CN117290726A

CN117290726A - CAE-BiLSTM-based fault early warning method for mobile equipment

Info

Publication number: CN117290726A
Application number: CN202311237531.8A
Authority: CN
Inventors: 孙贺贺; 李郭敏; 曾珍
Original assignee: Weibiran Data Technology Beijing Co ltd
Current assignee: Weibiran Data Technology Beijing Co ltd
Priority date: 2023-09-25
Filing date: 2023-09-25
Publication date: 2023-12-26

Abstract

The invention discloses a dynamic equipment fault early warning method based on CAE-BiLSTM, which comprises the following steps: acquiring original data running in a normal state of the mobile equipment and preprocessing the original data; intercepting part of the original data as a training set, and processing to obtain a standardized model; constructing a CAE-BiLSTM prediction model, taking original data in a training set as a target value of the prediction model, and taking other original data which are not taken as the training set as input of the prediction model; establishing a security threshold of the RMSE; operating the monitoring data in a state to be monitored, and obtaining a standardized structure after processing; calculating a residual error between the true value and the predicted value by using the RMSE; and judging the magnitude relation between the calculated RMSE of each monitoring parameter and the safety threshold value. The safety threshold value is compared with the root mean square error, so that the risk of false alarm is reduced; by constructing the CAE-BiLSTM training model, the running reliability of the mobile equipment is improved without expert experience.

Description

CAE-BiLSTM-based fault early warning method for mobile equipment

Technical Field

The invention relates to a fault early-warning method for mobile equipment, in particular to a fault early-warning method for mobile equipment based on CAE-BiLSTM.

Background

With the development of modern industrial Internet technology and production modernization, the traditional field inspection and off-line monitoring method cannot effectively monitor sudden and sporadic faults of equipment. The advantages and disadvantages of the unit state relate to the safe and effective operation of the production platform, the unplanned shutdown of the movable equipment can cause a certain influence on the continuous production of the platform, and the on-line real-time monitoring of the movable equipment can effectively solve the problem. Firstly, the online real-time monitoring technology can convert the traditional periodic maintenance into the optionally maintenance; secondly, the off-line monitoring mode can not effectively record equipment fault data in real time, the on-line real-time monitoring technology can record equipment operation data in real time, the equipment operation state is evaluated accurately by summarizing and analyzing the long-time monitoring data, the equipment management mode is improved, and the existing moving equipment fault early warning method are insufficient:

1. model based on dynamic device operation mechanism: the method is tightly combined with a control theory, a mathematical model is established through an equipment operation mechanism to predict, and the mathematical model is compared with an actual measured value to obtain a residual error; the residual is analyzed to determine if the process is malfunctioning. The method has the defects that most of mechanism models are simplified linear systems, and in the actual industrial process, nonlinear, high-degree-of-freedom and multivariable coupling systems are often adopted, so that the use effect is not ideal.

2. A method based on expertise: the method is based on experience knowledge of people, fault characteristics are deduced, namely after the fault of the movable equipment occurs, an expert discovers a problem by dissecting the equipment, qualitative or quantitative characteristics of the equipment fault are summarized by combining the change condition of historical monitoring parameters before the fault, and equipment fault early warning and monitoring are completed through the characteristics. The method has the defects that the early warning accuracy has strong dependence on the richness of expert knowledge and the level of expert knowledge, and many experiences are difficult to describe in a reasonable formal expression mode.

Disclosure of Invention

In order to solve the defects of the technology, the invention provides a dynamic equipment fault early warning method based on CAE-BiLSTM.

In order to solve the technical problems, the invention adopts the following technical scheme: a dynamic equipment fault early warning method based on CAE-BiLSTM comprises the following steps:

step S1: acquiring original data of the mobile equipment running in a normal state and preprocessing the original data;

step S2: intercepting the original data after the pretreatment of the part in the S1 as a training set, and carrying out parameter processing on the training set to obtain a standardized model;

step S3: constructing a CAE-BiLSTM prediction model, taking original data in a training set as a target value of the prediction model, taking other original data which are not taken as the training set as input of the prediction model, and performing data dimension reduction on the original data in the training set by using a convolution self-encoder CAE to obtain reconstructed data after reconstruction output; modeling the reconstructed data from the forward and reverse directions in a bi-directional short-term neural network BiLSTM, classifying the new data obtained after modeling in a full-connection layer, obtaining a target value of a prediction model, and storing the trained prediction model;

step S4: establishing a safety threshold of Root Mean Square Error (RMSE) by using the original data running in a normal state and applying a CAE-BiLSTM prediction model;

step S5: operating the monitoring data in a state to be monitored, and preprocessing the monitoring data to obtain a standardized structure;

step S6: calculating residual errors between real monitoring data and predicted values in the CAE-BiLSTM prediction model by using the RMSE;

step S7: and judging the magnitude relation between the RMSE of each monitoring parameter calculated in the step S6 and the safety threshold of the RMSE, and further judging whether the mobile equipment fails or not.

Further, the step S1 of preprocessing the original data specifically includes the following steps:

step S11, collecting original data of running of the mobile equipment in a normal state, and obtaining a data set formed by all running monitoring parameters of the mobile equipment;

step S12, analyzing a mechanism of operation in a data set and a data missing condition;

step S13, selecting original data parameters capable of reflecting the performance of equipment as monitoring parameters;

step S14, saving the names of the parameter items which are selected and used as monitoring parameters.

Further, in step S2, the original data serving as the monitoring parameter is intercepted from the original data of the historical operation of the slave device to form a training set. The parameter processing training set comprises parameter screening, data cleaning and parameter standardization, and a standardized model of the training set is stored; in the cleaning process, the training set only keeps the original data operated in the normal period and eliminates the original data operated in the fault period; in the process of parameter standardization of the training set and the method of Z-score is utilized for obtaining the standardized structure in the step S5, so that the variance of the new X data set is 1, the mean value is 0, and the processed original data and the processed monitoring data both accord with standard normal distribution, and the application formula is as follows:

X＝(x-μ)/σ；

wherein μ is a normal distribution of position parameters, X is an axis of symmetry with μ, and σ is the degree of dispersion.

In step S3, the convolution encoder CAE is used to perform data dimension reduction on the original data, which specifically includes the following steps:

step S31, convolution operation: initializing k convolution kernels W in a convolution layer, wherein each convolution kernel is matched with a bias b, generating k feature graphs h after convolution with an input x, and the activation function is sigma, and the formula is:

h ^k ＝σ(x*w ^k +b ^k )；

wherein h is a feature map; w is a convolution kernel; k is the number of convolution kernels; sigma is an activation function; x is input data; b is the current bias;

step S32, pooling operation: carrying out pooling operation on the feature map generated in the step S31 at a pooling layer, and reserving a matrix of the position relationship during pooling;

step S33, reverse pooling operation: performing reverse pooling operation on the feature map generated in the step S32 at a pooling layer, and restoring the pooled original data to the corresponding position of the matrix with the original size by using the matrix with the position relation when pooling is reserved;

step S34, deconvolution operation: each feature map h performs a convolution operation with its corresponding transpose of the convolution kernel and sums the results, then adds the offset c, with the formula:

y＝σ(∑h ^k *W ^k +c)；

wherein σ is the activation function; h is a feature map; w is a convolution kernel; k is the number of convolution kernels; c is the current bias;

step S35, updating the weight value: at the convolution layer, updating the weight, firstly determining a cost function costfunction by using a minimum mean square error MSE, wherein the formula is as follows:

wherein n is the number of samples; x is x _i Is the target value, y _i Is a true value.

In the step S3, modeling is carried out on the original data after CAE processing by utilizing BiLSTM, namely modeling is carried out on the original data after CAE processing by utilizing LSTM from the front direction and the back direction, and then information is spliced together; the information useful for calculation at the subsequent moment is transmitted by forgetting the information in the cell state and memorizing the new information, and useless information is discarded, and the forward propagation formula is as follows:

f _t ＝σ(W _f ·[h _t-1 ，x _t ]+b _f )；

i _t ＝σ(W _i ·[h _t-1 ，x _t ]+b _i )；

o _t ＝σ(W _o [h _t-1 ，x _t ]+b _o )；

h _t ＝o _t ·Relu(C _t )；

wherein f _t Is a forgetful door; i.e _t Is a memory gate; o (o) _t Is an output door; c (C) _t Is in a cellular state;is a temporary cellular state; h is a _t 、h _t-1 The output of the LSTM and the output of the last unit; x is x _t Is the current input; sigma and Relu are sigmoid functions and linear rectification activation functions; w (W) _f 、b _f A forgetting gate weight matrix and a bias roof; w (W) _i 、b _i Biasing the top for the weight matrix of the input gate; w (W) _c 、b _c A weight matrix and a bias roof calculated for the state values; w (W) _o 、b _o A weight matrix and a bias roof for the output gate;

the back propagation algorithm iteratively updates the original data by using a gradient descent method, derives each parameter by using a loss function, and updates the parameter along the direction of the bias.

Further, in step S4, an exponential weighted moving average method is used to establish a security threshold for RMSE.

Further, in step S5, the monitoring data is preprocessed, which specifically includes the following steps:

step S51, obtaining monitoring data of the current dynamic equipment to be evaluated;

step S52, screening parameter items of the monitoring data according to the names of the saved and selected parameter items;

step S53, the parameter values of the screened parameter items are subjected to standardization processing.

Further, in step S7, it is judged that the RMSE of each calculated monitoring parameter does not exceed the safety threshold of RMSE three times, and if the RMSE is normal, the alarm is not given; and if the calculated RMSE comparison of each monitoring parameter exceeds the safety threshold value of the RMSE for three times, alarming.

The invention discloses a dynamic equipment fault early warning method based on CAE-BiLSTM, which utilizes the sequence structure analysis function of BiLSTM and the feature extraction and transformation function of CAE to construct a deep learning model based on a CAE-BiLSTM network, wherein the model can quantify key parameters of equipment and further early warn the equipment by observing the change trend of residual errors, and has the following beneficial effects:

1. by comparing the safety threshold with the root mean square error, the residual error of the target parameter is statistically analyzed, the running condition of the equipment can be known by observing the variation trend of the residual error, the risk of false alarm is reduced, the residual error is predicted by replacing the mathematical model of most mechanism resume of the equipment, and the fault early warning of the mobile equipment based on CAE-BiLSTM is more suitable for a nonlinear and multivariable coupling system of the mobile equipment in the industrial process.

2. Constructing a CAE-BiLSTM training model, performing data dimension reduction and reconstruction on data in a training set by using a convolution self-encoder CAE, thereby being beneficial to improving the performance of the running equipment and reducing the consumption of a memory; modeling reconstructed data from the front direction and the back direction by utilizing a bi-directional short-term neural network BiLSTM, and learning long-term dependence in the data through a memory unit and a memory gate of the bi-directional short-term neural network BiLSTM, and combining with other deep learning architectures, so that the running performance and reliability of the mobile equipment are improved, the cost can be reduced, and the safe running of the mobile equipment is ensured; the new data after modeling enter the full-connection layer for classification, and due to the structural characteristics of the full-connection layer, when the complexity of the model is increased and the number of full-connection layers is deepened, the learning capacity of the model can be improved, and the learning capacity of the training model is utilized, so that the learning capacity of any expert knowledge and experience is not needed to be used as fault prediction reasoning, and the early warning accuracy is improved.

Drawings

Fig. 1 is a logic diagram of the technical scheme of the present invention.

FIG. 2 is a schematic diagram of the CAE-BiLSTM model structure of the present invention.

FIG. 3 is a schematic diagram of the BiLSTM model structure of the present invention.

Detailed Description

The invention will be described in further detail with reference to the drawings and the detailed description.

step S2: intercepting part of the preprocessed original data in the S1 as a training set, and carrying out parameter processing on the training set to obtain a standardized model;

step S4: establishing a safety threshold of Root Mean Square Error (RMSE) by operating original data in a normal state and applying a CAE-BiLSTM prediction model;

Specifically, as shown in a logic diagram of a technical scheme of a CAE-BiLSTM-based dynamic equipment fault early warning method shown in fig. 1, in a training process, after original data are operated in a normal state, the original data are preprocessed, after correlation analysis is carried out, input parameters are selected, a prediction model is built, and a threshold value of RMSE (root mean square error) is built through the original data in the normal operation state; in the monitoring process, running data in a state to be monitored, preprocessing the monitored data, selecting input and output parameters determined during training, putting the data to be monitored into a prediction model established during training to obtain a prediction sequence, judging whether the calculated RMSE exceeds a threshold established during training for 3 times, and if the RMSE does not exceed the threshold for 3 times, ensuring that the state is normal; if the RMSE is 3 times, the threshold value is exceeded, then an alarm is given.

The embodiment of the invention provides a mobile equipment fault early warning method based on CAE-BiLSTM, which comprises the following specific steps:

1: raw data of running in a normal state of the mobile equipment are obtained, a data set formed by all running parameters of the mobile equipment is obtained, the running mechanism in the data set is analyzed, the raw data parameters capable of reflecting the performance of the mobile equipment are selected as monitoring parameters, and parameter item names of the monitoring parameters are saved for a training set.

2: performing operations such as parameter screening, data cleaning, parameter standardization and the like on the training set, and storing a standardized model of the training set; in the parameter screening process, the selection of the monitoring parameters is obtained from the data items stored in the step 1; in the data cleaning process, only the original data operated in the normal operation period is reserved, and the original data operated in the fault period is required to be removed; the normalization process adopts a Z-score method, namely, a formula of X= (X-mu)/sigma is applied, wherein mu is a position parameter of normal distribution, X is a symmetry axis with mu, sigma is a degree of dispersion, so that a new X data set variance is 1, and a mean value is 0, and the processed data accords with the normal distribution.

3: the CAE-BiLSTM model shown in figure 2 is constructed, the original data of the dynamic equipment in the training set are respectively used as the targets of the model, other original data which are not used as the training set are used as the input of the model, the supervised training is carried out, and the training results are saved.

31: CAE automatic encoder is a typical unsupervised godA network model, namely a self-encoder; based on a back propagation algorithm and an optimization method, the self-encoder guides the neural network to try to learn a mapping relation by taking input data X as supervision at an input layer to obtain a reconstruction output X ^R Typically, an algorithm model consists of two parts: an encoder and a decoder, the CAE network structure including an encoding process and a decoding process; in the encoding process, original data input in an input layer are encoded into a pooling layer through a CAE layer, and high-dimensional input X is encoded into a low-dimensional hidden variable h through an encoder, so that the neural network is forced to learn the characteristics of the most information quantity; in the decoding process, the coded variable in the pooling layer of the coding process is decoded to the up-sampling layer through the CAE layer, the decoder restores the hidden variable h of the hidden layer to the initial dimension, and the best state is that the output of the decoder can perfectly or approximately restore the original input, namely X ^R The pressure is approximately equal to X; the convolution self-encoder replaces a simple self-encoding full-connection layer by using the CAE layer, is matched with a two-dimensional topological structure of the image, avoids the loss of reconstruction errors, changes the encoding process into a convolution process, and changes the decoding process into a deconvolution process; the decoded data enter a BiLSTM layer to model the original data after CAE processing from the positive direction and the negative direction, then the data enter a full-connection layer to be classified and integrated, the full-connection layer is replaced by the CAE layer in the convolution self-encoder, and finally the original data after CAE processing is output to an output layer.

311: convolution layer-convolution operation: initializing k convolution kernels (W), wherein each convolution kernel is matched with one offset b, and generating k feature graphs h after convolution with an input x, and the activation function is sigma, and the formula is:

h ^k ＝σ(x*w ^k +b ^k )；

312: pooling layer-pooling operation (MaxPooling): pooling operation is carried out on the feature map generated in 311, and a matrix of the position relation during pooling is reserved, so that the subsequent anti-pooling operation is facilitated;

313: pooling layer-self-coding (anti-pooling operation): performing inverse pooling operation on the feature map generated in 311, and restoring the pooled original data to the corresponding position of the original size matrix by using the matrix reserved to the position relation during pooling;

314: convolutional layer-self-encoding (deconvolution operation): performing convolution operation on each feature diagram h and the corresponding transpose of the convolution kernel, summing the results, and adding the offset c, wherein the formula is as follows:

y＝σ(∑h ^k *W ^k +c)；

315: convolution layer-update weights: to update the weights, firstly, a costfunction cost function is determined, and the MSE (minimum mean square error) function adopted here, that is, the square sum of the target value minus the actual value is averaged, where the formula is:

wherein 2n is the simplification of derivation, x _i Is the target value, y _i Is an actual value.

32: the LSTM is a special recurrent neural network, the BiLSTM does not change the internal structure of the LSTM, but models the CAE processed original data by using the LSTM from the front direction and the back direction, and then the information is spliced together, but the problem that the importance degree of the information of the original data is changed before and after the LSTM is processed due to the model structure, so that the prediction accuracy is reduced is solved, and meanwhile, gradient disappearance and explosion are avoided. The calculation process of LSTM is summarized as: by forgetting information in the cell state and memorizing new information, information useful for calculation at a later time is transferred, and useless information is discarded, and forward propagation thereof can be expressed as:

f _t ＝σ(W _f ·[h _t-1 ，x _t ]+b _f )；

i _t ＝σ(W _i ·[h _t-1 ，x _t ]+b _i )；

o _t ＝σ(W _o [h _t-1 ，x _t ]+b _o )；

h _t ＝o _t ·Relu(C _t )；

the back propagation algorithm is to iteratively update the CAE-processed original data by using a gradient descent method, derive each parameter by using a loss function, and update the parameters along the direction of the derivation.

FIG. 3 shows a BiLSTM model structure, wherein W _f As a forward weight matrix, W _b For the inverse weight matrix, o is the temporary cell state and the cell state, x is the current input, x= [ x ] ₁ ，x ₂ ，x ₃ …x _n ]；In order to output the data it is possible,α ^f for the output of the last neuron when the original data after CAE processing is forward propagated, alpha ^b And outputting the last neuron when the CAE processed data are back-propagated, and finally combining the result with the result output by the LSTM to obtain an output result. Constructing a CAE-BiLSTM training model, performing original data dimension reduction and reconstruction on data in a training set by using a convolution self-encoder CAE, thereby being beneficial to improving the performance of the running equipment and reducing the consumption of a memory; modeling reconstructed data from the front direction and the back direction by utilizing a bi-directional short-term neural network BiLSTM, and learning long-term dependence in the data through a memory unit and a memory gate of the bi-directional short-term neural network BiLSTM, and combining with other deep learning architectures, so that the running performance and reliability of the mobile equipment are improved, the cost can be reduced, and the safe running of the mobile equipment is ensured; the new data after modeling enter the full-connection layer for classification, and due to the structural characteristics of the full-connection layer, when the complexity of the model is increased, the number of full-connection layers is deepened, the learning capacity of the model can be improved, no expert experience is needed, no manual label establishment is needed, and a large amount of manpower is saved.

4: after the CAE-BiLSTM model is built, before the CAE-BiLSTM model is applied, running monitoring data in a state to be monitored, obtaining the monitoring data of the current dynamic equipment to be evaluated, screening the monitoring parameter items according to the names of the selected parameter items stored in 1, and standardizing a screening result set of the monitoring parameter items.

5: and (3) applying a CAE-BiLSTM training model, putting the data to be monitored into the CAE-BiLSTM model to obtain a predicted sequence, and establishing a Root Mean Square Error (RMSE) safety threshold by using an exponential weighted moving average method.

6: the residual error between the predicted value and the true value is calculated by the Root Mean Square Error (RMSE), and the smaller the root mean square error is, the higher the measurement accuracy is.

7: judging whether the RMSE exceeds a threshold value for three times according to the RMSE of each parameter calculated in the step 6; if the RMSE does not exceed the threshold value for three times, the RMSE is normal; if the RMSE is compared with the threshold value for three times, alarming; the monitoring accuracy can be improved, and the number of false alarms can be reduced.

The above embodiments are not intended to limit the present invention, and the present invention is not limited to the above examples, but is also intended to be limited to the following claims.

Claims

1. The utility model provides a mobile equipment fault early warning method based on CAE-BiLSTM, which is characterized by comprising the following steps:

2. The CAE-BiLSTM based dynamic device fault alerting method of claim 1, wherein: the step S1 of preprocessing the original data specifically comprises the following steps:

3. The CAE-BiLSTM based dynamic device fault alerting method of claim 2, wherein: in step S2, the original data serving as the monitoring parameter is intercepted from the original data of the historical operation of the slave device to form a training set.

4. The CAE-BiLSTM based dynamic device fault alerting method of claim 3, wherein: the step S2 of processing the parameters in the training set comprises parameter screening, data cleaning and parameter standardization, and a standardized model of the training set is stored; in the cleaning process, the training set only keeps the original data operated in the normal period and eliminates the original data operated in the fault period; in the process of parameter standardization of the training set and the method of Z-score is utilized for obtaining the standardized structure in the step S5, so that the variance of the new X data set is 1, the mean value is 0, and the processed original data and the processed monitoring data both accord with standard normal distribution, and the application formula is as follows:

X＝(x-μ)/σ；

5. The CAE-BiLSTM based dynamic device fault alerting method of claim 1, wherein: in the step S3, the convolution encoder CAE is used to perform data dimension reduction on the original data, and specifically includes the following steps:

h ^k =σ(x*W ^k +b ^k )；

y=σ(Σh ^k *W ^k +c)；

6. The CAE-BiLSTM based dynamic device fault alerting method of claim 1, wherein: in the step S3, modeling is carried out on the original data after CAE processing by utilizing BiLSTM, namely modeling is carried out on the original data after CAE processing by utilizing LSTM from the front direction and the back direction, and then information is spliced together; the information useful for calculation at the subsequent moment is transmitted by forgetting the information in the cell state and memorizing the new information, and useless information is discarded, and the forward propagation formula is as follows:

f _t =σ(W _f ·[h _t-1 ，x _t ]+b _f ；

i _t =σ(W _i ·[h _t-1 ，x _t ]+b _i )；

o _t ＝σ(w _o [h _t-1 ，x _t ]+b _o )；

h _t ＝o _t ·Relu(C _t )；

7. The CAE-BiLSTM based dynamic device fault alerting method of claim 1, wherein: in the step S4, an exponential weighted moving average method is used to establish a security threshold of RMSE.

8. The CAE-BiLSTM based dynamic device fault alerting method of claim 2, wherein: the step S5 of preprocessing the monitoring data specifically includes the following steps:

9. The CAE-BiLSTM based dynamic device fault alerting method of claim 1, wherein: in the step S7, if the calculated RMSE of each monitoring parameter is judged to be beyond the safety threshold value of the RMSE for three times, the RMSE is normal and does not alarm; and if the calculated RMSE comparison of each monitoring parameter exceeds the safety threshold value of the RMSE for three times, alarming.