CN112488235A

CN112488235A - Elevator time sequence data abnormity diagnosis method based on deep learning

Info

Publication number: CN112488235A
Application number: CN202011459448.1A
Authority: CN
Inventors: 陈华; 曾亚辉; 方建文
Original assignee: Special Equipment Safety Supervision Inspection Institute of Jiangsu Province
Current assignee: Special Equipment Safety Supervision Inspection Institute of Jiangsu Province
Priority date: 2020-12-11
Filing date: 2020-12-11
Publication date: 2021-03-12

Abstract

The invention provides an elevator time sequence data abnormity diagnosis method based on deep learning, which belongs to the field of special equipment operation process soft measurement modeling and application, and mainly comprises the following steps: determining auxiliary variables required by elevator soft measurement modeling based on the data, carrying out normalization processing on the auxiliary variables, and dividing the elevator data into time sequence data with fixed length by adopting a sliding window; carrying out trend prediction on the time sequence data based on a long-time memory network of an attention mechanism, and predicting a state value at the next moment; and inputting the state value into a normal range area reconstructed by a variational encoder, and judging whether the state value is in the normal range area or not to obtain the abnormal condition of the data. The method can improve the precision of the abnormity diagnosis in the elevator running process, can provide the effect of real-time detection, and can be effectively applied to the field of fault diagnosis of special detection equipment.

Description

Elevator time sequence data abnormity diagnosis method based on deep learning

Technical Field

The invention belongs to the field of soft measurement modeling and application of a special device in an operation process, and particularly relates to an abnormity diagnosis method of elevator time sequence data based on deep learning.

Background

With the steady advance of the modernization process, special equipment such as elevators, hoisting machinery, passenger ropeways and the like are widely applied, and the convenience and the efficiency of production and life of people are greatly improved due to the appearance of the special equipment. Therefore, special equipment occupies a very important position in human life, and is closely related to human life. However, with the popularization of special equipment, the safety problem of the special equipment is also greatly concerned, because the safety of human life and property is directly endangered once the special equipment fails, and the result is not imaginable. Aiming at the safety problem of the special equipment, the measures are taken to periodically inspect the special equipment and evaluate the running condition and the running life of the special equipment. However, this measure has four problems: 1. firstly, the professional requirement on quality inspectors is high, and the quality inspectors need a large amount of special equipment fault diagnosis experience, so that the evaluation result has reliability; 2. the system structure of the special equipment is complex, so that the fault caused by the complex system structure has too much uncertainty and cannot be accurately evaluated; 3. the probability of the equipment abnormality is low, the occurrence time is not fixed, and the equipment cannot be effectively controlled through regular inspection; 4. the regular inspection has serious hysteresis, and special equipment cannot be monitored in real time so as to ensure the life and property safety of people.

With the arrival of the artificial intelligence era, a plurality of manual operations are gradually replaced by intelligent equipment, and the intelligent equipment is used for monitoring a specific device in real time, so that the detection efficiency is higher, and the cost is low; and the artificial intelligence technology is mature and perfect, so that a plurality of effective achievements are obtained in the aspect of abnormity diagnosis, and the development of the abnormity diagnosis technology is greatly promoted. The abnormal diagnosis technology based on artificial intelligence mainly utilizes a sensor to collect data of key positions of special equipment, and determines the running condition and the running life of special equipment by observing the change of overall data and combining an expert diagnosis system. According to the method, the safety condition of the elevator can be effectively evaluated only by recording data display of the sensor at different moments in real time and transmitting the data to a computer of a user for data analysis through the Internet of things technology. Once a special device is easy to generate a large or small abnormality, the phenomena of waveform jumping, faults and the like of a sensor on the special device can be caused to form abnormal data. At the moment, the abnormal condition of the special equipment in the operation process can be diagnosed only by effectively analyzing the real-time data generated by the special equipment to obtain the abnormal points in the data, so that early warning measures are given in real time.

Disclosure of Invention

The purpose of the invention is as follows: the invention provides an elevator time sequence data abnormity diagnosis method based on deep learning, which aims at the problem that the traditional abnormity diagnosis method of special equipment needs to spend huge manpower and material resources to carry out manual inspection, adopts a data driving mode to carry out comprehensive evaluation on the running condition of the special equipment, and provides an elevator time sequence data abnormity diagnosis method based on deep learning.

The technical scheme is as follows: an abnormity diagnosis method of elevator time sequence data based on deep learning comprises the following steps:

step 1, collecting data, wherein the time sequence data of the elevator is obtained by sampling a door ball bearing, a humidity sensor and a vibration sensor on a key position on the elevator and is stored in a time sequence form;

step 2, preprocessing the time sequence data to obtain a modeling sample, and dividing the modeling sample into a training sample and a test sample;

step 3, establishing a prediction model of the long and short memory networks based on the attention mechanism by adopting the training samples, predicting by adopting the test samples, and predicting the state value at the next moment;

step 4, inputting the training sample into a variational encoder to obtain a target optimal loss value alpha;

step 5, inputting the state value into a range area reconstructed by a variational encoder, and determining the upper limit and the lower limit of a normal range interval;

step 6, collecting timing sequence data measured by a sensor on the elevator in real time on line, and preprocessing the timing sequence data;

and 7, inputting the preprocessed new sample into the prediction model of the long and short memory network based on the attention mechanism in the step 3 to obtain a corresponding state value, then inputting the state value into the step 5, judging whether the state value is at the upper limit and the lower limit of the normal range interval, and judging the abnormal condition of the data in real time.

In the step 2, the specific preprocessing process of preprocessing the time series data to obtain the modeling sample is as follows:

wherein the time-series data D { (x'₁，y₁)，(x′₂，y₂)，…，(x′_m，y_m)}，

The representation of a single piece of data,

representing the d attribute value in the ith data, preprocessing the time sequence data to obtain a modeling sample, and representing as follows: d { (x)₁，y₁)，(x₂，y₂)，…，(x_m，y_m)}。

In the step 3, establishing a prediction model of the long and short memory network based on the attention mechanism is divided into an encoding stage and a decoding stage. The encoder stage is implemented as follows:

step 1, equally dividing training samples into time sequence data with n batches, and inputting current time t based on the input of a long and short memory network of an attention mechanism

Step 2, given time t, the implicit state of the last time is expressed as

Outputting the activation function Sigmoid (. sigma.) is calculated together with the fully connected layer, and the Sigmoid (. sigma.) function is defined by the following equation:

step 3, setting the number of the hidden units as h, and inputting X in small batches at given time t_tAnd a last hidden state H_t-1Input gate of time t

Forgetting door

And output gate

Respectively as follows:

I_t＝σ(X_tW_xi+H_t-1W_hi+b_i)

F_t＝σ(X_tW_xf+H_t-1W_hf+b_f)

O_t＝σ(X_tW_xo+H_t-1W_ho+b_o)

W_xi，W_xf，

and W_hi，W_hf，

Is a weight parameter, b_i，b_f，

Is a deviation parameter;

step 4, candidate memory cells

Using Tanh (-) functionAs an activation function, the Tanh (-) function is defined as:

step 5, C_tIs calculated as:

wherein the content of the first and second substances,

and

is a weight parameter that is a function of,

is a deviation parameter;

step 6, memory cell C_tThe memory cell C of the current time, information in the implicit state, is obtained by controlling the input gate, the forgetting gate and the output gate using the multiplication by element, symbol-_tThe formula of (1) is:

step 7, controlling the memory cell to the hidden state H through the output gate_tThe expression of (a) is:

H_t＝O_t⊙Tanh(C_t)；

the decoder stage is implemented as follows:

step 1, for the index t of the encoder, the implicit state of the decoder time t' can be represented by the implicit state H of the previous moment_t′-1By way of representation, the current implicit state may be represented as:

H_t′＝g(c_t′，H_t′-1)

wherein g is a matrix multiplication operation;

step 2, weighted average is carried out on all encoder hidden states of the decoder in the background of time t':

step 3, obtaining probability distribution by using softmax operation:

step 4, for e_t′tThe implicit variables of time t and t' are input by the f-function:

e_tt＝f(H_t′-1，H_t)

wherein the f function is inner product operation, and perception transformation is carried out through an activation function tanh:

f(s，h)＝tanh(W_sH_t′-1+W_hH_t)v^T

wherein

All the model parameters can be learned, and all the matrix parameters are initialized through Gaussian distribution;

step 5, the function of the final training error is:

the method comprises the following specific steps of:

step 1, initializing encoder H₀，W_xi，W_xf，W_xo、W_hi，W_hf，W_ho、b_i，b_f，b_oW_xc、W_hc、b_c、v，W_s，W_hThe matrix which satisfies corresponding dimensionality and has a numerical value of 0 to 1, wherein the sample batch size n is 8, T is m/n, and epoach of neural network training is 1000;

step 2, setting epoach to be 1;

step 3, setting t to be 1;

step 4, letting H_t-1＝H₀And sample X_tSubstituting the formula in the step 2 to obtain H₁；

Step 5, changing T to T +1, and turning to step 64 when T is less than or equal to T, or turning to step 66;

step 6, obtaining H_TLet t' be 1;

step 7, entering a decoder part and enabling H_t′-1＝H_TIs substituted into H_t′＝g(c_t′，H_t′-1) To obtain H_t′Then make it possible to

Wherein

Step 8, changing T 'to T' +1, and turning to step 67 when T is less than or equal to T, or turning to step 69;

step 9, finally according to

Obtaining an error value error, and training an initialization parameter through back propagation; the parameter iterative optimization is carried out by adopting a random gradient descent algorithm, and the learning rate lr is 0.0003;

step 10, when the epoach is equal to or less than 1000, turning to step 63, otherwise, finishing training, establishing a long and short memory network prediction model based on an attention mechanism, and storing the model;

step 11, placing a test sample into the trained model for prediction, and predicting a state value at the next moment;

in the step 4, the variational encoder comprises two parts: the encoder and the decoder are all full-connection layer neural networks, the encoder uses a four-layer full-connection network, the input dimensionality is d, the output dimensionality is 4, and the dimensionalities of two middle hidden layers are 8 and 6 respectively; the dimension of the decoder is opposite to that of the encoder, the input and output dimensions are respectively 4 and d, and the dimension of the middle hidden layer is 6 and 8; the encoder mainly maps the extracted data features into low-dimensional potential feature vectors, and the decoder decodes and reconstructs the low-dimensional potential feature vectors to be as close as possible to the original data features. The method comprises the following concrete steps:

step 1, k represents that the original sample dimension is 12, subscript j represents the dimension of the potential feature vector, namely output dimension 4, the encoder part of the training sample encoder obtains the low-dimensional potential feature vector, and the sample is

d is 12, then the output is

Obtaining the latent variable z_i：

Where randn _ like () denotes the return of a random number satisfying a normal distribution, and then the latent variable z_iSample data x' with the same dimension as the training sample is restored through a decoder_i；

Step 2, finding the correct distribution of the latent variables by using the lower bound of the marginal likelihood estimation of the input data as a target loss function:

comparing errors of the output sample data and the training sample, calculating a target loss value through the formula, wherein the target to be achieved is to make the difference between the sample data and the training sample as small as possible, and when a variational encoder is trained, a random gradient descent algorithm is used for training, the initialization parameter of a neural network is optimized, and the obtained target loss value is alpha;

in the step 5, the state value is input into the range region reconstructed by the variational encoder, and the upper and lower limits of the normal range region are determined, wherein the specific process is as follows:

step 1, inputting a test sample, and obtaining a state value of the test sample through model training based on a reconstruction probability threshold value alpha obtained by a training sample;

and 2, inputting the state values into a variational encoder model for carrying out abnormity diagnosis, and recording the maximum value and the minimum value of the state values of the time series data with the target loss value lower than the target loss value to obtain [ delta max, delta min ], thereby obtaining the upper limit and the lower limit [ delta min, delta max ] of the normal range interval.

Has the advantages that: according to the invention, the abnormal condition of the special equipment in the operation process is diagnosed by analyzing the real-time data generated by the special equipment, so that the special equipment is effectively monitored, and the detection hysteresis is avoided; the method is provided for monitoring the fault of the special equipment in real time in the operation process.

Drawings

Fig. 1 is an overall architecture diagram of an abnormality diagnosis method for elevator time series data based on deep learning.

Fig. 2 is a prediction result after elevator time series data soft measurement modeling based on deep learning.

Fig. 3 shows an abnormality diagnosis effect of the abnormality diagnosis method for elevator time series data based on deep learning.

Detailed Description

The following further describes the embodiments of the present invention with reference to the drawings. Referring to fig. 1, an abnormality diagnosis method for elevator time series data based on deep learning includes the following steps:

In a further embodiment, the specific preprocessing process of preprocessing the time series data in step 2 to obtain the modeling sample is as follows:

Representing a single numberAccording to the above-mentioned technical scheme,

In a further embodiment, the step 3 of establishing the prediction model of the long and short memory network based on the attention mechanism is divided into an encoding stage and a decoding stage. The encoder stage is implemented as follows:

Step 2, given time t, the implicit state of the last time is expressed as

The output is obtained by jointly calculating an activation function Sigmoid (·) function (sigma) and a full connection layer, wherein the Sigmoid (·) function is defined by the following formula:

Forgetting door

And output gate

Respectively as follows:

I_t＝σ(X_tW_xi+H_t-1W_hi+b_i)

F_t＝σ(X_tW_xf+H_t-1W_hf+b_f)

O_t＝σ(X_tW_xo+H_t-1W_ho+b_o)

W_xi，W_xf，

and W_hi，W_hf，

Is a weight parameter, b_i，b_f，

Is a deviation parameter;

step 4, candidate memory cells

Using a Tanh (-) function as the activation function, the Tanh (-) function is defined as:

step 5, C_tIs calculated as:

wherein the content of the first and second substances,

and

is a weight parameter that is a function of,

is a deviation parameter;

H_t＝O_t⊙Tanh(C_t)；

the decoder stage is implemented as follows:

H_t′＝g(c_t′，H_t′-1)

wherein g is a matrix multiplication operation;

step 3, obtaining probability distribution by using softmax operation:

e_tt＝f(H_t′-1，H_t)

f(s，h)＝tanh(W_sH_t′-1+W_hH_t)v^T

wherein

step 5, the function of the final training error is:

the method comprises the following specific steps of:

step 2, setting epoach to be 1;

step 3, setting t to be 1;

step 6, obtaining H_TLet t' be 1:

Wherein

step 9, finally according to

in a further embodiment, in the step 4, the variational encoder comprises two parts: the encoder and the decoder are all full-connection layer neural networks, the encoder uses a four-layer full-connection network, the input dimensionality is d, the output dimensionality is 4, and the dimensionalities of two middle hidden layers are 8 and 6 respectively; the dimension of the decoder is opposite to that of the encoder, the input and output dimensions are respectively 4 and d, and the dimension of the middle hidden layer is 6 and 8; the encoder mainly maps the extracted data features into low-dimensional potential feature vectors, and the decoder decodes and reconstructs the low-dimensional potential feature vectors to be as close as possible to the original data features. The method comprises the following concrete steps:

d is 12, then the output is

Obtaining the latent variable z_i：

in a further embodiment, in step 5, the state value is input into the range region reconstructed by the variational encoder, and the upper and lower limits of the normal range interval are determined, where the specific process is as follows:

Although the preferred embodiments of the present invention have been described in detail, the present invention is not limited to the details of the embodiments, and various equivalent modifications can be made within the technical spirit of the present invention, and the scope of the present invention is also within the scope of the present invention.

Claims

1. An abnormity diagnosis method of elevator time sequence data based on deep learning is characterized by comprising the following steps:

2. The method for diagnosing the abnormality of the elevator time series data based on the deep learning of claim 1, wherein in the step 2, the specific preprocessing process for preprocessing the time series data to obtain the modeling sample comprises:

wherein the time-series data D { (x'₁,y₁),(x′₂,y₂),…,(x′_m,y_m)}，

The representation of a single piece of data,

representing the d attribute value in the ith data, preprocessing the time sequence data to obtain a modeling sample, and representing as follows: d { (x)₁,y₁),(x₂,y₂),…,(x_m,y_m)}。

3. The method for diagnosing the abnormality of the elevator timing data based on the deep learning of claim 1, wherein in the step 3, the prediction model of the long and short memory networks based on the attention mechanism is established and divided into an encoding stage and a decoding stage.

4. The method for diagnosing the abnormality of the elevator time series data based on the deep learning as claimed in claim 3, wherein the encoder stage is implemented by the following steps:

step 41, equally dividing the training sample into n batches of time sequence data, and inputting the current time t based on the input of the long and short memory network of the attention mechanism

Step 42, given time t, the implicit state of the last time is represented as

step 43, setting the number of the hidden units as h, and inputting X in small batches at given time t_tAnd a last hidden state H_t-1Input gate of time t

Forgetting door

And output gate

Respectively as follows:

I_t＝σ(X_tW_xi+H_t-1W_hi+b_i)

F_t＝σ(X_tW_xf+H_t-1W_hf+b_f)

O_t＝σ(X_tW_xo+H_t-1W_ho+b_o)

and

is a weight parameter that is a function of,

is a deviation parameter;

step 44, candidate memory cells

step 45, C_tIs calculated as:

wherein the content of the first and second substances,

and

is a weight parameter that is a function of,

is a deviation parameter;

step 46, memory cell C_tThe memory cell C of the current time, information in the implicit state, is obtained by controlling the input gate, the forgetting gate and the output gate using the multiplication by element, symbol-_tThe formula of (1) is:

step 47, controlling the memory cell to the hidden state H by the output gate_tThe expression of (a) is:

H_t＝O_t⊙Tanh(C_t)。

5. the method for diagnosing the abnormality of the elevator time series data based on the deep learning as claimed in claim 3, wherein the decoder stage is implemented by the following steps:

step 51, for the index t of the encoder, the implicit state of the decoder time t' can be represented by the implicit state H of the previous moment_t′-1By way of representation, the current implicit state may be represented as:

H_t′＝g(c_t′,H_t′-1)

wherein g is a matrix multiplication operation;

step 52, weighted average is performed on all encoder implicit states in the background of the decoder at time t':

and 53, obtaining probability distribution by using softmax operation:

step 54, for e_t′tThe implicit variables of time t and t' are input by the f-function:

e_tt＝f(Η_t′-1,H_t)

f(s,h)＝tanh(W_sΗ_t′-1+W_hH_t)v^T

wherein

step 55, the function of the last training error is:

6. the method for diagnosing the abnormality of the elevator time series data based on the deep learning as claimed in claim 3, wherein the prediction model of the long and short memory network based on the attention mechanism comprises the following specific steps:

step 61, initialize encoder H₀，W_xi,W_xf,W_xo、W_hi,W_hf,W_ho、b_i,b_f,b_o W_xc、W_hc、b_c、v,W_s,W_hThe matrix which satisfies corresponding dimensionality and has a numerical value of 0 to 1, wherein the sample batch size n is 8, T is m/n, and epoach of neural network training is 1000;

step 62, setting epoach to be 1;

step 63, setting t to be 1;

step 64, let H_t-1＝H₀And sample X_tSubstituting the formula in the step 2 to obtain H₁；

Step 65, changing T to T +1, and turning to step 64 when T is less than or equal to T, or turning to step 66;

step 66, obtaining H_TLet t' be 1;

step 67, enter the decoder part, let H_t′-1＝H_TIs substituted into H_t′＝g(c_t′,H_t′-1) To obtain H_t′Then make it possible to

Wherein

Step 68, changing T 'to T' +1, and turning to step 67 when T is less than or equal to T, or turning to step 69;

step 69, finally according to

step 610, when the epoach is equal to or less than 1000, turning to step 63, otherwise, finishing training, so as to establish a prediction model of the long and short memory network based on the attention mechanism, and storing the model;

and 611, putting the test sample into the trained model for prediction, and predicting the state value at the next moment.

7. The method for diagnosing the abnormality of the elevator time series data based on the deep learning of claim 1, wherein the variation encoder in the step 4 comprises two parts: the encoder and the decoder are all full-connection layer neural networks, the encoder uses a four-layer full-connection network, the input dimensionality is d, the output dimensionality is 4, and the dimensionalities of two middle hidden layers are 8 and 6 respectively; the dimension of the decoder is opposite to that of the encoder, the input and output dimensions are respectively 4 and d, and the dimension of the middle hidden layer is 6 and 8; the encoder mainly maps the extracted data features into low-dimensional potential feature vectors, and the decoder decodes and reconstructs the low-dimensional potential feature vectors to be as close as possible to the original data features.

8. The method for diagnosing the abnormality of the elevator time series data based on the deep learning of claim 7 is characterized by comprising the following steps:

steps 81 and k indicate that the original sample dimension is 12, subscript j indicates the dimension of the potential feature vector, namely output dimension 4, the encoder part of the training sample encoder obtains the low-dimensional potential feature vector, and the sample is

Then output through the encoder as

Obtaining the latent variable z_i：

And step 82, finding the correct distribution of the latent variables by using the lower bound of the marginal likelihood estimation of the input data as a target loss function:

and comparing errors of the output sample data and the training sample, calculating a target loss value by the above formula, wherein the target to be achieved is to make the difference between the sample data and the training sample as small as possible, and when the variational encoder is trained, the random gradient descent algorithm is used for training, the initialization parameter of the neural network is optimized, and the obtained target loss value is alpha.

9. The method for diagnosing the abnormality of the elevator time series data based on the deep learning according to the claim 1, characterized in that the step 5 further comprises the following specific processes:

step 91, inputting a test sample, and obtaining a state value of the test sample through model training based on a reconstruction probability threshold value alpha obtained by a training sample;

and step 92, inputting the state values into a variational encoder model for carrying out abnormity diagnosis, and recording the maximum value and the minimum value of the state values of the time series data with the target loss value lower than the target loss value to obtain [ delta max, delta min ], and obtaining the upper limit and the lower limit [ delta min, delta max ] of the normal range interval.