CN111353482A

CN111353482A - LSTM-based fatigue factor recessive anomaly detection and fault diagnosis method

Info

Publication number: CN111353482A
Application number: CN202010445803.3A
Authority: CN
Inventors: 冯海领; 焦正杉; 孙敬哲; 王汉奇; 王向敏; 赵宜斌
Original assignee: Tianjin Development Zone Jingnuo Hanhai Data Technology Co ltd
Current assignee: Tianjin Development Zone Jingnuo Hanhai Data Technology Co ltd
Priority date: 2020-05-25
Filing date: 2020-05-25
Publication date: 2020-06-30
Anticipated expiration: 2040-05-25
Also published as: CN111353482B

Abstract

The invention discloses a fatigue factor recessive anomaly detection and fault diagnosis method based on LSTM, which comprises the following steps: s1, collecting time sequence data of target equipment of a diagnosis object, and carrying out empirical mode decomposition on the data; s2, constructing an equipment vibration signal prediction model based on the LSTM by using normal data; s3, classifying the collected abnormal data, and constructing a fault time sequence data classification model based on the LSTM; s4, taking the mean square error MSE of the obtained vibration signal prediction model based on the LSTM as an initial fatigue factor threshold; s5, predicting the equipment production data by using a vibration signal prediction model based on the LSTM, calculating the mean square error of a predicted value and an actual value, and comparing the mean square error with an initial fatigue factor threshold value to detect an abnormal signal; and S6, classifying the abnormal signals through a fault time sequence data classification model to obtain a fault diagnosis result.

Description

LSTM-based fatigue factor recessive anomaly detection and fault diagnosis method

Technical Field

The invention relates to the field of equipment fault diagnosis, in particular to a fatigue factor recessive abnormality detection and fault diagnosis method based on LSTM.

Background

With the explosion of the technology of the internet of things, 5G, artificial intelligence, cloud computing and the like in the form of "nuclear fusion", the strategy of "re-industrialization" made by major industrial countries around intelligent manufacturing is also very dust-laden. China firstly proposes an intelligent plus concept in a government work report of 2019, and determines intelligent manufacturing as an important development direction of new kinetic energy for national economic development.

The current fault diagnosis methods are mainly classified into two categories: a signal processing based method and a machine learning based method. Common methods based on signal processing include Spectral Kurtosis Analysis (Spectral Kurtosis), sparse Decomposition Analysis (sparse Decomposition Analysis), Time-frequency domain Analysis (Time-frequency Analysis), Wavelet Transform (WT), Empirical Mode Decomposition (EMD); the Machine learning-based methods mainly include Hidden Markov Models (HMMs), bayesian networks (bayesian networks), Support Vector Machines (SVMs) and Recurrent Neural Networks (RNNs), and the methods have achieved better research results in the field of fault diagnosis. Signal processing-based methods generally require a large number of highly experienced engineers to analyze the fault information contained in the signal, making it difficult to obtain a fault diagnosis result in a programmed manner; the method based on machine learning just makes up for the above defects of the signal processing method, and well expands the fault diagnosis method, but brings new challenges on fault diagnosis data. Generally, data recorded by a digital infrastructure in an industrial scene are mostly normal data, the data contain huge amount of information, and influence factors in aspects of product quality, energy consumption, production cost and the like are concerned, but the fault information in the data is few and few, so that how much equipment reliability is, how much fault data is, and fault information extraction is another new challenge in a large intelligent manufacturing background. In order to deal with the problem of unbalanced data, technologies such as undersampling, oversampling, and synthesizing a few classes of oversampling have been provided, and these methods have achieved good results in corresponding documents, but the undersampling technology may result in a reduction in data volume, and the oversampling technology may result in an increase in noise, and these problems may ultimately affect the accuracy of the training model.

Disclosure of Invention

In order to solve the problems caused by characteristics of unbalanced equipment data, high noise, time sequence and the like in the field of intelligent manufacturing, the invention provides a fatigue factor recessive abnormality detection and fault diagnosis method based on LSTM. The invention improves the unbalanced data processing method, the classification model and the abnormality detection method in the traditional fault diagnosis, can effectively avoid the problems caused by unbalanced data, can detect the slight change of equipment, and can early warn abnormal signals in time so as to detect the problems of equipment abrasion, aging and the like.

In order to solve the technical problems, the technical scheme adopted by the invention is as follows:

a fatigue factor recessive abnormality detection and fault diagnosis method based on LSTM comprises the following steps:

s1, collecting time sequence sample data of the target equipment of the diagnosis object, wherein the sample comprises normal data, abnormal data and fault types of the abnormal data, and performing empirical mode decomposition on the time sequence sample data of the equipment by using an EMD method;

s2, extracting normal data samples in the time sequence sample data, and constructing an equipment vibration signal prediction model by using an LSTM-based deep neural network;

s3, extracting abnormal data samples in the time sequence sample data, and constructing a fault time sequence data classification model by using an LSTM-based deep neural network;

s4, acquiring the Mean Square Error (MSE) of the vibration signal prediction model based on the LSTM, and taking the MSE as an initial fatigue factor threshold;

s5, using vibration signal prediction model based on LSTMPredicting the production data of the equipment, calculating the mean square error between the predicted value and the actual value, comparing the mean square error with the initial fatigue factor threshold value to detect abnormal signals, and dynamically adjusting the threshold value according to the actual situation

；

And S6, classifying the abnormal signals through a fault time sequence data classification model to obtain a fault diagnosis result.

Further, the process of constructing the device vibration signal prediction model based on the LSTM deep neural network in step S2 includes the following steps:

s2-1: characteristic engineering: firstly, processing collected original data, performing auxiliary judgment by using EMD (empirical mode decomposition) and waveform visualization technology, and identifying abnormal vibration frequency by observing IMF (intrinsic mode function) components;

s2-2: resampling: original data are re-sampled, abnormal data fragments in the original data are deleted, the fact that continuous normal data exist in the remaining data and only exist in the remaining data is guaranteed, and the problem of data unbalance caused by unbalanced data is solved;

s2-3: data normalization: normalizing the data of each dimension in the time series, and calculating the mean value of each dimension

And standard deviation of

The time-series sample data X is normalized by the following formula, and the normalized data is X':

（1）

s2-4: and (3) carrying out format conversion on the data set: converting the unsupervised data set into a supervised data set, dividing the normalized data into a continuous time series X of length l_r，X_rLength of (d) is input by the model as length l_sAnd an output length l_pDetermined together, their relationship is the total length

Dividing time series sample data X into X_sAnd X_pTwo parts, which are used as input and output samples of the model;

s2-5: dividing the data set: randomly arranging data, and dividing input data X according to same proportion_sAnd output data X_pObtaining a training data set

And validating the data set

；

S2-6: constructing a model: setting check points, saving model parameters once per Epoch, adjusting Epoch and Dropout parameters, and observing trainloss and valloss, using an early stop mechanism when an over-fit condition occurs;

s2-7: classification using models: inputting a segment of time sequence sample data X to obtain a prediction vector

Then, calculating a mean square error MSE according to the actual value y for evaluating the prediction effect, wherein n is the sequence number:

（2）。

further, the process of constructing the fault time series data classification model based on the LSTM deep neural network in step S3 includes the following steps:

s3-1: firstly, processing fault data samples of time sequence sample data of equipment, using data which has abnormality and has determined fault types, and additionally adding normal samples with the same quantity as each fault sample;

s3-2: empirical mode decomposition of time series sample data(EMD) to obtain an m-dimensional vector containing IMF components

M is determined by empirical mode decomposition, imf_iIs the ith dimension component;

s3-3: numbering imf of n-type time sequence data to obtain model input data X_c

（3）

Determining the output data y of the model from the numbers_c

（4）

S3-4: normalizing the data and calculating the mean of the samples

And standard deviation of

Normalized data is

：

（5）

S3-5: dividing data set, randomly arranging data, and dividing input data X according to same proportion_cAnd output data y_cObtaining a training data set

And validating the data set

；

S3-6: training model, setupChecking points, saving model parameters once per Epoch, adjusting Epoch and Dropout parameters, and observing trainloss and valloss, using an early stop mechanism when an over-fit condition occurs;

s3-7: classifying by using model, inputting a time sequence X_cObtaining a classification vector y', then the fault number

The corresponding fault is found by numbering, argmax being the parameter of the maximum value.

Further, the step S5 of predicting the equipment production data by using the vibration signal prediction model based on LSTM specifically includes:

predicting the value of time t using a prediction model

Calculated according to the actual value

Calculating a prediction error:

（6）

wherein

I is the dimension of the true monitor value, and then every error is calculated

Added to the vector of prediction errors:

（7）

h is the number of historical errors used to estimate the current error, l_sFor the length of the input sequence, the set of errors e is then smoothed, and an error threshold is selected for the set of errors

-the value of the mean square error above the threshold is classified as abnormal:

（8）

is a vector of threshold errors, e_sIs a sequence error vector, z is a constant,

is a mean value

Is a standard deviation of wherein

Determined by equation (9):

（9）

wherein:

（10）

e in the formula (9)_seqIs composed of

A continuous sequence of (a).

Is estimated by

Determining that z is an ordered set of positive integers representing greater than

The number of standard deviations, the value of z depending on experimental data, once

Is determined to produce a sequence of anomalies for each of the mean square errors

A score is made and the score s is used to indicate the severity of the anomaly, i.e. the fatigue factor:

（11）

the highest mean square error in each anomalous error sequence is then determined based on its correlation with a selected threshold

Is normalized.

Further, in the step S5, the threshold is dynamically adjusted according to the actual situation

The method specifically comprises the following steps:

in the step S5, the threshold is dynamically adjusted according to the actual situation

The method specifically comprises the following steps:

creating a new collection

Including all in descending order

Maximum value of

Then, howeverPost-cursor maximum non-abnormal mean square error

Is added to

At the end, the sequence is gradually decreased, and the decrease of the ith step is:

（12）

if the amplitude d is reduced at a certain step i⁽ⁱ⁾Greater than the minimum drop amplitude p, all satisfy

Is still an abnormal sequence; if the amplitude is decreased by d⁽ⁱ⁾Less than the minimum reduction p, then the reduction of all subsequent errors

Are reclassified as normal values.

Adopt the produced beneficial effect of above-mentioned technical scheme to lie in:

(1) aiming at the problem that enough fault samples are difficult to collect in the initial stage of a production environment, an LSTM-based vibration signal prediction model anomaly detection method is provided, the existing data non-equilibrium characteristics are fully utilized, and anomaly signals generated by equipment are effectively detected;

(2) aiming at the problem that the equipment aging wear condition is not evaluated in the current research situation, a fatigue factor recessive fault detection method is provided on the basis of (1), a threshold value can be found in fluctuating data to distinguish normal data from abnormal data, and the threshold value is dynamically adjusted in incremental learning so as to achieve the purpose of detecting potential abnormality;

(3) in consideration of the characteristic that fault time sequence data have complex time relevance, a fault time sequence data classification model fusing EMD feature extraction is provided, the fault diagnosis method uses EMD to perform feature processing and denoising, and uses a LSTM structure-based deep neural network to extract features on a time dimension so as to improve the fault classification accuracy.

Drawings

FIG. 1 is a flow chart of a vibration signal prediction model based on LSTM;

FIG. 2 is a fault timing data classification model incorporating EMD feature extraction;

FIG. 3 is a flowchart of the overall structure of the LSTM-based incremental learning latent fault diagnosis method;

FIG. 4 ARIMA, RNN, Stacked LSTM predict effect;

FIG. 5 a vibration signal;

FIG. 6 Normal data EMD decomposition;

fig. 7 inner ring fault data EMD decomposition.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings and specific embodiments.

the experimental verification of the invention uses experimental data containing three faults of an Inner ring (Inner Raceway Fault), an Outer ring (Outer Raceway Fault) and a rolling body (Ball Fault);

the flow of the vibration signal prediction model based on the LSTM proposed by the present invention is described as follows, and the flow chart of the model is shown in fig. 1.

Step 1: characteristic engineering: firstly, processing collected original data, performing auxiliary judgment by using EMD (empirical mode decomposition) and waveform visualization technology, and identifying abnormal vibration frequency by observing IMF (intrinsic mode function) components;

step 2: resampling: original data are re-sampled, abnormal data fragments in the original data are deleted, the fact that continuous normal data exist in the remaining data and only exist in the remaining data is guaranteed, and the problem of data unbalance caused by unbalanced data is solved;

and step 3: data normalization: normalizing the data of each dimension in the time series, and calculating the mean value of each dimension

And standard deviation of

The time-series sample data X is normalized by the following formula. The data normalization processing is beneficial to the initialization of the model, avoids the influence on the updating of the gradient value due to the numerical value deviation problem, and is convenient to use the fixed learning rate to iterate the model, thereby accelerating the convergence speed of the model;

（1）

and 4, step 4: and (3) carrying out format conversion on the data set: converting the unsupervised data set into a supervised data set, dividing the normalized data into a continuous time series X of length l_r，X_rLength of (d) is input by the model as length l_sAnd an output length l_pAre determined jointly, their relationship is

and 5: dividing the data set: randomly arranging data, and dividing input data X according to same proportion_sAnd output data X_pObtaining a training data set

And validating the data set

；

Step 6: constructing a model: setting check points, saving model parameters once per Epoch, adjusting Epoch and Dropout parameters, and observing trainloss and valloss, using an Early Stop mechanism (Early-Stop) when an overfitting condition occurs;

and 7: classification using models: inputting a segment of time sequence sample data X to obtain a prediction vector

（2）。

the fault time sequence data classification model process integrating EMD feature extraction provided by the invention is described as follows, and a model flow chart is shown in FIG. 2.

Step 1: firstly, processing fault samples of equipment time sequence data, using data which has abnormality and has determined fault types, and additionally adding normal samples with the same quantity as each fault sample;

step 2: EMD decomposition is carried out on the sample time sequence data to obtain an m-dimensional vector containing IMF components

and step 3: numbering imf of n-type time sequence data to obtain model input data X_c，

（3）

Determining the output data y of the model from the numbers_c：

（4）

And 4, step 4: normalizing the data and calculating the mean of the samples

And standard deviation of

Inputting data X to the model by equation (5)_cNormalization was performed:

（5）；

and 5: dividing data set, randomly arranging data, and dividing model input data X according to same proportion_cAnd output data y_cObtaining a training data set

And validating the data set

；

Step 6: training the model, setting check points, storing model parameters once per Epoch, adjusting Epoch and Dropout parameters, and observing trainloss and valloss, using an Early Stop mechanism (Early-Stop) when an overfitting condition occurs;

and 7: classifying by using model, inputting a time sequence X_cObtaining a classification vector y', then the fault number

And finding out the corresponding fault through the number.

s5, predicting the production data of the equipment by using a vibration signal prediction model based on LSTM, calculating the mean square error between a predicted value and an actual value, comparing the mean square error with an initial fatigue factor threshold value to detect an abnormal signal, and dynamically adjusting the threshold value according to the actual condition

(ii) a Predicting the value of time t using a prediction model

Calculating, from the actual value, a prediction error:

（6）

wherein

And i is the dimension of the true monitoring value. Then each error is compared

Added to the vector of prediction errors:

（7）

h is the number of historical errors used to estimate the current error, l_sIs the length of the input sequence. This set of errors e is then smoothed to suppress abrupt changes in errors that often occur in LSTM-based prediction-abrupt changes that are typically difficult for prediction models to predict, and that can also lead to abrupt changes in error values even if such data is normal. To determine whether these values are normal, the present invention sets a threshold for their smoothed prediction error-values of mean square error above the threshold are classified as abnormal.

In calculating the prediction error threshold, the method can be implemented by using a supervisor with a sample labelThe method learns to find a suitable anomaly threshold, but in general there are not enough label samples, especially if the device is operating properly without a large amount of fault data. The threshold calculation method provided by the invention does not need to use sample label data or statistical hypothesis test about errors, and can have higher performance under the condition of lower resource consumption. Selecting an error threshold in the error set

：

（8）

is a mean value

Is a standard deviation of wherein

Determined by equation (9):

（9）

wherein:

（10）

e in the formula (9)_seqIs composed of

A continuous sequence of (a).

Is estimated by

Number of standard deviations. The value of z depends on experimental data, the normal range in the experiment of the invention is 2-8, and when z is less than 2, the false alarm is generally excessive. Once the cover is closed

（11）

in short, if a threshold is found

The minus score s is greater than

Then the mean square error e of the remaining sequence_sMean value of

And standard deviation of

Will be greatly reduced. This method is also applicable to sequences with a large number of anomalies (

) Penalizing to avoid excessive deviation, then, the highest mean square error in each abnormal error sequence according to the selected threshold

Is normalized.

In order to reduce the false alarm rate and the calculation amount, the invention provides the following threshold value adjusting strategy: creating a new collection

Including all in descending order

Maximum value of

Then the maximum non-abnormal mean square error

Is added to

And (5) ending. The sequence is gradually decreased, and the decreasing amplitude of each step is d⁽ⁱ⁾,

（12）

Is still an abnormal sequence. If the amplitude is decreased by d⁽ⁱ⁾Less than the minimum reduction p, then the reduction of all subsequent errors

Are reclassified as normal values. This approach helps to ensure that the sequence of anomalies is not a regular noise in the time series data stream, and identification of the sequence of outliers is achieved by thresholding, detecting only sequence segments with potential anomalies, more efficiently than comparing sequence values one by one without using a threshold.

The general structural flow of the specific fault diagnosis method is shown in fig. 3.

The invention discloses a fatigue factor recessive anomaly detection and fault diagnosis method based on LSTM, which comprises the following steps:

description of data

The experimental data are derived from bearing fault data of a university of Kaiser West reservoir (CWRU) electrical engineering laboratory, and total 1,341,856 data points, wherein the bearing model is 6205-2RS JEM SKF deep groove ball bearing. Single-point failures of 3 failure levels are respectively arranged on an Inner ring (Inner Raceway Fault), an Outer ring (Outer Raceway Fault) and a rolling body (ballFault) on a bearing by using an electric spark machining technology, the failure diameters are respectively 0.007 inches, 0.014 inches and 0.021 inches, and the failure depths are respectively 0.011 inches, 0.050 inches and 0.150 inches. Three kinds of faults are respectively arranged at a motor driving End (Driver End) and a Fan End (Fan End), and 21 groups of data including 6 fault types are collected by vibration sensors arranged at the motor driving End, the Fan End and a base. A description of specific bearing failure data is shown in table 1.

TABLE 1 bearing failure data description (in inches)

Four types of vibration data drawing graphs are selected, as shown in fig. 5, a normal signal, a rolling element fault signal, an inner ring fault signal and an outer ring fault signal are sequentially arranged from top to bottom, and a sequence with larger amplitude can be periodically shown in the fault signal, wherein the amplitude of the fault vibration signal is obviously larger than that of the normal signal, and the sequence with larger amplitude can be observed from the graph of sample data.

And decomposing the original vibration signal by using EMD, and observing the IMF component characteristics of each stage. After the original signal decomposition, 7 components (called IMF 1-IMF 7 respectively) and a residual map are obtained, wherein each IMF component represents an connotative modal component existing in the original signal. FIG. 6 is an EMD decomposition of normal data, FIG. 7 is an EMD decomposition of inner ring fault data, in the diagram, IMFs 1-7 represent signal components under different frequencies, the signal components are sequentially arranged from high frequency to low frequency, and the right side part is the instantaneous frequency of each IMF component. It can be seen from the graph that the normal signal and the abnormal signal have a large difference in residual error and instantaneous frequency distribution, the instantaneous frequency distribution of the normal signal is smooth, and the instantaneous frequency distribution of the abnormal signal has a large fluctuation.

In order to compare the effect difference between the LSTM-based vibration signal prediction model and ARIMA and RNN methods, the used experimental data comprise normal data of a driving end and a fan end, the data are divided into single-step prediction data and multi-step prediction data, the multi-step prediction data are provided with two prediction lengths of 30 and 100, 6 data sets are provided in total, and each data set comprises 20000 training samples and 4000 test samples.

The ARIMA Model is called an autoregressive Moving Average Model (ARIMA, Autoregesived Moving Average Model). Also known as ARIMA (p, d, q), is the most common one of statistical models (statisticmodel) for time series prediction. The ARIMA model parameters p (Auto-Regressive), d (integrated), q (moving average) are shown in Table 2.

TABLE 2 ARIMA model parameters

p(Auto-Regressive)	d(Integrated)	q(Moving Average)
			1	1	1

RNN (Recurrent Neural Network) processes the most common, most traditional deep learning model of sequence data. The RNN model parameters used for the comparative experiments are shown in table 3.

TABLE 3RNN model parameters

The test was performed with the above-identified model parameters and structure, and the comparative experiment results are shown in table 4.

TABLE 4 comparative experiment test results

As can be seen from the experimental test results in Table 4, the mean square errors of the three models have strong correlation with the data prediction length (the mean square errors of the RNN and StackedLSTM models are subject to ValLoss in Table 4), that is, the mean square error is smaller when the data prediction length is shorter, and the effect of single-step prediction is significantly better than that of multi-step prediction. Fig. 4(a), (b) and (c) are graphs of the prediction effect with prediction step size of 30 for the three models respectively. The dark grey lines represent the true value (TrueValue), the light grey lines represent the predicted value (Prediction), which represent the Acceleration value (left y-axis, Acceleration), and the dashed lines represent the Residual of the actual and predicted values (Residual, right y-axis, Res). ARIMA has a slight advantage in single-step prediction experiments (datasets 1 and 2) over both RNN and StackedLSTM neural networks, and its higher MSE is not as satisfactory in long-term prediction (

datasets

3, 4, 5, 6). The ARIMA and the neural network show two different characteristics, the neural network has better effect than the ARIMA in the aspect of long-term prediction and has more advantage in the aspect of short-term prediction. The prediction model effect based on the StackLSTM is superior to that of the RNN model, so that a better prediction effect can be obtained, and the StackLSTM is more suitable for long-term prediction. Structurally, the ARIMA is a linear model, and the neural network is a nonlinear model, so that the ARIMA can obtain good effect in a short time when the relationship is simple, but the fault data prediction model based on the StackedLSTM is very suitable for the scene when the complex association relationship is faced.

Claims

1. A fatigue factor recessive abnormality detection and fault diagnosis method based on LSTM is characterized by comprising the following steps:

s4, taking the Mean Square Error (MSE) of the vibration signal prediction model based on the LSTM acquired in the S2 as an initial fatigue factor threshold;

；

2. The LSTM-based fatigue factor latent anomaly detection and fault diagnosis method of claim 1, wherein the process of constructing the device vibration signal prediction model based on the LSTM deep neural network in step S2 comprises the following steps:

And standard deviation of

（1）

And validating the data set

；

（2）。

3. the LSTM-based fatigue factor latent anomaly detection and fault diagnosis method of claim 1, wherein the process of constructing the fault timing sequence data classification model based on the LSTM deep neural network in step S3 comprises the following steps:

s3-2: performing Empirical Mode Decomposition (EMD) on the time sequence sample data to obtain an m-dimensional vector containing IMF components

s3-3: numbering imf of n-type time sequence data to obtain model input data X_c

（3）

Determining the output data y of the model from the numbers_c

（4）

S3-4: normalizing the data and calculating the mean of the samples

And standard deviation of

Normalized data is

：

（5）

And validating the data set

；

S3-6: training the model, setting check points, and storing one for each EpochSecondary model parameters, adjust Epoch and Dropout parameters, and observe trainloss and valloss, using an early stop mechanism when an over-fit condition occurs;

s3-7: classifying by using model, inputting a segment of model input data X_cObtaining a classification vector y', then the fault number

4. The LSTM-based fatigue factor latent anomaly detection and fault diagnosis method of claim 1, wherein the step S5 of predicting the equipment production data by using the LSTM-based vibration signal prediction model specifically comprises:

predicting the value of time t using a prediction model

Calculated according to the actual value

Calculating a prediction error:

（6）

wherein

Added to the vector of prediction errors:

（7）

（8）

is a mean value

Is a standard deviation of wherein

Determined by equation (9):

（9）

wherein:

（10）

e in the formula (9)_seqIs composed of

A continuous sequence of (a);

is estimated by

（11）

Is normalized.

5. The LSTM-based fatigue factor latent anomaly detection and fault diagnosis method of claim 1, wherein the threshold is dynamically adjusted according to actual conditions in step S5

The method specifically comprises the following steps:

creating a new collection

Including all in descending order

Maximum value of

Then the maximum non-abnormal mean square error

Is added to

Are reclassified as normal values.