CN112507785A

CN112507785A - Rolling bearing fault analysis based on CNN and LSTM

Info

Publication number: CN112507785A
Application number: CN202011205524.6A
Authority: CN
Inventors: 刘瑞军; 章博华; 王俊; 张伦
Original assignee: Beijing Technology and Business University
Current assignee: Beijing Technology and Business University
Priority date: 2020-11-02
Filing date: 2020-11-02
Publication date: 2021-03-16

Abstract

The present application discloses a rolling bearing fault classification method based on CNN and LSTM. Acquire bearing vibration data and preprocess it to construct a dataset. Use Butterworth Filter to remove noise on the processed vibration data, and perform Fast Fourier Transform (FFT) to convert the preprocessed time domain signal into frequency domain signal. The CNN network is used for learning to obtain the image features of the time domain map and the frequency domain map, and the image feature fusion is performed through the Add layer. The image features learned by CNN are input into the LSTM network, the time series features contained in the features are learned through the LSTM network, and the classification function is realized through a fully connected layer and the Softmax function. Using the trained network to classify the test samples has important practical significance and practical value for the early detection and analysis of weak faults of rolling bearings.

Description

Rolling bearing fault analysis based on CNN and LSTM

Technical Field

The application belongs to the technical field of machine learning and fault identification, and particularly relates to a rolling bearing fault analysis method of a deep learning model CNN and a deep learning model LSTM.

Background

The rolling bearing fault diagnosis technology is an important research subject in the field of mechanical fault diagnosis. The train is widely applied to public trips as a complex mechanical device. The mechanical system of the train breaks down, the operation of a traffic system is interfered, and the safety of personnel can be endangered in serious cases, so that huge economic loss is caused. The rolling bearing is a core part of the train. According to statistics, 30% of mechanical equipment faults are caused by bearing faults, and the method has important practical significance and practical value for early detection and analysis of weak faults of the rolling bearing. Different fault types have different signal characteristics, and the vibration signal can intuitively reflect the health state of the bearing, so the application is wide. The signal characteristic main packet expression forms are divided into two types: time domain and frequency domain, however, the failure characteristics in the vibration signal cannot be fully represented by the time domain analysis or the frequency domain analysis alone. In recent years, a Convolutional Neural Network (CNN) is widely used in the fields of image processing, voice recognition, failure analysis, and the like. In bearing fault diagnosis, one-dimensional time sequence data or a two-dimensional time-frequency graph is used as input, training is carried out through a neural network, and fault characteristics are extracted, so that the fault can be analyzed in a noisy working condition scene.

And Jing and the like learn characteristics from the frequency spectrum of the vibration signal by using CNN (CNN), and realize the health state monitoring of the gearbox. Qu and the like provide a one-dimensional convolutional neural network fault diagnosis algorithm, and self-adaptive feature extraction and fault diagnosis based on a deep network are realized. SE, etc. trains CNN from the frequency spectrum of the motor using frequency components through fast fourier transforms. Zhuang et al uses a Long Short-Term Memory (LSTM) network to extract time characteristics of time series data through a gate structure, thereby enhancing the generalization capability of the model. The integrated deep learning method for multi-bearing residual service life collaborative prediction is provided by combining time domain and frequency domain characteristics of Ren and the like, and a good prediction effect is obtained. Liu uses a CNN + LSTM model and is mapped into a service life index through an activation layer, and then the service life of the bearing is predicted.

Although the CNN model can sufficiently extract spatial features of data, it cannot extract temporal features of data. Under the condition of large data quantity, the LSTM network is difficult to extract the nonlinear features of the data, the data features are not extracted sufficiently, and the convergence speed is slow. And compared with a model taking the time domain features or the frequency domain features as input, the model combining the time domain features and the frequency domain features can show more excellent classification effect.

Disclosure of Invention

The invention aims to provide a rolling bearing fault analysis method based on CNN and LSTM. The method fully utilizes the spatial feature extraction capability of CNN and the time sequence feature learning capability of LSTM, fully extracts the relation between the vibration image and time dependence, and carries out classification recognition and fault judgment on the vibration feature of the rolling bearing through the full-connection layer and the softmax layer, thereby enhancing the classification precision. And compared with the traditional RNN, the LSTM solves the problem of gradient disappearance and can reduce the difficulty of model training.

According to an aspect of the present application, there is provided a CNN and LSTM-based bearing fault analysis method, including:

and acquiring bearing vibration data, and processing the respiratory frequency signal of the vibration data containing noise by adopting a Butterworth filter.

Wherein the Butterworth filter has the formula:

where n is the order of the filter, ω_cThe frequency at which the amplitude drops to-3 db is taken for the cut-off frequency. The default order of a filter of the filter is 2, although a high-order Butterworth filter can realize clearer roll-off near a cut-off frequency, the high-order Butterworth filter can also cause serious signal distortion and influence the precision of a result, and experiments find that the performance of the filter with the order of 1 is better, namely a first-order Butterworth filter is used for processing vibration data containing noise;

converting the description of the preprocessed signal from the time domain to the frequency domain by a Fast Fourier Transform (FFT);

the FFT calculates the spectrum of the signal x (k) using discrete signals. Wherein the formula of the FFT is:

namely, the FFT algorithm can reduce the calculation amount of frequency domain conversion and improve the conversion speed, and the time complexity is o (nolg)₂n)。

And taking the time domain graph and the frequency domain graph obtained after processing as input, respectively training through a preset CNN network, carrying out convolution operation on the data on the one-dimensional time axis by the CNN, and moving along the time t axis of the time domain signal and the time frequency graph to extract image characteristics. CNN performs feature extraction by 3 convolutional layers. The pooling layer is positioned behind each convolution layer to reduce the dimension of the feature map, wherein the time feature of the data is reserved by reducing the complexity of output and preventing overfitting of the data by using the maximum pooling operation;

wherein the maximum pooling layer calculation formula is:

w and d are the preset length and width of the maximum pooling filter.

The time domain graph and the frequency domain graph obtained by CNN learning are subjected to feature fusion through the add layer, the dimensionality of the image is not increased, the information amount under each dimension is increased, and the time feature can be reserved by performing feature fusion through the add layer. Taking the fusion characteristics as the input of a long-time memory network layer;

inputting the fusion characteristics obtained after the add layer characteristics are fused into a long-time memory network layer, connecting the long-time memory network layer and the short-time memory network layer in a serial mode, and taking data containing time sequence as input to obtain space-time characteristics; the long-time memory network layer is internally provided with 3 gates: forget gate, input gate, output gate, the update formula of every moment t gate is as follows:

forget door f_tThe purpose of this is to let the LSTM network forget information that was previously useless:

f_t＝σ(W_f·[h_t-1,x_t]+b_f)

input door i_tThe role of (2) is to determine the input information of the LSTM network:

c_t′＝tanh(W_c·[h_t-1,x_t]+b_c)

c_t＝f_t*c_t-1+i_ic_t′

output gate o_tThe role of (a) is to determine the outcome of the neuron:

o_t＝σ(W_o[h_t-1,x_t]+b_o)

h_t＝o_t*tanh(c_t)

wherein, W_f、W_i、W_c、W_oWeight matrix b of forgetting gate, input gate and output gate_f、b_i、b_c、b_oIs its bias term; h is_t-1The state of the hidden layer at the time t-1; σ is a logistic function with an output of (0, 1); x is the number of_tIs the input vector at the time t; h is_t-1The state of the previous moment; tanh is the activation function.

And taking the acquired space-time characteristics as input, mapping the space-time characteristics to a sample mark space in a full connection layer, and obtaining a classification probability result through subsequent softmax layer operation to identify and classify the rolling bearing faults.

Wherein the formula of the softmax function is:

denotes the sample vector x when there are K linear functions^TProbability of belonging to class j.

Drawings

Some specific embodiments of the present application will be described in detail hereinafter by way of illustration and not limitation with reference to the accompanying drawings. The same reference numbers in the drawings identify the same or similar elements or components. Those skilled in the art will appreciate that the drawings are not necessarily drawn to scale. In the drawings:

FIG. 1 is a schematic flow chart of a fault analysis method of a rolling bearing based on CNN and LSTM in the method of the present application;

fig. 2 is a schematic structural block diagram of a rolling bearing fault analysis method based on CNN and LSTM according to an embodiment of the present application.

Fig. 3 is a computing device provided in an embodiment of the present application.

Fig. 4 is a computer-readable storage medium according to an embodiment of the present application.

Detailed Description

Fig. 1 is a schematic flow chart of a CNN and LSTM-based rolling bearing fault analysis method according to an embodiment of the present application. Referring to fig. 1, a method and a system for analyzing a fault of a rolling bearing based on CNN and LSTM provided in an embodiment of the present application may include:

step S1: denoising the processed vibration data by using a Butterworth Filter (Butterworth Filter), and converting the preprocessed time domain signals into frequency domain signals by using Fast Fourier Transform (FFT);

step S2: learning by using a CNN (CNN network) to obtain image characteristics of a time domain graph and a frequency domain graph;

step S3: performing image feature fusion through the add layer;

step S4: inputting the fusion image characteristics obtained by the add layer into an LSTM network, and further learning the time sequence characteristics contained in the characteristics through the LSTM network;

step S5: the classification function is realized through a full connection layer and a Softmax function, and the trained network is used for carrying out fault classification on the test sample;

the invention aims to provide a rolling bearing fault analysis method based on CNN and LSTM. Firstly, noise filtering is carried out on a data set, vibration data are represented by a time domain graph and a time frequency graph, and feature extraction is better carried out. The vibration characteristics of the time-frequency graph and the time-domain graph are respectively extracted by utilizing the space characteristic extraction capability of the CNN, the characteristics are fused, the relation between the vibration characteristics and the time dependence is fully extracted by utilizing the time sequence characteristic learning capability of the LSTM, the vibration characteristics of the rolling bearing are classified and identified and the fault is judged through the full connection layer and the Softmax layer, and the classification precision is enhanced.

The source of an experimental data set adopted by the method is CWRU (Kaiser Sichu university bearing data center), and the data set is a rolling bearing fault data set which is most widely used internationally at present. The data set records actual test conditions for the motor and bearing fault conditions, using Electrical Discharge Machining (EDM) techniques to implant faults into the motor bearings. Faults ranging from 0.007 inches to 0.040 inches in diameter were introduced on the bearing inner race, the rolling elements and the bearing outer race, respectively. The failed bearing was reinstalled into the test motor and the bearing experiment recorded vibration data of 0 to 3 horsepower (motor speed 1797 to 1720RPM) at 12,000 samples/second and 48,000 samples/second.

And S1, acquiring bearing vibration data, and processing the respiratory frequency signal of the vibration data containing noise by adopting a Butterworth filter.

Wherein the Butterworth filter has the formula:

where n is the order of the filter, ω_cThe frequency at which the amplitude drops to-3 db is taken for the cut-off frequency. The default order of the filter is 2, although the high-order Butterworth filter can realize clearer roll-off near the cut-off frequency, the high-order Butterworth filter can cause serious signal distortion and influence the precision of the result, and experiments find that the performance of the filter with the order of 1 is better, namely the filter with the first orderThe Butterworth filter processes the vibration data containing noise;

the FFT calculates the spectrum of the signal x (k) using discrete signals. Where the FFT can be expressed as:

namely, the FFT algorithm can reduce the calculation amount of frequency domain conversion and improve the conversion speed, and the time complexity is o (nlog)₂n)。

And S2, taking the processed time domain graph and frequency domain graph as input, respectively training through a preset CNN network, carrying out convolution operation on the data on the one-dimensional time axis by the CNN, and moving along the time t axis of the time domain signal and the time frequency graph to extract image characteristics. CNN performs feature extraction by 3 convolutional layers. The pooling layer is positioned behind each convolution layer to reduce the dimension of the feature map, wherein the time feature of the data is reserved by reducing the complexity of output and preventing overfitting of the data by using the maximum pooling operation;

wherein the maximum pooling layer calculation formula is:

w and d are the preset length and width of the maximum pooling filter.

S3, carrying out feature fusion on the time domain graph and the frequency domain graph obtained by CNN learning through the add layer, wherein the dimensionality of the image is not increased, the information content under each dimension is increased, and the time feature can be reserved by carrying out feature fusion through the add layer. Taking the fusion characteristics as the input of a long-time memory network layer;

s4, inputting the fusion characteristics obtained by fusing the characteristics of the Add layers into a long-short time memory network layer, connecting the layers of the long-short time memory network in a series mode, and taking data containing time sequence as input to obtain space-time characteristics; the long-time memory network layer is internally provided with 3 gates: forget gate, input gate, output gate, the update formula of every moment t gate is as follows:

f_t＝σ(W_f·[h_t-1,x_t]+b_f)

c_t′＝tanh(W_c·[h_t-1,x_t]+b_c)

c_t＝f_t*c_t-1+i_ic_t′

output gate o_tThe role of (a) is to determine the outcome of the neuron:

o_t＝σ(W_o[h_t-1,x_t]+b_o)

h_t＝o_t*tanh(c_t)

And S5, taking the acquired space-time characteristics as input, mapping the space-time characteristics to a sample mark space in a full connection layer, and obtaining a classification probability result through subsequent softmax layer operation to identify and classify the rolling bearing faults.

Wherein the formula of the softmax function is:

The embodiment of the present application in fig. 3 also provides a computing device comprising a memory 320, a processor 310 and a computer program stored in said memory 320 and executable by said processor 310, the computer program being stored in a space 330 for program code in the memory 320, the computer program, when executed by the processor 310, implementing the method steps 331 for performing any of the methods according to the present invention.

The embodiment of the application in fig. 4 also provides a computer-readable storage medium. The computer readable storage medium comprises a storage unit for program code provided with a program 331' for performing the steps of the method according to the invention, which program is executed by a processor.

The embodiment of the application also provides a computer program product containing instructions. Which, when run on a computer, causes the computer to carry out the steps of the method according to the invention.

In the above embodiments, the implementation may be wholly or partially realized by software, hardware, firmware, or any combination thereof. When implemented in software, may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When loaded and executed by a computer, cause the computer to perform, in whole or in part, the procedures or functions described in accordance with the embodiments of the application. The computer may be a general purpose computer, a special purpose computer, a network of computers, or other programmable device. The computer instructions may be stored in a computer readable storage medium or transmitted from one computer readable storage medium to another, for example, from one website site, computer, server, or data center to another website site, computer, server, or data center via wired (e.g., coaxial cable, fiber optic, Digital Subscriber Line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.). The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device, such as a server, a data center, etc., that incorporates one or more of the available media. The usable medium may be a magnetic medium (e.g., floppy Disk, hard Disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium (e.g., Solid State Disk (SSD)), among others.

Those of skill would further appreciate that the various illustrative components and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both, and that the various illustrative components and steps have been described above generally in terms of their functionality in order to clearly illustrate this interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.

It will be understood by those skilled in the art that all or part of the steps in the method for implementing the above embodiments may be implemented by a program, and the program may be stored in a computer-readable storage medium, where the storage medium is a non-transitory medium, such as a random access memory, a read only memory, a flash memory, a hard disk, a solid state disk, a magnetic tape (magnetic tape), a floppy disk (floppy disk), an optical disk (optical disk), and any combination thereof.

The above description is only for the preferred embodiment of the present application, but the scope of the present application is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present application should be covered within the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims

1. A rolling bearing fault classification method based on CNN and SLTM comprises the following steps:

acquiring and preprocessing bearing vibration data to construct a data set;

denoising the processed vibration data by using a Butterworth Filter (Butterworth Filter), and converting the preprocessed time domain signals into frequency domain signals by using Fast Fourier Transform (FFT);

the CNN network is used for learning to obtain the image characteristics of a time domain graph and a frequency domain graph, and the image characteristics are fused through an Add layer;

inputting image features obtained by CNN learning into an LSTM network, learning time sequence features contained in the features through the LSTM network, and realizing a classification function through a full connection layer and a Softmax function;

and carrying out fault classification on the test sample by using the trained network.

2. The method of claim 1, wherein the breathing rate signal is processed using a Butterworth filter (Butterworth filter) on noisy vibration data.

Wherein the Butterworth filter has the formula:

where n is the order of the filter, ω_cThe frequency at which the amplitude drops to-3 db is taken for the cut-off frequency. The default order of the filter is 2, although a high-order Butterworth filter can realize clearer roll-off near a cut-off frequency, the high-order Butterworth filter can cause serious signal distortion and influence the precision of a result, and experiments find that the performance of the filter with the order of 1 is better, namely the first-order Butterworth filter is used for processing vibration data containing noise.

3. The method of claim 1, wherein the description of the preprocessed signal is converted from the time domain to the frequency domain by a Fast Fourier Transform (FFT).

4. The method according to claim 1, wherein the time domain graph and the frequency domain graph obtained after the processing are used as input and are respectively trained through a preset CNN network, and the CNN only performs convolution operation on data on a one-dimensional time axis and moves along the time t axis of the time domain signal and the time frequency graph to extract image features. CNN performs feature extraction by 3 convolutional layers. The pooling layer is positioned behind each convolution layer to reduce the dimension of the feature map, wherein the time feature of the data is reserved by reducing the complexity of output and preventing overfitting of the data by using the maximum pooling operation;

wherein the maximum pooling layer calculation formula is:

w and d are the preset length and width of the maximum pooling filter.

5. The method according to claim 1 and claim 4, wherein the time domain graph and the frequency domain graph obtained by CNN learning are subjected to feature fusion through an add layer, the dimensionality of the image is not increased, the information amount under each dimension is increased, and the time feature can be retained by performing feature fusion through the add layer. And taking the fusion characteristics as the input of the long-time and short-time memory network layer.

6. The method according to claim 1 and claim 5, characterized in that the fused feature obtained by fusing the add layer features is input to the long-short term memory network layer, the layers of the long-short term memory network are connected in series, and the time-space feature is obtained by using the data containing time sequence as input; the long-time memory network layer is internally provided with 3 gates: forget gate, input gate, output gate, the update formula of every moment t gate is as follows:

f_t＝σ(W_f·[h_t-1,x_t]+b_f)

c_t′＝tanh(W_c·[h_t-1,x_t]+b_c)

c_t＝f_t*c_t-1+i_ic_t′

output gate o_tThe role of (a) is to determine the outcome of the neuron:

o_t＝σ(W_o[h_t-1,x_t]+b_o)

h_t＝o_t*tanh(c_t)

7. The method according to claim 1, characterized in that the acquired spatiotemporal features are used as input, mapped to a sample mark space in a full connection layer, and a classification probability result is obtained through subsequent softmax layer operation, so as to identify and classify the rolling bearing faults.

Wherein the formula of the softmax function is:

8. The method of any one of claims 1 to 7, wherein the data set used in the method is a CWRU (university of Kaiser storage bearing data centre) data set in a laboratory environment.