CN113935460B

CN113935460B - Intelligent diagnosis method for mechanical faults under unbalanced-like data set

Info

Publication number: CN113935460B
Application number: CN202111136682.5A
Authority: CN
Inventors: 王俊; 戴俊; 石娟娟; 江星星; 姚林泉
Original assignee: Suzhou University
Current assignee: Suzhou University
Priority date: 2021-09-27
Filing date: 2021-09-27
Publication date: 2023-08-11
Anticipated expiration: 2041-09-27
Also published as: CN113935460A; WO2023044979A1

Abstract

The application discloses an intelligent diagnosis method for mechanical faults under unbalanced data sets, which comprises the following steps: step (1), data preprocessing: converting the mechanical vibration signal to the frequency domain and normalizing the amplitude to the [0,1] range; step (2), building a model: combining an automatic encoder with a generation countermeasure network, and building a data generation model; step (3), model training: training the data generation model by using fault data according to a preset loss function and an optimization algorithm; step (4), data generation: the low-dimensional characteristics of fault data learned in training by the data generation model are utilized, and fault data of corresponding classes are generated after repeated interpolation and noise addition, so that various data balances are realized; step (5), fault diagnosis: training a preset fault diagnosis model by using the class balance data set, and performing intelligent diagnosis on mechanical faults by using the trained fault diagnosis model. And (3) utilizing an automatic encoder to generate the combination of the countermeasure network to realize mechanical fault diagnosis.

Description

Intelligent diagnosis method for mechanical faults under unbalanced-like data set

Technical Field

The application relates to the field of intelligent fault diagnosis, in particular to an intelligent mechanical fault diagnosis method under an unbalanced data set.

Background

As rotary mechanical devices continue to develop in an intelligent, precise, and complex direction, the structure of the mechanical devices is becoming more complex and compact. In the service process of the mechanical equipment, once a certain part fails, the operation of the whole mechanical equipment can be influenced, and even safety accidents are caused. In order to ensure healthy operation of mechanical equipment, deep learning theory is gradually applied to intelligent diagnosis of mechanical faults as the latest research results in the fields of pattern recognition and machine learning. Compared with the traditional fault diagnosis method, the intelligent diagnosis model based on deep learning utilizes the deep network model to adaptively extract effective fault characteristics from signals, has high diagnosis efficiency and does not depend on the signal processing experience of operators, and is widely paid attention to.

Models currently in common use in intelligent diagnosis of mechanical faults include Convolutional Neural Networks (CNNs), deep Belief Networks (DBNs), residual networks (res net), and the like. In the training process of the models, a large amount of historical data sets are often required to be input as training samples, so that the corresponding relation between the data and the health state category is established. Although the mechanical equipment fails to bring great potential safety hazard to the operation of the equipment, the failure is an occasional event, and the equipment cannot operate for a long time in a failure state, so that the normal state data are more, the failure state data are less, and the class imbalance problem of a data set is caused. The imbalance of the normal class and the fault class presents great difficulty and challenges for identifying the mechanical health state, and the unbalanced class data set easily causes the performance of the diagnosis model to be reduced, namely, the model is easy to be over-fitted to normal signals with a large number of samples, and under-fit to fault signals with a small number of samples. In addition, due to the rarity of fault samples, the model can easily learn some redundant and even irrelevant features in the process of extracting fault data features, and the features reduce the generalization capability of the model.

In order to solve the problem of performance degradation of the intelligent diagnosis model of mechanical faults caused by unbalance of the class, a dynamic weight method and a data generation method are commonly used. The dynamic weighting method gives more attention to a smaller number of faulty samples by adjusting the weight parameters in the network, thereby improving the under-fitting problem for the faulty samples. The data generation rule is to generate a new sample of the same type by using a small amount of fault data to expand the fault sample, balance the fault data with normal data, and train an intelligent diagnosis model by using the balanced data set. The traditional data generation method includes a few types of synthesis upsampling technology (SMOTE), adaptive synthesis sampling (ADASYN) and the like.

The traditional technology has the following technical problems:

in the intelligent diagnosis of mechanical faults under an actual unbalanced data set, the dynamic weighting method needs to dynamically adjust the weights according to the unbalance rate between normal and fault samples, so the method is suitable for being applied to the condition that the unbalance rate is known. And when the data is in extreme class unbalance, the dynamic weighting method is easily interfered by redundant features in a small number of fault samples, so that the model is over-fitted, and the accuracy of fault diagnosis is reduced. The data generation method enables various types in the data set to reach balance through upsampling a small number of fault signals, and fundamentally solves the phenomenon of unbalanced types. However, the mechanical structure is complex, the vibration signal of the vibration sensor is nonlinear, the vibration signal of the vibration sensor has stronger background noise under actual working conditions, and the vibration sensor shows obvious nonstationary characteristics under fault conditions. The traditional data generation method does not learn the distribution characteristics of the data, directly generates signals in time domain signals through interpolation technology, is easily interfered by measurement noise components, has low quality of the generated data, and is also easy to cause performance degradation of an intelligent diagnosis model.

Disclosure of Invention

Aiming at the problems that the application scene of a dynamic weight method is limited, the traditional data generation method is easy to be interfered by noise and the quality of generated data is low, the application provides a novel data generation method, which is based on a deep neural network, learns the low-dimensional distribution characteristic of fault signals through the combination of an automatic encoder and a generation countermeasure network, generates fault data according to the low-dimensional characteristic, thereby avoiding the interference of noise, obtaining high-quality generated data and realizing high-performance intelligent diagnosis of mechanical faults by utilizing the balanced data set.

In order to solve the technical problems, the application provides an intelligent diagnosis method for mechanical faults under an unbalanced-like data set, which comprises the following steps:

step (1), data preprocessing: converting the mechanical vibration signal to the frequency domain and normalizing the amplitude to the [0,1] range;

step (2), building a model: combining an automatic encoder with a generation countermeasure network, and building a data generation model;

step (3), model training: training the data generation model by using fault data according to a preset loss function and an optimization algorithm;

step (4), data generation: the low-dimensional characteristics of fault data learned in training by the data generation model are utilized, and fault data of corresponding classes are generated after repeated interpolation and noise addition, so that various data balances are realized;

step (5), fault diagnosis: training a preset fault diagnosis model by using the class balance data set, and performing intelligent diagnosis on mechanical faults by using the trained fault diagnosis model.

In one embodiment, in step (2), the automatic encoder is composed of an encoder and a decoder, the generation countermeasure network is composed of a generator and a discriminator, and the decoder is the generator; the automatic encoder learns the low-dimensional characteristics of the input data, namely the true data, and outputs the generated data, namely the false data, consistent with the distribution characteristics of the input data through the decoder by the low-dimensional characteristics and class labels thereof; and the discriminator in the generated countermeasure network respectively carries out true and false discrimination and category classification on the input data and the generated data.

In one embodiment, the encoder, the decoder, and the arbiter each comprise a construction by one of a deep convolutional network, a deep confidence network, and a residual network.

In one embodiment, in step (3), the preset loss function includes a mean square error loss function between the generator generated data and the encoder input data, a cross entropy classification loss function of the arbiter for true and false data, a waserstein distance or a binary cross entropy loss function of the arbiter for true and false authentication of the data, and a mean square error loss function between the encoder output feature and an intermediate implicit feature of the arbiter.

In one embodiment, in the step (3), the preset optimization algorithm includes, but is not limited to, one of a random gradient descent method (SGD), a random gradient descent with Momentum (Momentum), a Nesterov Momentum method, an adagard algorithm, and an adaptive moment estimation method (Adam).

In one embodiment, in step (4), the interpolation is performed in different low-dimensional features of the same class of fault samples, the class of labels are embedded before generating the fault data, and the added noise is low-amplitude random noise.

In one embodiment, in step (5), the preset fault diagnosis model includes one of a support vector machine, a k-nearest neighbor algorithm, a random forest, a fuzzy system, or a deep neural network.

Based on the same inventive concept, the present application also provides a computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, said processor implementing the steps of any one of the methods when executing said program.

Based on the same inventive concept, the present application also provides a computer-readable storage medium, on which a computer program is stored, which program, when being executed by a processor, implements the steps of any of the methods.

Based on the same inventive concept, the present application also provides a processor for running a program, wherein the program runs to execute the method of any one of the above.

The application has the beneficial effects that:

compared with the prior art, the application discloses an intelligent diagnosis method for mechanical faults under an unbalanced-like data set. Aiming at the problem of diagnosis precision reduction caused by unbalanced data sets in mechanical fault diagnosis, the method provides a novel data generation method, the characteristic mining capability of deep learning and an countermeasure training mechanism are utilized to learn the data distribution characteristics of a small number of fault samples, interpolation and noise adding are utilized to generate novel characteristics in a low-dimensional characteristic space of data, and after labels are embedded, the novel fault samples are obtained through a generator. The influence of measurement noise in the signal can be eliminated by interpolation in a low-dimensional space, the diversity of generated samples can be improved by adding random noise, and the consistency of the generated samples and the data distribution of the same-category fault samples can be ensured by embedding the labels. Thus, the method has at least the following advantages: (1) The method can learn the low-dimensional distribution characteristics of the data and eliminate the interference of measurement noise; (2) The generated data has consistency with the same type of fault data, and has certain diversity, so that the quality of the generated data is high; and (3) the accuracy of intelligent recognition of mechanical faults is high.

Drawings

FIG. 1 is a flow chart of a method for intelligent diagnosis of mechanical faults under an unbalanced data set of the present application.

Fig. 2 is a comparison chart of generated data and real data of four fault types obtained in the mechanical fault intelligent diagnosis method under the unbalanced data set, wherein the left side is the real data under four fault states, and the right side is the generated data corresponding to the real data.

FIG. 3 is a graph showing the classification accuracy of the intelligent diagnosis method of mechanical failure under the unbalanced data set and the conventional method under five unbalanced rates.

Detailed Description

The present application will be further described with reference to the accompanying drawings and specific examples, which are not intended to be limiting, so that those skilled in the art will better understand the application and practice it.

The intelligent diagnosis method for mechanical faults under the unbalanced data set is shown in the flow chart of fig. 1, and the technology specifically comprises the following steps:

step 101: and (5) preprocessing data. The vibration signal is fourier transformed, the mechanical vibration signal is converted to the frequency domain, and the amplitude is normalized to the [0,1] range.

Step 102: and (5) building a model. Combining the automatic encoder with the generation countermeasure network to build a data generation model.

The automatic encoder consists of an encoder and a decoder, the generation countermeasure network consists of a generator and a discriminator, and the decoder is the generator. The automatic encoder learns the low-dimensional characteristics of the input data (true data) through the encoder, and then outputs the generated data (false data) consistent with the distribution characteristics of the input data through the decoder through the low-dimensional characteristics and class labels thereof. The discriminator in the generation countermeasure network performs true and false discrimination and category classification on the input data and the generated data, respectively.

The encoder, decoder, arbiter include, but are not limited to, being constructed by one of a deep convolutional network, a deep belief network, a residual network.

Step 103: and (5) model training. And training data to generate a model according to a preset loss function and an optimization algorithm by using the fault data.

The loss function of the data generation model in the training process comprises the following steps:

1) The generator generates a mean square error loss function between the data and the encoder input data. Optimizing the loss function may ensure consistency of the generated data and the input data distribution characteristics.

2) The discriminator classifies the loss function for the cross entropy of the true and false data. Optimizing the cross entropy classification loss function of the real data can improve the classification capacity of the discriminator on the real data; optimizing the cross entropy classification loss function of the false data can improve the class discrimination capability of the discriminator for the generated data and the learning capability of the generator for class features, and ensure the feature consistency between the generated data in the same class and the feature difference between the generated data in different classes.

3) The discriminator discriminates the data true and false from the Wasserstein distance or the binary cross entropy loss function. Optimizing the loss function may further improve the quality of the data generated by the generator and the discriminatory power of the arbiter.

4) A mean square error loss function between the encoder output feature and the intermediate implicit feature of the arbiter. Optimizing the loss function may improve consistency of the extracted features of the same class of data by the generator and the arbiter.

Through optimizing the loss functions, the generated data is finally close to the data distribution of the input data of the same class, the discriminator is difficult to discriminate the true and false of the generated data and the input data of the same class, and the balance is achieved between the generator and the discriminator, so that the training of the data generation model is completed.

Preset optimization algorithms include, but are not limited to, one of random gradient descent (SGD), momentum-loaded random gradient descent (Momentum), nestrov Momentum, adagard algorithm, adaptive moment estimation (Adam).

Step 104: and (5) generating data. And generating corresponding fault data by utilizing the low-dimensional characteristics of the fault data learned by the data generation model in training through repeated interpolation and noise addition, so as to realize the balance of various data.

Interpolation is performed in different low-dimensional features of the same class of fault samples, the class of labels are embedded before generating fault data, and the added noise is low-amplitude random noise.

Step 105, fault diagnosis. Training a preset fault diagnosis model by using the class balance data set, and performing intelligent diagnosis on mechanical faults by using the trained fault diagnosis model.

The preset fault diagnosis model comprises one of a support vector machine, a k nearest neighbor algorithm, a random forest, a fuzzy system and a deep neural network, but is not limited to the support vector machine, the k nearest neighbor algorithm and the fuzzy system.

In order to more clearly understand the technical scheme and the effect of the present application, the following detailed description is provided with reference to a specific embodiment.

Taking intelligent diagnosis of gear box faults as an example, building a planetary gear box fault simulation test platform, and manually setting four fault states respectively: five health states including broken teeth, missing teeth, tooth root cracks and tooth surface abrasion and normal states are added. An acceleration sensor is arranged on the planetary gear box to collect vibration signals of the gear box, and the sampling frequency is 5kHz. Each health state contained 2000 sets of signals, 1000 of which did not participate in training as test data, each set of signals being 2048 data points in length. In order to verify the effectiveness of the intelligent diagnosis method for mechanical faults under the unbalanced data set, 5 unbalanced rates are set in the example, namely the ratio of the number of healthy samples of the gear box to the number of the fault samples of each type is 5:1, 10:1, 20:1, 50:1 and 100:1, and the data size of the healthy samples under each unbalanced rate is 1000.

The technology disclosed by the application is adopted to process the 5 groups of unbalanced data sets, the steps are shown in fig. 1, and the detailed information is as follows.

And (3) preprocessing data. The vibration signal is fourier transformed, the mechanical vibration signal is converted to the frequency domain, and the amplitude is normalized to the [0,1] range. The length of the original time domain signal is 2048 data points, and 1024-length frequency domain signals are taken as input data of the model after Fourier transformation.

And (2) building a model. Combining an automatic encoder with a generation countermeasure network, and constructing a data generation model, wherein the specific embodiment is as follows:

(1) automatic encoder: including encoders and decoders, which function primarily to encode and decode input data. The encoder adopts a four-layer one-dimensional convolutional neural network structure, the dimensions of each layer are 8, 16, 32 and 64, a convolutional kernel with the length of 15 is adopted, the convolutional layers are connected with a LeakyReLU activation function layer, and a sample outputs a 64-dimensional potential feature vector after passing through the encoder. The decoder adopts a four-layer one-dimensional deconvolution neural network structure, each layer of dimension is 64, 32, 16 and 8, a deconvolution core with the length of 15 is adopted, a ReLU activation function layer is connected between deconvolution layers, the last deconvolution layer of the decoder is connected to a Sigmoid activation function, and the amplitude of generated data is limited in the range of [0,1 ].

(2) Generating an antagonizing network: comprising a generator and a discriminator. The generator is the decoder in the automatic encoder. The discriminator designs four one-dimensional convolution layers and two full-connection layers, wherein each layer has dimensions of 8, 16, 32 and 64, a convolution kernel with the length of 15 is adopted, a LeakyRelu activation function layer is connected between each convolution layer, and the convolution layers finally output feature vectors with the length of 64 dimensions. The feature is then input into two fully connected layers, respectively, the first fully connected layer reduces the 64-dimensional feature vector to 1-dimensional for calculation of the wasperstein distance between the generated data and the real data. The second fully-connected layer reduces the 64-dimensional feature vector to 4-dimensional (i.e. the number of fault categories requiring up-sampling) and connects with the Softmax active layer to judge the category of the signal.

And (3) training a model. And training data to generate a model according to a preset loss function and an optimization algorithm by using the fault data. In this embodiment there is a 4-part loss function:

(1) the generator generates a mean square error loss function between the data and the encoder input data;

(2) classifying loss functions of the cross entropy of the true and false data by the discriminator;

(3) a Wasserstein distance function for discriminating the true and false of the data by the discriminator;

(4) a mean square error loss function between the encoder output feature and the intermediate implicit feature of the arbiter.

After the loss functions of all the parts are added, the back propagation is carried out through a root mean square transfer algorithm (RmsPorp), and the discriminator and the automatic encoder are optimized in sequence. And repeatedly executing the model training, and after the model training is iterated for 2000 times, the model loss tends to be balanced, and ending the network training.

And (4) generating data. The low-dimensional characteristics of fault data learned in training by the data generation model are utilized, and fault data of corresponding classes are generated after repeated interpolation and noise addition, so that various data balances are realized;

the training samples of the same class are input into an encoder, and potential feature vectors of input data are acquired. Then selecting similar feature vectors for interpolation, in the embodiment, selecting a feature vector by adopting a K nearest neighbor method, selecting one feature vector from low-dimensional feature vectors, finding out 3 nearest neighbor vectors, and then selecting one from adjacent vectors for vector interpolation. After interpolation amplification, adding 0.02 times of standard Gaussian white noise to the newly acquired vector, and embedding a label of a sample into the noisy vector to realize amplification of the potential feature vector. And finally, inputting the processed feature vector into a decoder to generate a new sample. Fig. 2 shows a comparison of the generated signals of the four fault types and the real signals, and it can be seen that the generated signals follow the distribution rule of the real signals, and have a certain difference.

And (5) fault diagnosis. Training a preset fault diagnosis model by using the class balance data set, and performing intelligent diagnosis on mechanical faults by using the trained fault diagnosis model.

The fault diagnosis model selects a support vector machine, and the input data of the fault diagnosis model is 6 main features of each data sample extracted by using a principal component analysis method. Firstly training a support vector machine by adopting a class balance data set, and then testing the classification accuracy of the trained support vector machine by adopting test set data (each class data amount is 1000). FIG. 2 shows the classification accuracy obtained after training the support vector machine by using the class balance data set obtained by the method and the synthetic minority class upsampling technology respectively, and also shows the classification accuracy obtained without adopting the data generation method. Under different unbalance rates, the method and the synthetic minority class upsampling technology provided by the application can improve the classification accuracy of the classifier, and the method provided by the application can obtain the highest classification accuracy, so that the quality of generated data obtained by the data generation method provided by the application is high, and the performance of the classifier is improved.

In summary, the present application can learn the data distribution characteristics of a small number of failure samples by combining the automatic encoder with the generation countermeasure network, using the feature mining capability of deep learning and the countermeasure training mechanism. In addition, potential characteristics are generated by interpolation and noise addition in a low-dimensional space, and data are generated by a decoder, so that the anti-interference capability and the data quality of measured noise can be improved, and the performance of intelligent diagnosis of mechanical faults is improved.

The above-described embodiments are merely preferred embodiments for fully explaining the present application, and the scope of the present application is not limited thereto. Equivalent substitutions and modifications will occur to those skilled in the art based on the present application, and are intended to be within the scope of the present application. The protection scope of the application is subject to the claims.

Claims

1. An intelligent diagnosis method for mechanical faults under an unbalanced-like data set is characterized by comprising the following steps:

step (2), building a model: combining an automatic encoder with a generation countermeasure network, and building a data generation model, wherein the automatic encoder learns low-dimensional characteristics of input data, namely true data, through the encoder, and then outputs generated data, namely false data, consistent with the distribution characteristics of the input data through a decoder by the aid of the low-dimensional characteristics and class labels thereof; the discriminator in the generated countermeasure network respectively carries out true and false discrimination and category classification on the input data and the generated data;

step (3), model training: training the data generation model by using fault data according to a preset loss function and an optimization algorithm, wherein the preset loss function comprises a mean square error loss function between the generated data of the generator and the input data of the encoder, a cross entropy classification loss function of a discriminator on true and false data, a Wasserstein distance or a binary cross entropy loss function of the discriminator on true and false discrimination of the data, and a mean square error loss function between an output characteristic of the encoder and an intermediate implicit characteristic of the discriminator;

step (4), data generation: the fault data low-dimensional characteristics learned by the data generation model in training are utilized, corresponding classes of fault data are generated through repeated interpolation and noise adding, so that various types of data balance is realized, the interpolation is carried out in different low-dimensional characteristics of the same class of fault samples, labels of the class are needed to be embedded before the fault data are generated, and the added noise is low-amplitude random noise, and the method comprises the following steps: inputting training samples of the same class into an encoder to obtain potential feature vectors of input data; then selecting similar feature vectors for interpolation, selecting a feature vector by adopting a K nearest neighbor method, selecting one feature vector from the low-dimensional feature vector, finding out 3 nearest neighbor vectors, and then selecting one from the nearest neighbor vectors for vector interpolation; after interpolation amplification, adding 0.02 times of standard Gaussian white noise to the newly acquired vector, and embedding a label of a sample into the noisy vector to realize amplification of potential feature vectors; finally, inputting the processed feature vector into a decoder to generate a new sample;

2. The intelligent diagnosis method for mechanical failure under unbalanced-like data set of claim 1, wherein in the step (2), the automatic encoder is composed of an encoder and a decoder, the generation countermeasure network is composed of a generator and a discriminator, and the decoder is the generator.

3. The method of intelligent diagnosis of mechanical failure under an unbalanced-like data set of claim 2, wherein the encoder, the decoder and the arbiter are each constructed by one of a deep convolutional network, a deep belief network, and a residual network.

4. The intelligent diagnosis method for mechanical failure under unbalanced-like data set of claim 1, wherein in the step (3), the preset optimization algorithm comprises one of a random gradient descent method, a random gradient descent with momentum, a Nesterov momentum method, an adagard algorithm and an adaptive moment estimation method.

5. The intelligent diagnosis method for mechanical failure under unbalanced-like data set of claim 1, wherein in step (5), the preset failure diagnosis model comprises one of a support vector machine, a k-nearest neighbor algorithm, a random forest, a fuzzy system or a deep neural network.

6. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the steps of the method according to any one of claims 1 to 5 when the program is executed by the processor.

7. A computer readable storage medium, on which a computer program is stored, characterized in that the program, when being executed by a processor, implements the steps of the method according to any one of claims 1 to 5.

8. A processor for running a program, wherein the program when run performs the method of any one of claims 1 to 5.