CN113312719B

CN113312719B - Rotary machine fault diagnosis method based on class unbalance weight cross entropy

Info

Publication number: CN113312719B
Application number: CN202110584346.0A
Authority: CN
Inventors: 王奇斌; 杨胜康; 孔宪光; 程涵; 余粼钖; 吉王辉
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2021-05-27
Filing date: 2021-05-27
Publication date: 2023-02-10
Anticipated expiration: 2041-05-27
Also published as: CN113312719A

Abstract

The invention provides a rotary machine fault diagnosis method based on class unbalance weight cross entropy, which comprises the following steps: acquiring a training sample set and a testing sample set; (2) constructing a sparse self-encoder model; (3) carrying out iterative training on the sparse self-encoder model; (4) constructing a fault diagnosis model of the rotary machine; (5) Carrying out iterative training on the rotating machine fault diagnosis model; and (6) acquiring a fault diagnosis result of the rotary machine. The method takes the category unbalance weight cross entropy loss function as the classification loss function of the fault diagnosis model, eliminates the influence of the fault category unbalance on the precision of the fault diagnosis model through the category unbalance weight, improves the diagnosis precision of the fault diagnosis model, and realizes the fault diagnosis of the rotary machine.

Description

Rotary machine fault diagnosis method based on class unbalance weight cross entropy

Technical Field

The invention belongs to the technical field of machinery, relates to a rotary machine fault diagnosis method, and particularly relates to a rotary machine fault diagnosis method based on class imbalance weight cross entropy, which can be used for carrying out intelligent detection, state monitoring and equipment maintenance on rotary machines.

Background

The rotating machine is one of the most widely used mechanical devices in industrial equipment, including gears, cams, bearings and other machines capable of rotating, the stability and reliability of the rotating machine directly affect the industrial production activity, and the fault diagnosis of the rotating machine is very important. The fault diagnosis is a process of finding out a fault of a device or a system, in an actual production environment, a rotating machine normally operates for a long time, and a device fault does not occur frequently, so that a problem of serious data imbalance is often faced, and the fault diagnosis is difficult to be used under an actual production condition. The method for solving the data unbalance problem mainly comprises a traditional method and a learning-based method, wherein the traditional method changes data distribution from a data level and reduces the data unbalance degree. The learning-based method learns unbalanced data characteristics by using deep learning technologies of deep characteristic learning such as a deep confidence network, a deep self-encoder and a convolutional neural network, and performs fault diagnosis of class imbalance.

In recent years, learning-based methods have been widely used in the field of fault diagnosis. For example, the application publication number is CN 112364706A, the name is 'a small sample bearing fault diagnosis method based on class imbalance', and discloses a small sample bearing fault diagnosis method based on class imbalance, the method comprises the steps of firstly constructing a class imbalance data set, segmenting one-dimensional original vibration signals through a sliding window, and enhancing one sample into a plurality of small samples with similar characteristics; then, taking the plurality of enhanced samples as input, and extracting signal features by utilizing a deep convolutional neural network; and finally, fault classification is realized by combining an integrated learning classification algorithm. The method adopts a data enhancement processing method to enlarge the sample size, improve the utilization rate of sample points and improve the accuracy of fault diagnosis by adopting an integrated learning method, but has the defects that the sample is enhanced into a small sample with similar characteristics, so that the overfitting of a network is easily caused, more characteristics of the sample cannot be learned, the generalization capability of the network is reduced, and the fault diagnosis precision is lower.

Disclosure of Invention

The invention aims to overcome the defects in the prior art, and provides a rotary machine fault diagnosis method based on class unbalance weight cross entropy, which is used for solving the technical problem of low rotary machine fault diagnosis precision under unbalanced data in the prior art.

In order to achieve the purpose, the technical scheme adopted by the invention comprises the following steps:

(1) Acquiring a training sample set and a testing sample set:

(1a) Carrying out Fourier transform on I vibration signals of the rotating machinery under C fault categories acquired by an acceleration sensor to obtain a frequency domain vibration signal set S = { x = _i |1≤i≤I}，x _i Representing the ith frequency domain vibration signal, wherein C is more than or equal to 2,I and more than or equal to 50000;

(1b) Performing one-hot coding on the fault types corresponding to more than half of the frequency domain vibration signals in the frequency domain vibration signal set S, and taking more than half of the frequency domain vibration signals and the fault type labels corresponding to each signal as a training sample set X ₁ Taking the rest frequency domain vibration signals in the frequency domain vibration signal set S as a test sample set X ₂ Wherein, training sample set X ₁ And test sample set X ₂ Each fault category contains an unbalanced number of samples, i.e.

N _c A number of samples representing a class c fault category;

(2) Constructing a sparse self-encoder model H:

constructing K sparse autoencoders F = { F) including sequential connections ₁ ,F ₂ ,...,F _k ,...,F _K H, wherein each sparse self-encoder F _k Comprises an encoder and a decoder which are connected in sequence; the encoder and the decoder both adopt a full connection layer with an activation function sigmoid, wherein K is more than or equal to 3,F _k Denotes the kth network parameter as

The sparse self-encoder of (1);

(3) Performing iterative training on the sparse self-encoder model H:

(3a) Initializing the iteration number to be N, wherein the maximum iteration number is N, N is more than or equal to 200, and N =0;

(3b) Initializing the kth sparse autoencoder F _k Is X _k And let k =1,X _k ＝X ₁ ；

(3c) Mixing X _k As a sparse self-encoder modelInput of H, sparse autoencoder F _k The encoder in (1) encodes each training sample to obtain a code set X' _k ，F _k Of the decoder to the encoder of coding set X' _k Decoding to obtain a decoded set X ″) _k ；

(3d) Calculate X ″) _k And X _k Loss of mean square error of

And using a counter-propagating method by

Compute sparse autoencoder F _k Network parameter gradient of

Then using a gradient descent algorithm through F _k Network parameter gradient of

To F _k Network parameters of

Updating is carried out;

(3e) Judging whether N is more than or equal to N, if so, obtaining a trained sparse self-encoder F' _k And executing the step (3 f), otherwise, executing the step (3 c);

(3f) Judging whether K = K is true, if so, obtaining a trained sparse self-coding model H ', otherwise, enabling K = K +1, and simultaneously enabling X' _k Performing step (3 c) as an input to the (k + 1) th sparse self-encoder;

(4) Constructing a rotary machine fault diagnosis model O:

(4a) Constructing a structure of a rotary machine fault diagnosis model O:

constructing a rotary machine fault diagnosis model O comprising K encoders and a classifier which are sequentially connected, wherein the encoders adopt sparse self-encoders in a trained sparse self-encoder model H', and the classifier comprises a full connection layer and a softmax activation function output layer which are sequentially connected;

(4b) Defining a class imbalance weight cross entropy loss function J (theta) of a rotary machine fault diagnosis model O:

where Σ denotes a summation operation, 1[ ·]The function of the index is expressed,

and f _i Respectively representing the predictive label and the characteristic vector of the ith training sample, log (-) represents the logarithm operation based on a natural constant e,

transpose of classifier parameter vector, α, at the time of representing the ith training sample _c A class imbalance weight representing a class c fault;

(5) Performing iterative training on a rotating machine fault diagnosis model O:

(5a) The number of initialization iterations is M, the maximum number of iterations is M, M is more than or equal to 100, and the network parameters of the mth iteration classifier are

And let m =0;

(5b) Will train sample set X ₁ The forward propagation is carried out as the input of a fault diagnosis model O of the rotating machinery to obtain X ₁ Classification result of fault category

And a feature vector f, and using a class imbalance weight cross entropy loss function J (theta) by

And f calculating a loss value of the classifier

Then using a back propagation method and passing through the loss value

Calculating network parameter gradients for classifiers

Finally adopting gradient descent algorithm and passing

Network parameters to classifiers

Updating is carried out;

(5c) Judging whether M is greater than or equal to M, if so, obtaining a trained fault diagnosis model O' of the rotary machine, otherwise, enabling M = M +1, and executing the step (5 b);

(6) Acquiring a fault diagnosis result of the rotary machine:

(6a) Set X of test samples ₂ The forward propagation is carried out as the input of a trained rotary machine fault diagnosis model O' to obtain X ₂ Predictive labelset for failure categories

And look up in the index table

And the subscript corresponding to the medium maximum value corresponds to the fault category.

Compared with the prior art, the invention has the following advantages:

1. the method adopts the loss value of the classifier calculated by the class unbalanced weight cross entropy loss function, and updates the network parameters through the network parameter gradient calculated by the loss value, thereby avoiding the influence of class unbalanced data on the loss value of the classifier, further avoiding the influence of the class unbalanced data on the network parameters, and effectively improving the diagnosis precision of the rotary machine fault.

2. The sparse self-encoder model and the rotary machine fault diagnosis model constructed by the method both comprise a plurality of sparse self-encoders which are connected in sequence, and in the process of training the two models and acquiring the fault diagnosis result of the rotary machine, the sparse self-encoders can better improve the feature extraction capability of the rotary machine fault diagnosis model on the vibration signal, and further improve the fault diagnosis precision of the rotary machine.

Drawings

FIG. 1 is a flow chart of an implementation of the present invention;

fig. 2 is a comparison graph of the fault diagnosis precision of the rolling bearing of the present invention and the prior art.

Detailed Description

The invention is described in further detail below with reference to the figures and the specific embodiments.

Referring to fig. 1, the present invention includes the steps of:

step 1) obtaining a training sample set and a testing sample set:

step 1 a) performing Fourier transform on I vibration signals of the rotating machine under C fault categories acquired by an acceleration sensor to obtain a frequency domain vibration signal set S = { x = _i |1≤i≤I}，x _i The ith frequency domain vibration signal is represented, wherein C is not less than 2,I not less than 50000, in the example, the rotating machinery adopts a rolling bearing, C =12, I =50000, and Fourier transform is adopted to obtain the frequency domain information of the vibration signal, so that the sparse self-encoder model can extract more useful features to a certain extent, and the Fourier transform formula is as follows:

where F (ω) represents a frequency domain vibration signal, [ integral ] represents an integral operation, [ infinity ] represents infinity, F (t) represents a time domain vibration signal, [ omega ] represents frequency, t represents time, e represents ^-jωt Represents the complex function, dt represents the derivative of t;

step 1 b) to the frequency domainPerforming one-hot coding on fault categories corresponding to three-quarter frequency domain vibration signals in the vibration signal set S, and taking the three-quarter frequency domain vibration signals in the S and fault category labels corresponding to each signal as a training sample set X ₁ Taking the rest frequency domain vibration signals in the frequency domain vibration signal set S as a test sample set X ₂ Wherein, training sample set X ₁ And test sample set X ₂ In which the number of samples per fault category is not equal, i.e. in which

N _c The number of samples representing class c fault category, in this example, 6 unbalanced cases of category are set, as shown in table 1:

TABLE 1

Class of failure	Case1	Case 2	Case 3	Case 4	Case 5	Case 6
							C1	200	200	200	200	200	200
C2	200	180	170	160	150	180
							C3	200	180	170	160	150	180
C4	200	180	170	160	150	180
							C5	200	180	170	160	150	160
C6	200	180	170	160	150	160
							C7	200	180	170	160	150	160
C8	200	180	170	160	150	150
							C9	200	180	170	160	150	150
C10	200	180	170	160	150	150
							C11	200	180	170	160	150	150
C12	200	180	170	160	150	150

Step 2), constructing a sparse autoencoder model H:

constructing K sparse autoencoders F = { F) including sequential connections ₁ ,F ₂ ,...,F _k ,...,F _K A sparse autoencoder model of each sparse autoencoder F _k Comprises an encoder and a decoder which are connected in sequence; the encoder and the decoder both adopt a full connection layer with an activation function sigmoid, wherein K is more than or equal to 3,F _k Denotes the kth network parameter as

In this example, K =3, the 3 sparse autoencoders connected in sequence can better extract the depth characteristics of the frequency domain vibration signal, and the hyper-parameters of the sparse autoencoder model H include the network node number, the learning rate, the penalty coefficient, and the like, which are set as shown in table 2:

TABLE 2

Step 3), carrying out iterative training on the sparse self-encoder model H:

step 3 a) initializing the iteration number to be N, wherein the maximum iteration number is N, N is more than or equal to 200, and N =0, in this example, N =200;

step 3 b) initializing the kth sparse autoencoder F _k Is X _k And let k =1,X _k ＝X ₁ ；

Step 3 c) converting X _k Sparse autoencoder F as input to sparse autoencoder model H _k The encoder in (1) encodes each training sample to obtain a code set X' _k ，F _k Decoder of (2) encodes encoder set X' _k Decoding to obtain a decoded set X ″) _k ；

Step 3 d) calculating X ″ _k And X _k Loss of mean square error of

And using a counter-propagating method by

Compute sparse autoencoder F _k Network parameter gradient of

To F _k Network parameters of

Updating is performed, wherein X ″' is calculated _k And X _k Loss of mean square error of

Compute sparse autoencoder F _k Network parameter gradient of

To F _k Network parameters of

Updating, wherein the calculation formula and the updating formula are respectively as follows:

wherein p represents a training sample set X ₁ The number of training samples in the training sequence,

in order to represent the derivation of the partial derivatives,

sparse autoencoder F at time n +1 of representation iteration _k The network parameters of (a) are set,

sparse autoencoder F at time representing iteration nth _k The network parameters of (a) are set,

for sparse self-encoders F _k The learning rate of (c);

step 3 e) judging whether N is more than or equal to N, if so, obtaining a trained sparse self-encoder F' _k And executing step 3 f), otherwise, executing step 3 c);

step 3 f) judging whether K = K is true, if so, obtaining a trained sparse self-coding model H ', otherwise, enabling K = K +1, and simultaneously enabling X' _k Performing step 3 c) as an input to the (k + 1) th sparse self-encoder;

step 4), constructing a fault diagnosis model O of the rotating machinery:

step 4 a) constructing a structure of a fault diagnosis model O of the rotary machine:

the method comprises the following steps of constructing a rotary machine fault diagnosis model O comprising K encoders and a classifier which are connected in sequence, wherein the encoders adopt sparse self-encoders in trained sparse self-encoder models H', the classifier comprises a full-connection layer and a softmax activation function output layer which are connected in sequence, and in the example, the hyper-parameters of the rotary machine fault diagnosis model O comprise learning rate, the number of network nodes and the like, and are set as shown in a table 3:

TABLE 3

Number of network nodes	200-100-50-25-12
		Learning rate	0.0005
Learning rate attenuation coefficient	0.99

Step 4 b), defining a category unbalance weight cross entropy loss function J (theta) of the rotary machine fault diagnosis model O:

where Σ represents the summation operation, 1[ ·]The function of the index is expressed,

and f _i Respectively representing the prediction label and the feature vector of the ith training sample, log (-) represents a logarithm operation based on a natural constant e,

step 5) iterative training is carried out on the rotary machine fault diagnosis model O:

step 5 a) initializing the iteration times to be M, the maximum iteration times to be M, M is more than or equal to 100, and the network parameters of the mth iteration classifier are

Let M =0, M =100 in this example;

step 5 b) training sample set X ₁ The forward propagation is carried out as the input of a fault diagnosis model O of the rotating machinery to obtain X ₁ Classification result of fault category

And f calculating a loss value of the classifier

Then using a back propagation method and passing through the loss value

Calculating network parameter gradients for classifiers

Finally, the network parameters of the classifier are subjected to gradient pair by adopting a gradient descent algorithm through the network parameter gradient of the classifier

Performing an update in which a loss value of the classifier is calculated

Calculating network parameter gradients for classifiers

Network parameters of classifier

The update formulas of (a) and (b) are respectively:

α _c ＝exp(-A _c )

wherein A is _c Representing the prediction accuracy rate of the rotary machine fault diagnosis model O on the class c fault category, n _c The number of samples representing that the rotating machinery fault diagnosis model O correctly predicts that the training samples belong to the class c fault category, exp (-) represents exponential operation with e as base,

network parameter, l, representing the classifier at time m +1 of iteration _C Representing the learning rate of the classifier.

In the above steps, the category imbalance weight cross entropy loss function is used as a classification loss function of the fault diagnosis model, and the influence of the fault category imbalance on the classifier loss is eliminated through the category imbalance weight;

step 5 c) judging whether M is larger than or equal to M, if so, obtaining a trained rotary machine fault diagnosis model O', otherwise, enabling M = M +1, and executing the step (5 b);

step 6) obtaining a fault diagnosis result of the rotary machine:

set X of test samples ₂ The forward propagation is carried out as the input of a trained rotary machine fault diagnosis model O' to obtain X ₂ Predictive labelset for failure categories

And look up in the index table

The technical effects of the invention are further explained by combining simulation experiments as follows:

1. simulation conditions and contents:

the hardware platform of the simulation experiment is as follows: the central processing unit is Intel (R) Core (TM) i5-7500 CPU, the main frequency is 3.40GHZ, and the memory is 16G.

The software platform of the simulation experiment is as follows: WINDOWS 7 operating system and Python 3.7.

The time domain vibration signals used in the simulation experiment are all from the bearing time domain vibration signals collected by the bearing accelerated life test bench PRONOSTIA. The platform consists of three parts: the device comprises a driving module, a load module and a data acquisition module. The main function of the test device is to provide signals of different fault types, the main components of the test device comprise a driving motor, a torque sensor and a dynamometer, the power of the driving motor is 1.2Kw, and the maximum rotating speed is 6000r/min. The bearing model is 6205-2RS JEM SKF, an acceleration sensor (DYTRAN 3035B) is arranged near the driving end, and the sampling frequency is 12kHz. The working conditions are as follows: rotation speed 1800rpm, load 4000N. The test bearing mainly comprises four fault states of a normal state, a roller defect (BD), an outer ring defect (OR) and an inner ring defect (IR). Using electric discharge machining to introduce a single point fault into the test bearings, with fault diameters of 0.007, 0.014, 0.021 and 0.028 inches, for four size types, a total of 12 fault categories of rolling bearing time domain vibration signals were obtained including different fault conditions, different fault diameter sizes and different fault orientations, as shown in table 4:

TABLE 4

Position of	Diameter (inches)	Direction	Label (R)
				Is normal	0	-	1
Roller	0.007	-	2
				Roller	0.014	-	3
Roller	0.021	-	4
				Inner ring	0.007	-	5
Inner ring	0.014	-	6
				Inner ring	0.021	-	7
Outer ring	0.007	Vertical 3 o' clock direction	8
				Outer ring	0.007	Center 6 o' clock direction	9
Outer ring	0.014	Center 6 o' clock direction	10
				Outer ring	0.021	Vertical 3 o' clock direction	11
Outer ring	0.021	Center 6 o' clock direction	12

The rolling bearing fault diagnosis precision of the normalized CNN based on data imbalance of the invention and the prior art is compared and simulated, and the result is shown in FIG. 2.

2. And (3) simulation result analysis:

the simulation experiment of the invention is to classify 6 different cases listed in table 1 by respectively adopting the method of the invention and the prior art (based on the normalized CNN rolling bearing fault diagnosis of data imbalance) and compare the results.

The prior art normalized CNN rolling bearing fault diagnosis Based on data imbalance refers to a rolling bearing fault diagnosis method proposed in the paper published by Bo ZHao et al, "Intelligent fault diagnosis of rolling bearings Based on normalized CNN conditioning data and variable working conditions, knowledge-Based Systems, volume 199,8July 2020, 105971".

In order to evaluate the effect of the invention, the accuracy of the classification results of 6 cases in the simulation experiment of the invention is respectively calculated by using the following formula of evaluation indexes (classification accuracy Acc), and the calculation results are drawn as table 5, wherein the expression of Acc is as follows:

in the formula (I), the compound is shown in the specification,

a prediction tag, y, representing the jth test sample _j The actual label representing the jth test sample, J representing the total number of samples in the test sample set.

TABLE 5

Method	Case1	Case 2	Case 3	Case 4	Case 5	Case 6
							The invention	0.9958	0.9892	0.9908	0.9742	0.9867	0.9850
Prior Art	0.9850	0.9793	0.9775	0.9741	0.9686	0.9680

Referring to fig. 2, the abscissa represents different cases, and the ordinate represents the accuracy, and it can be seen from fig. 2 that the accuracy of the classification result of the method of the present invention is higher than that of the prior art in 6 cases compared with the prior art, and is significantly improved in case 3, case 5 and case 6, which proves that the present invention can better perform fault diagnosis and simultaneously improve the fault diagnosis accuracy under the condition of unequal sample numbers of each fault category.

The above simulation experiments show that: the sparse self-encoder model and the rotary machine fault diagnosis model constructed by the method both comprise a plurality of sparse self-encoders which are connected in sequence, in the process of training the two models and acquiring the fault diagnosis result of the rotary machine, the sparse self-encoders can better improve the feature extraction capability of the rotary machine fault diagnosis model on vibration signals, a category unbalance weight cross entropy loss function is used as a classification loss function of the fault diagnosis model, the influence of fault category unbalance on classification loss is eliminated through the category unbalance weight, the diagnosis precision of the fault diagnosis model is improved, and the technical problem of low rotary machine fault diagnosis precision under unbalanced data in the prior art is solved.

Claims

1. A rotary machine fault diagnosis method based on class imbalance weight cross entropy is characterized by comprising the following steps:

(1) Acquiring a training sample set and a testing sample set:

(1a) Fourier transformation is carried out on I vibration signals of the rotating machinery under C fault categories acquired through the acceleration sensor, and a frequency domain vibration signal set S = { x = (x) } is obtained _i |1≤i≤I}，x _i Representing the ith frequency domain vibration signal, wherein C is more than or equal to 2,I and more than or equal to 50000;

(1b) Performing one-hot coding on fault categories corresponding to more than half of frequency domain vibration signals in the frequency domain vibration signal set S, and taking more than half of frequency domain vibration signals and fault category labels corresponding to each signal thereof as a training sample set X ₁ Taking the rest frequency domain vibration signals in the frequency domain vibration signal set S as a test sample set X ₂ Wherein, training sample set X ₁ And a test specimenCollection X ₂ Each fault class contains an unbalanced number of samples, i.e.

N _c A number of samples representing a class c fault category;

(2) Constructing a sparse self-encoder model H:

The sparse self-encoder of (1);

(3) Performing iterative training on the sparse self-encoder model H:

(3c) X is to be _k Sparse autoencoder F as input to sparse autoencoder model H _k The encoder in (1) encodes each training sample to obtain a code set X' _k ，F _k Of the decoder to the encoder of coding set X' _k Decoding to obtain a decoded set X ″) _k ；

(3d) Calculate X ″) _k And X _k Loss of mean square error of

And using a counter-propagating method by

Compute sparse autoencoder F _k Network parameter gradient of

To F _k Network parameters of

Updating is carried out;

(4) Constructing a rotary machine fault diagnosis model O:

(4a) Constructing a structure of a rotary machine fault diagnosis model O:

And let m =0;

And f calculating a loss value of the classifier

Then using a back propagation method and passing through the loss value

Calculating network parameter gradients for classifiers

Finally, a gradient descent algorithm is adopted andby passing

Network parameters to classifiers

Updating is carried out;

(6) Acquiring a fault diagnosis result of the rotary machine:

And look up in the index table

2. The method for diagnosing faults of rotating machinery based on class imbalance weight cross entropy as claimed in claim 1, wherein the step (3 d) calculates X ″ " _k And X _k Loss of mean square error of

Compute sparse autoencoder F _k Network parameter gradient of

To F _k Network parameters of

in order to represent the derivation of the partial derivatives,

for sparse self-encoders F _k The learning rate of (2).

3. The method for diagnosing faults of rotating machinery based on class-C imbalance weight cross entropy as claimed in claim 1, wherein the class-C imbalance weight α of the fault in step (4 b) _c The calculation formula is as follows:

α _c ＝exp(-A _c )

where exp (. Cndot.) denotes the exponential operation with e as base, A _c Representing the prediction accuracy rate of the rotary machine fault diagnosis model O on the class c fault category, n _c And the number of samples representing that the rotating machinery fault diagnosis model O correctly predicts that the training samples belong to the class c fault category.