CN117171544B

CN117171544B - Motor vibration fault diagnosis method based on multichannel fusion convolutional neural network

Info

Publication number: CN117171544B
Application number: CN202311143804.2A
Authority: CN
Inventors: 谢颖; 王丽婧; 赵加明; 李道璐
Original assignee: Harbin University of Science and Technology; Qiqihar University
Current assignee: Harbin University of Science and Technology; Qiqihar University
Priority date: 2023-09-06
Filing date: 2023-09-06
Publication date: 2024-04-12
Anticipated expiration: 2043-09-06
Also published as: CN117171544A

Abstract

A motor vibration fault diagnosis method based on a multichannel fusion convolutional neural network relates to the technical field of motor fault diagnosis. The invention aims to solve the problem of low accuracy of the existing method for judging the fault of the rotary machine by adopting a single signal. The motor vibration fault diagnosis method based on the multichannel fusion convolutional neural network is characterized in that a one-dimensional vibration signal of a diagnosed motor is collected, the one-dimensional vibration signal is decomposed into a plurality of PF components by adopting a local mean decomposition method, each PF component is converted into an M multiplied by N two-dimensional matrix, the obtained two-dimensional matrix is connected with two-dimensional data of the one-dimensional vibration signal in channel dimension, a three-dimensional input matrix with multiple channels is obtained, and the three-dimensional input matrix is input into a trained multichannel deep convolutional neural network to obtain a diagnosis result.

Description

Motor vibration fault diagnosis method based on multichannel fusion convolutional neural network

Technical Field

The invention belongs to the technical field of motor fault diagnosis.

Background

In practical application of the motor, due to a severe working environment, an overload running state and a complex electromagnetic relationship, some fault hidden dangers often occur, for example: stator winding turn-to-turn short circuit, rotor broken bar, air gap eccentricity, bearing abrasion and the like. For most rotating equipment, either in a normal or in a fault condition, a certain degree of vibration is generated during operation. The vibration signal can effectively represent the fault characteristics of equipment, such as the conditions of asymmetric shafting, loose connection of parts, damage to a rotor bearing and the like. Therefore, the acquisition and analysis of vibration signals has become one of the fault diagnosis methods commonly used in the field of rotary machines. However, at present, it is difficult to achieve an ideal diagnostic effect with a single signal processing method.

Disclosure of Invention

The invention aims to solve the problem of low accuracy of the existing method for judging the fault of the rotary machine by adopting a single signal, and provides a motor vibration fault diagnosis method based on a multichannel fusion convolutional neural network.

A motor vibration fault diagnosis method based on a multichannel fusion convolutional neural network specifically comprises the following steps:

collecting one-dimensional vibration signals of a motor to be diagnosed, decomposing the one-dimensional vibration signals into a plurality of PF components by adopting a local mean decomposition method, converting each PF component into an MxN two-dimensional matrix, connecting the obtained two-dimensional matrix with two-dimensional data of the one-dimensional vibration signals in channel dimensions to obtain a three-dimensional input matrix with multiple channels, and inputting the three-dimensional input matrix into a trained multichannel deep convolutional neural network to obtain a diagnosis result;

the multichannel deep convolutional neural network includes: the device comprises a plurality of feature extraction modules, a full-connection module and a Softmax module, wherein the feature extraction modules are serially connected step by step and used for carrying out feature extraction on input signals until obtaining features meeting the precision requirement, and the feature extraction modules comprise a multi-channel weighted fusion module, a multi-scale feature extraction module and a multi-dimensional attention module;

the multi-channel weighted fusion module is used for extracting the characteristics of the three-dimensional input matrix through a plurality of characteristic extraction channels containing different convolution kernels, and carrying out weighted fusion on the characteristics output by the same convolution layer of the different channels to obtain a fusion result;

the multi-scale feature extraction module is used for extracting features of the fusion result through feature extraction branches 3*3 and 7*7 respectively, performing concat operation on the extraction results of the two branches and the fusion result, and finally performing dimension reduction on the concat operation result through 1*1 convolution to obtain a dimension reduction result;

the multidimensional attention module is used for carrying out 1*1 convolution operation on the dimension reduction result, inputting the operation result into a Softmax function, multiplying the operation result by attention weight coefficients of the width dimension, the height dimension and the channel dimension respectively, multiplying the sum of the three products by the dimension reduction result, and then adding the sum of the three products with the dimension reduction result to obtain a three-dimensional attention feature;

the full-connection module comprises two full-connection layers, wherein the two full-connection layers are used for carrying out nonlinear adjustment on the three-dimensional attention characteristics to obtain an adjustment result;

the Softmax module is used for carrying out Softmax operation on the adjustment result to obtain a diagnosis result.

Further, in the multi-channel weighted fusion module, after the feature extraction is performed on the three-dimensional input matrix by the plurality of feature extraction channels, the output obtained by each channel is:

wherein,for the output of the first convolution layer in the jth channel,/>For the ith input in the ith convolutional layer of the first-1 layer in the jth channel,/th channel>K is the total number of inputs, which is the convolution kernel of the first-1 layer of convolution layers in the jth lane.

Further, in the multi-channel weighted fusion module, the characteristics output by the same convolution layer of different channels are weighted and fused by the following formula:

wherein X is ^l For the output of the first convolution layer, f _l-1 []Representing the activation function of the layer 1 convolutional layer,weights for the first-1 layer convolution layer in the jth lane, +.>For the offset vector of the first-1 layer convolution layer in the jth lane, +.>Z is the total number of channels for the output of the first-1 layer convolution layer in the jth channel.

Further, the operation formula of the multi-scale feature extraction module is as follows:

m ₁ ＝c ₃₂ (c ₃₁ m+b ₃₁ )+b ₃₂ ，

m ₂ ＝c ₇₂ (c ₇₁ m+b ₇₁ )+b ₇₂ ，

M＝Cat(m ₁ ,m ₂ ,m)，

Q＝c _f M+b _f ，

wherein m is a fusion result obtained by the multi-channel weighted fusion module, and m is ₁ And m ₂ Extracting branch extracted features 3*3 and 7*7 respectively, c ₃₁ And c ₃₂ B is the convolution kernel weight of the first and second 3*3 ₃₁ And b ₃₂ C respectively ₃₁ And c ₃₂ Offset of c ₇₁ And c ₇₂ Convolution kernel weights, b, for the first and second 7*7, respectively ₇₁ And b ₇₂ C respectively ₇₁ And c ₇₂ M is the result of the concat operation, cat () represents the concat operation, Q is the result of dimension reduction, c _f Convolution kernel weight for 1*1, b _f Convolutionally offset for 1*1.

Further, the operation formula of the multidimensional attention module is as follows:

wherein X is _out For the three-dimensional attention feature output by the multi-dimensional attention module, X is the input of the multi-dimensional attention module, soft max () represents Softmax function, f () represents convolution operation, C, H, W represents width, height, channel of X and has i=c, H, W, Z, respectively _i Is characteristic of i, R _i Sum [ for the weight of the feature of i]Representing an addition operation.

Further, the training process of the multichannel deep convolutional neural network is as follows:

extracting historical one-dimensional vibration signals of a motor, amplifying, decomposing each amplified one-dimensional vibration signal into a plurality of PF components by adopting a local mean decomposition method, converting each PF component into an M multiplied by N two-dimensional matrix, connecting the two-dimensional matrix with two-dimensional data of one-dimensional vibration signals corresponding to the two-dimensional matrix in a channel dimension, and taking a three-dimensional input matrix corresponding to each one-dimensional vibration signal as a sample to establish a sample set;

marking a fault label on samples containing faults in the sample set, and dividing the samples in the sample set into a training set and a testing set;

training the multichannel deep convolutional neural network model by utilizing samples in a training set to obtain model parameters;

substituting the model parameters into the multichannel deep convolutional neural network model, testing by using a testing set so as to judge the accuracy of the multichannel deep convolutional neural network model, if the accuracy is met, completing training, otherwise, retraining the multichannel deep convolutional neural network model.

Furthermore, the historical one-dimensional vibration signal is amplified by adopting an overlapping sampling mode.

Further, the number of the amplified historical one-dimensional vibration signals is Y, and there are:

wherein L is the length of the historical one-dimensional vibration signalThe degree, n is the sampling step size,representing a rounding down.

Further, the ratio of the number of samples in the training set to the number of samples in the test set is 7:3.

Further, in the training process of the multichannel deep convolutional neural network, the loss function is obtained by performing cross entropy calculation on the output vector of Softmax and the actual label of the sample, and the calculation formula of the loss function is as follows:

wherein Loss is Loss, y' _f The f-th value, y, of the actual label of the sample _f The f-th value of the vector is output for Softmax.

Compared with the prior art, the technical method of the invention has the beneficial effects that:

(1) The invention provides a multichannel convolutional neural network mechanism which utilizes a multichannel dynamic receptive field to comprehensively extract multi-scale abstract features of vibration signals.

(2) The invention provides a multi-feature extraction module which can grasp the features of different layers of a target area, so that the diversity of the features is increased, and important semantic information is prevented from being lost; the three-dimensional attention mechanism improves the attention degree of the network to fault characteristics and improves the sensitivity of the whole network to fault differences.

(3) The invention provides a method for extracting motor fault characteristics in a two-dimensional image converted from vibration signals by using a multichannel fused deep neural network, and improves the motor fault diagnosis and identification precision.

Drawings

FIG. 1 is a schematic diagram of a motor vibration signal fault detection network based on multi-channel input;

FIG. 2 is a schematic diagram of a multi-scale feature extraction module structure;

FIG. 3 is a schematic diagram of a three-dimensional attention module structure;

FIG. 4 is a graph of training process loss and accuracy variation;

fig. 5 is a bar graph of accuracy versus experimental condition for each network.

Detailed Description

The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention. It should be noted that, without conflict, the embodiments of the present invention and features of the embodiments may be combined with each other.

At present, a single signal processing method is difficult to realize an ideal diagnosis effect. Thus, there is a need for further feature extraction and identification of the processed signals. In recent years, a deep learning method has made remarkable progress in the field of mechanical failure diagnosis. The advantage of deep learning is its strong feature learning ability and end-to-end training pattern. Through the structure of the deep neural network, more complex and abstract feature representation can be learned, so that the accuracy and the robustness of fault diagnosis are improved. The convolutional neural network has unique advantages in the aspects of local feature learning and mode classification, is convenient for processing a large amount of data, can adaptively learn signal features, reduces dependence on expert knowledge, realizes automation and intellectualization of fault diagnosis, and has a deep network structure capable of accurately measuring high-dimensional data.

Referring to fig. 1 to 5, a specific description is given of a motor vibration fault diagnosis method based on a multi-channel fusion convolutional neural network according to the present embodiment, where a required multi-channel fusion convolutional neural network is trained first, specifically as follows:

step one: and extracting a historical one-dimensional vibration signal of the motor and amplifying the historical one-dimensional vibration signal in an overlapping sampling mode. Assuming that the length of the historical one-dimensional vibration signal is L, the sampling step length is n, and the number of amplified one-dimensional vibration signals is Y, the calculation formula of the number of the available sub-signals is as follows:

wherein the method comprises the steps ofRepresenting a rounding down.

In practical application, the method uses 8 motor vibration data fault types including turn-to-turn short circuit, air gap eccentricity, rotor broken bar, bearing seat damage, bearing abrasion and the like. Data enhancement operation is performed to finally obtain 16000 samples, each sample containing 1024 sampling points.

TABLE 1 failure type

Each fault signal is converted into a two-dimensional matrix, the original vibration signal comprises 1024 sampling points, and the size of the two-dimensional matrix after conversion is 32 x 32. Similarly, each PF component is also converted into a two-dimensional matrix, and connected to the two-dimensional matrix of the original vibration signal in the channel dimension, to finally generate the data format of 6×32×32 required by CNN.

Step two: and decomposing each amplified one-dimensional vibration signal into a plurality of PF components by adopting a local mean decomposition method, converting each PF component into an MxN two-dimensional matrix, connecting the two-dimensional matrix with two-dimensional data of the corresponding one-dimensional vibration signal in the channel dimension, and taking the three-dimensional input matrix corresponding to each one-dimensional vibration signal as a sample to establish a sample set. And marking a fault label for samples containing faults in the sample set, wherein the fault label comprises a plurality of motor vibration data fault types in total, such as turn-to-turn short circuit, air gap eccentricity, rotor broken bars, bearing pedestal damage, bearing abrasion and the like. 1000 samples/class of fault data are taken, 2000 samples after the enhancement operation, each sample containing 1024 sampling points. And then dividing the samples in the sample set into a training set and a testing set, wherein the ratio of the number of the samples in the training set to the number of the samples in the testing set is 7:3.

In this step, the Local Mean Decomposition (LMD) method is used to decompose the nonlinear and unstable vibration signals into a Product Function (PF) with physical meaning for a plurality of instantaneous frequencies. The local mean decomposition method adaptively decomposes a complex non-stationary multi-component signal into the sum of product functions with physical significance of a plurality of instantaneous frequencies, wherein each PF component consists of the product of a pure frequency modulation signal and an envelope signal, and can show the distribution rule of signal energy in a time-frequency domain. The envelope signal is the instantaneous amplitude of the PF component, and the instantaneous frequency of the PF component can be directly obtained from the pure FM signal, and further, the instantaneous amplitude and the instantaneous frequency of all the PF components are combined, thereby obtaining the complete time-frequency distribution of the original signal.

In the LMD process, the average value m of adjacent local average value points is calculated first _i Smoothing to obtain local mean function m _ij And envelope estimation function a _ij . Then, separating the local mean function from the original vibration signal to obtain h _ij (t) and demodulating to obtain s _ij (t). If s is _ij (t) is identified as a pure FM signal according to the instantaneous amplitude function a _i (t) calculating the corresponding PF component PF _i (t) residual Signal u _i (t). This process is repeated until the residual signal u _i (t) becomes a monotonic function, indicating that the decomposition is complete and all PF components are obtained.

And each PF component after the vibration signal is decomposed is correspondingly converted into an MxN two-dimensional matrix, and meanwhile, the two-dimensional matrices are connected with the two-dimensional data of the original vibration signal in the channel dimension to form a three-dimensional input matrix with a plurality of channels. The processing mode can enhance the characteristic expression of the vibration signal in the space dimension and obtain the final three-dimensional input matrix Xin epsilon R ^c*M*N 。

Step three: establishing a multi-channel deep convolutional neural network (MCI-CNN) model as in fig. 1, the multi-channel deep convolutional neural network comprising: the system comprises a multi-channel weighted fusion module, a multi-scale feature extraction module, a multi-dimensional attention module, a full connection module and a Softmax module.

The Multichannel weighted fusion module (Multichannel-weighted fusion input, MCI): the feature of the vibration signal after data conversion has larger difference of the feature of the component decomposed by the same vibration signal, so that the feature extraction is independently carried out by adopting different convolution kernels to form different receptive fields in each channel by combining the features of the signals of each channel, the size of the receptive fields changes along with the change of the convolution kernels, the difference of the receptive fields can be realized by changing the size of the convolution kernels, and the two have the following relationship:

rf _n ＝(rf _n-1 )*s _n-1 +k _n-1 ，

rf _last ＝k _last ，

rf _n-1 and rf _n The receptive fields of the n-1 layer and the n-layer convolution pooling layer are respectively the size, rf _last The receptive field size, s, for the last convolutional pooling layer _n-1 Compensation, k, for the n-1 layer convolutional pooling layer _n-1 Convolving the kernel, k, of the pooling layer for the n-1 th layer convolution _last The convolution kernels of the layers are pooled for the last layer of convolution.

After feature extraction is performed on the three-dimensional input matrix through different convolution kernels, the output obtained by each channel after feature extraction is performed on the three-dimensional input matrix through a plurality of feature extraction channels is as follows:

wherein,for the output of the first convolution layer in the jth channel,/>For the ith input in the ith convolutional layer of the first-1 layer in the jth channel,/th channel>Is the j th passThe convolution kernel of the layer 1 convolution layer in the track, k, is the total number of inputs.

The convolution kernels of the channels are different, so that a dynamic receptive field is obtained. After feature extraction based on dynamic receptive fields, each channel acquires the same number of feature maps, which then need to be effectively fused. The characteristics output by the same convolution layer of different channels are weighted and fused through the following steps:

The Multi-scale feature extraction module (Multi-feature extraction module, MF). The feature extraction double branches of 3*3 and 7*7 are arranged, so that the network can extract features of different levels, and detail feature loss is avoided. The residual structure is used for merging the input of the module and the characteristics extracted by the receptive fields through concat operation, so that the network extracts richer semantic information, the problem of network degradation is solved, the generation of gradient explosion is effectively prevented, and the deep network can be effectively learned. And finally, performing dimension reduction operation on the output characteristics through 1*1 convolution to obtain a dimension reduction result. The multi-scale feature extraction module is shown in fig. 2. The operation formula of the multi-scale feature extraction module is as follows:

m ₁ ＝c ₃₂ (c ₃₁ m+b ₃₁ )+b ₃₂ ，

m ₂ ＝c ₇₂ (c ₇₁ m+b ₇₁ )+b ₇₂ ，

M＝Cat(m ₁ ,m ₂ ,m)，

Q＝c _f M+b _f ，

wherein m is a fusion result obtained by the multi-channel weighted fusion module, and m is ₁ And m ₂ The feature extraction branches 3*3 and 7*7 respectively extract features,

c ₃₁ and c ₃₂ B is the convolution kernel weight of the first and second 3*3 ₃₁ And b ₃₂ C respectively ₃₁ And c ₃₂ Offset of c ₇₁ And c ₇₂ Convolution kernel weights, b, for the first and second 7*7, respectively ₇₁ And b ₇₂ C respectively ₇₁ And c ₇₂ Is set in the above-mentioned state,

m is the result of the concat operation, cat () represents the concat operation, Q is the result of dimension reduction, c _f Convolution kernel weight for 1*1, b _f Convolutionally offset for 1*1.

The multi-dimensional attention module (Multidimensional attention, MA). And 1*1 convolution operation is carried out on the dimension reduction result, the operation result is input into a Softmax function and multiplied by the attention weight coefficients of the width dimension, the height dimension and the channel dimension respectively, the sum of the three products is multiplied by the dimension reduction result, and then the three products are added with the dimension reduction result, so that the three-dimensional attention characteristic is obtained.

The three-dimensional attention branches are constructed, the attention degree of the network to the insulator is improved through the attention mechanisms of three dimensions of the width, the height and the channel, more effective defect characteristics are obtained, meanwhile, the weights of the three dimensions are dynamically adjusted, and the sensitivity of the whole network to defect differences is improved. The three-dimensional attention module structure is shown in fig. 3.

The input feature map is subjected to 1*1 convolution operation, and then is input into Softmax to be multiplied by the attention weight coefficients of three dimensions of width, height and channel respectively. The weight coefficient is dynamically adjusted, when the network is in a shallow layer network, the specific gravity of the wide and high is increased, and the specific gravity of the channel is reduced, so that the network is more focused on the wide and high characteristics of the image. When in a deep network, as the number of channels increases, the proportion of channels increases correspondingly, the attention of the network to wide and high features decreases, and the attention to the channel features increases. After the operation, all the attention is added and multiplied by the original input image to obtain the three-dimensional attention characteristic, and finally the addition operation is carried out on the three-dimensional attention characteristic and the original image to further fuse the attention characteristic.

The operation formula of the multidimensional attention module is as follows:

The full-connection module comprises two full-connection layers, wherein the two full-connection layers are used for carrying out nonlinear adjustment on the three-dimensional attention characteristic radix-Nylon bin to obtain an adjustment result;

Step four: and inputting a training set to obtain the neural network parameters. An early-stop algorithm is used to optimize the training of the model. Updating model parameters by adopting an adaptive matrix estimation optimizer; the loss function employs cross entropy loss.

The classifier loss function of the MCI-CNN is obtained by cross entropy calculation of the output vector of the Softmax and the actual label of the sample. The calculation formula of the loss function is as follows:

wherein Loss is Loss, y' _f The f-th value, y, of the actual label of the sample _f Is Softmax outputs the f-th value of the vector. In the back propagation stage, the Adam algorithm is used for optimizing the parameters, and the learning rate of each parameter is dynamically adjusted by utilizing the first moment estimation and the second moment estimation of the parameters, so that the weight is updated to obtain the optimal solution.

Step five: a network model is deployed, a test set is input, fault classification is determined, and diagnosis accuracy is evaluated.

Step six: and saving the network model.

Step seven: collecting one-dimensional vibration signals of a motor to be diagnosed, decomposing the one-dimensional vibration signals into a plurality of PF components by adopting a local mean decomposition method, converting each PF component into an MxN two-dimensional matrix, connecting the obtained two-dimensional matrix with two-dimensional data of the one-dimensional vibration signals in channel dimensions to obtain a three-dimensional input matrix with multiple channels, and inputting the three-dimensional input matrix into a trained multi-channel deep convolutional neural network to perform fault classification and identification to obtain a diagnosis result.

The experimental parameter settings of this embodiment are shown in table 2. An early-stop algorithm is used to optimize the training of the model. And updating model parameters by adopting an adaptive moment estimation optimizer. The loss function employs cross entropy loss.

Table 2 experimental parameter settings

The loss and accuracy of the observation network training set samples and the test set samples are changed with the iteration number as shown in fig. 4. The test set sample loss increases sharply in the first 5 rounds of training, but the training set sample loss and the test set sample loss decrease gradually along with the increase of the iteration times Epoch, which indicates that the model is continuously converged and is continuously approaching to 0. After the iteration times are 20, the training set loss and the test set loss basically coincide, no large fluctuation exists, the stability is high, and the fitting problem does not occur. After model training is finished, each type of faults is verified, the result is shown in table 3, the accuracy of the network in identifying each type of faults is over 99.9%, and the model has high identification accuracy.

TABLE 3 results of various fault identifications

In order to further verify the effectiveness of each module of the MCI-CNN network on the whole network, 4 groups of ablation experiments are carried out on the whole network so as to verify the performances of MW, MF and MA modules, and the MA, MF and MF are respectively removed to form NO-MA+CNN, NO-MF+CNN and NO-MW+CNN networks based on the MCI-CNN network.

Table 4 shows the results of the ablation experiment training. The NO-MA+CNN achieves 99.03% accuracy in 30 rounds of training period, which is 0.89% lower than MCI-CNN because the network lacks attention to fault features in the feature extraction stage, resulting in reduced network accuracy. NO-mf+cnn achieves 99.56% lower accuracy than MCI-CNN by 0.36% due to the lack of multi-scale feature extraction capability of the network during the feature extraction stage. The accuracy of NO-MW+CNN is 99.76%, which is reduced by 0.16% compared with MCI-CNN, because the vibration signal input by a single channel cannot fully express the fault characteristics of the motor. The MCI-CNN obtains 99.92% accuracy in the iteration period, and further illustrates that MW, MF and MA modules have promotion effect on the improvement of the overall network performance.

Table 4 ablation experimental results

In order to further verify the performance and error Back Propagation (BP) of the MCI-CNN, a support vector machine (Support Vector Machine, SVM), long and short time memories (Long Short Term Memory, LSTM), stack denoising self-coding (Stacked Denoised Autoencoder, SDAE), one-dimensional convolutional neural networks (1D-CNN) and multi-channel fusion convolutional neural networks (multi-channel-CNN), the MCI-CNN network has the highest accuracy rate compared with other 6 networks, has better recognition precision on fault signals, and has the accuracy rate which is 3.16%, 2.14%, 1.69%, 1.03%, 1.29% and 0.46% higher than that of the 6 networks respectively, and further proves the effectiveness of the method.

According to the invention, a deep convolutional neural network data processing technology is introduced into the field of motor vibration fault diagnosis, and the accuracy of fault diagnosis is improved. According to the invention, self-adaptive time-frequency analysis is carried out on an original vibration signal by adopting local mean decomposition, the original vibration signal is converted into multi-dimensional data, the multi-dimensional data input by a multi-channel is extracted by adopting a convolutional neural network and is weighted and fused, meanwhile, a multi-feature extraction module is constructed to avoid important semantic information loss, the attention of the network to fault signals is improved by utilizing a multi-dimensional attention mechanism, the fault diagnosis of the motor vibration signal is realized, and the fault diagnosis recognition precision of the motor is improved.

Although the invention herein has been described with reference to particular embodiments, it is to be understood that these embodiments are merely illustrative of the principles and applications of the present invention. It is therefore to be understood that numerous modifications may be made to the illustrative embodiments and that other arrangements may be devised without departing from the spirit and scope of the present invention as defined by the appended claims. It should be understood that the different dependent claims and the features described herein may be combined in ways other than as described in the original claims. It is also to be understood that features described in connection with separate embodiments may be used in other described embodiments.

Claims

1. A motor vibration fault diagnosis method based on a multichannel fusion convolutional neural network is characterized in that,

the Softmax module is used for carrying out Softmax operation on the adjustment result to obtain a diagnosis result;

in the multi-channel weighted fusion module, after a plurality of feature extraction channels perform feature extraction on a three-dimensional input matrix, the output obtained by each channel is as follows:

wherein,for the output of the first convolution layer in the jth channel,/>For the ith input in the ith convolutional layer of the first-1 layer in the jth channel,/th channel>A convolution kernel of a first-1 layer convolution layer in a jth channel, wherein k is the total number of inputs;

in the multi-channel weighted fusion module, the characteristics output by the same convolution layer of different channels are weighted and fused by the following formula:

wherein X is ^l For the output of the first convolution layer, f _l-1 []Representing the activation function of the layer 1 convolutional layer,weights for the first-1 layer convolution layer in the jth lane, +.>For the offset vector of the first-1 layer convolution layer in the jth lane, +.>Z is the total number of channels, which is the output of the first-1 layer convolution layer in the jth channel;

the operation formula of the multi-scale feature extraction module is as follows:

m ₁ ＝c ₃₂ (c ₃₁ m+b ₃₁ )+b ₃₂ ，

m ₂ ＝c ₇₂ (c ₇₁ m+b ₇₁ )+b ₇₂ ，

M＝Cat(m ₁ ,m ₂ ,m)，

Q＝c _f M+b _f ，

wherein m is a fusion result obtained by the multi-channel weighted fusion module, and m is ₁ And m ₂ Extracting branch extracted features 3*3 and 7*7 respectively, c ₃₁ And c ₃₂ B is the convolution kernel weight of the first and second 3*3 ₃₁ And b ₃₂ C respectively ₃₁ And c ₃₂ Offset of c ₇₁ And c ₇₂ Convolution kernel weights, b, for the first and second 7*7, respectively ₇₁ And b ₇₂ C respectively ₇₁ And c ₇₂ M is the result of the concat operation, cat () represents the concat operation, Q is the result of dimension reduction, c _f Convolution kernel weight for 1*1, b _f Convolutionally offset 1*1;

the operation formula of the multidimensional attention module is as follows:

wherein X is _out For the three-dimensional attention feature output by the multi-dimensional attention module, X is the input of the multi-dimensional attention module, softmax () represents Softmax function, f () represents convolution operation, C, H, W represents width, height, channel of X and has i=c, H, W, Z, respectively _i Is characteristic of i, R _i Sum [ for the weight of the feature of i]Representing an addition operation.

2. The motor vibration fault diagnosis method based on the multichannel fusion convolutional neural network according to claim 1, wherein the training process of the multichannel deep convolutional neural network is as follows:

3. The motor vibration fault diagnosis method based on the multichannel fusion convolutional neural network according to claim 2, wherein the historical one-dimensional vibration signals are amplified by adopting an overlapped sampling mode.

4. The motor vibration fault diagnosis method based on the multichannel fusion convolutional neural network according to claim 3, wherein the number of amplified historical one-dimensional vibration signals is Y, and the method comprises the following steps:

wherein L is the length of the historical one-dimensional vibration signal, n is the sampling step length,representing a rounding down.

5. The motor vibration fault diagnosis method based on the multichannel fusion convolutional neural network according to claim 2, wherein the ratio of the number of samples in the training set to the number of samples in the test set is 7:3.

6. The motor vibration fault diagnosis method based on the multichannel fusion convolutional neural network according to claim 2, wherein in the training process of the multichannel deep convolutional neural network, a loss function is obtained by performing cross entropy calculation on an output vector of Softmax and an actual label of a sample, and a calculation formula of the loss function is as follows: