CN114372492A

CN114372492A - An interpretable fault diagnosis method for rolling bearings

Info

Publication number: CN114372492A
Application number: CN202111676747.5A
Authority: CN
Inventors: 刘之航; 丁康; 何国林; 蒋飞
Original assignee: South China University of Technology SCUT
Current assignee: South China University of Technology SCUT
Priority date: 2021-12-31
Filing date: 2021-12-31
Publication date: 2022-04-19
Anticipated expiration: 2041-12-31
Also published as: CN114372492B

Abstract

The invention discloses an interpretable rolling bearing fault diagnosis method, which comprises the following steps: collecting one-dimensional time sequence signals of the rolling bearing, expanding samples, establishing an initial 1D-CNN-BilSTM neural network model, adding a Grad-CAM + + interpretation layer to the neural network model, and establishing the neural network model with convolution interpretation capability. And training the neural network model by using various one-dimensional fault data to obtain a model with fault diagnosis capability, and performing fault diagnosis on the rolling bearing through the fault diagnosis model. The invention explains the characteristic extraction process of the neural network with CNN as the basic structure, adds the BilSTM, utilizes the characteristic of bidirectional analysis capability, realizes better diagnosis precision, and improves the noise immunity and robustness of the fault diagnosis neural network model.

Description

An interpretable fault diagnosis method for rolling bearings

技术领域technical field

本发明属于旋转机械故障诊断和信号处理领域，具体涉及一种可解释性滚动轴承故障诊断方法。The invention belongs to the field of fault diagnosis and signal processing of rotating machinery, and particularly relates to an interpretable rolling bearing fault diagnosis method.

背景技术Background technique

滚动轴承广泛应用于各个行业中，如车辆，航空航天。滚动轴承的健康是整个设备平稳运行的基础保障,因而对滚动轴承运行状态监测和综合评价具有显著的意义。滚动轴承极易发生疲劳损伤和性能衰退，并影响整个系统的安全性和可靠性，研究故障诊断与健康管理具有重要的理论研究意义和工程应用价值。Rolling bearings are widely used in various industries, such as vehicles, aerospace. The health of the rolling bearing is the basic guarantee for the smooth operation of the whole equipment, so it is of great significance to monitor and comprehensively evaluate the running state of the rolling bearing. Rolling bearings are prone to fatigue damage and performance degradation, and affect the safety and reliability of the entire system. Research on fault diagnosis and health management has important theoretical research significance and engineering application value.

传统的故障诊断方法存在两个问题，其一是需要较多的机械工程及统计学相关的专家知识，其二是随着滚动轴承的运行工况越来越复杂，传统的故障诊断方法通常表现得不是很理想，并且不具备自适应性，因此需要寻找基于特征学习的方法进行故障特征提取和诊断研究。There are two problems in the traditional fault diagnosis method. One is that it requires a lot of expert knowledge related to mechanical engineering and statistics. It is not ideal and does not have self-adaptation, so it is necessary to find a method based on feature learning for fault feature extraction and diagnosis research.

随着人工智能和深度学习的不断发展，卷积神经网络凭借其共享卷积核，对高维数据处理无压力，不需要太对先验知识和自动提取特征等优点，已在很多领域中得到广泛应用。但大多数CNN都不具有可解释性，导致该过程常被诟病为“黑箱”。此外，当噪声较大时，常规CNN的诊断表现并不令人满意，所以提出一种具有良好抗噪性的可解释性轴承故障诊断方法是极其重要的。With the continuous development of artificial intelligence and deep learning, the convolutional neural network, with its shared convolution kernel, has no pressure on high-dimensional data processing, does not require too much prior knowledge, and automatically extracts features and other advantages, has been obtained in many fields. widely used. But most CNNs are not interpretable, resulting in the process often criticized as a "black box". In addition, the diagnostic performance of conventional CNNs is not satisfactory when the noise is large, so it is extremely important to propose an interpretable bearing fault diagnosis method with good noise immunity.

现有技术一种基于卷积神经网络的轴承检测方法(CN201910985498.4)中的不足如下：The deficiencies in the prior art bearing detection method based on convolutional neural network (CN201910985498.4) are as follows:

1、其使用的Grad-CAM的推导过程中通过特征图的大小Z来估计权重，所以权重大小与特征图的大小关系很大，会影响到结果。此外，本发明方法对输入图像中多次出现的同一类，则无法很好地进行激活。1. In the derivation process of Grad-CAM used, the weight is estimated by the size Z of the feature map, so the size of the weight has a great relationship with the size of the feature map, which will affect the result. In addition, the method of the present invention cannot activate the same class that appears multiple times in the input image.

2、其抗噪性效果未知。2. Its anti-noise effect is unknown.

发明内容SUMMARY OF THE INVENTION

本发明的目的是针对现有技术的不足之处，提出一种可解释性滚动轴承故障诊断方法。旨在解决传统故障诊断方法存在的需要较多先验知识、CNN过程常被诟病为黑箱和普通CNN抗噪性差等技术问题。The purpose of the present invention is to propose an interpretable rolling bearing fault diagnosis method aiming at the deficiencies of the prior art. It aims to solve the technical problems of traditional fault diagnosis methods that require more prior knowledge, CNN process is often criticized as black box and poor noise resistance of ordinary CNN.

本发明至少通过如下技术方案之一实现。The present invention is realized by at least one of the following technical solutions.

一种可解释性滚动轴承故障诊断方法，包括以下步骤：An interpretable rolling bearing fault diagnosis method, comprising the following steps:

S1、采集滚动轴承的一维时间序列信号并进行样本扩充；S1. Collect the one-dimensional time series signal of the rolling bearing and expand the sample;

S2、建立初始1D-CNN-BiLSTM神经网络模型；S2. Establish an initial 1D-CNN-BiLSTM neural network model;

S3、向所述神经网络模型添加解释层；S3, adding an explanation layer to the neural network model;

S4、建立具有卷积解释能力的神经网络模型；S4. Establish a neural network model with convolution interpretation capability;

S5、利用多种一维故障数据训练所述神经网络模型，获得具有故障诊断能力的模型；S5, using a variety of one-dimensional fault data to train the neural network model to obtain a model with fault diagnosis capability;

S6、通过所述故障诊断模型对所述滚动轴承进行故障诊断。S6. Perform fault diagnosis on the rolling bearing by using the fault diagnosis model.

进一步地，步骤S1包括以下步骤：Further, step S1 includes the following steps:

S11、收集加速度传感器所收集到的正常、外圈故障、内圈故障和滚动体故障四种类别的一维时间信号；S11. Collect four types of one-dimensional time signals of normal, outer ring fault, inner ring fault and rolling element fault collected by the acceleration sensor;

S12、将所述一维时间信号信号利用重叠采样的方式进行扩充，并将数据样本划分为训练集和测试集；S12, the one-dimensional time signal signal is expanded by means of overlapping sampling, and the data samples are divided into a training set and a test set;

S13、将所述数据样本采用K折交叉验证的方式多次划分出不同的训练集和测试集，最后取平均结果以提高模型的鲁棒性。S13. Divide the data sample into different training sets and test sets multiple times by means of K-fold cross-validation, and finally average the results to improve the robustness of the model.

进一步地，所述初始1D-CNN-BiLSTM神经网络模型的结构包括输入层(input)、卷积神经网络层(CNN)、BiLSTM层、全连接层(Fully Connected)以及输出层(output)：Further, the structure of the initial 1D-CNN-BiLSTM neural network model includes an input layer (input), a convolutional neural network layer (CNN), a BiLSTM layer, a fully connected layer (Fully Connected) and an output layer (output):

CNN层包括卷积层(ConV)、池化层(Max Pooling)，每个卷积层的激活函数都为Relu函数；The CNN layer includes a convolution layer (ConV) and a pooling layer (Max Pooling), and the activation function of each convolution layer is a Relu function;

所述池化层的池化方法为最大池化；The pooling method of the pooling layer is maximum pooling;

所述输出层的激活函数为Softmax函数。The activation function of the output layer is a Softmax function.

进一步地，所述BiLSTM层包括双向LSTM层，所述LSTM层的组成具体为：Further, the BiLSTM layer includes a bidirectional LSTM layer, and the composition of the LSTM layer is specifically:

f_t＝σ(W_f[h_t-1,x_t]+b_f)f _t =σ(W _f [h _t-1 ,x _t ]+b _f )

i_t＝σ(W_i[h_t-1,x_t]+b_i)i _t =σ(W _i [h _t-1 ,x _t ]+b _i )

o_t＝σ(W_o[h_t-1,x_t]+b_o)o _t =σ(W _o [h _t-1 ,x _t ]+b _o )

c_t＝f_tc_t-1+i_ttanh(W_c[h_t-1,x_t]+b_c)c _t =f _t c _t-1 +i _t tanh(W _c [h _t-1 ,x _t ]+b _c )

h_t＝o_ttanhc_t h _t =o _t tanhc _t

式中，f_t为遗忘门，i_t为输入门，o_t为输出门，c_t为细胞状态，σ为sigmoid激活函数，W_f、W_i、W_o、W_c为训练参数中的权重，b_f、b_i、b_o、b_c为训练参数中的偏置项，h_t为当前时刻隐藏层的状态，h_t-1为上一时刻隐藏层的状态，x_t为当前时刻的输入。where f _t is the forgetting gate, it is the input gate, _{o t is the output gate, c t is the cell state, σ is the sigmoid activation function, and W f , Wi , W o} _, _and _W _c _are _the weights in the training parameters , b _f , b _i , b _o , b _c are the bias items in the training parameters, h _t is the state of the hidden layer at the current moment, h _t-1 is the state of the hidden layer at the previous moment, and x _t is the current moment’s state of the hidden layer. enter.

进一步地，所述Relu函数为：Further, the Relu function is:

式中，f(x)是Relu函数，x是上一层输入的向量。In the formula, f(x) is the Relu function, and x is the input vector of the previous layer.

进一步地，所述Softmax激活函数为：Further, the Softmax activation function is:

式中，z_i+1是Z中第i+1个点输出的值，C是输出节点的个数，z_c是Z中第c个点输出的值，e是自然常数。In the formula, z _i+1 is the output value of the i+1th point in Z, _C is the number of output nodes, zc is the output value of the cth point in Z, and e is a natural constant.

进一步地，所述解释层为Grad-CAM++。Further, the interpretation layer is Grad-CAM++.

进一步地，步骤S3包括以下步骤：Further, step S3 includes the following steps:

S31、计算1D-CNN-BiLSTM神经网络模型最后一个卷积层之后各通道的各类别的权重：S31. Calculate the weights of each channel after the last convolutional layer of the 1D-CNN-BiLSTM neural network model:

式中，Y^x为类别x的得分值即重要程度，

为第k个特征图对于类别x的权重，

为第k个特征图的(i,j)处的像素值，

为第k个特征图的(i,j)处对于类别x的权重；In the formula, Y ^x is the score value of category x, that is, the importance degree,

is the weight of the k-th feature map for the category x,

is the pixel value at (i, j) of the k-th feature map,

is the weight for category x at (i, j) of the k-th feature map;

S32、根据所述的权重生成类激活图：S32. Generate a class activation map according to the weight:

S33、将所述的类激活图的尺寸变换到原输入图像的尺寸大小，再通过热力图覆盖到原图上，热力图值的大小对应1D-CNN-BiLSTM神经网络模型对输入信号的激活程度。S33. Transform the size of the class activation map to the size of the original input image, and then overlay it on the original image through a heatmap. The size of the heatmap value corresponds to the activation degree of the 1D-CNN-BiLSTM neural network model to the input signal. .

进一步地，步骤S4的具体步骤为：Further, the specific steps of step S4 are:

S41、在1D-CNN-BiLSTM神经网络模型后连接解释层和BiLSTM层；S41. Connect the explanation layer and the BiLSTM layer after the 1D-CNN-BiLSTM neural network model;

S42、将解释层作为整个模型的中间层，对卷积过程进行解释；S42. Use the interpretation layer as the middle layer of the entire model to explain the convolution process;

S43、将所述BiLSTM层作为整个模型的后端，将CNN最后一个卷积层的输出作为后端的输入，进行故障诊断。S43. Use the BiLSTM layer as the back end of the entire model, and use the output of the last convolutional layer of the CNN as the input of the back end to perform fault diagnosis.

进一步地，步骤S5的具体步骤为：Further, the specific steps of step S5 are:

S51、将步骤S1收集到的样本进行归类，共有四种类别，分别是正常、外圈故障、内圈故障和滚动体故障；S51, classify the samples collected in step S1 into four categories, namely normal, outer ring fault, inner ring fault and rolling element fault;

S52、将所述故障样本划分为训练集和测试集，利用K折交叉验证方法进行训练和测试；S52, dividing the fault sample into a training set and a test set, and using the K-fold cross-validation method for training and testing;

S53、将所述训练集作为具有卷积解释能力的神经网络模型的输入，进行模型训练，获得具有故障诊断能力的模型。S53. Use the training set as an input of a neural network model with convolution interpretation capability, and perform model training to obtain a model with fault diagnosis capability.

本发明与现有技术相比，具有如下的优点及效果：Compared with the prior art, the present invention has the following advantages and effects:

(1)具有可解释性和通用性。在仿真分析和实验验证中，利用Grad-CAM++证明了卷积神经网络特征提取的合理性和可解释性。本发明方法不仅能应用于二维图片，也可以应用在旋转机械故障等一维时间信号，拓展类激活映射图的适用范围，具有通用性。(1) It is interpretable and universal. In simulation analysis and experimental verification, Grad-CAM++ is used to prove the rationality and interpretability of feature extraction of convolutional neural network. The method of the invention can not only be applied to two-dimensional pictures, but also can be applied to one-dimensional time signals such as rotating machinery faults, so as to expand the scope of application of the class activation map and has universality.

(2)诊断精度和分类结果好。采用凯斯西储大学的滚动轴承故障数据集进行实验，诊断结果对于相同类别的聚类效果和对于不同类别的分类效果均表现优异。(2) The diagnostic accuracy and classification results are good. Using the rolling bearing fault data set of Case Western Reserve University to conduct experiments, the diagnosis results have excellent clustering effects for the same category and classification effects for different categories.

(3)抗噪性和鲁棒性好，本发明所述模型的诊断精度更高，并且当噪声逐渐降低时，其诊断精度收敛得更快。(3) Good anti-noise and robustness, the model of the present invention has higher diagnostic accuracy, and when the noise gradually decreases, its diagnostic accuracy converges faster.

附图说明Description of drawings

为了更清楚地说明本发明实施例或现有技术中的技术方案，下面将对实施例中所需要使用的附图作简单地介绍。附图构成本申请的一部分，但仅是作为体现发明概念的非限制性示例，并非用于做出任何限制。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the accompanying drawings required in the embodiments will be briefly introduced below. The accompanying drawings form a part of this application, but only serve as non-limiting examples embodying the inventive concept and are not intended to be limiting in any way.

图1是本发明方法实施的流程图；Fig. 1 is the flow chart that the method of the present invention is implemented;

图2是本发明方法中神经网络架构图；Fig. 2 is a neural network architecture diagram in the method of the present invention;

图3a是1797r/min转速下滚动轴承实验外圈故障的Grad-CAM++图；Figure 3a is the Grad-CAM++ diagram of the outer ring failure of the rolling bearing experiment at a rotational speed of 1797r/min;

图3b是1797r/min转速下滚动轴承实验内圈故障的Grad-CAM++图；Figure 3b is the Grad-CAM++ diagram of the inner ring failure of the rolling bearing experiment at a rotational speed of 1797r/min;

图3c是1797r/min转速下滚动轴承实验滚动体故障的Grad-CAM++图；Figure 3c is the Grad-CAM++ diagram of the rolling element failure of the rolling bearing experiment at a rotational speed of 1797r/min;

图3d是1797r/min转速下滚动轴承实验正常状态的Grad-CAM++图；Figure 3d is the Grad-CAM++ diagram of the normal state of the rolling bearing experiment at 1797r/min;

图4是本发明方法中的在1797r/min转速下故障诊断结果t-SNE分析图；Fig. 4 is the fault diagnosis result t-SNE analysis diagram under the rotating speed of 1797r/min in the method of the present invention;

图5是本发明方法中的在1797r/min转速下故障诊断结果混淆矩阵图；Fig. 5 is the confusion matrix diagram of the fault diagnosis result under the rotating speed of 1797r/min in the method of the present invention;

图6a是1772r/min转速下滚动轴承实验外圈故障的Grad-CAM++图；Figure 6a is the Grad-CAM++ diagram of the outer ring failure of the rolling bearing experiment at a rotational speed of 1772r/min;

图6b是1772r/min转速下滚动轴承实验内圈故障的Grad-CAM++图；Figure 6b is the Grad-CAM++ diagram of the inner ring failure of the rolling bearing experiment at a rotational speed of 1772r/min;

图6c是1772r/min转速下滚动轴承实验滚动体故障的Grad-CAM++图；Figure 6c is the Grad-CAM++ diagram of the rolling element failure of the rolling bearing experiment at a rotational speed of 1772r/min;

图6d是1772r/min转速下滚动轴承实验正常状态的Grad-CAM++图；Figure 6d is the Grad-CAM++ diagram of the normal state of the rolling bearing experiment at a rotational speed of 1772r/min;

图7是本发明方法中的在1772r/min转速下故障诊断结果t-SNE分析图；Fig. 7 is the fault diagnosis result t-SNE analysis diagram under the rotating speed of 1772r/min in the method of the present invention;

图8是本发明方法中的在1772r/min转速下故障诊断结果混淆矩阵图；Fig. 8 is the confusion matrix diagram of the fault diagnosis result under the rotating speed of 1772r/min in the method of the present invention;

图9a是1750r/min转速下滚动轴承实验外圈故障的Grad-CAM++图；Figure 9a is the Grad-CAM++ diagram of the outer ring failure of the rolling bearing experiment at a rotational speed of 1750r/min;

图9b是1750r/min转速下滚动轴承实验内圈故障的Grad-CAM++图；Figure 9b is the Grad-CAM++ diagram of the inner ring failure of the rolling bearing experiment at a rotational speed of 1750r/min;

图9c是1750r/min转速下滚动轴承实验滚动体故障的Grad-CAM++图；Figure 9c is the Grad-CAM++ diagram of the rolling element failure of the rolling bearing experiment at a rotational speed of 1750 r/min;

图9d是1750r/min转速下滚动轴承实验正常状态的Grad-CAM++图；Figure 9d is the Grad-CAM++ diagram of the normal state of the rolling bearing experiment at a rotational speed of 1750r/min;

图10是本发明方法中的在1750r/min转速下故障诊断结果t-SNE分析图；Fig. 10 is the t-SNE analysis diagram of the fault diagnosis result under the rotating speed of 1750r/min in the method of the present invention;

图11是本发明方法中的在1750r/min转速下故障诊断结果混淆矩阵图；Fig. 11 is the confusion matrix diagram of the fault diagnosis result under the rotating speed of 1750r/min in the method of the present invention;

图12是本发明方法中不同信噪比下各种网络模型的诊断精度图。Fig. 12 is a diagram showing the diagnostic accuracy of various network models under different signal-to-noise ratios in the method of the present invention.

具体实施方式Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚，下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments These are some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

在本发明的描述中，“多个”的含义是两个或两个以上，除非另有明确具体的限定。In the description of the present invention, "plurality" means two or more, unless otherwise expressly and specifically defined.

本发明提供了一种可解释性滚动轴承故障诊断方法，以下分别进行详细说明。The present invention provides an interpretable rolling bearing fault diagnosis method, which will be described in detail below.

如图1所示，为本发明实施例提供的基于Grad-CAM++和CNN-BiLSTM的可解释性滚动轴承故障诊断方法的一个实施例流程示意图，包括下述步骤：As shown in FIG. 1 , a schematic flowchart of an embodiment of an interpretable rolling bearing fault diagnosis method based on Grad-CAM++ and CNN-BiLSTM provided by an embodiment of the present invention includes the following steps:

S3、向该神经网络模型添加Grad-CAM++解释层；S3. Add a Grad-CAM++ interpretation layer to the neural network model;

数据具体为轴承外圈故障、内圈故障、滚动体故障和正常数据。将所述故障样本划分为训练集和测试集，利用K折交叉验证方法进行训练和测试，取K＝10，每次取其中9份进行训练，剩余1份进行测试，将所述训练集作为1D-CNN-BiLSTM神经网络模型的输入，每个样本维度为2048×1，进行模型训练，获得具有故障诊断能力的模型。The data are specifically bearing outer ring failure, inner ring failure, rolling element failure and normal data. The fault samples are divided into a training set and a test set, and the K-fold cross-validation method is used for training and testing, taking K=10, taking 9 of them for training each time, and the remaining 1 for testing, using the training set as The input of 1D-CNN-BiLSTM neural network model, the dimension of each sample is 2048×1, and the model is trained to obtain a model with fault diagnosis ability.

本发明实施例提供的基于Grad-CAM++和CNN-BiLSTM的可解释性滚动轴承故障诊断方法，通过基于Grad-CAM++算法对卷积神经网络的卷积结果以可视化热力图的形式很好地进行了解释性说明，通过CNN局部特征提取上的优势和BiLSTM能很好处理非线性时间序列的天然特点，提高诊断结果的可靠性，且能减少训练网络时的参数，从而提升训练速度。进一步将其与具有相同结构和参数的1D-CNN、1D-CNN-SVM和1D-CNN-LSTM作为本发明所述模型的对比，验证所述的1D-CNN-BiLSTM可解释神经网络的优秀抗噪性。The interpretable rolling bearing fault diagnosis method based on Grad-CAM++ and CNN-BiLSTM provided by the embodiment of the present invention, the convolution result of the convolutional neural network based on the Grad-CAM++ algorithm is well explained in the form of a visual heat map This shows that the advantages of CNN in local feature extraction and BiLSTM can well handle the natural characteristics of nonlinear time series, improve the reliability of diagnosis results, and reduce the parameters when training the network, thereby improving the training speed. It is further compared with 1D-CNN, 1D-CNN-SVM and 1D-CNN-LSTM with the same structure and parameters as the model described in the present invention, and it is verified that the described 1D-CNN-BiLSTM can explain the excellent resistance of neural network. noise.

在本发明的一些实施例，如图1所示，S1包括：In some embodiments of the present invention, as shown in FIG. 1 , S1 includes:

S11、收集加速度传感器所收集到的正常、外圈故障、内圈故障和滚动体故障四种类别的一维时间信号，这里以凯斯西储大学滚动轴承故障数据集为例，进行三次不同转速下的实验，分别为1797r/min、1772r/min、1750r/min；S11. Collect the one-dimensional time signals of four categories of normal, outer ring fault, inner ring fault and rolling element fault collected by the acceleration sensor. Here, taking the rolling bearing fault data set of Case Western Reserve University as an example, carry out three different rotation speeds. The experiments were 1797r/min, 1772r/min, 1750r/min;

在本发明的一些实施例，如图2所示，所述初始1D-CNN-BiLSTM神经网络模型的结构中，包括输入层(input)、两层卷积层(ConV)、两层池化层(Max Pooling)、BiLSTM层、全连接层(Fully Connected)以及输出层(output)：In some embodiments of the present invention, as shown in FIG. 2 , the structure of the initial 1D-CNN-BiLSTM neural network model includes an input layer (input), two layers of convolution layers (ConV), and two layers of pooling layers (Max Pooling), BiLSTM layer, fully connected layer (Fully Connected) and output layer (output):

卷积层的第一个卷积层具有16个4×1维度的卷积核，第二个卷积核具有32个2×1维度的卷积核，每个卷积层的激活函数都为Relu函数；The first convolutional layer of the convolutional layer has 16 convolution kernels of 4×1 dimension, the second convolutional kernel has 32 convolutional kernels of 2×1 dimension, and the activation function of each convolutional layer is Relu function;

具体地，Relu函数为：Specifically, the Relu function is:

池化层的池化方法为最大池化，第一个池化层的步长为4，第二个池化层的步长为2；The pooling method of the pooling layer is maximum pooling, the step size of the first pooling layer is 4, and the step size of the second pooling layer is 2;

第一层全连接层的神经元个数为64个，第二层全连接层的神经元个数为4个，设置dropout丢弃率为0.5，防止过拟合。The number of neurons in the first fully connected layer is 64, the number of neurons in the second fully connected layer is 4, and the dropout dropout rate is set to 0.5 to prevent overfitting.

输出层的神经元个数与故障类型数量一致，激活函数为Softmax函数。The number of neurons in the output layer is consistent with the number of fault types, and the activation function is the Softmax function.

具体地，Softmax函数为：Specifically, the Softmax function is:

进一步地，在本发明的一些实施例中，所述向该神经网络模型添加Grad-CAM++解释层，获得具有卷积解释能力的模型，其结果如图3所示，S3包括：Further, in some embodiments of the present invention, the Grad-CAM++ interpretation layer is added to the neural network model to obtain a model with convolution interpretation capability. The result is shown in Figure 3, and S3 includes:

S31、计算卷积层后各通道对各类别的权重，如下所示：S31. Calculate the weights of each channel for each category after the convolutional layer, as shown below:

式中，Y^x为类别x的得分值即重要程度，

为第k个特征图对于类别x的权重，

为第k个特征图的(i,j)处的像素值，

为第k个特征图的(i,j)处对于类别x的权重。In the formula, Y ^x is the score value of category x, that is, the importance degree,

is the weight of the k-th feature map for the category x,

is the pixel value at (i, j) of the k-th feature map,

is the weight for category x at (i, j) of the k-th feature map.

S32、所述的生成的类激活图的计算方式如下所示：S32. The calculation method of the generated class activation graph is as follows:

S33、将所述的类激活图需要将尺寸变换到原输入图像的尺寸大小，再通过热力图覆盖到原图上，热力图的值对应着卷积神经网络对此位置的激活程度。S33, the size of the class activation map needs to be transformed to the size of the original input image, and then overlaid on the original image through a heat map, the value of the heat map corresponds to the activation degree of the convolutional neural network at this position.

选取三种实施例进行说明。实施例1为转速等于1797r/min时的情况，实施例2为转速等于1772r/min时的情况，实施例3为转速等于1750r/min时的情况，现将三种实施例一起分析如下。Three embodiments are selected for description. Example 1 is the situation when the rotating speed is equal to 1797r/min, Example 2 is the situation when the rotating speed is equal to 1772r/min, and Example 3 is the situation when the rotating speed is equal to 1750r/min. Now the three embodiments are analyzed together as follows.

滚动轴承外圈故障的轴承如图3a、图6a、图9a所示，该1D-CNN-BiLSTM神经网络对于该输入信号的冲击部分有较大的激活程度，而神经网络的卷积层在此实验中负责故障特征的提取，故说明该网络对此冲击部分的权重较大，更为关注。The bearing with the fault of the outer ring of the rolling bearing is shown in Figure 3a, Figure 6a, and Figure 9a. The 1D-CNN-BiLSTM neural network has a greater degree of activation for the impact part of the input signal, and the convolution layer of the neural network is used in this experiment. It is responsible for the extraction of fault features in the network, so it shows that the network has a larger weight on the impact part and pays more attention to it.

内圈故障的轴承如图3b、图6b、图9b所示，与外圈故障的情况类似，该神经网络对于该输入信号的冲击部分同样有较大的激活程度，说明网络对此部分的权重更大。The bearing with inner ring fault is shown in Figure 3b, Figure 6b, and Figure 9b. Similar to the case of the outer ring fault, the neural network also has a greater degree of activation for the impact part of the input signal, indicating the weight of the network on this part. bigger.

滚动体故障的轴承如图3c、图6c、图9c所示，与内外圈故障不同，滚动体故障样本的时域信号冲击部分分布不均匀，没有特别明显的冲击区域，但与正常轴承的信号图3d、图6d、图9d相比，仍表现出神经网络对此类信号进行了有选择的激活，该激活区域权重更大。The bearing with rolling element fault is shown in Figure 3c, Figure 6c, and Figure 9c. Different from the inner and outer ring faults, the time-domain signal impact part of the rolling element fault samples is unevenly distributed, and there is no particularly obvious impact area, but the signal of the normal bearing is different. Compared with Figure 3d, Figure 6d, and Figure 9d, it still shows that the neural network has selectively activated such signals, and the activation area has a greater weight.

正常轴承的时域信号是均匀分布的，如图3d、图6d、图9d所示，同样地，其Grad-CAM++图的激活部分也是分散在各个区域，即各部分的权重相近，故能够表明正常轴承的状态。The time domain signals of normal bearings are evenly distributed, as shown in Figure 3d, Figure 6d, and Figure 9d. Similarly, the activation part of the Grad-CAM++ map is also scattered in each area, that is, the weights of each part are similar, so it can be shown that Normal bearing condition.

具体地，将辅助的可视化方法图4、图7和图10对故障诊断结果进行分析，故障诊断准确率均超过98.5％，可以明显看出本发明方法的优越性。图5、图8和图11为t-SNE降维可视化图，以图5为例，在类别3中，有个别样本被分类在类别2和类别4，在类别4中，有少量样本被分类在类别2中，故诊断结果没有达到100％。但根据降维可视化的结果可以看出，采用凯斯西储大学的滚动轴承故障数据集进行实验，诊断结果和t-SNE分析表明本发明方法对于相同类别的聚类效果和对于不同类别的分类效果均表现优异。本发明的网络对相同类别的样本可以很好地进行聚类，对不同的样本也可以很好地进行分类。Specifically, by analyzing the fault diagnosis results of the auxiliary visualization method Fig. 4, Fig. 7 and Fig. 10, the fault diagnosis accuracy rate is all over 98.5%, and the superiority of the method of the present invention can be clearly seen. Figure 5, Figure 8 and Figure 11 are t-SNE dimensionality reduction visualization diagrams. Taking Figure 5 as an example, in category 3, some samples are classified in category 2 and category 4, and in category 4, a small number of samples are classified In category 2, the diagnostic result does not reach 100%. However, according to the results of dimensionality reduction visualization, it can be seen that the rolling bearing fault data set of Case Western Reserve University is used for experiments. The diagnosis results and t-SNE analysis show that the method of the present invention has clustering effects for the same category and classification effects for different categories. All performed excellently. The network of the present invention can cluster samples of the same category well, and can also classify different samples well.

进一步地，将其与具有相同结构和参数的1D-CNN、1D-CNN-SVM和1D-CNN-LSTM作为本发明所述模型的对比，结果如图6、图12所示，结果表明在相同噪声条件下本发明所述模型的诊断精度更高，并且当噪声逐渐降低时，其诊断精度收敛得更快，验证了所述的1D-CNN-BiLSTM可解释神经网络的优秀抗噪性。Further, compare it with 1D-CNN, 1D-CNN-SVM and 1D-CNN-LSTM with the same structure and parameters as the model described in the present invention, the results are shown in Figure 6 and Figure 12, the results show that the same The diagnostic accuracy of the model of the present invention is higher under noise conditions, and when the noise is gradually reduced, the diagnostic accuracy converges faster, which verifies the excellent noise immunity of the 1D-CNN-BiLSTM interpretable neural network.

以上公开的本发明优选实施例只是用于帮助阐述本发明。优选实施例并没有详尽叙述所有的细节，也不限制该发明仅为所述的具体实施方式。显然，根据本说明书的内容，可作很多的修改和变化。本说明书选取并具体描述这些实施例，是为了更好地解释本发明的原理和实际应用，从而使所属技术领域技术人员能很好地理解和利用本发明。本发明仅受权利要求书及其全部范围和等效物的限制。The above-disclosed preferred embodiments of the present invention are provided only to help illustrate the present invention. The preferred embodiments do not exhaust all the details, nor do they limit the invention to only the described embodiments. Obviously, many modifications and variations are possible in light of the content of this specification. The present specification selects and specifically describes these embodiments in order to better explain the principles and practical applications of the present invention, so that those skilled in the art can well understand and utilize the present invention. The present invention is to be limited only by the claims and their full scope and equivalents.

Claims

1. A fault diagnosis method for an interpretable rolling bearing, comprising the following steps:

S1. Collect the one-dimensional time series signal of the rolling bearing and expand the sample;

S2. Establish an initial 1D-CNN-BiLSTM neural network model;

S3, adding an explanation layer to the neural network model;

S4. Establish a neural network model with convolution interpretation capability;

S5, using a variety of one-dimensional fault data to train the neural network model to obtain a model with fault diagnosis capability;

S6. Perform fault diagnosis on the rolling bearing by using the fault diagnosis model.

2. An interpretable rolling bearing fault diagnosis method according to claim 1, wherein step S1 comprises the following steps:

S11. Collect four types of one-dimensional time signals of normal, outer ring fault, inner ring fault and rolling element fault collected by the acceleration sensor;

S12, the one-dimensional time signal signal is expanded by means of overlapping sampling, and the data samples are divided into a training set and a test set;

S13. Divide the data sample into different training sets and test sets multiple times by means of K-fold cross-validation, and finally average the results to improve the robustness of the model.

3. An interpretable rolling bearing fault diagnosis method according to claim 1, wherein the structure of the initial 1D-CNN-BiLSTM neural network model comprises an input layer, a convolutional neural network layer (CNN), a BiLSTM layer, fully connected layer and output layer:

The CNN layer includes a convolution layer and a pooling layer, and the activation function of each convolution layer is the Relu function;

The pooling method of the pooling layer is maximum pooling;

The activation function of the output layer is a Softmax function.

4. An interpretable rolling bearing fault diagnosis method according to claim 3, wherein the BiLSTM layer comprises a bidirectional LSTM layer, and the composition of the LSTM layer is specifically:

f _t =σ(W _f [h _t-1 ,x _t ]+b _f )

i _t =σ(W _i [h _t-1 ,x _t ]+b _i )

o _t =σ(W _o [h _t-1 ,x _t ]+b _o )

c _t =f _t c _t-1 +i _t tanh(W _c [h _t-1 ,x _t ]+b _c )

h _t =o _t tanhc _t

where f _t is the forgetting gate, it is the input gate, _{o t is the output gate, c t is the cell state, σ is the sigmoid activation function, and W f , Wi , W o} _, _and _W _c _are _the weights in the training parameters , b _f , b _i , b _o , b _c are the bias items in the training parameters, h _t is the state of the hidden layer at the current moment, h _t-1 is the state of the hidden layer at the previous moment, and x _t is the current moment’s state of the hidden layer. enter.

5. A kind of interpretable rolling bearing fault diagnosis method according to claim 3, is characterized in that, described Relu function is:

In the formula, f(x) is the Relu function, and x is the input vector of the previous layer.

6. A kind of interpretable rolling bearing fault diagnosis method according to claim 3, is characterized in that, described Softmax activation function is:

In the formula, z _i+1 is the output value of the i+1th point in Z, _C is the number of output nodes, zc is the output value of the cth point in Z, and e is a natural constant.

7. An interpretable rolling bearing fault diagnosis method according to claim 1, wherein the interpretation layer is Grad-CAM++.

8. An interpretable rolling bearing fault diagnosis method according to claim 1 or 7, wherein step S3 comprises the following steps:

S31. Calculate the weights of each channel after the last convolutional layer of the 1D-CNN-BiLSTM neural network model:

In the formula, Y ^x is the score value of category x, that is, the importance degree,

is the weight of the k-th feature map for the category x,

is the pixel value at (i, j) of the k-th feature map,

is the weight for category x at (i, j) of the k-th feature map;

S32. Generate a class activation map according to the weight:

S33. Transform the size of the class activation map to the size of the original input image, and then overlay it on the original image through a heatmap. The size of the heatmap value corresponds to the activation degree of the 1D-CNN-BiLSTM neural network model to the input signal. .

9. The interpretable rolling bearing fault diagnosis method according to claim 8, wherein the specific steps of step S4 are:

S41. Connect the explanation layer and the BiLSTM layer after the 1D-CNN-BiLSTM neural network model;

S42. Use the interpretation layer as the middle layer of the entire model to explain the convolution process;

S43. Use the BiLSTM layer as the back end of the entire model, and use the output of the last convolutional layer of the CNN as the input of the back end to perform fault diagnosis.

10. An interpretable rolling bearing fault diagnosis method according to any one of claims 1 to 9, wherein the specific steps of step S5 are:

S51, classify the samples collected in step S1 into four categories, namely normal, outer ring fault, inner ring fault and rolling element fault;

S52, dividing the fault sample into a training set and a test set, and using the K-fold cross-validation method for training and testing;

S53. Use the training set as the input of the neural network model with convolution interpretation capability, and perform model training to obtain a model with fault diagnosis capability.