CN113486931B

CN113486931B - Rolling bearing enhanced diagnosis method based on PDA-WGANGP

Info

Publication number: CN113486931B
Application number: CN202110685339.XA
Authority: CN
Inventors: 陈嘉宇; 林翠颖; 张清华; 葛红娟
Original assignee: Nanjing University of Aeronautics and Astronautics
Current assignee: Nanjing University of Aeronautics and Astronautics
Priority date: 2021-06-21
Filing date: 2021-06-21
Publication date: 2023-05-26
Anticipated expiration: 2041-06-21
Also published as: CN113486931A

Abstract

The invention discloses a rolling bearing enhancement diagnosis method based on PDA-WGANGP, and relates to the field of mechanical equipment fault data enhancement. Firstly, training a pre-training network by using health data and fault data under laboratory conditions, and taking the structure and parameters of the pre-training network as feature extraction layers of the first several layers of a discriminator and a Dense classifier; secondly, introducing a residual error network, constructing a brand new generator, solving the gradient descent problem of the convolutional neural network, and preventing the generation of overfitting, so that a few high-quality samples are stably generated by using an unbalanced training set; finally, model training is completed, high-quality fault samples are generated, and verification is carried out on the generated samples. The method can accurately generate the high-quality fault samples, improve the running efficiency of the model, solve the problem of unbalanced data and improve the accuracy and stability of the generation of the fault samples of the rolling bearing.

Description

An enhanced diagnosis method for rolling bearings based on PDA-WGANGP

技术领域Technical Field

本发明涉及轴承故障诊断领域，具体为一种基于PDA-WGANGP的滚动轴承增强诊断方法。The invention relates to the field of bearing fault diagnosis, and in particular to a rolling bearing enhanced diagnosis method based on PDA-WGANGP.

背景技术Background Art

作为机械系统和电力系统中最重要的部件之一，滚动轴承经常在大负荷、高速旋转的恶劣环境中工作，导致其发生故障的概率大大增加，而一旦发生故障，将会造成巨大的经济损失或安全事故。因此，对滚动轴承进行健康状态的监控以及故障诊断研究十分必要。As one of the most important components in mechanical and power systems, rolling bearings often work in harsh environments with heavy loads and high-speed rotation, which greatly increases the probability of failure. Once a failure occurs, it will cause huge economic losses or safety accidents. Therefore, it is necessary to monitor the health status of rolling bearings and conduct fault diagnosis research.

基于数据驱动的故障诊断方法是当前检测设备健康状况的重要手段。但在实际应用中，滚动轴承大多处于正常工作状态，采集到的故障数据相对较少。因此，出现了数据不平衡问题。所谓数据不平衡，是指采集的故障数据量远小于正常样本，导致数据驱动的故障诊断方法存在模型偏差。因此，提高数据不平衡问题的预测性能，特别是对少数类别数据的预测性能具有重要意义。基于梯度惩罚与瓦瑟斯坦距离的辅助分类生成对抗网络(ACWGAN-GP)为故障诊断中的数据不平衡问题提供了一种可行的解决方案。ACWGAN-GP能够学习原始样本的数据分布特征，生成具有相似分布的新的合成样本，并且加入了瓦瑟斯坦距离增加了生成样本的真实度，添加了梯度惩罚的方法以防止模型崩塌。然而，ACWGAN-GP也存在着一定的不足。首先，模型的网络结构为从头开始训练的，需要更多的迭代步数才能生成出肉眼来看比较相近的故障信号样本。虽然针对特定的任务从头开始训练模型在最终会拟合出不错的样本，但是其需要花费更巨大的时间成本，大大缩短了算法的计算效率。其次，模型采用在判别器的最后一层加入分类层，分类的泛化能力不高，分类效果不好。而且，对于分类器，若只采用原始样本进行训练，容易导致过拟合问题。最后，在生成器中，随着网络层数的增加，卷积神经网络会出现梯度消失的现象，这会导致生成器的低精度和过拟合问题。The data-driven fault diagnosis method is an important means to detect the health status of equipment. However, in practical applications, rolling bearings are mostly in normal working state, and the collected fault data is relatively small. Therefore, the problem of data imbalance occurs. The so-called data imbalance means that the amount of collected fault data is much smaller than the normal sample, which leads to model bias in the data-driven fault diagnosis method. Therefore, it is of great significance to improve the prediction performance of data imbalance problem, especially the prediction performance of minority category data. The auxiliary classification generative adversarial network based on gradient penalty and Wasserstein distance (ACWGAN-GP) provides a feasible solution to the data imbalance problem in fault diagnosis. ACWGAN-GP can learn the data distribution characteristics of the original samples, generate new synthetic samples with similar distribution, and add Wasserstein distance to increase the authenticity of the generated samples, and add gradient penalty to prevent model collapse. However, ACWGAN-GP also has certain shortcomings. First, the network structure of the model is trained from scratch, and more iterations are required to generate fault signal samples that are relatively similar to the naked eye. Although training a model from scratch for a specific task will eventually fit a good sample, it takes a greater time cost, which greatly reduces the computational efficiency of the algorithm. Secondly, the model adds a classification layer to the last layer of the discriminator, which has low generalization ability and poor classification effect. Moreover, for the classifier, if only the original samples are used for training, it is easy to cause overfitting problems. Finally, in the generator, as the number of network layers increases, the convolutional neural network will experience the phenomenon of gradient disappearance, which will lead to low precision and overfitting problems of the generator.

综上所述，针对轴承故障数据不平衡问题，基于ACWGAN-GP的数据增强方法可以很好的弥补数据不平衡的不足，但却也产生了一些新的问题，比如模型生成样本质量不高、训练效率不高、分类器泛化能力不好、生成器容易产生梯度消失现象。这将降低模型的训练效率，制约后续模型故障诊断的精度。因此，一种快速有效提高模型生成样本的质量与训练效率，增强分类器的泛化能力，避免生成器的梯度消失问题，实现更准确、高效的故障样本生成的滚动轴承增强诊断方法亟待研究。In summary, the data enhancement method based on ACWGAN-GP can make up for the problem of unbalanced data in bearing fault data, but it also produces some new problems, such as low quality of model generated samples, low training efficiency, poor generalization ability of classifier, and easy gradient vanishing phenomenon of generator. This will reduce the training efficiency of the model and restrict the accuracy of subsequent model fault diagnosis. Therefore, a rolling bearing enhanced diagnosis method that can quickly and effectively improve the quality and training efficiency of model generated samples, enhance the generalization ability of classifier, avoid the gradient vanishing problem of generator, and achieve more accurate and efficient fault sample generation needs to be studied urgently.

发明内容Summary of the invention

为了解决上述问题，本发明公开了一种基于PDA-WGANGP的滚动轴承增强诊断方法，基于瓦瑟斯坦距离与梯度惩罚的预训练数据增强生成对抗网络方法，利用健康数据以及实验室条件下的故障数据进行预训练网络的训练，利用预训练网络的结构和参数作为判别器和Dense分类器前几层的特征提取层，利用不平衡训练集稳定地生成少数类的高质量样本，最后，完成模型训练，产生高质量的故障样本，并对生成样本进行验证。In order to solve the above problems, the present invention discloses a rolling bearing enhanced diagnosis method based on PDA-WGANGP, a pre-trained data enhanced generative adversarial network method based on Wasserstein distance and gradient penalty, the pre-trained network is trained using health data and fault data under laboratory conditions, the structure and parameters of the pre-trained network are used as the feature extraction layer of the discriminator and the first few layers of the Dense classifier, the unbalanced training set is used to stably generate high-quality samples of the minority class, and finally, the model training is completed, high-quality fault samples are generated, and the generated samples are verified.

一种基于PDA-WGANGP的滚动轴承增强诊断方法，其特征在于：包括如下步骤：A rolling bearing enhanced diagnosis method based on PDA-WGANGP, characterized in that it includes the following steps:

步骤1，传感器采集健康信号与故障信号数据，基于PDA方法预处理该数据并预训练网络模型；Step 1: The sensor collects healthy signal and fault signal data, pre-processes the data based on the PDA method, and pre-trains the network model;

步骤2，将所述数据作为输入量并调用预训练后的网络模型，基于WGANGP方法生成样本，评估该生成样本的质量以输出高质量的故障模型；Step 2, using the data as input and calling the pre-trained network model, generating samples based on the WGANGP method, and evaluating the quality of the generated samples to output a high-quality fault model;

步骤3，将所述高质量的故障模型作为输入样本训练故障诊断模型，将所述数据输入训练后的故障诊断模型中，输出故障诊断结果，验证网络模型的诊断能力。Step 3: Use the high-quality fault model as an input sample to train a fault diagnosis model, input the data into the trained fault diagnosis model, output the fault diagnosis result, and verify the diagnostic capability of the network model.

作为优选，步骤1具体包括：步骤1.1，对传感器采集的健康信号与故障信号进行快速傅里叶变换，使得信号相对应的健康、故障特征更为显著，方便之后模型进行特征提取，并对数据进行归一化到(-1，1)之间；步骤1.2，对数据进行划分，划分为预训练数据集、不平衡训练集以及测试集。步骤1.3，改进判别器与Dense分类器：利用步骤1.2中的预训练数据集对判别器与Dense分类器一维卷积网络的前几层进行预训练，在其后加高维卷积网络的结构，使网络拥有更具泛化性的特征提取层。Preferably, step 1 specifically includes: step 1.1, performing fast Fourier transform on the healthy signal and fault signal collected by the sensor, so that the health and fault features corresponding to the signal are more significant, which is convenient for the subsequent model to extract features, and the data is normalized to between (-1, 1); step 1.2, dividing the data into a pre-training data set, an unbalanced training set, and a test set. Step 1.3, improving the discriminator and the dense classifier: using the pre-training data set in step 1.2 to pre-train the first few layers of the one-dimensional convolutional network of the discriminator and the dense classifier, and then adding a high-dimensional convolutional network structure, so that the network has a more generalized feature extraction layer.

作为优选，改进的判别器、改进的Dense分类器预训练方法为：在改进的判别器、Dense分类器中采用预训练数据集预训练一维卷积网络的前几层，在其后加高维卷积网络的结构，使网络拥有更具泛化性的特征提取层。如此可以加快纳什均衡的到来，快速的生成质量好的生成信号。预训练网络参照VGG网络做一些改变，由于实验室条件下的故障数据和健康状态的数据十分地丰富，故预训练网络的训练数据采用实验室条件下故障轴承以及健康状态的轴承产生的数据集。Preferably, the pre-training method of the improved discriminator and the improved Dense classifier is: in the improved discriminator and the Dense classifier, the pre-training data set is used to pre-train the first few layers of the one-dimensional convolutional network, and then the high-dimensional convolutional network structure is added to make the network have a more generalized feature extraction layer. This can accelerate the arrival of the Nash equilibrium and quickly generate a good quality signal. The pre-training network is modified with reference to the VGG network. Since the fault data and healthy status data under laboratory conditions are very rich, the training data of the pre-training network uses the data set generated by the faulty bearings and the healthy bearings under laboratory conditions.

作为优选，步骤2具体包括：步骤2.1，建立WGANGP模型并初始化参数；步骤2.2，训练WGANGP模型：引入残差网络以构建改进的生成器，将随机噪声与故障类别标签输入改进的生成器中，以产生带有标签的合成样本；为不平衡训练集贴上类别标签作为输入的原始样本；将该合成样本与原始样本输入改进的判别器与Dense分类器中分别进行样本的真伪判断与类别分类；步骤2.3，若改进的判别器判断数据为生成样本则给出“假”的判别结果，若改进的判别器判断数据为原始样本则给出“真”的判别结果；Dense分类器最终输出生成样本与原始样本的故障类别；步骤2.4，当改进的判别器不能够准确判断某个样本是原始样本还是生成样本时，整个模型达到了纳什均衡，输出基于纳什均衡的生成样本；步骤2.5，评价步骤2.4输出的生成样本的质量，将高质量的生成样本作为步骤3中平衡训练集条件下轴承故障诊断的输入数据。Preferably, step 2 specifically includes: step 2.1, establishing a WGANGP model and initializing parameters; step 2.2, training the WGANGP model: introducing a residual network to construct an improved generator, inputting random noise and fault category labels into the improved generator to generate labeled synthetic samples; labeling the unbalanced training set with category labels as the original samples input; inputting the synthetic samples and the original samples into the improved discriminator and the Dense classifier to respectively judge the authenticity and classify the samples; step 2.3, if the improved discriminator judges that the data is a raw data, If the improved discriminator determines that the data is an original sample, a "false" discrimination result is given. If the improved discriminator determines that the data is an original sample, a "true" discrimination result is given; the dense classifier finally outputs the fault category of the generated sample and the original sample; step 2.4, when the improved discriminator cannot accurately determine whether a sample is an original sample or a generated sample, the entire model reaches a Nash equilibrium and outputs a generated sample based on the Nash equilibrium; step 2.5, evaluate the quality of the generated samples output by step 2.4, and use the high-quality generated samples as the input data for bearing fault diagnosis under the balanced training set condition in step 3.

作为优选，步骤2.1中建立WGANGP模型的伪代码如下，其中λ＝10，n_D＝5，

批量大小m，Adam超参数α、β₁、β₂，超参数ρ，Preferably, the pseudo code for establishing the WGANGP model in step 2.1 is as follows, where λ=10, n _D =5,

Batch size m, Adam hyperparameters α, β ₁ , β ₂ , hyperparameter ρ,

作为优选，步骤2.2中改进的生成器为引入残差网络后构建的全新生成器。具体为：将残差块A添加至生成器的网络结构中，残差块的输出为G(A)，F(A)为残差块的映射：G(A)＝F(A)+A。以解决卷积神经网络梯度随着卷积层增加而导致的梯度消失的现象。Preferably, the improved generator in step 2.2 is a new generator constructed after introducing the residual network. Specifically, the residual block A is added to the network structure of the generator, the output of the residual block is G(A), and F(A) is the mapping of the residual block: G(A) = F(A) + A. This solves the problem that the gradient of the convolutional neural network disappears as the number of convolutional layers increases.

作为优选，生成器的网络采用Instance Normalization(IN)算法代替BatchNormalization(BN)算法进行模型优化，从而使网络拥有更好的拟合能力。在生成器中使用IN不仅可以加速模型收敛，并且可以保持每个信号实例之间的独立。As a preference, the generator network uses the Instance Normalization (IN) algorithm instead of the Batch Normalization (BN) algorithm for model optimization, so that the network has better fitting ability. Using IN in the generator can not only accelerate model convergence, but also maintain the independence of each signal instance.

作为优选，步骤2.2中将随机噪声与故障类别标签输入改进的生成器中，具体为：将随机噪声向量集Z＝(z₁，z₂，z₃...z_m)，m＝(1，2…)与标签集Y＝(y¹，y²，y³...y^k)，k＝(1，2…)同时输入改进的生成器中，以使生成器产生带有标签的合成样本

其中，m为需产生样本的数量，k为故障类别数。Preferably, in step 2.2, random noise and fault category labels are input into the improved generator, specifically: random noise vector set Z = (z ₁ , z ₂ , z ₃ ... z _m ), m = (1, 2 ...) and label set Y = (y ¹ , y ² , y ³ ... y ^k ), k = (1, 2 ...) are simultaneously input into the improved generator, so that the generator generates synthetic samples with labels.

Among them, m is the number of samples to be generated, and k is the number of fault categories.

作为优选，Dense分类器采用交替池化方法和激活函数以减少训练过程中噪声的影响。原始的辅助分类生成对抗网络是通过向判别器添加分类层，在样本生成过程中提供不同类别控制的简单方法。而将分类器从判别器中单独提炼出来，提出基于Dense的分类器，使其专注于分类任务，如此拥有更好的分类能力，从而对生成器控制类别的生成起到约束监督作用，有效提高分类的精度。然而，分类器最初是为真实数据集设计的，而不是为对抗网络生成的数据设计的。由于生成器将随机噪声矢量作为输入并生成合成数据，因此所提出的分类器应考虑噪声输入的影响。所以，采用交替池化方法和分类器的激活函数来减少训练过程中噪声的影响。As a preferred method, the Dense classifier adopts an alternating pooling method and an activation function to reduce the influence of noise during training. The original auxiliary classification generative adversarial network is a simple method to provide different category controls during sample generation by adding a classification layer to the discriminator. The classifier is extracted from the discriminator separately, and a Dense-based classifier is proposed to focus on the classification task, so that it has better classification ability, thereby constraining the generation of the generator-controlled category and effectively improving the classification accuracy. However, the classifier was originally designed for real data sets, not for data generated by adversarial networks. Since the generator takes a random noise vector as input and generates synthetic data, the proposed classifier should consider the influence of noise input. Therefore, the alternating pooling method and the activation function of the classifier are used to reduce the influence of noise during training.

作为优选，改进的判别器、Dense分类器的原理为：As a preferred method, the principles of the improved discriminator and the Dense classifier are as follows:

L_D，

L_C是改进的判别器、Dense分类器对真实数据的分类、Dense分类器对生成数据的分类、Dense分类器的损失函数。P_r(s)是真实数据的分布，P_z(z)是噪声向量z的先验分布。θ_D、θ_G、θ_C是改进的判别器、生成器、Dense分类器的参数。c_g是从条件噪声中采样的类别，c_r是真实数据的类别。ρ是一个超参数，用来控制改进的生成器对真实数据分类的准确性和对生成数据分类的准确性的重要性。

为真实数据和生成数据的插值，λ是梯度惩罚项的权重以满足1-lipschitz条件。 _LD ,

L _C is the classification of real data by the improved discriminator, the classification of generated data by the Dense classifier, and the loss function of the Dense classifier. _{P r} (s) is the distribution of real data, and P _z (z) is the prior distribution of the noise vector z. _{θ D} , θ _G , θ _C are the parameters of the improved discriminator, generator, and Dense classifier. c _g is the category sampled from the conditional noise, and _cr is the category of the real data. ρ is a hyperparameter used to control the accuracy of the improved generator in classifying real data and the importance of the accuracy of the generated data classification.

is the interpolation of the real data and the generated data, and λ is the weight of the gradient penalty term to satisfy the 1-lipschitz condition.

作为优选，步骤2.5中采取余弦相似度和皮尔逊相关系数两个评价指标，以评价生成样本和真实样本的相似度。Preferably, in step 2.5, two evaluation indicators, cosine similarity and Pearson correlation coefficient, are used to evaluate the similarity between the generated samples and the real samples.

作为优选，步骤3中的故障诊断包括：步骤3.1，扩充不平衡训练集：将步骤2.4输出的基于纳什均衡的生成样本输入至诸如卷积神经网络、支持向量机等故障诊断模型中进行模型的训练；步骤3.2，将步骤1中划分的测试集输入训练后的故障诊断模型中进行模型诊断能力测试，最终得出基于PDA-WGANGP的滚动轴承增强诊断结果。Preferably, the fault diagnosis in step 3 includes: step 3.1, expanding the unbalanced training set: inputting the generated samples based on Nash equilibrium outputted in step 2.4 into fault diagnosis models such as convolutional neural networks and support vector machines for model training; step 3.2, inputting the test set divided in step 1 into the trained fault diagnosis model for model diagnosis capability testing, and finally obtaining the enhanced diagnosis results of rolling bearings based on PDA-WGANGP.

有益效果：Beneficial effects:

(1)本发明在判别器和分类器中采用预训练的网络结构，拥有更具泛化性的特征提取层，这在一定程度上缓解了模型泛化性不强的问题，减少了计算量，提高了样本生成的效率；(1) The present invention adopts a pre-trained network structure in the discriminator and the classifier, and has a more generalized feature extraction layer, which to some extent alleviates the problem of weak generalization of the model, reduces the amount of calculation, and improves the efficiency of sample generation;

(2)本发明将对于分类器，不止用原始数据进行训练，同时加入生成数据减缓过拟合，保障生成器的训练稳定度；(2) The present invention not only trains the classifier with original data, but also adds generated data to reduce overfitting and ensure the training stability of the generator;

(3)本发明采用全新的生成器网络结构，设计了一个全新的深度残差网络进行数据分布的拟合，并采用Instance Normalization(IN)代替Batch Normalization(BN)，从而拥有更好的拟合能力；(3) The present invention adopts a new generator network structure, designs a new deep residual network to fit the data distribution, and uses Instance Normalization (IN) instead of Batch Normalization (BN), so as to have better fitting ability;

(4)本发明将分类器从判别器中单独提炼出来，提出基于Dense的分类器，使其专注于分类任务，如此拥有更好的分类能力，从而对生成器控制类别的生成起到约束监督作用；(4) The present invention separates the classifier from the discriminator and proposes a Dense-based classifier, which focuses on the classification task and has better classification ability, thereby constraining and supervising the generation of the generator control category;

(5)本发明所提的技术方法能够应用在有关滚动轴承故障诊断的生产作业领域，实现工业生产中对滚动轴承健康状态的监测，延长滚动轴承的使用寿命，保证装备的持续适航。(5) The technical method proposed in the present invention can be applied in the field of production operations related to rolling bearing fault diagnosis, realize the monitoring of the health status of rolling bearings in industrial production, extend the service life of rolling bearings, and ensure the continuous airworthiness of equipment.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为本发明的一个实施例基于PDA-WGANGP的滚动轴承增强诊断方法方案图；FIG1 is a schematic diagram of a rolling bearing enhanced diagnosis method based on PDA-WGANGP according to an embodiment of the present invention;

图2为本发明的一个实施例基于WGANGP的样本生成方案图；FIG2 is a sample generation scheme diagram based on WGANGP according to an embodiment of the present invention;

图3为本发明的一个实施例不平衡训练集与4种方法下的诊断精度对比图。FIG3 is a comparison diagram of the diagnostic accuracy of an unbalanced training set and four methods according to an embodiment of the present invention.

具体实施方式DETAILED DESCRIPTION

下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例仅仅是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。The following will be combined with the drawings in the embodiments of the present invention to clearly and completely describe the technical solutions in the embodiments of the present invention. Obviously, the described embodiments are only part of the embodiments of the present invention, not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by ordinary technicians in this field without creative work are within the scope of protection of the present invention.

本发明针对机械系统中滚动轴承故障、健康数据不平衡的需求，以解决ACWGAN-GP存在的训练效率低、生成样本质量较低等问题为目标，提出了一种基于PDA-WGANGP的滚动轴承增强诊断方法。Aiming at the needs of rolling bearing fault and health data imbalance in mechanical systems, this paper proposes a rolling bearing enhanced diagnosis method based on PDA-WGANGP with the goal of solving the problems of low training efficiency and low quality of generated samples existing in ACWGAN-GP.

结合图1至图3，一种基于PDA-WGANGP的滚动轴承增强诊断方法，步骤1，传感器采集健康信号与故障信号数据，基于PDA方法对数据进行预处理并对模型进行预训练。为了具体说明本发明的方法，本发明对2016年帕德伯恩大学轴承数据集进行实验验证。在这个数据集中，轴承损坏分布广泛，将分为三个主要组：6个健康轴承，12个人为损坏的轴承和14个实际情况损坏的轴承。以64khz的采样率获得振动信号。In conjunction with Figures 1 to 3, a rolling bearing enhanced diagnosis method based on PDA-WGANGP, step 1, the sensor collects health signal and fault signal data, pre-processes the data based on the PDA method and pre-trains the model. In order to specifically illustrate the method of the present invention, the present invention experimentally verifies the 2016 Paderborn University bearing data set. In this data set, bearing damage is widely distributed and will be divided into three main groups: 6 healthy bearings, 12 artificially damaged bearings and 14 actually damaged bearings. The vibration signal is obtained at a sampling rate of 64khz.

通过快速傅里叶变换将数据的时域信号转换为频域信号，以增强信号特征，并对信号进行归一化处理。对振动信号进行快速傅里叶变换，使得信号相对应的健康、故障特征更为显著，方便之后模型进行特征提取，并对数据进行归一化到(-1，1)之间。The time domain signal of the data is converted into a frequency domain signal through fast Fourier transform to enhance the signal characteristics and normalize the signal. Fast Fourier transform is performed on the vibration signal to make the corresponding health and fault characteristics of the signal more significant, which is convenient for the subsequent model to extract features and normalize the data to between (-1, 1).

对数据进行划分，划分为预训练数据集、不平衡训练集以及测试集。因为在实验室条件下的故障数据和健康状态的数据十分地丰富，所以采用实验室条件下故障轴承以及健康状态的轴承产生的数据集作为预训练数据，基于预训练数据集预训练Dense分类器与判别器。然而，在实际工业生产中，滚动轴承大多处于正常工作状态，因此，健康样本远远多于故障样本。基于此条件，假设在训练集中，在原始样本中取每一类故障样本个数为n个，健康状态下的样本为5n个，二者相差5倍，意味着生成器每一类故障需要生成4n个样本。而在测试集中，健康状况与每一类故障状况的样本数目为2n个。其中，每个样本的长度都为1024。操作工况N09_M07_F10代表旋转速度为900rpm，加载力矩为0.7NM，径向力为1000N。具体预训练数据集如表1。The data is divided into pre-training data set, unbalanced training set and test set. Because the fault data and health status data under laboratory conditions are very rich, the data set generated by the faulty bearings and the healthy bearings under laboratory conditions is used as the pre-training data, and the Dense classifier and discriminator are pre-trained based on the pre-training data set. However, in actual industrial production, most rolling bearings are in normal working condition, so the healthy samples are far more than the fault samples. Based on this condition, it is assumed that in the training set, the number of each type of fault samples in the original sample is n, and the number of samples in the healthy state is 5n, which is 5 times different, which means that the generator needs to generate 4n samples for each type of fault. In the test set, the number of samples of the healthy state and each type of fault condition is 2n. Among them, the length of each sample is 1024. The operating condition N09_M07_F10 represents a rotation speed of 900rpm, a loading torque of 0.7NM, and a radial force of 1000N. The specific pre-training data set is shown in Table 1.

表1.预训练网络数据集Table 1. Pre-trained network datasets

对于不平衡的数据集，选取K001、KA04、KA15、KA16、KI04、KI14、KI17、KI18共8类作为训练集和测试集，包括1类健康轴承，7类真实故障轴承。采用的操作工况为N09_M07_F10。其中K001为健康轴承，故障轴承的具体信息如表2所示。For the unbalanced data set, 8 categories, including K001, KA04, KA15, KA16, KI04, KI14, KI17, and KI18, are selected as training and test sets, including 1 healthy bearing and 7 real faulty bearings. The operating condition used is N09_M07_F10. K001 is a healthy bearing, and the specific information of the faulty bearing is shown in Table 2.

表2.故障轴承的损伤说明Table 2. Damage description of faulty bearings

在真实情况下，由于噪声等影响下，对于同样工况同样损伤程度不同操作时间的信号测量也有些许差距，这对故障诊断精度带来不小的阻碍。因此本文选取两次不同的操作时间，分为为时间一和时间二，来更加真实地体现实际情况。训练集和测试集如表3表4所示。In real situations, due to the influence of noise, there are some differences in the signal measurement of different operation times under the same working conditions and the same degree of damage, which brings a considerable obstacle to the accuracy of fault diagnosis. Therefore, this paper selects two different operation times, divided into time one and time two, to more realistically reflect the actual situation. The training set and test set are shown in Table 3 and Table 4.

表3.不平衡训练数据集Table 3. Imbalanced training dataset

表4.测试数据集Table 4. Test dataset

针对判别器与Dense分类器进行预训练使其通过在大量数据下的学习得到更为通用的特征提取器。在改进的判别器、Dense分类器中采用预训练数据集预训练一维卷积网络的前几层，在其后加高维卷积网络的结构，使网络拥有更具泛化性的特征提取层。如此可以加快纳什均衡的到来，快速的生成质量好的生成信号。对于判别器第1-9层采用预训练网络结构与参数，设置第1-9层为不可训练，其余层为添加层，从头开始训练。对于分类器第1-19层采用预训练网络结构与参数，设置第1-11层为不可训练，设置第12-19层为可训练。其余层为添加层，从头开始训练。其中，预训练网络结构、Dense分类器网络结构、改进的判别器网络结构如表5、6、7所示。当网络预训练完成后，保存预训练模型，以便正式训练时的调用。Pre-training is performed on the discriminator and the dense classifier so that they can obtain more general feature extractors through learning under a large amount of data. In the improved discriminator and dense classifier, the pre-training data set is used to pre-train the first few layers of the one-dimensional convolutional network, and the structure of the high-dimensional convolutional network is added afterwards, so that the network has a more generalized feature extraction layer. This can accelerate the arrival of Nash equilibrium and quickly generate high-quality generation signals. For the 1st to 9th layers of the discriminator, the pre-trained network structure and parameters are used, and the 1st to 9th layers are set as untrainable, and the remaining layers are added layers, and training is started from scratch. For the 1st to 19th layers of the classifier, the pre-trained network structure and parameters are used, and the 1st to 11th layers are set as untrainable, and the 12th to 19th layers are set as trainable. The remaining layers are added layers and training is started from scratch. Among them, the pre-trained network structure, the dense classifier network structure, and the improved discriminator network structure are shown in Tables 5, 6, and 7. When the network pre-training is completed, the pre-trained model is saved for calling during formal training.

表5.预训练网络结构Table 5. Pre-trained network structure

表6.Dense分类器网络结构Table 6. Dense classifier network structure

表7.改进的判别器网络结构Table 7. Improved discriminator network structure

步骤2，将步骤1中的数据作为输入量并调用预训练后的网络模型，基于WGANGP方法生成样本，评估该生成样本的质量以输出高质量的故障模型。Step 2: Use the data in step 1 as input and call the pre-trained network model, generate samples based on the WGANGP method, and evaluate the quality of the generated samples to output a high-quality fault model.

首先，建立WGANGP模型并初始化参数：First, establish the WGANGP model and initialize the parameters:

如图2所示，为基于WGANGP的样本生成方案图。将随机噪声与种故障类别标签输入改进的生成器中，以产生带有标签的合成样本；为不平衡训练集贴上类别标签作为输入量；将该合成样本与输入量输入改进的判别器与Dense分类器中分别进行样本的真伪判断与类别分类。As shown in Figure 2, it is a sample generation scheme diagram based on WGANGP. Random noise and fault category labels are input into the improved generator to generate labeled synthetic samples; the category labels are attached to the imbalanced training set as input; the synthetic samples and input are input into the improved discriminator and Dense classifier to respectively judge the authenticity and classify the samples.

WGANGP的训练模型的伪代码如下，其中λ＝10，n_D＝5，

批量大小m，Adam超参数α、β₁、β₂，超参数ρ：The pseudo code of the training model of WGANGP is as follows, where λ = 10, n _D = 5,

Batch size m, Adam hyperparameters α, β ₁ , β ₂ , hyperparameter ρ:

训练WGANGP模型，首先，将随机噪声z与7种故障类别标签Y＝{y¹,y²...y⁷}相结合，输入至生成器G中，以便使生成器产生带有标签的合成样本

其中，m为需产生样本的数量，k为故障类别数。其中，生成器G的网络结构如表8所示。To train the WGANGP model, first, random noise z is combined with seven fault category labels Y = {y ¹ , y ² ...y ⁷ } and input into the generator G so that the generator generates synthetic samples with labels.

Among them, m is the number of samples to be generated, and k is the number of fault categories. The network structure of the generator G is shown in Table 8.

为了解决卷积神经网络梯度随着卷积层增加而导致的梯度消失的现象，将残差块A添加至网络结构中，以改进生成器。其中，残差块的输出为G(A)，F(A)为残差块的映射：In order to solve the problem of gradient vanishing of convolutional neural network as the number of convolutional layers increases, the residual block A is added to the network structure to improve the generator. The output of the residual block is G(A), and F(A) is the mapping of the residual block:

G(A)＝F(A)+AG(A)＝F(A)+A

并且，采用Instance Normalization(IN)代替Batch Normalization(BN)，从而使网络拥有更好的拟合能力。在生成器中使用IN不仅可以加速模型收敛，并且可以保持每个信号实例之间的独立。因此在生成器中采用IN。In addition, Instance Normalization (IN) is used instead of Batch Normalization (BN) to make the network have better fitting ability. Using IN in the generator can not only accelerate the convergence of the model, but also keep each signal instance independent. Therefore, IN is used in the generator.

然后，生成器将持续生成新的样本，持续生成的新样本与原始的每类100个的故障样本按照批次的形式随机输入Dense分类器C与改进的判别器D中分别进行样本的类别分类与真伪判断。Then, the generator will continue to generate new samples. The continuously generated new samples and the original 100 fault samples of each category are randomly input into the Dense classifier C and the improved discriminator D in the form of batches for sample category classification and authenticity judgment respectively.

ACWGAN-GP是通过向判别器添加分类层，在样本生成过程中提供不同类别控制的简单方法。而将分类器从判别器中单独提炼出来，提出基于Dense的分类器，使其专注于分类任务，如此拥有更好的分类能力，从而对生成器控制类别的生成起到约束监督作用。ACWGAN-GP is a simple method to provide different category controls in the sample generation process by adding a classification layer to the discriminator. The classifier is extracted from the discriminator and a Dense-based classifier is proposed to focus on the classification task, which has better classification capabilities and thus plays a role in constraining and supervising the generation of the generator control category.

然而分类器最初是为真实数据集设计的，而不是为对抗网络生成的数据设计的。由于生成器将随机噪声矢量作为输入并生成合成数据，因此所提出的分类器应考虑噪声输入的影响。所以，采用交替池化方法和分类器的激活函数来减少训练过程中噪声的影响。改进的判别器、Dense分类器的原理为：However, the classifier was originally designed for real datasets, not for data generated by adversarial networks. Since the generator takes a random noise vector as input and generates synthetic data, the proposed classifier should consider the impact of the noise input. Therefore, an alternating pooling method and the activation function of the classifier are used to reduce the impact of noise during training. The principle of the improved discriminator and dense classifier is:

L_D，

最后，当判别器无法判断样本是来自生成器还是原始训练样本时，整个模型达到纳什均衡，此时，生成器生成的样本近乎拟合于原始样本，可以作为后序故障诊断的训练样本，扩充不平衡训练数据集。Finally, when the discriminator cannot determine whether the sample comes from the generator or the original training sample, the entire model reaches a Nash equilibrium. At this time, the sample generated by the generator is almost consistent with the original sample and can be used as a training sample for subsequent fault diagnosis to expand the unbalanced training data set.

如表3所示，不平衡训练集下的健康状态的样本个数为500，因此，在模型达到纳什均衡时将7种故障状态下的各100个故障样本扩充至500个，为进行故障诊断的实验验证做准备。As shown in Table 3, the number of samples in the healthy state under the unbalanced training set is 500. Therefore, when the model reaches the Nash equilibrium, the 100 fault samples in each of the seven fault states are expanded to 500 to prepare for the experimental verification of fault diagnosis.

表8.改进的生成器网络结构Table 8. Improved generator network structure

为验证生成样本质量的好坏，本文引入了余弦相似度和皮尔逊相关系数评价生成样本和真实样本的相似度，验证基于WGANGP方法的优越性。In order to verify the quality of generated samples, this paper introduces cosine similarity and Pearson correlation coefficient to evaluate the similarity between generated samples and real samples, and verifies the superiority of the WGANGP method.

余弦相似度：衡量两个样本向量的相关性。给出的相似性范围从-1到1：-1意味着两个向量指向的方向正好截然相反，1表示它们的指向是完全相同的，0通常表示它们之间是独立的，而在这之间的值则表示中间的相似性或相异性。样本向量x,y的余弦相似度为：Cosine similarity: measures the correlation between two sample vectors. The given similarity ranges from -1 to 1: -1 means that the two vectors point in opposite directions, 1 means that they point in exactly the same direction, 0 usually means that they are independent, and the values in between indicate intermediate similarities or dissimilarities. The cosine similarity of sample vectors x and y is:

皮尔逊相关系数：衡量两个随机样本变量的相关性，绝对值越大，相关程度越大。随机样本变量X,Y的皮尔逊相关系数为：Pearson correlation coefficient: measures the correlation between two random sample variables. The larger the absolute value, the greater the correlation. The Pearson correlation coefficient of random sample variables X and Y is:

其中，Cov(X,Y)为X与Y的协方差，σ_x为X的方差，σ_Y为Y的方差。Where Cov(X,Y) is the covariance of X and Y, _σx is the variance of X, and _σY is the variance of Y.

合成少数类过采样(SMOTE)、自适应综合过采样(ADASYN)、ACWGAN-GP这三种方法将被用以完成对比分析。四种方法的余弦相似度对比见表9，皮尔逊相关系数对比见表10。The three methods of synthetic minority oversampling (SMOTE), adaptive synthetic oversampling (ADASYN), and ACWGAN-GP will be used to complete the comparative analysis. The cosine similarity comparison of the four methods is shown in Table 9, and the Pearson correlation coefficient comparison is shown in Table 10.

表9.四种方法的余弦相似度对比Table 9. Cosine similarity comparison of four methods

表10.四种方法的皮尔逊相关系数对比Table 10. Comparison of Pearson correlation coefficients of four methods

通过表9、表10余弦相似度、皮尔逊相关系数的计算可知，WGANGP生成的故障信号具有较高的真实度，所以将会对数据平衡后的故障诊断起到了积极作用。而SMOTE与ADASYN的余弦相似度、皮尔逊相关系数相对于WGANGP与AC-WGANGP较高，由此可见，AC-WGANGP没有很好的拟合原始故障信号的数据分布，因此其生成器产生的故障信号质量较差，无法对真实情况下的故障诊段进行有效的帮助。而传统SMOTE、ADASYN由于为线性插值产生新信号，即使在小样本条件下，产生的信号相似度较高，对于数据增强有一定的效果。Through the calculation of cosine similarity and Pearson correlation coefficient in Table 9 and Table 10, it can be seen that the fault signal generated by WGANGP has a high degree of authenticity, so it will play a positive role in fault diagnosis after data balancing. The cosine similarity and Pearson correlation coefficient of SMOTE and ADASYN are higher than those of WGANGP and AC-WGANGP. It can be seen that AC-WGANGP does not fit the data distribution of the original fault signal well, so the fault signal generated by its generator is of poor quality and cannot effectively help the fault diagnosis in real situations. Traditional SMOTE and ADASYN generate new signals by linear interpolation, so even under small sample conditions, the generated signals have a high degree of similarity, which has a certain effect on data enhancement.

步骤3，通过步骤2输出的高质量的故障模型来扩充不平衡训练集，将数量为n个的少数类故障样本扩充至与健康状态的样本数5n一致，也就是说每一类故障生成4n个样本，将扩充完成的训练集输入至诸如卷积神经网络、支持向量机等故障诊断模型中进行模型的训练，将测试集输入训练后的故障诊断模型中，输出故障诊断结果，验证网络模型的诊断能力，得出最终基于PDA-WGANGP的滚动轴承增强诊断结果。Step 3, expand the unbalanced training set through the high-quality fault model output by step 2, expand the number of n minority class fault samples to the same number of healthy state samples 5n, that is, generate 4n samples for each type of fault, input the expanded training set into fault diagnosis models such as convolutional neural networks and support vector machines for model training, input the test set into the trained fault diagnosis model, output the fault diagnosis results, verify the diagnostic ability of the network model, and obtain the final enhanced diagnosis results of rolling bearings based on PDA-WGANGP.

采用本发明提出的基于PDA-WGANGP的滚动轴承增强诊断方法，为比较验证所提方法的有效性与优越性，SMOTE、ADASYN、ACWGAN-GP这三种方法将被用以完成对比分析，本文将通过这四种方法对少类故障信号进行补充，得到数据增强的训练数据集。最后对不平衡训练数据集以及四种方法增强的数据集分别进行一维卷积神经网(1D-CNN)、支持向量机(SVM)、稠密卷积网络(DENSE)三种网络结构的故障诊断，比较各个模型的诊断精度，来验证模型产生样本的精确性。不平衡训练集与四种方法下的诊断精度对比图如图3所示。The rolling bearing enhanced diagnosis method based on PDA-WGANGP proposed in this invention is adopted. In order to compare and verify the effectiveness and superiority of the proposed method, the three methods of SMOTE, ADASYN and ACWGAN-GP will be used to complete the comparative analysis. This paper will use these four methods to supplement the minority fault signals and obtain the data enhanced training data set. Finally, the unbalanced training data set and the data set enhanced by the four methods are respectively subjected to fault diagnosis of three network structures, namely one-dimensional convolutional neural network (1D-CNN), support vector machine (SVM) and dense convolutional network (DENSE), and the diagnostic accuracy of each model is compared to verify the accuracy of the samples generated by the model. The comparison chart of the diagnostic accuracy of the unbalanced training set and the four methods is shown in Figure 3.

如图3所示，基于三种故障诊断模型的PDA-WGANGP方法呈现出稳定的特质，在DENSE上诊断精度最高，能够达到93.35％的精确度，1D-CNN上最低，为92.25％，二者仅相差1.1％。而基于AC-WGANGP数据增强方法的波动最大，最大故障诊断精度为91.63％，最小为87.13％，二者相差4.5％，表明AC-WGANGP所生成的样本质量不高。对于不平衡训练数据集，因为模型缺少训练数据的输入，其诊断效果最差。虽然SMOTE、ADASYN在生成样本上与真实样本具有强的相关性，但其严重依赖数据特征，没有考虑少数类样本的真实分布特性，其故障诊断的效果不如PDA-WGANGP。As shown in Figure 3, the PDA-WGANGP method based on the three fault diagnosis models shows stable characteristics. The highest diagnostic accuracy is achieved on DENSE, which can reach 93.35% accuracy, and the lowest is 92.25% on 1D-CNN, with a difference of only 1.1%. The AC-WGANGP data enhancement method has the largest fluctuation, with the maximum fault diagnosis accuracy of 91.63% and the minimum of 87.13%, with a difference of 4.5%, indicating that the quality of samples generated by AC-WGANGP is not high. For unbalanced training data sets, the diagnostic effect is the worst because the model lacks the input of training data. Although SMOTE and ADASYN have a strong correlation with real samples in terms of generated samples, they rely heavily on data features and do not consider the real distribution characteristics of minority class samples. Their fault diagnosis effect is not as good as PDA-WGANGP.

由此，通过数据分析实施了本发明的应用过程，并且验证了本发明方法的有效性与优越性。Thus, the application process of the present invention was implemented through data analysis, and the effectiveness and superiority of the method of the present invention were verified.

最后应说明的是：以上所述仅为本发明的优选实施例而已，并不用于限制本发明，尽管参照前述实施例对本发明进行了详细的说明，对于本领域的技术人员来说，其依然可以对前述各实施例所记载的技术方案进行修改，或者对其中部分技术特征进行等同替换。凡在本发明的精神和原则之内，所作的任何修改、等同替换、改进等，均应包含在本发明的保护范围之内。Finally, it should be noted that the above description is only a preferred embodiment of the present invention and is not intended to limit the present invention. Although the present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art can still modify the technical solutions described in the aforementioned embodiments or replace some of the technical features therein by equivalents. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention shall be included in the protection scope of the present invention.

Claims

1. A rolling bearing enhanced diagnosis method based on PDA-WGANGP, which is characterized in that: the method comprises the following steps: step 1, a sensor collects health signals and fault signal data, preprocesses the health signals and the fault signal data based on a PDA method and pretrains a network model; the method specifically comprises the following steps:

step 1.1, converting a time domain signal of data into a frequency domain signal through fast Fourier transform to enhance signal characteristics, and carrying out normalization processing on the signal;

step 1.2, dividing the data into a pre-training data set, an unbalanced training set and a test set;

step 1.3, improving a discriminator and a Dense classifier: pre-training the one-dimensional convolution network of the discriminator and the Dense classifier by using the pre-training data set in the step 1.2, and adding the structure of the high-dimensional convolution network;

step 2, taking the fault signal data as input quantity, calling a pre-trained network model, generating a sample based on a WGANGP method, and evaluating the quality of the generated sample to output a high-quality fault model, wherein the specific steps are as follows: step 2.1, establishing a WGANGP model and initializing parameters;

step 2.2, training a WGANGP model: introducing a residual network to improve a generator, inputting random noise and fault class labels into the improved generator to produce labeled composite samples; attaching a class label to the unbalanced training set as an input original sample; inputting the synthesized sample and the original sample into an improved discriminator and a Dense classifier to respectively judge the authenticity and classify the samples;

step 2.3, if the improved discriminator judges that the data is a synthesized sample, a false discrimination result is given, and if the improved discriminator judges that the data is an original sample, a true discrimination result is given; the Dense classifier outputs the fault category of the synthesized sample and the original sample finally;

step 2.4, when the improved discriminator cannot accurately judge whether a certain sample is an original sample or a synthesized sample, the whole model achieves Nash equilibrium, and a generated sample based on Nash equilibrium is output;

step 2.5, evaluating the quality of the generated sample output in the step 2.4, and taking the high-quality generated sample as input data of bearing fault diagnosis under the condition of the balance training set in the step 3;

and 3, after the unbalance training set is expanded by the high-quality generated sample, training a fault diagnosis model by taking the high-quality generated sample as an input sample under the condition of the balance training set, inputting the test set divided in the step 1.2 into the trained fault diagnosis model, outputting a fault diagnosis result, verifying the diagnosis capability of the network model, and finally obtaining the rolling bearing enhanced diagnosis result based on the PDA-WGANGP.

2. The PDA-WGANGP based rolling bearing enhanced diagnostic method according to claim 1, wherein: the method for improving the generator in the step 2.2 is to introduce a residual network, and add a residual block a to the network structure of the generator to solve the problem that the gradient of the convolutional neural network disappears due to the increase of the convolutional layer, wherein the output of the residual block is G (a), and F (a) is the mapping of the residual block: g (a) =f (a) +a.

3. The PDA-WGANGP based rolling bearing enhanced diagnostic method according to claim 2, wherein: the network of generators uses an IN algorithm instead of a BN algorithm for model optimization to speed up model convergence and maintain independence between each signal instance.

4. A PDA-WGANGP-based rolling bearing enhanced diagnostic method according to claim 3, wherein: the step 2.2 is to input random noise and fault class labels into an improved generator, specifically: set of random noise vectors z= (Z) ₁ ，z ₂ ，z ₃ ...z _m ) M= (1, 2 …) and tag set y= (Y) ¹ ，y ² ，y ³ …y ^k ) K= (1, 2 …) are simultaneously input into the modified generator such that the generator produces labeled composite samples

Where m is the number of samples to be generated and k is the number of fault categories. />

5. The PDA-WGANGP based rolling bearing enhanced diagnostic method according to claim 1, wherein: the Dense classifier adopts an alternate pooling method and an activation function to reduce the influence of noise in the training process.

6. The PDA-WGANGP-based rolling bearing enhanced diagnostic method according to claim 5, wherein: the principle of the improved discriminator and the Dense classifier is as follows:

wherein L is _D ，

L _C The method is an improved discriminator, classification of real data by a Dense classifier, classification of generated data by the Dense classifier and a loss function of the Dense classifier; p (P) _r (s) is the distribution of real data, P _z (z) is an a priori distribution of noise vector z; θ _D 、θ _G 、θ _C Is a parameter of an improved discriminator, generator and Dense classifier; c _g Is the category sampled from the conditional noise, c _r Is a category of real data; ρ is a superparameter for controlling the accuracy of the improved generator for true data classification and the importance of the accuracy for generating data classification;

For interpolation of the real data and the generated data λ is the weight of the gradient penalty term to satisfy the 1-lipschitz condition.

7. The PDA-WGANGP-based rolling bearing enhanced diagnostic method according to claim 6, wherein: and in the step 2.5, two evaluation indexes of cosine similarity and pearson correlation coefficient are adopted to evaluate the similarity of the generated sample and the real sample.