CN116858531A

CN116858531A - A wind turbine gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt

Info

Publication number: CN116858531A
Application number: CN202310579050.9A
Authority: CN
Inventors: 雷钧; 张彬桥; 舒勇; 刘雷; 杨洋
Original assignee: China Three Gorges University CTGU
Current assignee: China Three Gorges University CTGU
Priority date: 2023-05-22
Filing date: 2023-05-22
Publication date: 2023-10-10

Abstract

A wind turbine gearbox fault diagnosis method based on data enhancement and CSP‑ResNeXt, including the following steps: using an acceleration sensor installed on the gearbox to collect fault vibration signal data collected from different fault types of the gearbox; using the wavelet packet decomposition method to decompose the collected data Decompose the fault vibration signal to obtain different wavelet packet coefficients. Randomly select a set of wavelet coefficients, distort them and restore them to time domain signals to expand the fault samples and complete the data enhancement operation of the fault samples; through the relative position matrix Relative Position Matrix , RPM converts the time domain signal into a grayscale image and inputs it into the built CSP‑ResNeXt network to train the diagnostic model. Finally, an intelligent fault diagnosis model of the wind turbine gearbox can be obtained, which can be used to input vibration signals collected in real time for fault diagnosis and identification. The present invention uses wavelet packet distortion technology for data enhancement, and at the same time adopts an improved ResNeXt network to achieve accurate fault identification of wind turbine gearboxes when fault sample classes are seriously unbalanced.

Description

A wind turbine gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt

技术领域Technical Field

本发明涉及风电机组齿轮箱的健康状态监测技术领域，具体涉及一种基于数据增强和CSP-ResNeXt的风机齿轮箱故障诊断方法。The present invention relates to the technical field of health status monitoring of a wind turbine gearbox, and in particular to a wind turbine gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt.

背景技术Background Art

风电机组由于长期工作在诸如极端温度、暴雨、暴雪、盐雾等环境下，随着运行时间的增加，叶片、主轴承、齿轮箱、发电机及其他部件的疲劳强度、运行性能等不断下降，引起异常和故障，导致风机不正常运行甚至停机。针对故障部件进行分析，风机齿轮箱在所有故障部件所占比例最高，因此对风机齿轮箱建立故障诊断模型十分重要。Wind turbines work in extreme temperature, rainstorm, snowstorm, salt spray and other environments for a long time. As the operation time increases, the fatigue strength and operating performance of blades, main bearings, gearboxes, generators and other components continue to decline, causing abnormalities and failures, resulting in abnormal operation or even shutdown of the wind turbine. Analysis of the faulty components shows that the wind turbine gearbox accounts for the highest proportion of all faulty components, so it is very important to establish a fault diagnosis model for the wind turbine gearbox.

一般针对旋转部件常采用振动信号做为信号源进行故障的分析，在信号采集过程中参杂了强噪音信号，信号本身有具有非线性以及非平稳性的特点，给故障诊断带来了挑战。同时，风电机组由加速度传感器采集的一般为正常状态的振动信号，而故障信号采集样例较少，使得故障诊断模型训练带来了困难。Generally, vibration signals are used as signal sources for fault analysis of rotating parts. Strong noise signals are mixed in the signal acquisition process, and the signal itself has nonlinear and non-stationary characteristics, which brings challenges to fault diagnosis. At the same time, the vibration signals collected by the acceleration sensor of the wind turbine are generally normal, and there are few samples of fault signal collection, which makes it difficult to train the fault diagnosis model.

发明内容Summary of the invention

本发明所要解决的技术问题是提供一种基于数据增强和CSP-ResNeXt的风机齿轮箱故障诊断方法，解决风机齿轮箱故障振动信号能量微弱，常常被强噪声污染，传统的故障诊断采用时域、频域分析方法的人工故障特征提取具有主观性，诊断效果差。The technical problem to be solved by the present invention is to provide a fan gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt, so as to solve the problem that the energy of the fan gearbox fault vibration signal is weak and is often polluted by strong noise. The traditional fault diagnosis uses time domain and frequency domain analysis methods to extract artificial fault features, which is subjective and has poor diagnostic effect.

为解决上述技术问题，本发明所采用的技术方案是：In order to solve the above technical problems, the technical solution adopted by the present invention is:

一种基于数据增强和CSP-ResNeXt的风机齿轮箱故障诊断方法，包括以下步骤：A wind turbine gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt comprises the following steps:

Step1、利用安装在齿轮箱上的加速度传感器采集齿轮箱不同故障类型采集的故障振动信号数据；Step 1, using the acceleration sensor installed on the gearbox to collect fault vibration signal data of different fault types of the gearbox;

Step2、采用小波包畸变对样本数据进行扩充；Step 2, use wavelet packet distortion to expand the sample data;

Step3、将故障振动信号通过相对位置矩阵转化为灰度图；Step 3, convert the fault vibration signal into a grayscale image through the relative position matrix;

Step4、将样本扩充后的数据输入改进ResNeXt网络，最终可以得到风机齿轮箱的智能故障诊断模型，利用该模型输入实时采集的振动信号进行故障诊断和识别。Step 4: Input the expanded sample data into the improved ResNeXt network, and finally obtain the intelligent fault diagnosis model of the wind turbine gearbox. Use this model to input the real-time collected vibration signal for fault diagnosis and identification.

上述的Step2的具体步骤如下：The specific steps of Step 2 above are as follows:

Step2.1、采用小波包变换将原始振动信号做2层分解；Step 2.1, use wavelet packet transform to decompose the original vibration signal into two layers;

Step2.2、随机选取一组小波系数进行畸变操作；Step 2.2, randomly select a set of wavelet coefficients for distortion operation;

Step2.3、由畸变操作后的小波包分解系数还原为时域信号。Step 2.3, restore the wavelet packet decomposition coefficients after the distortion operation to the time domain signal.

上述的Step2.1中，使用以下函数实现原始振动信号的2层分解：In the above Step 2.1, the following function is used to implement the 2-layer decomposition of the original vibration signal:

其中，i＝0,...,(2j-1)，k和l为元素的顺序索引，(h,g)为一对长度为L的有限脉冲响应滤波器，设w_j,i表示第j个分解层的第i组小波系数，w_0,0是原始信号。Where i=0,...,(2j-1), k and l are the sequential indices of the elements, (h,g) is a pair of finite impulse response filters of length L, let wj _,i represent the i-th group of wavelet coefficients of the j-th decomposition layer, and _w0,0 is the original signal.

上述的Step2.2中畸变操作的畸变函数为：The distortion function of the distortion operation in Step 2.2 above is:

其中d是从预设范围中随机选取的失真系数，sign为符号函数，在数学和计算机运算中，其功能是取某个数的符号(正或负)：当x>0，sign(x)＝1；当x＝0，sign(x)＝0；当x<0，sign(x)＝-1；abs为绝对值函数，返回w值的绝对值。Where d is the distortion coefficient randomly selected from a preset range, sign is the sign function, which is used to take the sign (positive or negative) of a number in mathematics and computer operations: when x>0, sign(x)=1; when x=0, sign(x)=0; when x<0, sign(x)=-1; abs is the absolute value function, which returns the absolute value of w.

上述的Step4的具体步骤为：The specific steps of Step 4 above are:

Step4.1、输入数据经过视野域为7×7的卷积核，维度数为64，步长为2的预处理层；Step 4.1, the input data passes through a preprocessing layer with a convolution kernel of 7×7 field of view, a dimension of 64, and a step size of 2;

Step4.2、Step4.1输出数据经过卷积核为3×3，步长为2的全局最大池化层；Step4.3、Step4.2输出数据进入四个由不同层的ResNeXt残差块组成的主干网络。The output data of Step 4.2 and Step 4.1 pass through the global maximum pooling layer with a convolution kernel of 3×3 and a step size of 2; the output data of Step 4.3 and Step 4.2 enter the backbone network composed of four ResNeXt residual blocks of different layers.

上述的Step4.3中四个由不同层的ResNeXt残差块组成的主干网络的组成结构为：The structure of the backbone network composed of four ResNeXt residual blocks of different layers in the above Step 4.3 is:

一部分由3层基数为32的卷积核为1×1，3×3，1×1的残差结构构成，同时包含一条残差连接旁路，最后将通道拼接输出；One part consists of 3 layers of residual structures with a base of 32 and convolution kernels of 1×1, 3×3, and 1×1, and also includes a residual connection bypass, and finally the channels are spliced and output;

第二部分由3层基数为32的卷积核为1×1，3×3，1×1的残差结构构成，同时包含一条残差连接旁路，最后将通道拼接输出；The second part consists of 3 layers of residual structures with a base of 32 and convolution kernels of 1×1, 3×3, and 1×1, and also includes a residual connection bypass, and finally concatenates the channels for output;

第三部分由3层基数为32的卷积核为1×1，3×3，1×1的残差结构构成，同时包含一条残差连接旁路，最后将通道拼接输出；The third part consists of 3 layers of residual structures with a base of 32 and convolution kernels of 1×1, 3×3, and 1×1, and also includes a residual connection bypass, and finally concatenates the channels for output;

第四部分由3层基数为32的卷积核为1×1，3×3，1×1的残差结构构成，同时包含一条残差连接旁路，最后将通道拼接输出。The fourth part consists of 3 layers of residual structures with a base size of 32 and convolution kernels of 1×1, 3×3, and 1×1, and also includes a residual connection bypass, and finally the channels are concatenated and output.

上述的四个由不同层的ResNeXt残差块组成的主干网络残差块堆叠结构为3-4-6-3，在每部分网络结构中插入CSP模块，将输入通道在进入每部分堆叠网络时做分组，将一半的输入通道进入堆叠网络，而另一半的输入通道则进入CSP模块；然后将每部分的输出通道与CSP模块的输出通道拼接，作为每部分堆叠网络结构的总输出；最后，将第四部分的输出做汇总，进入卷积核为1×1的全局平均池化层与全连接层实现故障分类。The above four backbone network residual blocks composed of different layers of ResNeXt residual blocks have a stacking structure of 3-4-6-3. A CSP module is inserted into each part of the network structure, and the input channels are grouped when entering each part of the stacked network. Half of the input channels enter the stacked network, and the other half of the input channels enter the CSP module; then the output channel of each part is spliced with the output channel of the CSP module as the total output of each part of the stacked network structure; finally, the output of the fourth part is summarized and enters the global average pooling layer and the fully connected layer with a convolution kernel of 1×1 to realize fault classification.

上述的ResNeXt残差块激活函数采用sigmoid函数，其计算公式为：The above ResNeXt residual block activation function uses the sigmoid function, and its calculation formula is:

其中x为输入，f(x)为输出。Where x is the input and f(x) is the output.

上述的四个ResNeXt残差块网络部分中每个卷积核后和过渡层所使用的激活函数均采用Relu激活函数，其计算公式为：The activation function used in each convolution kernel and transition layer in the above four ResNeXt residual block network parts adopts the Relu activation function, and its calculation formula is:

f(x)＝max(0,x)f(x)＝max(0,x)

其中x为输入，f(x)为输出。Where x is the input and f(x) is the output.

上述的改进ResNeXt网络即CSP-ResNeXt网络的全连接层损失函数采用交叉熵损失函数，其计算公式为：The fully connected layer loss function of the above-mentioned improved ResNeXt network, namely the CSP-ResNeXt network, adopts the cross entropy loss function, and its calculation formula is:

其中p(x_i)和q(x_i)分别表示真实概率分布与预测概率分布，H(p,q)表示预测值和真实值的差距；Where p( _xi ) and q( _xi ) represent the true probability distribution and the predicted probability distribution, respectively, and H(p,q) represents the gap between the predicted value and the true value;

交叉熵损失函数搭配softmax分类器使用，在全连接层将输出的结果进行处理，使其多个分类的预测值和为1，再通过交叉熵来计算损失，其中，softmax函数计算公式为：The cross entropy loss function is used with the softmax classifier. The output results are processed in the fully connected layer so that the sum of the predicted values of multiple classifications is 1, and then the loss is calculated by cross entropy. The calculation formula of the softmax function is:

其中x_i为模型上一层的输出，作为softmax分类器的输入；输出计算结果softmax(x)可视为预测结果为真实结果的置信度。Among them, _xi is the output of the previous layer of the model, which serves as the input of the softmax classifier; the output calculation result softmax(x) can be regarded as the confidence that the predicted result is the true result.

本发明提供的一种基于数据增强和CSP-ResNeXt的风机齿轮箱故障诊断方法，本发明具有如下有益效果：The present invention provides a wind turbine gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt, which has the following beneficial effects:

1.在小波包变换的基础上发展了小波包失真，以增加错误训练样本的数量。在这里，增强样本与原始样本相似，但有不同的值。这样既可以达到类之间的平衡，又可以提高训练数据集的样本多样性。1. Wavelet packet distortion is developed based on wavelet packet transform to increase the number of erroneous training samples. Here, the enhanced samples are similar to the original samples but have different values. This can achieve a balance between classes and improve the sample diversity of the training data set.

2.CSP-ResNeXt在原有Resnet基础上引入分组卷积思想，使得网络结构在复杂度相同的情况下分类精度更高，同时在不同应用场景中由于每个残差块内采用VGG的堆叠结构归一化的思想，使得模型的泛化能力更强；在采用CSP模块后，减少了模型计算量并使得梯度组合更加丰富，避免了模型在前向梯度传播中重复学习同样的梯度值，增强了模型在训练时的收敛能力。2. CSP-ResNeXt introduces the idea of group convolution based on the original Resnet, which makes the network structure have higher classification accuracy under the same complexity. At the same time, in different application scenarios, due to the idea of VGG stacking structure normalization in each residual block, the generalization ability of the model is stronger; after adopting the CSP module, the amount of model calculation is reduced and the gradient combination is richer, which avoids the model from repeatedly learning the same gradient value in the forward gradient propagation and enhances the convergence ability of the model during training.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

下面结合附图和实施例对本发明作进一步说明：The present invention will be further described below in conjunction with the accompanying drawings and embodiments:

图1为本发明实施例整体流程示意图；FIG1 is a schematic diagram of the overall process of an embodiment of the present invention;

图2为残差结构设计图；Figure 2 is a residual structure design diagram;

图3为实施例中转化后的图形差分场特征图谱；FIG3 is a graph showing a characteristic spectrum of a differential field after conversion in an embodiment;

图4为实施例中模型训练损失函数曲线示意图；FIG4 is a schematic diagram of a model training loss function curve in an embodiment;

图5为实施例模型训练的准确率曲线示意图。FIG5 is a schematic diagram of an accuracy curve of the model training of the embodiment.

具体实施方式DETAILED DESCRIPTION

以下结合附图和实施例详细说明本发明技术方案。The technical solution of the present invention is described in detail below in conjunction with the accompanying drawings and embodiments.

如图1-2中所示，一种基于数据增强和CSP-ResNeXt的风机齿轮箱故障诊断方法，包括以下步骤：As shown in FIG1-2, a wind turbine gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt includes the following steps:

Step1中，在振动信号提取与原始数据集构建方面，至少取正常振动信号，齿轮箱轴承外圈故障振动信号、齿轮箱轴承内圈故障振动信号、断齿状态下的故障振动信号、点蚀状态下的故障振动信号以及磨损状态下的故障振动信号六种状态下的故障振动信号构成数据集，并采用独热编码的形式对数据集分类标签进行标号，方便后期采用softmax分类器进行故障诊断类型的分类。In Step 1, in terms of vibration signal extraction and original data set construction, at least six types of fault vibration signals, including normal vibration signals, gearbox bearing outer ring fault vibration signals, gearbox bearing inner ring fault vibration signals, broken tooth state fault vibration signals, pitting state fault vibration signals, and wear state fault vibration signals, are taken to form a data set, and the classification labels of the data set are numbered in the form of unique hot encoding to facilitate the subsequent classification of fault diagnosis types using a softmax classifier.

Step2.1、采用小波包变换将故障振动信号分解，分解阶数为2，得到小波包分解系数；Step 2.1, use wavelet packet transform to decompose the fault vibration signal, the decomposition order is 2, and the wavelet packet decomposition coefficient is obtained;

Step2.3、通过小波包反变换还原为时序信号，完成故障振动信号的数据增强；Step 2.3, restore the time series signal through wavelet packet inverse transformation to complete the data enhancement of the fault vibration signal;

同时，在Step2.3中，将数据输入改进ResNeXt中可用相对位置矩阵(RelativePosition Matrix,RPM)将数据转化为二维图像以利于诊断模型的训练。At the same time, in Step 2.3, the data is input into the improved ResNeXt and the available relative position matrix (RPM) is used to convert the data into a two-dimensional image to facilitate the training of the diagnostic model.

小波包变换作为一种经典的时频域分析方法，可以将一个信号分解为一组不同频段的小波系数；而小波包反变换则相反，可以通过小波包分解系数重构还原出原始信号；若按照相同的分解规则分解与重构信号，则重构信号与原始信号保持一致；本发明对一组随机选取的小波系数进行失真处理，使重构后的信号与原始样本相似但略有不同。重构信号可以作为增强样本，特别是对少数类振动信号样本。As a classic time-frequency domain analysis method, wavelet packet transform can decompose a signal into a set of wavelet coefficients in different frequency bands; on the contrary, the inverse wavelet packet transform can reconstruct and restore the original signal through the wavelet packet decomposition coefficients; if the signal is decomposed and reconstructed according to the same decomposition rule, the reconstructed signal is consistent with the original signal; the present invention performs distortion processing on a set of randomly selected wavelet coefficients so that the reconstructed signal is similar to the original sample but slightly different. The reconstructed signal can be used as an enhanced sample, especially for minority vibration signal samples.

对部分小波系数进行畸变后，进行小波包反变换得到输出信号v，其公式表示为：After distorting some wavelet coefficients, the inverse wavelet packet transform is performed to obtain the output signal v, which is expressed as follows:

其中为畸变后小波系数，v为重构小波系数，为一对长度为L的有限脉冲响应滤波器，i＝0,...,(2j-1)，k和l为元素的顺序索引。in is the distorted wavelet coefficient, v is the reconstructed wavelet coefficient, is a pair of finite impulse response filters of length L, i = 0, ..., (2j-1), k and l are the sequential indices of the elements.

在Step3中，将数据输入改进ResNeXt中可用相对位置矩阵(Relative PositionMatrix,RPM)将数据转化为二维图像以利于诊断模型的训练，具体实施步骤如下：In Step 3, the data is input into the improved ResNeXt and the relative position matrix (RPM) can be used to convert the data into a two-dimensional image to facilitate the training of the diagnostic model. The specific implementation steps are as follows:

相对位置矩阵(Relative Position Matrix,RPM)包含了原始时间序列的冗余特征，使转换后的图像中，类间和类内的相似度信息更容易被捕捉；对于一个时间序列X＝(xt，t＝1，2，…，N)，可以通过以下步骤得到RPM图：The relative position matrix (RPM) contains the redundant features of the original time series , making it easier to capture the similarity information between and within classes in the converted image. For a time series X = (xt, t = 1, 2, ..., N), the RPM graph can be obtained by the following steps:

1)针对原始时间序列，通过以下z-分值标准化的方法得到一个标准正态分布Z：1) For the original time series, a standard normal distribution Z is obtained by the following z-score standardization method:

其中μ表示X的平均值，σ表示X的标准差。Where μ represents the mean of X and σ represents the standard deviation of X.

2)2)采用分段聚合近似(PAA)方法，选择一个合适的缩减因子k，生成一个新的平滑时间序列将维度N减少到m：2)2) Using the piecewise aggregation approximation (PAA) method, select a suitable reduction factor k to generate a new smoothed time series Reduce the dimension N to m:

通过计算分段常数的平均值进行降维，可以保持原始时间序列的近似趋势，最终新的平滑时间序列的长度为m。By calculating the average value of the piecewise constants to reduce the dimension, the approximate trend of the original time series can be maintained, and the new smoothed time series is finally The length is m.

3)计算两个时间戳之间的相对位置，将预处理后的时间序列转换为二维矩阵M：3) Calculate the relative position between the two timestamps and convert the preprocessed time series Convert to a two-dimensional matrix M:

如上所示，该矩阵表征了时间序列中每两个时间戳之间的相对位置关系；其每一行和每一列都以某一个时间戳为参考，进一步表征整个序列的信息。As shown above, the matrix represents the relative position relationship between every two timestamps in the time series; each row and column is referenced to a certain timestamp, further representing the information of the entire sequence.

4)最后利用最小-最大归一化将M转换为灰度值矩阵，最终得到相对位移矩阵F：4) Finally, the minimum-maximum normalization is used to convert M into a gray value matrix, and finally the relative displacement matrix F is obtained:

最终将时间序列振动信号通过相对位移矩阵转化为灰度图，可以输入到诊断分类模型中训练。Finally, the time series vibration signal is converted into a grayscale image through the relative displacement matrix, which can be input into the diagnostic classification model for training.

对CSP-ResNeXt网络的构建中，应按照如下方法搭建：ResNeXt网络结构同时采用VGG堆叠的思想和Inception的分割-转换-合并思想，在其基本的残差块中采用分组卷积的方法，保持每个聚合的拓扑结构一致，从而增强了模型的可拓展性，同时也减轻了针对特定数据集时网络的结构设计难度；在此基础上引入CSP模块，加强了神经网络的学习能力，在一定程度上消除了计算瓶颈，降低了内存成本，在ResNeXt网络中引入CSP模块，可以减少推理计算所耗费的时间；CSP模块通过跨阶段来实现推理计算量的减少，依靠优化网络中的重复梯度信息来实现这一功能，通过从网络阶段的开始和结束集成特征映射来注重梯度的可变性，丰富和增加了梯度的多种组合。在ResNet的bottleneck层的基础上引入分组卷积的方法，将原有的1×1、3×3、1×1卷积结构共享卷积核参数改为分组共享各自组内的卷积核参数，保证计算量相差不大，增强网络的学习能力；同时为了提升推理计算的速度，引入CSP局部跨阶段方法，将bottleneck层的输入分为两部分，一部分通过改进Resnet块，另一部分直接与通过改进Resnet块的输入得到的输出进行拼接，减少冗余梯度的复用；通过分割梯度流来使梯度流通过不同网络路径传播，以及通过切换级联和转换的步骤，传播的梯度信息可以有很大的差异，以此做到减少冗余梯度的复用，残差结构设计思路如图2所示。In the construction of the CSP-ResNeXt network, the following method should be used: the ResNeXt network structure adopts the VGG stacking idea and the Inception split-conversion-merge idea at the same time, and adopts the group convolution method in its basic residual block to keep the topological structure of each aggregation consistent, thereby enhancing the scalability of the model and reducing the difficulty of network structure design for specific data sets; on this basis, the CSP module is introduced to enhance the learning ability of the neural network, eliminate the computing bottleneck to a certain extent, and reduce the memory cost. The introduction of the CSP module in the ResNeXt network can reduce the time spent on reasoning calculations; the CSP module reduces the amount of reasoning calculations by crossing stages, and achieves this function by optimizing the repeated gradient information in the network. It focuses on the variability of gradients by integrating feature maps from the beginning and end of the network stage, enriching and increasing the variety of gradient combinations. Based on the bottleneck layer of ResNet, the group convolution method is introduced, and the original 1×1, 3×3, and 1×1 convolution structures that share convolution kernel parameters are changed to group sharing of convolution kernel parameters within each group to ensure that the amount of calculation is not much different and enhance the learning ability of the network; at the same time, in order to improve the speed of reasoning calculation, the CSP local cross-stage method is introduced to divide the input of the bottleneck layer into two parts, one part is obtained by improving the Resnet block, and the other part is directly spliced with the output obtained by improving the input of the Resnet block to reduce the reuse of redundant gradients; by splitting the gradient flow to make the gradient flow propagate through different network paths, and by switching the cascade and conversion steps, the propagated gradient information can be very different, so as to reduce the reuse of redundant gradients. The residual structure design idea is shown in Figure 2.

上述的Step4的具体步骤为：The specific steps of Step 4 above are:

四层卷积进行特征提取，通过四层层基数及卷积核相同的卷积层进行特征提取，效果更佳。Four-layer convolution is used for feature extraction. Feature extraction is performed through four convolution layers with the same layer cardinality and convolution kernel, which has a better effect.

Step4中，将训练集输入搭建好的CSP-ResNeXt网络进行训练；设置超参数批大小为128，梯度下降优化算法采用adam随机优化算法：学习率α设置为0.001，一阶矩估计衰减系数β₁设置为0.9，二阶矩估计衰减系数β₂设置为0.999，平滑项ε设置为1e^-8，具体迭代公式为：In Step 4, the training set is input into the built CSP-ResNeXt network for training; the hyperparameter batch size is set to 128, and the gradient descent optimization algorithm adopts the adam random optimization algorithm: the learning rate α is set to 0.001, the first-order moment estimation attenuation coefficient _β1 is set to 0.9, the second-order moment estimation attenuation coefficient _β2 is set to 0.999, and the smoothing term ε is set to 1e ^-8 . The specific iteration formula is:

m_t＝β₁m_t-1+(1-β₁)g_t m _t =β ₁ m _t-1 +(1-β ₁ )g _t

其中，t为训练次数；m_t和v_t分别是对梯度的一阶矩估计和二阶矩估计，可以看作是对期望E[g_t]、E[g_t ²]的近似；g_t为对应每个参数θ计算出的梯度值；和是对m_t和v_t的矫正，这样可以近似为对期望的无偏估计；为防止分母为0，设置平滑项ε。Where t is the number of training times; m _t and v _t are the first-order moment estimate and second-order moment estimate of the gradient, respectively, which can be regarded as the approximation of the expected E[g _t ] and E[g _t ² ]; g _t is the gradient value calculated for each parameter θ; and is a correction to m _t and v _t , which can be approximated as an unbiased estimate of the expectation; to prevent the denominator from being 0, a smoothing term ε is set.

其中x为输入，f(x)为输出。Where x is the input and f(x) is the output.

f(x)＝max(0,x)f(x)＝max(0,x)

其中x为输入，f(x)为输出。Where x is the input and f(x) is the output.

实施例：Example:

为验证本发明的可行性，采用东南大学公开齿轮箱数据集实现本发明的方法。In order to verify the feasibility of the present invention, the public gearbox data set of Southeast University is used to implement the method of the present invention.

本实验算法处理平台设备采用高性能计算机模型训练与测试的软件环境为Windows11，选择Python3.10编程语言，GPU加速库为CUDA11.8，深度学习框架为Pytorch2.0.1。The experimental algorithm processing platform equipment uses Windows 11 as the software environment for high-performance computer model training and testing, Python 3.10 programming language, CUDA 11.8 GPU acceleration library, and Pytorch 2.0.1 deep learning framework.

实验数据集共分为5类，分别为健康、断齿、齿轮表面磨损、齿轮脚裂纹、齿轮根部裂纹，每类样本数量为500，总的样本数量为2500,数据集按照训练集与测试集8：2的比例进行随机划分后，训练集样本数量为2000，验证集样本数量为500。The experimental data set is divided into five categories, namely healthy, broken teeth, gear surface wear, gear foot cracks, and gear root cracks. The number of samples in each category is 500, and the total number of samples is 2500. After the data set is randomly divided into a training set and a test set at a ratio of 8:2, the number of samples in the training set is 2000 and the number of samples in the validation set is 500.

模型网络代码为：The model network code is:

按照本发明所述方法转化后的图形差分场特征图谱示例如下图3所示。An example of a graphic differential field characteristic spectrum converted according to the method of the present invention is shown in FIG3 below.

模型训练方面，设置模型迭代次数为500次，批量大小设置为8，线程设置为2，其模型损失函数和准确率如下图4和5中所示，可以看出，当模型迭代500次后，模型基本收敛，准确率可以达到95％以上。In terms of model training, the number of model iterations is set to 500, the batch size is set to 8, and the thread is set to 2. The model loss function and accuracy are shown in Figures 4 and 5 below. It can be seen that after 500 iterations of the model, the model basically converges and the accuracy can reach more than 95%.

Claims

1. A fan gear box fault diagnosis method based on data enhancement and CSP-ResNeXt is characterized by comprising the following steps:

step1, collecting fault vibration signal data collected by different fault types of the gear box by using an acceleration sensor arranged on the gear box;

step2, expanding sample data by adopting wavelet packet distortion;

step3, converting the fault vibration signal into a gray level map through a relative position matrix;

step4, inputting the data after sample expansion into an improved ResNeXt network, and finally obtaining an intelligent fault diagnosis model of the fan gear box, and inputting vibration signals acquired in real time by using the model to perform fault diagnosis and identification.

2. The fan gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt according to claim 1, wherein the specific steps of Step2 are as follows:

step2.1, performing 2-layer decomposition on the original vibration signal by adopting wavelet packet transformation;

step2.2, randomly selecting a group of wavelet coefficients to carry out distortion operation;

step2.3, reducing the wavelet packet decomposition coefficient after distortion operation into a time domain signal.

3. The fan gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt according to claim 2, wherein in step2.1, the following function is used to implement 2-layer decomposition of the original vibration signal:

where i=0, (2 j-1), k and L are sequential indices of elements, (h, g) is a pair of finite impulse response filters of length L, let w be _j,i Ith group of wavelet coefficients representing jth decomposition level, w _0,0 Is the original signal.

4. The fan gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt according to claim 2, wherein the distortion function of the distortion operation in the step2.2 is:

wherein d is a distortion coefficient randomly selected from a preset range, sign is a sign function, and in mathematical and computer operations, the function is to take a certain number of signs (positive or negative): when x >0, sign (x) =1; when x=0, sign (x) =0; when x <0, sign (x) = -1; abs is an absolute function, returning the absolute value of the w value.

5. The fan gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt according to claim 1, wherein the Step4 specifically comprises the following steps:

step4.1, the input data passes through a convolution kernel with a field of view of 7×7, a dimension number of 64, and a step length of 2;

the output data of Step4.2 and Step4.1 pass through a global maximum pooling layer with convolution kernel of 3 multiplied by 3 and step length of 2;

the step4.3 and step4.2 output data enter a backbone network consisting of four different layers of ResNeXt residual blocks.

6. The fan gear box fault diagnosis method based on data enhancement and CSP-ResNeXt according to claim 5, wherein the four main networks consisting of ResNeXt residual blocks of different layers in the step4.3 have the following composition structure:

one part of the three-layer joint structure consists of 3 layers of residual structures with the base number of 32 and convolution kernels of 1 multiplied by 1,3 multiplied by 3 and 1 multiplied by 1, and comprises a residual connection bypass, and finally, the channels are spliced and output;

the second part is composed of residual structures with 3 layers of convolution kernels of 1 multiplied by 1,3 multiplied by 3 and 1 multiplied by 1, the number of the base numbers of the layers is 32, and the residual structures comprise a residual connection bypass, and finally channels are spliced and output;

the third part is composed of residual structures with 3 layers of convolution kernels of 1 multiplied by 1,3 multiplied by 3 and 1 multiplied by 1, the number of the base numbers of the layers is 32, and the residual structures comprise a residual connection bypass, and finally channels are spliced and output;

the fourth part is composed of a residual structure with 3 layers of convolution kernels of 1×1,3×3 and 1×1 and with a base number of 32, and comprises a residual connection bypass, and finally channels are spliced and output.

7. The fan gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt according to claim 6, wherein the four main network residual blocks formed by ResNeXt residual blocks of different layers are stacked in a structure of 3-4-6-3, CSP modules are inserted into each part of the network structure, input channels are grouped when entering each part of the stacked network, half of the input channels enter the stacked network, and the other half of the input channels enter the CSP modules; then splicing the output channel of each part with the output channel of the CSP module to be used as the total output of each part of stacked network structure; and finally, summarizing the output of the fourth part, and entering a global average pooling layer with the convolution kernel of 1 multiplied by 1 and a full connection layer to realize fault classification.

8. The fan gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt according to claim 7, wherein the ResNeXt residual error block activation function adopts a sigmoid function, and the calculation formula is as follows:

where x is the input and f (x) is the output.

9. The fan gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt according to claim 8, wherein the activation function used by each convolution kernel and transition layer in the four ResNeXt residual block network parts adopts a Relu activation function, and the calculation formula is as follows:

f(x)＝max(0,x)

where x is the input and f (x) is the output.

10. The fan gearbox fault diagnosis method based on data enhancement and CSP-ResNeXt according to claim 9, wherein the improved ResNeXt network, namely the full connection layer loss function of the CSP-ResNeXt network, adopts a cross entropy loss function, and the calculation formula is as follows:

wherein p (x) _i ) And q (x) _i ) Respectively representing a true probability distribution and a predicted probability distribution, and H (p, q) represents the difference between the predicted value and the true value;

the cross entropy loss function is matched with the softmax classifier to be used, the output result is processed at the full connection layer, the sum of the predictive values of a plurality of classifications is 1, and then the loss is calculated through the cross entropy, wherein the softmax function has the calculation formula:

wherein x is _i The output of the upper layer of the model is used as the input of a softmax classifier; the output calculation result softmax (x) may be regarded as the confidence that the predicted result is a true result.