CN116007937A - Intelligent fault diagnosis method and device for mechanical equipment transmission part - Google Patents

Intelligent fault diagnosis method and device for mechanical equipment transmission part Download PDF

Info

Publication number
CN116007937A
CN116007937A CN202211575101.2A CN202211575101A CN116007937A CN 116007937 A CN116007937 A CN 116007937A CN 202211575101 A CN202211575101 A CN 202211575101A CN 116007937 A CN116007937 A CN 116007937A
Authority
CN
China
Prior art keywords
self
layer
network
fault diagnosis
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202211575101.2A
Other languages
Chinese (zh)
Other versions
CN116007937B (en
Inventor
江星星
杨立冬
王前
石娟娟
杜贵府
王俊
丁传仓
沈长青
黄伟国
朱忠奎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou University
Original Assignee
Suzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou University filed Critical Suzhou University
Priority to CN202211575101.2A priority Critical patent/CN116007937B/en
Publication of CN116007937A publication Critical patent/CN116007937A/en
Application granted granted Critical
Publication of CN116007937B publication Critical patent/CN116007937B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Testing Of Devices, Machine Parts, Or Other Structures Thereof (AREA)

Abstract

The invention relates to the technical field of intelligent operation and maintenance of mechanical equipment, in particular to an intelligent fault diagnosis method and device for a transmission part of mechanical equipment. According to the invention, vibration signals under different working conditions are collected as training samples, the problem of performance degradation of a model caused by working condition change is avoided, and a self-supervision pre-training network is established, so that an easily-obtained label-free sample training network is fully utilized, more effective characteristics can be extracted from the network, and dependence on label samples is reduced; in addition, through the encoder model and the decoder model based on the self-attention mechanism, more comprehensive global features are extracted, redundant features are restrained, effective features are enhanced, pre-enhancement of input data through preprocessing is not needed, and diagnosis efficiency is improved.

Description

Intelligent fault diagnosis method and device for mechanical equipment transmission part
Technical Field
The invention relates to the technical field of intelligent operation and maintenance of mechanical equipment, in particular to an intelligent fault diagnosis method and device for a transmission part of mechanical equipment.
Background
The mechanical equipment is the most main carrier in industrial production, and plays an important role in ensuring the quality of products, improving the production efficiency and creating economic benefits. The transmission part of the mechanical equipment is a core part of the mechanical equipment and plays key roles of bearing load, transmitting power and the like. However, mechanical transmission components often operate in harsh operating environments and require prolonged uninterrupted service, with serious consequences upon failure. The device maintenance time can be reasonably planned by timely monitoring and diagnosing the transmission parts of the mechanical device, so that the economic benefit is increased, and catastrophic results are avoided. Therefore, the research of the intelligent diagnosis method for the mechanical equipment transmission part is significant for improving the stability and reliability of the mechanical equipment transmission part.
In recent years, with the development of industrial big data and artificial intelligence technology, deep learning is widely applied to intelligent fault diagnosis methods of mechanical equipment transmission parts. By constructing a deep neural network, a deep confidence network, a cyclic neural network, a convolutional neural network and other multi-layer deep structures, the intelligent fault diagnosis model can approximately map complex functional relations between signals and corresponding fault types, and therefore accurate diagnosis results are obtained. Compared with the traditional method, the intelligent fault diagnosis method based on deep learning has the advantages of no need of data preprocessing, automatic feature extraction, capability of processing big data and the like, so that the intelligent fault diagnosis method based on deep learning is more and more widely studied.
Although the intelligent diagnosis method based on deep learning has the advantages, the intelligent diagnosis method based on deep learning has some defects which prevent the intelligent diagnosis method from being applied to engineering, on one hand, the intelligent diagnosis method based on deep learning needs to use a large number of label samples in the process of training a network model, but is unrealistic in actual engineering, because expert knowledge and engineering experience are needed for judging the type of faults, a large amount of manpower and material resources are needed for labeling the samples, and the network model needs to be trained from the beginning when facing a new diagnosis task or a new data set; on the other hand, most of the existing methods are insufficient in capability of extracting high-level features, the features are pre-enhanced by means of pre-processing, which reduces the efficiency of the whole diagnosis process and the diagnosis performance is reduced when it processes the raw vibration data. Second, existing deep learning models typically extract features from the local but cannot determine whether they are redundant features or fault type related valid features. Finally, the existing method generally sets training data and test data under the same working condition, and when the working condition is changed, the performance of the model is inevitably reduced, so that the practicability of the method is reduced. These shortcomings will lead to the fact that the existing intelligent fault diagnosis method based on self-supervision learning is not comprehensive in learned characteristics, and the generalization capability and the practicability of the diagnosis model are poor.
Disclosure of Invention
Therefore, the invention aims to solve the technical problems of insufficient comprehensive learned characteristics, poor generalization performance and high dependence on label samples in the prior art.
In order to solve the technical problems, the invention provides an intelligent fault diagnosis method for a transmission part of mechanical equipment, which comprises the following steps:
collecting vibration signals of a mechanical equipment transmission part under different working conditions, intercepting the vibration signals for a plurality of times according to preset data point lengths to obtain a plurality of sample data, dividing the plurality of sample data according to preset proportions, calibrating one part of the sample data according to fault types to be used as a labeled data set, and the other part of the sample data is used as a non-labeled data set;
establishing a self-supervision pre-training network by constructing a random mask module, an encoder model and a decoder model based on a self-attention mechanism, masking and reconstructing samples in the unlabeled dataset by using the self-supervision pre-training network, wherein the reconstruction comprises an encoding process and a decoding process, calculating reconstruction loss between signals before and after reconstruction, optimizing the self-supervision pre-training network by taking minimized reconstruction loss as an objective function, and updating network parameters of the self-supervision pre-training network;
establishing a fine tuning network by constructing the encoder model and the classifier model, transferring parameters corresponding to the encoder model in the optimized self-supervision pre-training network to the encoder model of the fine tuning network, classifying samples in the labeled data set by using the fine tuning network, calculating classification loss, optimizing the fine tuning network by taking the minimized classification loss as an objective function, and updating network parameters of the fine tuning network to obtain an intelligent fault diagnosis model of the mechanical equipment transmission part;
and inputting the vibration signal to be tested into the intelligent fault diagnosis model of the mechanical equipment transmission part to obtain the fault type of the mechanical equipment transmission part.
Preferably, said masking samples in said unlabeled dataset with said self-supervised pretraining network includes:
for input samples [ X ] 1 ;X 2 ;...;X n ]Performing random masking operation to obtain a mask output matrix X m =[X 1 ;X 2 ;...;X n ][c 1 ;c 2 ;...;c n ] T Wherein n is the preset data point length, x 1 To x n Representing n row vectors of the input matrix c 1 To c n Representing randomly generated mask vectors, half of which have values of 1 and the rest of which are 0, T representing the transpose;
removing the row vector with 0 in the mask output matrix to obtain a final mask output matrix X M =X m -X 0
Preferably, the reconstructed encoding process includes:
adding one row of category token vectors for aggregating the classification information to the final mask output matrix, and superposing position codes for indicating the position information to obtain a first coding characteristic
Figure BDA0003989066530000031
wherein ,
Figure BDA0003989066530000032
representing a category token vector, ">
Figure BDA0003989066530000033
Representing a merge matrix->
Figure BDA0003989066530000034
Representing a position code;
the first coding feature is subjected to layer normalization processing to obtain a second coding feature
Figure BDA0003989066530000035
Wherein, gamma is a scaling parameter, beta is a translation parameter, and a standardized value +.>
Figure BDA0003989066530000036
Average of first coding feature in last dimension
Figure BDA0003989066530000037
Variance->
Figure BDA0003989066530000038
N dim Dimension of the last dimension of the first code feature;
processing the second coding feature through multi-stage stacked multi-head self-attention layers and multi-layer perceptron layers to obtain a target coding feature
Figure BDA0003989066530000039
The processing procedure of the d-level multi-head self-attention layer and the d-level multi-layer perceptron layer is as follows:
calculating the self-attention value of the d-1 level multi-layer sensor layer output
Figure BDA00039890665300000310
The output of the d-1 level multi-layer sensor layer is added with>
Figure BDA00039890665300000311
Residual connection to obtain the output of the d-level multi-head self-attention layer>
Figure BDA0003989066530000041
Wherein depth is the number of levels of the multi-headed self-focusing layer and multi-layered sensor layer stack, when d=1, the +.>
Figure BDA0003989066530000042
Self-attention value for the second coding feature, is->
Figure BDA0003989066530000043
For the first encoding feature;
outputting the d-level multi-head self-attention layer
Figure BDA0003989066530000044
And first linear layer weight +.>
Figure BDA0003989066530000045
Product of (c) and first linear layer deviation b 1 And, through GeLU excitationAfter the living function processing, the second linear layer weight is +.>
Figure BDA0003989066530000046
Multiplied by the second linear layer deviation b 2 Adding, finally, with the output of the d-level multi-head self-attention layer->
Figure BDA0003989066530000047
Residual connection to obtain the output +.>
Figure BDA0003989066530000048
Preferably, the decoding process of the reconstruction includes:
restoring the target coding feature to the vector y removed at the random masking block d =y+X 0 And the third linear layer is used for reducing the dimension of the first linear layer to obtain a first decoding characteristic y d ′=y d *w d +b d, wherein ,
Figure BDA0003989066530000049
is a third linear layer weight matrix, b d Is a third linear layer bias;
and processing the first decoding characteristic through the multi-stage stacked multi-head self-attention layer and the multi-layer perceptron layer, and recovering dimension through a fourth linear layer to obtain a reconstruction characteristic.
Preferably, the reconstruction loss is:
Figure BDA00039890665300000410
wherein ,nB Representing the size of the batch and,
Figure BDA00039890665300000411
representing the reconstruction feature, x i Representing the mask output matrix.
Preferably, the optimization method of the objective function is a back propagation and gradient descent method.
Preferably, the classifier model is a Softmax classifier, probability values of each fault belonging to different categories are output through the Softmax classifier, and a calculation formula of the corresponding fault probability values is as follows:
Figure BDA00039890665300000412
wherein ,xi ' represents the ith input feature, y i Output probability value, n, representing corresponding class s Indicating the total number of fault categories.
Preferably, the classification loss is:
Figure BDA0003989066530000051
wherein ,nb Indicating batch size, n s ' total number of categories representing health status, 1 {. Cndot. } represents a function at y j Return 1 when k, otherwise return 0, y j A real label representing a sample is provided,
Figure BDA0003989066530000052
representing the predicted probability of all health states.
The invention also provides an intelligent fault diagnosis device for the mechanical equipment transmission part, which comprises the following components:
the data set construction module is used for collecting vibration signals of the mechanical equipment transmission part under different working conditions, intercepting the vibration signals for a plurality of times according to preset data point lengths to obtain a plurality of sample data, dividing the plurality of sample data according to preset proportions, calibrating one part of the sample data according to fault types to be used as a labeled data set, and the other part of the sample data is used as a non-labeled data set;
the self-supervision pre-training network construction module is used for constructing a self-supervision pre-training network by constructing a random mask module, a self-attention mechanism-based encoder model and a self-attention mechanism-based decoder model, masking and reconstructing samples in the unlabeled dataset by utilizing the self-supervision pre-training network, wherein the reconstruction comprises an encoding process and a decoding process, calculating reconstruction loss between signals before and after reconstruction, optimizing the self-supervision pre-training network by taking the minimized reconstruction loss as an objective function, and updating network parameters of the self-supervision pre-training network;
the prediction model construction module is used for building a fine tuning network by constructing the encoder model and the classifier model, transferring parameters corresponding to the encoder model in the optimized self-supervision pre-training network to the encoder model of the fine tuning network, classifying samples in the labeled data set by using the fine tuning network, calculating classification loss, optimizing the fine tuning network by taking the minimized classification loss as an objective function, and updating network parameters of the fine tuning network to obtain an intelligent fault diagnosis model of the mechanical equipment transmission part;
and the fault diagnosis module is used for inputting the vibration signal to be tested into the intelligent fault diagnosis model of the mechanical equipment transmission part to obtain the fault type of the mechanical equipment transmission part.
The invention also provides a computer readable storage medium, wherein the computer readable storage medium is stored with a computer program, and the computer program realizes the steps of the intelligent fault diagnosis method for the mechanical equipment transmission part when being executed by a processor.
Compared with the prior art, the technical scheme of the invention has the following advantages:
according to the intelligent fault diagnosis method for the mechanical equipment transmission part, vibration signals under different working conditions are collected to serve as training samples, the problem that the performance of a model is reduced due to the change of the working conditions is avoided, a self-supervision pre-training network is built, an easily-obtained label-free sample training network is fully utilized, the network can extract more effective characteristics, and dependence on label samples is reduced; in addition, through the encoder model and the decoder model based on the self-attention mechanism, more comprehensive global features are extracted, redundant features are restrained, effective features are enhanced, pre-enhancement of input data through preprocessing is not needed, and diagnosis efficiency is improved. The intelligent fault diagnosis method for the mechanical equipment transmission part can complete the diagnosis task of the intelligent fault diagnosis model under the new working condition, has good generalization performance, is suitable for identifying the health state of the mechanical equipment transmission part on site in real time, provides a reliable and convenient tool for the intelligent diagnosis method based on deep self-supervision learning, and has important field significance and wide application prospect.
Drawings
In order that the invention may be more readily understood, a more particular description of the invention will be rendered by reference to specific embodiments thereof which are illustrated in the appended drawings, in which:
FIG. 1 is a flow chart of an intelligent fault diagnosis method for a mechanical equipment transmission part;
FIG. 2 is a schematic diagram of a self-supervising pre-training network of the present invention;
FIG. 3 is a schematic diagram of a fine-tuning network of the present invention;
FIG. 4 is a schematic diagram of an encoder model in a self-supervising pre-training network and a fine tuning network according to the present invention;
fig. 5 is a block diagram of a mechanical device transmission component intelligent fault diagnosis device according to an embodiment of the present invention.
Detailed Description
The invention provides an intelligent fault diagnosis method and device for a mechanical equipment transmission part and a computer storage medium, the learned characteristics are more comprehensive, and the generalization capability and practicability of a diagnosis model are improved.
In order to better understand the aspects of the present invention, the present invention will be described in further detail with reference to the accompanying drawings and detailed description. It will be apparent that the described embodiments are only some, but not all, embodiments of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Referring to fig. 1, fig. 1 is a flowchart illustrating an implementation method of intelligent fault diagnosis for a transmission component of a mechanical device according to the present invention; the specific operation steps are as follows:
s101, collecting vibration signals of a mechanical equipment transmission part under different working conditions, intercepting the vibration signals for a plurality of times according to preset data point lengths to obtain a plurality of sample data, dividing the plurality of sample data according to preset proportions, calibrating one part of the sample data according to fault types to obtain a labeled data set, and using the other part of the sample data as a non-labeled data set;
the method comprises the steps of collecting vibration signals of a mechanical equipment transmission part under various working conditions by using a sensor, intercepting the vibration signals according to a certain data point length to obtain a large number of sample sets X, and selecting samples under one working condition as test data sets
Figure BDA0003989066530000071
The samples under other working conditions are training data sets, samples with known fault types in the training data sets are calibrated according to fault types, category labels Y are set, and the training data sets are divided into label-free data sets +.>
Figure BDA0003989066530000072
There is tag dataset->
Figure BDA0003989066530000073
wherein nT ,n U ,n L Representing the number of samples in the test dataset, the unlabeled dataset and the labeled dataset, respectively. Wherein the data sets are all derived from a laboratory bench.
As shown in fig. 2:
s102, establishing a self-supervision pre-training network by constructing a random mask module, an encoder model and a decoder model based on a self-attention mechanism, masking and reconstructing samples in the unlabeled dataset by using the self-supervision pre-training network, wherein the reconstruction comprises an encoding process and a decoding process, calculating reconstruction loss between signals before and after reconstruction, optimizing the self-supervision pre-training network by taking the minimized reconstruction loss as an objective function, and updating network parameters of the self-supervision pre-training network;
the random masking module masks and rejects the input label-free signal, the encoder model extracts the representation from the reserved original signal, the decoder model reconstructs the signal according to the extracted representation, and the encoder model and the decoder model are both based on a self-attention mechanism, so that the problems of incomplete characteristics and lack of pertinence extracted by the model can be effectively avoided.
The encoder model includes a multi-headed self-attention layer, a layer normalization layer, a multi-layered perceptron layer, and a residual connection.
The decoder model includes a linear layer, a multi-headed self-attention layer, a layer normalization layer, a multi-layered perceptron layer, and a residual connection.
As shown in fig. 3:
s103, establishing a fine tuning network by constructing the encoder model and the classifier model, transferring parameters corresponding to the encoder model in the optimized self-supervision pre-training network to the encoder model of the fine tuning network, classifying samples in the labeled data set by using the fine tuning network, calculating classification loss, optimizing the fine tuning network by taking minimized classification loss as an objective function, and updating network parameters of the fine tuning network to obtain an intelligent fault diagnosis model of the mechanical equipment transmission part;
the building of the fine tuning network comprises an encoder model and a classifier model, wherein the encoder model is consistent with the encoder model in the pre-training network, the repetition is omitted, the corresponding parameters of the encoder model in the pre-training network are migrated, the characteristic extraction capacity is inherited, and then a small amount of tagged data is utilized to train the network to adapt to the fault diagnosis task.
The optimization method of the objective function is a back propagation and gradient descent method.
The classifier model is a Softmax classifier, probability values of each fault belonging to different categories are output through the Softmax classifier, and a corresponding calculation formula of the fault probability values is as follows:
Figure BDA0003989066530000081
wherein ,xi ' represents the ith input feature, y i Output probability value, n, representing corresponding class s Indicating the total number of fault categories.
The classification loss is:
Figure BDA0003989066530000091
wherein ,nb Indicating batch size, n s ' total number of categories representing health status, 1 {. Cndot. } represents a function at y j Return 1 when k, otherwise return 0, y j A real label representing a sample is provided,
Figure BDA0003989066530000092
representing the predicted probability of all health states.
S104, inputting the vibration signal to be tested into the intelligent fault diagnosis model of the mechanical equipment transmission part to obtain the fault type of the mechanical equipment transmission part.
According to the intelligent fault diagnosis method for the mechanical equipment transmission part, vibration signals under different working conditions are collected to serve as training samples, the problem that the performance of a model is reduced due to the change of the working conditions is avoided, a self-supervision pre-training network is built, an easily-obtained label-free sample training network is fully utilized, the network can extract more effective characteristics, and dependence on label samples is reduced; in addition, through the encoder model and the decoder model based on the self-attention mechanism, more comprehensive global features are extracted, redundant features are restrained, effective features are enhanced, pre-enhancement of input data through preprocessing is not needed, and diagnosis efficiency is improved. The intelligent fault diagnosis method for the mechanical equipment transmission part can complete the diagnosis task of the intelligent fault diagnosis model under the new working condition, has good generalization performance, is suitable for identifying the health state of the mechanical equipment transmission part on site in real time, provides a reliable and convenient tool for the intelligent diagnosis method based on deep self-supervision learning, and has important field significance and wide application prospect.
Based on the above embodiments, the present embodiment further describes in detail step S102:
the masking samples in the unlabeled dataset with the self-supervised pretraining network includes:
for input samples [ X ] 1 ;X 2 ;...;X n ]Performing random masking operation to obtain a mask output matrix X m =[X 1 ;X 2 ;...;X n ][c 1 ;c 2 ;...;c n ] T Wherein n is the preset data point length, x 1 To x n Representing n row vectors of the input matrix c 1 To c n Representing randomly generated mask vectors, half of which have values of 1 and the rest of which are 0, T representing the transpose;
converting the row vector with 0 in the mask output matrix into a trainable parameter and temporarily removing to obtain a final mask output matrix X M =X m -X 0
Mask matrix for final output
Figure BDA0003989066530000101
The input is to the encoder model for encoding characterization.
The reconstructing of the samples in the unlabeled dataset using the self-supervised pretraining network specifically includes:
as shown in fig. 4, the reconstructed encoding process includes:
adding one row of category token vectors for aggregating the classification information to the final mask output matrix, and superposing position codes for indicating the position information to obtain a first coding characteristic
Figure BDA0003989066530000102
wherein ,
Figure BDA0003989066530000103
representing a category token vector, ">
Figure BDA0003989066530000104
Representing a merge matrix->
Figure BDA0003989066530000105
Representing a position code;
and carrying out layer standardization processing on the first coding feature to obtain a second coding feature X, wherein the mathematical expression is as follows:
Figure BDA0003989066530000106
Figure BDA0003989066530000107
Figure BDA0003989066530000108
Figure BDA0003989066530000109
wherein μ is the average of the input in the last dimension, σ 2 As a function of the variance of the values,
Figure BDA00039890665300001010
for normalized values, ε is a smoothing factor, the output is infinity when the variance is prevented from being 0, and the final batch normalized output is introduced with a scaling parameter γ and a shifting parameter β that are updated with a back-propagation algorithm during network training to further improve the numerical output stability, N dim Dimension of the last dimension of the first code feature;
processing the second coding feature through multi-stage stacked multi-head self-attention layers and multi-layer perceptron layers to obtain a target coding feature
Figure BDA00039890665300001011
Wherein, the d-level multi-head self-attention layer and the d-level multi-head self-attention layerThe processing procedure of the d-level multilayer sensor layer is as follows:
calculating the self-attention value of the d-1 level multi-layer sensor layer output
Figure BDA0003989066530000111
The output of the d-1 level multi-layer sensor layer is added with>
Figure BDA0003989066530000112
Residual connection to prevent gradient extinction and obtain output of d-level multi-head self-attention layer
Figure BDA00039890665300001118
d=1, 2, 3.., depth, where depth is the number of levels of the multi-headed self-attention layer and multi-layered sensor layer stack, when d=1, the _on_is>
Figure BDA0003989066530000113
Self-attention value for the second coding feature, is->
Figure BDA0003989066530000114
For the first encoding feature;
outputting the d-level multi-head self-attention layer
Figure BDA0003989066530000115
And first linear layer weight +.>
Figure BDA0003989066530000116
Product of (c) and first linear layer deviation b 1 After GeLU activation function processing, and the second linear layer weight +.>
Figure BDA0003989066530000117
Multiplied by the second linear layer deviation b 2 Adding, finally, with the output of the d-level multi-head self-attention layer->
Figure BDA0003989066530000118
Residual connection is carried out to obtain the output of the d-level multi-layer perceptron layer so as to realize the multiplexingHybrid nonlinear mapping
Figure BDA0003989066530000119
d=1,2,3,...,depth。
The mathematical expression for calculating the multi-headed self-attention value is:
Q=X·W q ,K=X·W k ,V=X·W v
Figure BDA00039890665300001110
head i =Attention(Q·W i Q ,K·W i K ,V·W i V )
y MSA ′=Concat(head 1 ,...head i )W O
wherein ,
Figure BDA00039890665300001111
representing a query matrix for subsequent matching with the key matrix,/->
Figure BDA00039890665300001112
Representing a key matrix for being matched by the query matrix, < >>
Figure BDA00039890665300001113
Representing a matrix of values representing the input +.>
Figure BDA00039890665300001114
Information extracted from the Chinese herb, jersey>
Figure BDA00039890665300001115
and />
Figure BDA00039890665300001116
Respectively, a respective leachable transformation matrix, < >>
Figure BDA00039890665300001117
For scaling factorThe sub-matching process calculates the product of the query matrix and the key matrix, divides the scaling factor and passes through a Softmax function, the calculation result is the correlation of the query matrix and the key matrix, and the calculated result is then used as the weight of the value matrix to be multiplied by the value matrix; acquisition from different feature subspaces through multiple sets of different transformation matrices
Information, combining multiple groups of self-attention calculation results to obtain a multi-head self-attention value y MSA
The decoding process of the reconstruction includes:
restoring the target coding feature to the vector y removed at the random masking block d =y+X 0
Figure BDA0003989066530000121
And the third linear layer is used for reducing the dimension to obtain a first decoding characteristic y d ′=y d *w d +b d ,/>
Figure BDA0003989066530000122
wherein ,
Figure BDA0003989066530000123
is a third linear layer weight matrix, b d Is a third linear layer bias;
and processing the first decoding characteristic through the multi-stage stacked multi-head self-attention layer and the multi-layer perceptron layer, and recovering dimension through a fourth linear layer to obtain a reconstruction characteristic.
The reconstruction loss is as follows:
Figure BDA0003989066530000124
wherein ,nB Representing the size of the batch and,
Figure BDA0003989066530000125
representing the reconstruction feature, x i Representing the mask output matrix.
Based on the above examples, the present example verifies the effectiveness of the present invention in one specific experiment:
the data used in the experiment come from a bearing fault simulation experiment table of Suzhou university, and the bearing fault data set has seven health states, including 0.2mm fault of the inner ring of the bearing, 0.2mm fault of the rolling body, 0.2mm fault of the outer ring and compound fault types (including compound fault of the inner ring and the rolling body, compound fault of the inner ring and the outer ring, compound fault of the outer ring and the rolling body, compound fault of the inner ring, the outer ring and the rolling body). The data for each health state were collected at 800rpm and four different loads (0 KN,0.8KN,1.6KN,2.5 KN). Each state of health for each condition contained 200 samples, each sample containing 1024 data points. The data under the load of 2.5KN is selected as a test data set, the data under the rest loads are training test sets, the number of partial sample labels under the load of 1.6KN in the training test sets is given, the number is 5, 10, 15 and 20 of each health state, and the rest are label-free samples. The details of the samples used in the experiments are shown in table 1.
Table 1 number of samples for each health status
Figure BDA0003989066530000126
To verify the effectiveness of the proposed invention, the experimental case performs diagnostic tasks with four different few label samples, the number of label samples being 5, 10, 15 and 20 for each health state, respectively, 2.5%,5%,7.5% and 10% of the total number of samples. In order to verify the effectiveness of the intelligent fault diagnosis method and system for the mechanical equipment transmission part provided by the invention, five other advanced intelligent diagnosis methods based on self-supervision learning are used as comparison, and the intelligent fault diagnosis method comprises a self-attention encoder (SAE), a convolution self-encoder (CAE), a context self-encoder (CE), a Denoising Convolutional Neural Network (DCNN) and a denoising self-encoder (DAE). The parameters during training of each network model are set as follows: the initial learning rate was 0.001, the batch size was 128, and the number of iterations was 50.
Compared with other self-supervision learning intelligent diagnosis methods, the classification accuracy of the method has the best effect on diagnosis tasks under the condition of four different few-label samples, and the superiority of the method is shown, and the comparison result is shown in table 2.
Table 2 accuracy of each method at each diagnostic task
Figure BDA0003989066530000131
Aiming at the problems that the existing intelligent fault diagnosis method seriously depends on a label sample and the characteristics extracted by a diagnosis network are not comprehensive and lack of pertinence, the invention takes a mechanical equipment transmission part as a research object, extracts supervision information from a label-free signal by a self-supervision learning method, enhances the performance of effective characterization extracted by the network, builds a network model by using a self-attention mechanism, extracts comprehensive and pertinence characteristics, and improves the generalization performance of the model.
Referring to fig. 5, fig. 5 is a block diagram of a mechanical device transmission component intelligent fault diagnosis apparatus according to an embodiment of the present invention; the specific apparatus may include:
the data set construction module 100 is configured to collect vibration signals of a mechanical device transmission component under different working conditions, intercept the vibration signals for multiple times according to a preset data point length to obtain multiple sample data, divide the multiple sample data according to a preset proportion, calibrate one part of the sample data according to a fault type to be used as a labeled data set, and use the other part of the sample data as an unlabeled data set;
a self-supervision pre-training network construction module 200, configured to construct a self-supervision pre-training network by constructing a random mask module, a self-attention mechanism-based encoder model, and a decoder model, mask and reconstruct samples in the unlabeled dataset using the self-supervision pre-training network, wherein the reconstruction includes an encoding process and a decoding process, calculate reconstruction loss between signals before and after reconstruction, and optimize the self-supervision pre-training network with a minimized reconstruction loss as an objective function, and update network parameters of the self-supervision pre-training network;
the prediction model construction module 300 is configured to build a fine tuning network by constructing the encoder model and the classifier model, migrate parameters corresponding to the encoder model in the optimized self-supervision pre-training network to the encoder model of the fine tuning network, classify the samples in the labeled dataset by using the fine tuning network, calculate classification loss, optimize the fine tuning network by taking minimized classification loss as an objective function, and update network parameters of the fine tuning network to obtain an intelligent fault diagnosis model of the mechanical equipment transmission part;
the fault diagnosis module 400 is configured to input a vibration signal to be tested into the intelligent fault diagnosis model of the mechanical device transmission component, so as to obtain a fault class of the mechanical device transmission component.
The intelligent fault diagnosis device for a mechanical device driving part of the present embodiment is used to implement the foregoing intelligent fault diagnosis method for a mechanical device driving part, so that the specific implementation of the intelligent fault diagnosis device for a mechanical device driving part can be found in the foregoing example portions of the intelligent fault diagnosis method for a mechanical device driving part, for example, the data set building module 100, the self-monitoring pre-training network building module 200, the prediction model building module 300, and the fault diagnosis module 400 are respectively used to implement steps S101, S102, S103, and S104 in the foregoing intelligent fault diagnosis method for a mechanical device driving part, so that the specific implementation thereof can refer to the description of the corresponding examples of each portion and will not be repeated herein.
The specific embodiment of the invention also provides a computer readable storage medium, wherein the computer readable storage medium is stored with a computer program, and the computer program realizes the steps of the intelligent fault diagnosis method for the mechanical equipment transmission part when being executed by a processor.
It will be appreciated by those skilled in the art that embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
It is apparent that the above examples are given by way of illustration only and are not limiting of the embodiments. Other variations and modifications of the present invention will be apparent to those of ordinary skill in the art in light of the foregoing description. It is not necessary here nor is it exhaustive of all embodiments. While still being apparent from variations or modifications that may be made by those skilled in the art are within the scope of the invention.

Claims (10)

1. An intelligent fault diagnosis method for a transmission part of a mechanical device is characterized by comprising the following steps:
collecting vibration signals of a mechanical equipment transmission part under different working conditions, intercepting the vibration signals for a plurality of times according to preset data point lengths to obtain a plurality of sample data, dividing the plurality of sample data according to preset proportions, calibrating one part of the sample data according to fault types to be used as a labeled data set, and the other part of the sample data is used as a non-labeled data set;
establishing a self-supervision pre-training network by constructing a random mask module, an encoder model and a decoder model based on a self-attention mechanism, masking and reconstructing samples in the unlabeled dataset by using the self-supervision pre-training network, wherein the reconstruction comprises an encoding process and a decoding process, calculating reconstruction loss between signals before and after reconstruction, optimizing the self-supervision pre-training network by taking minimized reconstruction loss as an objective function, and updating network parameters of the self-supervision pre-training network;
establishing a fine tuning network by constructing the encoder model and the classifier model, transferring parameters corresponding to the encoder model in the optimized self-supervision pre-training network to the encoder model of the fine tuning network, classifying samples in the labeled data set by using the fine tuning network, calculating classification loss, optimizing the fine tuning network by taking the minimized classification loss as an objective function, and updating network parameters of the fine tuning network to obtain an intelligent fault diagnosis model of the mechanical equipment transmission part;
and inputting the vibration signal to be tested into the intelligent fault diagnosis model of the mechanical equipment transmission part to obtain the fault type of the mechanical equipment transmission part.
2. The method of claim 1, wherein masking samples in the unlabeled dataset with the self-supervised pre-training network comprises:
for input samples [ X ] 1 ;X 2 ;...;X n ]Performing random masking operation to obtain a mask output matrix X m =[X 1 ;X 2 ;...;X n ][c 1 ;c 2 ;...;c n ] T Wherein n is the preset data point length, x 1 To x n Representing n row vectors of the input matrix c 1 To c n Representing randomly generated mask vectors, half of which have values of 1 and the rest of which are 0, T representing the transpose;
removing the row vector with 0 in the mask output matrix to obtain a final mask output matrix X M =X m -X 0
3. The intelligent fault diagnosis method for a mechanical device transmission component according to claim 2, wherein the reconstructed encoding process comprises:
adding one row of category token vectors for aggregating the classification information to the final mask output matrix, and superposing position codes for indicating the position information to obtain a first coding characteristic
Figure FDA0003989066520000021
wherein ,
Figure FDA0003989066520000022
representing a category token vector, ">
Figure FDA0003989066520000023
Representing a merge matrix->
Figure FDA0003989066520000024
Representing a position code;
the first coding feature is subjected to layer normalization processing to obtain a second coding feature
Figure FDA0003989066520000025
Wherein, gamma is a scaling parameter, beta is a translation parameter, and a standardized value +.>
Figure FDA0003989066520000026
Average of first coding feature in last dimension
Figure FDA0003989066520000027
Variance->
Figure FDA0003989066520000028
Epsilon is a smoothing factor, N dim Dimension of the last dimension of the first code feature;
processing the second coding feature through multi-stage stacked multi-head self-attention layers and multi-layer perceptron layers to obtain a target coding feature
Figure FDA0003989066520000029
The processing procedure of the d-level multi-head self-attention layer and the d-level multi-layer perceptron layer is as follows:
calculating the self-attention value of the d-1 level multi-layer sensor layer output
Figure FDA00039890665200000210
The output of the d-1 level multi-layer sensor layer is added with>
Figure FDA00039890665200000211
Residual connection to obtain the output of the d-level multi-head self-attention layer>
Figure FDA00039890665200000212
Wherein depth is the number of levels of the multi-headed self-focusing layer and multi-layered sensor layer stack, when d=1, the +.>
Figure FDA00039890665200000213
Self-attention value for the second coding feature, is->
Figure FDA0003989066520000031
For the first encoding feature;
outputting the d-level multi-head self-attention layer
Figure FDA0003989066520000032
And first linear layer weight +.>
Figure FDA0003989066520000033
Product of (c) and first linear layer deviation b 1 After GeLU activation function processing, and the second linear layer weight +.>
Figure FDA0003989066520000034
Multiplied by the second linear layer deviation b 2 Adding, finally, with the output of the d-level multi-head self-attention layer->
Figure FDA0003989066520000035
Residual connection to obtain the output +.>
Figure FDA0003989066520000036
4. A mechanical device transmission component intelligent fault diagnosis method according to claim 3, wherein the reconstructed decoding process comprises:
restoring the target coding feature to the vector y removed at the random masking block d =y+X 0 And the third linear layer is used for reducing the dimension of the first linear layer to obtain a first decoding characteristic y d ′=y d *w d +b d, wherein ,
Figure FDA0003989066520000037
is a third linear layer weight matrix, b d Is a third linear layer bias;
and processing the first decoding characteristic through the multi-stage stacked multi-head self-attention layer and the multi-layer perceptron layer, and recovering dimension through a fourth linear layer to obtain a reconstruction characteristic.
5. The intelligent fault diagnosis method for a mechanical equipment transmission component according to claim 4, wherein the reconstruction loss is:
Figure FDA0003989066520000038
wherein ,nB Representing the size of the batch and,
Figure FDA0003989066520000039
representing the reconstruction feature, x i Representing the mask output matrix.
6. The intelligent fault diagnosis method for a mechanical equipment transmission component according to claim 1, wherein the optimization method of the objective function is a back propagation and gradient descent method.
7. The intelligent fault diagnosis method for the mechanical equipment transmission component according to claim 1, wherein the classifier model is a Softmax classifier, probability values of each fault belonging to different categories are output through the Softmax classifier, and a calculation formula of the corresponding fault probability values is as follows:
Figure FDA0003989066520000041
wherein ,xi ' represents the ith input feature, y i Output probability value, n, representing corresponding class s Indicating the total number of fault categories.
8. The intelligent fault diagnosis method for a mechanical equipment transmission component according to claim 1, wherein the classification loss is:
Figure FDA0003989066520000042
wherein ,nb Indicating batch size, n s ' total number of categories representing health status, 1 {. Cndot. } represents a function at y j Return 1 when k, otherwise return 0, y j A real label representing a sample is provided,
Figure FDA0003989066520000043
representing the predicted probability of all health states.
9. An intelligent fault diagnosis device for a transmission part of a mechanical device, comprising:
the data set construction module is used for collecting vibration signals of the mechanical equipment transmission part under different working conditions, intercepting the vibration signals for a plurality of times according to preset data point lengths to obtain a plurality of sample data, dividing the plurality of sample data according to preset proportions, calibrating one part of the sample data according to fault types to be used as a labeled data set, and the other part of the sample data is used as a non-labeled data set;
the self-supervision pre-training network construction module is used for constructing a self-supervision pre-training network by constructing a random mask module, a self-attention mechanism-based encoder model and a self-attention mechanism-based decoder model, masking and reconstructing samples in the unlabeled dataset by utilizing the self-supervision pre-training network, wherein the reconstruction comprises an encoding process and a decoding process, calculating reconstruction loss between signals before and after reconstruction, optimizing the self-supervision pre-training network by taking the minimized reconstruction loss as an objective function, and updating network parameters of the self-supervision pre-training network;
the prediction model construction module is used for building a fine tuning network by constructing the encoder model and the classifier model, transferring parameters corresponding to the encoder model in the optimized self-supervision pre-training network to the encoder model of the fine tuning network, classifying samples in the labeled data set by using the fine tuning network, calculating classification loss, optimizing the fine tuning network by taking the minimized classification loss as an objective function, and updating network parameters of the fine tuning network to obtain an intelligent fault diagnosis model of the mechanical equipment transmission part;
and the fault diagnosis module is used for inputting the vibration signal to be tested into the intelligent fault diagnosis model of the mechanical equipment transmission part to obtain the fault type of the mechanical equipment transmission part.
10. A computer readable storage medium, characterized in that the computer readable storage medium has stored thereon a computer program which, when executed by a processor, implements the steps of a method for intelligent fault diagnosis of a transmission component of a mechanical device according to any of claims 1 to 8.
CN202211575101.2A 2022-12-08 2022-12-08 Intelligent fault diagnosis method and device for mechanical equipment transmission part Active CN116007937B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211575101.2A CN116007937B (en) 2022-12-08 2022-12-08 Intelligent fault diagnosis method and device for mechanical equipment transmission part

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211575101.2A CN116007937B (en) 2022-12-08 2022-12-08 Intelligent fault diagnosis method and device for mechanical equipment transmission part

Publications (2)

Publication Number Publication Date
CN116007937A true CN116007937A (en) 2023-04-25
CN116007937B CN116007937B (en) 2024-09-06

Family

ID=86020785

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211575101.2A Active CN116007937B (en) 2022-12-08 2022-12-08 Intelligent fault diagnosis method and device for mechanical equipment transmission part

Country Status (1)

Country Link
CN (1) CN116007937B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117874593A (en) * 2023-12-11 2024-04-12 华北电力大学 Fan bearing fault diagnosis model training method and device, electronic equipment and medium
CN117992862A (en) * 2024-04-07 2024-05-07 山东大学 Two-stage rotating mechanical equipment reliability assessment method and system based on large model

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111562109A (en) * 2020-05-19 2020-08-21 重庆邮电大学 Deep learning state identification and diagnosis method for mechanical equipment
CN111916067A (en) * 2020-07-27 2020-11-10 腾讯科技(深圳)有限公司 Training method and device of voice recognition model, electronic equipment and storage medium
CN112990578A (en) * 2021-03-16 2021-06-18 北京航空航天大学 Industrial low-quality high-noise data modeling and product quality prediction method
WO2022110730A1 (en) * 2020-11-27 2022-06-02 平安科技(深圳)有限公司 Label-based optimization model training method, apparatus, device, and storage medium
CN114993677A (en) * 2022-05-11 2022-09-02 山东大学 Rolling bearing fault diagnosis method and system based on unbalanced small sample data
US20220327035A1 (en) * 2020-06-03 2022-10-13 Soochow University Intra-class adaptation fault diagnosis method for bearing under variable working conditions
CN115410032A (en) * 2022-07-26 2022-11-29 北京工业大学 OCTA image classification structure training method based on self-supervision learning

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111562109A (en) * 2020-05-19 2020-08-21 重庆邮电大学 Deep learning state identification and diagnosis method for mechanical equipment
US20220327035A1 (en) * 2020-06-03 2022-10-13 Soochow University Intra-class adaptation fault diagnosis method for bearing under variable working conditions
CN111916067A (en) * 2020-07-27 2020-11-10 腾讯科技(深圳)有限公司 Training method and device of voice recognition model, electronic equipment and storage medium
WO2022110730A1 (en) * 2020-11-27 2022-06-02 平安科技(深圳)有限公司 Label-based optimization model training method, apparatus, device, and storage medium
CN112990578A (en) * 2021-03-16 2021-06-18 北京航空航天大学 Industrial low-quality high-noise data modeling and product quality prediction method
CN114993677A (en) * 2022-05-11 2022-09-02 山东大学 Rolling bearing fault diagnosis method and system based on unbalanced small sample data
CN115410032A (en) * 2022-07-26 2022-11-29 北京工业大学 OCTA image classification structure training method based on self-supervision learning

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
岳增营 等: "基于语言模型的预训练技术研究综述", 中文信息学报, vol. 35, no. 9, 30 September 2021 (2021-09-30), pages 15 - 29 *
戴俊 等: "基于生成对抗网络和自动编码器的机械系统异常检测", 仪器仪表学报, vol. 40, no. 9, 30 September 2019 (2019-09-30), pages 16 - 26 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117874593A (en) * 2023-12-11 2024-04-12 华北电力大学 Fan bearing fault diagnosis model training method and device, electronic equipment and medium
CN117992862A (en) * 2024-04-07 2024-05-07 山东大学 Two-stage rotating mechanical equipment reliability assessment method and system based on large model
CN117992862B (en) * 2024-04-07 2024-07-12 山东大学 Two-stage rotating mechanical equipment reliability assessment method and system based on large model

Also Published As

Publication number Publication date
CN116007937B (en) 2024-09-06

Similar Documents

Publication Publication Date Title
CN112149316B (en) Aero-engine residual life prediction method based on improved CNN model
CN111914873B (en) Two-stage cloud server unsupervised anomaly prediction method
CN116007937B (en) Intelligent fault diagnosis method and device for mechanical equipment transmission part
CN115293280A (en) Power equipment system anomaly detection method based on space-time feature segmentation reconstruction
CN115017826B (en) Method for predicting residual service life of equipment
Nasser et al. A hybrid of convolutional neural network and long short-term memory network approach to predictive maintenance
CN110851654A (en) Industrial equipment fault detection and classification method based on tensor data dimension reduction
CN114757432A (en) Future execution activity and time prediction method and system based on flow log and multi-task learning
CN117034143B (en) Distributed system fault diagnosis method and device based on machine learning
CN114266301A (en) Intelligent power equipment fault prediction method based on graph convolution neural network
CN112947385A (en) Aircraft fault diagnosis method and system based on improved Transformer model
Xu et al. Global attention mechanism based deep learning for remaining useful life prediction of aero-engine
CN114169091A (en) Method for establishing prediction model of residual life of engineering mechanical part and prediction method
Pan et al. A meta network pruning framework for remaining useful life prediction of rocket engine bearings with temporal distribution discrepancy
CN116842379A (en) Mechanical bearing residual service life prediction method based on DRSN-CS and BiGRU+MLP models
CN113435321A (en) Method, system and equipment for evaluating state of main shaft bearing and readable storage medium
Ma et al. A collaborative central domain adaptation approach with multi-order graph embedding for bearing fault diagnosis under few-shot samples
CN116227586A (en) Meta learning fault diagnosis method and device based on depth residual error shrinkage prototype network
CN117667495B (en) Association rule and deep learning integrated application system fault prediction method
CN117390407B (en) Fault identification method, system, medium and equipment of substation equipment
Liu et al. A novel health prognosis method for system based on improved degenerated Hidden Markov model
CN116383757B (en) Bearing fault diagnosis method based on multi-scale feature fusion and migration learning
Bond et al. A hybrid learning approach to prognostics and health management applied to military ground vehicles using time-series and maintenance event data
CN112560252A (en) Prediction method for residual life of aircraft engine
CN116628444A (en) Water quality early warning method based on improved meta-learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant