CN117574259A - Attention twin intelligent migration interpretability diagnosis method suitable for high-end equipment - Google Patents

Attention twin intelligent migration interpretability diagnosis method suitable for high-end equipment Download PDF

Info

Publication number
CN117574259A
CN117574259A CN202311316905.5A CN202311316905A CN117574259A CN 117574259 A CN117574259 A CN 117574259A CN 202311316905 A CN202311316905 A CN 202311316905A CN 117574259 A CN117574259 A CN 117574259A
Authority
CN
China
Prior art keywords
attention
module
feature
model
twin
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202311316905.5A
Other languages
Chinese (zh)
Other versions
CN117574259B (en
Inventor
缪小冬
徐坤
李舜酩
江星星
王�华
陆建涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Tech University
Original Assignee
Nanjing Tech University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Tech University filed Critical Nanjing Tech University
Priority to CN202311316905.5A priority Critical patent/CN117574259B/en
Publication of CN117574259A publication Critical patent/CN117574259A/en
Application granted granted Critical
Publication of CN117574259B publication Critical patent/CN117574259B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2415Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01MTESTING STATIC OR DYNAMIC BALANCE OF MACHINES OR STRUCTURES; TESTING OF STRUCTURES OR APPARATUS, NOT OTHERWISE PROVIDED FOR
    • G01M13/00Testing of machine parts
    • G01M13/02Gearings; Transmission mechanisms
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01MTESTING STATIC OR DYNAMIC BALANCE OF MACHINES OR STRUCTURES; TESTING OF STRUCTURES OR APPARATUS, NOT OTHERWISE PROVIDED FOR
    • G01M13/00Testing of machine parts
    • G01M13/04Bearings
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/253Fusion techniques of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]

Abstract

The invention discloses an attention twin intelligent migration interpretability diagnosis method suitable for high-end equipment, which comprises the steps of firstly obtaining vibration signal data of different fault parts of a bearing and a gear under different loads or rotating speeds, and preprocessing the data; constructing a convolution neural network model of a double-branch architecture, so that the model can simultaneously receive unbalanced samples under two different working conditions for training; adding a cross-module serial dual-attention mechanism comprising a channel feature attention enhancement module and a segment feature attention enhancement module into the model, wherein the cross-module serial dual-attention mechanism is used for reasonably enhancing the feature information of small samples, enhancing the domain invariant features of unbalanced samples and realizing the attention interpretability of the features; meanwhile, a twin feature fusion module is introduced, so that domain invariant features in an unbalanced sample are better extracted; finally, various proposed feature visualization methods are adopted to realize feature visualization interpretation of the migration diagnosis model, and high-efficiency migration diagnosis accuracy is completed on the test data set.

Description

Attention twin intelligent migration interpretability diagnosis method suitable for high-end equipment
Technical Field
The invention relates to an intelligent diagnosis technology of high-end equipment, in particular to an attention twin intelligent migration interpretability diagnosis method suitable for high-end equipment.
Background
The rotating parts such as the bearing or the gear are core parts for power transmission of high-end equipment, and the health state of the rotating parts directly influences the operation reliability of the high-end equipment, so that the fault monitoring and diagnosis of the rotating parts such as the bearing or the gear are very important. At present, fault monitoring and diagnosing methods are mainly divided into three categories:a method based on a kinetic model; />A method of extracting a mechanism feature based on signal processing; />An intelligent fault diagnosis method based on artificial intelligence. The method based on the dynamic model is the basis of fault diagnosis, but because the method is difficult to simulate the fault dynamic characteristics of real high-end equipment in operation, researchers gradually start to study a signal processing method capable of extracting fault mechanism characteristics, and the running state of the rotating part can be judged according to the extracted corresponding mechanism characteristics. In recent decades, intelligent fault diagnosis methods have been increasingly applied to the field of fault diagnosis due to the rise of artificial intelligence technology. In the intelligent fault diagnosis application process, the problem of migration working conditions caused by rotation speed or load change is frequently encountered; also frequently encounter a failure from normalSample imbalance caused by more samples and fewer fault samples; for high-end equipment, such as geared turbofan aircraft engines, problems are also often encountered in which both bearings and gears need to be diagnosed.
In intelligent diagnosis engineering application, the problem of migration working conditions often causes drift of fault characteristics, so various migration learning methods are often used to reasonably measure and limit the drift of characteristics, such as the migration measurement method mentioned in patent CN 113435375B. The problem of sample unbalance often causes excessive misclassification of a small sample by a model and also causes simultaneous occurrence of migration problems, and a method for fault migration diagnosis of a rolling bearing with multitasking and freedom of the unbalanced sample is proposed in the patent CN 113469066B, so that the problem is solved.
However, aiming at the synchronous diagnosis problem of the migration working conditions of the bearing and the gear under unbalanced samples of high-end equipment such as a gear-driven turbofan aeroengine, corresponding patents are fewer; and also lacks various feature visualization methods for such cases. For example: the CN113191215A uses two-dimensional time-frequency data, has low processing speed, has good effect only under the condition of unbalanced sample, and has general diagnosis on the migration of working conditions; patent CN113505655a fails to address the conditions of variable rotational speed or variable load shifting for simultaneous diagnostics. In summary, the existing intelligent diagnosis method is difficult to realize synchronous feature interpretable fault diagnosis of the migration working condition of the bearing and the gear under the unbalanced sample.
Disclosure of Invention
The invention aims to: the invention aims to solve the defects in the prior art, provides a method for diagnosing the intelligent migration interpretability of the attention twin suitable for high-end equipment, can diagnose migration working conditions of a bearing and a gear under unbalanced samples at the same time, adopts various feature visualization methods based on the attention or twin networks to visually interpret the features learned by an intelligent diagnosis model, and promotes the practical application process of intelligent fault diagnosis.
The technical scheme is as follows: the invention discloses an attention twin intelligent migration interpretability diagnosis method suitable for high-end equipment, which comprises the following steps of:
step S1, obtaining fault signals of a bearing and a gear in a high-end equipment (such as a gear drive turbofan aeroengine) simulation test bed to obtain a bearing fault sample and a gear fault sample;
step S2, carrying out data preprocessing on a bearing fault sample and a gear fault sample: sequentially combining three aero-bearing data under variable load and three planetary gear data under variable rotation speed, adding healthy samples (namely normal sample data), setting each type of healthy samples into different unbalanced sample sizes, performing time-frequency conversion on the unbalanced samples into frequency domain signals (for example, short-time Fourier conversion can be adopted), and performing standardized processing;
s3, constructing an attention twin intelligent migration interpretable diagnostic model
Step S3.1, building a double-branch convolutional neural network model, wherein the double-branch convolutional neural network model is divided into 5 convolutional layers, 5 pooling layers and a full-connection layer, and the model comprises a batch standardization layer and L2 regularization;
s3.2, designing a cross-module double-attention mechanism module which is connected in series with the cross-module for better diagnosis effect and better interpretability of the intelligent diagnosis model, and establishing a feature visualization method of the corresponding module;
the cross-module dual-attention mechanism module comprises a channel feature attention module CFAE and a fragment feature attention module FFAE, wherein the feature values in different channels of the unbalanced sample are enhanced through the channel feature attention module CFAE, and the feature values of different feature fragments of the unbalanced sample are enhanced through the fragment feature attention module FFAE;
step S3.3, in order to better extract the characteristic information in the unbalanced sample, and simultaneously extract as many fusion domain invariant features under different loads or rotating speeds as possible for the model, and introduce a contrast loss function to construct a twin feature fusion network module;
s3.4, inserting a cross-module double-attention mechanism module and a twin feature fusion network module into the double-branch convolutional neural network model to construct an attention twin intelligent migration interpretable diagnostic model under an unbalanced sample; wherein after the channel feature attention module CFAE is placed at the 3 rd pooling layer, the segment feature attention module FFAE is placed at the 5 th convolution layer, applying a twinning feature fusion network at the Global Average Pooling (GAP) layer;
step S4, training an attention twin intelligent migration interpretable diagnostic model by using unbalanced sample data standardized in the step S2, wherein the attention twin intelligent migration interpretable diagnostic model is trained by using unbalanced sample data under any two loads or rotating speeds, the unbalanced sample data under two working conditions are used for training at first, and then data under other working conditions are used for testing (the limitation that the traditional migration diagnostic method needs to use target domain data of the same class as the source domain sample for follow-up training is avoided), so that the practical fault diagnosis application requirements (the target domain data which is of the same class as the source domain data and the quantity of which is as consistent as possible is difficult to obtain in the practical application of migration diagnosis) are more satisfied;
s5, testing the diagnostic effect of the attention twin intelligent migration explanatory diagnostic model by using a test set; parameters such as internal weight and bias of the trained attention twin intelligent migration interpretable diagnostic model are fixed, further, diagnostic effects of the model are verified by using data under other rotating speeds or loads, a diagnostic comparison box graph and a confusion matrix with other models are drawn, and model performance is displayed;
step S6, visually displaying statistical features learned by the attention twin intelligent migration interpretable diagnostic model by means of the cross-module double-attention mechanism module in step S3, firstly displaying the enhancement of the channel feature attention module CFAE on different channel features, then displaying the enhancement of the segment feature attention module FFAE on the features of different segments, and simultaneously activating weightsW2Visual display is carried out, and then the statistical characteristic information which is specifically learned by a display model is combined with the spectrum waveform;
and S7, displaying cross-domain fusion and invariant features which are not changed along with working conditions in the unbalanced sample under the working conditions learned by the attention twin intelligent migration interpretability diagnostic model by means of the step S3 twin feature fusion network module, and further explaining feature information learned by the intelligent model.
Further, the dual-branch convolutional neural network model in the step S3.1 includes 5 convolutional layers, 5 pooling layers and a full-connection layer, and further includes a batch normalization layer and L2 regularization, and the formulas thereof are as follows:
in the method, in the process of the invention,mrepresenting the total amount of sample per batch,sample representing each lot, +.>And->Represents mean and standard deviation, respectively, +.>And->Respectively representing corrected sample values, +.>For avoiding the phenomenon that the standard deviation of the denominator is 0,γandβscale factors and translation factors, respectively;
in the method, in the process of the invention,E in is the original class-loss function,is a regularization parameter which is a function of the data,wrepresenting weight parameters in the model.
Further, the details of the cross-module dual-attention mechanism module in step S3.2 are as follows: the expression of the channel feature attention module CFAE is:
in the method, in the process of the invention,cfaeis the characteristic output of the CFAE module,z(Pool 3) represents the characteristics of the third pooling layer output,z activate a feature representing the activation of the attention,is a correction coefficient which is used to correct the image,Crepresenting the number of classes of failure,eis a constant;
the expression of the segment feature attention module FFAE is:
in the method, in the process of the invention,ffaeis the characteristic output of the FFAE module,W2representing activation weights corresponding to different segments of each sample,z j,m (i) Is the characteristic output of the layer above FFAE,A 1 ~A 8 it is the sample that corresponds to each feature fragment activation value.
Further, the expression of the loss function in the step S3.3 is:
in the method, in the process of the invention,a parameter representing the contrast loss layer,L Siamese is the loss of the twinning feature fusion network,D(G 1 ,G 2 ) Representing the values from two speeds or loadsG 1 ,G 2 ) The distance between the lower unbalanced sample features,Qis a constant value, which is a constant value,lon behalf of the network layer,nthe number of the representative number is represented by,Nrefers to the total number of samples.
Further, the total loss function of the attention twin intelligent migration interpretable diagnostic model obtained in step S3.4 may be expressed as:
in the method, in the process of the invention,Nthe number of the representative number is represented by,Crepresenting the number of classes of failure,yis a sample tag that is to be read,pfor the probability that the tag is correct,L sum representing the total loss of the product,parameters representing cross entropy loss, < >>A parameter representing the regularization loss is provided,L entropy is a class-loss cross-entropy loss term,L L is a regularization loss term that is used to determine the regularization loss,Nrepresenting the total number of samples.
Further, the enhanced values of the channel feature attention module CFAE for different channel features in step S7 are as follows:
in the method, in the process of the invention,z activate representing the value of the feature after attention activation,W1is the weight of the attention to the person,is a correction coefficient which is used to correct the image,z(Pool 3) represents the characteristics of the third pooling layer output,z activate characteristic of attention activation, +_>Is a correction coefficient which is used to correct the image,Crepresenting the number of fault categories;
the value of the segment feature attention module FFAE weight is as follows:
in the method, in the process of the invention,W2representing activation weights corresponding to different segments of each sample,z j,m (i) Is the characteristic output of the layer above FFAE,A 1 ~A 8 it is the sample that corresponds to each feature fragment activation value.
The beneficial effects are that: the invention provides an intelligent diagnosis method for variable rotation speed or variable load migration under an unbalanced sample for different types of faults of a bearing and a gear, and the method effectively enhances the characteristic information of a small sample under the unbalanced sample through a cross-module serial dual-attention mechanism designed by the invention, so that the model can better extract the characteristics in the unbalanced sample; meanwhile, under the constraint of the proposed twin feature fusion network, the model extracts domain invariant features under an unbalanced sample which does not change along with working conditions, so that data under other working conditions can be better diagnosed; finally, the information learned by the model is visually interpreted through the attention mechanism and the characteristics in the twin characteristic fusion network respectively, so that the internal interpretability of the intelligent diagnosis model is enhanced.
Drawings
FIG. 1 is an overall flow chart of the intelligent diagnosis of the present invention;
FIG. 2 is a schematic diagram of an illustrative model of attention twin intelligent migration under an unbalanced sample of the present invention;
FIG. 3 is a graph showing the diagnostic performance of the model of the present invention under different migration conditions;
FIG. 4 is a confusion matrix of the model in the working condition of D1/D2-D3 in the invention;
FIG. 5 is a graph of the enhancement effect of the CFAE module on different channel characteristics in the present invention;
FIG. 6 is a graph of the enhancement effect of the FFAE module on different segment characteristics in the present invention;
fig. 7 is a plot of attention versus frequency domain feature matching.
FIG. 8 is a graph of the unchanged feature of the unbalanced sample domain extracted by the twinning feature fusion network layer in the present invention.
Detailed Description
The technical scheme of the present invention is described in detail below, but the scope of the present invention is not limited to the embodiments.
As shown in fig. 1 and fig. 2, the attention twin intelligent migration interpretability diagnosis method suitable for high-end equipment can diagnose a bearing and a gear at the same time, and adopts various attention or twin network-based feature visualization methods to carry out visual interpretation on the features learned by an intelligent diagnosis model so as to promote the practical application process of intelligent fault diagnosis. The method specifically comprises the following steps:
step S1: and acquiring fault signals of the bearing and the gear in the simulation test bed of the high-end equipment of the gear drive turbofan aeroengine, and further acquiring a bearing fault sample and a gear fault sample.
The embodiment adopts the data of variable load (1000N, 1400N and 1800N) of the ultra-high-speed aviation bearing of the university of Duling engineering at 18000r/min, and the model is verified for simultaneously diagnosing the bearing and the planet gear by adopting the data of the self-contained planet gear at variable rotation speed (1800 r/min,2000r/min and 2500 r/min) and combining the two data because the aviation planet gear is very expensive.
Step S2: data preprocessing of bearing failure samples and gear failure samples
Sequentially combining three aviation bearing data under variable load and three planetary gear data under variable rotation speed, adding health samples, setting each type of health samples into different unbalanced sample sizes, performing time-frequency conversion on the health samples to convert the health samples into frequency domain signals, and performing standardized processing.
The present embodiment is divided into seven health conditions: normal samples (NR), bearing inner ring indentations 150 μm (IRI 150), bearing inner ring indentations 450 μm (IRI 450), bearing roller indentations 450 μm (RI 450), planet wheel pitting (PGP), sun wheel pitting (SGP) and sun wheel wear (SGW); for the seven types of health conditions, different numbers of sample sizes are respectively set, namely, the minimum number of samples is 100, and the maximum number of samples is 500. As shown in fig. 1, in this embodiment, the unbalanced sample, the working condition 1 and the working condition 2 may form two training sets, and the balanced sample and the working condition 3 may form a test set; the first two training sets input an untrained diagnosis model at the same time, and after the model is trained, the data set under the third working condition is used for verification.
Step S3: construction of attention twin intelligent migration interpretable diagnostic model
Step S3.1: building a double-branch convolutional neural network model, wherein the double-branch convolutional neural network model is divided into 5 convolutional layers, 5 pooling layers and a full-connection layer, the model comprises a batch normalization layer and L2 regularization, and the formulas are as follows:
in the method, in the process of the invention,and->Represents mean and standard deviation, respectively, +.>For avoiding the phenomenon that the standard deviation of the denominator is 0,γandβscale factors and translation factors, respectively;
in the method, in the process of the invention,E in is the original class-loss function,is a regularization parameter which is a function of the data,wrepresenting weight parameters in the model;
step S3.2: in order to make the diagnosis effect of the intelligent diagnosis model better and more interpretable, a cross-module double-attention mechanism module is designed, and a feature visualization method of a corresponding module is established.
The cross-module dual-attention mechanism module of the present embodiment includes a channel feature attention module (CFAE) and a segment feature attention module (FFAE), where the channel feature attention module (CFAE) enhances feature values in different channels of an unbalanced sample by attention, and the total formulas after attention activation and after residual linking are expressed as:
in the method, in the process of the invention,z activate representing the value of the feature after attention activation,W1is the weight of the attention to the person,is a correction coefficient which is used to correct the image,cfaeis the characteristic output of the CFAE module,z(Pool 3) represents the characteristics of the third pooling layer output,z activate characteristic of attention activation, +_>Is a correction coefficient which is used to correct the image,Crepresentative species;
segment feature attention module (FFAE) enhances the feature value of different feature segments of an unbalanced sample by attention, its attention weightW2And the attention formulas are expressed as:
in the method, in the process of the invention,ffaeis the characteristic output of the FFAE module,W2representing activation weights corresponding to different segments of each sample,z j,m (i) Is the characteristic output of the layer above FFAE,A 1 ~A 8 is that the sample corresponds to each feature fragment activation value;
step S3.3: in order to better extract the characteristic information in the unbalanced sample, and simultaneously extract as many fusion domain invariant features under different loads or rotating speeds as possible for the model, a contrast loss function is introduced to construct a twin feature fusion network module, and the loss function and a distance formula of the twin feature fusion network module are respectively expressed as follows:
in the method, in the process of the invention,a parameter representing the contrast loss layer,L Siamese is the loss of the twinning feature fusion network,D(G 1 ,G 2 ) Representing the values from two speeds or loadsG 1 ,G 2 ) Lower part (C)The distance between the sample features is not balanced,Qis a constant value, which is a constant value,lon behalf of the network layer,nthe number of the representative number is represented by,dimrepresenting dimensions;
step S3.4: inserting the dual-attention mechanism module and the twin feature fusion network module into a dual-branch convolution network model to construct an attention twin intelligent migration interpretable diagnostic model under an unbalanced sample;
in the attention twin intelligent migration interpretable diagnostic model, after the channel feature attention module CFAE is placed at the third pooling layer, the segment feature attention module FFAE is placed at the 5 th convolution layer, and a twin feature fusion network is applied at the Global Average Pooling (GAP) layer;
the total loss function of the attention twin intelligent migration interpretable diagnostic model is expressed as:
in the method, in the process of the invention,Nthe number of the representative number is represented by,Crepresenting the number of classes of failure,yis a sample tag that is to be read,pfor the probability that the tag is correct,L sum representing the total loss of the product,parameters representing cross entropy loss, < >>A parameter representing the regularization loss is provided,L entropy is a class-loss cross-entropy loss term,L L is a regularization loss term.
The detailed parameters of the attention twin smart migration interpretable diagnostic model of the present embodiment are shown in Table 1:
table 1 model detailed parameter table
Layer number Layer type Core size/step size Number of channels Batch normalization Activation function
1 Conv_1 45*1/3*1 16 Yes Relu
2 Pool_1 2*1/2*1 16 / /
3 Conv_2 15*1/3*1 32 Yes Relu
4 Pool_2 2*1/2*1 32 / /
5 Conv_3 3*1/1*1 48 Yes Relu
6 Pool_3 2*1/2*1 48 / /
7 CFAE / 48 / /
8 Conv_4 3*1/1*1 64 Yes Relu
9 Pool_4 2*1/2*1 64 / /
10 Conv_5 3*1/1*1 64 Yes Relu
11 FFAE / 64 /
12 GAP / 64 / /
13 SFF / / / /
14 Fc1 64*10 / / /
15 / / / / Softmax
Step S4: training the attention twin intelligent migration interpretive diagnostic model using the unbalanced sample data (two training sets in fig. 1) prepared in step S2, wherein the attention twin intelligent migration interpretive diagnostic model is trained by using unbalanced sample data under any two loads or rotating speeds, namely: firstly training by using unbalanced sample data under two working conditions, and then testing by using data under other working conditions;
step S5: performing model diagnosis effect test on the attention twin intelligent migration interpretable diagnosis model;
the parameters such as the internal weight and bias of the attention twin intelligent migration interpretable diagnostic model trained in the step S4 are fixed, further, the diagnostic effect of the model is verified by using data under other rotating speeds or loads, a diagnostic performance of the model under different migration working conditions and a confusion matrix of the model of FIG. 4 under the working conditions of D1/D2-D3 are drawn, and the performance of the model is displayed.
Step S6: and visually displaying the statistical features learned by the attention twin intelligent migration explanatory diagnosis model by means of the cross-module double-attention mechanism module.
Firstly, enhancing the characteristics of different channels by the CFAE module, and drawing an effect diagram of enhancing the characteristics of different channels by the CFAE module of FIG. 5; feature enhancement of the FFAE module for different fragments is then demonstrated, along with weighting of activationW2Performing visual display, and drawing an effect graph of the FFAE module of FIG. 6 on the enhancement of different segment characteristics; and further, combining the spectrum waveform, showing which statistical characteristic information is specifically learned by the model, and drawing a attention and frequency domain characteristic matching diagram of fig. 7.
Step S7: and displaying the cross-domain fusion and invariant features which are not changed along with the working condition in the unbalanced sample under the variable working condition learned by the attention twin intelligent migration interpretability diagnostic model by means of the twin feature fusion network module, drawing an unbalanced sample domain invariant feature map extracted by the twin feature fusion network layer of FIG. 8, and further explaining the feature information learned by the intelligent model.
Examples:
the feasibility of the present invention was verified by taking as an example the intelligent fault diagnosis of bearings and gears in the geared turbofan aeroengine simulation test stand described in steps 1 and 2 of the above-described embodiments (see fig. 2).
The imbalance samples under each condition of this example were prepared as shown in table 2:
TABLE 2 sample size for bearing and gear imbalance samples under different conditions
In Table 2, D1 to D3 represent migration conditions 1 to 3, respectively. It can be seen from the table that the number of samples of each type is ragged, with a minimum of 100 samples and a maximum of 500 samples. The sample data are used, unbalanced sample data under two working conditions are used for training, and then data under a third working condition are used for testing the model, and the diagnosis performance of the model under different working conditions is shown in figure 3. When the abscissa of fig. 3 represents each operating condition, the dataset symbol "D" is omitted for convenience. In FIG. 3, 1/2.fwdarw.3 represents the working condition of D1/D2.fwdarw.D3, and so on. It can be seen from the figure that the proposed model achieves a rather good diagnostic effect. Even under such severe conditions, uneven unbalanced samples are used as training, and the diagnosis accuracy is higher than 96% for a plurality of detection components, and the diagnosis stability is good. However, for D1/D3- > D2 and D3/D2- > D1, the test accuracy of the model is even slightly higher than the training accuracy, which also proves the adaptive robustness of the model to some extent.
In addition, as can be seen from the confusion matrix of fig. 4, the attention twin intelligent migration interpretable diagnostic model of the present invention correctly classifies these confusing unbalanced samples, further demonstrating the advancement of the present invention.
As can be seen from fig. 5, the CFAE module enhances the characteristics of the CFAE layer relative to the characteristics of the upper layer (pool_3). This enhancement occurs whether the absolute value is large or small. The feature increase amplitude with smaller absolute value is relatively smaller, and the feature increase amplitude with larger absolute value is relatively larger. Since the principle of CFAE is to enhance the eigenvalues of different channels in the model, each channel (channel 1, channel 13, channel 25, channel 37) has a certain effect, it is placed in front of the segment feature enhancement network. To apply the performance of the CFAE network to the more specific features extracted by the model as early as possible, it is placed after the third pooling layer.
As can be seen from the segment feature enhancement effect of the FFAE model in fig. 6, the FFAE enhances the segment with a larger absolute value of the selected feature, ignores the part with a smaller feature value, and makes the model have a better classification effect on the feature with significant enhancement. Selected from fig. 6 are channel 1, channel 20, channel 38, and channel 54.
As can be seen from fig. 7 (the abscissa indicates the feature segment attention, i.e. the feature attention segment, and the ordinate indicates the number of samples of different fault types), the attention profile of the FFAE network is almost uniform for normal sample signals without any faults, with no particular attention to the feature segment. For the two types of faults which are difficult to distinguish between C1 and C3, the model focuses on the low-frequency part of the C1 fault signal and the high-frequency part of the C3 fault signal respectively, so that the model maximally distinguishes the two characteristics.
Fig. 8 shows the domain invariant feature for different conditions learned by the model, since the highlighting is the same, and the left and right graphs in fig. 8 are imbalance sample training data conditions 1 and 2 in order. Finally, from the above visual analysis of various features, it can be derived that the intelligent diagnostic model learns not the failure feature frequency with actual physical meaning, but features of a certain waveform signal that are statistically different from other waveform signals.
In summary, the invention can synchronously diagnose the bearing and the gear in an unbalanced sample, and simultaneously adopts a plurality of feature visualization methods, so that a learning mechanism in the intelligent diagnosis model is more interpretable, statistical feature information learned by the model is interpreted, and the application process of the intelligent diagnosis model is further promoted.

Claims (6)

1. An attention twin intelligent migration interpretability diagnostic method for high-end equipment, comprising the steps of:
s1, acquiring fault signals of a bearing and a gear in a high-end equipment simulation test bed, and obtaining a bearing fault sample and a gear fault sample;
step S2, data preprocessing: sequentially combining the bearing data under three variable loads and the gear data under three variable rotation speeds, adding health samples, setting each type of health samples into different unbalanced sample sizes, performing time-frequency conversion on the unbalanced samples to convert the unbalanced samples into frequency domain signals, and performing standardized processing;
s3, constructing an attention twin intelligent migration interpretable diagnostic model
Step S3.1, building a double-branch convolutional neural network model, wherein the double-branch convolutional neural network model is divided into 5 convolutional layers, 5 pooling layers and a full-connection layer, and the model comprises a batch standardization layer and L2 regularization;
s3.2, designing a cross-module dual-attention mechanism module with cross modules connected in series, and establishing a feature visualization method of the corresponding module;
the cross-module dual-attention mechanism module comprises a channel feature attention module CFAE and a fragment feature attention module FFAE, wherein the feature values in different channels of the unbalanced sample are enhanced through the channel feature attention module CFAE, and the feature values of different feature fragments of the unbalanced sample are enhanced through the fragment feature attention module FFAE;
s3.3, introducing a contrast loss function to construct a twin feature fusion network module;
s3.4, inserting a cross-module double-attention mechanism module and a twin feature fusion network module into the double-branch convolutional neural network model to construct an attention twin intelligent migration interpretable diagnostic model under an unbalanced sample; wherein after the CFAE module is placed at the 3 rd pooling layer, the FFAE module is placed at the 5 th convolution layer, and a twinning feature fusion network is applied at the global average pooling GAP layer;
step S4, training an attention twin intelligent migration interpretable diagnostic model by using the unbalanced sample data obtained in the step S2, wherein the attention twin intelligent migration interpretable diagnostic model is trained by using unbalanced sample data under any two loads or rotating speeds, and the unbalanced sample data under two working conditions are firstly used for training;
s5, testing the diagnostic effect of the attention twin intelligent migration explanatory diagnostic model by using a test set; parameters such as internal weight and bias of the trained attention twin intelligent migration interpretable diagnostic model are fixed, further, diagnostic effects of the model are verified by using data under other rotating speeds or loads, a diagnostic comparison box graph and a confusion matrix with other models are drawn, and model performance is displayed;
step S6, visually displaying statistical features learned by the attention twin intelligent migration interpretable diagnostic model by means of the cross-module double-attention mechanism module in step S3, firstly displaying the enhancement of the channel feature attention module CFAE on different channel features, then displaying the enhancement of the segment feature attention module FFAE on the features of different segments, and simultaneously activating weightsW2Visual display is carried out, and then the statistical characteristic information which is specifically learned by a display model is combined with the spectrum waveform;
and S7, displaying cross-domain fusion and invariant features which are not changed along with working conditions in the unbalanced sample under the working conditions learned by the attention twin intelligent migration interpretability diagnostic model by means of the step S3 twin feature fusion network module, and further explaining feature information learned by the intelligent model.
2. The method for diagnosing the interpretability of the migration of the attention twin for high-end equipment according to claim 1, wherein the two-branch convolutional neural network model in the step S3.1 comprises 5 convolutional layers, 5 pooling layers and one full-connection layer, and further comprises a batch normalization layer and L2 regularization, and the formulas are as follows:
in the method, in the process of the invention,mrepresenting the total amount of sample per batch,sample representing each lot, +.>And->Represents mean and standard deviation, respectively, +.>And->Respectively representing corrected sample values, +.>For avoiding the phenomenon that the standard deviation of the denominator is 0,γandβscale factors and translation factors, respectively;
in the method, in the process of the invention,E in is the original class-loss function,is a regularization parameter which is a function of the data,wrepresenting weight parameters in the model.
3. The attention twin smart migration interpretable diagnostic method for high-end equipment of claim 1, wherein the step S3.2 cross-module dual-attention mechanism module is detailed as follows: the expression of the channel feature attention module CFAE is:
in the method, in the process of the invention,cfaeis the characteristic output of the CFAE module,z(Pool 3) represents the characteristics of the third pooling layer output,z activate a feature representing the activation of the attention,is a correction coefficient which is used to correct the image,Crepresenting the number of classes of failure,eis a constant;
the expression of the segment feature attention module FFAE is:
in the method, in the process of the invention,ffaeis the characteristic output of the FFAE module,W2representing activation weights corresponding to different segments of each sample,z j,m (i) Is the characteristic output of the layer above FFAE,A 1 ~A 8 it is the sample that corresponds to each feature fragment activation value.
4. The attention twin smart migration interpretable diagnostic method for high-end equipment of claim 1, wherein the expression of the loss function in step S3.3 is:
in the method, in the process of the invention,a parameter representing the contrast loss layer,L Siamese is the loss of the twinning feature fusion network,D(G 1 ,G 2 ) Representing the values from two speeds or loadsG 1 ,G 2 ) The distance between the lower unbalanced sample features,Qis a constant value, which is a constant value,lon behalf of the network layer,nthe number of the representative number is represented by,Nrefers to the total number of samples.
5. The attention twin smart migration interpretable diagnostic method for high-end equipment of claim 1, wherein the total loss function of the attention twin smart migration interpretable diagnostic model obtained at step S3.4 is expressed as:
in the method, in the process of the invention,Nthe number of the representative number is represented by,Crepresenting the number of classes of failure,yis a sample tag that is to be read,pfor the probability that the tag is correct,L sum representing the total loss of the product,parameters representing cross entropy loss, < >>A parameter representing the regularization loss is provided,L entropy is a class-loss cross-entropy loss term,L L is a regularization loss term that is used to determine the regularization loss,Nrepresenting the total number of samples.
6. The method for diagnosing the interpretive migration of attention twin for high-end equipment according to claim 1, wherein the enhanced values of the channel feature attention module CFAE for different channel features in step S7 are as follows:
in the method, in the process of the invention,z activate representing the value of the feature after attention activation,W1is the weight of the attention to the person,is a correction coefficient which is used to correct the image, z(Pool 3) represents the characteristics of the third pooling layer output,z activate characteristic of attention activation, +_>Is a correction coefficient which is used to correct the image,Crepresenting the number of fault categories;
the value of the segment feature attention module FFAE weight is as follows:
in the method, in the process of the invention,W2representing activation weights corresponding to different segments of each sample,z j,m (i) Is the characteristic output of the layer above FFAE,A 1 ~A 8 it is the sample that corresponds to each feature fragment activation value.
CN202311316905.5A 2023-10-12 2023-10-12 Attention twin intelligent migration interpretability diagnosis method suitable for high-end equipment Active CN117574259B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311316905.5A CN117574259B (en) 2023-10-12 2023-10-12 Attention twin intelligent migration interpretability diagnosis method suitable for high-end equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311316905.5A CN117574259B (en) 2023-10-12 2023-10-12 Attention twin intelligent migration interpretability diagnosis method suitable for high-end equipment

Publications (2)

Publication Number Publication Date
CN117574259A true CN117574259A (en) 2024-02-20
CN117574259B CN117574259B (en) 2024-05-07

Family

ID=89886949

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311316905.5A Active CN117574259B (en) 2023-10-12 2023-10-12 Attention twin intelligent migration interpretability diagnosis method suitable for high-end equipment

Country Status (1)

Country Link
CN (1) CN117574259B (en)

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109035297A (en) * 2018-07-19 2018-12-18 深圳市唯特视科技有限公司 A kind of real-time tracing method based on dual Siam's network
CN111626171A (en) * 2020-05-21 2020-09-04 青岛科技大学 Group behavior identification method based on video segment attention mechanism and interactive relation activity diagram modeling
CN113191215A (en) * 2021-04-12 2021-07-30 西安理工大学 Rolling bearing fault diagnosis method integrating attention mechanism and twin network structure
CN113609896A (en) * 2021-06-22 2021-11-05 武汉大学 Object-level remote sensing change detection method and system based on dual-correlation attention
CN114842363A (en) * 2022-07-04 2022-08-02 南方电网科学研究院有限责任公司 Identification method and system for key power equipment in digital twin platform area
CN115545092A (en) * 2022-08-31 2022-12-30 哈尔滨工业大学(威海) Aero-engine small sample fault diagnosis method based on deep twin self-attention network
WO2023007198A1 (en) * 2021-07-27 2023-02-02 Számítástechnikai És Automatizálási Kutatóintézet Training method for training a change detection system, training set generating method therefor, and change detection system
CN116106012A (en) * 2021-11-11 2023-05-12 中国科学院沈阳自动化研究所 Rolling bearing domain adaptive fault diagnosis method based on attention mechanism
CN116401603A (en) * 2023-04-26 2023-07-07 哈尔滨工程大学 Multi-mode bearing fault intelligent diagnosis method based on transfer learning
CN116563337A (en) * 2023-04-11 2023-08-08 武汉大学 Target tracking method based on double-attention mechanism
US20230274531A1 (en) * 2022-02-09 2023-08-31 Soochow University Global and local feature reconstruction network-based medical image segmentation method

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109035297A (en) * 2018-07-19 2018-12-18 深圳市唯特视科技有限公司 A kind of real-time tracing method based on dual Siam's network
CN111626171A (en) * 2020-05-21 2020-09-04 青岛科技大学 Group behavior identification method based on video segment attention mechanism and interactive relation activity diagram modeling
CN113191215A (en) * 2021-04-12 2021-07-30 西安理工大学 Rolling bearing fault diagnosis method integrating attention mechanism and twin network structure
CN113609896A (en) * 2021-06-22 2021-11-05 武汉大学 Object-level remote sensing change detection method and system based on dual-correlation attention
WO2023007198A1 (en) * 2021-07-27 2023-02-02 Számítástechnikai És Automatizálási Kutatóintézet Training method for training a change detection system, training set generating method therefor, and change detection system
CN116106012A (en) * 2021-11-11 2023-05-12 中国科学院沈阳自动化研究所 Rolling bearing domain adaptive fault diagnosis method based on attention mechanism
US20230274531A1 (en) * 2022-02-09 2023-08-31 Soochow University Global and local feature reconstruction network-based medical image segmentation method
CN114842363A (en) * 2022-07-04 2022-08-02 南方电网科学研究院有限责任公司 Identification method and system for key power equipment in digital twin platform area
CN115545092A (en) * 2022-08-31 2022-12-30 哈尔滨工业大学(威海) Aero-engine small sample fault diagnosis method based on deep twin self-attention network
CN116563337A (en) * 2023-04-11 2023-08-08 武汉大学 Target tracking method based on double-attention mechanism
CN116401603A (en) * 2023-04-26 2023-07-07 哈尔滨工程大学 Multi-mode bearing fault intelligent diagnosis method based on transfer learning

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
KAI HUANG等: "SiamCAM: A real-time Siamese network for object tracking with compensating attention mechanism", 《APPL. SCI.》, vol. 12, no. 8, 13 April 2022 (2022-04-13), pages 1 - 16 *
KUN XU等: "Interpretable Siamese dual attention enhancement transfer compound diagnostic model for unbalanced samples", 《STRUCTURAL HEALTH MONITORING》, 7 March 2024 (2024-03-07) *
薛俊康: "基于双支结构深度学习模型的建筑物变化检测", 《中国优秀硕士学位论文全文数据库_工程科技Ⅱ辑》, 15 August 2023 (2023-08-15), pages 028 - 24 *
赵文清等: "基于改进YOLOv5s的输电线路螺栓缺销检测方法", 《华北电力大学学报(自然科学版)》, 7 September 2022 (2022-09-07), pages 1 - 10 *

Also Published As

Publication number Publication date
CN117574259B (en) 2024-05-07

Similar Documents

Publication Publication Date Title
CN109765053B (en) Rolling bearing fault diagnosis method using convolutional neural network and kurtosis index
CN109186973B (en) Mechanical fault diagnosis method of unsupervised deep learning network
CN111914883B (en) Spindle bearing state evaluation method and device based on deep fusion network
CN109858352B (en) Fault diagnosis method based on compressed sensing and improved multi-scale network
CN112036547B (en) Rolling bearing residual life prediction method combining automatic feature extraction with LSTM
CN110866365A (en) Mechanical equipment intelligent fault diagnosis method based on partial migration convolutional network
CN112418013B (en) Complex working condition bearing fault diagnosis method based on meta-learning under small sample
CN110991295B (en) Self-adaptive fault diagnosis method based on one-dimensional convolutional neural network
CN111046945A (en) Fault type and damage degree diagnosis method based on combined convolutional neural network
CN110595780A (en) Bearing fault identification method based on vibration gray level image and convolution neural network
CN112257528B (en) Wind power gear box fault diagnosis method based on wavelet transformation and dense connection expansion convolutional neural network
CN112378660A (en) Intelligent fault diagnosis method for aero-engine bearing based on data driving
CN116358871B (en) Rolling bearing weak signal composite fault diagnosis method based on graph rolling network
CN114034481A (en) Fault diagnosis system and method for rolling mill gearbox
CN111444871A (en) Fault diagnosis method for multi-scale deep convolution neural network planetary gearbox
CN114169377A (en) G-MSCNN-based fault diagnosis method for rolling bearing in noisy environment
CN114755017B (en) Variable-speed bearing fault diagnosis method of cross-domain data driving unsupervised field shared network
CN113537152A (en) Flow field state fault detection method based on deep neural network
CN116793682A (en) Bearing fault diagnosis method based on iCORAL-MMD and anti-migration learning
CN116186641A (en) Cross-equipment rolling bearing small sample fault diagnosis method based on feature fusion
CN113916535B (en) Bearing diagnosis method, system, equipment and medium based on time frequency and CNN
CN113776835B (en) Rolling bearing fault diagnosis method based on enhanced Mel linear frequency cepstrum coefficient
Liang et al. Multibranch and multiscale dynamic convolutional network for small sample fault diagnosis of rotating machinery
CN117574259B (en) Attention twin intelligent migration interpretability diagnosis method suitable for high-end equipment
CN112766200A (en) Planetary gear box small sample fault diagnosis method based on one-dimensional convolutional neural network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant