CN112232252B - Transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation - Google Patents
Transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation Download PDFInfo
- Publication number
- CN112232252B CN112232252B CN202011152085.7A CN202011152085A CN112232252B CN 112232252 B CN112232252 B CN 112232252B CN 202011152085 A CN202011152085 A CN 202011152085A CN 112232252 B CN112232252 B CN 112232252B
- Authority
- CN
- China
- Prior art keywords
- domain
- sample
- target domain
- target
- fault
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 70
- 238000003745 diagnosis Methods 0.000 title claims abstract description 45
- 230000003044 adaptive effect Effects 0.000 title claims abstract description 35
- 230000005540 biological transmission Effects 0.000 title claims abstract description 26
- 230000006978 adaptation Effects 0.000 claims abstract description 15
- 238000000605 extraction Methods 0.000 claims abstract description 14
- 238000009826 distribution Methods 0.000 claims abstract description 13
- 238000013145 classification model Methods 0.000 claims abstract description 12
- 230000006870 function Effects 0.000 claims description 44
- 230000008569 process Effects 0.000 claims description 22
- 239000011159 matrix material Substances 0.000 claims description 18
- 239000013598 vector Substances 0.000 claims description 18
- 230000004913 activation Effects 0.000 claims description 6
- 238000012545 processing Methods 0.000 claims description 4
- ORILYTVJVMAKLC-UHFFFAOYSA-N Adamantane Natural products C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 claims description 3
- 230000002195 synergetic effect Effects 0.000 claims description 3
- 238000010276 construction Methods 0.000 claims description 2
- 238000012549 training Methods 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 235000014653 Carica parviflora Nutrition 0.000 description 1
- 241000243321 Cnidaria Species 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000013527 convolutional neural network Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000010248 power generation Methods 0.000 description 1
- 238000012706 support-vector machine Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2218/00—Aspects of pattern recognition specially adapted for signal processing
- G06F2218/08—Feature extraction
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2415—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2218/00—Aspects of pattern recognition specially adapted for signal processing
- G06F2218/12—Classification; Matching
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Probability & Statistics with Applications (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Signal Processing (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Computational Mathematics (AREA)
- Algebra (AREA)
- Computing Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Testing And Monitoring For Control Systems (AREA)
Abstract
The invention discloses a transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation, which comprises the following steps: constructing a transmission chain fault database; constructing an unsupervised domain adaptive feature extraction and classification model; establishing an unsupervised domain adaptive fault diagnosis model fused with optimal transportation; and sending the target domain data into a trained feature extractor and a softmax classifier to obtain a predictive label of the target domain data, and calculating the classification accuracy of the predictive label. According to the invention, the optimal transport theory and the domain adaptation theory are combined together, and the distribution of the feature space and the label space is aligned by minimizing the distance between the features of the source domain and the target domain and the labels, so that the domain adaptation problem of the source domain and the target domain in fault diagnosis is solved, and the precision of fault diagnosis is improved.
Description
Technical Field
The invention relates to a transmission chain fault diagnosis method, in particular to a transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation.
Background
Transmission chains play an important role in industrial applications, being part of many industrial systems, such as high-speed rail, aircraft, wind power systems, etc., but transmission chains tend to fail. The fault diagnosis of the transmission chain is important to ensure the safe operation of the system, reduce the machine downtime and save the maintenance cost. Therefore, the failure diagnosis of the power train is receiving a great deal of attention, and a large number of failure diagnosis methods are proposed.
Fault diagnosis methods can be broadly classified into model-based methods and data-driven methods. Early fault diagnosis is mainly based on a physical model, and can accurately describe the connection between faults and related industrial systems; however, this fault diagnosis method has two main disadvantages: 1) Highly dependent on a priori knowledge of the system; 2) Disturbances in the industrial operation process and some assumptions about the system, such as the form of noise and the operating conditions of the system, may not be appropriate, which may lead to uncertainties and misdiagnosis. The data-driven fault diagnosis method directly analyzes the acquired data, and uses techniques such as signal processing, machine learning and the like, so that dependence on priori knowledge of a system can be reduced, and the method is more suitable for modern industrial application; such as Hilbert transform and wavelet transform, are widely used for fault signature analysis. While these methods can achieve good performance in fault diagnosis tasks, they still face the challenge of automatically extracting fault signatures from the initial fault signal. Moreover, manually extracting fault signatures is very time consuming.
In recent years, deep learning has been greatly advanced in the field of fault diagnosis, such as deep stack networks and convolutional neural networks, and has been used to solve the problem of fault diagnosis due to good capability of automatically extracting features. Most existing methods can accurately detect faults, but the success of these methods is based on two assumptions: 1) There is a large amount of tagged training data; 2) The training data of the source domain and the test data of the target domain are distributed identically. When these two assumptions are violated, the performance of these algorithms may drop significantly. However, in practical industrial applications, such as wind power generation systems, these assumptions often fail due to changes in the operating environment and instability in the load torque. Also, collecting data is very time consuming, so it is often difficult to collect a large amount of tagged training data. It is also widely appreciated that models trained from raw data may be less reliable. Even though tag data may be available under certain operating conditions, the data distribution may change with new operating conditions.
Domain adaptation is an effective method for solving the problem of data imbalance or data scarcity, and can effectively relieve the influence of data distribution difference and remarkably improve the performance of the classifier. However, the existing fault diagnosis method based on domain adaptation theory mainly has two disadvantages: 1) When the distribution of the source domain and the target domain is multi-modal, the probability measures used by these methods cannot represent the difference between the source domain and the target domain; 2) Only the differences in the source and target domains are considered in the high-dimensional feature space, while the differences in the tag space are often ignored, resulting in inadequate adaptation. The optimal transport (Optimal transport, OT) theory is a powerful tool to calculate the distance between probability distributions. The OT distance can be calculated directly from the distributed samples without the need for density estimation or other non-parametric methods.
Disclosure of Invention
In order to solve the technical problems, the invention provides the transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation, which is simple in algorithm and high in diagnosis precision.
The technical scheme for solving the problems is as follows: an optimal transportation-based transmission chain unsupervised domain adaptation fault diagnosis method comprises the following steps:
(1) Constructing a transmission chain fault database: collecting fault data of a source domain and a target domain of a transmission chain, wherein a fault sample set of the source domain of an active label is as followsIts corresponding label is->Wherein->For the mth source domain failure sample, +.>Is->Corresponding labels, s represents a source domain, and m is the total number of source domain samples; the unlabeled target domain fault sample set isWherein->The method comprises the steps that an nth target domain fault sample is obtained, t represents a target domain, and n is the total number of target domain samples; the source domain fault sample set and the target domain fault sample set together form a transmission chain fault database;
(2) Constructing an unsupervised domain adaptive feature extraction and classification model: an automatic encoder is adopted as a feature extractor f, features of fault samples of a source domain and a target domain are automatically extracted, a softmax classifier g is trained by using the source domain samples, so that a prediction label of the fault samples of the target domain is obtained, and unsupervised domain adaptation feature extraction of the fault samples of the target domain and construction of a classification model are realized;
(3) Establishing an unsupervised domain adaptive fault diagnosis model integrating optimal transportation: processing fault samples of a source domain and a target domain through unsupervised domain adaptive feature extraction and classification models to obtain sample features and corresponding labels; the feature extractor f and the softmax classifier g are fused into an objective function of an optimal transportation solver, a transportation planning matrix gamma is calculated through sample features and labels, the Ji Yuanyu and the target domain are distributed, and the source domain label information is transported to the target domain; alternately optimizing the unsupervised domain adaptive feature extraction and classification model and the optimal transport solver, so as to obtain an unsupervised domain adaptive fault diagnosis model fused with optimal transport;
(4) And sending the target domain data into a trained feature extractor f and a softmax classifier g to obtain a predictive label of the target domain data, and calculating the classification accuracy of the predictive label.
In the above-mentioned transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation, in the step (2), an automatic encoder is adopted as a feature extractor f, and the specific process of automatically extracting the features of the source domain and the target domain fault samples is as follows:
the automatic encoder passes through a function h θ Encoding the input data x into a representative feature y; this process is expressed as:
y=h θ (x)=σ(Wx+b)
wherein θ represents a parameter of the encoding portion; w represents a weight matrix of the coding process, and b represents a bias vector of the coding process; sigma represents an activation function;
accordingly, the decoding part passes through the function h' θ′ Reconstructing the representative features back into the input data:
x′=h′ θ′ (y)=σ′(W′y+b′)
wherein θ ' represents parameters of the decoding part, x ' represents reconstructed input data, W ' represents a weight matrix of the decoding process, b ' represents a bias vector of the decoding process, and σ ' represents an activation function of the decoding process;
the loss function of the entire automatic encoder network is:
wherein x is ρ And x' ρ Rho=1, 2, …, I is the number of input data, |x ', representing the rho-th input data and the decoder reconstructed input data, respectively' ρ -x ρ || 2 Represents x' ρ -x ρ 2-norms of (2).
In the above-mentioned optimal transport-based drive chain unsupervised domain adaptive fault diagnosis method, in the step (2), the softmax classifier is used for estimating the probability that each sample belongs to each category, and the category with the highest probability is used as the category of the sample;
given input data and corresponding tags { (x) (1) ,y (1) ),(x (2) ,y (2) ),…,(x (τ) ,y (τ) ) X, where x (τ) Represents the τ input data, y (τ) Represents x (τ) Corresponding tag, and y (τ) E {1,2, …, k }, k representing the number of categories of tags; input data x (τ) Tag probability of (2)The method comprises the following steps:
wherein,parameters representing a softmax classifier, T representing the transpose of the vector,/for the vectors>Representing all +.>Summing to normalize the probability value; />Represents the first parameter->Is a transpose of (2);
cross entropy loss is used as a loss function of a softmax classifier
Wherein 1{ expression is true } = 1,1{ expression is false } = 0, y (α) For the alpha-th input data x (α) α=1, 2, …, τ, β represents a class β tag, β=1, 2, …, k;represents the beta parameter->Is a transpose of (a).
In the above-mentioned transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transport, in the step (3), the optimal transport solver is used to calculate the distance between the probability distribution of the source domain and the target domain, and the domain adaptation problem is regarded as a special case of the discrete optimal transport problem;
let the edge distribution of the source domain and target domain samples be μ s Sum mu t The distance between the source domain and target domain feature space and the tag space is:
wherein C (i, j) represents the cost of moving the probability mass from the i-th source domain sample feature and label to the j-th target domain sample feature and label; c is a cost matrix, i=1, 2, …, m, j=1, 2, …, n;features representing the ith source domain sample; />Features representing a jth target domain sample; η represents a trade-off parameter; />Representing calculating an L2 distance between sample features of the source domain and the target domain; />Representing a cross entropy loss between the computed source domain and target domain labels;
since the target domain samples do not have corresponding tags,cannot be used directly, so generating a tag of the target domain sample feature with the source domain trained softmax classifier g>Namely:
the objective function for solving the optimal transport plan is:
wherein, pi= { γ∈ (R + ) m×n |γ1 n =μ s ,γ T 1 m =μ t -represents a set of all transport plans γ; gamma (i, j) is the probability mass of the ith source domain sample feature and label being transferred to the jth target domain sample feature and label; 1 n N×1 vectors representing elements 1; 1 m An mx1 vector representing an element of 1; (R) + ) m×n Representing a positive real matrix with dimensions m x n;<·,·>representing the Frobenius dot product;
initial value gamma of transport planning 0 The method comprises the following steps:
wherein C is 0 Cost matrix for moving probability mass from source domain sample to target domain sample, andγ 0 (i, j) is the probability mass of the ith source domain sample transitioning to the jth target domain sample.
In the above-mentioned transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation, in the step (3), the objective loss function to be optimized for the whole model is:
wherein f' denotes a decoder of the automatic encoder,and->Respectively represent the source domain featuresAnd->Decoding into input data; l (L) f And L g Representing the loss functions of the feature extractor f and softmax classifier g, respectively;
alternately solving the objective loss function using a synergistic descent algorithm:
when the parameters of the feature extractor f and softmax classifier g are fixed, the target loss function is written as:
the above is a standard linear programming problem, and is solved by using a network simplex flow algorithm;
when the transport plan γ is fixed, the target loss function is written as:
the Adam algorithm is used to solve the above equation.
In the above optimal transportation-based driving chain unsupervised domain adaptive fault diagnosis method, in the step (4), the prediction accuracy of the target domain sample label is calculated according to the following formula:
wherein,for sample->Predicted tag,/->Is a corresponding true label; 1 {. Cndot. } is a binary function, whenWhen (I)>1 is shown in the specification; when->When (I)>Is 0.
The invention has the beneficial effects that:
1. according to the invention, the AE network is used as a feature extractor to extract representative features in the original data, and the softmax classifier trained by the source domain sample is used for classifying the target domain fault sample, so that the unsupervised fault diagnosis of the target domain sample is realized.
2. According to the invention, the optimal transmission theory is fused into the domain adaptation problem, and the feature extractor and the softmax classifier trained on the source domain sample are used for the target domain sample through the feature space and the label space of Ji Yuanyu and the target domain, so that the feature extraction capability and the fault diagnosis capability are improved.
Drawings
FIG. 1 is a flow chart of the present invention.
FIG. 2 is a schematic diagram of an automatic encoder according to the present invention.
FIG. 3 is a schematic diagram of the optimal transportation problem of the present invention.
FIG. 4 is a bar graph of a comparative experiment of the present invention.
Detailed Description
The invention is further described below with reference to the drawings and examples.
As shown in fig. 1, a transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation includes the following steps:
(1) Constructing a transmission chain fault database: collecting fault data of a source domain and a target domain of a transmission chain, wherein a fault sample set of the source domain of an active label is as followsIts corresponding label is->Wherein->For the mth source domain failure sample, +.>Is->Corresponding labels, s represents a source domain, and m is the total number of source domain samples; the unlabeled target domain fault sample set isWherein->For the nth target domain failure sample, t represents the target domain, and n is the total number of target domain samples.
(2) Constructing an unsupervised domain adaptive feature extraction and classification model: and an automatic encoder is adopted as a feature extractor f to automatically extract the features of the source domain fault samples and the target domain fault samples, and a softmax classifier g is trained by using the source domain samples, so that a prediction label of the target domain fault samples is obtained, and the unsupervised domain adaptation feature extraction and classification of the target domain fault samples are realized.
Automatic encoders are used to learn more compact sample features with class discrimination, thereby speeding up the training process;
as shown in fig. 2, the automatic encoder passes through a function h θ The input data x is encoded as a representative feature y. This process is expressed as:
y=h θ (x)=σ(Wx+b)
where θ represents a parameter of the encoded section. W represents a weight matrix of the encoding process, and b represents a bias vector of the encoding process. Sigma represents an activation function;
accordingly, the decoding part passes through the function h' θ′ Reconstructing the representative features back into the input data:
x′=h′ θ′ (y)=σ′(W′y+b′)
where θ' represents a parameter of the decoding section. x' is reconstructed input data. W 'represents a weight matrix of the decoding process, and b' represents a bias vector of the decoding process. σ' represents an activation function of the decoding process;
the loss function of the entire automatic encoder network is:
wherein x is ρ And x' ρ The ρ -th input data and the decoder reconstructed input data are represented by ρ=1, 2, …, I, respectively, which are the number of input data. ||x' ρ -x ρ || 2 Represents x' ρ -x ρ 2-norms of (2);
the softmax classifier is the most common algorithm to solve the multi-classification problem. The main function of Softmax is to estimate the probability that each sample belongs to each category, and the category with the highest probability is taken as the category of the sample;
given input data and corresponding tags { (x) (1) ,y (1) ),(x (2) ,y (2) ),…,(x (τ) ,y (τ) ) }. Wherein x is (τ) Represents the τ input data, y (τ) Represents x (τ) Corresponding tag, and y (τ) E {1,2, …, k }, k representing the number of categories of tags. Input data x (τ) The tag probability of (2) is:
wherein,parameters representing a softmax classifier, T representing the transpose of the vector,/for the vectors>Representing all +.>Summing, and normalizing the probability value. />Represents the first parameter->Is a transpose of (2);
the cross entropy loss is used as a loss function of the softmax classifier:
where 1{ expression is true } = 1,1{ expression is false } = 0.y is (α) For the alpha-th input data x (α) α=1, 2, …, τ. β represents a class β tag, β=1, 2, …, k.Represents the beta parameter->Is a transpose of (a).
(3) Establishing an unsupervised domain adaptive fault diagnosis model integrating optimal transportation: processing fault samples of a source domain and a target domain through unsupervised domain adaptive feature extraction and classification models to obtain sample features and corresponding labels; the feature extractor f and the softmax classifier g are fused into an objective function of an optimal transport OT solver, a transport planning matrix gamma is calculated through sample features and labels, so that the Ji Yuanyu and the target domain are distributed, and the source domain label information is transported to the target domain; and alternately optimizing the unsupervised domain adaptive feature extraction and classification model and the OT solver, so as to obtain the unsupervised domain adaptive fault diagnosis model fused with optimal transportation.
The optimal transport OT is used to calculate the distance between the source domain and the target domain probability distribution. Domain adaptation problems can be seen as a special case of discrete OT problems;
as shown in FIG. 3, let the edge distributions of the source domain and target domain samples be μ, respectively s Sum mu t The distance between the source domain and target domain feature space and the tag space is:
where C (i, j) represents the cost of moving the probability mass from the i-th source domain sample feature and label to the j-th target domain sample feature and label. C is a cost matrix, i=1, 2, …, m, j=1, 2, …, n.Representing the characteristics of the ith source domain sample. />Features representing the jth target field sample. η represents a trade-off parameter. />Representing the L2 distance between the sample features of the computed source and target domains. />Representing a cross entropy loss between the computed source domain and target domain labels;
since the target domain samples do not have corresponding tags,cannot be used directly, so generating a tag of the target domain sample feature with the source domain trained softmax classifier g>Namely:
the objective function for solving the optimal transport plan is:
wherein, pi= { γ∈ (R + ) m×n |γ1 n =μ s ,γ T 1 m =μ t And, representing the aggregate of all transport plans γ. Gamma (i, j) is the probability mass of the ith source domain sample feature and label transitioning to the jth target domain sample feature and label. 1 n N×1 vector representing element 1,1 m An mx1 vector representing an element of 1. (R) + ) m×n Representing a positive real matrix with dimensions m x n.<·,·>Representing the Frobenius dot product;
initial value gamma of transport planning 0 The method comprises the following steps:
wherein C is 0 Cost matrix for moving probability mass from source domain sample to target domain sample, andγ 0 (i, j) is the probability mass of the ith source domain sample transitioning to the jth target domain sample;
the target loss function to be optimized for the whole model is:
wherein f' denotes a decoder of the automatic encoder,and->Respectively represent the source domain featuresAnd->Decoded into input data. L (L) f And L g Representing the loss functions of the feature extractor f and softmax classifier g, respectively;
the objective loss function is solved alternately using a synergistic descent algorithm. When the parameters of the feature extractor f and softmax classifier g are fixed, the target loss function can be written as:
the above is a standard linear programming problem, which can be solved by using a network simplex flow algorithm;
when the transport plan γ is fixed, the objective loss function can be written as:
the Adam algorithm is used to solve the above equation.
(4) And sending the target domain data into a trained feature extractor f and a softmax classifier g to obtain a predictive label of the target domain data, and calculating the classification accuracy of the predictive label.
The prediction accuracy of the target domain sample tag is calculated as follows:
wherein,for sample->Predicted tag,/->Is the corresponding true tag. 1 {. Cndot. } is a binary function, whenWhen (I)>1 is shown in the specification; when->When (I)>Is 0.
In order to verify the effectiveness of the invention, the invention selects the real bearing fault data in the transmission system for verification. The source domain samples are normal and fault samples in a load state and are labeled. While the target domain samples are normal and fault samples in the other state and have no label. The motor is loaded with four types of motor, namely 0hp, 1hp, 2hp and 3hp, and 6 types of cross domains are formed for experimental verification: 0-1hp, 0-2hp, 0-3hp, 1-2hp, 1-3hp and 2-3hp. The fault diameters were 0.007 inches and 0.014 inches. The experiment selects the methods of a support vector machine SVM, a K-nearest neighbor, a softmax classifier, a back propagation neural network BP, migration component analysis TCA, a joint distribution adaptation JDA, a correlation alignment method CORAL and the like, and the method is used for carrying out a comparison experiment. The experimental result is shown in fig. 4, and the method can realize bearing fault diagnosis under various working conditions, and the accuracy is higher than that of other comparison methods.
In summary, according to the optimal transportation-based driving chain unsupervised domain adaptive fault diagnosis method, ji Yuanyu and target domains are distributed by minimizing the distances between the fault feature space and the label space of the source domain and the target domain, so that the depth features of target domain samples are better extracted, the label classifier trained by the source domain can be well used for target domain data, the target domain unsupervised domain adaptive label prediction is realized, and the fault diagnosis precision is improved.
Claims (6)
1. An optimal transportation-based transmission chain unsupervised domain adaptation fault diagnosis method is characterized by comprising the following steps of: the method comprises the following steps:
(1) Constructing a transmission chain fault database: collecting fault data of a source domain and a target domain of a transmission chain, wherein a fault sample set of the source domain of an active label is as followsIts corresponding label is->Wherein->For the mth source domain failure sample, +.>Is->Corresponding labels, s represents a source domain, and m is the total number of source domain samples; the unlabeled target domain fault sample set isWherein->The method comprises the steps that an nth target domain fault sample is obtained, t represents a target domain, and n is the total number of target domain samples; the source domain fault sample set and the target domain fault sample set together form a transmission chain fault database;
(2) Constructing an unsupervised domain adaptive feature extraction and classification model: an automatic encoder is adopted as a feature extractor f, features of fault samples of a source domain and a target domain are automatically extracted, a softmax classifier g is trained by using the source domain samples, so that a prediction label of the fault samples of the target domain is obtained, and unsupervised domain adaptation feature extraction of the fault samples of the target domain and construction of a classification model are realized;
(3) Establishing an unsupervised domain adaptive fault diagnosis model integrating optimal transportation: processing fault samples of a source domain and a target domain through unsupervised domain adaptive feature extraction and classification models to obtain sample features and corresponding labels; the feature extractor f and the softmax classifier g are fused into an objective function of an optimal transportation solver, a transportation planning matrix gamma is calculated through sample features and labels, the Ji Yuanyu and the target domain are distributed, and the source domain label information is transported to the target domain; alternately optimizing the unsupervised domain adaptive feature extraction and classification model and the optimal transport solver, so as to obtain an unsupervised domain adaptive fault diagnosis model fused with optimal transport;
(4) And sending the target domain data into a trained feature extractor f and a softmax classifier g to obtain a predictive label of the target domain data, and calculating the classification accuracy of the predictive label.
2. The optimal transport-based drive train unsupervised domain adaptive fault diagnosis method according to claim 1, characterized by: in the step (2), an automatic encoder is adopted as a feature extractor f, and the specific process of automatically extracting the features of the source domain and target domain fault samples is as follows:
the automatic encoder passes through a function h θ Encoding the input data x into a representative feature y; this process is expressed as:
y=h θ (x)=σ(Wx+b)
wherein θ represents a parameter of the encoding portion; w represents a weight matrix of the coding process, and b represents a bias vector of the coding process; sigma represents an activation function;
accordingly, the decoding part passes through the function h' θ′ Reconstructing the representative features back into the input data:
x′=h′ θ′ (y)=σ′(W′y+b′)
wherein θ ' represents parameters of the decoding part, x ' represents reconstructed input data, W ' represents a weight matrix of the decoding process, b ' represents a bias vector of the decoding process, and σ ' represents an activation function of the decoding process;
the loss function of the entire automatic encoder network is:
wherein x is ρ And x' ρ Rho=1, 2, …, I is the number of input data, |x ', representing the rho-th input data and the decoder reconstructed input data, respectively' ρ -x ρ || 2 Represents x' ρ -x ρ 2-norms of (2).
3. The optimal transport-based drive train unsupervised domain adaptive fault diagnosis method according to claim 2, characterized by: in the step (2), the softmax classifier has the functions of estimating the probability that each sample belongs to each category, and taking the category with the highest probability as the category of the sample;
given input data and corresponding tags { (x) (1) ,y (1) ),(x (2) ,y (2) ),…,(x (τ) ,y (τ) ) X, where x (τ) Represents the τ input data, y (τ) Represents x (τ) Corresponding tag, and y (τ) E {1,2, …, k }, k representing the number of categories of tags; input data x (τ) Tag probability of (2)The method comprises the following steps:
wherein,parameters representing a softmax classifier, T representing the transpose of the vector,/for the vectors>Representing all +.>Summing to normalize the probability value; />Represents the first parameter->Is a transpose of (2);
cross entropy loss is used as a loss function of a softmax classifier
Wherein 1{ expression is true } = 1,1{ expression is false } = 0, y (α) For the alpha-th input data x (α) α=1, 2, …, τ, β represents a class β tag, β=1, 2, …, k;represents the beta parameter->Is a transpose of (a).
4. The optimal transport-based drive train unsupervised domain adaptive fault diagnosis method according to claim 3, characterized by: in the step (3), the optimal transportation solver is used for calculating the distance between the probability distribution of the source domain and the probability distribution of the target domain, and the domain adaptation problem is regarded as a special case of the discrete optimal transportation problem;
let the edge distribution of the source domain and target domain samples be μ s Sum mu t The distance between the source domain and target domain feature space and the tag space is:
wherein C (i, j) represents the cost of moving the probability mass from the i-th source domain sample feature and label to the j-th target domain sample feature and label; c is a cost matrix, i=1, 2, …, m, j=1, 2, …, n;features representing the ith source domain sample; />Features representing a jth target domain sample; η represents a trade-off parameter; />Representing calculating an L2 distance between sample features of the source domain and the target domain; />Representing a cross entropy loss between the computed source domain and target domain labels;
since the target domain samples do not have corresponding tags,cannot be used directly, so generating a tag of the target domain sample feature with the source domain trained softmax classifier g>Namely:
the objective function for solving the optimal transport plan is:
wherein,representing a set of all transport plans γ; gamma (i, j) is the probability mass of the ith source domain sample feature and label being transferred to the jth target domain sample feature and label; 1 n N×1 vectors representing elements 1; 1 m An mx1 vector representing an element of 1; (R) + ) m×n Representing a positive real matrix with dimensions m x n;<·,·>representing the Frobenius dot product;
initial value gamma of transport planning 0 The method comprises the following steps:
wherein C is 0 Cost matrix for moving probability mass from source domain sample to target domain sample, andγ 0 (i, j) is the probability mass of the ith source domain sample transitioning to the jth target domain sample.
5. The optimal transport-based drive train unsupervised domain adaptive fault diagnosis method according to claim 4, characterized by: in the step (3), the objective loss function to be optimized for the whole model is as follows:
wherein f' denotes a decoder of the automatic encoder,and->Respectively represent the source domain feature->And->Decoding into input data; l (L) f And L g Representing the loss functions of the feature extractor f and softmax classifier g, respectively;
alternately solving the objective loss function using a synergistic descent algorithm:
when the parameters of the feature extractor f and softmax classifier g are fixed, the target loss function is written as:
the above is a standard linear programming problem, and is solved by using a network simplex flow algorithm;
when the transport plan γ is fixed, the target loss function is written as:
the Adam algorithm is used to solve the above equation.
6. The optimal transport-based drive train unsupervised domain adaptive fault diagnosis method according to claim 5, characterized by: in the step (4), the prediction accuracy of the target domain sample tag is calculated according to the following formula:
wherein,for sample->Predicted tag,/->Is a corresponding true label; 1 {. Cndot. } is a binary function, when +.>When (I)>1 is shown in the specification; when->When (I)>Is 0.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011152085.7A CN112232252B (en) | 2020-10-23 | 2020-10-23 | Transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011152085.7A CN112232252B (en) | 2020-10-23 | 2020-10-23 | Transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112232252A CN112232252A (en) | 2021-01-15 |
CN112232252B true CN112232252B (en) | 2023-12-01 |
Family
ID=74109601
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011152085.7A Active CN112232252B (en) | 2020-10-23 | 2020-10-23 | Transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112232252B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112784920B (en) * | 2021-02-03 | 2024-02-13 | 湖南科技大学 | Yun Bianduan coordinated rotating component reactance domain self-adaptive fault diagnosis method |
CN112990259B (en) * | 2021-02-04 | 2023-12-26 | 西交利物浦大学 | Early fault diagnosis method for rotary mechanical bearing based on improved transfer learning |
CN113935406B (en) * | 2021-09-27 | 2023-05-16 | 苏州大学 | Mechanical equipment unsupervised fault diagnosis method based on countermeasure flow model |
CN113822228B (en) * | 2021-10-27 | 2024-03-22 | 南京大学 | User expression recognition method and system based on continuous learning |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109376620A (en) * | 2018-09-30 | 2019-02-22 | 华北电力大学 | A kind of migration diagnostic method of gearbox of wind turbine failure |
CN109580215A (en) * | 2018-11-30 | 2019-04-05 | 湖南科技大学 | A kind of wind-powered electricity generation driving unit fault diagnostic method generating confrontation network based on depth |
CN109902393A (en) * | 2019-03-01 | 2019-06-18 | 哈尔滨理工大学 | Fault Diagnosis of Roller Bearings under a kind of variable working condition based on further feature and transfer learning |
CN110837850A (en) * | 2019-10-23 | 2020-02-25 | 浙江大学 | Unsupervised domain adaptation method based on counterstudy loss function |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190325861A1 (en) * | 2018-04-18 | 2019-10-24 | Maneesh Kumar Singh | Systems and Methods for Automatic Speech Recognition Using Domain Adaptation Techniques |
-
2020
- 2020-10-23 CN CN202011152085.7A patent/CN112232252B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109376620A (en) * | 2018-09-30 | 2019-02-22 | 华北电力大学 | A kind of migration diagnostic method of gearbox of wind turbine failure |
CN109580215A (en) * | 2018-11-30 | 2019-04-05 | 湖南科技大学 | A kind of wind-powered electricity generation driving unit fault diagnostic method generating confrontation network based on depth |
CN109902393A (en) * | 2019-03-01 | 2019-06-18 | 哈尔滨理工大学 | Fault Diagnosis of Roller Bearings under a kind of variable working condition based on further feature and transfer learning |
CN110837850A (en) * | 2019-10-23 | 2020-02-25 | 浙江大学 | Unsupervised domain adaptation method based on counterstudy loss function |
Non-Patent Citations (1)
Title |
---|
基于域对抗迁移的变工况滚动轴承故障诊断模型;金余丰;刘晓锋;姚美常;黄凤良;;自动化仪表(第12期);55-60 * |
Also Published As
Publication number | Publication date |
---|---|
CN112232252A (en) | 2021-01-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112232252B (en) | Transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation | |
CN112784965B (en) | Large-scale multi-element time series data anomaly detection method oriented to cloud environment | |
CN109992668B (en) | Self-attention-based enterprise public opinion analysis method and device | |
CN110532542B (en) | Invoice false invoice identification method and system based on positive case and unmarked learning | |
CN111860677A (en) | Rolling bearing transfer learning fault diagnosis method based on partial domain confrontation | |
CN112015863B (en) | Multi-feature fusion Chinese text classification method based on graphic neural network | |
Li et al. | Confidence-based active learning | |
CN111461157B (en) | Self-learning-based cross-modal Hash retrieval method | |
CN112926303A (en) | Malicious URL detection method based on BERT-BiGRU | |
CN116976857B (en) | Predictive maintenance method and system for railway power equipment based on deep learning | |
Dai et al. | Hybrid deep model for human behavior understanding on industrial internet of video things | |
CN115099296A (en) | Sea wave height prediction method based on deep learning algorithm | |
CN116910013A (en) | System log anomaly detection method based on semantic flowsheet mining | |
CN114299326A (en) | Small sample classification method based on conversion network and self-supervision | |
CN111177010A (en) | Software defect severity identification method | |
CN114943233A (en) | Parallelization-improvement-based entity naming identification method for Transformer-CRF network | |
Kamruzzaman et al. | A hybrid learning algorithm for text classification | |
CN113569048A (en) | Method and system for automatically dividing affiliated industries based on enterprise operation range | |
CN116610846A (en) | Breeding screening method and system for cattle breeding | |
CN117216264A (en) | Machine tool equipment fault analysis method and system based on BERT algorithm | |
CN116502123A (en) | Non-supervision cross-domain prediction method for residual service life of aero-engine | |
Passalis et al. | Deep temporal logistic bag-of-features for forecasting high frequency limit order book time series | |
CN115965026A (en) | Model pre-training method and device, text analysis method and device and storage medium | |
CN111523557A (en) | Wind power intelligent fault diagnosis method based on big data | |
Brønstad | Data-driven detection and identification of undesirable events in subsea oil wells |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |