CN112232252A - Optimal transport-based transmission chain unsupervised domain adaptive fault diagnosis method - Google Patents
Optimal transport-based transmission chain unsupervised domain adaptive fault diagnosis method Download PDFInfo
- Publication number
- CN112232252A CN112232252A CN202011152085.7A CN202011152085A CN112232252A CN 112232252 A CN112232252 A CN 112232252A CN 202011152085 A CN202011152085 A CN 202011152085A CN 112232252 A CN112232252 A CN 112232252A
- Authority
- CN
- China
- Prior art keywords
- domain
- target domain
- sample
- fault
- target
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 66
- 238000003745 diagnosis Methods 0.000 title claims abstract description 44
- 230000003044 adaptive effect Effects 0.000 title claims abstract description 40
- 230000005540 biological transmission Effects 0.000 title claims abstract description 28
- 238000009826 distribution Methods 0.000 claims abstract description 17
- 238000000605 extraction Methods 0.000 claims abstract description 15
- 238000013145 classification model Methods 0.000 claims abstract description 12
- 230000006978 adaptation Effects 0.000 claims abstract description 10
- 230000006870 function Effects 0.000 claims description 45
- 230000008569 process Effects 0.000 claims description 22
- 239000011159 matrix material Substances 0.000 claims description 18
- 239000000126 substance Substances 0.000 claims description 7
- 230000004913 activation Effects 0.000 claims description 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 6
- 239000000284 extract Substances 0.000 claims description 4
- 238000012545 processing Methods 0.000 claims description 4
- ORILYTVJVMAKLC-UHFFFAOYSA-N Adamantane Natural products C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 claims description 3
- 238000010606 normalization Methods 0.000 claims description 3
- 239000004576 sand Substances 0.000 claims description 3
- 238000012546 transfer Methods 0.000 claims description 3
- 238000010276 construction Methods 0.000 claims description 2
- 238000012549 training Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000012706 support-vector machine Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000013527 convolutional neural network Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000010248 power generation Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2218/00—Aspects of pattern recognition specially adapted for signal processing
- G06F2218/08—Feature extraction
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2415—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2218/00—Aspects of pattern recognition specially adapted for signal processing
- G06F2218/12—Classification; Matching
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Probability & Statistics with Applications (AREA)
- Bioinformatics & Computational Biology (AREA)
- Mathematical Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Biology (AREA)
- Signal Processing (AREA)
- Algebra (AREA)
- Computational Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Testing And Monitoring For Control Systems (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation, which comprises the following steps of: building a transmission chain fault database; constructing an unsupervised domain adaptive feature extraction and classification model; establishing an unsupervised domain adaptive fault diagnosis model fusing optimal transportation; and (3) sending the target domain data into the trained feature extractor and softmax classifier to obtain a prediction label of the target domain data, and calculating the classification precision of the target domain data. The invention combines the optimal transport theory and the domain adaptation theory, aligns the distribution of the feature space and the label space by minimizing the distance between the features and the labels of the fault source domain and the target domain, solves the domain adaptation problem of the source domain and the target domain in fault diagnosis, and improves the precision of fault diagnosis.
Description
Technical Field
The invention relates to a fault diagnosis method for a transmission chain, in particular to an unsupervised domain adaptive fault diagnosis method for the transmission chain based on optimal transportation.
Background
The transmission chain plays an important role in industrial application and is a component of many industrial systems, such as high-speed rails, airplanes, wind power systems and the like, but the transmission chain is prone to failure. The fault diagnosis of the transmission chain is important for ensuring the safe operation of the system, reducing the machine downtime and saving the maintenance cost. Therefore, the diagnosis of the failure of the drive train has received a great deal of attention, and a large number of failure diagnosis methods have been proposed.
Fault diagnosis methods can be broadly classified into model-based methods and data-driven methods. Early fault diagnosis is mainly based on a physical model, which can accurately describe the relation between a fault and a related industrial system; however, this fault diagnosis method has two main disadvantages: 1) highly dependent on a priori knowledge of the system; 2) disturbances in the industrial operation process and some assumptions about the system (e.g., the form of noise and the operating conditions of the system) may be inappropriate, which may lead to uncertainty and misdiagnosis. The data-driven fault diagnosis method directly analyzes the acquired data, and the technology of signal processing, machine learning and the like is used, so that the dependence on the prior knowledge of the system can be reduced, and the method is more suitable for modern industrial application; such as Hilbert transform and wavelet transform, are widely used for fault signature analysis. While these methods may achieve good performance on the task of fault diagnosis, they still face the challenge of automatically extracting fault signatures from the initial fault signal. Moreover, manual extraction of fault features is very time consuming.
In recent years, deep learning has been greatly developed for application in the field of fault diagnosis, such as a deep stack network and a convolutional neural network, which have good capability of automatically extracting features, and are used to solve the problem of fault diagnosis. Most existing methods are able to accurately detect faults, but the success of these methods is based on two assumptions: 1) a plurality of labeled training data; 2) the training data of the source domain and the test data of the target domain are distributed identically. When these two assumptions are violated, the performance of these algorithms may be significantly degraded. However, in practical industrial applications, such as wind power generation systems, these assumptions often cannot be established due to changes in the operating environment and instability in the load torque. Moreover, collecting data is very time consuming, so it is often difficult to collect a large amount of labeled training data. It is also generally recognized that models trained from raw data may be less reliable. Even if tag data is available under certain operating conditions, the data distribution may change with new operating conditions.
The domain adaptation is an effective method for solving the problem of data imbalance or data scarcity, can effectively relieve the influence of data distribution difference, and obviously improves the performance of the classifier. However, the existing fault diagnosis method based on the domain adaptation theory mainly has two disadvantages: 1) when the distribution of the source domain and the target domain is multi-modal, the probability measures used by these methods cannot represent the difference between the source domain and the target domain; 2) the difference between the source domain and the target domain is only considered in the high-dimensional feature space, and the difference in the label space is often ignored, resulting in insufficient adaptation. Optimal Transport (OT) theory, as a powerful tool, can calculate the distance between probability distributions. The OT distance can be calculated directly from the samples of the distribution without the need for density estimation or other non-parametric methods.
Disclosure of Invention
In order to solve the technical problems, the invention provides the optimal transport-based transmission chain unsupervised domain adaptive fault diagnosis method which is simple in algorithm and high in diagnosis precision.
The technical scheme for solving the problems is as follows: an optimal transportation-based transmission chain unsupervised domain adaptive fault diagnosis method comprises the following steps:
(1) building a transmission chain fault database: acquiring fault data of a source domain and a target domain of a transmission chain, wherein a labeled source domain fault sample set isThe corresponding label isWhereinFor the m-th source domain fault sample,is composed ofCorresponding labels, s represents a source domain, and m is the total number of source domain samples; unlabeled target domain fault sample set isWhereinAn nth target domain fault sample is obtained, t represents a target domain, and n is the total number of target domain samples; the source domain fault sample set and the target domain fault sample set jointly form a transmission chain fault database;
(2) constructing an unsupervised domain adaptive feature extraction and classification model: an automatic encoder is used as a feature extractor f to automatically extract features of source domain and target domain fault samples, and a softmax classifier g is trained by using the source domain samples, so that a prediction label of the target domain fault samples is obtained, and unsupervised domain adaptive feature extraction and classification model construction of the target domain fault samples are realized;
(3) establishing an unsupervised domain adaptive fault diagnosis model fusing optimal transportation: processing fault samples of a source domain and a target domain through an unsupervised domain adaptive feature extraction and classification model to obtain sample features and corresponding labels; integrating the feature extractor f and the softmax classifier g into a target function of an optimal transport solver, calculating a transport planning matrix gamma through sample features and labels, aligning the distribution of a source domain and a target domain, and transporting source domain label information to the target domain; alternately optimizing the unsupervised domain adaptive feature extraction and classification model and the optimal transport solver, thereby obtaining an unsupervised domain adaptive fault diagnosis model fusing optimal transport;
(4) and (3) sending the target domain data into a trained feature extractor f and a softmax classifier g to obtain a prediction label of the target domain data, and calculating the classification precision of the target domain data.
In the optimal transportation-based unsupervised domain adaptive fault diagnosis method for the transmission chain, in the step (2), an automatic encoder is adopted as a feature extractor f, and the specific process of automatically extracting the features of the fault samples of the source domain and the target domain comprises the following steps:
the automatic encoder passes a function hθEncoding input data x into a representative feature y; this process is represented as:
y=hθ(x)=σ(Wx+b)
wherein θ represents a parameter of the encoding portion; w represents a weight matrix of the encoding process, b represents a bias vector of the encoding process; σ represents an activation function;
accordingly, the decoding part passes through a function h'θ′Reconstructing the representative features back into the input data:
x′=h′θ′(y)=σ′(W′y+b′)
wherein θ ' represents a parameter of the decoding part, x ' is reconstructed input data, W ' represents a weight matrix of the decoding process, b ' represents a bias vector of the decoding process, and σ ' represents an activation function of the decoding process;
the loss function of the entire autoencoder network is:
wherein x isρAnd x'ρRespectively represents the ρ -th input data and the input data reconstructed by the decoder, ρ ═ 1,2, …, I is the number of input data, | x'ρ-xρ||2Represents x'ρ-xρ2-norm of (d).
In the optimal transport-based drive chain unsupervised domain adaptive fault diagnosis method, in the step (2), the softmax classifier has the function of estimating the probability that each sample belongs to each class, and taking the class with the highest probability as the class of the sample;
given input data and corresponding labels { (x)(1),y(1)),(x(2),y(2)),…,(x(τ),y(τ)) In which x(τ)Denotes the τ th input data, y(τ)Denotes x(τ)Corresponding label, and y(τ)E {1,2, …, k }, where k represents the number of categories of the tag; then data x is input(τ)Probability of label ofComprises the following steps:
wherein the content of the first and second substances,denotes the parameters of the softmax classifier, T denotes the transpose of the vector,is shown to allSumming to realize the normalization of the probability value;denotes the l parameterTransposing;
Where 1{ expression is true } -, 1{ expression is false } -, 0, and y(α)For alpha-th input data x(α)α ═ 1,2, …, τ, β denotes group β tags, β ═ 1,2, …, k;denotes the beta parameterThe transposing of (1).
In the optimal transportation-based unsupervised domain adaptive fault diagnosis method for the transmission chain, in the step (3), an optimal transportation solver is used for calculating the distance between the probability distributions of a source domain and a target domain, and the domain adaptation problem is regarded as a special case of a discrete optimal transportation problem;
let the edge distribution of the source domain and target domain samples be mu respectivelysAnd mutAnd then the distance between the feature space and the label space of the source domain and the target domain is:
wherein C (i, j) represents the cost of moving the probability mass from the ith source domain sample feature and label to the jth target domain sample feature and label; c is a cost matrix, i is 1,2, …, m, j is 1,2, …, n;features representing an ith source domain sample;features representing a jth target domain sample; η represents a trade-off parameter;indicating the L2 distance between the sample features of the computed source domain and the target domain;representing a cross-entropy loss between the compute source domain and the target domain label;
since the target domain exemplar does not have a corresponding label,cannot be used directly, so the source domain trained softmax classifier g generates labels for target domain sample featuresNamely:
the objective function for solving the optimal transport plan is:
wherein, n ═ { gamma ∈ (R)+)m×n|γ1n=μs,γ T1m=μtDenotes the set of all transport plans γ; gamma (i, j) is the probability quality of the ith source domain sample feature and label transferring to the jth target domain sample feature and label; 1nAn n × 1 vector representing an element of 1; 1mAn mx 1 vector representing an element of 1; (R)+)m×nRepresenting a positive real matrix with dimension m × n;<·,·>expressing Frobenius dot product;
initial value gamma for transport planning0The method comprises the following steps:
wherein, C0A cost matrix for moving the probability mass from the source domain samples to the target domain samples, andγ0(i, j) is the probability mass of the transfer of the ith source domain sample to the jth target domain sample.
In the above optimal transportation-based transmission chain unsupervised domain adaptive fault diagnosis method, in the step (3), the target loss function to be optimized by the whole model is as follows:
where f' denotes the decoder of the auto-encoder,andrespectively representing source domain featuresAnddecoding into input data; l isfAnd LgRespectively representing the loss functions of the feature extractor f and the softmax classifier g;
alternately solving the objective loss function using a cooperative descent algorithm:
when the parameters of the feature extractor f and the softmax classifier g are fixed, the target loss function is written as:
the above formula is a standard linear programming problem, which is solved by a network simplex flow algorithm;
when the transport plan γ is fixed, the target loss function is written as:
the Adam algorithm is used to solve the above equation.
In the optimal transportation-based transmission chain unsupervised domain adaptive fault diagnosis method, in the step (4), the prediction accuracy accurve of the target domain sample label is calculated according to the following formula:
wherein the content of the first and second substances,is a sampleThe predicted label of the tag is used to predict,is a corresponding true label; 1 {. is a binary function whenWhen the temperature of the water is higher than the set temperature,is 1; when in useWhen the temperature of the water is higher than the set temperature,is 0.
The invention has the beneficial effects that:
1. according to the invention, an AE network is used as a feature extractor to extract representative features in original data, and a softmax classifier trained by source domain samples is used for classifying the target domain fault samples, so that unsupervised fault diagnosis of the target domain samples is realized.
2. According to the method, an optimal transmission theory is fused in the domain adaptation problem, the feature extractor and the softmax classifier trained by the source domain samples are used for the target domain samples by aligning the feature space and the label space of the source domain and the target domain, and the feature extraction capability and the fault diagnosis capability are improved.
Drawings
FIG. 1 is a flow chart of the present invention.
FIG. 2 is a schematic diagram of an automatic encoder according to the present invention.
FIG. 3 is a schematic diagram of the optimal transport problem of the present invention.
FIG. 4 is a bar graph of a comparative experiment of the present invention.
Detailed Description
The invention is further described below with reference to the figures and examples.
As shown in fig. 1, a method for diagnosing an unsupervised domain adaptive fault of a transmission chain based on optimal transportation includes the following steps:
(1) building a transmission chain fault database: acquiring fault data of a source domain and a target domain of a transmission chain, wherein a labeled source domain fault sample set isThe corresponding label isWhereinFor the m-th source domain fault sample,is composed ofCorresponding labels, s represents a source domain, and m is the total number of source domain samples; unlabeled target domain fault sample set isWhereinIs the nth target domain fault sample, t represents the target domain, and n is the total number of target domain samples.
(2) Constructing an unsupervised domain adaptive feature extraction and classification model: and (3) adopting an automatic encoder as a feature extractor f to automatically extract the features of the source domain fault samples and the target domain fault samples, and training a softmax classifier g by using the source domain samples, so as to obtain a prediction label of the target domain fault samples and realize unsupervised domain adaptive feature extraction and classification of the target domain fault samples.
The auto-encoder is used to learn more compact sample features with class discrimination, thereby speeding up the training process;
as shown in FIG. 2, the autoencoder passes a function hθInput data x is encoded as representative characteristics y. This process is represented as:
y=hθ(x)=σ(Wx+b)
where θ represents a parameter of the encoding portion. W denotes a weight matrix of the encoding process, and b denotes a bias vector of the encoding process. σ represents an activation function;
accordingly, the decoding part passes through a function h'θ′Reconstructing the representative features back into the input data:
x′=h′θ′(y)=σ′(W′y+b′)
where θ' represents a parameter of the decoding portion. x' is the reconstructed input data. W 'denotes a weight matrix of the decoding process, and b' denotes a bias vector of the decoding process. σ' represents an activation function of the decoding process;
the loss function of the entire autoencoder network is:
wherein x isρAnd x'ρThe input data ρ is the input data reconstructed by the decoder, and ρ is 1,2, …, and I are the number of input data. L x'ρ-xρ||2Represents x'ρ-xρ2-range ofCounting;
the softmax classifier is the most common algorithm to solve the multi-classification problem. The main function of Softmax is to estimate the probability that each sample belongs to each class and to take the class with the highest probability as the class of the sample;
given input data and corresponding labels { (x)(1),y(1)),(x(2),y(2)),…,(x(τ),y(τ))}. Wherein x is(τ)Denotes the τ th input data, y(τ)Denotes x(τ)Corresponding label, and y(τ)E {1,2, …, k }, k representing the number of categories of the tag. Then data x is input(τ)The label probability of (a) is:
wherein the content of the first and second substances,denotes the parameters of the softmax classifier, T denotes the transpose of the vector,is shown to allAnd summing to realize the normalization of the probability value.Denotes the l parameterTransposing;
the cross entropy loss is used as a loss function for the softmax classifier:
where 1{ expression is true } -, 1, and 1{ expression is false } -, 0. y is(α)For alpha-th input data x(α)α ═ 1,2, …, τ. β represents a β -th class label, β ═ 1,2, …, k.Denotes the beta parameterThe transposing of (1).
(3) Establishing an unsupervised domain adaptive fault diagnosis model fusing optimal transportation: processing fault samples of a source domain and a target domain through an unsupervised domain adaptive feature extraction and classification model to obtain sample features and corresponding labels; integrating the feature extractor f and the softmax classifier g into a target function of an optimal transport OT solver, calculating a transport planning matrix gamma through sample features and labels to align the distribution of a source domain and a target domain, and transporting label information of the source domain to the target domain; and alternately optimizing the unsupervised domain adaptive feature extraction and classification model and the OT solver, thereby obtaining the unsupervised domain adaptive fault diagnosis model fusing the optimal transportation.
The optimal transport OT is used to calculate the distance between the source and target domain probability distributions. The domain adaptation problem can be seen as a special case of the discrete OT problem;
as shown in FIG. 3, let the edge distributions of the source domain and target domain samples be μsAnd mutAnd then the distance between the feature space and the label space of the source domain and the target domain is:
where C (i, j) represents the cost of moving the probability mass from the ith source domain sample feature and label to the jth target domain sample feature and label. C is a cost matrix, i is 1,2, …, m, j is 1,2, …, n.Representing the characteristics of the ith source domain sample.Representing the characteristics of the jth target domain sample. η represents a trade-off parameter.Indicating the L2 distance between the computed source and target domain sample features.Representing a cross-entropy loss between the compute source domain and the target domain label;
since the target domain exemplar does not have a corresponding label,cannot be used directly, so the source domain trained softmax classifier g generates labels for target domain sample featuresNamely:
the objective function for solving the optimal transport plan is:
wherein, n ═ { gamma ∈ (R)+)m×n|γ1n=μs,γ T1m=μtRepresents the set of all transport plans γ. And gamma (i, j) is the probability quality of the transfer of the ith source domain sample characteristic and label to the jth target domain sample characteristic and label. 1nRepresenting an n × 1 vector with an element of 1, 1mRepresenting an m 1 vector with elements of 1. (R)+)m×nRepresenting a matrix of positive real numbers of dimension m n.<·,·>Expressing Frobenius dot product;
initial value gamma for transport planning0The method comprises the following steps:
wherein, C0A cost matrix for moving the probability mass from the source domain samples to the target domain samples, andγ0(i, j) is the probability mass of the ith source domain sample being transferred to the jth target domain sample;
the objective loss function to be optimized for the entire model is:
where f' denotes the decoder of the auto-encoder,andrespectively representing source domain featuresAnddecoded into input data. L isfAnd LgRespectively representing the loss functions of the feature extractor f and the softmax classifier g;
and alternately solving the target loss function by using a cooperative descent algorithm. When the parameters of the feature extractor f and the softmax classifier g are fixed, the target loss function can be written as:
the above formula is a standard linear programming problem, which can be solved by a network simplex flow algorithm;
when the transport plan γ is fixed, the target loss function can be written as:
the Adam algorithm is used to solve the above equation.
(4) And (3) sending the target domain data into a trained feature extractor f and a softmax classifier g to obtain a prediction label of the target domain data, and calculating the classification precision of the target domain data.
The prediction accuracy of the target domain sample label is calculated according to the following formula:
wherein the content of the first and second substances,is a sampleThe predicted label of the tag is used to predict,is the corresponding true label. 1 {. is a binary function whenWhen the temperature of the water is higher than the set temperature,is 1; when in useWhen the temperature of the water is higher than the set temperature,is 0.
In order to verify the effectiveness of the invention, the invention selects real bearing fault data in the transmission system for verification. The source domain samples are normal and fault samples under a load condition and are labeled. And the target domain samples are normal and fault samples in another state and have no label. The motor loads are four, namely 0hp, 1hp, 2hp and 3hp, and 6 cross domains are formed for experimental verification: 0-1hp, 0-2hp, 0-3hp, 1-2hp, 1-3hp and 2-3 hp. The failure diameters were 0.007 inches and 0.014 inches. And selecting methods such as a Support Vector Machine (SVM), K-nearest neighbor, softmax classifier, back propagation neural network (BP), migration component analysis (TCA), Joint Distribution Adaptation (JDA), correlation alignment method (CORAL) and the like for comparison experiment with the method. The experimental result is shown in fig. 4, the method can realize bearing fault diagnosis under various working conditions, and the precision is higher than that of other comparison methods.
In conclusion, the optimal transportation-based transmission chain unsupervised domain adaptive fault diagnosis method aligns the distribution of the source domain and the target domain by minimizing the distance between the fault feature space of the source domain and the target domain and the label space, better extracts the depth feature of the target domain sample, ensures that a label classifier trained by the source domain can be well used for target domain data, realizes unsupervised domain adaptive label prediction of the target domain, and improves the fault diagnosis precision.
Claims (6)
1. A transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation is characterized by comprising the following steps: the method comprises the following steps:
(1) building a transmission chain fault database: acquiring fault data of a source domain and a target domain of a transmission chain, wherein a labeled source domain fault sample set isThe corresponding label isWhereinFor the m-th source domain fault sample,is composed ofCorresponding labels, s represents a source domain, and m is the total number of source domain samples; unlabeled target domain fault sample set isWhereinAn nth target domain fault sample is obtained, t represents a target domain, and n is the total number of target domain samples; the source domain fault sample set and the target domain fault sample set jointly form a transmission chain fault database;
(2) constructing an unsupervised domain adaptive feature extraction and classification model: an automatic encoder is used as a feature extractor f to automatically extract features of source domain and target domain fault samples, and a softmax classifier g is trained by using the source domain samples, so that a prediction label of the target domain fault samples is obtained, and unsupervised domain adaptive feature extraction and classification model construction of the target domain fault samples are realized;
(3) establishing an unsupervised domain adaptive fault diagnosis model fusing optimal transportation: processing fault samples of a source domain and a target domain through an unsupervised domain adaptive feature extraction and classification model to obtain sample features and corresponding labels; integrating the feature extractor f and the softmax classifier g into a target function of an optimal transport solver, calculating a transport planning matrix gamma through sample features and labels, aligning the distribution of a source domain and a target domain, and transporting source domain label information to the target domain; alternately optimizing the unsupervised domain adaptive feature extraction and classification model and the optimal transport solver, thereby obtaining an unsupervised domain adaptive fault diagnosis model fusing optimal transport;
(4) and (3) sending the target domain data into a trained feature extractor f and a softmax classifier g to obtain a prediction label of the target domain data, and calculating the classification precision of the target domain data.
2. The optimal transport based drive chain unsupervised domain adaptive fault diagnosis method according to claim 1, characterized in that: in the step (2), an automatic encoder is used as a feature extractor f, and the specific process of automatically extracting the features of the source domain fault sample and the target domain fault sample comprises the following steps:
the automatic encoder passes a function hθEncoding input data x into a representative feature y; this process is represented as:
y=hθ(x)=σ(Wx+b)
wherein θ represents a parameter of the encoding portion; w represents a weight matrix of the encoding process, b represents a bias vector of the encoding process; σ represents an activation function;
accordingly, the decoding part passes through a function h'θ′Reconstructing the representative features back into the input data:
x′=h′θ′(y)=σ′(W′y+b′)
wherein θ ' represents a parameter of the decoding part, x ' is reconstructed input data, W ' represents a weight matrix of the decoding process, b ' represents a bias vector of the decoding process, and σ ' represents an activation function of the decoding process;
the loss function of the entire autoencoder network is:
wherein x isρAnd x'ρRespectively represents the ρ -th input data and the input data reconstructed by the decoder, ρ ═ 1,2, …, I is the number of input data, | x'ρ-xρ||2Represents x'ρ-xρ2-norm of (d).
3. The optimal transport based drive chain unsupervised domain adaptive fault diagnosis method according to claim 2, characterized in that: in the step (2), the softmax classifier has the function of estimating the probability of each sample belonging to each class and taking the class with the highest probability as the class of the sample;
given input data and corresponding labels { (x)(1),y(1)),(x(2),y(2)),…,(x(τ),y(τ)) In which x(τ)Denotes the τ th input data, y(τ)Denotes x(τ)Corresponding label, and y(τ)E {1,2, …, k }, where k represents the number of categories of the tag; then data x is input(τ)Probability of label ofComprises the following steps:
wherein the content of the first and second substances,denotes the parameters of the softmax classifier, T denotes the transpose of the vector,is shown to allSumming to realize the normalization of the probability value;denotes the l parameterTransposing;
4. The optimal transport based drive chain unsupervised domain adaptive fault diagnosis method according to claim 3, characterized in that: in the step (3), the optimal transport solver is used for calculating the distance between the probability distributions of the source domain and the target domain, and the domain adaptation problem is regarded as a special case of the discrete optimal transport problem;
let the edge distribution of the source domain and target domain samples be mu respectivelysAnd mutAnd then the distance between the feature space and the label space of the source domain and the target domain is:
wherein C (i, j) represents the cost of moving the probability mass from the ith source domain sample feature and label to the jth target domain sample feature and label; c is a cost matrix, i is 1,2, …, m, j is 1,2, …, n;features representing an ith source domain sample;features representing a jth target domain sample; η represents a trade-off parameter;indicating the L2 distance between the sample features of the computed source domain and the target domain;representing a cross-entropy loss between the compute source domain and the target domain label;
since the target domain exemplar does not have a corresponding label,cannot be used directly, so the source domain trained softmax classifier g generates labels for target domain sample featuresNamely:
the objective function for solving the optimal transport plan is:
wherein the content of the first and second substances,represents a set of all transport plans γ; gamma (i, j) is the probability quality of the ith source domain sample feature and label transferring to the jth target domain sample feature and label; 1nAn n × 1 vector representing an element of 1; 1mAn mx 1 vector representing an element of 1; (R)+)m×nRepresenting a positive real matrix with dimension m × n;<·,·>expressing Frobenius dot product;
initial value gamma for transport planning0The method comprises the following steps:
5. The optimal transport based drive chain unsupervised domain adaptive fault diagnosis method according to claim 4, characterized in that: in the step (3), the objective loss function to be optimized by the whole model is as follows:
where f' denotes the decoder of the auto-encoder,andrespectively representing source domain featuresAnddecoding into input data; l isfAnd LgRespectively representing the loss functions of the feature extractor f and the softmax classifier g;
alternately solving the objective loss function using a cooperative descent algorithm:
when the parameters of the feature extractor f and the softmax classifier g are fixed, the target loss function is written as:
the above formula is a standard linear programming problem, which is solved by a network simplex flow algorithm;
when the transport plan γ is fixed, the target loss function is written as:
the Adam algorithm is used to solve the above equation.
6. The optimal transport based drive chain unsupervised domain adaptive fault diagnosis method according to claim 5, characterized in that: in the step (4), the prediction accuracy accurve of the target domain sample label is calculated according to the following formula:
wherein the content of the first and second substances,is a sampleThe predicted label of the tag is used to predict,is a corresponding true label; 1 {. is a binary function whenWhen the temperature of the water is higher than the set temperature,is 1; when in useWhen the temperature of the water is higher than the set temperature,is 0.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011152085.7A CN112232252B (en) | 2020-10-23 | 2020-10-23 | Transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011152085.7A CN112232252B (en) | 2020-10-23 | 2020-10-23 | Transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112232252A true CN112232252A (en) | 2021-01-15 |
CN112232252B CN112232252B (en) | 2023-12-01 |
Family
ID=74109601
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011152085.7A Active CN112232252B (en) | 2020-10-23 | 2020-10-23 | Transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112232252B (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112784920A (en) * | 2021-02-03 | 2021-05-11 | 湖南科技大学 | Cloud-side-end-coordinated dual-anti-domain self-adaptive fault diagnosis method for rotating part |
CN112990259A (en) * | 2021-02-04 | 2021-06-18 | 西交利物浦大学 | Early fault diagnosis method of rotary mechanical bearing based on improved transfer learning |
CN113822228A (en) * | 2021-10-27 | 2021-12-21 | 南京大学 | User expression recognition method and system based on continuous learning |
WO2023044978A1 (en) * | 2021-09-27 | 2023-03-30 | 苏州大学 | Adversarial-flow-model-based unsupervised fault diagnosis method for mechanical device |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109376620A (en) * | 2018-09-30 | 2019-02-22 | 华北电力大学 | A kind of migration diagnostic method of gearbox of wind turbine failure |
CN109580215A (en) * | 2018-11-30 | 2019-04-05 | 湖南科技大学 | A kind of wind-powered electricity generation driving unit fault diagnostic method generating confrontation network based on depth |
CN109902393A (en) * | 2019-03-01 | 2019-06-18 | 哈尔滨理工大学 | Fault Diagnosis of Roller Bearings under a kind of variable working condition based on further feature and transfer learning |
US20190325861A1 (en) * | 2018-04-18 | 2019-10-24 | Maneesh Kumar Singh | Systems and Methods for Automatic Speech Recognition Using Domain Adaptation Techniques |
CN110837850A (en) * | 2019-10-23 | 2020-02-25 | 浙江大学 | Unsupervised domain adaptation method based on counterstudy loss function |
-
2020
- 2020-10-23 CN CN202011152085.7A patent/CN112232252B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190325861A1 (en) * | 2018-04-18 | 2019-10-24 | Maneesh Kumar Singh | Systems and Methods for Automatic Speech Recognition Using Domain Adaptation Techniques |
CN109376620A (en) * | 2018-09-30 | 2019-02-22 | 华北电力大学 | A kind of migration diagnostic method of gearbox of wind turbine failure |
CN109580215A (en) * | 2018-11-30 | 2019-04-05 | 湖南科技大学 | A kind of wind-powered electricity generation driving unit fault diagnostic method generating confrontation network based on depth |
CN109902393A (en) * | 2019-03-01 | 2019-06-18 | 哈尔滨理工大学 | Fault Diagnosis of Roller Bearings under a kind of variable working condition based on further feature and transfer learning |
CN110837850A (en) * | 2019-10-23 | 2020-02-25 | 浙江大学 | Unsupervised domain adaptation method based on counterstudy loss function |
Non-Patent Citations (1)
Title |
---|
金余丰;刘晓锋;姚美常;黄凤良;: "基于域对抗迁移的变工况滚动轴承故障诊断模型", 自动化仪表, no. 12, pages 55 - 60 * |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112784920A (en) * | 2021-02-03 | 2021-05-11 | 湖南科技大学 | Cloud-side-end-coordinated dual-anti-domain self-adaptive fault diagnosis method for rotating part |
CN112784920B (en) * | 2021-02-03 | 2024-02-13 | 湖南科技大学 | Yun Bianduan coordinated rotating component reactance domain self-adaptive fault diagnosis method |
CN112990259A (en) * | 2021-02-04 | 2021-06-18 | 西交利物浦大学 | Early fault diagnosis method of rotary mechanical bearing based on improved transfer learning |
CN112990259B (en) * | 2021-02-04 | 2023-12-26 | 西交利物浦大学 | Early fault diagnosis method for rotary mechanical bearing based on improved transfer learning |
WO2023044978A1 (en) * | 2021-09-27 | 2023-03-30 | 苏州大学 | Adversarial-flow-model-based unsupervised fault diagnosis method for mechanical device |
CN113822228A (en) * | 2021-10-27 | 2021-12-21 | 南京大学 | User expression recognition method and system based on continuous learning |
CN113822228B (en) * | 2021-10-27 | 2024-03-22 | 南京大学 | User expression recognition method and system based on continuous learning |
Also Published As
Publication number | Publication date |
---|---|
CN112232252B (en) | 2023-12-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112232252B (en) | Transmission chain unsupervised domain adaptive fault diagnosis method based on optimal transportation | |
CN110209823B (en) | Multi-label text classification method and system | |
Li et al. | Confidence-based active learning | |
CN111860677B (en) | Rolling bearing migration learning fault diagnosis method based on partial domain countermeasure | |
CN109492610B (en) | Pedestrian re-identification method and device and readable storage medium | |
CN110705607A (en) | Industry multi-label noise reduction method based on cyclic re-labeling self-service method | |
CN116304905B (en) | Permanent magnet synchronous motor demagnetizing fault diagnosis method under multi-load working condition | |
CN116610803B (en) | Industrial chain excellent enterprise information management method and system based on big data | |
CN112560948B (en) | Fundus image classification method and imaging method under data deviation | |
CN116911929B (en) | Advertisement service terminal and method based on big data | |
Dai et al. | Hybrid deep model for human behavior understanding on industrial internet of video things | |
CN116910013A (en) | System log anomaly detection method based on semantic flowsheet mining | |
CN114299326A (en) | Small sample classification method based on conversion network and self-supervision | |
CN114780767A (en) | Large-scale image retrieval method and system based on deep convolutional neural network | |
CN114943233A (en) | Parallelization-improvement-based entity naming identification method for Transformer-CRF network | |
CN111177010A (en) | Software defect severity identification method | |
CN112949097B (en) | Bearing residual life prediction model and method based on deep migration learning | |
CN113239720B (en) | Subway vehicle running gear fault diagnosis method based on deep migration learning | |
Li et al. | Dual-source gramian angular field method and its application on fault diagnosis of drilling pump fluid end | |
CN116610846A (en) | Breeding screening method and system for cattle breeding | |
CN109978013B (en) | Deep clustering method for character action recognition | |
CN115098681A (en) | Open service intention detection method based on supervised contrast learning | |
Brønstad | Data-driven detection and identification of undesirable events in subsea oil wells | |
CN115795037B (en) | Multi-label text classification method based on label perception | |
CN111158640B (en) | One-to-many demand analysis and identification method based on deep learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |