CN117520905B

CN117520905B - Anti-fact fault data generation method based on causal intervention

Info

Publication number: CN117520905B
Application number: CN202410005964.9A
Authority: CN
Inventors: 丁煦; 陈冠华; 夏鹏华; 张一琦; 徐娟; 王松; 周辉; 翟华
Original assignee: Hefei University of Technology; Bengbu Triumph Engineering and Technology Co Ltd
Current assignee: Hefei University of Technology; Bengbu Triumph Engineering and Technology Co Ltd
Priority date: 2024-01-03
Filing date: 2024-01-03
Publication date: 2024-03-22
Anticipated expiration: 2044-01-03
Also published as: CN117520905A

Abstract

The invention relates to the technical field of anti-fact fault data generation, in particular to an anti-fact fault data generation method based on causal intervention. Firstly, describing a generation mechanism of fault data through a structural causal model, considering that the fault data consists of causal features and non-causal features, and only the causal features can influence the judgment of fault types. And then decoupling through a CycleGAN network to generate causal features and non-causal features. And introducing causal relation loss and characteristic information contrast loss in the CycleGAN network to constrain the model, and further reserving causal factors and intervening non-causal factors. Training a generator and a discriminator in the CycleGAN network to obtain an optimal generator, and generating anti-fact fault data through the optimal generator. The invention can generate high-quality anti-reality fault data through the proposed network model, and improve the precision of fault diagnosis.

Description

Anti-fact fault data generation method based on causal intervention

Technical Field

The invention relates to the technical field of anti-fact fault data generation, in particular to an anti-fact fault data generation method based on causal intervention.

Background

At present, a new path is opened up for intelligent fault diagnosis based on a deep learning method, and the real-time detection of the mechanical equipment faults is realized by extracting the characteristics in fault data. However, in the deep learning diagnostic model, the collected fault data are also quite different due to different working environments of the mechanical equipment, so that the robustness and accuracy of the same diagnostic model using the fault data are also problematic, and the following main reasons are:

1. due to the difference of working conditions of mechanical equipment and data acquisition methods, fault data acquired for fault diagnosis contain a large number of environmental features which have no causal relation with fault types but affect the diagnosis accuracy of a diagnosis model; and the fault data collected under different environments usually have different distribution characteristics, so that the generalization capability of the same diagnosis model on the fault data is reduced, and finally, the prediction accuracy is reduced.

2. The number of fault data which can be obtained at present and does not contain excessive noise is small, so that the diagnosis model trained by using limited data quantity is difficult to fully mine key features for judging fault types, and further the diagnosis precision of the diagnosis model is reduced.

It follows that further research is now required in terms of acquisition of fault data.

Disclosure of Invention

In order to avoid and overcome the technical problems in the prior art, the invention provides a causal intervention-based anti-reality fault data generation method. The invention can decouple the causal feature and the non-causal feature, and generate diversified anti-reality fault data through the intervention of the non-causal feature, thereby reducing or eliminating the influence of the non-causal feature on the fault type judgment.

In order to achieve the above purpose, the present invention provides the following technical solutions:

a method for generating anti-fact fault data based on causal intervention comprises the following steps:

s1, acquiring original working condition fault data and target working condition fault data; extracting real fault characteristics in original working condition fault data, inputting the real fault characteristics into a generator in a generating countermeasure network, performing characteristic decoupling on the real fault characteristics, and decoupling the real fault characteristics into real causal characteristics with causal relation between fault types and real non-causal characteristics without causal relation between the fault types; the real causal features generate corresponding anti-facts causal features through a generator, and the anti-facts causal features form corresponding anti-facts fault features;

s2, inputting the counter-facts fault characteristics and the real fault characteristics in the target working condition fault data into a discriminator in a generating countermeasure network at the same time, performing error optimization training on the discriminator and the generator to obtain an optimal generator, and forming a mapping for generating the real fault characteristics in the target working condition fault data from the real fault characteristics in the original working condition fault data in the optimal generator;

s3, inputting a real fault sample in the original working condition fault data into an optimal generator, and generating corresponding anti-facts fault data through mapping.

As a further scheme of the invention: true faults in original working condition fault dataAfter the characteristics are input into a generator in a generating countermeasure network, the generator firstly performs characteristic decoupling on the real fault characteristics, namely, real vibration signals of the real fault characteristics are divided into a plurality of real vibration signals in time sequenceKEach real patch corresponds to a position in the real vibration signal, and each real patch is respectivelyz ₁ 、z ₂ 、…、z _k 、…、z _K The method comprises the steps of carrying out a first treatment on the surface of the Wherein,z ₁ representing the 1 st actual patch to be applied,z ₂ representing the 2 nd real patch;z _k represent the firstkA real patch;z _K represent the firstKA real patch;

real causal features with causal relation between each real patch representation and the fault type;

inputting the real patches into a generator, sequentially passing through an input layer, a network of each layer and an output layer in the generator, and finally outputting corresponding anti-facts patches in the generator, wherein each anti-facts patch is respectivelyThe method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>Representing the 1 st counterfactual patch output by the generator, i.ez ₁ A counterfactual patch generated in the generator; />Representing the 2 nd counterfactual patch output by the generator, i.e.z ₂ A counterfactual patch generated in the generator; />Represents the first output of the generatorkCounter-facts patches, i.ez _k A counterfactual patch generated in the generator; />Represents the first output of the generatorKInverse of each otherFact patches, i.e.z _K A counterfactual patch generated in the generator; the anti-facts patches represent corresponding anti-facts causal features, and each anti-facts patch is sequentially arranged and combined according to the positions of the corresponding real patches in the real vibration signals, so that anti-facts fault data generated by the generator through the original working condition fault data are formed.

As still further aspects of the invention: the generation of the antagonism network adopts a CycleGAN network, and the antagonism loss function of the CycleGAN network is expressed as follows:

；

wherein,representing a fight loss function for the CycleGAN network;Ga representation generator;Da representation discriminator;Xrepresenting a sample set of raw operating condition fault data,xrepresentation ofXA true fault sample in (a); />Data distribution representing original operating mode fault data;E _x representation ofXExpectations of the true fault samples in (a);D(. Cndot.) represents the discriminant function of the CycleGAN network;G(. Cndot.) represents the generation function of the CycleGAN network;Ya sample set of target operating condition fault data is represented,yrepresentation ofYA true fault sample in (a); />Data distribution representing target operating condition fault data;E _y representation ofYIs a true failure sample.

As still further aspects of the invention: in a real vibration signal of a real fault feature with a time stamp in original working condition fault data, the relation between real patches at different moments is inconsistent, and causal relation between different real patches is calculated by dot product operation;

in the original working condition fault data, for a given real patch expressed by a vectorz _k It is associated with a real patchz _i The correlation relationship between them is expressed as:

；

wherein P is _k (i) Representing a real patchz _k And a real patchz _i The correlation between them evaluates the score, i.e. characterizes the real patchz _k And a real patchz _i A causal relationship distribution between the two;exp(. Cndot.) expressed in terms of natural constantseAn exponential function of the base;urepresenting the super-parameters;z _j the first generated in the true vibration signal representing the original condition fault datajA real patch, j=1, …,K，Krepresenting the total number of real patches;Trepresenting a matrix transpose operation;

solving real patches in original working condition fault dataz _k And all other real patches, and is noted asP _k ；

Similarly, in the counterfactual fault data, for a given counterfactual patch represented by a vectorIt is associated with a counterfactual patch>The correlation relationship between them is expressed as:

；

wherein Q is _k (i) Representing a counterfactual patchAnd counterfactual patch->The correlation between them evaluates the score, i.e. characterizes the counterfactual patch +.>And counterfactual patch->A causal relationship distribution between the two; />Representing the first of the anti-facts vibration signals formed in the anti-facts fault datajA counter fact patch;

solving the counterfactual patch in the counterfactual fault dataCorrelation with all other counterfactual patches and is noted asQ _k ；

Via JS divergence measureP _k AndQ _k similarity between the two, the measurement formula is as follows:

；

wherein,representation ofP _k AndQ _k similarity between; />Representation ofQ _k For a pair ofP _k KL divergence of (2); />Representation ofP _k For a pair ofQ _k KL divergence of (2);

by minimizingI.e.The real patch can be further constrainedz _k And corresponding anti-facts patchIntegrity of causal features in between;

based on minimizationThe similarity between all the real patches and the corresponding anti-facts patches is solved through JS divergence, and the solved result can be used as a causal relation loss, wherein the causal relation loss is expressed as follows:

；

wherein,representation generatorGIs lost in causal relationship in (a).

As still further aspects of the invention: acquiring characteristic information extracted from different layers of networks in a discriminator, wherein the discriminator is sharedLA layer network in whichlLayer common outputN _l With dimensions of 1XM _l Is used for the feature vector of (a),M _l representing the length of the feature vector,lthe value ranges from 1 toLThe method comprises the steps of carrying out a first treatment on the surface of the The number of the real fault characteristics and the counter-fact fault characteristics generated in the same layer network of the discriminator is the same;

the loss between the counter fact feature vector generated by the counter fact fault data in the layers of the network of the discriminator and the real feature vector generated by the target working condition fault data in the layers of the network of the discriminator is called layer contrast loss, and the layer contrast loss is expressed as follows:

；

wherein,L _{con l h,,} representing the counterfactual fault data at the arbiterlAdverse events generated in a hierarchical networkThe real characteristic vector and the target working condition fault data are in the first of the discriminatorslThe third layer is a layer network of the true feature vectorshLoss of individual layer contrast;representing the counterfactual fault data at the arbiterlGenerated in a layer networkhA counter-fact feature vector is used to determine,hthe value ranges from 1 toN _l ，N _l Representation of the discriminantlThe total number of the counter fact feature vectors or the real feature vectors generated in the layer network;representing the counterfactual fault data at the arbiterlGenerated in a layer network except->A feature vector set composed of all other inverse feature vectors except for the others; />Representation->The number of the middle-inverse fact feature vectors; />Representation ofThe first of (3)rA counter-fact feature vector is used to determine,rthe value range is 1 to->；/>Representation->The first of (3)fA plurality of inverse fact feature vectors; />Indicating the target working condition fault data in the first discriminatorlA feature vector set formed by all real feature vectors generated in the layer network; />Representation->The first of (3)rTrue feature vectors;ωrepresenting the super-parameters;

the contrast losses of all layers in the discriminator are added to form the total contrast loss of the discriminator, and the total contrast loss is expressed as follows:

；

wherein,representing the total contrast loss value of the discriminator;Lindicating the total number of layers of the network in the arbiter.

As still further aspects of the invention: selecting the anti-facts fault characteristics output by different layers of networks from the generator to perform patch comparison, and obtaining multi-layer patch comparison loss, wherein the comparison process is specifically as follows:

S2A1, in the process of processing original working condition fault data by the generator, obtaining the counterfactual patches output by each layer of network from the generator, and inputting the counterfactual patches into the multi-layer perceptron network one by one, wherein the multi-layer perceptron network outputs the counterfactual patch characteristics corresponding to each layer of network, and the counterfactual patch characteristics are expressed as follows:

；

wherein,representation generatorGFirst, thebLayer network ofb _i The true patches are inverse fact patches generated by the generating function;MLPrepresentation ofA multi-layer perceptron; />Representation->After the multi-layer perceptron is input, the corresponding output counterfactual patch features;

S2A2, calculating losses among the anti-fact patch features in the generator layer networks, the positive sample patch features in the anti-fact patch features and the negative sample patch features of the anti-fact patch features to obtain anti-fact patch losses of the anti-fact fault features, wherein the anti-fact patch losses are expressed as follows:

；

wherein,the counterfactual patch loss of original working condition fault data is represented; />Representation->Corresponding negative example patch feature, +.>Representation->Corresponding positive sample patch features;Orepresenting a loss function;Brepresenting the total number of layers of the network in the generator;b _I the first of the representation generatorsbThe total number of counterfactual patch features in the layer network;

S2A3, inputting target working condition fault data into a generation countermeasure network, and processing the target working condition fault data according to the processing process of the original working condition fault data in the generator so as to obtain a corresponding real patch; processing the real patch according to the same processing mode as the step S2A1 and the step S2A2 to obtain the counterfactual patch loss corresponding to the target working condition fault data;

S2A4, combining the counterfactual patch loss of the original working condition fault data and the counterfactual patch loss of the target working condition fault data to obtain a multi-layer patch loss of the generator, wherein the multi-layer patch loss is expressed as follows:

；

wherein,a multi-layer patch loss representing a generator; />Representation ofWeight parameters of (2); />The counterfactual patch loss of the target working condition fault data is represented; />Representation->Weight parameters of (c).

As still further aspects of the invention: error optimization refers to simultaneously solving minimum value of fight loss function in CycleGAN network, generatorGMinimum and generator of causal relation losses of (a)GMulti-layer patch loss minima and arbiterDBy repeating the error optimization process and reaching the set stop condition, an optimal generator can be generated, and the mapping of the real fault characteristics in the original working condition fault data to the real fault characteristics in the target working condition fault data can be obtained.

Compared with the prior art, the invention has the beneficial effects that:

1. the invention extracts real causal features related to fault types and irrelevant real non-causal features from real fault samples, and intervenes the non-causal features through decoupling action of a generator to generate anti-fact fault data; and through the error optimization training generator, the accuracy of the anti-fact fault data generated by using the mapping is improved, and the prediction accuracy of the fault type classification model after subsequent training is improved.

2. The fault data of the invention are decoupled into causal and non-causal features, and the anti-fact fault data are generated through independent intervention of the latter. On one hand, causal relation loss is designed, and causal characteristics of original working condition fault data are captured. On the other hand, the contrast loss of the characteristic information is added, and the intervention effect on the non-causal characteristics is improved. The generated anti-facts fault data is helpful to build a robust downstream fault classifier model, and the influence of confounding factors is reduced.

Drawings

Fig. 1 is a schematic diagram of a counterfactual fault data generation model in the present invention.

FIG. 2 is a schematic diagram of a causal relationship between different patches according to the present invention.

FIG. 3 is a schematic diagram of a comparison between different non-causal features in the inventive arbiter.

Detailed Description

The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

Referring to fig. 1 to 3, the bearing is taken as an example of the present invention. In the working process of the bearing, the working environment of the bearing is complex and changeable. On the same or different machines, the different fault vibration signals acquired follow different marginal distributions due to changes in environmental factors such as load conditions, temperature, etc. However, in the failure samples, the damage-related characteristics of the components such as the outer ring, the inner ring, the rollers, etc. related to the failure type are not interfered by these environmental factors. I.e. the non-causal features related to environmental factors and the causal features related to fault type are independent of each other, i.e. the independence of the causal mechanisms. The independence of causal mechanisms suggests that intervention on one mechanism does not make changes to the other. And the types and the number of the causal features affecting the fault type are fixed, while the non-causal features can change along with the change of the external environment. The invention aims to separate the causal features and the non-causal features, further reduce or eliminate the influence of the non-causal features on fault type judgment, and improve the accuracy of data.

The anti-facts fault data obtained by the anti-facts reasoning are data which have never appeared. Given real fault data, if the anti-facts fault data generated by the real fault data exist in the real distribution of the real fault samples, the obtained anti-facts fault data is called anti-facts confidence. This process of generating the anti-fact fault data of the anti-fact confidence from the real fault data is referred to as mapping. The mapping process is actually to generate causal features and non-causal features through group decoupling fault data, reserve the causal features, and intervene the non-causal features to ensure the anti-fact confidence.

To achieve the anti-facts confidence, an anti-facts fault data generation model is then built, as shown in fig. 1, mainly divided into two phases, phase one: training a counterfactual generator; stage two: and generating the counterfactual data and diagnosing faults. The generation model takes a CycleGAN network in a generation countermeasure network as a backbone network, and combines a plurality of loss functions to generate the counterfactual fault data.

By arranging various sensors on the fault bearing of which the bearing fault type is determined, the fault bearing is installed in corresponding detection equipment, corresponding working condition data such as load, rotating speed and the like are set, and vibration signals of the fault bearing are detected. And recording fault type and working condition data of the fault bearing and corresponding vibration signals to form fault data. And acquiring real fault data under different environments through multiple times of time, and respectively naming the real fault data as target working condition fault data and original working condition fault data.

After the original working condition fault data are input into a generator in the CycleGAN network, the generator firstly extracts all real fault characteristics and performs characteristic decoupling, the real fault data are divided into a plurality of fragments which are not overlapped with each other, namely real vibration signals of the real fault characteristics are sequentially divided into K patches according to time sequence, the patches are named as real patches, and each real patch corresponds to one position in the real vibration signals. Each real patch representation has real causal features of causal relation with the fault type.

The real patches are input into the generator, each real patch sequentially passes through an input layer, each layer of network and an output layer in the generator to be processed, and finally, the corresponding anti-facts patch is output in the generator. The anti-facts patches represent the corresponding anti-facts and cause-effect characteristics, and the anti-facts fault data generated by the generator through the original working condition fault data can be formed by sequentially arranging and combining the anti-facts patches according to the positions of the corresponding real patches in the real vibration signals.

The aim of the generator is to generate the anti-fact fault data which is as close to the real fault data distribution in the target working condition fault data as possible; the objective of the discriminator is to distinguish the counterfactual fault data generated by the original condition fault data from the target condition fault data. The countermeasures against losses enable the generator to generate counterfactual fault data having specific target operating condition attributes by facilitating the mutual boosting of the generator and the arbiter. Both the generator and the arbiter in the CycleGAN network are typically constructed using neural networks.

Selecting the anti-facts fault characteristics output by different layers of networks from the generator to perform patch comparison, and obtaining multi-layer patch comparison loss, wherein the comparison process is specifically as follows:

and in the process of normally processing the original working condition fault data by the generator, obtaining the counterfactual patches output by the networks of all layers from the generator, and inputting the counterfactual patches into the multi-layer perceptron networks of the two layers one by one, wherein the multi-layer perceptron networks output patch characteristics corresponding to the networks of all layers.

The losses among the patch features in the network of each layer of the generator, the positive sample patch features in the patch features and the negative sample patch features of the patch features are calculated to obtain the counterfactual patch losses of the counterfactual fault features.

Inputting the target working condition fault data into a generation countermeasure network, and processing the target working condition fault data according to the processing process of the original working condition fault data in the generator so as to obtain a corresponding real patch; and processing the real patch according to the same processing mode as the above content to obtain the counterfactual patch loss corresponding to the target working condition fault data.

The counterfactual patch loss of the original operating condition fault data and the counterfactual patch loss of the target operating condition fault data are combined to obtain the multi-layer patch loss of the generator.

On the basis of the multi-layer patch loss, causal relation loss and characteristic information contrast loss are introduced to restrict the causal intervention effect of the generated model, and high-efficiency class-specific characteristic transfer is realized.

As shown in fig. 2, in the real vibration signal of the real fault feature with the time stamp in the original working condition fault data, the relation between the real patches at different moments is inconsistent, and the causal relation between the different real patches is calculated by using dot product operation. And solving the similarity between all the real patches and the corresponding anti-facts patches through JS divergence, wherein the solved result can be used as a causality loss.

Machines and environments mainly contain low-level, operating-specific attributes, i.e., non-causal features. The performance of the arbiter is good or bad in its ability to discriminate against non-causal features. The comparison concept was introduced into the arbiter in order to further improve the performance of the arbiter. The method is characterized in that the network structure of the discriminator is reconstructed, and the characteristic information in the fault data is further extracted by utilizing a plurality of characteristic vectors output by different layers of the network structure.

Acquiring characteristic information extracted from different layers in own network of discriminators, wherein the discriminators are sharedLA layer, wherein the firstlLayer outputN _l With dimensions of 1XM _l Is used for the feature vector of (a),ltake a value of 1 toLThe method comprises the steps of carrying out a first treatment on the surface of the The number of real fault signatures and anti-facts fault signatures generated in the same layer network of the arbiter are the same.

As shown in fig. 3, the loss between the anti-facts feature vector generated in the respective layers of the arbiter by the anti-facts fault data and the true feature vector generated in the respective layers of the arbiter by the target operating condition fault data is referred to as a layer contrast loss. The contrast losses of all layers in the discriminator are added to form the total contrast loss of the discriminator.

Intuitively, through total contrast loss, the generator may possess greater discrimination, directing the generator to increase the degree of intervention on non-causal features, thereby increasing sample variance.

Error optimization refers to simultaneously solving the minimum value and generator of the fight loss function of the CycleGAN networkGMinimum and generator of causal relation losses of (a)GMulti-layer patch loss minima and arbiterDAnd (3) repeating the error optimization process, stopping iteration when the error reaches a set precision range or reaches corresponding iteration times, and obtaining the mapping of generating the real fault characteristics in the target working condition fault data from the real fault characteristics in the original working condition fault data.

And inputting the original working condition fault data containing the real fault characteristics and the real fault samples of the real fault types corresponding to the real fault characteristics into an optimal generator, and generating corresponding anti-facts fault data through mapping. The generated anti-reality fault data is more real, and the generated data volume can also be determined according to the actual situation. And the generated anti-reality fault data can be mixed with the real fault data and input into a neural network for fault type prediction, so as to train the neural network and improve the prediction accuracy.

The foregoing is only a preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art, who is within the scope of the present invention, should make equivalent substitutions or modifications according to the technical scheme of the present invention and the inventive concept thereof, and should be covered by the scope of the present invention.

Claims

1. The method for generating the anti-fact fault data based on causal intervention is characterized by comprising the following steps of:

s3, inputting a real fault sample in original working condition fault data into an optimal generator, and generating corresponding anti-facts fault data through mapping;

after the real fault characteristics in the original working condition fault data are input into the generator in the generation countermeasure network, the generator firstly performs characteristic decoupling on the real fault characteristics, namely, real vibration signals of the real fault characteristics are divided into time sequences in turnKEach real patch corresponds to a position in the real vibration signal, and each real patch is respectivelyz ₁ 、z ₂ 、…、z _k 、…、z _K The method comprises the steps of carrying out a first treatment on the surface of the Wherein,z ₁ representing the 1 st actual patch to be applied,z ₂ representing the 2 nd real patch;z _k represent the firstkA real patch;z _K represent the firstKA real patch;

inputting the real patches into a generator, sequentially passing through an input layer, a network of each layer and an output layer in the generator, and finally outputting corresponding anti-facts patches in the generator, wherein each anti-facts patch is respectivelyThe method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>Representing the 1 st counterfactual patch output by the generator, i.ez ₁ A counterfactual patch generated in the generator; />Representing the 2 nd counterfactual patch output by the generator, i.e.z ₂ A counterfactual patch generated in the generator; />Represents the first output of the generatorkCounter-facts patches, i.ez _k A counterfactual patch generated in the generator; />Represents the first output of the generatorKCounter-facts patches, i.ez _K A counterfactual patch generated in the generator; the anti-facts patches represent corresponding anti-facts causal features, and the anti-facts patches are sequentially arranged and combined according to the positions of the corresponding real patches in the real vibration signals to form anti-facts fault data generated by the original working condition fault data through the generator;

the generation of the antagonism network adopts a CycleGAN network, and the antagonism loss function of the CycleGAN network is expressed as follows:

wherein,representing a fight loss function for the CycleGAN network;Ga representation generator;Da representation discriminator;Xrepresenting a sample set of raw operating condition fault data,xrepresentation ofXA true fault sample in (a); />Data distribution representing original operating mode fault data;E _x representation ofXExpectations of the true fault samples in (a);D(. Cndot.) represents the discriminant function of the CycleGAN network;G(. Cndot.) represents the generation function of the CycleGAN network;Ya sample set of target operating condition fault data is represented,yrepresentation ofYA true fault sample in (a); />Data distribution representing target operating condition fault data;E _y representation ofYExpectations of the true fault samples in (a);

in a real vibration signal of a real fault feature with a time stamp in original working condition fault data, the relation between real patches at different moments is inconsistent, and causal relation between different real patches is calculated by dot product operation;

wherein Q is _k (i) Representing a counterfactual patch->And counterfactual patch->The correlation between them evaluates the score, i.e. characterizes the counterfactual patch +.>And counterfactual patch->A causal relationship distribution between the two; />Representing the first of the anti-facts vibration signals formed in the anti-facts fault datajA counter fact patch;

wherein (1)>Representation ofP _k AndQ _k similarity between; />Representation ofQ _k For a pair ofP _k KL divergence of (2); />Representation ofP _k For a pair ofQ _k KL divergence of (2);

by minimizingCan further restrict the real patchz _k And corresponding counterfactual patch->Integrity of causal features in between;

based on minimizationThe similarity between all the real patches and the corresponding anti-facts patches is solved through JS divergence, and the similarity can be used as a causal relation loss, and the causal relation loss is expressed as follows:

wherein (1)>Representation generatorGIs lost in causal relationship in (a).

2. The method for generating anti-facts fault data based on causal intervention according to claim 1, wherein the feature information extracted from different layers of networks in the discriminators is obtained, and the discriminators are sharedLA layer network in whichlLayer common outputN _l Each dimension is 1×M _l Is used for the feature vector of (a),M _l representing the length of the feature vector,lthe value ranges from 1 toLThe method comprises the steps of carrying out a first treatment on the surface of the The number of the real fault characteristics and the counter-fact fault characteristics generated in the same layer network of the discriminator is the same;

wherein,L _{con l h,,} representing the counterfactual fault data at the arbiterlThe inverse fact feature vector and the target working condition fault data generated in the layer network are in the first of the discriminatorslThe third layer is a layer network of the true feature vectorshLoss of individual layer contrast; />Representing the counterfactual fault data at the arbiterlGenerated in a layer networkhA counter-fact feature vector is used to determine,hthe value ranges from 1 toN _l ，N _l Representation of the discriminantlThe total number of the counter fact feature vectors or the real feature vectors generated in the layer network; />Representing the counterfactual fault data at the arbiterlGenerated in a layer network except->A feature vector set composed of all other inverse feature vectors except for the others; />Representation->The number of the middle-inverse fact feature vectors; />Representation->The first of (3)rA counter-fact feature vector is used to determine,rthe value range is 1 to->；/>Representation->The first of (3)fA plurality of inverse fact feature vectors; />Indicating the target working condition fault data in the first discriminatorlFeature vector set composed of all true feature vectors generated in layer networkCombining; />Representation->The first of (3)rTrue feature vectors;ωrepresenting the super-parameters;

wherein (1)>Representing the total contrast loss value of the discriminator;Lindicating the total number of layers of the network in the arbiter.

3. The method for generating the anti-facts fault data based on causal intervention according to claim 2, wherein the anti-facts fault characteristics output by different layers of networks are selected from the generator for patch comparison, and a multi-layer patch comparison loss is obtained, and the comparison process is as follows:

wherein (1)>Representation generatorGFirst, thebLayer network ofb _i The true patches are inverse fact patches generated by the generating function;MLPrepresentation ofA multi-layer perceptron; />Representation->After the multi-layer perceptron is input, the corresponding output counterfactual patch features;

wherein,a multi-layer patch loss representing a generator; />Representation->Weight parameters of (2); />The counterfactual patch loss of the target working condition fault data is represented; />Representation ofWeight parameters of (c).

4. A causal intervention based anti-facts fault data generation method according to claim 3, characterized in that the error optimisation is the simultaneous solution of the minimum of the counterdamage function in the CycleGAN network, generatorGMinimum and generator of causal relation losses of (a)GMulti-layer patch loss minima and arbiterDBy repeating the error optimization process and reaching a set stop condition, a new product can be producedAnd forming an optimal generator, and obtaining a mapping of the real fault characteristics in the original working condition fault data to the real fault characteristics in the target working condition fault data.