CN111240279B

CN111240279B - Confrontation enhancement fault classification method for industrial unbalanced data

Info

Publication number: CN111240279B
Application number: CN201911369696.4A
Authority: CN
Inventors: 葛志强; 江肖禹
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2019-12-26
Filing date: 2019-12-26
Publication date: 2021-04-06
Anticipated expiration: 2039-12-26
Also published as: CN111240279A

Abstract

The invention discloses an confrontation enhancement fault classification method for industrial unbalanced data, and belongs to the field of fault diagnosis and classification in industrial processes. According to the method, through countertraining between a multi-classification discriminator and a small sample generator, the small sample generator respectively generates data directionally for each type of unbalanced small samples, and through data screening based on the Mahalanobis distance of a principal element space, generated data closer to real data are obtained. A dynamic table of a supplementary database is established, real data are supplemented by utilizing a quantitative updating sample set to obtain a new data set, and the imbalance of industrial data is solved. The classification method couples the training processes of the generator and the multi-classification discriminator together, and more effectively utilizes computing resources.

Description

Confrontation enhancement fault classification method for industrial unbalanced data

Technical Field

The invention belongs to the field of industrial process fault diagnosis and classification, and particularly relates to an anti-enhancement fault classification method for industrial unbalanced data.

Background

With the development of modern industry, industrial data is accumulated in large quantities, and a basis is provided for data-driven process analysis. Among them, the fault diagnosis problem is a typical application of industrial data. Many data-driven methods, such as support vector machines, back-propagation neural networks, etc. algorithms have been widely used for fault classification in some industrial processes.

However, since the fault conditions occurring in the industry are rare, the collected fault data is very limited. Compared with a large amount of non-fault data, namely data under a normal condition, the proportion of fault data is low. In addition, there is an imbalance in the number of faults of different probabilities. This presents difficulties for algorithmic classification based on balanced data distribution. Therefore, the fault diagnosis in the industry is essentially a multi-classification problem of unbalanced data, and needs to be solved urgently.

Supplementing data from the data plane is the most direct way to solve the imbalance problem. The generation of the countermeasure network is a promising generation model at present, and is composed of a generator and a discriminator. Through the countertraining between the generator and the arbiter, the generator can generate data that spoofs the arbiter. Thus, generating data that is generated against the network can be applied to the complement of small samples. However, the training process for generating the countermeasure network is extremely unstable, and noise points deviating from real data or pattern collapse problems are easily generated, so that the authenticity of the generated data is influenced. And the generation of the confrontation network and the training of the classifier are two independent processes, so that the complexity of the model is increased and the waste of computing resources is reduced.

Disclosure of Invention

Aiming at the classification problem of the industrial unbalanced data, the invention provides an confrontation enhancement fault classification method for the industrial unbalanced data, which realizes accurate classification by utilizing a confrontation generation network structure of a small sample generator and a multi-classification discriminator.

The purpose of the invention is realized by the following technical scheme: a confrontation enhancement fault classification method for industrial unbalance data specifically comprises the following steps:

(1) collecting offline data in a historical industrial process as an original data set X, wherein the X comprises m kinds of large sample data X_bigAnd n kinds of small sample data X_smallI.e. X ═ X_big，X_small}＝{(xⁱ，yⁱ) In which y isⁱE {1, 2., m + n }, i represents the number of samples, and the number of samples of m large samples is the same, while the number of samples of small sample data is less than that of large samples; the original data set X is preprocessed and converted into 0, 1 by linearizing the original data through a linear function]Range, resulting in a training data set

(2) The data generation stage specifically comprises the following substeps:

(2.1) the small sample generators construct structures of n generators for n kinds of small sample data of unbalanced data, each small sample generator is a fully-connected neural network with the same structure, the input of the fully-connected neural network is Gaussian noise z, and the dimension of the fully-connected neural network is p; the output is the characteristic of the generated data, and the dimension is q; the n generators are mutually independent to form a parallel multi-generator structure; before Gaussian noise z is input into a generator, initializing a hyper-parameter of each small sample generator network;

(2.2) Gaussian noise z with a mean value of 0 and a variance of 0.1 is generated by a random function. Inputting Gaussian noise z into the small sample generator, and outputting n groups of generated data G (z) { G } by the generator₁，G₂，…，G_n}；

(3) The data screening stage specifically comprises the following steps:

(3.1) For the training data set

Performing principal component analysis to obtain a reference principal component matrix T₁Respectively projecting the generated data G (z) to a reference pivot space to obtain a corresponding pivot matrix T₂(ii) a Reference pivot matrix T₁Determining a reference pivot matrix T by accumulating the variance percentage to 98%₁The number of the main elements in (1); corresponding principal component matrix T₂The number of principal elements in (1) and a reference principal element matrix T₁The number of the main elements in the system is equal.

(3.2) separately aligning the reference pivot matrices T₁Solving the Mahalanobis distance MD by the inner small sample data centroid_1maxTo obtain the farthest distances under different categories

By said MD_1maxDetermining the upper threshold value of data screening to be kMD_1maxAnd k is a screening coefficient.

(3.3) for the corresponding principal component matrix T₂And a reference pivot matrix T₁The corresponding essence center of the small sample calculates the mahalanobis distance

Will MD₂And kMD_1maxMaking a comparison if MD₂＜kMD_1maxThen the generated data G (z) is considered to be close to the training data set

Is the effective point G_valid(ii) a If MD₂＞kMD_1maxWhen the generated data G deviates from the training data set

Is an outlier noise G_invalid。

(3.4) bringing the effective Point G_validExtracting from the generated data G (z) set, and giving the corresponding class labels y to the generated data G (z) set to remove outlier noise G_invalid。

(4) The dynamic table stage of the supplementary database specifically comprises the following steps:

(4.1) adding the effective point G_validImporting a dynamic table L of a supplementary database, wherein the dynamic table L is a sample sequence distributed to the generated data of each small sample class, namely a sample sequences; the length of each sample sequence in the supplementary database plus the number of real samples of the corresponding category small samples is equal to the number of real samples of each category large samples;

(4.2) when the accumulated number of the generated samples is smaller than the sequence length, continuously writing the generated samples into a dynamic table L of a supplementary database in the iteration process; and when the accumulated number of the generated samples is greater than or equal to the length of the sample sequence, eliminating the generated data at the end, writing the generated data into new generated data, and obtaining an updated supplementary data set X'.

(5) The classifier training stage specifically comprises the following steps:

(5.1) constructing a neural network multi-classification discriminator D (x) combined by a multi-hidden layer and a softmax network layer, inputting p-dimensional data x, and outputting m + n +1 sample class labels y; wherein m items are large sample class labels, n items are small sample class labels, and the m + n +1 item is a generated pseudo data label;

(5.2) applying the training data set

Mixed with the supplementary data set X' and used as real data X-P_dataThe generated data are represented as x to P_GAnd inputting x into the multi-classification discriminator to obtain the probability value p (y | x) of softmax output corresponding to each class.

(5.3) constructing a loss function of the classification discriminator:

(5.4) updating network parameters through an error back propagation algorithm, and optimizing a classification discriminator model until the loss function of the discriminator is converged;

(6) the generator training stage specifically comprises the following steps:

(6.1) constructing loss functions for n independent generators in the small sample generator respectively, wherein the loss function of the ith generator is as follows:

and (6.2) updating network parameters through an error back propagation algorithm, and optimizing a generator model until the generated data can cheat the authenticity judgment of the discriminator, namely the loss function of the discriminator is converged.

(6.3) repeating the steps (2.1) - (6.2) until each sample sequence corresponding to the dynamic table of the supplementary data set is filled, and finishing training the resistance enhanced fault classifier.

(7) When new data needs to be subjected to fault classification, the data is input into a trained countermeasure enhancement fault classifier, the probability of the item y of the softmax network layer which is m + n +1 is ignored, the posterior probability of each fault category is obtained, and the data is matched with the category according to the maximum posterior probability to realize the fault classification of the data.

Compared with the prior art, the invention has the following beneficial effects: through continuous iteration of the process, the data generated by the generator gradually approaches to the real samples, the sample sequence corresponding to the dynamic table of the supplementary data set is updated by the generated data, and the small category eliminates the imbalance of the original data set through the supplement of the generated data. Meanwhile, high-quality data in the generated data of the generator is reserved through data screening, and the performance of the classifier is further improved. The countermeasure enhancement classification method provided by the invention is an end-to-end model and is a data enhancement method which more conveniently utilizes generated data.

Drawings

FIG. 1 is a flow chart of a training of a challenge enhanced fault classifier for industrial imbalance data;

FIG. 2 is a flow chart of the Tennessee Eastman (TE) process;

FIG. 3 is a comparison of classification results against robust fault classifiers and other oversampling methods.

Detailed Description

The present invention is further described in detail with reference to the accompanying drawings.

The countermeasure-enhancing fault classifier adopted by the invention is structurally divided into four parts, wherein the first part is a small sample generator: the second part is a data filter: screening the generated data based on the Mahalanobis distance of the principal component space, quantitatively storing screened data by a third part which is a dynamic table of a supplementary database and mixing the screened data with real data, and a fourth part which is a multi-classification discriminator: the method is formed by combining a multi-hidden-layer neural network and a softmax network layer, wherein the output of the last layer of the neural network and the output of the softmax network layer are m + n +1 items, wherein m is a large sample class number, and n is a small sample class number.

A confrontation enhancement fault classification method for industrial unbalance data specifically comprises the following steps:

The training of the countermeasure enhancement fault classifier is a game countermeasure process and needs to be iterated circularly. An iteration cycle can be divided into 5 stages, and the specific flow is shown in fig. 1:

(2) the data generation stage specifically comprises the following substeps:

(3) The data screening stage specifically comprises the following steps:

(3.1) for the training data set

(3.3) for the corresponding principal component matrix T₂Root of Henan ginsengCooperation element matrix T₁The corresponding essence center of the small sample calculates the mahalanobis distance

Will MD₂And kMD_1maxMake a comparison if

The generated data g (z) is considered to be close to the training data set

Is the effective point G_valid(ii) a If it is

At this time, the generated data G is considered to deviate from the training data set

Is an outlier noise G_invalid。

(5) The classifier training stage specifically comprises the following steps:

(5.2) applying the training data set

(5.3) constructing a loss function of the classification discriminator:

wherein

X from n generators.

(6) the generator training stage specifically comprises the following steps:

(6.3) repeating the steps (2.1) - (6.2) until each sample sequence corresponding to the dynamic table of the supplementary data set is filled, and finishing training the resistance enhanced fault classifier. Through continuous iteration of the process, the data generated by the generator gradually approaches to the real samples, the sample sequence corresponding to the dynamic table of the supplementary data set is updated by the generated data, the imbalance of the data set is solved through effective generated data supplement, and the classification performance of the multi-classification discriminator is improved in confrontation and training.

Examples

The performance of the industrial imbalance data countermeasure enhancement fault classification discriminator is described below in conjunction with a specific TE process example. The TE process is a standard data set commonly used in the field of fault diagnosis and fault classification, and the whole data set includes 53 process variables, and the process flow thereof is shown in fig. 2. The process consists of 5 operation units, namely a gas-liquid separation tower, a continuous stirring type reaction kettle, a dephlegmator, a centrifugal compressor, a reboiler and the like, can be expressed by a plurality of algebraic and differential equations, and is mainly characterized by nonlinearity and strong coupling of the process sensing data.

The TE process sets 21 types of faults, wherein the 21 types of faults include 16 types of known faults and 5 types of unknown faults, the types of faults include step change of flow, slow ramp increase, viscosity of a valve and the like, and typical nonlinear faults and dynamic faults are included, normal data and five fault states are selected for research in the embodiment, and descriptions and corresponding ratios of different states are shown in table 1.

Table 1: fault list of the present embodiment

Numbering	Type (B)	State description	Number of
				0	Is normal	Is free of	1000
1	Step fault	The A/C feed flow ratio was varied, the content of component B being kept constant (stream 4)	50
				2	Step fault	The content of component B was varied and the A/C feed flow ratio was constant (stream 4)	50
3	Step fault	Loss of Material A (stream 1)	50
				4	Random variable fault	The temperature of the cooling water inlet of the condenser changes	20
5	Unknown fault	Is unknown	20

In this example, a total of 16 variables were selected for analysis, as shown in table 2.

Table 2: variable list of the present embodiment

Numbering	Measuring variable	Numbering	Measuring variable
				1	A feed rate	9	Product separator temperature
2	D amount of feed	10	Humidity of product classifier
				3	E amount of feed	11	Product separator bottoms flow
4	Total amount of feed	12	Pressure of stripper
				5	Flow rate of recirculation	13	Stripper temperature
6	Reactor feed	14	Flow rate of gas stripper
				7	Reactor temperature	15	Reactor cooling water outlet temperature
8	Discharge velocity	16	Separator cooling water outlet temperature

In this example, the number of generators of the small sample generator is 5, the number of hidden layers per generator is 2, the number of hidden layer nodes is 32 and 64, respectively, the optimizer used is ADAM, and the learning rate is 0.01. The number of hidden layers of the multi-classification discriminator is 2, the number of hidden layer nodes is 100 and 200 respectively, and the classification discriminator adopts an optimizer SGD to learn 0.1. Each time, a batch of data is selected for training, the batch size is 60, all samples are traversed in each period, and 100 periods are iterated.

100 samples from each state were selected as test data. Fig. 3 is a graph showing that the countermeasure enhancement discriminator and a common data oversampling method are counted and compared with the classification result (classification accuracy) of the neural network classifier, and it can be seen from the graph that the method has higher classification accuracy than the method of SMOTE + BPNN, smoteemann + SMOTE, and the superiority of the method is proved.

Claims

1. The method for classifying the confrontation enhancement faults facing the industrial unbalance data is characterized by comprising the following steps:

(2) The data generation stage specifically comprises the following substeps:

(2.2) generating a Gaussian noise z with a mean value of 0 and a variance of 0.1 by a random function; inputting Gaussian noise z into the small sample generator, and outputting n groups of generated data G (z) { G } by the generator₁，G₂，…，G_n}；

(3) The data screening stage specifically comprises the following steps:

(3.1) for the training data set

Performing principal component analysis to obtain a reference principal component matrix T₁Respectively projecting the generated data G (z) to a reference pivot space to obtain a corresponding pivot matrix T₂(ii) a Reference pivot matrix T₁Determining a reference pivot matrix T by accumulating the variance percentage to 98%₁The number of the main elements in (1); corresponding principal component matrix T₂The number of principal elements in (1) and a reference principal element matrix T₁The number of the main elements in the tree is equal;

By said MD_1maxDetermining the upper threshold value of data screening to be kMD_1maxK is a screening coefficient;

Will MD₂And kMD_1maxMake a comparison if

The generated data g (z) is considered to be close to the training data set

Is the effective point G_valid(ii) a If it is

At this time, the generated data G (z) is considered to deviate from the training data set

Is an outlier noise G_invalid；

(3.4) bringing the effective Point G_validExtracting from the generated data G (z) set, and giving the corresponding class labels y to the generated data G (z) set to remove outlier noise G_invalid；

(4.2) when the accumulated number of the generated samples is smaller than the sequence length, continuously writing the generated samples into a dynamic table L of a supplementary database in the iteration process; when the accumulated number of the generated samples is larger than or equal to the length of the sample sequence, eliminating the generated data at the tail, and writing the generated data into new generated data to obtain an updated supplementary data set X';

(5) the classifier training stage specifically comprises the following steps:

(5.2) applying the training data set

Mixed with the supplementary data set X' and used as real data X-P_dataThe generated data are represented as x to P_GInputting x into a multi-classification discriminator to obtain a probability value p (y | x) of softmax output corresponding to each class;

(5.3) constructing a loss function of the classification discriminator:

(6) the generator training stage specifically comprises the following steps:

(6.2) updating network parameters through an error back propagation algorithm, and optimizing a generator model until the generated data can trick the authenticity judgment of the discriminator, namely the loss function of the generator is converged;

(6.3) repeating the steps (2.1) - (6.2) until each sample sequence corresponding to the dynamic table of the supplementary data set is filled, and finishing training the anti-enhancement fault classifier;