CN112598022A

CN112598022A - Improved FDA process industrial fault diagnosis method based on ensemble learning method

Info

Publication number: CN112598022A
Application number: CN202011374338.5A
Authority: CN
Inventors: 贺彦林; 赵阳; 朱群雄; 徐圆; 张洋
Original assignee: Beijing University of Chemical Technology
Current assignee: Beijing University of Chemical Technology
Priority date: 2020-11-30
Filing date: 2020-11-30
Publication date: 2021-04-02

Abstract

The invention discloses an improved FDA process industrial fault diagnosis method based on an ensemble learning method, which fully retains the time sequence relevance and more useful information of an industrial process among data after the data are subjected to dynamic processing, and is convenient for feature extraction of an FDA method of L2 norm normalization. According to the technical scheme provided by the invention, under the action of the integrated learning method, the time for establishing the comprehensive fault diagnosis model is shortened, and the fault diagnosis efficiency is improved. Compared with a fault diagnosis model which is not subjected to dynamic processing and is based on a Bayesian method, the technical scheme provided by the invention has obvious advantages through experimental simulation results.

Description

Improved FDA process industrial fault diagnosis method based on ensemble learning method

Technical Field

The invention relates to the technical field of process industrial fault diagnosis, in particular to an improved FDA (food and drug administration) process industrial fault diagnosis method based on an integrated learning method.

Background

The process industry related to the industries of petroleum, chemical industry, metallurgy, paper making and the like is taken as an important prop industry of national economy in China, and the safe production of the process industry is an important concern. The development of the process industry gradually shows the characteristics of large scale, nonlinearity and strong coupling of production equipment so far due to the continuous development and updating of discipline fusion and computer technology, and the production process also has high complexity and high risk correspondingly, so that various faults frequently occur, and the economic loss and even the loss of life of people are caused. Therefore, fault diagnosis in the process industry is always an important research subject of the production process, is a key technology of industrial safety, and is important for the safe operation of an industrial control system. Finding and solving faults in time is an important means for improving production safety and economic benefit. Therefore, the method has important practical significance for researching a fault diagnosis model and improving the fault diagnosis accuracy.

In recent years, data-driven fault diagnosis techniques based on multivariate statistical techniques have been widely studied and applied, such as principal component analysis methods, local preserving projection methods, canonical variable analysis methods, and the like. Principal Component Analysis (PCA) was once the most widely used data-driven technique in industrial monitoring systems, the classical method for data dimensionality reduction. Extracting the structural features of the data using PCA techniques can eliminate the correlation of the variables while ensuring that the sample data has the greatest variance. Due to the advantages of the PCA method in data structure feature extraction, further studies on PCA have not been interrupted. For example, the scholars propose a novel pattern recognition framework based on multi-scale pca (mspca) which incorporates an adaptive neuro-fuzzy inference system to creatively divide the state space into a score space and a residual space after extracting data features, which allows fault diagnosis to be performed by multiple classifiers. However, in the fault diagnosis process, when the dimension of the data is reduced, the data information among all the mode types is also an object needing important attention, and based on the object, the dimension of the data is reduced by adopting Fisher discriminant analysis. Fisher Discriminant Analysis (FDA) is a data dimension reduction method widely used in pattern classification, and this technique can fully consider data information between pattern classes. Particularly in the case of a relatively small number of process fault observations affecting production and a relatively large number of variables contained in the fault mode class, the FDA is more suitable for use in fault diagnosis when determining a representation of lower dimensionality due to its ability to focus on inter-class information.

As a supervised learning method, the FDA has been extensively studied with functions of feature extraction and pattern classification. As one method of data feature selection, FDA outperforms PCA in fault diagnosis. The FDA may also perform pre-classification at the same time as feature extraction, which can reduce the performance requirements on the classifier when performing failure mode recognition compared to other data-driven methods.

When the invention is used for fault mode classification, an algorithm which is widely applied in the field of machine learning and data mining is selected: the Adaboost algorithm. The Adaboost algorithm is an integrated algorithm and is applied to the fields of two classes, namely the Adaboost M1 variant, and is applied to the fields of multiple classes, namely the Adaboost M2 variant. Considering that the flow industry has more fault types and the fault diagnosis belongs to a multi-classification task, the Adaboost M2 variant is selected by the invention. In the invention, the algorithm is aimed at multi-classification tasks, and a special combination strategy is applied to collect a plurality of homogeneous weak classifiers (basic classifiers) with poor classification performance, so that the homogeneous weak classifiers are promoted to be strong classifiers and obtain excellent classification performance. The working mechanism of Adaboost M2 is: firstly, training a first basic classifier 1 by using initial weight from a training set, then classifying the patterns, further updating the weight of samples in the training set according to the learning error rate of the basic classifier 1 to the fault patterns, increasing the weight of classified error samples and reducing the weight of classified correct samples, so that each time of classified error samples is paid more attention. And after the training samples with updated weights are obtained, training the basic classifier 2, iterating for a plurality of times in this way, terminating the training until the number of the basic classifiers reaches the preset number T, combining the T basic classifiers through a set strategy, and finally upgrading the T basic classifiers into a strong classifier.

In order to verify the feasibility of the method, the fault type in the Tennessee Eastman Process (TEP) is taken as a research object, and fault diagnosis model establishment is carried out based on an FDA method, so that the method aims to diagnose the fault of the Process industry in time, reduce economic loss caused by the fault and realize efficient and safe production. TEP is a realistic industrial process simulation created by eastman chemicals, usa, and provides a model paradigm for scientific research of process monitoring. The simulation process comprises five core composition units of a reactor, a condenser, a compressor, a separator and a stripping tower, and material flow relates to eight chemical components. The experimental result of the fault diagnosis model based on the TEP shows that compared with other models, the improved FDA fault diagnosis model based on the integrated learning method can realize higher-precision fault diagnosis.

Disclosure of Invention

In order to solve the limitations and defects of the prior art, the invention provides an improved FDA process industrial fault diagnosis method based on an ensemble learning method, which comprises the steps of preprocessing the obtained fault data, constructing a dynamic input process, reducing the FDA data dimension and extracting the features, classifying the mode of the Adaboost M2 method by the ensemble learning method, and constructing a fault diagnosis model;

the method for classifying patterns by the ensemble learning method Adaboost M2 comprises the following steps:

given a training set S and a base classifier space Ψ, the expression is as follows:

wherein z is_iIs the row vector of data Z, y is the class label, c is the fault class number, phi represents the basic classifier;

training set weights are initialized, and the expression is as follows:

for each weak classifier, dividing the sample space Z to obtain Z₁，Z₂，...，Z_mIn aProbability distribution D_fThe sample class attribute probability is calculated as follows:

setting the output of a basic classifier in the divided sample space Z, wherein the expression is as follows:

calculating to obtain a normalization factor, wherein the expression is as follows:

wherein there are F features, F1, 2,., F;

the basic classifier is chosen such that the normalization factor is minimal, the expression is as follows:

calculating error fraction, and expressing the following expression:

wherein, T_m(z) is the sample label predicted by the base classifier;

calculating the weight of each basic classifier, and expressing the following expression:

and updating the sample weight, wherein the expression is as follows:

obtaining a combined classifier, wherein the expression is as follows:

wherein the content of the first and second substances,

optionally, the step of preprocessing the obtained fault data includes: selecting 52 process variables of the Tennessee-Ismann process as input, and carrying out fault diagnosis on 21 faults of the Tennessee-Ismann process;

obtaining a normalized data set X of a failed original training sample₀The expression is as follows:

wherein n represents the number of samples, and m represents the number of variables;

the step of constructing a dynamic input process comprises: for the normalized data set X₀Performing dynamic processing, and selecting a time lag constant L to be 2 to obtain an augmentation matrix X, wherein the expression is as follows:

the FDA data dimension reduction and feature extraction steps comprise: and reducing the dimension of the augmentation matrix X, calculating an FDA feature vector, and performing L2 norm normalization.

Optionally, the step of performing dimension reduction on the augmented matrix X, calculating an FDA eigenvector, and performing norm normalization by using L2 includes:

and constructing an overall dispersion matrix, wherein the expression is as follows:

S_t＝S_w+S_b (3)

wherein S is_wRepresenting an intra-class dispersion matrix, S_bRepresenting an inter-class dispersion matrix;

calculating an FDA vector, and maximizing the inter-class dispersion while minimizing the intra-class dispersion, wherein the expression is as follows:

using lagrange multiplier method, obtain FDA vector wk by solving generalized eigenvalues, the expression is as follows:

S_bw_k＝λ_pS_ww_k (5)

for the FDA vector w_kL2 norm normalization is performed, and the expression is as follows:

r FDA vectors are selected to construct a linear transformation matrix W_newThe expression is as follows:

W_new＝(w² ₁，w² ₂，...，，w² _r)^T (7)

the process industry data dimension is carried out, and the expression is as follows:

Z＝W^T _new×X (8)。

the invention has the following beneficial effects:

according to the improved FDA process industrial fault diagnosis method based on the ensemble learning method, provided by the invention, after data are subjected to dynamic processing, the time sequence relevance and more useful information of an industrial process are fully reserved among the data, and the FDA method of L2 norm normalization is convenient for feature extraction. According to the technical scheme provided by the invention, under the action of the integrated learning method, the time for establishing the comprehensive fault diagnosis model is shortened, and the fault diagnosis efficiency is improved. Compared with a fault diagnosis model which is not subjected to dynamic processing and is based on a Bayesian method, the fault diagnosis method has the obvious advantages.

Drawings

Fig. 1 is a flowchart of a TEP according to an embodiment of the present invention.

Fig. 2 is a schematic diagram of an average classification accuracy curve in a training process according to an embodiment of the present invention.

Fig. 3 is a diagram illustrating a sample point distribution effect after dimensionality reduction of a data set according to an embodiment of the present invention.

Fig. 4 is a schematic diagram of a confusion matrix of the fault diagnosis result according to an embodiment of the present invention.

Fig. 5 is a comparison graph of fault diagnosis results of different methods according to the first embodiment of the present invention.

Detailed Description

In order to make those skilled in the art better understand the technical solution of the present invention, the improved FDA process industrial fault diagnosis method based on the ensemble learning method provided by the present invention is described in detail below with reference to the accompanying drawings.

Example one

The embodiment provides a novel fault diagnosis method based on Adaboost M2 dynamic L2 norm normalized Fisher discriminant analysis (D-L2-FDA), and a fault data set is dynamically processed in consideration of time sequence characteristics of collected process data in a constructed D-L2-FDA model. The L2 norm normalized FDA is used to extract data features and reduce the dimensionality of the fault data. In addition, in order to classify different types of failure modes, the present embodiment adopts an ensemble learning method such as Adaboost M2. In order to verify the effectiveness of the proposed D-L2-FDA model based on Adaboost M2 in fault diagnosis, the present example was subjected to case study based on Tennessman process. Simulation results show that the diagnostic performance of the D-L2-FDA model based on Adaboost M2 is superior to other methods.

Fig. 1 is a flowchart of a TEP according to an embodiment of the present invention. In order to improve the fault diagnosis accuracy, the embodiment provides an improved FDA process industrial fault diagnosis method based on an integrated learning method, so as to accurately diagnose a process fault and ensure production safety. The embodiment comprises the following steps: the method comprises the steps of obtaining data, conducting data preprocessing, constructing a dynamic input process, FDA data dimension reduction and feature extraction, conducting mode classification by an integrated learning method Adaboost M2, and constructing a fault diagnosis model. According to the technical scheme provided by the embodiment, the building time of the comprehensive fault diagnosis model is shortened under the action of the integrated learning method, and the fault diagnosis efficiency is improved.

The embodiment is based on TEP simulation, actual factory production is truly simulated, an obtained data set is divided into a training set and a testing set, the training set is composed of 480 samples, and the testing set is composed of 960 samples. The data in the training set and the test set contain information of 52 variables in total, namely the control variable and the measurement variable, at different sampling moments, and the variable information is shown in table 1. All data sets cover 22 different conditions, including 1 fault-free set and 21 fault sets, and the fault conditions are described and their types are shown in table 2.

TABLE 1 Process variables and their description

TABLE 2 Fault and description thereof

Fig. 2 is a schematic diagram of an average classification accuracy curve in a training process according to an embodiment of the present invention. In this embodiment, data preprocessing is performed, and in order to consider continuity and dynamic property of the process, dynamic processing is performed on the original data set.

Fig. 3 is a diagram illustrating a sample point distribution effect after dimensionality reduction of a data set according to an embodiment of the present invention. In this embodiment, the dimension reduction and feature extraction are performed on the fault data set, an improved FDA method is adopted, after an FDA feature vector is solved, L2 norm normalization is performed to obtain a feature vector direction, while the magnitude of the feature vector is ignored, so that the model calculation complexity is simplified.

In this embodiment, after the fault mode classification is performed to obtain the specification data after the dimension reduction, the fault mode classification is performed by using an ensemble learning method Adaboost M2 based on a decision tree to complete fault diagnosis.

Fig. 4 is a schematic diagram of a confusion matrix of the fault diagnosis result according to an embodiment of the present invention. Fig. 5 is a comparison graph of fault diagnosis results of different methods according to the first embodiment of the present invention. The embodiment is an improved FDA fault diagnosis model based on an ensemble learning method, which is used for diagnosing process faults of the process industry, the initial data set is dynamically processed, L2 norm normalization is carried out on the obtained FDA feature vector to obtain a fault feature extraction model, fault mode classification is carried out by combining an Adaboost M2 algorithm based on decision tree integration, and finally fault diagnosis of the process industry is realized.

This embodiment obtains a normalized data set X of the original training sample of the failure₀：

Where n represents the number of samples and m represents the number of variables.

This example is for X₀Performing dynamic processing, and selecting a time lag constant L to be 2 to obtain an augmentation matrix X:

in this embodiment, dimension reduction is performed on the augmented matrix, FDA eigenvectors are calculated, and L2 norm normalization is performed. This embodiment constructs the total dispersion matrix:

S_t＝S_w+S_b (3)

wherein S is_wRepresenting an intra-class dispersion matrix, S_bRepresenting an inter-class dispersion matrix.

The present embodiment calculates the FDA vector, and maximizes the inter-class dispersion while minimizing the intra-class dispersion:

the present embodiment applies the lagrangian multiplier method to obtain the FDA vector w by solving the generalized eigenvalue of the following equation_k：

S_bw_k＝λ_pS_ww_k (5)

In this embodiment, a series of FDA vectors are normalized by an L2 norm:

in the embodiment, r FDA vectors are selected to construct a linear transformation matrix W_new：

W_new＝(w² _I，w² ₂，...，w² _r)^T (7)

In this embodiment, the process industry data dimension reduction is performed:

Z＝W^T _new×X (8)

in this embodiment, an ensemble learning method Adaboost M2 is applied to classify the failure modes, and the generation process of the combination classifier is as follows:

this embodiment gives a training set S and a base classifier space Ψ:

wherein z is_iIs a row vector of data Z, representing one sample. y is the class label, c is the number of failed classes, and φ represents the basic classifier.

This embodiment initializes the training set weights:

this embodiment assumes that a total of F features need to be obtained, for F ═ 1,2, …, F:

for each weak classifier, the present embodiment performs the following operations:

this example divides the sample space Z to obtain Z₁，Z₂，...，Z_m；

Under probability distribution, the present embodiment calculates a sample class attribute probability:

the present embodiment sets the output of the basic classifier in the above division:

in this embodiment, a normalization factor is calculated:

the embodiment selects a basic classifier such that the normalization factor is minimal:

this embodiment calculates the error fraction:

wherein, T_m(z) is the sample label predicted by the basic classifier.

The present embodiment calculates the weight of each basic classifier:

the present embodiment updates the sample weight:

this embodiment results in a combined classifier:

wherein the content of the first and second substances,

the combined classifier obtained by the expression (18) integrates a plurality of basic classifiers with relatively weak classification capability into a strong classifier, which is called as an ensemble learning method in the embodiment. Adaboost M2, as an ensemble learning method, provides a novel pattern classifier that provides a new idea for fault diagnosis by predicting the labels of test set data based on extracted process data features to implement pattern classification. The idea of the Adaboost M2 method is to construct a first simple classifier based on the original data distribution. Then, bootstrap generates T weak classifiers, and finally, the different classifiers are combined together to construct a strong classifier with better performance. As can be seen from the algorithm flow, in the weight updating process, the weight of the misclassified sample is increased, and the weight of the correctly classified sample is decreased. Thus, Adaboost M2 enhances learning of misclassified samples.

An improved FDA fault diagnosis model based on an ensemble learning method Adaboost M2 is finally designed, and from the experimental result of TEP fault diagnosis of the model, the embodiment has high fault diagnosis accuracy and certain practical application value.

According to the improved FDA process industrial fault diagnosis method based on the ensemble learning method, after data are subjected to dynamic processing, time sequence relevance and more useful information of an industrial process are fully reserved among the data, and feature extraction is conveniently performed by the FDA method of L2 norm normalization. According to the technical scheme provided by the embodiment, the building time of the comprehensive fault diagnosis model is shortened under the action of the integrated learning method, and the fault diagnosis efficiency is improved. Compared with a fault diagnosis model which is not subjected to dynamic processing and is based on a Bayesian method, the technical scheme provided by the embodiment has significant advantages through experimental simulation results.

It will be understood that the above embodiments are merely exemplary embodiments taken to illustrate the principles of the present invention, which is not limited thereto. It will be apparent to those skilled in the art that various modifications and improvements can be made without departing from the spirit and substance of the invention, and these modifications and improvements are also considered to be within the scope of the invention.

Claims

1. An improved FDA process industrial fault diagnosis method based on an ensemble learning method is characterized by comprising the steps of preprocessing obtained fault data, constructing a dynamic input process, FDA data dimension reduction and feature extraction, carrying out mode classification by using an ensemble learning method Adaboost M2 method, and constructing a fault diagnosis model;

training set weights are initialized, and the expression is as follows:

for each weak classifier, dividing the sample space Z to obtain Z₁,Z₂,...,Z_mIn the probability distribution D_fThe sample class attribute probability is calculated as follows:

wherein there are F features, F is 1,2, …, F;

calculating error fraction, and expressing the following expression:

wherein, T_m(z) is the sample label predicted by the base classifier;

and updating the sample weight, wherein the expression is as follows:

obtaining a combined classifier, wherein the expression is as follows:

wherein the content of the first and second substances,

2. the improved FDA process industry fault diagnosis method based on integrated learning method according to claim 1, wherein the step of preprocessing the obtained fault data comprises: selecting 52 process variables of the Tennessee-Ismann process as input, and carrying out fault diagnosis on 21 faults of the Tennessee-Ismann process;

3. The improved FDA process industrial fault diagnosis method based on ensemble learning method according to claim 2, wherein the step of performing dimension reduction on the augmented matrix X, calculating FDA feature vector, and performing L2 norm normalization comprises:

S_t＝S_w+S_b (3)

obtaining the FDA vector w by solving for the generalized eigenvalues using the Lagrangian multiplier method_kThe expression is as follows:

S_bw_k＝λ_pS_ww_k (5)

W_new＝(w² _I，w² ₂，...，w² _r)^T (7)

and (3) performing process industrial data dimension reduction, wherein the expression is as follows:

Z＝W^T _new×X (8)。