CN107688825B

CN107688825B - Improved integrated weighted extreme learning machine sewage treatment fault diagnosis method

Info

Publication number: CN107688825B
Application number: CN201710654311.3A
Authority: CN
Inventors: 许玉格; 赖春伶; 孙称立; 陈立定
Original assignee: South China University of Technology SCUT
Current assignee: South China University of Technology SCUT
Priority date: 2017-08-03
Filing date: 2017-08-03
Publication date: 2020-02-18
Anticipated expiration: 2037-08-03
Also published as: CN107688825A

Abstract

The invention discloses an improved integrated weighted extreme learning machine sewage treatment fault diagnosis method, which comprises the following steps: s1, assigning the initial weight of the weighted extreme learning machine by adopting an assignment formula which tends to a few samples aiming at the base classifier; s2, training a base classifier; s3, providing a novel integrated algorithm-based classifier weight updating formula, integrating a plurality of base classifiers by using a weighted extreme learning machine as a base classifier and an Adaboost iteration method, and establishing an improved sewage fault diagnosis model; s4, inputting sample data generated in the sewage treatment process, setting the number T of the base classifiers of the integrated algorithm, the optimal kernel width gamma of the base classifiers and the corresponding optimal regularization coefficient C, establishing a fault diagnosis model of the sewage treatment system and carrying out performance test. The invention can realize the classification of the unbalanced data of a plurality of classes, improves the classification performance of the unbalanced data, particularly the classification accuracy of a few classes, and effectively improves the accuracy of fault diagnosis in the sewage treatment process.

Description

Improved integrated weighted extreme learning machine sewage treatment fault diagnosis method

Technical Field

The invention relates to the technical field of sewage treatment fault diagnosis, in particular to an improved sewage treatment fault diagnosis method of an integrated weighted extreme learning machine.

Background

The sewage treatment is a complex biochemical process with a great number of influencing factors, the sewage treatment plant is difficult to keep long-term stable operation, and the fault easily causes serious problems of substandard effluent quality, increased operating cost, secondary environmental pollution and the like, so the operating state of the sewage treatment plant needs to be monitored, the operating fault is diagnosed and timely treated.

The fault diagnosis of the sewage treatment process is actually a problem of pattern recognition, and the problem of unbalanced distribution of sewage data sets is often encountered in the classification process. The traditional machine learning method is easy to make the classification accuracy biased to the majority, and the actual classification is more important to the classification accuracy of the minority, namely the classification accuracy of the fault class. The fault can be timely and accurately found, so that the loss of the sewage treatment plant can be reduced to a great extent, and the working efficiency of the sewage treatment plant is improved.

Disclosure of Invention

The invention provides an improved integrated weighted extreme learning machine sewage treatment fault diagnosis method aiming at the fault diagnosis problem of a sewage treatment plant, and the method introduces an unbalanced classification evaluation index G-mean into an Adaboost integrated classification algorithm which takes a weighted extreme learning machine as a base classifier, is used for fault diagnosis in the sewage treatment process, can realize the classification of unbalanced data of multiple categories, improves the classification performance of the unbalanced data, particularly the classification accuracy of a few categories, and effectively improves the fault diagnosis accuracy in the sewage treatment process.

In order to achieve the purpose, the technical scheme provided by the invention is as follows: an improved integrated weighted extreme learning machine sewage treatment fault diagnosis method comprises the following steps:

s1, assigning the initial weight of the weighted extreme learning machine by adopting an assignment formula which tends to a few samples aiming at the base classifier;

s2, training a base classifier: calculating the recall rate recall and the performance evaluation index G-mean value of the previous base classifier, adopting an initial weight matrix updating formula based on G-mean, adjusting the weight matrix of the next base classifier of the weighted extreme learning machine and establishing a base classifier model, wherein the steps are as follows:

s2.1, giving a sewage sample set { (x)₁,y₁),(x₂,y₂),…,(x_i,y_i),…,(x_N,y_N) In which x_iE X denotes the attribute value of the ith sample, y_iIndicates the category label corresponding to the ith sample, N is the total number of samples, y_iE.y {1,2, …, K, …, K }, where K denotes the kth class and K denotes a total of K classes; setting the number of base classifiers of the integration algorithm and recording as T;

s2.2, training the training samples by using a weighted kernel extreme learning machine as a base classifier to obtain a training model h_t，For the t-th base classifier h_tFirst, the recall rate R of each class is obtained₁,R₂,…R_k,…,R_KK is the kth class, M is the total number of classes, and then the number of each class is calculated as n_kAnd classification result A (x) of each sample_i) In the case of a partial pair, A (x)_i) 1 ═ 1; if the error is found, A (x)_i) -1; finally, G _ mean ═ R is obtained₁·R₂…R_K)^1/K；

S2.3, if G _ mean is less than or equal to 0.5, exiting iteration;

s2.4, according to the calculation base classifier h_tWeight calculation formula of

Computing the weight λ of the tth base classifier_tThe smaller G _ mean, λ_tThe smaller the training error is, the smaller the proportion of the t-th base classifier in the whole integration algorithm is, and vice versa;

s2.5, adjusting weight distribution D of next iteration of sample_t+1，D_t+1The adjustment rule of (2) is as follows:

s2.6, making T equal to T +1, if T is less than T, returning to S2.2, otherwise, ending;

s3, providing a novel weight updating formula of an integrated algorithm-based classifier, integrating a plurality of base classifiers by using a weighted extreme learning machine as a base classifier and an Adaboost iteration method, and establishing an improved sewage fault diagnosis model, wherein the weight updating formula comprises the following steps:

s3.1, setting the number of the base classifiers of the integration algorithm and recording as T;

s3.2, determining a sample x according to a weight initialization method_iInitial weight distribution D of₁(i)：i＝1,2,…,N；

S3.3Training T base classifiers according to the method of S2, and updating the formula according to the weights of the base classifiers

Calculating the weight of the base classifier;

s3.4, integrating the T basic classifiers to obtain a sewage fault diagnosis model:

s4, inputting sample data generated in the sewage treatment process, setting the number of the base classifiers of the integrated algorithm as T, setting the optimal kernel width gamma of the base classifiers and the corresponding optimal regularization coefficient C, establishing a fault diagnosis model of the sewage treatment system and carrying out performance test.

In step S1, two weight initialization schemes are selected, one is an automatic weighting scheme:

wherein W₁Denotes a first weighting scheme, n_kThe number of samples corresponding to the training samples with the category k is obtained;

another idea of weight initialization is to push the ratio of minority classes and majority classes towards 0.618:1, which is essentially to trade off the recognition accuracy of minority classes by sacrificing the classification accuracy of majority classes:

wherein W₂Representing a second weighting scheme.

In step S2.2, the modeling of the weighted kernel limit learning machine is specifically as follows:

the extreme learning machine adopts a framework of a single hidden layer feedforward neural network SLFN, and N sewage treatment fault diagnosis training samples are given (x)₁,y₁),(x₂,y₂),…,(x_N,y_N) The standard SLFN output model with L hidden nodes is represented as follows:

wherein, β_iRepresenting the output weight of the ith hidden neuron and the connected output neurons, G being the hidden layer neuron activation function, w_iRepresenting input weights of the input layer and the i-th hidden neuron, b_iRepresents the bias of the i-th hidden neuron, o_jIs the actual output value of the jth output neuron;

for N number of sewage treatment failure diagnosis samples, there is one (w)_i,b_i) And β_iSo thatAnd further, the SLFN model is approximated to a sample set by zero errors, namely, the hidden layer feedforward neural network can fit the SLFN model without errors, namely:

it is expressed as H β ═ T, where:

wherein, H is a hidden layer output matrix, β is an output weight matrix, and T is an output layer output matrix;

when the activation function G is a differentiable function, the SLFN parameters need not be adjusted in their entirety, and the link weights w are input_iAnd hidden layer bias b_iRandomly selected during the initialization process of network parameters and kept unchanged during the training process, the training SLFN is equivalent to solving the least square solution of the linear system H β ═ T, and can be converted into the following optimization problem:

Minimize：||Hβ-T||²and β ceiling

The optimization problem is mathematically expressed as:

Minimize：

Subject to：

wherein, ξ_i＝[ξ_i,1,…ξ_i,K]^TIs a sewage treatment fault diagnosis training sample x_iThe Moore-Penrose generalized inverse matrix H output by the neuron of the hidden layer is the error vector between the output value of the corresponding output node and the real value⁺Can be solved to obtain:

the orthogonal projection method KKT can be used for effectively aligning H⁺Solve when H^TH or HH^TIn the case of a non-singular matrix H⁺＝(H^TH)^-1H^TOr H⁺＝H^T(H^TH)^-1In order to obtain better stability and generalization performance of the obtained model, the solution is carried outWhen it is needed to H^TH or HH^TDiagonal element plus a positive valueObtaining:

i denotes the identity matrix and the corresponding output function is:

or when:

the corresponding ELM output function is:

for better handling of unbalanced data, each sample is weighted such that samples belonging to different classes get different weights, so the mathematical form of the above-mentioned optimization problem is rewritten as:

Minimize：

Subject to：

where W is an N diagonal matrix of definitions, each main diagonal element W_iiCorrespond to one sample x_iDifferent types of samples will be automatically allocated with different weights, C is a normalization coefficient;

defining Lagrange function to solve the quadratic programming problem according to the KKT optimization condition, and then equivalently solving the following formula:

Minimize：

wherein, α_iAre Lagrange multipliers, all non-negative;

the corresponding KKT optimization limiting conditions are as follows:

the algorithm solves the hidden layer output weight as:

the weighting scheme employs the sample weight distribution D in step S2.5_t；

When the hidden layer feature map h (x) is unknown, the kernel matrix is defined as follows:

Ω_ELM＝HH^T:Ω_ELMi,j＝h(x_i)·h(x_j)＝K(x_i,x_j)i＝1,2,…,N；j＝1,2,…,N

this kernel function K (·) needs to satisfy the Mercer condition, at which time the output expression is written as:

therefore, the hidden layer feature mapping of the ELM can be kept unknown, and meanwhile, the number L of the hidden layer neurons does not need to be set;

the final output equation of the weighted extreme learning machine based on the kernel function is as follows:

where I is an identity matrix, C is a normalization coefficient, W is a weighting matrix, T is an output layer matrix, and Ω_ELMIs a kernel matrix;

in summary, the process of the weighted extreme learning machine training algorithm based on the kernel function is as follows:

s2.2.1, giving each sample weight according to the weighting scheme, and calculating a weighting matrix W;

s2.2.2, calculating a kernel matrix omega according to the kernel function_ELM；

S2.2.3, calculating the output result f (x) of the network.

In step S4, the number T of base classifiers of the ensemble classifier is set to 20, and the kernel width γ and normalization coefficient C of the base classifiers satisfying the optimal performance of the algorithm are found by using the mesh parameter optimization, where γ is found in the optimization range of {2 {^-18,2^(-18+step),…,2²⁰Step is 0.5; c has an optimization range of {2 }^-18,2^(-18+step),…,2⁵⁰}，step＝0.5。

Compared with the prior art, the invention has the following advantages and beneficial effects:

1. the method firstly introduces the unbalanced classification evaluation index G-mean into an Adaboost integrated classification algorithm which takes a weighted extreme learning machine as a base classifier, and provides a novel integrated algorithm base classifier weight value updating formula.

2. The method of the invention firstly provides an initial weight matrix updating formula based on G-mean, which is used for modeling a weighted extreme learning machine.

3. The invention adopts the classifier of the weighted extreme learning machine as the base classifier of the integrated learning algorithm, and can improve the learning speed of the classifier, thereby realizing the real-time and accurate monitoring of the running state of the sewage treatment plant.

4. The method can improve the integral classification accuracy of the sewage treatment ancient fault diagnosis system, especially can improve the identification accuracy of fault categories, and has important significance for fault early warning and timely treatment of the sewage treatment system.

5. The method can effectively ensure the stable operation of the sewage treatment plant and the sewage treatment quality, and reduce secondary pollution.

Drawings

FIG. 1 is a flow chart of the method of the present invention.

Detailed Description

The present invention will be further described with reference to the following specific examples.

Referring to fig. 1, the integrated weighted learning extreme machine sewage treatment fault diagnosis method provided by the embodiment includes the following steps:

step S1, initial base classifier weighted extreme learning machineAnd (6) assigning a weight value. There are two weight initialization schemes, one is an automatic weighting scheme:

wherein W₂Representing a second weighting scheme.

Step S2, training a base classifier:

s2.2, training the training samples by using a weighted kernel extreme learning machine as a base classifier to obtain a training model h_t，

For the t-th base classifier h_tFirst, the recall rate R of each class is obtained₁,R₂,…R_k,…,R_KK is the kth class, M is the total number of classes, and then the number of each class is calculated as n_kAnd classification result A (x) of each sample_i) In the case of a partial pair, A (x)_i) 1 ═ 1; if the error is found, A (x)_i) -1; finally, G _ mean ═ R is obtained₁·R₂…R_K)^1/K；

S2.3, if G _ mean is less than or equal to 0.5, exiting iteration;

and finishing training the base classifier.

In the step S2.2, the modeling of the weighted kernel limit learning machine specifically includes the following steps:

for N number of sewage treatment failure diagnosis samples, there is one (w)_i,b_i) And β_iSo that

And further, the SLFN model is approximated to a sample set by zero errors, namely, the hidden layer feedforward neural network can fit the SLFN model without errors, namely:it is expressed as H β ═ T, where:

Minimize：||Hβ-T||²and β ceiling

The optimization problem is mathematically expressed as:

Minimize：

Subject to：

wherein, ξ_i＝[ξ_i,1,…ξ_i,K]^TIs a sewage treatment fault diagnosis training sample x_iError between output value and true value of its corresponding output nodeDifference vector, Moore-Penrose generalized inverse matrix H output by hidden layer neurons⁺Can be solved to obtain:

the orthogonal projection method KKT can be used for effectively aligning H⁺Solve when H^TH or HH^TIn the case of a non-singular matrix H⁺＝(H^TH)^-1H^TOr H⁺＝H^T(H^TH)^-1In order to obtain better stability and generalization performance of the obtained model, the solution is carried out

When it is needed to H^TH or HH^TDiagonal element plus a positive value

Obtaining:

i denotes the identity matrix and the corresponding output function is:

or when:

the corresponding ELM output function is:

Minimize：

Subject to：

Minimize：

wherein, α_iAre Lagrange multipliers, all non-negative;

the corresponding KKT optimization limiting conditions are as follows:

the algorithm solves the hidden layer output weight as:

the weighting scheme employs the sample weight distribution D in step S2.5_t；

S2.2.3, calculating the output result f (x) of the network.

Step S3, a novel integrated algorithm-based classifier weight updating formula is provided, a weighted extreme learning machine is used as a base classifier, a plurality of base classifiers are integrated by an Adaboost iteration method, and an improved sewage fault diagnosis model is established, wherein the steps and the processes are as follows:

S3.3, training T pieces according to the method of S2A base classifier for updating the formula based on the weight of the base classifier

Calculating the weight of the base classifier;

and finishing modeling of the sewage fault diagnosis model.

And step S4, setting the number T of the base classifiers of the integrated classifier to be 20, and searching the kernel width gamma and the normalization coefficient C of the base classifier which meet the optimal performance of the algorithm by adopting a grid parameter optimization mode. The optimization range of gamma is {2^-18,2^(-18+step),…,2²⁰Step is 0.5; c has an optimization range of {2 }^-18,2^(-18+step),…,2⁵⁰}，step＝0.5。

The data of experimental simulation comes from California university database (UCI) and is daily monitoring data of a sewage treatment plant, the dimension of each sample of the whole data set is 38, all attribute values are completely recorded with 380, 13 states of the monitored water body are totally obtained, and each state is replaced by a number. To simplify the complexity of classification, we classified the samples into 4 broad classes according to the nature of the class of samples, as shown in table 1 below. In table 1, the category 1 is a normal case, the category 2 is a normal case in which the performance exceeds the average value, the category 3 is a normal case in which the inflow rate is low, and the category 4 is a failure case due to a failure in the secondary sedimentation tank, an abnormal state due to heavy rain, and an overload in the solid solubility. The number of the class 1 samples in the normal condition is more, and the samples belong to a plurality of classes; and the category 3 and the category 4 belong to a few categories due to the small number of samples, and the distribution ratio of the four categories of samples is 39.6:14.6:8:1 through simplification of the data categories. The parameter optimization shows that the two weight initialization schemes adopted by the software example respectively have the following optimal parameters: w1 (C2)^26.5,γ＝2¹³),W2:(C＝2^27.5,γ＝2^13.5)。

According to the steps, 3/4 of a sewage sample set, namely 285 groups of samples in total, is used as a training sample set in a simulation experiment, different weight initialization schemes are adopted, a final classification model is generated through integrated iteration, and the remaining sample set is used as a test sample and is substituted into the classification model to obtain a final classification result, namely a sewage treatment fault diagnosis result. Wherein AdaG1WELM represents the algorithm adopting the W1 initial weight scheme, and AdaG2WELM represents the algorithm adopting the W2 initial weight scheme.

TABLE 1 sample Category number distribution

TABLE 2 results compared to conventional Classification Algorithm

TABLE 3 comparison of results with current similar algorithms

Tables 2 and 3 show the experimental results comparing the algorithms used in the present invention (AdaG1WKELM and AdaG2WKELM) with the conventional classification algorithm and the current similar research algorithm, respectively. The traditional classification algorithm comprises a Back Propagation Neural Network (BPNN), a Support Vector Machine (SVM), a Relevance Vector Machine (RVM), a Fast relevance vector machine (Fast RVM), an Extreme Learning Machine (ELM), and a weighted extreme learning machine (K-WELM) based on a kernel function; current similar research algorithms include B-PCA-CBPNN, wellm, and Pre-processed Fast RVM. R1-acc, R2-acc, R3-acc and R4-acc respectively represent the classification accuracy of each class, Total acc represents the overall classification accuracy, and G-mean ═ is (R₁×R₂×R₃×R₄)^1/4Training time represents the modulusType training time. It can be known from the table that although the classification accuracy of AdaG1WKELM and AdaG2WKELM for most types of samples is lower than that of other types of algorithms, the classification accuracy for a few types of samples is higher, especially the classification accuracy of the fourth type, i.e. the fault class, and the overall G-mean value and the overall accuracy are the greatest. It can be seen that the algorithm used by the software is well suited to classify unbalanced data sets. In conclusion, the fault diagnosis method based on the G-mean integrated extreme learning machine used by the software can accurately judge the faults which may occur in the sewage treatment process, and the fault treatment capacity of the sewage treatment plant is enhanced.

The above-mentioned embodiments are merely preferred embodiments of the present invention, and the scope of the present invention is not limited thereto, so that the changes in the shape and principle of the present invention should be covered within the protection scope of the present invention.

Claims

1. An improved integrated weighted learning machine sewage treatment fault diagnosis method is characterized by comprising the following steps:

there are two weight initialization schemes, one is an automatic weighting scheme:

wherein W₂Representing a second weighting scheme;

s2.2, training the training samples by using a weighted kernel extreme learning machine as a base classifier to obtain a training model h_t(x)，

For the t-th base classifier h_t(x) First, the recall rate R of each class is obtained₁,R₂,…R_k,…,R_KK is the kth class, K is the total number of classes, and then the number of each class is calculated as n_kAnd classification result A (x) of each sample_i) In the case of a partial pair, A (x)_i) 1 ═ 1; if the error is found, A (x)_i) -1; finally, G _ mean ═ R is obtained₁·R₂…R_K)^1/K；

S2.3, if G _ mean is less than or equal to 0.5, exiting iteration;

s2.4, according to the calculation base classifier h_t(x) Weight calculation formula of

Computing the weight λ of the tth base classifier_tThe smaller G _ mean, λ_tThe smaller the training error, the smaller the weight of the t-th base classifier in the whole integrated algorithm, and vice versa；

s3, providing an integrated algorithm-based classifier weight value updating formula, integrating a plurality of base classifiers by using a weighted extreme learning machine as a base classifier and an Adaboost iteration method, and establishing an improved sewage fault diagnosis model, wherein the steps and processes are as follows:

S3.3, training T base classifiers according to the method of S2, and updating the formula according to the weights of the base classifiers

Calculating the weight of the base classifier;

2. The improved integrated weighted learning extreme machine sewage treatment fault diagnosis method as claimed in claim 1, wherein in step S2.2, the modeling of the weighted kernel extreme learning machine is specifically as follows:

And further, the SLFN model is approximated to a sample set by zero errors, namely, the hidden layer feedforward neural network can fit the SLFN model without errors, namely:

it is expressed as H β ═ T, where:

when the activation function G is a differentiable function, the SLFN parameters need not be adjusted all together, and the chaining weight is inputWeight w_iAnd hidden layer bias b_iRandomly selected during the initialization process of network parameters and kept unchanged during the training process, the training SLFN is equivalent to solving the least square solution of the linear system H β ═ T, and can be converted into the following optimization problem:

Minimize：||Hβ-T||²and β ceiling

The optimization problem is mathematically expressed as:

Minimize：

Subject to：

wherein, ξ_i＝[ξ_i,1,…ξ_i,K]^TIs a sewage treatment fault diagnosis training sample x_iThe Moore-Penrose generalized inverse matrix H + output by the neuron of the hidden layer can be solved to obtain the error vector between the output value of the corresponding output node and the real value:

the orthogonal projection method KKT can be used for effectively aligning H⁺Solve when H^TH or HH^TIn the case of a non-singular matrix H⁺＝(H^TH)^-1H^TOr H⁺＝H^T(H^TH)^-1In order to obtain better stability and generalization performance of the obtained model, the solution is carried outWhen it is needed to H^TH or HH^TDiagonal element plus a positive value

Obtaining:

i denotes the identity matrix and the corresponding output function is:

or when:

the corresponding ELM output function is:

Minimize：

Subject to：

according to the KKT optimization condition, defining a Lagrange function to solve a quadratic programming problem, and then equivalently solving the following formula:

Minimize：

wherein the content of the first and second substances,α_iare Lagrange multipliers, all non-negative;

the corresponding KKT optimization limiting conditions are as follows:

the algorithm solves the hidden layer output weight as:

the weighting scheme employs the sample weight distribution D in step S2.5_t；

S2.2.3, calculating the output result f (x) of the network.

3. The improved integrated weighted learning machine sewage treatment fault diagnosis method according to claim 1, characterized in that: in step S4, the number T of base classifiers of the ensemble classifier is set to 20, and the kernel width γ and normalization coefficient C of the base classifiers satisfying the optimal performance of the algorithm are found by using the mesh parameter optimization, where γ is found in the optimization range of {2 {^-18,2^(-18+step),…,2²⁰Step is 0.5; c has an optimization range of {2 }^-18,2^(-18+step),…,2⁵⁰}，step＝0.5。