CN111898313B

CN111898313B - Fault detection method based on ICA and SVM integrated learning

Info

Publication number: CN111898313B
Application number: CN202010612207.XA
Authority: CN
Inventors: 凡时财; 邹见效; 张季阳; 徐红兵
Original assignee: University of Electronic Science and Technology of China
Current assignee: University of Electronic Science and Technology of China
Priority date: 2020-06-30
Filing date: 2020-06-30
Publication date: 2022-05-20
Anticipated expiration: 2040-06-30
Also published as: CN111898313A

Abstract

The invention discloses a fault detection method based on ICA and SVM integrated learning, which integrates an ICA model with characteristic extraction capability and an SVM model with classification capability by adopting Bayesian inference and utilizes different advantages of different models, thereby improving detection accuracy and ensuring more stable detection effect.

Description

Fault detection method based on ICA and SVM integrated learning

Technical Field

The invention relates to the field of fault detection in industrial processes, in particular to a fault detection method based on ICA and SVM integrated learning.

Background

Modern industrial production is more and more scaled and complicated, and if the production process breaks down, the product quality is influenced, and the life safety of people is threatened more easily. Fault detection techniques are therefore often employed to monitor industrial process conditions.

In the prior art, the measurement device is arranged to acquire the working data of the industrial production system to detect the fault in the industrial process, but the measurement device can only acquire the working data of the industrial production system and cannot directly draw a conclusion about whether the fault occurs, so how to perform fault detection according to the data acquired by the measurement device and improve the accuracy of fault detection become important research points in the field.

Disclosure of Invention

In order to overcome the defects in the prior art, the fault detection method based on ICA and SVM integrated learning provided by the invention can accurately and quickly detect faults through data acquired by measuring equipment.

In order to achieve the purpose of the invention, the invention adopts the technical scheme that:

a fault detection method based on ICA and SVM integrated learning is provided, which comprises the following steps:

s1, acquiring measurement data of the target industrial production system by the measurement equipment under different working conditions of the target industrial production system, listing the measurement data at different moments under the same working condition into a matrix, and taking each matrix as an initial sample set;

s2, standardizing the initial sample set under the normal working condition to obtain a standardized sample set under the normal working condition;

s3, constructing an ICA model based on the sample set under the standardized normal working condition;

s4, obtaining the number of SVM models to be constructed, randomly extracting samples from each initial sample set to form a sample subset of each working condition for each SVM model, and standardizing each sample subset to obtain a training subset corresponding to each SVM model;

s5, constructing a corresponding SVM model based on each training subset;

s6, acquiring measurement data of the current measurement equipment on the target industrial production system, and standardizing the measurement data to be used as a detection sample;

s7, respectively taking the detection samples as the input of all ICA models and SVM models, and correspondingly obtaining the output of each ICA model and the output of each SVM model;

s8, integrating the outputs of all ICA models and SVM models corresponding to the detection samples through Bayesian inference to calculate the integration probability value;

and S9, judging whether the integrated probability value is larger than or equal to a threshold value, if so, judging that the detection sample is a fault, otherwise, judging that the detection sample is normal, and finishing fault detection.

Further, the specific method of step S1 is:

the method comprises the steps that K measuring devices are adopted to simultaneously obtain measuring data of a target industrial production system under C working conditions, the measuring data corresponding to the K measuring devices at the same time are used as a sample, the measuring data at different times under the same working condition are listed into a matrix, and each matrix is used as an initial sample set to obtain C initial sample sets; i.e. each sample in each initial set of samples has K elements.

Further, the specific method of step S2 is:

for an initial sample set X under normal operating conditions₀＝[x₀(1),x₀(2),...,x₀(r),...,x₀(n₀)]According to the formula:

obtaining the l-th sample of the r-th sample_xNormalized value of individual element

Further obtaining the values of all elements after standardization, completing the standardization of the initial sample set under the normal working condition, and obtaining the standardized sample set under the normal working condition; wherein x₀(r)(l_x) For the l-th sample in the r-th sample_xA value of an element; mean (X)₀(l_x) Is the l-th sample of each sample in the initial sample set under normal conditions_xA mean of the individual elements; std (X)₀(l_x) Is the l-th sample of each sample in the initial sample set under normal conditions_xStandard deviation of individual elements; n is₀The total number of samples in the initial sample set under the normal working condition.

Further, the specific method of step S3 includes the following sub-steps:

s3-1, sample set under normal working condition after standardization

Whitening to obtain a whitening transformation matrix Q_PACAnd according to the formula:

obtaining a whitening matrix Z;

s3-2, construction of Q_ICAAn ICA model, for the ith ICA model, according to the formula:

S_i＝B_iZ

different random number seeds are set, and a first demixing matrix B of the ith ICA model is obtained by adopting a FastICA algorithm_iAnd independent vector matrix S_iFurther, a first unmixing matrix and an independent vector matrix of each ICA model are obtained;

s3-3, according to the formula:

obtaining a second unmixing matrix W of the ith ICA model_iFurther obtaining a second unmixing matrix of each ICA model; wherein (·)^TRepresents a transpose of a matrix;

s3-4, sequentially selecting d row vectors from the row vectors of the second unmixing matrix of the ith ICA model according to the sequence of vector norm from large to small to form a combined matrix W of the ith ICA model_d,iFurther obtaining a combination matrix of each ICA model;

s3-5, according to the formula:

obtaining a set of normalized samples under normal conditions

Middle (r) th sample

Based on the ithStatistic value of ICA model

Further obtaining a standardized sample set under normal working conditions

All the samples in the statistical quantity combination based on the ith ICA model

And obtaining a standardized sample set under normal working conditions

All samples in (a) are based on a combination of statistics for each ICA model; wherein n is₀The total number of samples in the initial sample set under the normal working condition;

s3-6, obtaining statistic combination of ith ICA model by adopting kernel density estimation method KDE

Probability density of

And according to the formula:

obtaining a solving interval [ start ] corresponding to the ith ICA model_i,end_i]Further obtaining probability density functions and solving intervals of all ICA model statistic combinations; where min (-) denotes taking the minimum value and max (-) denotes taking the maximum value;

s3-7, solving interval [ start ] corresponding to ith ICA model_i,end_i]Is equidistantly divided into num_iIndividual subareaAnd according to the formula:

acquiring the number k of accumulated subintervals corresponding to the ith ICA model_iFurther obtaining the cumulative number of subintervals corresponding to each ICA model; wherein Δ_iThe subinterval width corresponding to the ith ICA model;

statistical combination representing ith ICA model

In that

The probability density of (d); α is the confidence of the control limit; xi is a constant;

s3-8, according to the formula:

UCL_i＝start_i+k_iΔ_i

obtaining the control limit UCL of the ith ICA model_iFurther obtaining the control limit of each ICA model;

and S3-9, regarding any ICA model, taking the ratio of the statistic value of the input sample to the control limit as the output of the ICA model, and completing the construction of the ICA model.

Further, the specific method of step S4 includes the following sub-steps:

s4-1, obtaining the quantity Q of SVM models to be constructed_SVMFor the jth SVM model, m is randomly drawn from each initial sample set without replacement_c,jEach sample constitutes a sample subset Y for each condition_c,j(ii) a Wherein m is_c,j＝int(n_c,j×rate_j) Int (·) denotes that only the integer part of the computation result is retained, n_c,jIs the total number of samples in the initial sample set under the c-th working condition, when c is 0, the normal working condition is indicated, and rate_jFor the extraction ratio corresponding to the jth SVM model, 0.0<rate_j<1.0；

S4-2, according to the formula:

obtaining a sample subset Y_c,jIth sample of (e)_yNormalized value of individual element

Further obtaining a sample subset Y_c,jThe normalized values of all the elements in the sample subset Y are completed_c,jTo obtain a training subset corresponding to the jth SVM model

Further obtaining a training subset corresponding to each SVM model; wherein

As a subset of samples Y_c,jNormalized results of the e sample; y is_c,j(e)(l_y) As a subset of samples Y_c,jIth sample of (e)_yAn element; mean (Y)_0,j(l_y) ) is sample subset Y under normal operating conditions_0,jOf each sample_yA mean of the individual elements; std (Y)_0,j(l_y) ) is sample subset Y under normal operating conditions_0,jOf each sample_yStandard deviation of individual elements.

Further, the specific method of step S5 includes the following sub-steps:

s5-1, for the jth SVM model, setting the sample labels under the normal working condition in the training subset corresponding to the jth SVM model as-1, and setting the sample labels under the fault working condition in the training subset corresponding to the jth SVM model as 1;

s5-2, obtaining the intercept coefficient tau of the jth SVM model_jThe u th training subset corresponding to the j th SVM model_jOne sample h_j(u_j) Corresponding specific gravity coefficient phi_j(u_j)；

S5-3, according to the formula:

K(h_j(u_j),h)＝exp(-γ||h_j(u_j)-h||²)

acquiring a hyperplane equation of a jth SVM model; wherein hp_j(h) Representing the value of the hyperplane equation of the jth SVM model when the detection sample is a sample h;

m_c,jthe number of samples in the sample subset of the c working condition in the training subset corresponding to the jth SVM model is determined; label_j(u_j) For the u th training subset corresponding to the j th SVM model_jOne sample h_j(u_j) A tag value of (a); exp (·) is an exponential function; gamma is a hyperparameter, i.e., a constant; i | · | purple wind²Is the square of the vector two norm; k (-) is a radial basis function;

s5-4, according to the formula:

obtaining the output of the jth SVM model when the detection sample is the sample h

For each SVM model, twice of the value of the hyperplane equation corresponding to the detection sample after being activated by the sigmoid function is used as the output of the SVM model, and the construction of all SVM models is completed.

Further, the specific method of step S5-2 is:

for the jth SVM model, by solving an optimization problem:

0≤φ_j(u_j)≤ξ',u_j＝1,2,…,M_j

obtaining the u-th training subset corresponding to the training subset_jOne sample h_j(u_j) Corresponding specific gravity coefficient phi_j(u_j) (ii) a Wherein s.t. represents a constraint; h is_j(u_i) Representing the u th training subset corresponding to the j th SVM model_iA sample is obtained; label_j(u_i) Is a sample h_j(u_i) A tag value of (a); xi 'is a penalty parameter, xi'>0；

Randomly selecting a sample h corresponding to the specific gravity coefficient which is more than 0 and less than xi_j(u_m) And sample h_j(u_m) Label of (1)_j(u_m) And according to the formula:

obtaining the intercept coefficient tau of the jth SVM model_j。

Further, the specific method of step S8 includes the following sub-steps:

s8-1, obtaining a detection sample x_newCorresponding all ICA model outputs and SVM model outputs to obtain the detection sample x_newCorresponding output matrix

Wherein

Representing the test sample x_newOutput on the ith ICA model;

representing the test sample x_newIn the first placeOutputs on the j SVM models;

s8-2, according to the formula:

P_q(x_new|N)＝exp(-v_new(q))

P_q(x_new|F)＝exp(-1/v_new(q))

separately obtaining detection samples x_newConditional probability P under normal operating conditions_q(x_newN) and conditional probability P under fault conditions_q(x_new| F); wherein N refers to normal working conditions, and F refers to fault working conditions; exp (·) is an exponential function; v. of_new(q) is the output matrix v_newThe qth value of (1);

s8-3, according to the formula:

P_q(x_new)＝P_q(x_new|N)α+P_q(x_new|F)(1-α)

obtaining a test sample x_newThe total probability P corresponding to the q-th value in the sequence_q(x_new) (ii) a Wherein α is the confidence of the control limit in the ICA model;

s8-4, according to the formula:

separately obtaining detection samples x_newPosterior probability P of q-th value under normal working condition_q(N|x_new) And posterior probability P under fault conditions_q(F|x_new)；

S8-5, according to the formula:

obtaining a test sample x_newIntegrated probability value P of_new(ii) a Wherein Q_SIs a stand forWith the output of the ICA model and the number of outputs of the SVM model, i.e. the output matrix v_newTotal number of middle elements.

Further, the threshold in step S9 is 1- α, where α is the confidence of the control limit in the ICA model.

Further, the number of construction of the ICA model and the SVM model is 3.

The invention has the beneficial effects that: according to the method, an ICA model with a feature extraction capability and an SVM model with a classification capability are integrated by Bayesian inference, and different advantages of different models are utilized, so that the detection accuracy is improved, and the detection effect is more stable.

Drawings

FIG. 1 is a schematic flow chart of the present invention.

Detailed Description

The following description of the embodiments of the present invention is provided to facilitate the understanding of the present invention by those skilled in the art, but it should be understood that the present invention is not limited to the scope of the embodiments, and it will be apparent to those skilled in the art that various changes may be made without departing from the spirit and scope of the invention as defined and defined in the appended claims, and all matters produced by the invention using the inventive concept are protected.

As shown in fig. 1, the fault detection method based on the ICA and SVM ensemble learning includes the following steps:

s3, constructing an ICA model based on the standardized sample set under the normal working condition;

s5, constructing a corresponding SVM model based on each training subset;

The specific method of step S1 is: the method comprises the steps that K measuring devices are adopted to simultaneously obtain measuring data of a target industrial production system under C working conditions, the measuring data corresponding to the K measuring devices at the same time are used as a sample, the measuring data at different times under the same working condition are listed into a matrix, and each matrix is used as an initial sample set to obtain C initial sample sets; i.e. each sample in each initial set of samples has K elements.

The specific method of step S2 is: for an initial sample set X under normal operating conditions₀＝[x₀(1),x₀(2),...,x₀(r),...,x₀(n₀)]According to the formula:

obtaining the l-th sample of the r-th sample_xNormalized value of each element

Further obtaining the values of all elements after standardization, and completing the normal working conditionStandardizing the initial sample set to obtain a standardized sample set under a normal working condition; wherein x₀(r)(l_x) For the l-th sample in the r-th sample_xA value of an element; mean (X)₀(l_x) Is the l-th sample of each sample in the initial sample set under normal conditions_xA mean of the individual elements; std (X)₀(l_x) Is the l-th sample of each sample in the initial sample set under normal conditions_xStandard deviation of individual elements; n is₀The total number of samples in the initial sample set under the normal working condition.

The specific method of step S3 includes the following substeps:

s3-1, sample set under normal working condition after standardization

Whitening processing is carried out to obtain a whitening transformation matrix Q_PACAnd according to the formula:

obtaining a whitening matrix Z;

S_i＝B_iZ

different random number seeds are set, and a FastICA algorithm is adopted to solve a first unmixing matrix B of an ith ICA model_iAnd independent vector matrix S_iFurther, a first unmixing matrix and an independent vector matrix of each ICA model are obtained;

s3-3, according to the formula:

s3-5, according to the formula:

obtaining a set of normalized samples under normal conditions

Middle (r) th sample

Statistic value based on ith ICA model

Further obtaining a standardized sample set under normal working conditions

And obtaining a standardized sample set under normal working conditions

Based on the statistic combination of each ICA model; wherein n is₀The total number of samples in the initial sample set under the normal working condition;

Probability density of

And according to the formula:

s3-7, solving interval [ start ] corresponding to ith ICA model_i,end_i]Is equidistantly divided into num_iSub-intervals and according to the formula:

statistical combination representing ith ICA model

At start_i+ξΔ_iThe probability density of (d); α is the confidence of the control limit; xi is a constant;

s3-8, according to the formula:

UCL_i＝start_i+k_iΔ_i

The specific method of step S4 includes the following sub-steps:

S4-2, according to the formula:

Further obtaining a training subset corresponding to each SVM model; wherein

The specific method of step S5 includes the following substeps:

S5-3, according to the formula:

K(h_j(u_j),h)＝exp(-γ||h_j(u_j)-h||²)

s5-4, according to the formula:

The specific method of step S5-2 is:

for the jth SVM model, by solving an optimization problem:

0≤φ_j(u_j)≤ξ',u_j＝1,2,…,M_j

obtaining the u-th training subset corresponding to the u-th training subset_jOne sample h_j(u_j) Corresponding specific gravity coefficient phi_j(u_j) (ii) a Wherein s.t. represents a constraint; h is_j(u_i) Representing the u th training subset corresponding to the j th SVM model_iA sample is obtained; label_j(u_i) Is a sample h_j(u_i) A tag value of (a); xi 'is a penalty parameter, xi'>0；

Randomly selecting a sample h corresponding to the specific gravity coefficient larger than 0 and smaller than xi_j(u_m) And sample h_j(u_m) Label of (1)_j(u_m) And according to the formula:

obtaining an intercept coefficient tau of a jth SVM model_j。

The specific method of step S8 includes the following substeps:

Wherein

Representing the test sample x_newOutput on the ith ICA model;

representing the test sample x_newAn output on the jth SVM model;

s8-2, according to the formula:

P_q(x_new|N)＝exp(-v_new(q))

P_q(x_new|F)＝exp(-1/v_new(q))

s8-3, according to the formula:

P_q(x_new)＝P_q(x_new|N)α+P_q(x_new|F)(1-α)

s8-4, according to the formula:

S8-5, according to the formula:

obtaining a test sample x_newIntegrated probability value P of_new(ii) a Wherein Q_SFor all ICA model outputs and the number of SVM model outputs, i.e. the output matrix v_newTotal number of middle elements.

In a specific implementation, the threshold in step S9 is 1- α, where α is the confidence in the control limits in the ICA model. The number of constructed ICA models and SVM models is 3.

In one embodiment of the present invention, a model of the U.S. Tennessee Eastman (TE) chemical process is used, which is taken from a real chemical process. The TE chemical process comprises five main units: the reactor, the condenser, the compressor, the separator and the stripping tower are widely applied to the research of various fault detection and diagnosis methods due to the fact that internal mechanisms of the reactor, the condenser, the compressor, the separator and the stripping tower are complex. The whole TE chemical process mainly comprises 22 continuous process measurement variables, 19 composition measurement variables and 12 operation variables, and can simulate normal working conditions and 21 fault working conditions. The data set in this embodiment is a data set that is disclosed and simulated by the american Institute of Technology (MIT), and is divided into a training set and a test set, where the two portions each include a sample set for each operating condition, and each sample is characterized by 52 dimensions. In the training set, there are 500 samples in the normal case sample set and 480 samples in the fault case sample set. In the test set, there are 960 samples for each case of the sample set, but because the fault was introduced after 160 normal conditions, the first 160 samples in the sample set of fault conditions belong to normal condition samples. In the embodiment, the training set part in the data set is used as the training set of the embodiment to train the model. The test set part in the data set is selected as the test set of the embodiment to test the effect of the fault detection method based on the ICA and SVM integration method.

In this embodiment, 3 ICA models and 3 SVM models are constructed, and thus, their individual detection results will be used for comparison with the detection results of the ICA and SVM integration method. Table 1 is a failure detection result recall rate statistical table of each model in this embodiment. Wherein ICA₁、ICA₂、ICA₃Respectively representing No. 1, 2 and 3 ICA models, and judging in such a way that the statistic value of the sample exceeds the control limit, the model is regarded as a fault, and the SVM model₁、SVM₂、 SVM₃Respectively representing the 1 st SVM model, the 2 nd SVM model and the 3 rd SVM model, and judging the mode that the hyperplane equation value of the sample exceeds 0 to be regarded as a fault.

Table 1: statistical table of fault detection result recall rate of each model

As the higher the recall value of the fault indicates the better the detection effect on the fault, it can be seen from table 1 that the detection effect of the method on the fault detection of the industrial production system is better than that of the prior art. Table 2 is a statistical table of the comparison result of the false alarm rate between the method of the present embodiment and the prior art.

Table 2: false alarm rate comparison result statistical table of method

Method

ICA₁

ICA₂

ICA₃

SVM₁

SVM₂

SVM₃

Method for producing a composite material

False alarm rate

0.37％

0.67％

0.39％

2.31％

2.34％

2.13％

1.01％

The lower the false alarm rate is, the less possibility of false alarm to the normal working condition is. As can be seen from Table 2, the method better integrates the false alarm conditions of the ICA and SVM models, thereby obtaining a smaller and moderate false alarm value. The method has good identification effect on the normal working condition of the industrial production system, and the normal production process is not easily influenced. Therefore, the method achieves excellent effects in terms of both the false alarm rate and the fault detection rate.

In conclusion, the ICA model with the feature extraction capability and the SVM model with the classification capability are integrated by Bayesian inference, and different advantages of different models are utilized, so that the detection accuracy is improved, and the detection effect is more stable.

Claims

1. A fault detection method based on ICA and SVM integrated learning is characterized by comprising the following steps:

s5, constructing a corresponding SVM model based on each training subset;

2. The fault detection method based on the ICA and SVM ensemble learning of claim 1, wherein the specific method of step S1 is:

3. The fault detection method based on the ICA and SVM integrated learning of claim 1, wherein the specific method of the step S2 is:

4. The fault detection method based on the ICA and SVM ensemble learning of claim 1, wherein the specific method of the step S3 comprises the following sub-steps:

s3-1, sample set under normal working condition after standardization

obtaining a whitening matrix Z;

S_i＝B_iZ

s3-3, according to the formula:

s3-5, according to the formula:

obtaining a set of normalized samples under normal conditions

Middle (r) th sample

Statistic value based on ith ICA model

Further obtaining a standardized sample set under normal working conditions

All samples in the statistical quantity combination based on ith ICA model

And obtaining a standardized sample set under normal working conditions

Probability density of

And according to the formula:

statistical combination representing ith ICA model

s3-8, according to the formula:

UCL_i＝start_i+k_iΔ_i

5. The fault detection method based on the ICA and SVM ensemble learning of claim 1, wherein the specific method of the step S4 comprises the following sub-steps:

S4-2, according to the formula:

Further obtaining a training subset corresponding to each SVM model; wherein

As a subset of samples Y_c,jNormalized results of the e sample; y is_c,j(e)(l_y) As a subset of samples Y_c,jIth sample of (e)_yAn element; mean (Y)_0,j(l_y) ) is sample subset Y under normal operating conditions_0,jOf each sample_yOf a single elementMean value; std (Y)_0,j(l_y) ) is sample subset Y under normal operating conditions_0,jOf each sample_yStandard deviation of individual elements.

6. The fault detection method based on the ICA and SVM ensemble learning of claim 1, wherein the specific method of the step S5 comprises the following sub-steps:

S5-3, according to the formula:

K(h_j(u_j),h)＝exp(-γ||h_j(u_j)-h||²)

s5-4, according to the formula:

7. The fault detection method based on ICA and SVM ensemble learning of claim 6, wherein the specific method of step S5-2 is:

for the jth SVM model, by solving an optimization problem:

0≤φ_j(u_j)≤ξ',u_j＝1,2,…,M_j

obtaining the u-th training subset corresponding to the u-th training subset_jOne sample h_j(u_j) Corresponding specific gravity coefficient phi_j(u_j) (ii) a Wherein s.t. represents a constraint; h is_j(u_i) Representing the u th training subset corresponding to the j th SVM model_iA sample is obtained; label_j(u_i) Is a sample h_j(u_i) The tag value of (a); xi 'is a penalty parameter, xi'>0；

obtaining the intercept coefficient tau of the jth SVM model_j。

8. The fault detection method based on the ICA and SVM ensemble learning of claim 1, wherein the specific method of the step S8 comprises the following sub-steps:

s8-1, obtaining a detection sample x_newCorresponding all ICA model outputs and SVM model outputs to obtain and detect sample x_newCorresponding output matrix

Wherein

Represents the detection sample x_newOutput on the ith ICA model;

represents the detection sample x_newAn output on the jth SVM model;

s8-2, according to the formula:

P_q(x_new|N)＝exp(-v_new(q))

P_q(x_new|F)＝exp(-1/v_new(q))

s8-3, according to the formula:

P_q(x_new)＝P_q(x_new|N)α+P_q(x_new|F)(1-α)

s8-4, according to the formula:

respectively obtaining detection samples x_newPosterior probability P of q-th value under normal working condition_q(N|x_new) And posterior probability P under fault conditions_q(F|x_new)；

S8-5, according to the formula:

9. The fault detection method based on ICA and SVM integrated learning of claim 1, wherein the threshold in the step S9 is 1- α, where α is the confidence of the control limit in the ICA model.

10. The fault detection method based on ICA and SVM ensemble learning of claim 1, wherein the number of constructions of ICA models and SVM models is 3.