CN109407652B

CN109407652B - Multivariable industrial process fault detection method based on main and auxiliary PCA models

Info

Publication number: CN109407652B
Application number: CN201811503665.9A
Authority: CN
Inventors: 邓晓刚; 邓佳伟; 曹玉苹
Original assignee: China University of Petroleum East China
Current assignee: China University of Petroleum East China
Priority date: 2018-12-10
Filing date: 2018-12-10
Publication date: 2020-03-06
Anticipated expiration: 2038-12-10
Also published as: CN109407652A

Abstract

The invention relates to a multivariable industrial process fault detection method based on a main and auxiliary PCA model, which comprises the following steps: carrying out standardization processing on the normal data set and the prior fault data set; establishing a PCA model for a normal data set as a main monitoring model, calculating relative mutual information of prior faults and normal data, grouping variables by means of generalized Dice, establishing the PCA model for the grouped data set as an auxiliary monitoring model, standardizing a test data set, projecting the test data set to the main monitoring model and the auxiliary monitoring model respectively, calculating statistics of the test data set projected to the main monitoring model and the auxiliary monitoring model, integrating information of a variable group by applying Bayesian theory to obtain total monitoring statistics, and judging whether the test data set fails or not according to whether the monitoring statistics exceeds a control limit. The invention not only effectively reduces the omission and waste of part of important prior fault information, but also improves the fault detection rate and the fault detection performance by mining the variable local information in variable groups.

Description

Multivariable industrial process fault detection method based on main and auxiliary PCA models

Technical Field

The invention belongs to the technical field of industrial process fault detection, and relates to a multivariable industrial process fault detection method based on a Primary Assisted Principal Component Analysis (PA-PCA).

Background

Due to the increasing complexity of modern industrial systems, people pay more attention to process safety and product quality, and the position of fault diagnosis in industrial production is more and more important. With the development of storage technology, mass production process data is collected and recorded. Therefore, the data-driven fault diagnosis method is widely used. Classical fault detection methods include Principal Component Analysis (PCA), Independent Component Analysis (ICA), and Fisher Discriminant Analysis (FDA). The PCA method has been a hot spot in the control field in recent years and is widely used by researchers, but the method still has some problems and is worthy of further study. The conventional PCA method only utilizes normal data when performing statistical modeling, ignores part of known prior fault information, causes omission and waste of part of important information, and accordingly causes reduction of fault detection performance. Therefore, how to effectively utilize the known prior fault data to mine effective information to improve the fault detection performance of the PCA has become a challenging subject.

Disclosure of Invention

The invention provides a multivariable industrial process fault detection method based on a main and auxiliary PCA model, aiming at the problems of low fault detection performance and the like caused by the fact that the traditional PCA method cannot deeply mine local information related to faults. The method can utilize prior fault information and deeply excavate variable local information, improve the fault detection rate and further improve the fault detection result.

In order to achieve the purpose, the invention provides a multivariable industrial process fault detection method based on a main and auxiliary PCA model, which comprises the following steps:

collecting normal data set X and known fault data set F in class C in historical database_cC1, 2, C as a training data set, and using the mean μ and standard deviation σ of the normal data set X for the training data sets X and F_cCarrying out standardization processing to obtain a standardized training data set

And

(II) pairs of datasets

Establishing a PCA model as a main monitoring model;

(III) calculating a relative mutual information matrix Delta R of the fault data set relative to the normal data set_c,c＝1,2,...,C；

(IV) pairs of relative mutual information matrix Delta R_cPerforming variable grouping on the process variable based on the generalized Dice coefficient to obtain a grouped data set

Wherein, B_cThe number of variable groups;

establishing a PCA model for the grouped data set as an auxiliary monitoring model;

(VI) collecting the test data set x_newTest data set X is paired with mean μ and standard deviation σ of normal data set X_newCarrying out standardization processing to obtain a standardized test data set

(VII) data set

Projecting to the main monitoring model and the auxiliary monitoring model respectively, and calculating a data set

Statistics T projected onto the master monitoring model²And SPE, data set

Statistics projected onto secondary monitoring model

And SPE_c,bStatistic T²Control limit of

Control limit SPE of statistic SPE_limStatistics of

Control limit of

And statistics SPE_c,bControl limit of [ SPE ]_c,b]_limAll calculated by nuclear density estimation;

(VIII) integrating all monitoring results to obtain total monitoring statistics

And BIC_SPEAccording to the statistics

Or statistic BIC_SPEDetermining whether a data set is exceeded by a control limit

Whether a failure has occurred.

Further, in the step (a), the training data sets X and F are processed by formula (1) using the mean μ and the standard deviation σ of the normal data set_cThe normalization process is performed, and the expression of formula (1) is:

training data sets X and F_cAfter the standardization treatment of the formula (1), a standardized training data set can be obtained

And

further, in the step (two), the training data set is subjected to

Carrying out PCA decomposition, and calculating a load matrix P of the training data set through a main monitoring model in formula (2), wherein the formula (2) is expressed as:

wherein T is a data set

E is a data set

Model residual moment ofAnd (5) arraying.

Further, in the step (III), the relative mutual information matrix Delta R_cThe calculation steps are as follows:

computing a data set by equation (3)

The mutual information matrix R, the data set is calculated by formula (4)

Mutual information matrix R_cThe formula (3) and the formula (4) are expressed as:

in the formula, m represents the number of variables, R_ijRepresenting a data set

Of the ith and jth columns, R_c,ijRepresenting a data set

The ith and jth columns of (1);

relative mutual information matrix DeltaR_cThen it is expressed as:

further, in the step (iv), the specific steps of grouping variables are as follows:

(1) defining the relative mutual information vector as:

r_i＝[ΔR_c,i1,ΔR_c,i2,…,ΔR_c,im]^T(6)

the similarity of the relative mutual information correlation degree between a certain variable and the rest variables is measured by using the generalized Dice coefficient, and is defined as follows:

in the formula, S is more than or equal to 0_i,j≤1；

Selecting to make r_iThe variable with the maximum | is taken as the first variable group and the number B of the variable groups is initialized_c＝1；

(2) Selecting the next vector r in order of variables_jWhere j ≠ i and j ≦ m, and calculates the vector r by equation (8)_jThe mean of the similarity to each vector in the set of known variables, equation (8), is expressed as: :

wherein b represents the b-th variable group, n_bRepresenting the number of variables in the b-th variable group;

(3) determining

The maximum value in the vector is judged whether the value exceeds the threshold value gamma, if the value exceeds the threshold value gamma, the variable x corresponding to the vector is judged_jIs divided into variable group b; conversely, variable x_jForm a new variable group, i.e. B_c＝B_c+1；

(4) Repeating the steps (2) and (3) until all variables are grouped, i.e.

Further, in the step (five), the data set after the variables are grouped

Carrying out PCA decomposition, and calculating a data set after variable grouping through an auxiliary monitoring model in a formula (9)

Load matrix P_c,bThe formula (9) is expressed as:

in the formula, T_c,bAs a data set

Score matrix of, E_c,bAs a data set

The model residual matrix of (2).

Further, in the step (six), the test data set X is processed by the formula (10) using the mean μ and the standard deviation σ of the normal data set X_newAnd (3) carrying out normalization processing, wherein the expression of the formula (10) is as follows:

test data set x_newAfter the standardization treatment of the formula (10), a standardized test data set can be obtained

Further, in step (seven), the data set is calculated by formula (11) and formula (12)

Statistics T projected onto the master monitoring model²And SPE, formula (11) and formula (12) are expressed as:

in the formula, sigma represents a diagonal matrix formed by characteristic values of a main monitoring model;

computing a data set by equation (13) and equation (14)

Statistics projected onto secondary monitoring model

And SPE_c,bEquation (13) and equation (14) are expressed as:

in the formula, sigma_c,bA diagonal matrix formed by characteristic values of the auxiliary monitoring model is represented,

indicating obtained from type c fault information

Group b variables.

Further, in the step (eight), a Bayesian reasoning is adopted to integrate all monitoring results, and the specific steps are as follows:

defining a sample

The probability of failure at the b-th statistic is:

in the formula, S represents a statistic T²Statistic SPE, statistic

And statistics SPE_c,b，

The posterior probability of a sample failure is represented,

representing the posterior probability under normal conditions, and respectively solving through a formula (16) and a formula (17)

Equations (16) and (17) are expressed as:

in the formula, S_limRepresentation statistic T²Statistic SPE, statistic

And statistics SPE_c,bIf p (f) is the confidence level α, then p (n) is 1- α, and the total monitoring statistic obtained by fusing all monitoring results is:

further, in the step (eight), the total monitoring statistics after fusion is used

Or total monitoring statistic BIC_SPEDetermining whether a data set exceeds a control limit

Whether it is failure data; when in use

Or BIC_SPEIf the value is more than 0.01, the process is considered to have a fault; otherwise, no fault is considered to occur in the process.

Compared with the prior art, the invention has the beneficial effects that:

according to the multivariate industrial process fault detection method provided by the invention, the difference of the structure change of the correlation relationship between the variables caused by the occurrence of the fault is measured by calculating the relative mutual information of the prior fault and the normal data, the variables are grouped by means of the generalized Dice, the known prior fault information can be fully utilized, the waste and omission of useful fault information can be avoided as much as possible, and the local information of the variables can be extracted by grouping the variables; on the basis, a PCA model is established for a normal data set containing all variables as a main monitoring model and a PCA sub-model is established for data sets of different variable groups as an auxiliary monitoring model, Bayesian reasoning is applied to integrate information of the variable groups to obtain total monitoring statistics, whether the test data set fails or not is judged according to whether the monitoring statistics exceeds a control limit, whether the test data set fails or not is judged according to the fused statistics, and then a failure detection result is improved, and the failure detection rate is improved.

Drawings

FIG. 1 is a flow chart of a multivariate industrial process fault detection method based on primary and secondary PCA models of the present invention;

FIG. 2 is a block diagram of a CSTR control system according to an embodiment of the present invention;

FIG. 3a is a mutual information comparison graph of normal test data and standard normal data in a CSTR control system by using the multi-variable industrial process fault detection method based on the primary and secondary PCA models according to the embodiment of the present invention;

FIG. 3b is a diagram showing the mutual information comparison between the fault 1 and the standard normal data in the CSTR control system by using the multivariate industrial process fault detection method based on the primary and secondary PCA models according to the embodiment of the present invention;

FIG. 3c is a diagram showing the mutual information comparison between the median fault 4 and the standard normal data in the multivariate industrial process fault detection method based on the primary and secondary PCA models according to the embodiment of the present invention;

FIG. 4a is a schematic diagram of a result of a prior fault information variable grouping for a CSTR control system using a fault 1 by using the multivariate industrial process fault detection method based on primary and secondary PCA models according to the embodiment of the present invention;

FIG. 4b is a schematic diagram showing a result of grouping prior fault information variables of a fault 4 for a CSTR control system by the multivariate industrial process fault detection method based on primary and secondary PCA models according to the embodiment of the present invention;

FIG. 5a is a schematic diagram of the monitoring result of CSTR control system fault 3 by using the existing PCA method according to the embodiment of the present invention;

FIG. 5b is a schematic diagram showing the monitoring result of the fault 3 of the CSTR control system by using the multivariate industrial process fault detection method based on the primary and secondary PCA models according to the embodiment of the present invention;

FIG. 6a is a schematic diagram of the monitoring result of CSTR control system fault 6 by using the existing PCA method according to the embodiment of the present invention;

fig. 6b is a schematic diagram of a monitoring result of a fault 6 of the CSTR control system by using the multivariate industrial process fault detection method based on the primary and secondary PCA models according to the embodiment of the present invention.

Detailed Description

The invention is described in detail below by way of exemplary embodiments. It should be understood, however, that elements, structures and features of one embodiment may be beneficially incorporated in other embodiments without further recitation.

Referring to fig. 1, the invention discloses a multivariate industrial process fault detection method based on primary and secondary PCA models, comprising the following steps:

collecting normal data set X and known fault data set F in class C in historical database_cC is taken as a training data set, and the training data sets X and F are subjected to formula (1) by using the mean value μ and the standard deviation σ of the normal data set_cThe normalization process is performed, and the expression of formula (1) is:

And

(II) pairs of datasets

Establishing a PCA model as a main monitoring model; the method specifically comprises the following steps: for data sets

wherein T is a data set

E is a data set

The model residual matrix of (2).

(III) calculating a relative mutual information matrix Delta R of the fault data set relative to the normal data set_cC is 1,2,. cndot.c; the method comprises the following specific steps of;

computing a data set by equation (3)

The mutual information matrix R, the data set is calculated by formula (4)

Of the ith and jth columns, R_c,ijRepresenting a data set

The ith and jth columns of (1);

relative mutual information matrix DeltaR_cThen it is expressed as:

since the mutual information of the variables caused by different faults is different, the difference between the mutual information of the fault data set and the reference is respectively measured by taking the mutual information matrix R of the normal data set as the reference, so that different variable grouping results can be obtained. In the relative mutual information, each row represents the difference of mutual information change between a certain variable and all variables, and if the change differences of the two variables are similar, the change of the correlation structure between the variables caused by the occurrence of a fault is similar, so that the two variables can be divided into the same variable group.

Wherein, B_cThe number of variable groups;

the specific steps for grouping variables are as follows:

(1) defining the relative mutual information vector as:

r_i＝[ΔR_c,i1,ΔR_c,i2,…,ΔR_c,im]^T(6)

in the formula, S is more than or equal to 0_i,j≤1；S_i,jThe closer the value is to 1, the stronger the similarity of the two vectors is, the similar the change of the correlation structure between the variables caused by the fault is, the two variables have a certain internal relationship, and the two variables are divided into the same variable group;

(3) determining

(4) Repeating the steps (2) and (3) until all variables are grouped, i.e.

According to the invention, the variables with the number less than or equal to 2 in the variable group are synthesized into one variable group in consideration of the complexity of operation. The variable grouping method can effectively utilize the known prior fault information, reduces the waste amount of the known fault information, can further mine the local information of the variables, and is more favorable for improving the detection performance of the fault. In the step, different variable grouping results can be obtained by using different prior fault information.

Establishing a PCA model for the grouped data set as an auxiliary monitoring model; the method specifically comprises the following steps: data set after grouping variables

Load matrix P_c,bThe formula (9) is expressed as:

in the formula, T_c,bAs a data set

Score matrix of, E_c,bAs a data set

The model residual matrix of (2).

(VI) collecting the test data set x_newThe test data set X is subjected to equation (10) using the mean μ and standard deviation σ of the normal data set X_newAnd (3) carrying out normalization processing, wherein the expression of the formula (10) is as follows:

(VII) data set

Respectively projecting to the main monitoring model and the auxiliary monitoring model; calculating a data set by equation (11) and equation (12)

computing a data set by equation (13) and equation (14)

Statistics projected onto secondary monitoring model

And SPE_c,bEquation (13) and equation (14) are expressed as:

indicating obtained from type c fault information

Group b variables;

computing a respective statistic T by kernel density estimation²Control limit of

Control limit SPE of statistic SPE_limStatistics of

Control limit of

And statistics SPE_c,bControl limit of [ SPE ]_c,b]_lim。

(VIII) integrating all monitoring results by adopting Bayesian inference to obtain total monitoring statistics

And BIC_SPEThe method comprises the following specific steps:

defining a sample

The probability of failure at the b-th statistic is:

in the formula, S represents a statistic T²Statistic SPE, statistic

And statistics SPE_c,b，

The posterior probability of a sample failure is represented,

And

equations (16) and (17) are expressed as:

in the formula, S_limRepresentation statistic T²Statistic SPE, statistic

according to the fused total monitoring statistics

Whether it is failure data; when in use

In the method, the steps (I) to (V) are off-line modeling stages, and the steps (VI) to (eighth) are on-line testing stages.

According to the fault detection method, on one hand, a PCA model is established by using normal process data and is used as a main monitoring model, on the other hand, variables are grouped according to relative mutual information between the normal process data and the fault data, then the PCA model is established aiming at prior fault information and is used as an auxiliary monitoring model, and the results of the main monitoring model and the auxiliary monitoring model are fused to monitor process changes. The prior fault information can be utilized, variable local information can be deeply excavated, waste and omission of useful fault information are reduced, the fault detection rate is improved, and the fault detection result is improved.

In order to more clearly illustrate the beneficial effects of the above-mentioned fault detection method of the present invention, the following further describes the above-mentioned fault detection method of the present invention with reference to the following embodiments.

Example (b): a control system of a Continuous Stirred Tank Reactor (CSTR) is used as a chemical reactor, has the advantages of low cost, strong heat exchange capability, stable product quality and the like, and is widely applied to industrial process reaction. During the reaction, the reactant A undergoes a first-order irreversible exothermic reaction in the reactor, with the formation of substance B. 10 variables were measured in the CSTR control system, including 4 state variables and 6 input variables, the variables are detailed in Table 1.

TABLE 1

Variables of	Description of the invention
		C_a	Concentration of reactant A flowing out of the reaction kettle
T	Temperature of the reaction vessel
		T_c	Temperature of jacket outlet coolant
h	Height of liquid level in reaction kettle
		Q	Concentration of the effluent from the reactor
Q_c	Flow of coolant in jacket
		Q_f	Flow rate of feed A
C_af	Concentration of feed A to the reactor
		T_f	Temperature of feed A
T_cf	Jacket inlet coolant temperature

In the above CSTR control system simulation, 1000 normal data are collected as a training set, and 6 kinds of fault data in table 2 are generated, each fault contains 1000 samples, and each fault is added from the 161 st sampling point.

TABLE 2

Fault of	Description of the invention
		1	Step change of feed flow
2	Feed concentration ramping
		3	Reduction of the activity of the catalyst
4	Decrease of heat exchange rate
		5	Deviation of the reactor temperature sensor
6	Deviation of cooling water temperature sensor

The CSTR control system of the embodiment is subjected to fault detection by adopting the fault detection method (hereinafter referred to as PA-PCA method). And after the fault is detected, comparing fault detection results of different methods through a fault detection rate FDR index in order to evaluate the fault detection performance of different fault detection methods. The failure detection rate FDR is defined as the percentage of the number of pieces of failure data that can be detected to the total number of pieces of failure data. Obviously, the larger the value of the FDR is, the better the fault detection effect of the fault detection method of the industrial process is; on the contrary, the worse the fault detection effect of the industrial process fault detection method.

In the simulation of the CSTR control system of this embodiment, the PCA method and the PA-PCA method of the present invention are used to monitor the process variation. Two different types of information, namely fault 1 (step fault) and fault 4 (slope fault), are selected as prior fault information. In both methods, the number of principal elements is selected according to the variance contribution rate of 80%, the threshold gamma for dividing the variable group is set to be 0.65, and the confidence of 99% is used for calculating the control limit of each method. The effect of fault detection is explained by taking fault 3 and fault 6 as examples.

Fig. 3a shows a mutual information comparison diagram of normal test data and standard normal data, fig. 3b shows a mutual information comparison diagram of a fault 1 in the CSTR control system and standard normal data, and fig. 3c shows a mutual information comparison diagram of a fault 4 in the CSTR control system and standard normal data. In fig. 3a-3c, the mutual information between the variable 1 and the remaining variables is shown. As can be seen from FIG. 3a, the mutual information of the two different normal data sets is substantially overlapped, which indicates that the structure of the correlation relationship between the variables in the process data is not substantially changed under normal operating conditions. It can be seen from fig. 3b and 3c that there is a large difference between the mutual information of two different faults and the mutual information of the standard normal data set, which indicates that the correlation structure between the variables in the process data changes under abnormal conditions, and this also verifies the necessity of the present invention in consideration of the prior fault information.

The malfunction 3 is caused by the change in the activity of the catalyst in the form of a ramp. Fig. 4a shows a diagram of the grouping result of the prior information variable using the fault 1, and fig. 4b shows a diagram of the grouping result of the prior information variable using the fault 4. As can be seen from fig. 4a and 4b, different variable grouping results can be obtained by using different a priori fault information. The PCA method and the PA-PCA method of the invention have a fault monitoring diagram as shown in fig. 5. According to FIG. 5a, T of the PCA method²And SPE statistics respectively give out alarm signals at 760 th sampling time and 639 th sampling time, the fault detection rates of the two statistics are respectively 32.02% and 39.88%, and the fault detection rate is low. In fig. 5b, the two statistics of the PA-PCA method can be alarmed 285 times and 106 times earlier than the conventional PCA method, and the failure detection rates are 46.43% and 58.81%, respectively, which improves the monitoring performance compared with the conventional PCA method.

The failure 6 is caused by a deviation of the cooling water temperature sensor. The monitoring of this fault by both methods is illustrated in fig. 6a and 6 b. As can be seen from fig. 6a, although the two statistics of the PCA method can detect the fault at the 413 th and 239 th sampling moments, the statistics fluctuate on the control line, which makes most of the statistics under the control line, and the fault detection rate is only 26.07% and 40.6%. In contrast, although the monitoring performance of the SPE statistic in the PA-PCA method is basically consistent with that of the traditional PCA method, the detection time is advanced by 1, and the fault detection rate is 43.45%, the T of the PA-PCA method is²The statistic can give an alarm signal at the 161 th sampling moment in time, compared with the T of the PCA method²The statistic is advanced 252 times, and the fault detection rate is high and is 77.5%, and the monitoring performance is improved, as shown in fig. 6 b. Therefore, the PA-PCA method provided by the invention can improve the fault detection performance of the CSTR control system fault 6.

Table 3 shows the failure detection rates for 6 failures of the CSTR control system for the PCA method and the PA-PCA method of the present invention.

TABLE 3

As can be seen from table 3, the PA-PCA method of the present invention has the best monitoring effect on 6 faults, has the highest average fault detection rate, and particularly, the monitoring performance on

faults

3 and 6 is improved more significantly. By combining the analysis, the fault detection effect of the PA-PCA method is superior to that of the traditional PCA method.

The above-mentioned embodiments are merely provided for the convenience of illustration of the present invention, and do not limit the scope of the present invention, and various simple modifications and modifications made by those skilled in the art within the technical scope of the present invention should be included in the above-mentioned claims.

Claims

1. A multivariable industrial process fault detection method based on a main and auxiliary PCA model comprises the following steps:

And

(II) pairs of datasets

Establishing a PCA model as a main monitoring model;

(III) calculating a relative mutual information matrix Delta R of the fault data set relative to the normal data set_cC is 1,2,. cndot.c; relative mutual information matrix DeltaR_cThe calculation steps are as follows:

computing a data set by equation (3)

The mutual information matrix R, the data set is calculated by formula (4)

Of the ith and jth columns, R_c,ijRepresenting a data set

The ith and jth columns of (1);

relative mutual information matrix DeltaR_cThen it is expressed as:

Wherein, B_cThe number of variable groups;

(VII) data set

Respectively projecting to the main monitoring model and the auxiliary monitoring model,and calculating a data set

Statistics T projected onto the master monitoring model²And SPE, data set

Statistics projected onto secondary monitoring model

And SPE_c,bSeparately calculating the statistic T by kernel density estimation²Control limit of

Control limit SPE of statistic SPE_limStatistics of

Control limit of

And statistics SPE_c,bControl limit of [ SPE ]_c,b]_lim；

And BIC_SPEAccording to the statistics

Whether a fault occurs; the specific steps of integrating all monitoring results by adopting Bayesian inference are as follows:

defining a sample

The probability of failure at the b-th statistic is:

in the formula, S represents a statistic T²Statistic SPE, statistic

And statistics SPE_c,b，

The posterior probability of a sample failure is represented,

And

equations (16) and (17) are expressed as:

in the formula, S_limRepresentation statistic T²Statistic SPE, statistic

And statistics SPE_c,bThe corresponding control limit, p (f) is confidence level α, then p (n) 1- α, and all of them are fusedThe total monitoring statistic obtained by the monitoring result is as follows:

2. the multivariate industrial process fault detection method based on primary and secondary PCA models as claimed in claim 1, wherein in the step (one), training data sets X and F are processed by formula (1) using the mean μ and standard deviation σ of the normal data set_cThe normalization process is performed, and the expression of formula (1) is:

And

3. the multivariate industrial process fault detection method based on primary and secondary PCA models as claimed in claim 2, wherein in step (two), the training data set is compared

wherein T is a data set

E is a data set

The model residual matrix of (2).

4. The multivariate industrial process fault detection method based on the primary and secondary PCA models as claimed in claim 1, wherein in the step (IV), the specific steps of performing variable grouping are as follows:

(1) defining the relative mutual information vector as:

r_i＝[ΔR_c,i1,ΔR_c,i2,…,ΔR_c,im]^T(6)

in the formula, S is more than or equal to 0_i,j≤1；

(2) Selecting the next vector r in order of variables_jWhere j ≠ i and j ≦ m, and calculates the vector r by equation (8)_jThe mean of the similarity to each vector in the set of known variables, equation (8), is expressed as:

(3) determining

(4) Repeating the steps (2) and (3) until all variables are grouped, i.e.

5. The multivariate industrial process fault detection method based on primary and secondary PCA models as claimed in claim 4, wherein in step (V), the data sets after variable grouping are performed

Load matrix P_c,bThe formula (9) is expressed as:

in the formula, T_c,bAs a data set

Score matrix of, E_c,bAs a data set

The model residual matrix of (2).

6. The multivariate industrial process fault detection method based on primary and secondary PCA models as claimed in claim 5, wherein in step (six), the test data set X is subjected to formula (10) by using the mean μ and standard deviation σ of the normal data set X_newAnd (3) carrying out normalization processing, wherein the expression of the formula (10) is as follows:

7. The multivariate industrial process fault detection method based on primary and secondary PCA models as claimed in claim 6, wherein in step (seventy), the dataset is calculated by formula (11) and formula (12)

computing a data set by equation (13) and equation (14)

Statistics projected onto secondary monitoring model

And SPE_c,bEquation (13) and equation (14) are expressed as:

indicating obtained from type c fault information

Group b variables.

8. The multivariate industrial process fault detection method based on primary and secondary PCA models as claimed in claim 1, wherein in step (eight), the total monitoring statistics after fusion are based on

Whether it is failure data; when in use