CN111880090A

CN111880090A - Distribution layered online fault detection method for million-kilowatt ultra-supercritical unit

Info

Publication number: CN111880090A
Application number: CN201910579518.8A
Authority: CN
Inventors: 赵春晖; 张淑美
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2019-06-28
Filing date: 2019-06-28
Publication date: 2020-11-03
Anticipated expiration: 2039-06-28
Also published as: CN111880090B

Abstract

本发明公开了一种面向百万千瓦超超临界机组的分布分层式在线故障检测方法。针对百万千瓦超超临界机组过程变量众多、变化工况复杂的问题，综合考虑变量间的相关关系以及变量在样本方向的分布情况，运用多层信息理论分解方法对变量进行分块，基于分块结果，结合高斯混合模型方法与贝叶斯理论，建立面向百万千万超超临界机组的多层分布式监测算法。该方法充分发掘了过程变量间的相关信息，有利于对百万千瓦超超临界机组复杂过程特性的了解，多层分布式的监测方法既可以有效挖掘过程的局部信息，又可以分析不同变量子块之间的相关关系，大大提高了百万千瓦超超临界机组这一复杂过程的故障检测性能，从而保证了大型燃煤发电机组的安全可靠运行。The invention discloses a distributed layered on-line fault detection method for a million-kilowatt ultra-supercritical unit. Aiming at the problems of numerous process variables and complex working conditions of mega-kilowatt ultra-supercritical units, comprehensively considering the correlation between variables and the distribution of variables in the sample direction, the multi-layer information theory decomposition method is used to divide the variables into blocks. Based on the block results, combined with the Gaussian mixture model method and Bayesian theory, a multi-layer distributed monitoring algorithm for millions of ultra-supercritical units is established. This method fully explores the relevant information among the process variables, which is beneficial to the understanding of the complex process characteristics of the million-kilowatt ultra-supercritical unit. The multi-layer distributed monitoring method can not only effectively mine the local information of the process, but also analyze the different variables. The correlation between the blocks greatly improves the fault detection performance of the complex process of the million-kilowatt ultra-supercritical unit, thereby ensuring the safe and reliable operation of large coal-fired generating units.

Description

A distributed layered online fault detection for million kilowatt ultra-supercritical units method

技术领域technical field

本发明属于火电过程故障检测领域，特别是针对以重面向变量众多且工况频繁波动的百万千瓦超超临界机组的分布分层式在线过程监测方法。The invention belongs to the field of thermal power process fault detection, in particular to a distributed layered on-line process monitoring method for a million-kilowatt ultra-supercritical unit with numerous variables and frequent fluctuations in operating conditions.

背景技术Background technique

电力工业是国民经济的重要基础产业，是国家经济发展战略中的重点项目。随着经济高速发展，电力需求也迅速增长。而煤炭资源是我国的主要能源，因此在未来相当长的时期内以煤为主的能源结构难以得到根本改变。作为中国的主力电源，燃煤发电装机容量始终在70％以上。据统计，在旺盛的用电需求推动下，2018年1-8月全社会用电量累计高达45296亿千瓦时，同比增长9.0％。其中，火电累计发电量为33103亿千瓦时，约占全国总发电量的73.1％，同比增长7.2％。近年来，为实现电力可持续发展，火力发电行业积极开展结构调整，“上大压小”，以大容量、低能耗的超(超)临界机组取代高能耗小火电机组，基本形成了以百万千瓦超超临界机组等大型燃煤发电机组为主体的电力能源结构。因此，针对百万千瓦超超临界机组的分析研究具有重大实际意义和应用价值。Electric power industry is an important basic industry of the national economy and a key project in the national economic development strategy. With the rapid economic development, the demand for electricity has also increased rapidly. Coal resources are the main energy in my country, so it is difficult to fundamentally change the coal-dominated energy structure for a long period of time in the future. As China's main power source, the installed capacity of coal-fired power generation has always been above 70%. According to statistics, driven by the strong demand for electricity, the total electricity consumption of the whole society from January to August 2018 reached 4,529.6 billion kWh, a year-on-year increase of 9.0%. Among them, the cumulative power generation of thermal power was 3,310.3 billion kWh, accounting for about 73.1% of the country's total power generation, a year-on-year increase of 7.2%. In recent years, in order to realize the sustainable development of electric power, the thermal power industry has actively carried out structural adjustment, "superpowering the large and suppressing the small", replacing the high-energy consumption small thermal power units with large-capacity, low-energy-consumption ultra (super) critical units, basically forming hundreds of The power energy structure is mainly composed of large-scale coal-fired generating units such as 10,000-kilowatt ultra-supercritical units. Therefore, the analysis and research on the million kilowatt ultra-supercritical unit has great practical significance and application value.

与传统发电机组相比，百万千瓦超超临界机组规模庞大、设备多样、参数众多且相互影响，而且整个发电过程工业流程长、单元装置多、空间分布广、安全要求高，这些都给百万千瓦超超临界机组的状态监测与故障诊断带来了困难。另外，由于环境条件、燃料特性和负荷大小等原因的不同，百万千瓦超超临界机组可能运行在不同的工况条件下。特别是近年来由于风电、光电等新能源并网造成的电网负荷波动、峰谷差加大以及用户侧需求的变化，导致机组出现频繁深度调峰等新常态，机组常处于不同工况切换的全工况运行模式。而且大型燃煤发电过程环境复杂，且工业流程长，即使在同一工况下，许多变量仍呈现出不同的数据分布特性。这些均为大型燃煤发电机组的故障检测与诊断提出了极大的挑战。Compared with traditional generator sets, the mega-kilowatt ultra-supercritical unit has a large scale, diverse equipment, numerous parameters and mutual influences, and the entire power generation process has a long industrial process, many unit devices, wide space distribution, and high safety requirements. Condition monitoring and fault diagnosis of 10,000-kilowatt ultra-supercritical units have brought difficulties. In addition, due to different environmental conditions, fuel characteristics and load sizes, the 1000MW ultra-supercritical unit may operate under different working conditions. Especially in recent years, due to grid load fluctuations, increased peak-to-valley difference, and changes in user-side demand caused by the integration of new energy sources such as wind power and photovoltaics, the new normals such as frequent and deep peak shaving have led to the occurrence of frequent and deep peak shaving. full operating mode. In addition, the large-scale coal-fired power generation process has a complex environment and a long industrial process. Even under the same working conditions, many variables still show different data distribution characteristics. All these pose great challenges to the fault detection and diagnosis of large coal-fired generating units.

针对火力发电机组故障检测的问题，前人对此已从不同的角度做了相应的研究与探讨，提出了相应的在线过程监测方法。然而，目前现有的方法大多主要是集中式单工况的监测方法。面对百万千瓦超超临界机组流程长、变量众多、相关关系复杂、动态工况的特点，集中式单工况的监测方法无法得到很好的监测效果。本发明的内容深入考虑了百万千瓦超超临界机组众多变量间的复杂相关关系以及变量在样本方向的多分布情况，提出了一种新的面向百万千瓦超超临界机组的多层分布式在线故障检测方法。Aiming at the problem of fault detection of thermal power generating units, predecessors have done corresponding researches and discussions from different angles, and put forward corresponding online process monitoring methods. However, most of the existing methods are mainly centralized monitoring methods for a single working condition. Faced with the characteristics of long process flow, numerous variables, complex correlations, and dynamic working conditions of mega-kilowatt ultra-supercritical units, the centralized monitoring method of single working condition cannot obtain a good monitoring effect. The content of the present invention deeply considers the complex correlation among many variables of the million-kilowatt ultra-supercritical unit and the multi-distribution of variables in the sample direction, and proposes a new multi-layer distributed system for the million-kilowatt ultra-supercritical unit. Online fault detection method.

发明内容SUMMARY OF THE INVENTION

本发明的目的在于针对现有的百万千瓦超超临界机组故障检测方法无法准确描述局部信息的问题，提出一种面向百万千万超超临界机组的多层分布式监测算法。该方法综合考虑变量间的相关关系以及变量在样本方向的分布情况，运用多层信息理论分解方法对变量进行分块，充分发掘了过程变量间的过程信息，有利于对百万千瓦超超临界机组复杂过程特性的了解。多层分布式的监测方法既可以有效挖掘过程的局部信息，又可以分析不同变量子块之间的相关关系，大大提高了百万千瓦超超临界机组这一复杂过程的故障检测性能，从而保证了大型燃煤发电机组的安全可靠运行。The purpose of the present invention is to propose a multi-layer distributed monitoring algorithm for 10,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000 ultra-supercritical ultra-supercritical unit fault detection methods that the local information cannot be accurately described. The method comprehensively considers the correlation between variables and the distribution of variables in the sample direction, and uses the multi-layer information theory decomposition method to divide the variables into blocks, fully exploring the process information between the process variables, which is conducive to the analysis of the million kilowatt ultra-supercritical Knowledge of complex process characteristics of units. The multi-layer distributed monitoring method can not only effectively mine the local information of the process, but also analyze the correlation between different variable sub-blocks, which greatly improves the fault detection performance of the complex process of the million-kilowatt ultra-supercritical unit, thereby ensuring Safe and reliable operation of large coal-fired generating units.

本发明的目的通过以下技术方案实现：一种面向百万千瓦超超临界机组的分布分层式在线故障检测方法，该方法包括以下步骤：The object of the present invention is achieved through the following technical solutions: a distributed layered online fault detection method for a million kilowatt ultra-supercritical unit, the method comprises the following steps:

(1)获取正常待分析数据：设一个百万千瓦超超临界机组具有J个测量变量和操作变量，每一次采样可以得到一个1×J的向量，采样N次后获取的数据表述为一个二维矩阵X＝[X₁,X₂,...,X_J]∈R^N×J，其中所述测量变量为机组正常运行过程中可被测量的状态参数，包括流量、电压、电流、温度、速率等；所述操作变量包括进风量、给料量、阀门开度等；(1) Obtaining normal data to be analyzed: Suppose a million kilowatt ultra-supercritical unit has J measurement variables and operating variables, each sampling can obtain a 1 × J vector, and the data obtained after N sampling is expressed as a two Dimensional matrix X=[X ₁ , X ₂ ,...,X _J ]∈R ^N×J , wherein the measured variables are the state parameters that can be measured during the normal operation of the unit, including flow, voltage, current, temperature , speed, etc.; the operating variables include air inlet volume, feeding volume, valve opening, etc.;

(2)利用基于互信息的谱聚类方法将过程变量分为不同的子块，同一子块中的变量具有较强的相关关系，不同子块间相关关系较弱。该步骤由以下子步骤来实现：(2) Using the spectral clustering method based on mutual information, the process variables are divided into different sub-blocks, the variables in the same sub-block have strong correlation, and the correlation between different sub-blocks is weak. This step is implemented by the following sub-steps:

(2.1)求取变量间的互信息：(2.1) Find the mutual information between variables:

I(X_i,X_j)＝H(X_i)+H(X_j)-H(X_i,X_j) (1)I(X _i ,X _j )=H(X _i )+H(X _j )-H(X _i ,X _j ) (1)

其中，X_i(i＝1,2,...,J)表示第i个变量，H(X_i)为变量X_i的信息熵：Among them, X _i (i=1,2,...,J) represents the ith variable, and H(X _i ) is the information entropy of variable X _i :

H(X_i)＝-∫_xp(X_i)logp(X_i)dx (2)H(X _i )=-∫ _x p(X _i )logp(X _i )dx (2)

H(X_i,X_j)为变量X_i和X_j的联合信息熵：H(X _i , X _j ) is the joint information entropy of variables X _i and X _j :

p(X_i)与p(X_j)表示变量X_i和X_j的概率密度函数，p(X_i,X_j)为联合概率密度函数。p(X _i ) and p(X _j ) represent the probability density functions of variables X _i and X _j , and p(X _i , X _j ) is the joint probability density function.

(2.2)基于式(1)求取的互信息，求取两两变量之间的广义相关系数：(2.2) Based on the mutual information obtained by formula (1), obtain the generalized correlation coefficient between the two variables:

其中，r_ij∈[0,1]。where r _ij ∈ [0,1].

(2.3)基于式(4)，求取变量的相关矩阵：(2.3) Based on formula (4), obtain the correlation matrix of variables:

(2.4)基于式(5)定义的相关矩阵R，求取斜对角矩阵D：(2.4) Based on the correlation matrix R defined by formula (5), obtain the diagonal diagonal matrix D:

D＝diag{D_ii} (6)D=diag{ _Dii } (6)

其中，D_ii为式(5)中第i行所有元素的和：Among them, D _ii is the sum of all elements in the i-th row in formula (5):

(2.5)求取斜对角矩阵D的拉普拉斯矩阵(2.5) Find the Laplace matrix of the diagonal diagonal matrix D

L＝D^-1/2RD^-1/2 (8)L=D ^-1/2 RD ^-1/2 (8)

(2.6)将拉普拉斯矩阵进行谱分解(2.6) Spectral decomposition of the Laplace matrix

L＝PΛP^T (9)L=PΛP ^T (9)

其中，P＝[P₁,P₂,...,P_J]为正交特征向量。Among them, P=[P ₁ , P ₂ , . . . , P _J ] are orthogonal eigenvectors.

(2.7)选择k个最大特征值对应的特征向量组成矩阵E＝[P₁,P₂,...,P_k]∈R^J×k，对矩阵E中每一行进行归一化处理，得到矩阵Y(2.7) Select the eigenvectors corresponding to the k largest eigenvalues to form a matrix E=[P ₁ , P ₂ ,...,P _k ]∈R ^J×k , and normalize each row in the matrix E to obtain matrix Y

(2.8)利用kmeans聚类算法对Y进行聚类，如果第i行属于第b类，则变量X_i划分到第b子块X_b。这样，就将百万千瓦超超临界机组的众多操作变量根据相关程度分成B个变量块。(2.8) Use kmeans clustering algorithm to cluster Y, if the i-th row belongs to the b-th class, the variable X _i is divided into the b-th sub-block X _b . In this way, many operating variables of the million kilowatt ultra-supercritical unit are divided into B variable blocks according to the degree of correlation.

X＝[X₁ X₂ … X_B] (11)X=[X ₁ X ₂ … X _B ] (11)

其中，

是第b(b＝1,2,...,B)个变量块，J_b表示X_b中包含的变量个数。in,

is the bth (b=1,2,...,B) variable block, and J _b represents the number of variables contained in X _b .

(3)利用基于高斯混合模型的信息论分解方法将变量块中的变量根据样本方向上的分布情况进一步分解，该步骤通过以下子步骤来实现：(3) Use the information theory decomposition method based on Gaussian mixture model to further decompose the variables in the variable block according to the distribution in the sample direction. This step is realized by the following sub-steps:

(3.1)将变量块随机分成W_b个子块：(3.1) The variable block is randomly divided into W _b sub-blocks:

(3.2)利用高斯混合模型方法求取第w(w＝1,2,...,W_b)个变量子块的概率密度：(3.2) Use the Gaussian mixture model method to obtain the probability density of the wth (w=1, 2,...,W _b ) variable sub-block:

其中,

是子高斯成分的个数；

是第m个子高斯成分的先验概率，满足

以及

为包含子高斯成分的均值

和协方差矩阵

的参数集。

为多元高斯概率密度：in,

is the number of sub-Gaussian components;

is the prior probability of the mth sub-Gaussian component, satisfying

as well as

is the mean with sub-Gaussian components

and covariance matrix

parameter set.

is the multivariate Gaussian probability density:

其中，J_b,w为

中变量的个数。Among them, J _b,w is

The number of variables in .

(3.3)求取子块中的各个变量的概率密度分布函数：(3.3) Obtain the probability density distribution function of each variable in the sub-block:

其中变量

J_b,w是变量块

中变量的个数，

表示X_b,i属于子块

时X_b,i的条件概率密度。where the variable

J _{b, w} are variable blocks

the number of variables in ,

Indicates that X _b,i belongs to the sub-block

The conditional probability density of X _b,i when .

(3.4)求取变量子块

和

的KL散度(w,v∈[1,W_b])：(3.4) Obtain variable sub-block

and

The KL divergence of (w, _v∈ [1,Wb]):

其中，

和

分别是

和

的概率密度函数，可以利用公式(13)计算。in,

and

respectively

and

The probability density function of , can be calculated using formula (13).

(3.5)利用蚁群算法优化步骤(3.1)的随机分块，使得以下目标函数最大化：(3.5) Use the ant colony algorithm to optimize the random block of step (3.1) so that the following objective function is maximized:

(3.6)通过重复步骤(3.2)—(3.5)，每个变量块(b＝1,2,...,B)进一步划分为若干个子块。原始数据集X分成不同的子块：(3.6) By repeating steps (3.2)-(3.5), each variable block (b=1, 2, . . . , B) is further divided into several sub-blocks. The original dataset X is divided into different sub-blocks:

其中，变量块

中所有变量具有很强的相关关系，变量子块

中的变量既有很强相关关系且具有类似的数据分布。where the variable block

All variables in have a strong correlation, the variable sub-block

The variables in are both strongly correlated and have similar data distributions.

(4)基于步骤(2)与(3)得到的变量分块结果，首先利用主元分析方法(PCA)描述变量子块

中各个变量的相关关系(4) Based on the variable block results obtained in steps (2) and (3), first use Principal Component Analysis (PCA) to describe the variable sub-blocks

The relationship between the various variables in

其中，P_b,w是负载矩阵，T_b,w是主元矩阵。Among them, P _b,w is the load matrix, and T _b,w is the pivot matrix.

(5)利用高斯混合模型(GMM)方法建立主元矩阵T_b,w的分布情况：(5) Use the Gaussian Mixture Model (GMM) method to establish the distribution of the principal matrix T _{b, w} :

其中，

是高斯分量的个数；

表示第m个分量的权重，

为包含子高斯成分的均值

和协方差矩阵

的参数集。in,

is the number of Gaussian components;

represents the weight of the mth component,

is the mean with sub-Gaussian components

and covariance matrix

parameter set.

(6)针对每个变量子块

建立BIP统计量(6) For each variable sub-block

Build BIP Statistics

其中，

表示

属于第m个分量

的概率，

为主元矩阵T_b,w第n(n＝1,2,..,N)行向量。

是基于局部马氏距离的概率，其定义为in,

express

belongs to the mth component

The probability,

is the nth (n=1,2,..,N) row vector of the principal matrix T _b,w .

is the probability based on the local Mahalanobis distance, which is defined as

其中，

为

到第m个高斯分量的马氏距离，t为T_b,w中任意一行。in,

for

Mahalanobis distance to the mth Gaussian component, t is any row in T _b,w .

(7)利用高斯混合模型(GMM)方法监测每个变量块中各个子块之间的关系，该步骤通过以下子步骤来实现。(7) Using the Gaussian Mixture Model (GMM) method to monitor the relationship between each sub-block in each variable block, this step is realized by the following sub-steps.

(7.1)将每个变量块X_b中各个子块的主元矩阵的第一列组合到一起：(7.1) Combine the first columns of the pivot matrix of each sub-block in each variable block X _b :

其中，t_b,w(w＝1,2,...,W_b)为主元矩阵T_b,w第1个列向量。Among them, t _b,w (w=1,2,...,W _b ) is the first column vector of the principal element matrix T _b,w .

(7.2)利用GMM描述主元数据

的分布情况：(7.2) Using GMM to describe main metadata

The distribution of :

其中，

为第b个变量子块中高斯分量的个数；

表示第m个分量的权重，

为包含子高斯成分的均值

和协方差矩阵

的参数集。in,

is the number of Gaussian components in the b-th variable sub-block;

represents the weight of the mth component,

is the mean with sub-Gaussian components

and covariance matrix

parameter set.

(7.3)针对每个变量块的主元数据

建立BIP统计量：(7.3) Main metadata for each variable block

Build BIP statistics:

其中，

为主元矩阵

第n(n＝1,2,..,N)行向量。

是基于局部马氏距离的概率。

是基于局部马氏距离的概率，

为

到第m个高斯分量的马氏距离，

为

中任意一行。in,

pivot matrix

The nth (n=1,2,..,N) row vector.

is the probability based on the local Mahalanobis distance.

is the probability based on the local Mahalanobis distance,

for

Mahalanobis distance to the mth Gaussian component,

for

any line in the .

(8)在线故障检测时，从变量子块，变量块，整个机组三个层次对过程进行监测。该步骤通过以下子步骤来实现。(8) During online fault detection, the process is monitored from three levels of variable sub-block, variable block and the entire unit. This step is achieved by the following sub-steps.

(8.1)获取新数据：按照步骤(1)采集各测点变量的值，记为z(1×J)。(8.1) Acquiring new data: According to step (1), collect the value of each measuring point variable, and record it as z(1×J).

(8.2)按照步骤(2)与步骤(3)得到的变量分块结果，将新数据进行子块分解：(8.2) According to the variable block results obtained in steps (2) and (3), decompose the new data into sub-blocks:

z＝[z₁ z₂ … z_b … z_B] (26)z=[z ₁ z ₂ … z _b … z _B ] (26)

其中z_b(b＝1,2,...,B)为第b个变量子块。where z _b (b=1,2,...,B) is the bth variable sub-block.

(8.3)在最底层，即变量子块层，将每个子块中

的数据向对应子块的主元方向进行投影：(8.3) At the bottom layer, that is, the variable sub-block layer, put each sub-block in

The data is projected to the pivot direction of the corresponding sub-block:

其中

是负载矩阵。in

is the load matrix.

(8.4)求取各个子块

的在线统计量指标：(8.4) Find each sub-block

Online statistic indicators for :

其中，上式各参数含义与公式(22)中类似。

表示

属于第m个分量

的概率。

是基于局部马氏距离的概率，

为

到第m个高斯分量的马氏距离，t为T_b,w中任意一行。The meanings of the parameters in the above formula are similar to those in formula (22).

express

belongs to the mth component

The probability.

is the probability based on the local Mahalanobis distance,

for

Mahalanobis distance to the mth Gaussian component, t is any row in T _b,w .

(8.5)在变量块层，首先将z_b中各个变量子块的主元组合到一起：(8.5) At the variable block layer, first combine the pivots of each variable sub-block in z _b :

(8.6)求取各个变量块z_b的在线统计量指标：(8.6) Obtain the online statistics index of each variable block z _b :

其中，上式各参数含义与公式(25)中类似。

表示

属于第m个分量

的概率。

是基于局部马氏距离的概率，

为

到第m个高斯分量的马氏距离，

为

中任意一行。The meanings of the parameters in the above formula are similar to those in formula (25).

express

belongs to the mth component

The probability.

is the probability based on the local Mahalanobis distance,

for

Mahalanobis distance to the mth Gaussian component,

for

any line in the .

(8.7)为了分析不同变量子块之间的关系，从整个机组层面对百万千瓦超超临界机组的运行状况进行监测，首先将各个变量块的BIP指标转换为正常(标记为‘N’)与故障(标记为‘F’)的概率：(8.7) In order to analyze the relationship between different variable sub-blocks, to monitor the operation status of the million kilowatt ultra-supercritical unit from the whole unit level, first convert the BIP index of each variable block to normal (marked as 'N') with the probability of failure (marked 'F'):

其中，BIP_b,lmt为统计量BIP指标的控制限；

表示第b个变量块正常的条件概率；

表示第b个变量块发生故障的条件概率。Among them, BIP _{b, lmt} is the control limit of the statistic BIP indicator;

represents the normal conditional probability of the b-th variable block;

represents the conditional probability of failure of the bth variable block.

(8.8)通过贝叶斯规则，计算第b个变量块发生故障的后验概率(8.8) Calculate the posterior probability of failure of the bth variable block by Bayesian rule

其中，P_b(F)＝α；P_b(N)＝1-α分别表示在显著性水平为α下过程发生故障或正常的先验概率。Among them, P _b (F) = α; P _b (N) = 1-α represent the prior probability of failure or normality of the process at the significance level of α, respectively.

(8.9)综合考虑所有变量块的故障概率，计算全局监测统计量(8.9) Comprehensively consider the failure probability of all variable blocks, and calculate the global monitoring statistics

(9)判断过程运行状态：从变量子块、变量块、整个机组三个层次对过程状态进行分析。实时比较三个层次的统计量与控制限：(9) Judging the process running state: analyze the process state from three levels of variable sub-block, variable block and the whole unit. Compare statistics and control limits at three levels in real time:

(a)在每个变量子块

中，如果BIP_b,w＞1-α，说明在子块

中的变量发生了故障，否则认为子块中的变量运行在正常范围内。(a) in each variable subblock

, if BIP _b,w > 1-α, it means that in the sub-block

The variables in the sub-block are considered faulty, otherwise the variables in the sub-block are considered to be operating within the normal range.

(b)在变量块层面，如果BIP_b＞1-α，说明该变量块中各个变量子块的相关关系发生了异常，否则说明第b个子块中的所有变量都运行正常。(b) At the variable block level, if BIP _b > 1-α, it means that the correlation of each variable sub-block in the variable block is abnormal; otherwise, it means that all variables in the bth sub-block are running normally.

(c)在机组层面，如果PF_z＞α，说明在百万千瓦超超临界机组运行过程中发生了异常或故障，否则说明机组整体正常运行。(c) At the unit level, if PF _z >α, it means that an abnormality or failure has occurred during the operation of the million kilowatt ultra-supercritical unit; otherwise, it means that the unit is operating normally as a whole.

与现有技术相比，本发明的有益效果在于：本发明的目的在于针对百万千瓦超超临界机组规模庞大、设备多样、参数众多且相互影响，而且整个发电过程工业流程长、单元装置多、空间分布广、工况频繁切换的特点，提出一种面向百万千万超超临界机组的多层分布式监测算法。该方法综合考虑变量间的相关关系以及变量在样本方向的分布情况，运用多层信息理论分解方法对变量进行分块，充分发掘了过程变量间的过程信息，有利于对百万千瓦超超临界机组复杂过程特性的了解。多层分布式的监测方法既可以有效挖掘过程的局部信息，又可以分析不同变量子块之间的相关关系，大大提高了百万千瓦超超临界机组这一复杂过程的故障检测性能，从而保证了大型燃煤发电机组的安全可靠运行。Compared with the prior art, the beneficial effects of the present invention are as follows: the purpose of the present invention is to aim at the large scale of the million kilowatt ultra-supercritical unit, various equipment, numerous parameters and mutual influence, and the entire power generation process has a long industrial process and many unit devices. , wide spatial distribution and frequent switching of operating conditions, a multi-layer distributed monitoring algorithm for millions of ultra-supercritical units is proposed. The method comprehensively considers the correlation between variables and the distribution of variables in the sample direction, and uses the multi-layer information theory decomposition method to divide the variables into blocks, fully exploring the process information between the process variables, which is conducive to the analysis of the million kilowatt ultra-supercritical Knowledge of complex process characteristics of units. The multi-layer distributed monitoring method can not only effectively mine the local information of the process, but also analyze the correlation between different variable sub-blocks, which greatly improves the fault detection performance of the complex process of the million-kilowatt ultra-supercritical unit, thereby ensuring Safe and reliable operation of large coal-fired generating units.

附图说明：Description of drawings:

图1是本发明所述的面向百万千瓦超超临界机组的分布分层式在线故障检测方法的说明图；Fig. 1 is the explanatory diagram of the distributed layered on-line fault detection method of the present invention facing one million kilowatts of ultra-supercritical units;

图2是本发明方法在具体实施例中的变量子块中的监测结果，(a)为在第5个变量块中的两个变量子块的监测结果，(b)为在第7个变量块中的三个变量子块的监测结果，(c)为在第9个变量块中的三个变量子块的监测结果。2 is the monitoring result of the method of the present invention in the variable sub-block in the specific embodiment, (a) is the monitoring result of the two variable sub-blocks in the fifth variable block, (b) is the monitoring result in the seventh variable block The monitoring results of the three variable sub-blocks in the block, (c) is the monitoring results of the three variable sub-blocks in the ninth variable block.

图3是本发明方法在具体实施例中的变量层中的监测结果，(a)为在第5个变量块的监测结果，(b)为在第7个变量块的监测结果，(c)为在第9个变量块的监测结果。Fig. 3 is the monitoring result in the variable layer of the method of the present invention in the specific embodiment, (a) is the monitoring result in the 5th variable block, (b) is the monitoring result in the 7th variable block, (c) For the monitoring results in the 9th variable block.

图4是本发明方法在具体实施例中的机组层面的监测结果。FIG. 4 is a monitoring result at the unit level of the method of the present invention in a specific embodiment.

具体实施方式Detailed ways

下面结合附图及具体实例，对本发明作进一步详细说明。The present invention will be described in further detail below in conjunction with the accompanying drawings and specific examples.

本发明以浙能集团下属嘉华电厂3号机组为例，该机组为百万千瓦超超临界机组，其功率为600MW，共包括154个过程变量，这些变量涉及到压力、温度、流量、流速等。The present invention takes the No. 3 unit of Jiahua Power Plant under Zheneng Group as an example. This unit is a million kilowatt ultra-supercritical unit with a power of 600 MW and includes a total of 154 process variables. These variables involve pressure, temperature, flow, and flow rate. Wait.

如图1所示，本发明是一种面向百万千瓦超超临界机组的动静特征协同分析的在线监测方法，包括以下步骤：As shown in Figure 1, the present invention is a kind of on-line monitoring method for the collaborative analysis of dynamic and static characteristics of a million-kilowatt ultra-supercritical unit, comprising the following steps:

(1)获取正常待分析数据：设一个百万千瓦超超临界机组具有J个测量变量和操作变量，每一次采样可以得到一个1×J的向量，采样N次后获取的数据表述为一个二维矩阵X＝[X₁,X₂,...,X_J]∈R^N×J。本实例中，采样周期为1分钟，共采集火电机组正常运行过程中2940个样本数据用于变量分块以及离线建模，154个过程变量，即建模数据为X(2940×154)。其中所述测量变量为机组正常运行过程中可被测量的状态参数，包括流量、电压、电流、温度、速率等；所述操作变量包括进风量、给料量、阀门开度等；(1) Obtaining normal data to be analyzed: Suppose a million kilowatt ultra-supercritical unit has J measurement variables and operating variables, each sampling can obtain a 1 × J vector, and the data obtained after N sampling is expressed as a two A dimensional matrix X=[X ₁ , X ₂ ,...,X _J ]∈R ^N×J . In this example, the sampling period is 1 minute, and a total of 2940 sample data are collected during the normal operation of the thermal power unit for variable block and offline modeling, and 154 process variables, that is, the modeling data is X (2940×154). Wherein, the measured variables are state parameters that can be measured during the normal operation of the unit, including flow, voltage, current, temperature, speed, etc.; the operating variables include air intake volume, feed volume, valve opening, etc.;

H(X_i)＝-∫_xp(X_i)logp(X_i)dx (2)H(X _i )=-∫ _x p(X _i )logp(X _i )dx (2)

(2.2)基于公式(1)求取的互信息，求取两两变量之间的广义相关系数：(2.2) Based on the mutual information obtained by formula (1), obtain the generalized correlation coefficient between the two variables:

其中，r_ij∈[0,1]。where r _ij ∈ [0,1].

(2.3)基于公式(4)，求取变量的相关矩阵：(2.3) Based on formula (4), obtain the correlation matrix of variables:

(2.4)基于公式(5)定义的相关矩阵R，求取斜对角矩阵D：(2.4) Based on the correlation matrix R defined by formula (5), obtain the diagonal diagonal matrix D:

D＝diag{D_ii} (6)D=diag{ _Dii } (6)

其中，D_ii为式(5)中第i行所有元素的和Among them, D _ii is the sum of all elements in the i-th row in formula (5)

L＝D^-1/2RD^-1/2 (8)L=D ^-1/2 RD ^-1/2 (8)

L＝PΛP^T (9)L=PΛP ^T (9)

X＝[X₁ X₂ ... X_B] (11)X=[X ₁ X ₂ ... X _B ] (11)

其中，

是第b(b＝1,2,...,B)个变量块，J_b表示X_b中包含的变量个数。in,

在本次实例中，根据相关关系，154个过程变量共分成了11个子块，如表1所示，每个子块中变量具有较强的相关关系，不同子块之间相关关系较弱。In this example, according to the correlation, the 154 process variables are divided into 11 sub-blocks. As shown in Table 1, the variables in each sub-block have strong correlation, and the correlation between different sub-blocks is weak.

表1.百万千瓦超超临界机组中变量分块情况Table 1. Variable block situation in 1 million kilowatt ultra-supercritical units

(3)利用基于高斯混合模型的信息论分解方法将上述11个变量块中的变量根据样本方向上的分布情况进一步分解，该步骤通过以下子步骤来实现(3) Use the information theory decomposition method based on Gaussian mixture model to further decompose the variables in the above 11 variable blocks according to the distribution in the sample direction. This step is realized by the following sub-steps

其中,

是子高斯成分的个数；

是第m个子高斯成分的先验概率，满足

以及

为包含子高斯成分的均值

和协方差矩阵

的参数集。

为多元高斯概率密度：in,

is the number of sub-Gaussian components;

is the prior probability of the mth sub-Gaussian component, satisfying

as well as

is the mean with sub-Gaussian components

and covariance matrix

parameter set.

is the multivariate Gaussian probability density:

其中，J_b,w为

中变量的个数。Among them, J _b,w is

The number of variables in .

其中变量

J_b,w是变量块

中变量的个数，

表示X_b,i属于子块

时X_bi的条件概率密度。where the variable

J _{b, w} are variable blocks

the number of variables in ,

Indicates that X _b,i belongs to the sub-block

The conditional probability density of X _bi when .

(3.4)求取变量子块

和

的KL散度(w,v∈[1,W_b])：(3.4) Obtain variable sub-block

and

The KL divergence of (w, _v∈ [1,Wb]):

其中，

和

分别是

和

的概率密度函数，可以利用公式(13)计算。in,

and

respectively

and

The probability density function of , can be calculated using formula (13).

其中，变量块

中所有变量具有很强的相关关系，变量子块

All variables in have a strong correlation, the variable sub-block

在本次实例中，根据变量的分布情况，将步骤(2)中得到的11个变量块进行进一步划分为27个变量子块，每个子块中的变量既具有较强的相关关系，又具有相同的分布。In this example, according to the distribution of variables, the 11 variable blocks obtained in step (2) are further divided into 27 variable sub-blocks. The variables in each sub-block have strong correlations and the same distribution.

表2.百万千瓦超超临界机组中变量子块的划分情况Table 2. The division of variable sub-blocks in the million kilowatt ultra-supercritical unit

The relationship between the various variables in

其中，

是高斯分量的个数；

表示第m个分量的权重，

为包含子高斯成分的均值

和协方差矩阵

的参数集。in,

is the number of Gaussian components;

represents the weight of the mth component,

is the mean with sub-Gaussian components

and covariance matrix

parameter set.

(6)针对每个变量子块

建立BIP统计量(6) For each variable sub-block

Build BIP Statistics

其中，

表示

属于第m个分量

的概率，

为主元矩阵T_b,w第n(n＝1,2,..,N)行向量。

是基于局部马氏距离的概率，其定义为in,

express

belongs to the mth component

The probability,

is the nth (n=1,2,..,N) row vector of the principal matrix T _b,w .

is the probability based on the local Mahalanobis distance, which is defined as

其中，

为

到第m个高斯分量的马氏距离，t为T_b,w中任意一行。in,

for

Mahalanobis distance to the mth Gaussian component, t is any row in T _b,w .

(7.2)利用GMM描述主元数据

的分布情况：(7.2) Using GMM to describe main metadata

The distribution of :

其中，

为第b个变量子块中高斯分量的个数；

表示第m个分量的权重，

为包含子高斯成分的均值

和协方差矩阵

的参数集。in,

is the number of Gaussian components in the b-th variable sub-block;

represents the weight of the mth component,

is the mean with sub-Gaussian components

and covariance matrix

parameter set.

(7.3)针对每个变量块的主元数据

建立BIP统计量：(7.3) Main metadata for each variable block

Build BIP statistics:

其中，

为主元矩阵

第n(n＝1,2,..,N)行向量。

是基于局部马氏距离的概率。

是基于局部马氏距离的概率，

为

到第m个高斯分量的马氏距离，

为

中任意一行。in,

pivot matrix

The nth (n=1,2,..,N) row vector.

is the probability based on the local Mahalanobis distance.

is the probability based on the local Mahalanobis distance,

for

Mahalanobis distance to the mth Gaussian component,

for

any line in the .

(8.1)故障数据准备：这里采集故障数据一共包含460个样本，数据记为Z(460×154)，该故障为循环水泵出口压力增大，该故障发生在第121个采样点。(8.1) Fault data preparation: The fault data collected here contains a total of 460 samples, and the data is recorded as Z (460×154). The fault is that the outlet pressure of the circulating water pump increases, and the fault occurs at the 121st sampling point.

(8.2)按照步骤(2)与步骤(3)得到的变量分块结果，将新数据记为z(1×154)进行子块分解，在本次实例中B＝11：(8.2) According to the variable block results obtained in steps (2) and (3), the new data is denoted as z(1×154) for sub-block decomposition. In this example, B=11:

z＝[z₁ z₂ ... z_b ... z_B] (26)z=[z ₁ z ₂ ... z _b ... z _B ] (26)

(8.3)在最底层，即变量子块层，将每个子块中

The data is projected to the pivot direction of the corresponding sub-block:

(8.4)求取各个子块

的在线统计量指标：(8.4) Find each sub-block

Online statistic indicators for :

其中，上式各参数含义与公式(22)中类似。

表示

属于第m个分量

的概率。

是基于局部马氏距离的概率，

为

express

belongs to the mth component

The probability.

is the probability based on the local Mahalanobis distance,

for

Mahalanobis distance to the mth Gaussian component, t is any row in T _b,w .

其中，上式各参数含义与公式(25)中类似。

表示

属于第m个分量

的概率。

是基于局部马氏距离的概率，

为

到第m个高斯分量的马氏距离，

为

express

belongs to the mth component

The probability.

is the probability based on the local Mahalanobis distance,

for

Mahalanobis distance to the mth Gaussian component,

for

any line in the .

其中，BIP_b,lmt为统计量BIP的控制限，在本具体实施例中取0.5；

表示第b个变量块正常的条件概率；

表示第b个变量块发生故障的条件概率。Wherein, BIP _{b, lmt} is the control limit of the statistic BIP, which is taken as 0.5 in this specific embodiment;

represents the normal conditional probability of the b-th variable block;

represents the conditional probability of failure of the bth variable block.

其中，P_b(F)＝α；P_b(N)＝1-α分别表示在显著性水平为α下过程发生故障或正常的先验概率，在本实例中，显著性水平α＝0.5。Among them, P _b (F)=α; P _b (N)=1-α respectively represent the prior probability of failure or normality of the process at the significance level α, in this example, the significance level α=0.5.

(a)在每个变量子块

中，如果BIP_b,w＞1-α，说明在子块

, if BIP _b,w > 1-α, it means that in the sub-block

利用本发明的监测方法对火电过程进行在线过程监测，结果如图2-图4所示。图2显示了本发明方法在8个变量子块的监测结果。从图2(a)中可以看出，第五个变量块的两个变量子块的统计量基本都在控制限以下，说明当前故障并没有影响到变量块#5中的各个变量，这些变量均正常运行。从图2(b)中可以看出，在前120个样本中，变量块#7的前两个子块的统计量基本都在控制限以下，显示过程正常运行。从第121个样本开始，变量子块统计量BIP_sub7,1和BIP_sub7,2迅速超出控制限，检测到故障发生，这说明该故障明显影响到这两个子块中的变量。同样，分析图2(c)的监测结果，可以发现变量块#9的前两个子块中的变量基本正常运行，但是BIP_sub9,3有效检测到了故障的发生。Using the monitoring method of the present invention to monitor the thermal power process on-line, the results are shown in Figures 2-4. Figure 2 shows the monitoring results of the method of the present invention in 8 variable sub-blocks. As can be seen from Figure 2(a), the statistics of the two variable sub-blocks of the fifth variable block are basically below the control limit, indicating that the current fault does not affect each variable in variable block #5. are operating normally. As can be seen from Figure 2(b), in the first 120 samples, the statistics of the first two sub-blocks of variable block #7 are basically below the control limit, indicating that the process is running normally. Starting from the 121st sample, the variable subblock statistics BIP _sub7,1 and BIP _sub7,2 rapidly exceed the control limits, and a fault is detected, indicating that the fault clearly affects the variables in these two subblocks. Similarly, analyzing the monitoring results in Figure 2(c), it can be found that the variables in the first two sub-blocks of variable block #9 are basically operating normally, but BIP _sub9,3 effectively detects the occurrence of faults.

图3显示了本发明方法在变量层中的部分监测结果。图3(a)中可以看出，第五个变量块基本都在控制限以下，表明变量块#5中的两个变量子块之间的相关关系没有受到故障的影响，变量块#5中的过程变量正常运行。图3(b)和图3(c)的监测结果分别显示变量块#7和变量块#9中的变量子块的相关关系受到了故障的影响，发生了异常。图4显示了发明方法在机组层的监测结果。可以看到，统计量在前120个样本基本运行在控制限以下，说明百万千瓦超超临界机组运行在正常工况。从第121个样本开始，统计量PF_z立即超出控制限，有效检测到故障的发生。总体来说，本发明的分层分布式故障检测方法在监测百万千瓦超超临界这一典型的大规模多工况过程时具有优越的故障检测性能，其分块结果有效分析众多变量之间复杂的相关关系，既可以加深操作人员对过程的理解，又可以为实际火电厂工业现场的技术管理部门提供高精度的在线过程监测结果，为实时判断过程运行状态，识别是否有故障发生提供可靠依据，进一步提高了百万千瓦超超临界机组运行的安全性、可靠性和有效性。Figure 3 shows part of the monitoring results of the method of the present invention in the variable layer. As can be seen in Figure 3(a), the fifth variable block is basically below the control limit, indicating that the correlation between the two variable sub-blocks in variable block #5 is not affected by the fault. The process variable is operating normally. The monitoring results of Fig. 3(b) and Fig. 3(c) respectively show that the correlation of variable sub-blocks in variable block #7 and variable block #9 is affected by the fault, and abnormality occurs. Figure 4 shows the monitoring results of the inventive method at the unit level. It can be seen that the statistics are basically operating below the control limit in the first 120 samples, indicating that the million-kilowatt ultra-supercritical unit is operating under normal conditions. Starting from the 121st sample, the statistic PF _z immediately exceeds the control limit, effectively detecting the occurrence of a fault. In general, the layered distributed fault detection method of the present invention has superior fault detection performance when monitoring a typical large-scale multi-working condition process of one million kilowatts of ultra-supercritical, and its block results can effectively analyze the relationship between many variables. The complex correlation can not only deepen the operator's understanding of the process, but also provide high-precision online process monitoring results for the technical management department of the actual thermal power plant industrial site, and provide reliable information for judging the process operating status in real time and identifying whether there is a fault. Based on this, the safety, reliability and effectiveness of the operation of the million-kilowatt ultra-supercritical unit are further improved.

Claims

1. A distribution layered online fault detection method for a million kilowatt ultra-supercritical unit is characterized by comprising the following steps:

(1) acquiring normal data to be analyzed: a million-kilowatt ultra-supercritical unit is provided with J measurement variables and operation variables, a vector of 1 XJ can be obtained by sampling every time, and data acquired after sampling N times is expressed as a two-dimensional matrix X ═ X₁,X₂,...,X_J]∈R^N×JThe measured variables are state parameters which can be measured in the normal operation process of the unit, and comprise flow, voltage, current, temperature, speed and the like; the operation variables comprise air intake, feeding amount, valve opening and the like;

(2) the process variable is divided into different sub-blocks by using a spectral clustering method based on mutual information, the variable in the same sub-block has stronger correlation, and the correlation between different sub-blocks is weaker. This step is realized by the following substeps:

(2.1) obtaining mutual information among variables:

I(X_i,X_j)＝H(X_i)+H(X_j)-H(X_i,X_j) (1)

wherein, X_i(i ═ 1, 2.., J) denotes the ith variable, H (X)_i) Is a variable X_iThe information entropy of (2):

H(X_i)＝-∫_xp(X_i)logp(X_i)dx (2)

H(X_i,X_j) Is a variable X_iAnd X_jJoint information entropy of (a):

p(X_i) And p (X)_j) Represents variable X_iAnd X_jP (X) is a probability density function of_i,X_j) Is a joint probability density function.

(2.2) solving the generalized correlation coefficient between every two variables based on the mutual information solved by the formula (1):

wherein r is_ij∈[0,1]。

(2.3) based on equation (4), a correlation matrix of the variables is found:

(2.4) solving a diagonal matrix D based on the correlation matrix R defined by the formula (5):

D＝diag{D_ii} (6)

wherein D is_iiIs the sum of all elements in row i in equation (5):

(2.5) solving a Laplace matrix of the diagonal matrix D:

L＝D^-1/2RD^-1/2(8)

(2.6) performing spectral decomposition on the Laplace matrix:

L＝PΛP^T(9)

wherein, P ═ P₁,P₂,...,P_J]Are orthogonal eigenvectors.

(2.7) selecting the eigenvectors corresponding to the k maximum eigenvalues to form a matrix E ═ P₁,P₂,...,P_k]∈R^J×kNormalizing each row in the matrix E to obtain a matrix Y:

(2.8) clustering Y by using a kmeans clustering algorithm, and if the ith row belongs to the b-th class, carrying out variable X_iDivision into sub-block b X_b. Thus, a plurality of operation variables of the million-kilowatt ultra-supercritical unit are divided into B variable blocks according to the correlation degree:

X＝[X₁X₂… X_B](11)

wherein,

is the (B ═ 1, 2.., B) th variable block, J_bRepresents X_bThe number of variables contained in (1).

(3) The variables in the variable block are further decomposed according to the distribution situation in the sample direction by using an information theory decomposition method based on a Gaussian mixture model, and the step is realized by the following substeps.

(3.1) randomly dividing the variable block into W_bIndividual block:

(3.2) using gaussian mixture model method to obtain W (W1, 2.., W)_b) Probability density of individual variable subblocks:

wherein,

is the number of sub-gaussian components;

is the prior probability of the mth sub-Gaussian component, satisfies

And

is a mean value containing sub-Gaussian components

Sum covariance matrix

Of the parameter set (c).

Is a multivariate gaussian probability density:

wherein, J_b,wIs composed of

The number of medium variables.

(3.3) solving probability density distribution functions of all variables in the sub-blocks:

wherein the variable

J_b,wIs a variable block

The number of the medium variables is equal to or greater than the total number of the medium variables,

represents X_b,iBelonging to sub-blocks

When X_b,iThe conditional probability density of (2).

(3.4) solving variable subblocks

And

KL divergence of (W, v ∈ [1, W ]_b])：

Wherein,

and

are respectively

And

the probability density function of (2) can be calculated by using the formula (13).

(3.5) optimizing the random partitions of step (3.1) using an ant colony algorithm such that the following objective function is maximized:

(3.6) each variable block (B ═ 1, 2.., B) is further divided into sub-blocks by repeating steps (3.2) - (3.5). The original data set X is divided into different sub-blocks:

wherein, the variable block

(4) Based on the variable block result obtained in the steps (2) and (3), firstly, describing the variable sub-block by using a Principal Component Analysis (PCA)

The correlation relationship of each variable in (1):

wherein, P_b,wIs a load matrix, T_b,wIs a principal component matrix.

(5) Principal component matrix T established by Gaussian Mixture Model (GMM) method_b,wThe distribution of (c):

wherein,

is the number of gaussian components;

the weight of the m-th component is represented,

is a mean value containing sub-Gaussian components

Sum covariance matrix

Of the parameter set (c).

(6) For each variable sub-block

Establishing BIP statistics

Wherein,

to represent

Belonging to the m-th component

The probability of (a) of (b) being,

as a principal component matrix T_b,wAn nth (N ═ 1, 2.., N) row vector.

Is based on the probability of local Mahalanobis distance, which is defined as

Wherein,

is composed of

Mahalanobis distance to mth gaussian component, T being T_b,wAny one of the rows.

(7) The relationship between each sub-block in each variable block is monitored by a Gaussian Mixture Model (GMM) method, which is implemented by the following sub-steps.

(7.1) blocking each variable block X_bThe first columns of the pivot matrices of the respective sub-blocks are combined together:

wherein, t_b,w(w＝1,2,...,W_b) As a principal component matrix T_b,wThe 1 st column vector.

(7.2) describing principal metadata with GMM

The distribution of (c):

wherein,

the number of Gaussian components in the b variable sub-block is shown;

the weight of the m-th component is represented,

is composed of sub-Gaussian componentsMean value of

Sum covariance matrix

Of the parameter set (c).

(7.3) Main metadata for each variable Block

Establishing BIP statistic:

wherein,

as a principal component matrix

An nth (N ═ 1, 2.., N) row vector.

To represent

Belonging to the m-th component

The probability of (c).

Is based on the probability of the local mahalanobis distance,

is composed of

Mahalanobis distance to the mth gaussian component,

is composed of

Any one of the rows.

(8) During online fault detection, the process is monitored from three levels of variable subblocks, variable blocks and the whole unit. This step is realized by the following substeps.

(8.1) acquiring new data: and (4) collecting the variable values of the measuring points according to the step (1) and recording as z (1 multiplied by J).

(8.2) according to the variable blocking results obtained in the step (2) and the step (3), carrying out sub-block decomposition on the new data:

z＝[z₁z₂... z_b... z_B](26)

wherein z is_b(B ═ 1, 2.., B) is the B variable sub-block.

(8.3) at the bottom layer, i.e., variable sub-block layer, each sub-block

The data of (2) are projected to the principal element direction of the corresponding sub-block:

wherein

Is a load matrix.

(8.4) obtaining each sub-block

The online statistic index of (1):

wherein, the meaning of each parameter in the above formula is similar to that in the formula (22).

To represent

Belonging to the m-th component

The probability of (c).

Is based on the probability of the local mahalanobis distance,

is composed of

(8.5) at the variable block level, z is first put_bThe main elements of each variable sub-block are combined together:

(8.6) obtaining each variable block z_bThe online statistic index of (1):

wherein, the meaning of each parameter of the above formula is similar to that in the formula (25).

To represent

Belonging to the m-th component

The probability of (c).

Is based on the probability of the local mahalanobis distance,

is composed of

Mahalanobis distance to the mth gaussian component,

is composed of

Any one of the rows.

(8.7) converting the BIP index of each variable block into the probability of normal (labeled 'N') and fault (labeled 'F'):

wherein BIP_b,lmtA control limit for the statistical BIP indicator;

representing the normal conditional probability of the b variable block;

indicating the conditional probability of the failure of the b-th variable block.

(8.8) calculating the posterior probability of the b variable block failing by the Bayes rule

Wherein, P_b(F)＝α；P_b(N) ═ 1- α represents the prior probability of the process failing or being normal, respectively, at a level of significance α.

(8.9) comprehensively considering the fault probability of all variable blocks and calculating the global monitoring statistic

(9) Judging the running state of the process: and analyzing the process state from three levels of the variable subblocks, the variable blocks and the whole unit. Three levels of statistics are compared with the control limits in real time:

(a) at each variable sub-block

In, if BIP_b,w1-alpha, in sub-blocks

The variable in (b) fails, otherwise the variable in the sub-block is considered to be operating within the normal range.

(b) At the variable block level, if BIP_bIf the value is more than 1-alpha, the related relation of each variable sub-block in the variable block is abnormal, otherwise, all the variables in the b-th sub-block are normally operated.

(c) At the unit level, if PF_zIf the measured value is more than alpha, the abnormal condition or the fault occurs in the running process of the million kilowatt ultra-supercritical unit, otherwise, the whole unit runs normally.