CN111880090A - Distribution layered online fault detection method for million-kilowatt ultra-supercritical unit - Google Patents

Distribution layered online fault detection method for million-kilowatt ultra-supercritical unit Download PDF

Info

Publication number
CN111880090A
CN111880090A CN201910579518.8A CN201910579518A CN111880090A CN 111880090 A CN111880090 A CN 111880090A CN 201910579518 A CN201910579518 A CN 201910579518A CN 111880090 A CN111880090 A CN 111880090A
Authority
CN
China
Prior art keywords
variable
block
sub
variables
matrix
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910579518.8A
Other languages
Chinese (zh)
Other versions
CN111880090B (en
Inventor
赵春晖
张淑美
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
Original Assignee
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJU filed Critical Zhejiang University ZJU
Priority to CN201910579518.8A priority Critical patent/CN111880090B/en
Publication of CN111880090A publication Critical patent/CN111880090A/en
Application granted granted Critical
Publication of CN111880090B publication Critical patent/CN111880090B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01RMEASURING ELECTRIC VARIABLES; MEASURING MAGNETIC VARIABLES
    • G01R31/00Arrangements for testing electric properties; Arrangements for locating electric faults; Arrangements for electrical testing characterised by what is being tested not provided for elsewhere
    • G01R31/34Testing dynamo-electric machines
    • G01R31/343Testing dynamo-electric machines in operation
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01MTESTING STATIC OR DYNAMIC BALANCE OF MACHINES OR STRUCTURES; TESTING OF STRUCTURES OR APPARATUS, NOT OTHERWISE PROVIDED FOR
    • G01M99/00Subject matter not provided for in other groups of this subclass
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Probability & Statistics with Applications (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Testing Or Calibration Of Command Recording Devices (AREA)

Abstract

本发明公开了一种面向百万千瓦超超临界机组的分布分层式在线故障检测方法。针对百万千瓦超超临界机组过程变量众多、变化工况复杂的问题,综合考虑变量间的相关关系以及变量在样本方向的分布情况,运用多层信息理论分解方法对变量进行分块,基于分块结果,结合高斯混合模型方法与贝叶斯理论,建立面向百万千万超超临界机组的多层分布式监测算法。该方法充分发掘了过程变量间的相关信息,有利于对百万千瓦超超临界机组复杂过程特性的了解,多层分布式的监测方法既可以有效挖掘过程的局部信息,又可以分析不同变量子块之间的相关关系,大大提高了百万千瓦超超临界机组这一复杂过程的故障检测性能,从而保证了大型燃煤发电机组的安全可靠运行。The invention discloses a distributed layered on-line fault detection method for a million-kilowatt ultra-supercritical unit. Aiming at the problems of numerous process variables and complex working conditions of mega-kilowatt ultra-supercritical units, comprehensively considering the correlation between variables and the distribution of variables in the sample direction, the multi-layer information theory decomposition method is used to divide the variables into blocks. Based on the block results, combined with the Gaussian mixture model method and Bayesian theory, a multi-layer distributed monitoring algorithm for millions of ultra-supercritical units is established. This method fully explores the relevant information among the process variables, which is beneficial to the understanding of the complex process characteristics of the million-kilowatt ultra-supercritical unit. The multi-layer distributed monitoring method can not only effectively mine the local information of the process, but also analyze the different variables. The correlation between the blocks greatly improves the fault detection performance of the complex process of the million-kilowatt ultra-supercritical unit, thereby ensuring the safe and reliable operation of large coal-fired generating units.

Description

一种面向百万千瓦超超临界机组的分布分层式在线故障检测 方法A distributed layered online fault detection for million kilowatt ultra-supercritical units method

技术领域technical field

本发明属于火电过程故障检测领域,特别是针对以重面向变量众多且工况频繁波动的百万千瓦超超临界机组的分布分层式在线过程监测方法。The invention belongs to the field of thermal power process fault detection, in particular to a distributed layered on-line process monitoring method for a million-kilowatt ultra-supercritical unit with numerous variables and frequent fluctuations in operating conditions.

背景技术Background technique

电力工业是国民经济的重要基础产业,是国家经济发展战略中的重点项目。随着经济高速发展,电力需求也迅速增长。而煤炭资源是我国的主要能源,因此在未来相当长的时期内以煤为主的能源结构难以得到根本改变。作为中国的主力电源,燃煤发电装机容量始终在70%以上。据统计,在旺盛的用电需求推动下,2018年1-8月全社会用电量累计高达45296亿千瓦时,同比增长9.0%。其中,火电累计发电量为33103亿千瓦时,约占全国总发电量的73.1%,同比增长7.2%。近年来,为实现电力可持续发展,火力发电行业积极开展结构调整,“上大压小”,以大容量、低能耗的超(超)临界机组取代高能耗小火电机组,基本形成了以百万千瓦超超临界机组等大型燃煤发电机组为主体的电力能源结构。因此,针对百万千瓦超超临界机组的分析研究具有重大实际意义和应用价值。Electric power industry is an important basic industry of the national economy and a key project in the national economic development strategy. With the rapid economic development, the demand for electricity has also increased rapidly. Coal resources are the main energy in my country, so it is difficult to fundamentally change the coal-dominated energy structure for a long period of time in the future. As China's main power source, the installed capacity of coal-fired power generation has always been above 70%. According to statistics, driven by the strong demand for electricity, the total electricity consumption of the whole society from January to August 2018 reached 4,529.6 billion kWh, a year-on-year increase of 9.0%. Among them, the cumulative power generation of thermal power was 3,310.3 billion kWh, accounting for about 73.1% of the country's total power generation, a year-on-year increase of 7.2%. In recent years, in order to realize the sustainable development of electric power, the thermal power industry has actively carried out structural adjustment, "superpowering the large and suppressing the small", replacing the high-energy consumption small thermal power units with large-capacity, low-energy-consumption ultra (super) critical units, basically forming hundreds of The power energy structure is mainly composed of large-scale coal-fired generating units such as 10,000-kilowatt ultra-supercritical units. Therefore, the analysis and research on the million kilowatt ultra-supercritical unit has great practical significance and application value.

与传统发电机组相比,百万千瓦超超临界机组规模庞大、设备多样、参数众多且相互影响,而且整个发电过程工业流程长、单元装置多、空间分布广、安全要求高,这些都给百万千瓦超超临界机组的状态监测与故障诊断带来了困难。另外,由于环境条件、燃料特性和负荷大小等原因的不同,百万千瓦超超临界机组可能运行在不同的工况条件下。特别是近年来由于风电、光电等新能源并网造成的电网负荷波动、峰谷差加大以及用户侧需求的变化,导致机组出现频繁深度调峰等新常态,机组常处于不同工况切换的全工况运行模式。而且大型燃煤发电过程环境复杂,且工业流程长,即使在同一工况下,许多变量仍呈现出不同的数据分布特性。这些均为大型燃煤发电机组的故障检测与诊断提出了极大的挑战。Compared with traditional generator sets, the mega-kilowatt ultra-supercritical unit has a large scale, diverse equipment, numerous parameters and mutual influences, and the entire power generation process has a long industrial process, many unit devices, wide space distribution, and high safety requirements. Condition monitoring and fault diagnosis of 10,000-kilowatt ultra-supercritical units have brought difficulties. In addition, due to different environmental conditions, fuel characteristics and load sizes, the 1000MW ultra-supercritical unit may operate under different working conditions. Especially in recent years, due to grid load fluctuations, increased peak-to-valley difference, and changes in user-side demand caused by the integration of new energy sources such as wind power and photovoltaics, the new normals such as frequent and deep peak shaving have led to the occurrence of frequent and deep peak shaving. full operating mode. In addition, the large-scale coal-fired power generation process has a complex environment and a long industrial process. Even under the same working conditions, many variables still show different data distribution characteristics. All these pose great challenges to the fault detection and diagnosis of large coal-fired generating units.

针对火力发电机组故障检测的问题,前人对此已从不同的角度做了相应的研究与探讨,提出了相应的在线过程监测方法。然而,目前现有的方法大多主要是集中式单工况的监测方法。面对百万千瓦超超临界机组流程长、变量众多、相关关系复杂、动态工况的特点,集中式单工况的监测方法无法得到很好的监测效果。本发明的内容深入考虑了百万千瓦超超临界机组众多变量间的复杂相关关系以及变量在样本方向的多分布情况,提出了一种新的面向百万千瓦超超临界机组的多层分布式在线故障检测方法。Aiming at the problem of fault detection of thermal power generating units, predecessors have done corresponding researches and discussions from different angles, and put forward corresponding online process monitoring methods. However, most of the existing methods are mainly centralized monitoring methods for a single working condition. Faced with the characteristics of long process flow, numerous variables, complex correlations, and dynamic working conditions of mega-kilowatt ultra-supercritical units, the centralized monitoring method of single working condition cannot obtain a good monitoring effect. The content of the present invention deeply considers the complex correlation among many variables of the million-kilowatt ultra-supercritical unit and the multi-distribution of variables in the sample direction, and proposes a new multi-layer distributed system for the million-kilowatt ultra-supercritical unit. Online fault detection method.

发明内容SUMMARY OF THE INVENTION

本发明的目的在于针对现有的百万千瓦超超临界机组故障检测方法无法准确描述局部信息的问题,提出一种面向百万千万超超临界机组的多层分布式监测算法。该方法综合考虑变量间的相关关系以及变量在样本方向的分布情况,运用多层信息理论分解方法对变量进行分块,充分发掘了过程变量间的过程信息,有利于对百万千瓦超超临界机组复杂过程特性的了解。多层分布式的监测方法既可以有效挖掘过程的局部信息,又可以分析不同变量子块之间的相关关系,大大提高了百万千瓦超超临界机组这一复杂过程的故障检测性能,从而保证了大型燃煤发电机组的安全可靠运行。The purpose of the present invention is to propose a multi-layer distributed monitoring algorithm for 10,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000 ultra-supercritical ultra-supercritical unit fault detection methods that the local information cannot be accurately described. The method comprehensively considers the correlation between variables and the distribution of variables in the sample direction, and uses the multi-layer information theory decomposition method to divide the variables into blocks, fully exploring the process information between the process variables, which is conducive to the analysis of the million kilowatt ultra-supercritical Knowledge of complex process characteristics of units. The multi-layer distributed monitoring method can not only effectively mine the local information of the process, but also analyze the correlation between different variable sub-blocks, which greatly improves the fault detection performance of the complex process of the million-kilowatt ultra-supercritical unit, thereby ensuring Safe and reliable operation of large coal-fired generating units.

本发明的目的通过以下技术方案实现:一种面向百万千瓦超超临界机组的分布分层式在线故障检测方法,该方法包括以下步骤:The object of the present invention is achieved through the following technical solutions: a distributed layered online fault detection method for a million kilowatt ultra-supercritical unit, the method comprises the following steps:

(1)获取正常待分析数据:设一个百万千瓦超超临界机组具有J个测量变量和操作变量,每一次采样可以得到一个1×J的向量,采样N次后获取的数据表述为一个二维矩阵X=[X1,X2,...,XJ]∈RN×J,其中所述测量变量为机组正常运行过程中可被测量的状态参数,包括流量、电压、电流、温度、速率等;所述操作变量包括进风量、给料量、阀门开度等;(1) Obtaining normal data to be analyzed: Suppose a million kilowatt ultra-supercritical unit has J measurement variables and operating variables, each sampling can obtain a 1 × J vector, and the data obtained after N sampling is expressed as a two Dimensional matrix X=[X 1 , X 2 ,...,X J ]∈R N×J , wherein the measured variables are the state parameters that can be measured during the normal operation of the unit, including flow, voltage, current, temperature , speed, etc.; the operating variables include air inlet volume, feeding volume, valve opening, etc.;

(2)利用基于互信息的谱聚类方法将过程变量分为不同的子块,同一子块中的变量具有较强的相关关系,不同子块间相关关系较弱。该步骤由以下子步骤来实现:(2) Using the spectral clustering method based on mutual information, the process variables are divided into different sub-blocks, the variables in the same sub-block have strong correlation, and the correlation between different sub-blocks is weak. This step is implemented by the following sub-steps:

(2.1)求取变量间的互信息:(2.1) Find the mutual information between variables:

I(Xi,Xj)=H(Xi)+H(Xj)-H(Xi,Xj) (1)I(X i ,X j )=H(X i )+H(X j )-H(X i ,X j ) (1)

其中,Xi(i=1,2,...,J)表示第i个变量,H(Xi)为变量Xi的信息熵:Among them, X i (i=1,2,...,J) represents the ith variable, and H(X i ) is the information entropy of variable X i :

H(Xi)=-∫xp(Xi)logp(Xi)dx (2)H(X i )=-∫ x p(X i )logp(X i )dx (2)

H(Xi,Xj)为变量Xi和Xj的联合信息熵:H(X i , X j ) is the joint information entropy of variables X i and X j :

Figure BDA0002112808330000031
Figure BDA0002112808330000031

p(Xi)与p(Xj)表示变量Xi和Xj的概率密度函数,p(Xi,Xj)为联合概率密度函数。p(X i ) and p(X j ) represent the probability density functions of variables X i and X j , and p(X i , X j ) is the joint probability density function.

(2.2)基于式(1)求取的互信息,求取两两变量之间的广义相关系数:(2.2) Based on the mutual information obtained by formula (1), obtain the generalized correlation coefficient between the two variables:

Figure BDA0002112808330000032
Figure BDA0002112808330000032

其中,rij∈[0,1]。where r ij ∈ [0,1].

(2.3)基于式(4),求取变量的相关矩阵:(2.3) Based on formula (4), obtain the correlation matrix of variables:

Figure BDA0002112808330000033
Figure BDA0002112808330000033

(2.4)基于式(5)定义的相关矩阵R,求取斜对角矩阵D:(2.4) Based on the correlation matrix R defined by formula (5), obtain the diagonal diagonal matrix D:

D=diag{Dii} (6)D=diag{ Dii } (6)

其中,Dii为式(5)中第i行所有元素的和:Among them, D ii is the sum of all elements in the i-th row in formula (5):

Figure BDA0002112808330000034
Figure BDA0002112808330000034

(2.5)求取斜对角矩阵D的拉普拉斯矩阵(2.5) Find the Laplace matrix of the diagonal diagonal matrix D

L=D-1/2RD-1/2 (8)L=D -1/2 RD -1/2 (8)

(2.6)将拉普拉斯矩阵进行谱分解(2.6) Spectral decomposition of the Laplace matrix

L=PΛPT (9)L=PΛP T (9)

其中,P=[P1,P2,...,PJ]为正交特征向量。Among them, P=[P 1 , P 2 , . . . , P J ] are orthogonal eigenvectors.

(2.7)选择k个最大特征值对应的特征向量组成矩阵E=[P1,P2,...,Pk]∈RJ×k,对矩阵E中每一行进行归一化处理,得到矩阵Y(2.7) Select the eigenvectors corresponding to the k largest eigenvalues to form a matrix E=[P 1 , P 2 ,...,P k ]∈R J×k , and normalize each row in the matrix E to obtain matrix Y

Figure BDA0002112808330000041
Figure BDA0002112808330000041

(2.8)利用kmeans聚类算法对Y进行聚类,如果第i行属于第b类,则变量Xi划分到第b子块Xb。这样,就将百万千瓦超超临界机组的众多操作变量根据相关程度分成B个变量块。(2.8) Use kmeans clustering algorithm to cluster Y, if the i-th row belongs to the b-th class, the variable X i is divided into the b-th sub-block X b . In this way, many operating variables of the million kilowatt ultra-supercritical unit are divided into B variable blocks according to the degree of correlation.

X=[X1 X2 … XB] (11)X=[X 1 X 2 … X B ] (11)

其中,

Figure BDA0002112808330000042
是第b(b=1,2,...,B)个变量块,Jb表示Xb中包含的变量个数。in,
Figure BDA0002112808330000042
is the bth (b=1,2,...,B) variable block, and J b represents the number of variables contained in X b .

(3)利用基于高斯混合模型的信息论分解方法将变量块中的变量根据样本方向上的分布情况进一步分解,该步骤通过以下子步骤来实现:(3) Use the information theory decomposition method based on Gaussian mixture model to further decompose the variables in the variable block according to the distribution in the sample direction. This step is realized by the following sub-steps:

(3.1)将变量块随机分成Wb个子块:(3.1) The variable block is randomly divided into W b sub-blocks:

Figure BDA0002112808330000043
Figure BDA0002112808330000043

(3.2)利用高斯混合模型方法求取第w(w=1,2,...,Wb)个变量子块的概率密度:(3.2) Use the Gaussian mixture model method to obtain the probability density of the wth (w=1, 2,...,W b ) variable sub-block:

Figure BDA0002112808330000044
Figure BDA0002112808330000044

其中,

Figure BDA0002112808330000045
是子高斯成分的个数;
Figure BDA0002112808330000046
是第m个子高斯成分的先验概率,满足
Figure BDA0002112808330000047
以及
Figure BDA0002112808330000048
为包含子高斯成分的均值
Figure BDA0002112808330000049
和协方差矩阵
Figure BDA00021128083300000410
的参数集。
Figure BDA00021128083300000411
为多元高斯概率密度:in,
Figure BDA0002112808330000045
is the number of sub-Gaussian components;
Figure BDA0002112808330000046
is the prior probability of the mth sub-Gaussian component, satisfying
Figure BDA0002112808330000047
as well as
Figure BDA0002112808330000048
is the mean with sub-Gaussian components
Figure BDA0002112808330000049
and covariance matrix
Figure BDA00021128083300000410
parameter set.
Figure BDA00021128083300000411
is the multivariate Gaussian probability density:

Figure BDA00021128083300000412
Figure BDA00021128083300000412

其中,Jb,w

Figure BDA0002112808330000051
中变量的个数。Among them, J b,w is
Figure BDA0002112808330000051
The number of variables in .

(3.3)求取子块中的各个变量的概率密度分布函数:(3.3) Obtain the probability density distribution function of each variable in the sub-block:

Figure BDA0002112808330000052
Figure BDA0002112808330000052

其中变量

Figure BDA0002112808330000053
Jb,w是变量块
Figure BDA0002112808330000054
中变量的个数,
Figure BDA0002112808330000055
表示Xb,i属于子块
Figure BDA0002112808330000056
时Xb,i的条件概率密度。where the variable
Figure BDA0002112808330000053
J b, w are variable blocks
Figure BDA0002112808330000054
the number of variables in ,
Figure BDA0002112808330000055
Indicates that X b,i belongs to the sub-block
Figure BDA0002112808330000056
The conditional probability density of X b,i when .

(3.4)求取变量子块

Figure BDA0002112808330000057
Figure BDA0002112808330000058
的KL散度(w,v∈[1,Wb]):(3.4) Obtain variable sub-block
Figure BDA0002112808330000057
and
Figure BDA0002112808330000058
The KL divergence of (w, v∈ [1,Wb]):

Figure BDA0002112808330000059
Figure BDA0002112808330000059

其中,

Figure BDA00021128083300000510
Figure BDA00021128083300000511
分别是
Figure BDA00021128083300000512
Figure BDA00021128083300000513
的概率密度函数,可以利用公式(13)计算。in,
Figure BDA00021128083300000510
and
Figure BDA00021128083300000511
respectively
Figure BDA00021128083300000512
and
Figure BDA00021128083300000513
The probability density function of , can be calculated using formula (13).

(3.5)利用蚁群算法优化步骤(3.1)的随机分块,使得以下目标函数最大化:(3.5) Use the ant colony algorithm to optimize the random block of step (3.1) so that the following objective function is maximized:

Figure BDA00021128083300000514
Figure BDA00021128083300000514

(3.6)通过重复步骤(3.2)—(3.5),每个变量块(b=1,2,...,B)进一步划分为若干个子块。原始数据集X分成不同的子块:(3.6) By repeating steps (3.2)-(3.5), each variable block (b=1, 2, . . . , B) is further divided into several sub-blocks. The original dataset X is divided into different sub-blocks:

Figure BDA00021128083300000515
Figure BDA00021128083300000515

其中,变量块

Figure BDA00021128083300000516
中所有变量具有很强的相关关系,变量子块
Figure BDA00021128083300000517
中的变量既有很强相关关系且具有类似的数据分布。where the variable block
Figure BDA00021128083300000516
All variables in have a strong correlation, the variable sub-block
Figure BDA00021128083300000517
The variables in are both strongly correlated and have similar data distributions.

(4)基于步骤(2)与(3)得到的变量分块结果,首先利用主元分析方法(PCA)描述变量子块

Figure BDA00021128083300000518
中各个变量的相关关系(4) Based on the variable block results obtained in steps (2) and (3), first use Principal Component Analysis (PCA) to describe the variable sub-blocks
Figure BDA00021128083300000518
The relationship between the various variables in

Figure BDA00021128083300000519
Figure BDA00021128083300000519

其中,Pb,w是负载矩阵,Tb,w是主元矩阵。Among them, P b,w is the load matrix, and T b,w is the pivot matrix.

(5)利用高斯混合模型(GMM)方法建立主元矩阵Tb,w的分布情况:(5) Use the Gaussian Mixture Model (GMM) method to establish the distribution of the principal matrix T b, w :

Figure BDA0002112808330000061
Figure BDA0002112808330000061

其中,

Figure BDA0002112808330000062
是高斯分量的个数;
Figure BDA0002112808330000063
表示第m个分量的权重,
Figure BDA0002112808330000064
为包含子高斯成分的均值
Figure BDA0002112808330000065
和协方差矩阵
Figure BDA0002112808330000066
的参数集。in,
Figure BDA0002112808330000062
is the number of Gaussian components;
Figure BDA0002112808330000063
represents the weight of the mth component,
Figure BDA0002112808330000064
is the mean with sub-Gaussian components
Figure BDA0002112808330000065
and covariance matrix
Figure BDA0002112808330000066
parameter set.

(6)针对每个变量子块

Figure BDA0002112808330000067
建立BIP统计量(6) For each variable sub-block
Figure BDA0002112808330000067
Build BIP Statistics

Figure BDA0002112808330000068
Figure BDA0002112808330000068

其中,

Figure BDA0002112808330000069
表示
Figure BDA00021128083300000610
属于第m个分量
Figure BDA00021128083300000611
的概率,
Figure BDA00021128083300000612
为主元矩阵Tb,w第n(n=1,2,..,N)行向量。
Figure BDA00021128083300000613
是基于局部马氏距离的概率,其定义为in,
Figure BDA0002112808330000069
express
Figure BDA00021128083300000610
belongs to the mth component
Figure BDA00021128083300000611
The probability,
Figure BDA00021128083300000612
is the nth (n=1,2,..,N) row vector of the principal matrix T b,w .
Figure BDA00021128083300000613
is the probability based on the local Mahalanobis distance, which is defined as

Figure BDA00021128083300000614
Figure BDA00021128083300000614

其中,

Figure BDA00021128083300000615
Figure BDA00021128083300000616
到第m个高斯分量的马氏距离,t为Tb,w中任意一行。in,
Figure BDA00021128083300000615
for
Figure BDA00021128083300000616
Mahalanobis distance to the mth Gaussian component, t is any row in T b,w .

(7)利用高斯混合模型(GMM)方法监测每个变量块中各个子块之间的关系,该步骤通过以下子步骤来实现。(7) Using the Gaussian Mixture Model (GMM) method to monitor the relationship between each sub-block in each variable block, this step is realized by the following sub-steps.

(7.1)将每个变量块Xb中各个子块的主元矩阵的第一列组合到一起:(7.1) Combine the first columns of the pivot matrix of each sub-block in each variable block X b :

Figure BDA00021128083300000617
Figure BDA00021128083300000617

其中,tb,w(w=1,2,...,Wb)为主元矩阵Tb,w第1个列向量。Among them, t b,w (w=1,2,...,W b ) is the first column vector of the principal element matrix T b,w .

(7.2)利用GMM描述主元数据

Figure BDA00021128083300000618
的分布情况:(7.2) Using GMM to describe main metadata
Figure BDA00021128083300000618
The distribution of :

Figure BDA00021128083300000619
Figure BDA00021128083300000619

其中,

Figure BDA00021128083300000620
为第b个变量子块中高斯分量的个数;
Figure BDA00021128083300000621
表示第m个分量的权重,
Figure BDA00021128083300000622
为包含子高斯成分的均值
Figure BDA00021128083300000623
和协方差矩阵
Figure BDA00021128083300000624
的参数集。in,
Figure BDA00021128083300000620
is the number of Gaussian components in the b-th variable sub-block;
Figure BDA00021128083300000621
represents the weight of the mth component,
Figure BDA00021128083300000622
is the mean with sub-Gaussian components
Figure BDA00021128083300000623
and covariance matrix
Figure BDA00021128083300000624
parameter set.

(7.3)针对每个变量块的主元数据

Figure BDA00021128083300000625
建立BIP统计量:(7.3) Main metadata for each variable block
Figure BDA00021128083300000625
Build BIP statistics:

Figure BDA0002112808330000071
Figure BDA0002112808330000071

其中,

Figure BDA0002112808330000072
为主元矩阵
Figure BDA0002112808330000073
第n(n=1,2,..,N)行向量。
Figure BDA0002112808330000074
是基于局部马氏距离的概率。
Figure BDA0002112808330000075
是基于局部马氏距离的概率,
Figure BDA0002112808330000076
Figure BDA0002112808330000077
到第m个高斯分量的马氏距离,
Figure BDA0002112808330000078
Figure BDA0002112808330000079
中任意一行。in,
Figure BDA0002112808330000072
pivot matrix
Figure BDA0002112808330000073
The nth (n=1,2,..,N) row vector.
Figure BDA0002112808330000074
is the probability based on the local Mahalanobis distance.
Figure BDA0002112808330000075
is the probability based on the local Mahalanobis distance,
Figure BDA0002112808330000076
for
Figure BDA0002112808330000077
Mahalanobis distance to the mth Gaussian component,
Figure BDA0002112808330000078
for
Figure BDA0002112808330000079
any line in the .

(8)在线故障检测时,从变量子块,变量块,整个机组三个层次对过程进行监测。该步骤通过以下子步骤来实现。(8) During online fault detection, the process is monitored from three levels of variable sub-block, variable block and the entire unit. This step is achieved by the following sub-steps.

(8.1)获取新数据:按照步骤(1)采集各测点变量的值,记为z(1×J)。(8.1) Acquiring new data: According to step (1), collect the value of each measuring point variable, and record it as z(1×J).

(8.2)按照步骤(2)与步骤(3)得到的变量分块结果,将新数据进行子块分解:(8.2) According to the variable block results obtained in steps (2) and (3), decompose the new data into sub-blocks:

z=[z1 z2 … zb … zB] (26)z=[z 1 z 2 … z b … z B ] (26)

Figure BDA00021128083300000710
Figure BDA00021128083300000710

其中zb(b=1,2,...,B)为第b个变量子块。where z b (b=1,2,...,B) is the bth variable sub-block.

(8.3)在最底层,即变量子块层,将每个子块中

Figure BDA00021128083300000711
Figure BDA00021128083300000712
的数据向对应子块的主元方向进行投影:(8.3) At the bottom layer, that is, the variable sub-block layer, put each sub-block in
Figure BDA00021128083300000711
Figure BDA00021128083300000712
The data is projected to the pivot direction of the corresponding sub-block:

Figure BDA00021128083300000713
Figure BDA00021128083300000713

其中

Figure BDA00021128083300000714
是负载矩阵。in
Figure BDA00021128083300000714
is the load matrix.

(8.4)求取各个子块

Figure BDA00021128083300000715
的在线统计量指标:(8.4) Find each sub-block
Figure BDA00021128083300000715
Online statistic indicators for :

Figure BDA00021128083300000716
Figure BDA00021128083300000716

其中,上式各参数含义与公式(22)中类似。

Figure BDA00021128083300000717
表示
Figure BDA00021128083300000718
属于第m个分量
Figure BDA00021128083300000719
的概率。
Figure BDA00021128083300000720
是基于局部马氏距离的概率,
Figure BDA00021128083300000721
Figure BDA00021128083300000722
到第m个高斯分量的马氏距离,t为Tb,w中任意一行。The meanings of the parameters in the above formula are similar to those in formula (22).
Figure BDA00021128083300000717
express
Figure BDA00021128083300000718
belongs to the mth component
Figure BDA00021128083300000719
The probability.
Figure BDA00021128083300000720
is the probability based on the local Mahalanobis distance,
Figure BDA00021128083300000721
for
Figure BDA00021128083300000722
Mahalanobis distance to the mth Gaussian component, t is any row in T b,w .

(8.5)在变量块层,首先将zb中各个变量子块的主元组合到一起:(8.5) At the variable block layer, first combine the pivots of each variable sub-block in z b :

Figure BDA0002112808330000081
Figure BDA0002112808330000081

(8.6)求取各个变量块zb的在线统计量指标:(8.6) Obtain the online statistics index of each variable block z b :

Figure BDA0002112808330000082
Figure BDA0002112808330000082

其中,上式各参数含义与公式(25)中类似。

Figure BDA0002112808330000083
表示
Figure BDA0002112808330000084
属于第m个分量
Figure BDA0002112808330000085
的概率。
Figure BDA0002112808330000086
是基于局部马氏距离的概率,
Figure BDA0002112808330000087
Figure BDA0002112808330000088
到第m个高斯分量的马氏距离,
Figure BDA0002112808330000089
Figure BDA00021128083300000810
中任意一行。The meanings of the parameters in the above formula are similar to those in formula (25).
Figure BDA0002112808330000083
express
Figure BDA0002112808330000084
belongs to the mth component
Figure BDA0002112808330000085
The probability.
Figure BDA0002112808330000086
is the probability based on the local Mahalanobis distance,
Figure BDA0002112808330000087
for
Figure BDA0002112808330000088
Mahalanobis distance to the mth Gaussian component,
Figure BDA0002112808330000089
for
Figure BDA00021128083300000810
any line in the .

(8.7)为了分析不同变量子块之间的关系,从整个机组层面对百万千瓦超超临界机组的运行状况进行监测,首先将各个变量块的BIP指标转换为正常(标记为‘N’)与故障(标记为‘F’)的概率:(8.7) In order to analyze the relationship between different variable sub-blocks, to monitor the operation status of the million kilowatt ultra-supercritical unit from the whole unit level, first convert the BIP index of each variable block to normal (marked as 'N') with the probability of failure (marked 'F'):

Figure BDA00021128083300000811
Figure BDA00021128083300000811

Figure BDA00021128083300000812
Figure BDA00021128083300000812

其中,BIPb,lmt为统计量BIP指标的控制限;

Figure BDA00021128083300000813
表示第b个变量块正常的条件概率;
Figure BDA00021128083300000814
表示第b个变量块发生故障的条件概率。Among them, BIP b, lmt is the control limit of the statistic BIP indicator;
Figure BDA00021128083300000813
represents the normal conditional probability of the b-th variable block;
Figure BDA00021128083300000814
represents the conditional probability of failure of the bth variable block.

(8.8)通过贝叶斯规则,计算第b个变量块发生故障的后验概率(8.8) Calculate the posterior probability of failure of the bth variable block by Bayesian rule

Figure BDA00021128083300000815
Figure BDA00021128083300000815

其中,Pb(F)=α;Pb(N)=1-α分别表示在显著性水平为α下过程发生故障或正常的先验概率。Among them, P b (F) = α; P b (N) = 1-α represent the prior probability of failure or normality of the process at the significance level of α, respectively.

(8.9)综合考虑所有变量块的故障概率,计算全局监测统计量(8.9) Comprehensively consider the failure probability of all variable blocks, and calculate the global monitoring statistics

Figure BDA00021128083300000816
Figure BDA00021128083300000816

(9)判断过程运行状态:从变量子块、变量块、整个机组三个层次对过程状态进行分析。实时比较三个层次的统计量与控制限:(9) Judging the process running state: analyze the process state from three levels of variable sub-block, variable block and the whole unit. Compare statistics and control limits at three levels in real time:

(a)在每个变量子块

Figure BDA0002112808330000091
中,如果BIPb,w>1-α,说明在子块
Figure BDA0002112808330000092
中的变量发生了故障,否则认为子块中的变量运行在正常范围内。(a) in each variable subblock
Figure BDA0002112808330000091
, if BIP b,w > 1-α, it means that in the sub-block
Figure BDA0002112808330000092
The variables in the sub-block are considered faulty, otherwise the variables in the sub-block are considered to be operating within the normal range.

(b)在变量块层面,如果BIPb>1-α,说明该变量块中各个变量子块的相关关系发生了异常,否则说明第b个子块中的所有变量都运行正常。(b) At the variable block level, if BIP b > 1-α, it means that the correlation of each variable sub-block in the variable block is abnormal; otherwise, it means that all variables in the bth sub-block are running normally.

(c)在机组层面,如果PFz>α,说明在百万千瓦超超临界机组运行过程中发生了异常或故障,否则说明机组整体正常运行。(c) At the unit level, if PF z >α, it means that an abnormality or failure has occurred during the operation of the million kilowatt ultra-supercritical unit; otherwise, it means that the unit is operating normally as a whole.

与现有技术相比,本发明的有益效果在于:本发明的目的在于针对百万千瓦超超临界机组规模庞大、设备多样、参数众多且相互影响,而且整个发电过程工业流程长、单元装置多、空间分布广、工况频繁切换的特点,提出一种面向百万千万超超临界机组的多层分布式监测算法。该方法综合考虑变量间的相关关系以及变量在样本方向的分布情况,运用多层信息理论分解方法对变量进行分块,充分发掘了过程变量间的过程信息,有利于对百万千瓦超超临界机组复杂过程特性的了解。多层分布式的监测方法既可以有效挖掘过程的局部信息,又可以分析不同变量子块之间的相关关系,大大提高了百万千瓦超超临界机组这一复杂过程的故障检测性能,从而保证了大型燃煤发电机组的安全可靠运行。Compared with the prior art, the beneficial effects of the present invention are as follows: the purpose of the present invention is to aim at the large scale of the million kilowatt ultra-supercritical unit, various equipment, numerous parameters and mutual influence, and the entire power generation process has a long industrial process and many unit devices. , wide spatial distribution and frequent switching of operating conditions, a multi-layer distributed monitoring algorithm for millions of ultra-supercritical units is proposed. The method comprehensively considers the correlation between variables and the distribution of variables in the sample direction, and uses the multi-layer information theory decomposition method to divide the variables into blocks, fully exploring the process information between the process variables, which is conducive to the analysis of the million kilowatt ultra-supercritical Knowledge of complex process characteristics of units. The multi-layer distributed monitoring method can not only effectively mine the local information of the process, but also analyze the correlation between different variable sub-blocks, which greatly improves the fault detection performance of the complex process of the million-kilowatt ultra-supercritical unit, thereby ensuring Safe and reliable operation of large coal-fired generating units.

附图说明:Description of drawings:

图1是本发明所述的面向百万千瓦超超临界机组的分布分层式在线故障检测方法的说明图;Fig. 1 is the explanatory diagram of the distributed layered on-line fault detection method of the present invention facing one million kilowatts of ultra-supercritical units;

图2是本发明方法在具体实施例中的变量子块中的监测结果,(a)为在第5个变量块中的两个变量子块的监测结果,(b)为在第7个变量块中的三个变量子块的监测结果,(c)为在第9个变量块中的三个变量子块的监测结果。2 is the monitoring result of the method of the present invention in the variable sub-block in the specific embodiment, (a) is the monitoring result of the two variable sub-blocks in the fifth variable block, (b) is the monitoring result in the seventh variable block The monitoring results of the three variable sub-blocks in the block, (c) is the monitoring results of the three variable sub-blocks in the ninth variable block.

图3是本发明方法在具体实施例中的变量层中的监测结果,(a)为在第5个变量块的监测结果,(b)为在第7个变量块的监测结果,(c)为在第9个变量块的监测结果。Fig. 3 is the monitoring result in the variable layer of the method of the present invention in the specific embodiment, (a) is the monitoring result in the 5th variable block, (b) is the monitoring result in the 7th variable block, (c) For the monitoring results in the 9th variable block.

图4是本发明方法在具体实施例中的机组层面的监测结果。FIG. 4 is a monitoring result at the unit level of the method of the present invention in a specific embodiment.

具体实施方式Detailed ways

下面结合附图及具体实例,对本发明作进一步详细说明。The present invention will be described in further detail below in conjunction with the accompanying drawings and specific examples.

本发明以浙能集团下属嘉华电厂3号机组为例,该机组为百万千瓦超超临界机组,其功率为600MW,共包括154个过程变量,这些变量涉及到压力、温度、流量、流速等。The present invention takes the No. 3 unit of Jiahua Power Plant under Zheneng Group as an example. This unit is a million kilowatt ultra-supercritical unit with a power of 600 MW and includes a total of 154 process variables. These variables involve pressure, temperature, flow, and flow rate. Wait.

如图1所示,本发明是一种面向百万千瓦超超临界机组的动静特征协同分析的在线监测方法,包括以下步骤:As shown in Figure 1, the present invention is a kind of on-line monitoring method for the collaborative analysis of dynamic and static characteristics of a million-kilowatt ultra-supercritical unit, comprising the following steps:

(1)获取正常待分析数据:设一个百万千瓦超超临界机组具有J个测量变量和操作变量,每一次采样可以得到一个1×J的向量,采样N次后获取的数据表述为一个二维矩阵X=[X1,X2,...,XJ]∈RN×J。本实例中,采样周期为1分钟,共采集火电机组正常运行过程中2940个样本数据用于变量分块以及离线建模,154个过程变量,即建模数据为X(2940×154)。其中所述测量变量为机组正常运行过程中可被测量的状态参数,包括流量、电压、电流、温度、速率等;所述操作变量包括进风量、给料量、阀门开度等;(1) Obtaining normal data to be analyzed: Suppose a million kilowatt ultra-supercritical unit has J measurement variables and operating variables, each sampling can obtain a 1 × J vector, and the data obtained after N sampling is expressed as a two A dimensional matrix X=[X 1 , X 2 ,...,X J ]∈R N×J . In this example, the sampling period is 1 minute, and a total of 2940 sample data are collected during the normal operation of the thermal power unit for variable block and offline modeling, and 154 process variables, that is, the modeling data is X (2940×154). Wherein, the measured variables are state parameters that can be measured during the normal operation of the unit, including flow, voltage, current, temperature, speed, etc.; the operating variables include air intake volume, feed volume, valve opening, etc.;

(2)利用基于互信息的谱聚类方法将过程变量分为不同的子块,同一子块中的变量具有较强的相关关系,不同子块间相关关系较弱。该步骤由以下子步骤来实现:(2) Using the spectral clustering method based on mutual information, the process variables are divided into different sub-blocks, the variables in the same sub-block have strong correlation, and the correlation between different sub-blocks is weak. This step is implemented by the following sub-steps:

(2.1)求取变量间的互信息:(2.1) Find the mutual information between variables:

I(Xi,Xj)=H(Xi)+H(Xj)-H(Xi,Xj) (1)I(X i ,X j )=H(X i )+H(X j )-H(X i ,X j ) (1)

其中,Xi(i=1,2,...,J)表示第i个变量,H(Xi)为变量Xi的信息熵:Among them, X i (i=1,2,...,J) represents the ith variable, and H(X i ) is the information entropy of variable X i :

H(Xi)=-∫xp(Xi)logp(Xi)dx (2)H(X i )=-∫ x p(X i )logp(X i )dx (2)

H(Xi,Xj)为变量Xi和Xj的联合信息熵:H(X i , X j ) is the joint information entropy of variables X i and X j :

Figure BDA0002112808330000101
Figure BDA0002112808330000101

p(Xi)与p(Xj)表示变量Xi和Xj的概率密度函数,p(Xi,Xj)为联合概率密度函数。p(X i ) and p(X j ) represent the probability density functions of variables X i and X j , and p(X i , X j ) is the joint probability density function.

(2.2)基于公式(1)求取的互信息,求取两两变量之间的广义相关系数:(2.2) Based on the mutual information obtained by formula (1), obtain the generalized correlation coefficient between the two variables:

Figure BDA0002112808330000111
Figure BDA0002112808330000111

其中,rij∈[0,1]。where r ij ∈ [0,1].

(2.3)基于公式(4),求取变量的相关矩阵:(2.3) Based on formula (4), obtain the correlation matrix of variables:

Figure BDA0002112808330000112
Figure BDA0002112808330000112

(2.4)基于公式(5)定义的相关矩阵R,求取斜对角矩阵D:(2.4) Based on the correlation matrix R defined by formula (5), obtain the diagonal diagonal matrix D:

D=diag{Dii} (6)D=diag{ Dii } (6)

其中,Dii为式(5)中第i行所有元素的和Among them, D ii is the sum of all elements in the i-th row in formula (5)

Figure BDA0002112808330000113
Figure BDA0002112808330000113

(2.5)求取斜对角矩阵D的拉普拉斯矩阵(2.5) Find the Laplace matrix of the diagonal diagonal matrix D

L=D-1/2RD-1/2 (8)L=D -1/2 RD -1/2 (8)

(2.6)将拉普拉斯矩阵进行谱分解(2.6) Spectral decomposition of the Laplace matrix

L=PΛPT (9)L=PΛP T (9)

其中,P=[P1,P2,...,PJ]为正交特征向量。Among them, P=[P 1 , P 2 , . . . , P J ] are orthogonal eigenvectors.

(2.7)选择k个最大特征值对应的特征向量组成矩阵E=[P1,P2,...,Pk]∈RJ×k,对矩阵E中每一行进行归一化处理,得到矩阵Y(2.7) Select the eigenvectors corresponding to the k largest eigenvalues to form a matrix E=[P 1 , P 2 ,...,P k ]∈R J×k , and normalize each row in the matrix E to obtain matrix Y

Figure BDA0002112808330000114
Figure BDA0002112808330000114

(2.8)利用kmeans聚类算法对Y进行聚类,如果第i行属于第b类,则变量Xi划分到第b子块Xb。这样,就将百万千瓦超超临界机组的众多操作变量根据相关程度分成B个变量块。(2.8) Use kmeans clustering algorithm to cluster Y, if the i-th row belongs to the b-th class, the variable X i is divided into the b-th sub-block X b . In this way, many operating variables of the million kilowatt ultra-supercritical unit are divided into B variable blocks according to the degree of correlation.

X=[X1 X2 ... XB] (11)X=[X 1 X 2 ... X B ] (11)

其中,

Figure BDA0002112808330000121
是第b(b=1,2,...,B)个变量块,Jb表示Xb中包含的变量个数。in,
Figure BDA0002112808330000121
is the bth (b=1,2,...,B) variable block, and J b represents the number of variables contained in X b .

在本次实例中,根据相关关系,154个过程变量共分成了11个子块,如表1所示,每个子块中变量具有较强的相关关系,不同子块之间相关关系较弱。In this example, according to the correlation, the 154 process variables are divided into 11 sub-blocks. As shown in Table 1, the variables in each sub-block have strong correlation, and the correlation between different sub-blocks is weak.

表1.百万千瓦超超临界机组中变量分块情况Table 1. Variable block situation in 1 million kilowatt ultra-supercritical units

Figure BDA0002112808330000122
Figure BDA0002112808330000122

(3)利用基于高斯混合模型的信息论分解方法将上述11个变量块中的变量根据样本方向上的分布情况进一步分解,该步骤通过以下子步骤来实现(3) Use the information theory decomposition method based on Gaussian mixture model to further decompose the variables in the above 11 variable blocks according to the distribution in the sample direction. This step is realized by the following sub-steps

(3.1)将变量块随机分成Wb个子块:(3.1) The variable block is randomly divided into W b sub-blocks:

Figure BDA0002112808330000123
Figure BDA0002112808330000123

(3.2)利用高斯混合模型方法求取第w(w=1,2,...,Wb)个变量子块的概率密度:(3.2) Use the Gaussian mixture model method to obtain the probability density of the wth (w=1, 2,...,W b ) variable sub-block:

Figure BDA0002112808330000124
Figure BDA0002112808330000124

其中,

Figure BDA0002112808330000131
是子高斯成分的个数;
Figure BDA0002112808330000132
是第m个子高斯成分的先验概率,满足
Figure BDA0002112808330000133
以及
Figure BDA0002112808330000134
为包含子高斯成分的均值
Figure BDA0002112808330000135
和协方差矩阵
Figure BDA0002112808330000136
的参数集。
Figure BDA0002112808330000137
为多元高斯概率密度:in,
Figure BDA0002112808330000131
is the number of sub-Gaussian components;
Figure BDA0002112808330000132
is the prior probability of the mth sub-Gaussian component, satisfying
Figure BDA0002112808330000133
as well as
Figure BDA0002112808330000134
is the mean with sub-Gaussian components
Figure BDA0002112808330000135
and covariance matrix
Figure BDA0002112808330000136
parameter set.
Figure BDA0002112808330000137
is the multivariate Gaussian probability density:

Figure BDA0002112808330000138
Figure BDA0002112808330000138

其中,Jb,w

Figure BDA0002112808330000139
中变量的个数。Among them, J b,w is
Figure BDA0002112808330000139
The number of variables in .

(3.3)求取子块中的各个变量的概率密度分布函数:(3.3) Obtain the probability density distribution function of each variable in the sub-block:

Figure BDA00021128083300001310
Figure BDA00021128083300001310

其中变量

Figure BDA00021128083300001311
Jb,w是变量块
Figure BDA00021128083300001312
中变量的个数,
Figure BDA00021128083300001313
表示Xb,i属于子块
Figure BDA00021128083300001314
时Xbi的条件概率密度。where the variable
Figure BDA00021128083300001311
J b, w are variable blocks
Figure BDA00021128083300001312
the number of variables in ,
Figure BDA00021128083300001313
Indicates that X b,i belongs to the sub-block
Figure BDA00021128083300001314
The conditional probability density of X bi when .

(3.4)求取变量子块

Figure BDA00021128083300001315
Figure BDA00021128083300001316
的KL散度(w,v∈[1,Wb]):(3.4) Obtain variable sub-block
Figure BDA00021128083300001315
and
Figure BDA00021128083300001316
The KL divergence of (w, v∈ [1,Wb]):

Figure BDA00021128083300001317
Figure BDA00021128083300001317

其中,

Figure BDA00021128083300001318
Figure BDA00021128083300001319
分别是
Figure BDA00021128083300001320
Figure BDA00021128083300001321
的概率密度函数,可以利用公式(13)计算。in,
Figure BDA00021128083300001318
and
Figure BDA00021128083300001319
respectively
Figure BDA00021128083300001320
and
Figure BDA00021128083300001321
The probability density function of , can be calculated using formula (13).

(3.5)利用蚁群算法优化步骤(3.1)的随机分块,使得以下目标函数最大化:(3.5) Use the ant colony algorithm to optimize the random block of step (3.1) so that the following objective function is maximized:

Figure BDA00021128083300001322
Figure BDA00021128083300001322

(3.6)通过重复步骤(3.2)—(3.5),每个变量块(b=1,2,...,B)进一步划分为若干个子块。原始数据集X分成不同的子块:(3.6) By repeating steps (3.2)-(3.5), each variable block (b=1, 2, . . . , B) is further divided into several sub-blocks. The original dataset X is divided into different sub-blocks:

Figure BDA00021128083300001323
Figure BDA00021128083300001323

其中,变量块

Figure BDA00021128083300001324
中所有变量具有很强的相关关系,变量子块
Figure BDA00021128083300001325
中的变量既有很强相关关系且具有类似的数据分布。where the variable block
Figure BDA00021128083300001324
All variables in have a strong correlation, the variable sub-block
Figure BDA00021128083300001325
The variables in are both strongly correlated and have similar data distributions.

在本次实例中,根据变量的分布情况,将步骤(2)中得到的11个变量块进行进一步划分为27个变量子块,每个子块中的变量既具有较强的相关关系,又具有相同的分布。In this example, according to the distribution of variables, the 11 variable blocks obtained in step (2) are further divided into 27 variable sub-blocks. The variables in each sub-block have strong correlations and the same distribution.

表2.百万千瓦超超临界机组中变量子块的划分情况Table 2. The division of variable sub-blocks in the million kilowatt ultra-supercritical unit

Figure BDA0002112808330000141
Figure BDA0002112808330000141

(4)基于步骤(2)与(3)得到的变量分块结果,首先利用主元分析方法(PCA)描述变量子块

Figure BDA0002112808330000142
中各个变量的相关关系(4) Based on the variable block results obtained in steps (2) and (3), first use Principal Component Analysis (PCA) to describe the variable sub-blocks
Figure BDA0002112808330000142
The relationship between the various variables in

Figure BDA0002112808330000143
Figure BDA0002112808330000143

其中,Pb,w是负载矩阵,Tb,w是主元矩阵。Among them, P b,w is the load matrix, and T b,w is the pivot matrix.

(5)利用高斯混合模型(GMM)方法建立主元矩阵Tb,w的分布情况:(5) Use the Gaussian Mixture Model (GMM) method to establish the distribution of the principal matrix T b, w :

Figure BDA0002112808330000144
Figure BDA0002112808330000144

其中,

Figure BDA0002112808330000145
是高斯分量的个数;
Figure BDA0002112808330000146
表示第m个分量的权重,
Figure BDA0002112808330000147
为包含子高斯成分的均值
Figure BDA0002112808330000151
和协方差矩阵
Figure BDA0002112808330000152
的参数集。in,
Figure BDA0002112808330000145
is the number of Gaussian components;
Figure BDA0002112808330000146
represents the weight of the mth component,
Figure BDA0002112808330000147
is the mean with sub-Gaussian components
Figure BDA0002112808330000151
and covariance matrix
Figure BDA0002112808330000152
parameter set.

(6)针对每个变量子块

Figure BDA0002112808330000153
建立BIP统计量(6) For each variable sub-block
Figure BDA0002112808330000153
Build BIP Statistics

Figure BDA0002112808330000154
Figure BDA0002112808330000154

其中,

Figure BDA0002112808330000155
表示
Figure BDA0002112808330000156
属于第m个分量
Figure BDA0002112808330000157
的概率,
Figure BDA0002112808330000158
为主元矩阵Tb,w第n(n=1,2,..,N)行向量。
Figure BDA0002112808330000159
是基于局部马氏距离的概率,其定义为in,
Figure BDA0002112808330000155
express
Figure BDA0002112808330000156
belongs to the mth component
Figure BDA0002112808330000157
The probability,
Figure BDA0002112808330000158
is the nth (n=1,2,..,N) row vector of the principal matrix T b,w .
Figure BDA0002112808330000159
is the probability based on the local Mahalanobis distance, which is defined as

Figure BDA00021128083300001510
Figure BDA00021128083300001510

其中,

Figure BDA00021128083300001511
Figure BDA00021128083300001512
到第m个高斯分量的马氏距离,t为Tb,w中任意一行。in,
Figure BDA00021128083300001511
for
Figure BDA00021128083300001512
Mahalanobis distance to the mth Gaussian component, t is any row in T b,w .

(7)利用高斯混合模型(GMM)方法监测每个变量块中各个子块之间的关系,该步骤通过以下子步骤来实现。(7) Using the Gaussian Mixture Model (GMM) method to monitor the relationship between each sub-block in each variable block, this step is realized by the following sub-steps.

(7.1)将每个变量块Xb中各个子块的主元矩阵的第一列组合到一起:(7.1) Combine the first columns of the pivot matrix of each sub-block in each variable block X b :

Figure BDA00021128083300001513
Figure BDA00021128083300001513

其中,tb,w(w=1,2,...,Wb)为主元矩阵Tb,w第1个列向量。Among them, t b,w (w=1,2,...,W b ) is the first column vector of the principal element matrix T b,w .

(7.2)利用GMM描述主元数据

Figure BDA00021128083300001514
的分布情况:(7.2) Using GMM to describe main metadata
Figure BDA00021128083300001514
The distribution of :

Figure BDA00021128083300001515
Figure BDA00021128083300001515

其中,

Figure BDA00021128083300001516
为第b个变量子块中高斯分量的个数;
Figure BDA00021128083300001517
表示第m个分量的权重,
Figure BDA00021128083300001518
为包含子高斯成分的均值
Figure BDA00021128083300001519
和协方差矩阵
Figure BDA00021128083300001520
的参数集。in,
Figure BDA00021128083300001516
is the number of Gaussian components in the b-th variable sub-block;
Figure BDA00021128083300001517
represents the weight of the mth component,
Figure BDA00021128083300001518
is the mean with sub-Gaussian components
Figure BDA00021128083300001519
and covariance matrix
Figure BDA00021128083300001520
parameter set.

(7.3)针对每个变量块的主元数据

Figure BDA00021128083300001521
建立BIP统计量:(7.3) Main metadata for each variable block
Figure BDA00021128083300001521
Build BIP statistics:

Figure BDA00021128083300001522
Figure BDA00021128083300001522

其中,

Figure BDA00021128083300001523
为主元矩阵
Figure BDA00021128083300001524
第n(n=1,2,..,N)行向量。
Figure BDA00021128083300001525
是基于局部马氏距离的概率。
Figure BDA0002112808330000161
是基于局部马氏距离的概率,
Figure BDA0002112808330000162
Figure BDA0002112808330000163
到第m个高斯分量的马氏距离,
Figure BDA0002112808330000164
Figure BDA0002112808330000165
中任意一行。in,
Figure BDA00021128083300001523
pivot matrix
Figure BDA00021128083300001524
The nth (n=1,2,..,N) row vector.
Figure BDA00021128083300001525
is the probability based on the local Mahalanobis distance.
Figure BDA0002112808330000161
is the probability based on the local Mahalanobis distance,
Figure BDA0002112808330000162
for
Figure BDA0002112808330000163
Mahalanobis distance to the mth Gaussian component,
Figure BDA0002112808330000164
for
Figure BDA0002112808330000165
any line in the .

(8.1)故障数据准备:这里采集故障数据一共包含460个样本,数据记为Z(460×154),该故障为循环水泵出口压力增大,该故障发生在第121个采样点。(8.1) Fault data preparation: The fault data collected here contains a total of 460 samples, and the data is recorded as Z (460×154). The fault is that the outlet pressure of the circulating water pump increases, and the fault occurs at the 121st sampling point.

(8.2)按照步骤(2)与步骤(3)得到的变量分块结果,将新数据记为z(1×154)进行子块分解,在本次实例中B=11:(8.2) According to the variable block results obtained in steps (2) and (3), the new data is denoted as z(1×154) for sub-block decomposition. In this example, B=11:

z=[z1 z2 ... zb ... zB] (26)z=[z 1 z 2 ... z b ... z B ] (26)

Figure BDA0002112808330000166
Figure BDA0002112808330000166

其中zb(b=1,2,...,B)为第b个变量子块。where z b (b=1,2,...,B) is the bth variable sub-block.

(8.3)在最底层,即变量子块层,将每个子块中

Figure BDA0002112808330000167
Figure BDA0002112808330000168
的数据向对应子块的主元方向进行投影:(8.3) At the bottom layer, that is, the variable sub-block layer, put each sub-block in
Figure BDA0002112808330000167
Figure BDA0002112808330000168
The data is projected to the pivot direction of the corresponding sub-block:

Figure BDA0002112808330000169
Figure BDA0002112808330000169

(8.4)求取各个子块

Figure BDA00021128083300001610
的在线统计量指标:(8.4) Find each sub-block
Figure BDA00021128083300001610
Online statistic indicators for :

Figure BDA00021128083300001611
Figure BDA00021128083300001611

其中,上式各参数含义与公式(22)中类似。

Figure BDA00021128083300001612
表示
Figure BDA00021128083300001613
属于第m个分量
Figure BDA00021128083300001614
的概率。
Figure BDA00021128083300001615
是基于局部马氏距离的概率,
Figure BDA00021128083300001616
Figure BDA00021128083300001617
到第m个高斯分量的马氏距离,t为Tb,w中任意一行。The meanings of the parameters in the above formula are similar to those in formula (22).
Figure BDA00021128083300001612
express
Figure BDA00021128083300001613
belongs to the mth component
Figure BDA00021128083300001614
The probability.
Figure BDA00021128083300001615
is the probability based on the local Mahalanobis distance,
Figure BDA00021128083300001616
for
Figure BDA00021128083300001617
Mahalanobis distance to the mth Gaussian component, t is any row in T b,w .

(8.5)在变量块层,首先将zb中各个变量子块的主元组合到一起:(8.5) At the variable block layer, first combine the pivots of each variable sub-block in z b :

Figure BDA00021128083300001618
Figure BDA00021128083300001618

(8.6)求取各个变量块zb的在线统计量指标:(8.6) Obtain the online statistics index of each variable block z b :

Figure BDA00021128083300001619
Figure BDA00021128083300001619

其中,上式各参数含义与公式(25)中类似。

Figure BDA00021128083300001620
表示
Figure BDA00021128083300001621
属于第m个分量
Figure BDA0002112808330000171
的概率。
Figure BDA0002112808330000172
是基于局部马氏距离的概率,
Figure BDA0002112808330000173
Figure BDA0002112808330000174
到第m个高斯分量的马氏距离,
Figure BDA0002112808330000175
Figure BDA0002112808330000176
中任意一行。The meanings of the parameters in the above formula are similar to those in formula (25).
Figure BDA00021128083300001620
express
Figure BDA00021128083300001621
belongs to the mth component
Figure BDA0002112808330000171
The probability.
Figure BDA0002112808330000172
is the probability based on the local Mahalanobis distance,
Figure BDA0002112808330000173
for
Figure BDA0002112808330000174
Mahalanobis distance to the mth Gaussian component,
Figure BDA0002112808330000175
for
Figure BDA0002112808330000176
any line in the .

(8.7)为了分析不同变量子块之间的关系,从整个机组层面对百万千瓦超超临界机组的运行状况进行监测,首先将各个变量块的BIP指标转换为正常(标记为‘N’)与故障(标记为‘F’)的概率:(8.7) In order to analyze the relationship between different variable sub-blocks, to monitor the operation status of the million kilowatt ultra-supercritical unit from the whole unit level, first convert the BIP index of each variable block to normal (marked as 'N') with the probability of failure (marked 'F'):

Figure BDA0002112808330000177
Figure BDA0002112808330000177

Figure BDA0002112808330000178
Figure BDA0002112808330000178

其中,BIPb,lmt为统计量BIP的控制限,在本具体实施例中取0.5;

Figure BDA0002112808330000179
表示第b个变量块正常的条件概率;
Figure BDA00021128083300001710
表示第b个变量块发生故障的条件概率。Wherein, BIP b, lmt is the control limit of the statistic BIP, which is taken as 0.5 in this specific embodiment;
Figure BDA0002112808330000179
represents the normal conditional probability of the b-th variable block;
Figure BDA00021128083300001710
represents the conditional probability of failure of the bth variable block.

(8.8)通过贝叶斯规则,计算第b个变量块发生故障的后验概率(8.8) Calculate the posterior probability of failure of the bth variable block by Bayesian rule

Figure BDA00021128083300001711
Figure BDA00021128083300001711

其中,Pb(F)=α;Pb(N)=1-α分别表示在显著性水平为α下过程发生故障或正常的先验概率,在本实例中,显著性水平α=0.5。Among them, P b (F)=α; P b (N)=1-α respectively represent the prior probability of failure or normality of the process at the significance level α, in this example, the significance level α=0.5.

(8.9)综合考虑所有变量块的故障概率,计算全局监测统计量(8.9) Comprehensively consider the failure probability of all variable blocks, and calculate the global monitoring statistics

Figure BDA00021128083300001712
Figure BDA00021128083300001712

(9)判断过程运行状态:从变量子块、变量块、整个机组三个层次对过程状态进行分析。实时比较三个层次的统计量与控制限:(9) Judging the process running state: analyze the process state from three levels of variable sub-block, variable block and the whole unit. Compare statistics and control limits at three levels in real time:

(a)在每个变量子块

Figure BDA00021128083300001713
中,如果BIPb,w>1-α,说明在子块
Figure BDA00021128083300001714
中的变量发生了故障,否则认为子块中的变量运行在正常范围内。(a) in each variable subblock
Figure BDA00021128083300001713
, if BIP b,w > 1-α, it means that in the sub-block
Figure BDA00021128083300001714
The variables in the sub-block are considered faulty, otherwise the variables in the sub-block are considered to be operating within the normal range.

(b)在变量块层面,如果BIPb>1-α,说明该变量块中各个变量子块的相关关系发生了异常,否则说明第b个子块中的所有变量都运行正常。(b) At the variable block level, if BIP b > 1-α, it means that the correlation of each variable sub-block in the variable block is abnormal; otherwise, it means that all variables in the bth sub-block are running normally.

(c)在机组层面,如果PFz>α,说明在百万千瓦超超临界机组运行过程中发生了异常或故障,否则说明机组整体正常运行。(c) At the unit level, if PF z >α, it means that an abnormality or failure has occurred during the operation of the million kilowatt ultra-supercritical unit; otherwise, it means that the unit is operating normally as a whole.

利用本发明的监测方法对火电过程进行在线过程监测,结果如图2-图4所示。图2显示了本发明方法在8个变量子块的监测结果。从图2(a)中可以看出,第五个变量块的两个变量子块的统计量基本都在控制限以下,说明当前故障并没有影响到变量块#5中的各个变量,这些变量均正常运行。从图2(b)中可以看出,在前120个样本中,变量块#7的前两个子块的统计量基本都在控制限以下,显示过程正常运行。从第121个样本开始,变量子块统计量BIPsub7,1和BIPsub7,2迅速超出控制限,检测到故障发生,这说明该故障明显影响到这两个子块中的变量。同样,分析图2(c)的监测结果,可以发现变量块#9的前两个子块中的变量基本正常运行,但是BIPsub9,3有效检测到了故障的发生。Using the monitoring method of the present invention to monitor the thermal power process on-line, the results are shown in Figures 2-4. Figure 2 shows the monitoring results of the method of the present invention in 8 variable sub-blocks. As can be seen from Figure 2(a), the statistics of the two variable sub-blocks of the fifth variable block are basically below the control limit, indicating that the current fault does not affect each variable in variable block #5. are operating normally. As can be seen from Figure 2(b), in the first 120 samples, the statistics of the first two sub-blocks of variable block #7 are basically below the control limit, indicating that the process is running normally. Starting from the 121st sample, the variable subblock statistics BIP sub7,1 and BIP sub7,2 rapidly exceed the control limits, and a fault is detected, indicating that the fault clearly affects the variables in these two subblocks. Similarly, analyzing the monitoring results in Figure 2(c), it can be found that the variables in the first two sub-blocks of variable block #9 are basically operating normally, but BIP sub9,3 effectively detects the occurrence of faults.

图3显示了本发明方法在变量层中的部分监测结果。图3(a)中可以看出,第五个变量块基本都在控制限以下,表明变量块#5中的两个变量子块之间的相关关系没有受到故障的影响,变量块#5中的过程变量正常运行。图3(b)和图3(c)的监测结果分别显示变量块#7和变量块#9中的变量子块的相关关系受到了故障的影响,发生了异常。图4显示了发明方法在机组层的监测结果。可以看到,统计量在前120个样本基本运行在控制限以下,说明百万千瓦超超临界机组运行在正常工况。从第121个样本开始,统计量PFz立即超出控制限,有效检测到故障的发生。总体来说,本发明的分层分布式故障检测方法在监测百万千瓦超超临界这一典型的大规模多工况过程时具有优越的故障检测性能,其分块结果有效分析众多变量之间复杂的相关关系,既可以加深操作人员对过程的理解,又可以为实际火电厂工业现场的技术管理部门提供高精度的在线过程监测结果,为实时判断过程运行状态,识别是否有故障发生提供可靠依据,进一步提高了百万千瓦超超临界机组运行的安全性、可靠性和有效性。Figure 3 shows part of the monitoring results of the method of the present invention in the variable layer. As can be seen in Figure 3(a), the fifth variable block is basically below the control limit, indicating that the correlation between the two variable sub-blocks in variable block #5 is not affected by the fault. The process variable is operating normally. The monitoring results of Fig. 3(b) and Fig. 3(c) respectively show that the correlation of variable sub-blocks in variable block #7 and variable block #9 is affected by the fault, and abnormality occurs. Figure 4 shows the monitoring results of the inventive method at the unit level. It can be seen that the statistics are basically operating below the control limit in the first 120 samples, indicating that the million-kilowatt ultra-supercritical unit is operating under normal conditions. Starting from the 121st sample, the statistic PF z immediately exceeds the control limit, effectively detecting the occurrence of a fault. In general, the layered distributed fault detection method of the present invention has superior fault detection performance when monitoring a typical large-scale multi-working condition process of one million kilowatts of ultra-supercritical, and its block results can effectively analyze the relationship between many variables. The complex correlation can not only deepen the operator's understanding of the process, but also provide high-precision online process monitoring results for the technical management department of the actual thermal power plant industrial site, and provide reliable information for judging the process operating status in real time and identifying whether there is a fault. Based on this, the safety, reliability and effectiveness of the operation of the million-kilowatt ultra-supercritical unit are further improved.

Claims (1)

1. A distribution layered online fault detection method for a million kilowatt ultra-supercritical unit is characterized by comprising the following steps:
(1) acquiring normal data to be analyzed: a million-kilowatt ultra-supercritical unit is provided with J measurement variables and operation variables, a vector of 1 XJ can be obtained by sampling every time, and data acquired after sampling N times is expressed as a two-dimensional matrix X ═ X1,X2,...,XJ]∈RN×JThe measured variables are state parameters which can be measured in the normal operation process of the unit, and comprise flow, voltage, current, temperature, speed and the like; the operation variables comprise air intake, feeding amount, valve opening and the like;
(2) the process variable is divided into different sub-blocks by using a spectral clustering method based on mutual information, the variable in the same sub-block has stronger correlation, and the correlation between different sub-blocks is weaker. This step is realized by the following substeps:
(2.1) obtaining mutual information among variables:
I(Xi,Xj)=H(Xi)+H(Xj)-H(Xi,Xj) (1)
wherein, Xi(i ═ 1, 2.., J) denotes the ith variable, H (X)i) Is a variable XiThe information entropy of (2):
H(Xi)=-∫xp(Xi)logp(Xi)dx (2)
H(Xi,Xj) Is a variable XiAnd XjJoint information entropy of (a):
Figure FDA0002112808320000011
p(Xi) And p (X)j) Represents variable XiAnd XjP (X) is a probability density function ofi,Xj) Is a joint probability density function.
(2.2) solving the generalized correlation coefficient between every two variables based on the mutual information solved by the formula (1):
Figure FDA0002112808320000012
wherein r isij∈[0,1]。
(2.3) based on equation (4), a correlation matrix of the variables is found:
Figure FDA0002112808320000021
(2.4) solving a diagonal matrix D based on the correlation matrix R defined by the formula (5):
D=diag{Dii} (6)
wherein D isiiIs the sum of all elements in row i in equation (5):
Figure FDA0002112808320000022
(2.5) solving a Laplace matrix of the diagonal matrix D:
L=D-1/2RD-1/2(8)
(2.6) performing spectral decomposition on the Laplace matrix:
L=PΛPT(9)
wherein, P ═ P1,P2,...,PJ]Are orthogonal eigenvectors.
(2.7) selecting the eigenvectors corresponding to the k maximum eigenvalues to form a matrix E ═ P1,P2,...,Pk]∈RJ×kNormalizing each row in the matrix E to obtain a matrix Y:
Figure FDA0002112808320000023
(2.8) clustering Y by using a kmeans clustering algorithm, and if the ith row belongs to the b-th class, carrying out variable XiDivision into sub-block b Xb. Thus, a plurality of operation variables of the million-kilowatt ultra-supercritical unit are divided into B variable blocks according to the correlation degree:
X=[X1X2… XB](11)
wherein,
Figure FDA0002112808320000024
is the (B ═ 1, 2.., B) th variable block, JbRepresents XbThe number of variables contained in (1).
(3) The variables in the variable block are further decomposed according to the distribution situation in the sample direction by using an information theory decomposition method based on a Gaussian mixture model, and the step is realized by the following substeps.
(3.1) randomly dividing the variable block into WbIndividual block:
Figure FDA0002112808320000031
(3.2) using gaussian mixture model method to obtain W (W1, 2.., W)b) Probability density of individual variable subblocks:
Figure FDA0002112808320000032
wherein,
Figure FDA0002112808320000033
is the number of sub-gaussian components;
Figure FDA0002112808320000034
is the prior probability of the mth sub-Gaussian component, satisfies
Figure FDA0002112808320000035
And
Figure FDA0002112808320000036
is a mean value containing sub-Gaussian components
Figure FDA0002112808320000037
Sum covariance matrix
Figure FDA0002112808320000038
Of the parameter set (c).
Figure FDA0002112808320000039
Is a multivariate gaussian probability density:
Figure FDA00021128083200000310
wherein, Jb,wIs composed of
Figure FDA00021128083200000311
The number of medium variables.
(3.3) solving probability density distribution functions of all variables in the sub-blocks:
Figure FDA00021128083200000312
wherein the variable
Figure FDA00021128083200000313
Jb,wIs a variable block
Figure FDA00021128083200000314
The number of the medium variables is equal to or greater than the total number of the medium variables,
Figure FDA00021128083200000315
represents Xb,iBelonging to sub-blocks
Figure FDA00021128083200000316
When Xb,iThe conditional probability density of (2).
(3.4) solving variable subblocks
Figure FDA00021128083200000317
And
Figure FDA00021128083200000318
KL divergence of (W, v ∈ [1, W ]b]):
Figure FDA00021128083200000319
Wherein,
Figure FDA00021128083200000320
and
Figure FDA00021128083200000321
are respectively
Figure FDA00021128083200000322
And
Figure FDA00021128083200000323
the probability density function of (2) can be calculated by using the formula (13).
(3.5) optimizing the random partitions of step (3.1) using an ant colony algorithm such that the following objective function is maximized:
Figure FDA00021128083200000324
(3.6) each variable block (B ═ 1, 2.., B) is further divided into sub-blocks by repeating steps (3.2) - (3.5). The original data set X is divided into different sub-blocks:
Figure FDA0002112808320000041
wherein, the variable block
Figure FDA0002112808320000042
(4) Based on the variable block result obtained in the steps (2) and (3), firstly, describing the variable sub-block by using a Principal Component Analysis (PCA)
Figure FDA0002112808320000043
The correlation relationship of each variable in (1):
Figure FDA0002112808320000044
wherein, Pb,wIs a load matrix, Tb,wIs a principal component matrix.
(5) Principal component matrix T established by Gaussian Mixture Model (GMM) methodb,wThe distribution of (c):
Figure FDA0002112808320000045
wherein,
Figure FDA0002112808320000046
is the number of gaussian components;
Figure FDA0002112808320000047
the weight of the m-th component is represented,
Figure FDA0002112808320000048
is a mean value containing sub-Gaussian components
Figure FDA0002112808320000049
Sum covariance matrix
Figure FDA00021128083200000410
Of the parameter set (c).
(6) For each variable sub-block
Figure FDA00021128083200000411
Establishing BIP statistics
Figure FDA00021128083200000412
Wherein,
Figure FDA00021128083200000413
to represent
Figure FDA00021128083200000414
Belonging to the m-th component
Figure FDA00021128083200000415
The probability of (a) of (b) being,
Figure FDA00021128083200000416
as a principal component matrix Tb,wAn nth (N ═ 1, 2.., N) row vector.
Figure FDA00021128083200000417
Is based on the probability of local Mahalanobis distance, which is defined as
Figure FDA00021128083200000418
Wherein,
Figure FDA00021128083200000419
is composed of
Figure FDA00021128083200000420
Mahalanobis distance to mth gaussian component, T being Tb,wAny one of the rows.
(7) The relationship between each sub-block in each variable block is monitored by a Gaussian Mixture Model (GMM) method, which is implemented by the following sub-steps.
(7.1) blocking each variable block XbThe first columns of the pivot matrices of the respective sub-blocks are combined together:
Figure FDA0002112808320000051
wherein, tb,w(w=1,2,...,Wb) As a principal component matrix Tb,wThe 1 st column vector.
(7.2) describing principal metadata with GMM
Figure FDA0002112808320000052
The distribution of (c):
Figure FDA0002112808320000053
wherein,
Figure FDA0002112808320000054
the number of Gaussian components in the b variable sub-block is shown;
Figure FDA0002112808320000055
the weight of the m-th component is represented,
Figure FDA0002112808320000056
is composed of sub-Gaussian componentsMean value of
Figure FDA0002112808320000057
Sum covariance matrix
Figure FDA0002112808320000058
Of the parameter set (c).
(7.3) Main metadata for each variable Block
Figure FDA0002112808320000059
Establishing BIP statistic:
Figure FDA00021128083200000510
wherein,
Figure FDA00021128083200000511
as a principal component matrix
Figure FDA00021128083200000512
An nth (N ═ 1, 2.., N) row vector.
Figure FDA00021128083200000513
To represent
Figure FDA00021128083200000514
Belonging to the m-th component
Figure FDA00021128083200000515
The probability of (c).
Figure FDA00021128083200000516
Is based on the probability of the local mahalanobis distance,
Figure FDA00021128083200000517
is composed of
Figure FDA00021128083200000518
Mahalanobis distance to the mth gaussian component,
Figure FDA00021128083200000519
is composed of
Figure FDA00021128083200000520
Any one of the rows.
(8) During online fault detection, the process is monitored from three levels of variable subblocks, variable blocks and the whole unit. This step is realized by the following substeps.
(8.1) acquiring new data: and (4) collecting the variable values of the measuring points according to the step (1) and recording as z (1 multiplied by J).
(8.2) according to the variable blocking results obtained in the step (2) and the step (3), carrying out sub-block decomposition on the new data:
z=[z1z2... zb... zB](26)
Figure FDA00021128083200000521
wherein z isb(B ═ 1, 2.., B) is the B variable sub-block.
(8.3) at the bottom layer, i.e., variable sub-block layer, each sub-block
Figure FDA00021128083200000522
Figure FDA00021128083200000523
The data of (2) are projected to the principal element direction of the corresponding sub-block:
Figure FDA0002112808320000061
wherein
Figure FDA0002112808320000062
Is a load matrix.
(8.4) obtaining each sub-block
Figure FDA0002112808320000063
The online statistic index of (1):
Figure FDA0002112808320000064
wherein, the meaning of each parameter in the above formula is similar to that in the formula (22).
Figure FDA0002112808320000065
To represent
Figure FDA0002112808320000066
Belonging to the m-th component
Figure FDA0002112808320000067
The probability of (c).
Figure FDA0002112808320000068
Is based on the probability of the local mahalanobis distance,
Figure FDA0002112808320000069
is composed of
Figure FDA00021128083200000610
Mahalanobis distance to mth gaussian component, T being Tb,wAny one of the rows.
(8.5) at the variable block level, z is first putbThe main elements of each variable sub-block are combined together:
Figure FDA00021128083200000611
(8.6) obtaining each variable block zbThe online statistic index of (1):
Figure FDA00021128083200000612
wherein, the meaning of each parameter of the above formula is similar to that in the formula (25).
Figure FDA00021128083200000613
To represent
Figure FDA00021128083200000614
Belonging to the m-th component
Figure FDA00021128083200000615
The probability of (c).
Figure FDA00021128083200000616
Is based on the probability of the local mahalanobis distance,
Figure FDA00021128083200000617
is composed of
Figure FDA00021128083200000618
Mahalanobis distance to the mth gaussian component,
Figure FDA00021128083200000619
is composed of
Figure FDA00021128083200000620
Any one of the rows.
(8.7) converting the BIP index of each variable block into the probability of normal (labeled 'N') and fault (labeled 'F'):
Figure FDA00021128083200000621
Figure FDA00021128083200000622
wherein BIPb,lmtA control limit for the statistical BIP indicator;
Figure FDA00021128083200000623
representing the normal conditional probability of the b variable block;
Figure FDA00021128083200000624
indicating the conditional probability of the failure of the b-th variable block.
(8.8) calculating the posterior probability of the b variable block failing by the Bayes rule
Figure FDA0002112808320000071
Wherein, Pb(F)=α;Pb(N) ═ 1- α represents the prior probability of the process failing or being normal, respectively, at a level of significance α.
(8.9) comprehensively considering the fault probability of all variable blocks and calculating the global monitoring statistic
Figure FDA0002112808320000072
(9) Judging the running state of the process: and analyzing the process state from three levels of the variable subblocks, the variable blocks and the whole unit. Three levels of statistics are compared with the control limits in real time:
(a) at each variable sub-block
Figure FDA0002112808320000073
In, if BIPb,w1-alpha, in sub-blocks
Figure FDA0002112808320000074
The variable in (b) fails, otherwise the variable in the sub-block is considered to be operating within the normal range.
(b) At the variable block level, if BIPbIf the value is more than 1-alpha, the related relation of each variable sub-block in the variable block is abnormal, otherwise, all the variables in the b-th sub-block are normally operated.
(c) At the unit level, if PFzIf the measured value is more than alpha, the abnormal condition or the fault occurs in the running process of the million kilowatt ultra-supercritical unit, otherwise, the whole unit runs normally.
CN201910579518.8A 2019-06-28 2019-06-28 An online fault detection method for one million kilowatt ultra-supercritical units Active CN111880090B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910579518.8A CN111880090B (en) 2019-06-28 2019-06-28 An online fault detection method for one million kilowatt ultra-supercritical units

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910579518.8A CN111880090B (en) 2019-06-28 2019-06-28 An online fault detection method for one million kilowatt ultra-supercritical units

Publications (2)

Publication Number Publication Date
CN111880090A true CN111880090A (en) 2020-11-03
CN111880090B CN111880090B (en) 2021-07-06

Family

ID=73153874

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910579518.8A Active CN111880090B (en) 2019-06-28 2019-06-28 An online fault detection method for one million kilowatt ultra-supercritical units

Country Status (1)

Country Link
CN (1) CN111880090B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113092907A (en) * 2021-04-02 2021-07-09 长春工业大学 System fault detection method based on block slow characteristic analysis
CN113158769A (en) * 2021-03-03 2021-07-23 安徽大学 CEEMDAN and FastICA-based electromechanical device bearing vibration signal denoising method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5461329A (en) * 1992-01-21 1995-10-24 Martin Marietta Energy Systems, Inc. Method and apparatus for generating motor current spectra to enhance motor system fault detection
CN102086784A (en) * 2010-12-16 2011-06-08 浙江大学 Distributed remote vibration monitoring and fault diagnosis system of large steam turbine-generator
CN105425779A (en) * 2015-12-24 2016-03-23 江南大学 ICA-PCA multi-working condition fault diagnosis method based on local neighborhood standardization and Bayesian inference
CN108490908A (en) * 2018-02-11 2018-09-04 浙江大学 A kind of dynamic distributed monitoring method towards gigawatt extra-supercritical unit variable parameter operation
CN108508866A (en) * 2018-03-21 2018-09-07 浙江大学 A kind of gigawatt extra-supercritical unit failure identification variables method based on sparse opposite discriminant analysis
CN109491338A (en) * 2018-11-09 2019-03-19 南通大学 A kind of relevant method for diagnosing faults of multimode procedure quality based on sparse GMM

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5461329A (en) * 1992-01-21 1995-10-24 Martin Marietta Energy Systems, Inc. Method and apparatus for generating motor current spectra to enhance motor system fault detection
CN102086784A (en) * 2010-12-16 2011-06-08 浙江大学 Distributed remote vibration monitoring and fault diagnosis system of large steam turbine-generator
CN105425779A (en) * 2015-12-24 2016-03-23 江南大学 ICA-PCA multi-working condition fault diagnosis method based on local neighborhood standardization and Bayesian inference
CN108490908A (en) * 2018-02-11 2018-09-04 浙江大学 A kind of dynamic distributed monitoring method towards gigawatt extra-supercritical unit variable parameter operation
CN108508866A (en) * 2018-03-21 2018-09-07 浙江大学 A kind of gigawatt extra-supercritical unit failure identification variables method based on sparse opposite discriminant analysis
CN109491338A (en) * 2018-11-09 2019-03-19 南通大学 A kind of relevant method for diagnosing faults of multimode procedure quality based on sparse GMM

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
DONG WANG ET AL.: "A simple and fast guideline for generating enhanced/squared envelope spectra from spectral coherence for bearing fault diagnosis", 《MECHANICAL SYSTEMS AND SIGNAL PROCESSING》 *
范海东等: "基于稀疏故障演化判别分析的故障根源追溯", 《控制工程》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113158769A (en) * 2021-03-03 2021-07-23 安徽大学 CEEMDAN and FastICA-based electromechanical device bearing vibration signal denoising method
CN113092907A (en) * 2021-04-02 2021-07-09 长春工业大学 System fault detection method based on block slow characteristic analysis

Also Published As

Publication number Publication date
CN111880090B (en) 2021-07-06

Similar Documents

Publication Publication Date Title
US11740619B2 (en) Malfunction early-warning method for production logistics delivery equipment
CN110579709B (en) Fault diagnosis method for proton exchange membrane fuel cell for tramcar
CN110362045B (en) A fault discrimination method for offshore doubly-fed wind turbines considering marine meteorological factors
CN102981452B (en) Method for modeling and evaluating reliability of three types of functional components of numerical control machine tool
CN111537219B (en) A performance detection and health assessment method for fan gearboxes based on temperature parameters
CN108008332A (en) A kind of new energy Remote testing device method for diagnosing faults based on data mining
Mao et al. Anomaly detection for power consumption data based on isolated forest
CN115081795B (en) Method and system for analyzing causes of abnormal energy consumption in enterprises under multi-dimensional scenarios
CN109359662B (en) Non-stationary Analysis and Causal Diagnosis Method for One Million KW Ultra-Supercritical Units
CN110133410A (en) Transformer fault diagnosis method and system based on fuzzy C-means clustering algorithm
CN109491358B (en) Control performance monitoring method for boiler dynamic information of million-kilowatt ultra-supercritical unit
CN107037306A (en) Transformer fault dynamic early-warning method based on HMM
CN109978252B (en) A method for predicting and evaluating the operating state of a high-power integrated fuel cell system
CN111880090B (en) An online fault detection method for one million kilowatt ultra-supercritical units
CN116840722A (en) A method for performance degradation assessment and life prediction of proton exchange membrane fuel cells
CN110569888A (en) Transformer fault diagnosis method and device based on directed acyclic graph support vector machine
CN111709587B (en) State Probabilistic Evaluation Method of Distribution System Based on Probability-Sequence Uncertainty
Liu et al. Discriminative signal recognition for transient stability assessment via discrete mutual information approximation and Eigen decomposition of Laplacian matrix
CN112380763A (en) System and method for analyzing reliability of in-pile component based on data mining
CN118965247B (en) Power plant data management method and system based on multi-source data
Zhou et al. Abnormal data processing of wind turbine based on combined algorithm and class center imputation
CN112598030A (en) Non-stationary process monitoring method based on recursive covariance analysis and elastic weight consolidation
Chen et al. Consistency Evaluation for Lithium-Ion Battery Energy Storage Systems Based on Approximate Low-Rank Representation and Hypersphere Concentration
CN116629409A (en) SOFC system fault probability prediction method based on naive Bayes algorithm
CN116011982A (en) A method and system for on-line monitoring of coal mill grinding roller breakage

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant