CN111598220B

CN111598220B - Gas turbine fault prediction method based on correlation analysis

Info

Publication number: CN111598220B
Application number: CN202010404291.6A
Authority: CN
Inventors: 杨静; 方宝富; 沈安波; 樊高金; 江刘峰; 朱尤杰
Original assignee: Hefei University of Technology
Current assignee: Hefei University of Technology
Priority date: 2020-05-13
Filing date: 2020-05-13
Publication date: 2023-03-14
Anticipated expiration: 2040-05-13
Also published as: CN111598220A

Abstract

本发明公开了一种基于相关性分析的燃气轮机故障预测方法，其步骤包括：1、以流的方式逐个读入待处理的监测节点向量；2、对每个当前读入的监测节点向量，与系统已经进行读入的监测节点向量，进行相关性分析；3、对已经选择的相关监测节点进行冗余分析；4、对于新加入的监测节点进行定向，确定与其他监测节点间的因果关系，重复步骤1‑4，直至读入的监测节点向量的数量超过极限值，最终获得相应的监测系统因果结构图，并用于训练故障预测模型；从而得到故障预测模型，以实现对故障进行更加准确的预测。本发明能获得更加精准的故障预测模型，从而能对故障进行更加准确的预测。The invention discloses a gas turbine fault prediction method based on correlation analysis, the steps of which include: 1. Read in the monitoring node vectors to be processed one by one in a flow mode; 2. For each currently read monitoring node vector, and The system has already read in the monitoring node vector and performs correlation analysis; 3. Perform redundancy analysis on the selected related monitoring nodes; 4. Orientate the newly added monitoring nodes to determine the causal relationship with other monitoring nodes. Repeat steps 1-4 until the number of monitoring node vectors read in exceeds the limit value, and finally obtain the corresponding causal structure diagram of the monitoring system, and use it to train the fault prediction model; thus obtain the fault prediction model to achieve more accurate fault prediction predict. The present invention can obtain a more accurate fault prediction model, so that faults can be predicted more accurately.

Description

A Gas Turbine Fault Prediction Method Based on Correlation Analysis

技术领域technical field

本发明属于数据挖掘领域，具体地说是一种基于相关性分析的燃气轮机故障预测方法。The invention belongs to the field of data mining, in particular to a gas turbine fault prediction method based on correlation analysis.

背景技术Background technique

目前国内燃气轮机状态监测和故障诊断研究现状最近有很大进步,但是技术还相对比较落后,应用成果较少。随着大数据技术的兴起，如何将大数据相关技术应用于燃气轮机状态监测和故障诊断是一个值得研究的课题。燃气轮机机组在运行时不断地产生大量的监测数据，基于这些海量的运行监测数据，开展燃气轮机机组状态分析、性能监测和故障智能诊断预测研究，具有非常重要的现实意义。通过数据建模，可以对燃气轮机机组状态进行实时的健康评估，预测状态趋势，在没有发生重大故障前提前预警，可以早期发现燃气轮机故障，从而避免经济损失、提供维修建议、有助于燃机的安全可靠地运行。然而，这些数据的分布往往是任意的，彼此之间的关系往往具有非线性的特点，对于这种非线性数据的研究是具有一定的挑战。这些运行数据构成一个复杂的网络系统，辨识该复杂系统的网络节点间的联系，有助于燃气轮机的状态监测和故障预测。At present, the research status of domestic gas turbine condition monitoring and fault diagnosis has made great progress recently, but the technology is still relatively backward, and the application results are few. With the rise of big data technology, how to apply big data related technology to gas turbine condition monitoring and fault diagnosis is a topic worth studying. Gas turbine units continuously generate a large amount of monitoring data during operation. Based on these massive operating monitoring data, it is of great practical significance to carry out research on status analysis, performance monitoring, and fault intelligent diagnosis and prediction of gas turbine units. Through data modeling, real-time health assessment of gas turbine unit status can be performed, status trends can be predicted, early warnings can be made before major failures occur, and gas turbine failures can be detected early, thereby avoiding economic losses, providing maintenance suggestions, and contributing to the development of gas turbines. Operate safely and reliably. However, the distribution of these data is often arbitrary, and the relationship between each other is often nonlinear, so it is a certain challenge to study this nonlinear data. These operating data constitute a complex network system, and identifying the connections between the network nodes of this complex system is helpful for the state monitoring and failure prediction of gas turbines.

而描述复杂网络间关系的杰出模型就是由美国加州大学的Judea Pearl提出的基于概率论和图论的贝叶斯网络模型，并凭杰出的贡献获得2011年度图灵奖。Hoyer等对于贝叶斯网络因果模型进行更进一步扩展，提出了加性噪声模型，该模型可以建模非高斯非线性的数据。而燃气轮机机组的运行数据恰恰也是非高斯非线性的。所以，基于加性噪声模型对燃气轮机机组的运行数据进行分析是非常有意义的研究方向。至于加性噪声模型的结构学习，Hoyer等基于非线性回归和基于HSIC标准提出了识别因果结构的方法，Mooij等提出了基于HSIC回归的算法，Zhang等提出了两阶段的算法，Tillman等人提出了kPC算法，Yamada等提出了最小二乘独立性回归的方法，Mooij等提出了基于最大后验的方法，Zhang等提出一个基于核的条件独立测试，Peters等提出了基于后续独立测试的回归方法，Zhang等提出一个基于回归的条件独立型测试的方法等，Nowzohour等基于惩罚性的似然的方法等等。The outstanding model describing the relationship between complex networks is the Bayesian network model based on probability theory and graph theory proposed by Judea Pearl of the University of California, USA, and won the 2011 Turing Award for his outstanding contributions. Hoyer et al. further extended the Bayesian network causal model and proposed an additive noise model, which can model non-Gaussian nonlinear data. However, the operating data of the gas turbine unit is also non-Gaussian nonlinear. Therefore, it is a very meaningful research direction to analyze the operating data of the gas turbine unit based on the additive noise model. As for the structure learning of additive noise models, Hoyer et al. proposed a method for identifying causal structures based on nonlinear regression and HSIC criteria, Mooij et al. proposed an algorithm based on HSIC regression, Zhang et al. proposed a two-stage algorithm, Tillman et al. proposed the kPC algorithm, Yamada et al. proposed the method of least squares independence regression, Mooij et al. proposed a method based on maximum a posteriori, Zhang et al. proposed a kernel-based conditional independent test, Peters et al. proposed a regression method based on subsequent independent tests , Zhang et al. proposed a regression-based conditional independence test method, etc., Nowzohour et al. based on the penalty-likelihood method and so on.

燃气轮机机组数据还具有高维特性，处理高维数据的一种方法就是降维，通常采用主成分分析法、独立成分分析等方法，而这些方法都要事先知道所有数据维的信息并一次载入内存，但有时燃气轮机机组数据维数巨大无法一次载入内存。并且可能不断出现新的测点数据，造成数据的特征空间是动态的、未知的。处理动态高维的数据近年来兴起一种基于流特征的数据分析方法，目前是数据挖掘领域一个新兴的研究方向，可以有效的处理高维大数据。Gas turbine unit data also has high-dimensional characteristics. One method to deal with high-dimensional data is dimensionality reduction, usually using methods such as principal component analysis and independent component analysis, and these methods must know all data dimensions in advance and load them at once. memory, but sometimes the data dimension of the gas turbine unit is too large to be loaded into the memory at one time. And new measuring point data may appear continuously, resulting in a dynamic and unknown feature space of the data. Handling dynamic high-dimensional data In recent years, a data analysis method based on flow features has emerged. It is currently an emerging research direction in the field of data mining, which can effectively process high-dimensional big data.

目前这些方法的主要局限包括：The main limitations of these current approaches include:

(1)上述多数算法的计算复杂度比较大，满足不了燃气轮机机组运行数据的在线实时学习；(1) The computational complexity of most of the above algorithms is relatively large, which cannot meet the online real-time learning of the operating data of the gas turbine unit;

(2)燃气轮机机组数据维数巨大无法一次载入内存，并且可能不断出现新的测点数据，现存的方法不能处理这种情形。(2) The data dimension of the gas turbine unit is too large to be loaded into the memory at one time, and new measurement point data may appear continuously, and the existing methods cannot handle this situation.

发明内容Contents of the invention

本发明为克服现有技术存在的不足之处，提出了一种基于相关性分析的燃气轮机故障预测方法，以期能获得更加精准的故障预测模型，从而能对故障进行更加准确的预测。In order to overcome the deficiencies in the prior art, the present invention proposes a gas turbine fault prediction method based on correlation analysis, in order to obtain a more accurate fault prediction model, thereby enabling more accurate fault prediction.

本发明为解决技术问题采用如下技术方案：The present invention adopts following technical scheme for solving technical problems:

本发明一种基于相关性分析的燃气轮机故障预测方法，是应用于燃气轮机系统中，并每隔一段时间对所述燃气轮机系统中Z个监测节点的运行状态进行监测，从而得到m次监测值向量所组成的燃气轮机运行数据集D，记为D＝{sam₁,sam₂,...,sam_v,...,sam_m}，其中，sam_v表示第v次监测值向量，且

表示第v次监测值向量中第i个监测节点的监测值；1≤v≤m，1≤i≤Z；将m次监测下的第i个监测节点的监测值所组成的向量定义为X_i，其特点是，所述燃气轮机故障预测是按如下步骤进行：A gas turbine failure prediction method based on correlation analysis of the present invention is applied to a gas turbine system, and monitors the operating states of Z monitoring nodes in the gas turbine system at regular intervals, thereby obtaining m times of monitoring value vectors Composed gas turbine operation data set D, recorded as D={sam ₁ , sam ₂ ,...,sam _v ,...,sam _m }, where sam _v represents the vth monitoring value vector, and

Indicates the monitoring value of the i-th monitoring node in the v-th monitoring value vector; 1≤v≤m, 1≤i≤Z; the vector formed by the monitoring value of the i-th monitoring node under m monitoring is defined as X _i , which is characterized in that the gas turbine failure prediction is performed according to the following steps:

步骤1、定义时刻t，并初始化t＝0；Step 1, define time t, and initialize t=0;

定义监测节点个数Z的极限值为max，即Z≤max；Define the limit value of the number of monitoring nodes Z to be max, that is, Z≤max;

步骤2、定义所选择的监测节点向量集为EF，并初始化第t时刻所选择的监测节点向量集为

Step 2. Define the selected monitoring node vector set as EF, and initialize the monitoring node vector set selected at the tth moment as

步骤3、定义变量j，并初始化j＝1；Step 3, define variable j, and initialize j=1;

步骤4、判断j≤Z是否成立，若成立，从燃气轮机运行数据集D中读取具有m个取值的第j个监测节点向量X_j；并初始化第j个监测节点向量X_j的相关监测节点向量集MB(X_j)为空、初始化第j个监测节点向量X_j的新增监测节点向量集FA(X_j)为空、初始化第j个监测节点向量X_j的冗余监测节点向量集FD(X_j)为空；再执行步骤5；否则，表示获得最终的由监测节点构成的因果结构图，其中，每个监测节点的父节点和子节点都是与相应监测节点相关的监测节点，并执行步骤16；Step 4. Determine whether j≤Z is true, if true, read the j-th monitoring node vector X _j with m values from the gas turbine operation data set D; and initialize the related monitoring of the j-th monitoring node vector X _j The node vector set MB(X _j ) is empty, the newly added monitoring node vector set FA(X _j ) of the j-th monitoring node vector X _j is initialized to be empty, and the redundant monitoring node vector of the j-th monitoring node vector X _j is initialized The set FD(X _j ) is empty; go to step 5 again; otherwise, it means to obtain the final causal structure graph composed of monitoring nodes, where the parent node and child nodes of each monitoring node are monitoring nodes related to the corresponding monitoring node , and execute step 16;

步骤5、判断j＝1是否成立，若成立，则将所述第j个监测节点向量X_j加入所述第t时刻所选择的监测节点向量集EF_t中，从而获得第t+1时刻所述选择的监测节点向量集EF_t+1；再将t+1赋值给t、将j+1赋值给j后，返回步骤4；否则执行步骤6；Step 5, judging whether j=1 is established, if established, then adding the j monitoring node vector X _j to the monitoring node vector set EF _t selected at the t time, so as to obtain the t+1 time The monitoring node vector set EF _t+1 selected above; then assign t+1 to t and j+1 to j, then return to step 4; otherwise, execute step 6;

步骤6、对所述第j个监测节点向量X_j进行相关性分析；Step 6, performing a correlation analysis on the jth monitoring node vector X _j ;

步骤7、判断所述第j个监测节点向量X_j的相关监测节点向量集MB(X_j)是否为空集，若为空集，则返回步骤4；否则，将第j个监测节点向量X_j加入第t时刻所选择的监测节点向量集EF_t中，从而获得第t+1时刻所选择的监测节点向量集EF_t+1＝EF_t∪X_j；再将t+1赋值给t后，执行步骤8；Step 7, judging whether the related monitoring node vector set MB(X _j ) of the jth monitoring node vector X _j is an empty set, if it is an empty set, then return to step 4; otherwise, set the jth monitoring node vector X _j is added to the monitoring node vector set EF _t selected at the tth moment, so as to obtain the monitoring node vector set EF _t+1 =EF _t ∪X _j at the t+1th moment; and then assign t+1 to t , go to step 8;

步骤8、定义变量k，并初始化k＝1；Step 8, define variable k, and initialize k=1;

步骤9、对所述第t时刻所选择的监测节点向量集EF_t的第k个监测节点向量X_k进行冗余校验分析；Step 9, performing redundancy check analysis on the kth monitoring node vector X _k of the monitoring node vector set EF _t selected at the tth moment;

步骤10、将k+1赋值给k，并判断k＞j是否成立，若成立，则执行步骤11；否则返回步骤9执行；Step 10, assign k+1 to k, and judge whether k>j is true, if true, execute step 11; otherwise, return to step 9 for execution;

步骤11、定义变量count；并初始化count＝0；初始化k＝1；Step 11, define variable count; and initialize count=0; initialize k=1;

步骤12、判断所述第k个监测节点向量X_k的相关监测节点向量集MB(X_k)是否为空集，若为空集，则从所述第t时刻所选择的监测节点向量集EF_t中删除所述第k个监测节点向量X_k后，再将count+1赋值给count后，执行步骤13；否则直接执行步骤13；Step 12. Judging whether the related monitoring node vector set MB(X _k ) of the kth monitoring node vector X _k is an empty set, if it is an empty set, then from the monitoring node vector set EF selected at the tth moment After deleting the k-th monitoring node vector X _k in _t , assigning count+1 to count, execute step 13; otherwise, directly execute step 13;

步骤13、将k+1赋值给k后；判断k＞j是否成立，若成立，则将j-count赋值给j后，获得更新的第t时刻所选择的监测节点向量集EF_t′，记为EF_t′＝{X′₁,X′₂,...,X′_i,...X′_j-count}；X′_i表示更新的第t时刻的监测节点向量集EF_t′中第i个监测节点向量，并有，

表示第i个监测节点向量X′_i中第v个监测值；1≤i≤j-count；再将j-count赋值给j后，执行步骤14；否则返回步骤12执行；Step 13. After assigning k+1 to k; judge whether k>j is true, if it is true, assign j-count to j, and obtain the updated monitoring node vector set EF _t ' selected at the tth moment, record EF _t ′={X′ ₁ ,X′ ₂ ,...,X′ _i ,...X′ _j-count }; X′ _i represents the updated monitoring node vector set EF _t ′ at the tth moment The i-th monitoring node vector, and has,

Indicates the v-th monitoring value in the i-th monitoring node vector X′ _i ; 1≤i≤j-count; after assigning j-count to j, execute step 14; otherwise, return to step 12 for execution;

步骤14、对所述第t时刻所选择的监测节点向量集EF_t′的第j个监测节点向量X′_j进行在线的局部定向，从而得到第t时刻因果结构图；Step 14, performing online local orientation on the jth monitoring node vector _X'j of the monitoring node vector set EF _t ' selected at the tth moment, so as to obtain the causal structure diagram at the tth moment;

步骤15、将j+1赋值给j，返回步骤4；Step 15, assign j+1 to j, and return to step 4;

步骤16、任意选择一个监测节点的监测节点向量，并作为LSTM神经网络模型的输出，再将与所选择的监测节点相关的监测节点向量作为LSTM神经网络模型的输入，从而训练LSTM神经网络模型，从而得到故障预测模型；Step 16, arbitrarily select the monitoring node vector of a monitoring node, and use it as the output of the LSTM neural network model, and then use the monitoring node vector related to the selected monitoring node as the input of the LSTM neural network model, thereby training the LSTM neural network model, So as to get the fault prediction model;

步骤17、实时监测任意一个监测节点的运行状态并获得相应的燃气轮机运行数据集，并利用所述故障预测模型得到实时监测的监测节点的预测值，再将所述预测值与实时监测的监测节点的真实值进行比较，当超过所设定的阈值，则表示相应监测节点可能发生故障，并给出预警提示。Step 17. Real-time monitor the operating status of any monitoring node and obtain the corresponding gas turbine operating data set, and use the fault prediction model to obtain the predicted value of the real-time monitored monitoring node, and then compare the predicted value with the real-time monitored monitoring node Compared with the real value of , when it exceeds the set threshold, it means that the corresponding monitoring node may fail, and an early warning prompt is given.

本发明所述的基于相关性分析的燃气轮机故障预测方法的特点也在于，所述步骤6是按如下步骤进行：The gas turbine failure prediction method based on correlation analysis of the present invention is also characterized in that said step 6 is carried out as follows:

步骤6.1、设置相关性阈值为α；Step 6.1, setting the correlation threshold to α;

步骤6.2、定义变量w；并初始化w＝1；定义变量θ；Step 6.2, define variable w; and initialize w=1; define variable θ;

步骤6.3、用希尔伯特-施密特独立性准则计算第j个监测节点向量X_j和第w个监测节点向量X_w的相关程度HSIC_jw；Step 6.3, using the Hilbert-Schmidt independence criterion to calculate the correlation degree HSIC _jw between the jth monitoring node vector X _j and the wth monitoring node vector _Xw ;

步骤6.4、将相关程度HSIC_jw赋值给θ，并判断θ≥α是否成立，若成立，则表示第j个监测节点向量X_j与第w个监测节点向量X_w相关，并执行步骤6.5；否则，表示第j个监测节点向量X_j与第w个监测节点向量X_w独立，并执行步骤6.6；Step 6.4. Assign the degree of correlation HSIC _jw to θ, and judge whether θ≥α is established. If it is established, it means that the j-th monitoring node vector X _j is related to the w-th monitoring node vector X _w , and perform step 6.5; otherwise , indicating that the j-th monitoring node vector X _j is independent from the w-th monitoring node vector X _w , and perform step 6.6;

步骤6.5、将第j个监测节点向量X_j加到第w个监测节点向量X_w的相关监测节点向量集MB(X_w)中，即MB(X_w)＝MB(X_w)∪X_j，将第j个监测节点向量X_j加入新增监测节点向量集FA(X_w)中，即FA(X_w)＝FA(X_w)∪{X_j}，从而更新第w个监测节点向量X_w的相关监测节点向量集MB(X_w)和新增监测节点向量集FA(X_w)；同时将第w个监测节点向量X_w加到第j个监测节点向量X_j的相关监测节点向量集MB(X_j)，即MB(X_j)＝MB(X_j)∪X_w，将第w个监测节点向量X_w加入新增监测节点向量集FA(X_j)中，即FA(X_j)＝FA(X_j)∪{X_w}，从而更新第j个监测节点向量X_j的相关监测节点向量集MB(X_j)和新增监测节点向量集FA(X_j)；再执行步骤6.6；Step 6.5. Add the j-th monitoring node vector X _j to the related monitoring node vector set MB(X _w ) of the w-th monitoring node vector X _w , that is, MB(X _w )=MB(X _w )∪X _j , add the j-th monitoring node vector X _j to the new monitoring node vector set FA(X _w ), that is, FA(X _w )=FA(X _w )∪{X _j }, thereby updating the w-th monitoring node vector The related monitoring node vector set MB(X _w ) of X _w and the new monitoring node vector set FA(X _w ); at the same time, add the wth monitoring node vector X _w to the related monitoring node of the jth monitoring node vector X _j Vector set MB(X _j ), that is, MB(X _j )=MB(X _j )∪X _w , add the wth monitoring node vector X _w to the new monitoring node vector set FA(X _j ), that is, FA( X _j )=FA(X _j )∪{X _w }, thus updating the related monitoring node vector set MB(X _j ) of the jth monitoring node vector X _j and the new monitoring node vector set FA(X _j ); Execute step 6.6;

步骤6.6、将k+1赋值给k，并判断k＞j-1是否成立，若成立，则执行步骤7；否则返回步骤6.3执行。Step 6.6. Assign k+1 to k, and judge whether k>j-1 is true, and if so, execute step 7; otherwise, return to step 6.3 for execution.

步骤9在线的冗余校验分析是按如下步骤进行：The online redundancy check analysis of step 9 is carried out according to the following steps:

步骤9.1、设置冗余度阈值β；计算第k个监测节点向量X_k的相关监测节点向量集MB(X_k)中的监测节点向量个数，记为S_k；Step 9.1, setting the redundancy threshold β; calculating the number of monitoring node vectors in the related monitoring node vector set MB(X _k ) of the kth monitoring node vector X _k , denoted as S _k ;

步骤9.2、定义变量s；并初始化s＝1；定义变量σ；Step 9.2, define variable s; and initialize s=1; define variable σ;

步骤9.3、获取所述相关监测节点向量集MB(X_k)中第s个监测节点向量的下标记为τ_s；Step 9.3, obtaining the subscript of the sth monitoring node vector in the related monitoring node vector set MB(X _k ) as τ _s ;

步骤9.4、用希尔伯特-施密特独立性准则计算第τ_s个监测节点向量

和第k个监测节点向量X_k的相关程度

Step 9.4, use the Hilbert-Schmidt independence criterion to calculate the τ _sth monitoring node vector

The degree of correlation with the kth monitoring node vector X _k

步骤9.5、将所述相关程度

赋值给σ，判断σ≤β是否成立，若成立，则表示第τ_s个监测节点向量

和第k个监测节点向量X_k不相关，即为冗余监测节点，并执行步骤9.6；否则，表明第τ_s个监测节点向量

和第k个监测节点向量X_k相关，并执行步骤9.7；Step 9.5, the correlation degree

Assign a value to σ to judge whether σ≤β is established, and if it is established, it means that the τ _sth monitoring node vector

It is irrelevant to the k-th monitoring node vector X _k , that is, it is a redundant monitoring node, and execute step 9.6; otherwise, it indicates that the τ _s -th monitoring node vector

Correlate with the kth monitoring node vector X _k , and execute step 9.7;

步骤9.6、从所述第k个监测节点向量X_k的相关监测节点向量集MB(X_k)中删除所述第τ_s个监测节点向量

即

并将第τ_s个监测节点向量

加入所述第k个监测节点向量X_k的冗余监测节点向量集FD(X_k)中，即

从所述第τ_s个监测节点向量

的相关监测节点向量集

中删除所述第k个监测节点向量X_k，即MB(X_τs)＝MB(X_τs)-X_k，并将第k个监测节点向量X_k加入所述第τ_s个监测节点向量

的冗余监测节点向量集

即

Step 9.6. Delete the τ s monitoring node vector from the related monitoring node vector set MB(X _k ) of the _k monitoring node vector X _k

Right now

and the τ _s monitoring node vector

Add to the redundant monitoring node vector set FD(X _k ) of the kth monitoring node vector X _k , namely

From the τ _s monitoring node vector

The related monitoring node vector set of

Delete the k-th monitoring node vector X _k in , that is, MB(X _τs )=MB(X _τs )-X _k , and add the k-th monitoring node vector X _k to the τ _s -th monitoring node vector

Redundancy monitoring node vector set of

Right now

步骤9.7、将s+1赋值给s；并判断s＞S_k是否成立，若成立，则执行步骤10；否则返回步骤9.3执行。Step 9.7. Assign s+1 to s; and judge whether s>S _k is true, and if so, execute step 10; otherwise, return to step 9.3 for execution.

所述步骤14是按如下步骤进行：Described step 14 is to carry out as follows:

步骤14.1、设置方向支持度阈值为γ；Step 14.1, set the direction support threshold to γ;

步骤14.2、从第j个监测节点向量X_j的相关监测节点向量集MB(X_j)任选一个监测节点向量X_g，并将第g个监测节点向量X_g从相关监测节点向量集MB(X_i)中删除；Step 14.2. Select a monitoring node vector X _g from the related monitoring node vector set MB(X _j ) of the j-th monitoring node vector X _j , and select the g-th monitoring node vector X _g from the related monitoring node vector set MB( X _i );

步骤14.3、当第g个监测节点向量X_g作为第j个监测节点向量X_j的父监测节点向量，即X_g→X_j时，利用最小二乘互信息方法的p-value值计算X_g→X_j方向的支持度，并记为p-value(X_g,X_j)；Step 14.3. When the g-th monitoring node vector X _g is used as the parent monitoring node vector of the j-th monitoring node vector X _j , that is, X _g → X _j , use the p-value value of the least squares mutual information method to calculate X _g →The support degree in the X _j direction, and recorded as p-value(X _g ,X _j );

步骤14.4、当第j个监测节点向量X_j作为第g个监测节点向量X_g的父监测节点向量，即X_j→X_g时，利用最小二乘互信息方法的的p-value值计算X_j→X_g方向的支持度，并记为p-value(X_j,X_g)；Step 14.4. When the j-th monitoring node vector X _j is used as the parent monitoring node vector of the g-th monitoring node vector X _g , that is, X _j → X _g , use the p-value value of the least squares mutual information method to calculate X The support degree of _j →X _g direction, and recorded as p-value(X _j ,X _g );

步骤14.5、如果p-value(X_g,X_j)＞γ或p-value(X_j,X_g)≤γ，则表示X_g→X_j方向的支持度较大，并定向为X_g→X_j；Step 14.5. If p-value(X _g ,X _j )>γ or p-value(X _j ,X _g )≤γ, it means that the support degree of X _g →X _j direction is large, and it is oriented as X _g → _Xj ;

如果p-value(X_j,X_g)＞γ或p-value(X_g,X_j)≤γ，则表示X_j→X_g方向的支持度较大，并定向为X_j→X_g；If p-value(X _j ,X _g )＞γ or p-value(X _g ,X _j )≤γ, it means that the support degree in the direction of X _j →X _g is large, and it is oriented as X _j →X _g ;

如果p-value(X_j,X_g)≤γ或p-value(X_g,X_j)≤γ或p-value(X_g,X_j)＞γ或p-value(X_j,X_g)＞γ，则表示两监测节点向量间没有因果关系，不用定向；If p-value(X _j ,X _g )≤γ or p-value(X _g ,X _j )≤γ or p-value(X _g ,X _j )＞γ or p-value(X _j ,X _g ) >γ, it means that there is no causal relationship between the two monitoring node vectors, and no orientation is required;

步骤14.6、如果第j个监测节点向量X_j的相关监测节点向量集MB(X_j)为空，则执行步骤15，否则，则返回步骤14.2执行。Step 14.6. If the related monitoring node vector set MB(X _j ) of the jth monitoring node vector X _j is empty, execute step 15; otherwise, return to step 14.2 for execution.

与已有技术相比，本发明的有益效果体现在：Compared with the prior art, the beneficial effects of the present invention are reflected in:

1、针对燃气轮机机组运行监测数据分布往往是任意的，彼此之间的关系往往具有非线性的特点，本发明基于希尔伯特-施密特独立性准则对监测节点的相关性进行研究，是一个新研究，结合局部学习策略实现监测节点相关监测节点的学习，显著的降低了学习的复杂度，从而满足了燃气轮机状态实时监测的需要。1. The distribution of monitoring data for the operation of gas turbine units is often arbitrary, and the relationship between them often has nonlinear characteristics. The present invention studies the correlation of monitoring nodes based on the Hilbert-Schmidt independence criterion. A new study, combined with local learning strategies to realize the learning of monitoring nodes related to monitoring nodes, significantly reduces the complexity of learning, thus meeting the needs of real-time monitoring of gas turbine status.

2、针对燃气轮机机组运行监测数据，研究监测节点彼此之间的因果联系，通常的贪婪搜索的方法复杂度较大，满足不了在线学习的时效性，本发明采用最小二乘互信息的方式进行定向，计算复杂度显著的降低，满足了在线的燃气轮机状态实时监测。2. For the monitoring data of the gas turbine unit operation, the causal relationship between the monitoring nodes is studied. The usual greedy search method is complex and cannot meet the timeliness of online learning. The present invention adopts the least square mutual information method for orientation , the computational complexity is significantly reduced, and the online real-time monitoring of gas turbine status is satisfied.

3、本发明针对燃气轮机机组运行数据的动态、高维性，以流的方式进行处理，可以处理高维、动态的燃气轮机机组运行监测数据。基于相关性分析和冗余校验从而实现了监测节点的相关相关监测节点集的在线更新，融入实时定向，实现了局部因果结构的快速在线调整，流的处理方式可以降低学习的时间复杂度，从而满足了在线学习的时效性要求，适用于高维动态的燃气轮机机组运行数据。3. The present invention aims at the dynamic and high-dimensional nature of the gas turbine unit operation data, and processes it in a flow manner, and can process high-dimensional and dynamic gas turbine unit operation monitoring data. Based on correlation analysis and redundancy check, the online update of related monitoring node sets of monitoring nodes is realized, and real-time orientation is integrated to realize fast online adjustment of local causal structure. The stream processing method can reduce the time complexity of learning, Therefore, the timeliness requirement of online learning is met, and it is suitable for high-dimensional dynamic gas turbine unit operation data.

具体实施方式Detailed ways

本实施例中，一种基于相关性分析的燃气轮机故障预测方法，是应用于燃气轮机系统中，并每隔一段时间对燃气轮机系统中Z个监测节点的运行状态进行监测，持续记录一段时间，假设共监测m次，从而得到m次监测值向量所组成的燃气轮机运行数据集D，记为D＝{sam₁,sam₂,...,sam_v,...,sam_m}，其中，sam_v表示第v次监测值向量，且

表示第v次监测值向量中第i个监测节点的监测值；1≤v≤m，1≤i≤Z；将m次监测下的第i个监测节点的监测值所组成的向量定义为X_i，

表示第i个监测节点向量X_i具有m个取值，因为对第i个监测节点进行了m次监测，就有m个监测值。该燃气轮机故障预测方法目的是为了找出监测节点间的关系，找到与任意监测节点相关性较强的监测节点，并在该方法的基础上，使用神经网络的方法对于监测节点的未来趋势进行预测，从而对为燃气轮机的运行状态进行监测和故障预警。具体的说，该燃气轮机故障预测是按如下步骤进行：In this embodiment, a gas turbine failure prediction method based on correlation analysis is applied to a gas turbine system, and monitors the operating status of Z monitoring nodes in the gas turbine system at regular intervals, and keeps recording for a period of time, assuming a total of Monitor m times, so as to obtain the gas turbine operation data set D composed of m monitoring value vectors, recorded as D={sam ₁ ,sam ₂ ,...,sam _v ,...,sam _m }, where, sam _v Indicates the vth monitoring value vector, and

Indicates the monitoring value of the i-th monitoring node in the v-th monitoring value vector; 1≤v≤m, 1≤i≤Z; the vector formed by the monitoring value of the i-th monitoring node under m monitoring is defined as X _i ,

Indicates that the i-th monitoring node vector X _i has m values, because the i-th monitoring node has been monitored for m times, and there are m monitoring values. The purpose of this gas turbine fault prediction method is to find out the relationship between the monitoring nodes, find the monitoring node with strong correlation with any monitoring node, and on the basis of this method, use the neural network method to predict the future trend of the monitoring node , so as to monitor the operating status of the gas turbine and provide early warning of failures. Specifically, the fault prediction of the gas turbine is carried out as follows:

步骤4、判断j≤Z是否成立，若成立，从燃气轮机运行数据集D中读取具有m个取值的第j个监测节点向量X_j；并初始化第j个监测节点向量X_j的相关监测节点向量集MB(X_j)为空、初始化第j个监测节点向量X_j的新增监测节点向量集FA(X_j)为空、初始化第j个监测节点向量X_j的冗余监测节点向量集FD(X_j)为空；再执行步骤5；否则，表示获得最终的由监测节点构成的因果结构图，且每个监测节点的父节点和子节点都是与相应监测节点相关的监测节点，并执行步骤16；Step 4. Determine whether j≤Z is true, if true, read the j-th monitoring node vector X _j with m values from the gas turbine operation data set D; and initialize the related monitoring of the j-th monitoring node vector X _j The node vector set MB(X _j ) is empty, the newly added monitoring node vector set FA(X _j ) of the j-th monitoring node vector X _j is initialized to be empty, and the redundant monitoring node vector of the j-th monitoring node vector X _j is initialized The set FD(X _j ) is empty; go to step 5 again; otherwise, it means that the final causal structure diagram composed of monitoring nodes is obtained, and the parent node and child node of each monitoring node are monitoring nodes related to the corresponding monitoring node, And execute step 16;

步骤5、判断j＝1是否成立，若成立，则将第j个监测节点向量X_j加入第t时刻所选择的监测节点向量集EF_t中，从而获得第t+1时刻选择的监测节点向量集EF_t+1；再将t+1赋值给t、将j+1赋值给j后，返回步骤4；否则执行步骤6；Step 5. Judging whether j=1 is established, if it is established, add the j-th monitoring node vector X _j to the monitoring node vector set EF _t selected at the t-th moment, so as to obtain the monitoring node vector selected at the t+1-th moment Set EF _t+1 ; then assign t+1 to t and j+1 to j, then return to step 4; otherwise, go to step 6;

步骤6、对第j个监测节点向量X_j进行相关性分析；Step 6, performing a correlation analysis on the jth monitoring node vector X _j ;

依据公式(1)计算独立性准则HSIC_jw的值：Calculate the value of the independence criterion HSIC _jw according to the formula (1):

n是向量的X_j和X_w维度数，H，K，L都是n行n列的矩阵，K_ij＝k(x_i,x_j)，L_ij＝l(x_i,x_j)，K_ij和L_ij是映射的核函数，H＝I-n^-111^T，1是n×1的全1向量，trace是矩阵的迹运算。n is the number of X _j and X _w dimensions of the vector, H, K, and L are all matrixes of n rows and n columns, K _ij =k( _xi ,x _j ), L _ij =l( _xi ,x _j ), K _ij and L _ij are the kernel functions of the mapping, H=In ^-1 11 ^T , 1 is an n×1 full 1 vector, and trace is the trace operation of the matrix.

步骤7、判断第j个监测节点向量X_j的相关监测节点向量集MB(X_j)是否为空集，若为空集，则返回步骤4；否则，将第j个监测节点向量X_j加入第t时刻所选择的监测节点向量集EF_t中，从而获得第t+1时刻所选择的监测节点向量集EF_t+1＝EF_t∪X_j；再将t+1赋值给t后，执行步骤8；Step 7. Determine whether the related monitoring node vector set MB(X _j ) of the j-th monitoring node vector X _j is an empty set, and if it is an empty set, return to step 4; otherwise, add the j-th monitoring node vector X _j to In the monitoring node vector set EF _t selected at the tth moment, the monitoring node vector set EF t+1 selected at the t+1st moment is obtained EF _t+1 = EF _t ∪X _j ; after assigning t+1 to t, execute Step 8;

步骤9、对第t时刻所选择的监测节点向量集EF_t的第k个监测节点向量X_k进行冗余校验分析；Step 9, performing redundancy check analysis on the kth monitoring node vector X _k of the monitoring node vector set EF _t selected at the tth moment;

步骤9.3、获取相关监测节点向量集MB(X_k)中第s个监测节点向量的下标记为τ_s；Step 9.3, obtaining the subscript of the sth monitoring node vector in the relevant monitoring node vector set MB(X _k ) is τ _s ;

和第k个监测节点向量X_k的相关程度

The degree of correlation with the kth monitoring node vector X _k

步骤9.5、将相关程度

和第k个监测节点向量X_k相关，并执行步骤9.7；Step 9.5, the degree of correlation

Correlate with the kth monitoring node vector X _k , and execute step 9.7;

步骤9.6、从第k个监测节点向量X_k的相关监测节点向量集MB(X_k)中删除第τ_s个监测节点向量

即

并将第τ_s个监测节点向量

加入第k个监测节点向量X_k的冗余监测节点向量集FD(X_k)中，即

从第τ_s个监测节点向量

的相关监测节点向量集

中删除第k个监测节点向量X_k，即

并将第k个监测节点向量X_k加入第τ_s个监测节点向量

的冗余监测节点向量集

即

Step 9.6. Delete the τ s monitoring node vector from the related monitoring node vector set MB(X _k ) of _the _k monitoring node vector X k

Right now

and the τ _s monitoring node vector

From the τ _sth monitoring node vector

The related monitoring node vector set of

Delete the kth monitoring node vector X _k in , namely

And add the k-th monitoring node vector X _k to the τ _s -th monitoring node vector

Redundancy monitoring node vector set of

Right now

步骤12、判断第k个监测节点向量X_k的相关监测节点向量集MB(X_k)是否为空集，若为空集，则从第t时刻所选择的监测节点向量集EF_t中删除第k个监测节点向量X_k后，再将count+1赋值给count后，执行步骤13；否则直接执行步骤13；Step 12. Determine whether the related monitoring node vector set MB(X _k ) of the kth monitoring node vector X _k is an empty set, and if it is an empty set, delete the monitoring node vector set EF _t selected at the tth moment After k monitoring node vectors X _k , assign count+1 to count, then execute step 13; otherwise, directly execute step 13;

步骤14、对第t时刻所选择的监测节点向量集EF_t′的第j个监测节点向量X′_j进行在线的局部定向，从而得到第t时刻因果结构图；Step 14. Carry out online local orientation to the jth monitoring node vector _X'j of the monitoring node vector set EF _t ' selected at the tth moment, so as to obtain the causal structure diagram at the tth moment;

步骤14.3、当第g个监测节点向量X_g作为第j个监测节点向量X_j的父监测节点向量，即X_g→X_j时，利用最小二乘互信息方法的p-value值计算X_g→X_j方向的支持度，并记为p-value(X_g,X_j)；采用最小二乘互信息方法的p-value表示方向的支持度原因在于，如MakotoYamada和Masashi Sugiyama所著的文献《Dependence Minimizing Regression withModel Selection forNon-Linear Causal Inference underNon-GaussianNoise》，对于非线性非高斯的数据，最小二乘互信息方法的p-value可以衡量方向的支持度，计算方法见上述论文。Step 14.3. When the g-th monitoring node vector X _g is used as the parent monitoring node vector of the j-th monitoring node vector X _j , that is, X _g → X _j , use the p-value value of the least squares mutual information method to calculate X _g → The support degree in the X _j direction, and recorded as p-value(X _g , X _j ); the p-value using the least squares mutual information method indicates the support degree of the direction. The reason is that, such as the literature written by MakotoYamada and Masashi Sugiyama "Dependence Minimizing Regression with Model Selection forNon-Linear Causal Inference underNon-GaussianNoise", for nonlinear and non-Gaussian data, the p-value of the least squares mutual information method can measure the support of the direction, and the calculation method is shown in the above paper.

步骤17、实时监测任意一个监测节点的运行状态并获得相应的燃气轮机运行数据集，并利用故障预测模型得到实时监测的监测节点的预测值，再将预测值与实时监测的监测节点的真实值进行比较，当超过所设定的阈值，则表示相应监测节点可能发生故障，并给出预警提示。Step 17. Monitor the operation status of any monitoring node in real time and obtain the corresponding gas turbine operation data set, and use the fault prediction model to obtain the predicted value of the real-time monitored monitoring node, and then compare the predicted value with the real value of the real-time monitored monitoring node In comparison, when the set threshold is exceeded, it indicates that the corresponding monitoring node may fail, and an early warning prompt is given.

Claims

1. A gas turbine fault prediction method based on correlation analysis is applied to a gas turbine system, and the operation states of Z monitoring nodes in the gas turbine system are monitored at intervals, so that a gas turbine operation data set D consisting of m monitoring value vectors is obtained and is recorded as D = { sam ₁ ,sam ₂ ,...,sam _v ,...,sam _m Wherein, sam _v Represents a vector of the v-th monitored value, and

representing the monitoring value of the ith monitoring node in the v-th monitoring value vector; v is more than or equal to 1 and less than or equal to m, i is more than or equal to 1 and less than or equal to Z; defining a vector formed by monitoring values of the ith monitoring node under m-time monitoring as X _i The method is characterized in that the fault prediction of the gas turbine is carried out according to the following steps:

step 1, defining a time t, and initializing t =0;

defining the limit value of the number Z of the monitoring nodes as max, namely Z is less than or equal to max;

step 2, defining the selected monitoring node vector set as EF, and initializing the selected monitoring node vector set at the t-th moment as

Step 3, defining a variable j, and initializing j =1;

step 4, judging whether the j is less than or equal to Z, if so, reading a jth monitoring node vector X with m values from a gas turbine operation data set D _j (ii) a And initializing the jth monitor node vector X _j Relevant monitoring node vector set MB (X) _j ) Initializing jth monitoring node vector X for null _j Newly added monitor node vector set FA (X) _j ) Initializing jth monitoring node vector X for null _j Redundant monitor node vector set FD (X) _j ) Is empty; then step 5 is executed; otherwise, the final causal structure diagram formed by the monitoring nodes is obtained, wherein the father node and the child node of each monitoring node are the monitoring nodes related to the corresponding monitoring node, and step 16 is executed;

step 5, judging whether j =1 is true, if so, determining the jth monitoring node vector X _j Adding the selected monitoring node vector set EF at the t moment _t Thereby obtaining the selected monitoring node vector set EF at the t +1 th moment _t+1 (ii) a Assigning t +1 to t and j +1 to j, and returning to the step 4; otherwise, executing step 6;

step 6, for the jth monitoring node vector X _j Performing correlation analysis;

step 7, judging the jth monitoring node vector X _j Of the relevant monitoring node vector set MB (X) _j ) If the set is an empty set, returning to the step 4; otherwise, the jth monitoring node vector X is used _j Adding the monitoring node vector set EF selected at the t moment _t FromAnd obtaining a monitoring node vector set EF selected at the t +1 th moment _t+1 ＝EF _t ∪X _j (ii) a Assigning t +1 to t, and executing the step 8;

step 8, defining a variable k, and initializing k =1;

step 9, for the monitoring node vector set EF selected at the t-th moment _t Of the kth monitoring node vector X _k Performing redundancy check analysis;

step 10, assigning k +1 to k, judging whether k is greater than j, and executing step 11 if k is greater than j; otherwise, returning to the step 9 for execution;

step 11, defining a variable count; and initializing count =0; initializing k =1;

step 12, judging the k-th monitoring node vector X _k Relevant monitoring node vector set MB (X) _k ) Whether the vector set is an empty set or not, if the vector set is the empty set, selecting a monitoring node vector set EF from the t-th moment _t Deleting the k-th monitoring node vector X _k Then, after assigning the count +1 to the count, executing step 13; otherwise, directly executing step 13;

step 13, assigning k +1 to k; judging whether k is more than j, if so, assigning j-count to j, and then obtaining an updated monitoring node vector set EF selected at the t-th moment _t ', is marked as EF _t ′＝{X′ ₁ ,X′ ₂ ,...,X′ _i ,...X′ _j-count }；X′ _i Set of monitoring node vectors EF representing the updated t-th moment _t The ith monitor node vector in' and, in combination,

representing the ith monitor node vector X' _i The middle-th monitoring value; i is more than or equal to 1 and less than or equal to j-count; assigning j-count to j, and executing step 14; otherwise, returning to the step 12 for execution;

step 14, for the monitoring node vector set EF selected at the t-th time _t 'j-th monitor node vector X' _j Carrying out on-line local orientation to obtain a t-time cause and effect structure diagram;

step 15, assigning j +1 to j, and returning to the step 4;

step 16, selecting a monitoring node vector of a monitoring node at will, and using the monitoring node vector as the output of the LSTM neural network model, and using the monitoring node vector related to the selected monitoring node as the input of the LSTM neural network model, so as to train the LSTM neural network model, thereby obtaining a fault prediction model;

and step 17, monitoring the operation state of any monitoring node in real time, obtaining a corresponding gas turbine operation data set, obtaining a predicted value of the monitoring node monitored in real time by using the fault prediction model, comparing the predicted value with a real value of the monitoring node monitored in real time, indicating that the corresponding monitoring node is likely to have a fault when the predicted value exceeds a set threshold value, and giving an early warning prompt.

2. The correlation analysis-based gas turbine engine failure prediction method of claim 1, wherein the step 6 is performed as follows:

step 6.1, setting a correlation threshold value as alpha;

step 6.2, defining a variable w; and initializing w =1; defining a variable theta;

step 6.3, calculating the jth monitoring node vector X by using a Hilbert-Schmidt independence criterion _j And w-th monitor node vector X _w Degree of correlation HSIC _jw ；

Step 6.4, correlating degree HSIC _jw Assigning to theta, judging whether theta is more than or equal to alpha, and if so, representing the jth monitoring node vector X _j And w-th monitor node vector X _w Correlating and executing step 6.5; otherwise, the j monitoring node vector X is represented _j And the w-th monitor node vector X _w Independently, and perform step 6.6;

step 6.5, monitoring the jth node vector X _j Vector X added to the w-th monitor node _w In a related monitorVector set MB (X) of measured nodes _w ) In (1), i.e. MB (X) _w )＝MB(X _w )∪X _j The jth monitor node vector X _j Adding newly added monitoring node vector set FA (X) _w ) In (1), namely FA (X) _w )＝FA(X _w )∪{X _j H, thereby updating the w-th monitor node vector X _w Of the relevant monitoring node vector set MB (X) _w ) And a newly added monitoring node vector set FA (X) _w ) (ii) a At the same time, the w-th monitoring node vector X _w Vector X added to jth monitor node _j Of the relevant monitoring node vector set MB (X) _j ) I.e. MB (X) _j )＝MB(X _j )∪X _w The w-th monitor node vector X _w Adding newly added monitoring node vector set FA (X) _j ) In (1), namely FA (X) _j )＝FA(X _j )∪{X _w H, thereby updating the jth monitor node vector X _j Relevant monitoring node vector set MB (X) _j ) And a newly added monitoring node vector set FA (X) _j ) (ii) a Then step 6.6 is executed;

6.6, assigning k +1 to k, judging whether k is greater than j-1, and if so, executing step 7; otherwise, the step 6.3 is returned to for execution.

3. The correlation analysis-based gas turbine engine failure prediction method of claim 1, wherein the step 9 of on-line redundancy check analysis is performed by the steps of:

step 9.1, setting a redundancy threshold value beta; calculating the kth monitoring node vector X _k Of the relevant monitoring node vector set MB (X) _k ) The number of the monitoring node vectors in (1) is recorded as S _k ；

Step 9.2, defining a variable s; and initializing s =1; defining a variable sigma;

step 9.3, obtaining the relevant monitoring node vector set MB (X) _k ) The vector of the middle s-th monitoring node is denoted by tau _s ；

Step 9.4, calculating the t < th > by using Hilbert-Schmidt independence criterion _s Vector of monitoring nodes

And the kth monitoring node vector X _k Degree of correlation of

Step 9.5, correlating the degree of correlation

Assigning to sigma, judging whether sigma is less than or equal to beta, if yes, representing the tau _s Vector of monitoring nodes

And the kth monitoring node vector X _k Irrelevant, namely, the node is a redundant monitoring node, and step 9.6 is executed; otherwise, it indicates the τ th _s Vector of monitoring nodes

And the kth monitoring node vector X _k Correlation and step 9.7 is performed;

step 9.6, from the k-th monitoring node vector X _k Of the relevant monitoring node vector set MB (X) _k ) In which the τ th is deleted _s A vector of monitoring nodes

Namely that

And will be tau _s Vector of monitoring nodes

Adding the kth monitoring node vector X _k Redundant monitor node vector set FD (X) _k ) In, i.e.

From the th τ _s Vector of monitoring nodes

Relevant monitoring node vector set

Deleting the k-th monitoring node vector X _k I.e. by

And the kth monitoring node vector X _k Adding the τ th _s A vector of monitoring nodes

Redundant monitoring node vector set

Namely, it is

Step 9.7, assigning s +1 to s; and judging S > S _k If yes, executing step 10; otherwise, the step 9.3 is returned to for execution.

4. The correlation analysis-based gas turbine engine fault prediction method of claim 1, wherein said step 14 is performed by:

step 14.1, setting a direction support degree threshold value as gamma;

step 14.2 from the jth monitor node vector X _j Of the relevant monitoring node vector set MB (X) _j ) Optionally one monitoring node vector X _g And the g-th monitoring node vector X _g Vector set MB (X) from related monitoring nodes _i ) Deleting;

step 14.3, when the g-th monitoring node vector X _g As the jth monitor node vector X _j Of parent monitoring node vectors, i.e. X _g →X _j When usingP-value calculation X of least square mutual information method _g →X _j Support of orientation and is denoted as p-value (X) _g ,X _j )；

Step 14.4, when the jth monitoring node vector X _j As the g-th monitor node vector X _g Of parent monitoring node vectors, i.e. X _j →X _g Then, X is calculated by using the p-value of the least square mutual information method _j →X _g Support of orientation and is denoted as p-value (X) _j ,X _g )；

Step 14.5, if p-value (X) _g ,X _j ) Gamma or p-value (X) _j ,X _g ) Gamma is less than or equal to gamma, then X is represented _g →X _j The direction has larger support degree and is oriented as X _g →X _j ；

If p-value (X) _j ,X _g ) Gamma or p-value (X) _g ,X _j ) Gamma is less than or equal to gamma, then X is represented _j →X _g The direction has larger support degree and is oriented as X _j →X _g ；

If p-value (X) _j ,X _g ) Gamma or p-value (X) _g ,X _j ) Gamma or p-value (X) _g ,X _j ) Gamma or p-value (X) _j ,X _g ) If the value is more than gamma, no causal relationship exists between the two monitoring node vectors, and no orientation is needed;

step 14.6, if the jth monitoring node vector X _j Of the relevant monitoring node vector set MB (X) _j ) If it is empty, step 15 is executed, otherwise, step 14.2 is returned to.