CN112904810B

CN112904810B - Process industry nonlinear process monitoring method based on effective feature selection

Info

Publication number: CN112904810B
Application number: CN202110044130.5A
Authority: CN
Inventors: 袁小锋; 网文聪; 王雅琳; 王凯; 阳春华; 桂卫华
Original assignee: Central South University
Current assignee: Central South University
Priority date: 2021-01-13
Filing date: 2021-01-13
Publication date: 2022-07-01
Anticipated expiration: 2041-01-13
Also published as: CN112904810A

Abstract

The invention provides a process industry nonlinear process monitoring method based on effective feature selection, which comprises the following steps: step 1, acquiring process variables during normal operation of a process industry, constructing a normal data set according to the acquired process variables, and performing data preprocessing on the normal data set; and 2, inputting the preprocessed normal data sample into a preset stacking self-encoder model, extracting hidden layer characteristics of the preprocessed normal data sample, pre-training a plurality of self-encoders, sequentially connecting and stacking input layers and hidden layers of the pre-trained self-encoders, constructing the trained stacking self-encoder model, and finely adjusting parameters of the trained stacking self-encoder model to obtain the hyper-parameters of the trained stacking self-encoder model. According to the invention, the extracted hidden layer characteristics are selected through an effective characteristic selection strategy, and the characteristics more effective for fault monitoring are selected, so that the fault monitoring accuracy is greatly improved, and the fault monitoring effect is greatly improved.

Description

Process industry nonlinear process monitoring method based on effective feature selection

Technical Field

The invention relates to the technical field of process industry, in particular to a process industry nonlinear process monitoring method based on effective characteristic selection.

Background

The process industry refers to modern manufacturing industry that produces process-based products, primarily processing products based on continuous and batch material flow, energy flow. The processing method of the product mainly comprises chemical and physical reactions, separation, mixing and the like, and the process industry is always the prop industry for national economy and social development in China in recent decades. With the development of production capacity and science and technology, the Chinese process industry has been greatly improved and advanced. Nowadays, the industrial production process faces new situations and new problems, and automation, informatization and intellectualization become development directions of transformation and upgrading, safe production, energy conservation and emission reduction in the industrial production process. Therefore, the capacity and the modernization level of the system are gradually improved, the industrial scale is rapidly expanded, and for a large-scale production system, the occurrence of system abnormity can cause production loss, operation unit damage and even casualties. In order to ensure the normal operation and product quality of an industrial system and ensure the safety of personnel and property, the existence of abnormal working conditions needs to be identified in real time and the operation and production recovery conditions need to be given in time, namely, the industrial production process is monitored on line.

Computer technology and distributed control systems are widely used in the process industry today to allow large amounts of process variable data to be collected and utilized, however, many product manufacturing process failure events are still handled manually by operators. Due to the lack of experience and capability of operators, when the abnormal conditions of production are faced, the adoption of wrong coping modes is inevitable, so that the industrial production cannot run stably. Therefore, the key content of the research is to introduce a computer control system to monitor the process industrial production process accurately in real time and to guide and assist field operators when dealing with production faults.

In recent years, industrial data is collected more conveniently and rapidly, and in many process monitoring research methods, a qualitative analysis method and a quantitative analysis method are mostly used for implementing real-time monitoring of an industrial process. Qualitative analysis methods describe the state of a system by analyzing the relationships between system variables or parameters, and common methods include: an expert system method, a graph theory method and a qualitative simulation method. Quantitative analysis is a method of analyzing the quantitative characteristics, quantitative relationships and quantitative variations of industrial data using a mathematical module to reveal and describe the correlation and development trends among process variables. Generally, quantitative analysis methods are classified into analytical model-based methods and data-driven methods.

Analytical model based process monitoring methods were studied earlier. The method comprises hidden layer knowledge of the inside of the system, but the method is only suitable for the monitored object to have an accurate mathematical model, the modeling process is complex, and in practice, the accurate mathematical model of the industrial process is often difficult to obtain, and the method cannot obtain a satisfactory monitoring effect. Currently, in modern industrial plants, different operating units can measure a large number of variables that reflect the operating state of an industrial process, thereby providing opportunities for data-driven based methods. Data-driven based process monitoring is an effective technique that can extract information from process data that is readily available in modern process systems, thereby providing a better understanding of the process without the need to know the exact analytical model of the system. Due to the nature of this data-driven approach, the data-driven monitoring method is well suited for complex and large-scale flow systems. The data driving method uses historical data to establish a process monitoring model, and then obtains a monitoring index of fault detection.

However, the data-driven based process monitoring also has the following problems. The effective characteristic information extracted by the model is insufficient, how to remove redundant information, and extracting key information is also an important aspect for improving the monitoring performance of the system. However, the existing feature extraction method for process monitoring cannot extract key effective features from the highly nonlinear and dynamically changing data, and the monitoring performance is affected. The stack self-encoder is used as a method for extracting features, a plurality of self-encoder networks are trained through nonlinear mapping, high-dimensional data are converted into low-dimensional features, abstract key features of input data can be obtained, and nonlinear relations of variables can be extracted.

Disclosure of Invention

The invention provides a process industry nonlinear process monitoring method based on effective feature selection, and aims to solve the problems that a traditional self-encoder model only focuses on feature extraction for reconstructing original input as much as possible, and feature information for increasing the difference between fault data and normal data is extracted from the fault detection angle.

In order to achieve the above object, an embodiment of the present invention provides a process industry nonlinear process monitoring method based on effective feature selection, including:

step 1, acquiring process variables during normal operation of a process industry, constructing a normal data set according to the acquired process variables, and performing data preprocessing on the normal data set;

step 2, inputting the preprocessed normal data samples into a preset stacked self-encoder model, extracting hidden layer characteristics of the preprocessed normal data samples, pre-training a plurality of self-encoders, sequentially connecting and stacking input layers and hidden layers of the pre-trained self-encoders, constructing the trained stacked self-encoder model, and finely adjusting parameters of the trained stacked self-encoder model to obtain hyper-parameters of the trained stacked self-encoder model;

step 3, selecting effective characteristics in the hidden layer characteristics of the preprocessed normal data sample through an effective characteristic selection strategy, calculating the statistic of the normal data concentrated sample according to the effective characteristics of the hidden layer characteristics of the selected normal data sample, and calculating the control limit of the statistic through kernel density estimation;

step 4, acquiring real-time process variables in the process industrial operation process, constructing an online detection data set according to the acquired real-time process variables, and performing data preprocessing on the online detection data set;

step 5, inputting the preprocessed online detection data sample into the trained stacked self-encoder model, extracting hidden layer characteristics of the preprocessed online detection data sample, selecting effective characteristics in the hidden layer characteristics through an effective characteristic selection strategy, and calculating statistics of online detection data concentrated samples according to the selected effective characteristics of the hidden layer characteristics;

step 6, comparing the statistic of each sample in the online detection data set with the control limit of the statistic, judging whether the statistic of each sample in the online detection data set exceeds the control limit of the statistic, when the statistic of a certain online detection data sample in the online detection data set exceeds the control limit of the statistic, alarming and prompting are carried out, and the current online detection data sample is judged as the online detection data sample of the fault, the number of the online detection data samples judged as the fault is recorded, when the statistic of one online detection data sample in the online detection data set does not exceed the control limit of the statistic, the current online detection data sample is judged as a normal online detection data sample, and calculating the fault detection rate according to the number of the fault data samples in the online detection data set and the number of the online detection data samples which are judged to be faults in the online detection data set.

Wherein, the step 1 specifically comprises:

collecting process variables according to a certain time sequence based on normal process industry, and constructing a normal data set according to the collected process variables, wherein the normal data set is represented as X₁＝[x₁,x₂,...,x_n]^T∈R^n×mWherein n represents the number of normal data sets, and m represents the number of process variables;

to a normal data set X₁＝[x₁,x₂,...,x_n]^T∈R^n×mData preprocessing was performed as follows:

where i denotes a normal data sample, i 1,2., n,

the mth process variable representing the ith normal data sample,

represents the maximum value of the mth process variable in the normal data sample,

representing the minimum of the mth process variable in the normal data sample.

Wherein, the step 2 specifically comprises:

inputting the preprocessed normal data set into a first self-encoder of a preset stacked self-encoder model for model training, wherein the structures of the self-encoder and a decoder are as follows:

h_i＝f(x_i；θ₁)＝f(W₁x_i+b₁),θ₁＝{W₁,b₁} (2)

r_i＝g(h_i；θ₂)＝g(W₂h_i+b₂),θ₂＝{W₂,b₂} (3)

wherein, h ═ f (x) denotes an activation function of the hidden layer, and r ═ g (h) denotes an activation function of the output layer; theta₁Set of parameters, θ, representing the hidden layer₂A parameter set representing the output layer, k representing the number of nodes of the hidden layer, x_i∈R^mRepresents the ith normal data sample, h_i∈R^kRepresenting the hidden layer features extracted by the self-encoder, g (-) represents the decoding function, r_i∈R^mRepresenting the output vector, W₁Representing the weight of the encoder, W₁∈R^k×m，W₂Representing the weight of the decoder, W₂∈R^m×k，b₁Representing the bias parameters of the encoder, b₁∈R^k，b₂Representing the offset parameters of the decoder, b₂∈R^m。

Wherein, the step 2 further comprises:

inputting the normal data set into a first self-encoder of a preset stacked self-encoder model, and extracting hidden layer characteristics through the first self-encoder;

inputting the extracted hidden layer characteristics into a subsequent self-encoder, and extracting the hidden layer characteristics through the subsequent self-encoder;

repeatedly inputting the hidden layer characteristic data extracted by the previous self-encoder into the next self-encoder, extracting the hidden layer characteristic through the next self-encoder until no next self-encoder stops, and obtaining the weight of each self-encoder, the bias parameter of each self-encoder and the hidden layer characteristic of each self-encoder;

sequentially connecting the self-coders after pre-training according to an input layer and a hidden layer to construct a trained stacked self-coder model;

and carrying out global fine adjustment on the parameters of the trained stacked self-encoder model through a back propagation algorithm.

Wherein, the step 3 specifically comprises:

final extraction of stacked autocoder by active feature selection strategyThe feature of the hidden layer is extracted effectively, and statistic T constructed in the feature space²As follows:

defining a calculation formula of the contribution degree of each feature of the normal data set hidden layer according to the formula (4) and calculating the contribution degree of each feature of the normal data set hidden layer as follows:

wherein, c_jRepresenting the contribution of the jth feature of the hidden layer of the normal data set, f representing the self-encoder activation function, X₁Representing a normal data set, w_1jSelf-encoder weights representing the jth feature of the hidden layer of the normal data set,

a bias parameter representing the jth characteristic of the hidden layer of the normal data set,

means, σ, representing the jth feature in the hidden layer of normal data set extraction_1j ²Representing the variance of the jth feature in the hidden layer extracted from the normal data set.

Wherein, the step 3 further comprises:

and performing descending order on the calculated contribution degrees of the characteristics of the hidden layer, selecting d characteristics with the largest contribution degree, wherein the hyper-parameter d of the trained stacked self-encoder model meets the following conditions as follows:

C(d)≥C^* (6)

wherein, C^*Indicates a predetermined threshold value, C^*Set between 80% and 90%;

assuming that the hidden layer of the I-th self-encoder has y features, the contribution degrees of the y features are in descending orderIs arranged as c₁≥c₂≥...≥c_yThe minimum trained hyper-parameter d of the stacked self-coder model satisfying the condition of equation (6) is derived as follows:

wherein d represents a hyper-parameter of the trained stacked self-encoder model, d is more than or equal to 1 and less than or equal to y, y represents the number of features, k represents the kth hidden layer feature, c_kRepresenting the contribution degree of the characteristic of the kth hidden layer;

statistics of samples in the normal dataset were calculated as follows:

wherein, T₁ ²Statistics representing samples in the normal data set, H_1jRepresenting the value of the jth feature in the hidden layer extracted from the normal data set.

Wherein, the step 3 further comprises:

the control limit of the statistic is calculated by kernel density estimation as follows:

wherein p (X) denotes a normal data set X₁Of a probability distribution function of X_1iThe ith normal data sample of the normal data set is represented, n represents the number of the normal data samples, h represents a bandwidth parameter, and K (·) represents a kernel function;

the kernel function K (·) satisfies the following condition:

K(x)≥0 (11)。

wherein, the step 4 specifically comprises:

acquiring real-time process variables in the process industry, and constructing an online detection data set according to the acquired real-time process variables, wherein the online detection data set is represented as X₂＝[x₁,x₂,...,x_n]^T∈R^n×mThe online detection data set comprises normal online detection data samples and fault data samples, and the online detection data set X₂＝[x₁,x₂,...,x_n]^T∈R^n×mData preprocessing was performed as follows:

where θ represents an online detection data sample, θ ═ 1,2., n,

represents the mth process variable of the theta online test data sample.

Wherein, the step 5 specifically comprises:

inputting the preprocessed online detection data sample into the trained stacked self-encoder model, extracting the characteristics of the hidden layer of the online detection data sample, selecting the effective characteristics of the hidden layer of the online detection data sample through an effective characteristic selection strategy, and calculating the statistic of the online detection data concentrated sample according to the effective characteristics of the selected characteristics of the hidden layer of the online detection data sample, wherein the statistic is as follows:

wherein, T₂ ²Statistics representing samples in the on-line test dataset, H_2jAnd representing the value of the jth feature in the hidden layer extracted by the online detection data set.

Wherein, the step 6 specifically comprises:

the failure detection rate was calculated as follows:

wherein, fdr represents the fault detection rate, a represents the number of online detection data samples judged to be fault in the online detection data set, and s represents the number of fault data samples in the online detection data set.

The scheme of the invention has the following beneficial effects:

according to the method for monitoring the nonlinear process of the process industry based on the effective feature selection, the extracted original features are selected through the effective feature selection strategy, the features more effective for fault monitoring are selected, the fault monitoring accuracy is greatly improved, and the fault monitoring effect is greatly improved.

Drawings

FIG. 1 is a flow chart of the present invention;

FIG. 2 is a schematic diagram of the TE process of the present invention;

FIG. 3 is a comparison of the failure detection rates in feature space according to the present invention;

FIG. 4 is a comparison diagram of the fault detection rate in the residual error space according to the present invention.

Detailed Description

In order to make the technical problems, technical solutions and advantages of the present invention more apparent, the following detailed description is given with reference to the accompanying drawings and specific embodiments.

The invention provides a process industry nonlinear process monitoring method based on effective feature selection, aiming at the problems that the existing self-encoder model only focuses on feature extraction for reconstructing original input as much as possible and ignores feature information for increasing the difference between fault data and normal data extracted from the fault detection angle.

As shown in fig. 1 to 4, an embodiment of the present invention provides a process industry nonlinear process monitoring method based on active feature selection, including: step 1, acquiring process variables during normal operation of a process industry, constructing a normal data set according to the acquired process variables, and performing data preprocessing on the normal data set; step 2, inputting the preprocessed normal data sample into a preset stacking self-encoder model, extracting hidden layer characteristics of the preprocessed normal data sample, pre-training a plurality of self-encoders, sequentially connecting and stacking input layers and hidden layers of the pre-trained self-encoders, constructing the trained stacking self-encoder model, and finely adjusting parameters of the trained stacking self-encoder model to obtain hyper-parameters of the trained stacking self-encoder model; step 3, selecting effective characteristics in the hidden layer characteristics of the preprocessed normal data sample through an effective characteristic selection strategy, calculating the statistic of the normal data concentrated sample according to the effective characteristics of the hidden layer characteristics of the selected normal data sample, and calculating the control limit of the statistic through kernel density estimation; step 4, acquiring real-time process variables in the process industrial operation process, constructing an online detection data set according to the acquired real-time process variables, and performing data preprocessing on the online detection data set; step 5, inputting the preprocessed online detection data sample into the trained stacked self-encoder model, extracting hidden layer characteristics of the preprocessed online detection data sample, selecting effective characteristics in the hidden layer characteristics through an effective characteristic selection strategy, and calculating statistics of online detection data concentrated samples according to the selected effective characteristics of the hidden layer characteristics; step 6, comparing the statistic of each sample in the online detection data set with the control limit of the statistic, judging whether the statistic of each sample in the online detection data set exceeds the control limit of the statistic, when the statistic of a certain online detection data sample in the online detection data set exceeds the control limit of the statistic, alarming and prompting are carried out, and the current online detection data sample is judged as the online detection data sample of the fault, the number of the online detection data samples judged as the fault is recorded, when the statistic of one online detection data sample in the online detection data set does not exceed the control limit of the statistic, the current online detection data sample is judged as a normal online detection data sample, and calculating the fault detection rate according to the number of the fault data samples in the online detection data set and the number of the online detection data samples which are judged to be faults in the online detection data set.

Wherein, the step 1 specifically comprises: collecting process variables according to a certain time sequence based on normal process industry, and constructing a normal data set according to the collected process variables, wherein the normal data set is represented as X₁＝[x₁,x₂,...,x_n]^T∈R^n×mWherein n represents the number of normal data sets, and m represents the number of process variables;

for a normal data set X₁＝[x₁,x₂,...,x_n]^T∈R^n×mData preprocessing was performed as follows:

where i denotes a normal data sample, i 1,2., n,

the mth process variable representing the ith normal data sample,

represents the maximum of the mth process variable in the normal data sample,

representing the minimum of the mth process variable in the normal data sample.

Wherein, the step 2 specifically comprises: inputting the preprocessed normal data set into a first self-encoder of a preset stack self-encoder model for model training, wherein the structures of the self-encoder and a decoder are as follows:

h_i＝f(x_i；θ₁)＝f(W₁x_i+b₁),θ₁＝{W₁,b₁} (2)

r_i＝g(h_i；θ₂)＝g(W₂h_i+b₂),θ₂＝{W₂,b₂} (3)

Wherein, the step 2 further comprises: inputting the normal data set into a first self-encoder of a preset stacked self-encoder model, and extracting hidden layer characteristics through the first self-encoder; inputting the extracted hidden layer characteristics into a subsequent self-encoder, and extracting the hidden layer characteristics through the subsequent self-encoder; repeatedly inputting the hidden layer characteristic data extracted by the previous self-encoder into the next self-encoder, extracting the hidden layer characteristic through the next self-encoder until no next self-encoder stops, and obtaining the weight of each self-encoder, the bias parameter of each self-encoder and the hidden layer characteristic of each self-encoder; sequentially connecting the self-coders after pre-training according to an input layer and a hidden layer to construct a trained stacked self-coder model; and carrying out global fine adjustment on the parameters of the trained stacked self-encoder model through a back propagation algorithm.

In the method for monitoring a nonlinear process of process industry based on effective feature selection according to the embodiment of the invention, the principle of the self-encoder is to reconstruct the input at the output part through the learning of the hidden layer of the network, i.e. let r be_iApproximation x_iThen, through a back propagation algorithm, an error back propagation training between the two is utilized to obtain a pre-trained self-encoder, and the training of the l-1 self-encoder is completedAnd (3) taking the (l-1) th hidden layer characteristic as the input of the (l) th self-encoder, obtaining the weight and the bias parameter of the (l) th self-encoder, and extracting the (l) th hidden layer characteristic, wherein the activation function can be a sigmoid, a tanh hyperbolic tangent function or a ReLu linear rectification function.

Wherein, the step 3 specifically comprises: performing effective feature extraction on the features of the hidden layer finally extracted by the stacked self-encoder through an effective feature selection strategy, and constructing a statistic T in a feature space²As follows:

wherein, c_jRepresenting the contribution of the jth feature of the hidden layer of the normal data set, f representing the self-encoder activation function, X₁A normal data set is represented that is,

self-encoder weights representing the jth feature of the hidden layer of the normal data set,

Wherein, the step 3 further comprises: and performing descending order on the calculated contribution degrees of the characteristics of the hidden layer, selecting d characteristics with the largest contribution degree, wherein the hyper-parameter d of the trained stacked self-encoder model meets the following conditions as follows:

C(d)≥C^* (6)

assuming that the hidden layer of the I-th self-encoder has y features, the contribution degrees of the y features are arranged in a descending order as c₁≥c₂≥...≥c_yThe minimum trained hyper-parameter d of the stacked self-coder model satisfying the condition of equation (6) is derived as follows:

statistics of samples in the normal dataset were calculated as follows:

Wherein, the step 3 further comprises: the control limit of the statistic is calculated by kernel density estimation as follows:

wherein p (X) denotes a normal data set X₁Of a probability distribution function of X_1iI normal data sample of the normal data set, n normal data sample number, and h normal data sample numberThe bandwidth parameter, K (-) represents the kernel function;

the kernel function K (·) satisfies the following condition:

K(x)≥0 (11)。

wherein, the step 4 specifically comprises: acquiring real-time process variables in the process industry, and constructing an online detection data set according to the acquired real-time process variables, wherein the online detection data set is represented as X₂＝[x₁,x₂,...,x_n]^T∈R^n×mThe online detection data set comprises normal online detection data samples and fault data samples, and the online detection data set X₂＝[x₁,x₂,...,x_n]^T∈R^n×mData preprocessing was performed as follows:

where θ represents an online detection data sample, θ ═ 1,2., n,

represents the mth process variable of the theta online test data sample.

The method for monitoring the process industry nonlinear process based on the effective feature selection according to the embodiment of the invention verifies the prediction performance of the trained stacked self-coding model through the online detection data set.

Wherein, the step 5 specifically comprises: inputting the preprocessed online detection data sample into the trained stacked self-encoder model, extracting the characteristics of the hidden layer of the online detection data sample, selecting the effective characteristics of the hidden layer of the online detection data sample through an effective characteristic selection strategy, and calculating the statistic of the online detection data concentrated sample according to the effective characteristics of the selected characteristics of the hidden layer of the online detection data sample, wherein the statistic is as follows:

wherein, T₂ ²Statistics representing samples in an on-line test dataset, H_2jAnd representing the value of the jth feature in the hidden layer extracted by the online detection data set.

Wherein, the step 6 specifically comprises: the failure detection rate was calculated as follows:

wherein, fdr represents the fault detection rate, a represents the number of online detection data samples judged as faults in the online detection data set, and s represents the number of fault data samples in the online detection data set.

According to the method for monitoring the process industry nonlinear process based on the effective feature selection, the training process of the self-encoder model is divided into two steps of pre-training layer by layer and reverse fine tuning, each self-encoder is pre-trained layer by layer, a normal data set is input and passes through a first self-encoder, and first hidden layer feature data are extracted; taking the first hidden layer characteristic data as the input of a second self-encoder, and extracting second hidden layer characteristic data; repeating the steps of taking hidden layer feature data of a previous self-encoder as input of hidden layer feature data of a next self-encoder and extracting hidden layer feature data of the next self-encoder, wherein l represents the position of the self-encoder in the stacked self-encoder model, and l is 1,2, …, n; for the training of a subsequent self-encoder (i.e. when l > -2), taking l-1 hidden layer feature data as the input of the first self-encoder, further obtaining the weight and the bias parameter of the first self-encoder, and extracting the first hidden layer feature data; sequentially connecting the pre-trained self-coders according to the input layer and the hidden layer to form a stacked self-coder model, and finally performing global fine tuning by utilizing back propagation on the basis of the parameters obtained by pre-training to train the stacked self-coder model. In the reverse fine tuning step, parameters of the stacked self-encoder model are fine-tuned by constructing a loss function of the stacked self-encoder model by using a back propagation algorithm until the network converges to a target range.

In this embodiment, the method of monitoring the TE process in the process industry includes the following steps: the control architecture for the TE process (Tennessee Eastman, TE, Tennessee-Eastman) comprises five major operating units, a reactor, a condenser, a separator, a compressor, and a stripper. The method comprises 22 measurement variables, 19 component variables and 12 operation variables, 11 operation variables and 22 continuous variables are selected for process monitoring, 21 fault data sets are used as a test set in the TE process, each data set comprises 960 samples, faults are introduced from the 161 st sample, 500 samples are used as normal data sets, normal data are collected for the TE process, and a stacked self-encoder model is 33->30->27->30->33, i.e. the input dimension is 33, and the number of self-encoders is 2. The parameters from the input layer to the hidden layer of the two coders are w₁、w₂、b₁、b₂The hidden layer activation functions of the two autoencoders are g₁、g₂}。

As shown in table 1, the feature space and residual space fault detection rate of the three models, which are Principal Component Analysis (PCA), Stacked Auto Encoder (SAE), and active feature selection stacked auto encoder (SAE-CS), on the fault in the test set 21 is given.

TABLE 1 Fault detection Rate for three models in TE Process

It can be seen from the table that the fault detection rate of the stacked self-encoder selected by the effective features is very good in 21 types of faults of the TE data set, and the detection rate of the stacked self-encoder selected by the effective features is generally better than that of the original stacked self-encoder in both the feature space and the residual space, and the stacked self-encoder selected by the effective features has higher accuracy.

TABLE 2TE Process failure categories

The method for monitoring the nonlinear process of the process industry based on the effective feature selection according to the embodiment of the invention includes acquiring process variables when the process industry normally operates to form a normal data set, inputting the normal data set into a stack self-encoder model which is originally set and formed by sequentially stacking l self-encoders, wherein l is 1,2, n to obtain a plurality of hidden layer features, training the stack self-encoder model to obtain the trained stack self-encoder model, selecting effective features which can distinguish a fault sample from a normal sample from the plurality of hidden layer features from the fault detection angle through an effective feature selection strategy in the stack self-encoder model, calculating statistics of the samples in the normal data set according to the selected effective features, and calculating a control limit of sample control quantity in the normal data set through kernel density estimation; acquiring real-time process variables in the process industry, constructing an online detection data set according to the real-time process variables, preprocessing the online detection data set, inputting the preprocessed online detection data set into a trained stacked self-encoder to extract hidden layer characteristics, selecting effective characteristics in the hidden layer characteristics through an effective characteristic selection strategy, calculating statistics of samples in the online detection data set according to the effective characteristics of the selected hidden layer characteristics, comparing the statistics of each sample in the online detection data set with control limits of a normal data set, judging whether the current online detection data sample is a failed online detection data sample, recording the number of the online detection data samples judged to be failed, calculating a failure detection rate according to the number of the failed data samples in the online detection data set and the number of the online detection data samples judged to be failed, and judging the prediction performance of the trained stacked self-encoder model according to the fault detection rate.

According to the method for monitoring the process industry nonlinear process based on the effective feature selection, the extracted hidden layer features are selected through an effective feature selection strategy, the features more effective for fault monitoring are selected, the effective features for improving the fault detection rate can be selected in a self-adaptive mode, the monitoring effect of the stacked self-encoder model is greatly improved, and the fault monitoring accuracy is greatly improved.

While the foregoing is directed to the preferred embodiment of the present invention, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention as defined in the appended claims.

Claims

1. A process industry nonlinear process monitoring method based on effective feature selection is characterized by comprising the following steps:

step 2, inputting the preprocessed normal data sample into a preset stacking self-encoder model, extracting hidden layer characteristics of the preprocessed normal data sample, pre-training a plurality of self-encoders, sequentially connecting and stacking input layers and hidden layers of the pre-trained self-encoders, constructing the trained stacking self-encoder model, and finely adjusting parameters of the trained stacking self-encoder model to obtain hyper-parameters of the trained stacking self-encoder model;

2. The method for monitoring the non-linear process of the process industry based on the effective characteristic selection according to claim 1, wherein the step 1 specifically comprises:

collecting process variables according to a certain time sequence based on normal process industry, and constructing a normal data set according to the collected process variables, wherein the normal data set is represented as X₁＝[x₁，x₂，...，x_n]^T∈R^n×mWherein n represents the number of normal data sets, and m represents the number of process variables;

for a normal data set X₁＝[x₁，x₂，...，x_n]^T∈R^n×mData preprocessing was performed as follows:

where i denotes a normal data sample, i 1,2., n,

the mth process variable representing the ith normal data sample,

representing the minimum of the mth process variable in the normal data sample.

3. The method for process industry nonlinear process monitoring based on active feature selection according to claim 2, wherein the step 2 specifically comprises:

h_i＝f(x_i；θ₁)＝f(W₁x_i+b₁)，θ₁＝{W₁，b₁} (2)

r_i＝g(h_i；θ₂)＝g(W₂h_i+b₂)，θ₂＝{W₂，b₂} (3)

4. The method of claim 3, wherein the step 2 further comprises:

5. The method for flow industrial nonlinear process monitoring based on active signature selection as recited in claim 4 wherein said step 3 specifically comprises:

carrying out effective feature extraction on the features of the hidden layer finally extracted by the stacked self-encoder through an effective feature selection strategy, and constructing a statistic T in a feature space²As follows:

represents the mean of the jth feature in the hidden layer extracted from the normal data set,

representing the variance of the jth feature in the hidden layer extracted from the normal data set.

6. The method of claim 5, wherein the step 3 further comprises:

and performing descending order arrangement on the calculated contribution degrees of the characteristics of the hidden layer, selecting d characteristics with the maximum contribution degree, and enabling the hyper-parameter d of the trained stacked self-encoder model to meet the following conditions as follows:

C(d)≥C^* (6)

statistics of samples in the normal dataset were calculated as follows:

wherein, T₁ ²Represents the statistics of the samples in the normal data set,

representing the value of the jth feature in the hidden layer extracted from the normal data set.

7. The method of claim 6, wherein the step 3 further comprises:

the kernel function K (·) satisfies the following condition:

K(x)≥0 (11)。

8. the method for process industry nonlinear process monitoring based on active feature selection according to claim 7, wherein the step 4 specifically comprises:

where θ represents an online detection data sample, θ ═ 1,2., n,

represents the mth process variable of the theta online test data sample.

9. The method for process industry nonlinear process monitoring based on active feature selection as recited in claim 8 wherein said step 5 specifically comprises:

wherein, T₂ ²A statistic representing a sample in the online test data set,

and representing the value of the jth feature in the hidden layer extracted by the online detection data set.

10. The method for process industry nonlinear process monitoring based on active signature selection as recited in claim 9 wherein said step 6 specifically comprises:

the failure detection rate was calculated as follows: