CN112904810B - Process industry nonlinear process monitoring method based on effective feature selection - Google Patents

Process industry nonlinear process monitoring method based on effective feature selection Download PDF

Info

Publication number
CN112904810B
CN112904810B CN202110044130.5A CN202110044130A CN112904810B CN 112904810 B CN112904810 B CN 112904810B CN 202110044130 A CN202110044130 A CN 202110044130A CN 112904810 B CN112904810 B CN 112904810B
Authority
CN
China
Prior art keywords
self
hidden layer
encoder
data set
online detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110044130.5A
Other languages
Chinese (zh)
Other versions
CN112904810A (en
Inventor
袁小锋
网文聪
王雅琳
王凯
阳春华
桂卫华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Central South University
Original Assignee
Central South University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Central South University filed Critical Central South University
Priority to CN202110044130.5A priority Critical patent/CN112904810B/en
Publication of CN112904810A publication Critical patent/CN112904810A/en
Application granted granted Critical
Publication of CN112904810B publication Critical patent/CN112904810B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/418Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM]
    • G05B19/41865Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM] characterised by job scheduling, process planning, material flow
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems
    • G05B2219/32Operator till task planning
    • G05B2219/32252Scheduling production, machining, job shop
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Manufacturing & Machinery (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Testing And Monitoring For Control Systems (AREA)

Abstract

The invention provides a process industry nonlinear process monitoring method based on effective feature selection, which comprises the following steps: step 1, acquiring process variables during normal operation of a process industry, constructing a normal data set according to the acquired process variables, and performing data preprocessing on the normal data set; and 2, inputting the preprocessed normal data sample into a preset stacking self-encoder model, extracting hidden layer characteristics of the preprocessed normal data sample, pre-training a plurality of self-encoders, sequentially connecting and stacking input layers and hidden layers of the pre-trained self-encoders, constructing the trained stacking self-encoder model, and finely adjusting parameters of the trained stacking self-encoder model to obtain the hyper-parameters of the trained stacking self-encoder model. According to the invention, the extracted hidden layer characteristics are selected through an effective characteristic selection strategy, and the characteristics more effective for fault monitoring are selected, so that the fault monitoring accuracy is greatly improved, and the fault monitoring effect is greatly improved.

Description

Process industry nonlinear process monitoring method based on effective feature selection
Technical Field
The invention relates to the technical field of process industry, in particular to a process industry nonlinear process monitoring method based on effective characteristic selection.
Background
The process industry refers to modern manufacturing industry that produces process-based products, primarily processing products based on continuous and batch material flow, energy flow. The processing method of the product mainly comprises chemical and physical reactions, separation, mixing and the like, and the process industry is always the prop industry for national economy and social development in China in recent decades. With the development of production capacity and science and technology, the Chinese process industry has been greatly improved and advanced. Nowadays, the industrial production process faces new situations and new problems, and automation, informatization and intellectualization become development directions of transformation and upgrading, safe production, energy conservation and emission reduction in the industrial production process. Therefore, the capacity and the modernization level of the system are gradually improved, the industrial scale is rapidly expanded, and for a large-scale production system, the occurrence of system abnormity can cause production loss, operation unit damage and even casualties. In order to ensure the normal operation and product quality of an industrial system and ensure the safety of personnel and property, the existence of abnormal working conditions needs to be identified in real time and the operation and production recovery conditions need to be given in time, namely, the industrial production process is monitored on line.
Computer technology and distributed control systems are widely used in the process industry today to allow large amounts of process variable data to be collected and utilized, however, many product manufacturing process failure events are still handled manually by operators. Due to the lack of experience and capability of operators, when the abnormal conditions of production are faced, the adoption of wrong coping modes is inevitable, so that the industrial production cannot run stably. Therefore, the key content of the research is to introduce a computer control system to monitor the process industrial production process accurately in real time and to guide and assist field operators when dealing with production faults.
In recent years, industrial data is collected more conveniently and rapidly, and in many process monitoring research methods, a qualitative analysis method and a quantitative analysis method are mostly used for implementing real-time monitoring of an industrial process. Qualitative analysis methods describe the state of a system by analyzing the relationships between system variables or parameters, and common methods include: an expert system method, a graph theory method and a qualitative simulation method. Quantitative analysis is a method of analyzing the quantitative characteristics, quantitative relationships and quantitative variations of industrial data using a mathematical module to reveal and describe the correlation and development trends among process variables. Generally, quantitative analysis methods are classified into analytical model-based methods and data-driven methods.
Analytical model based process monitoring methods were studied earlier. The method comprises hidden layer knowledge of the inside of the system, but the method is only suitable for the monitored object to have an accurate mathematical model, the modeling process is complex, and in practice, the accurate mathematical model of the industrial process is often difficult to obtain, and the method cannot obtain a satisfactory monitoring effect. Currently, in modern industrial plants, different operating units can measure a large number of variables that reflect the operating state of an industrial process, thereby providing opportunities for data-driven based methods. Data-driven based process monitoring is an effective technique that can extract information from process data that is readily available in modern process systems, thereby providing a better understanding of the process without the need to know the exact analytical model of the system. Due to the nature of this data-driven approach, the data-driven monitoring method is well suited for complex and large-scale flow systems. The data driving method uses historical data to establish a process monitoring model, and then obtains a monitoring index of fault detection.
However, the data-driven based process monitoring also has the following problems. The effective characteristic information extracted by the model is insufficient, how to remove redundant information, and extracting key information is also an important aspect for improving the monitoring performance of the system. However, the existing feature extraction method for process monitoring cannot extract key effective features from the highly nonlinear and dynamically changing data, and the monitoring performance is affected. The stack self-encoder is used as a method for extracting features, a plurality of self-encoder networks are trained through nonlinear mapping, high-dimensional data are converted into low-dimensional features, abstract key features of input data can be obtained, and nonlinear relations of variables can be extracted.
Disclosure of Invention
The invention provides a process industry nonlinear process monitoring method based on effective feature selection, and aims to solve the problems that a traditional self-encoder model only focuses on feature extraction for reconstructing original input as much as possible, and feature information for increasing the difference between fault data and normal data is extracted from the fault detection angle.
In order to achieve the above object, an embodiment of the present invention provides a process industry nonlinear process monitoring method based on effective feature selection, including:
step 1, acquiring process variables during normal operation of a process industry, constructing a normal data set according to the acquired process variables, and performing data preprocessing on the normal data set;
step 2, inputting the preprocessed normal data samples into a preset stacked self-encoder model, extracting hidden layer characteristics of the preprocessed normal data samples, pre-training a plurality of self-encoders, sequentially connecting and stacking input layers and hidden layers of the pre-trained self-encoders, constructing the trained stacked self-encoder model, and finely adjusting parameters of the trained stacked self-encoder model to obtain hyper-parameters of the trained stacked self-encoder model;
step 3, selecting effective characteristics in the hidden layer characteristics of the preprocessed normal data sample through an effective characteristic selection strategy, calculating the statistic of the normal data concentrated sample according to the effective characteristics of the hidden layer characteristics of the selected normal data sample, and calculating the control limit of the statistic through kernel density estimation;
step 4, acquiring real-time process variables in the process industrial operation process, constructing an online detection data set according to the acquired real-time process variables, and performing data preprocessing on the online detection data set;
step 5, inputting the preprocessed online detection data sample into the trained stacked self-encoder model, extracting hidden layer characteristics of the preprocessed online detection data sample, selecting effective characteristics in the hidden layer characteristics through an effective characteristic selection strategy, and calculating statistics of online detection data concentrated samples according to the selected effective characteristics of the hidden layer characteristics;
step 6, comparing the statistic of each sample in the online detection data set with the control limit of the statistic, judging whether the statistic of each sample in the online detection data set exceeds the control limit of the statistic, when the statistic of a certain online detection data sample in the online detection data set exceeds the control limit of the statistic, alarming and prompting are carried out, and the current online detection data sample is judged as the online detection data sample of the fault, the number of the online detection data samples judged as the fault is recorded, when the statistic of one online detection data sample in the online detection data set does not exceed the control limit of the statistic, the current online detection data sample is judged as a normal online detection data sample, and calculating the fault detection rate according to the number of the fault data samples in the online detection data set and the number of the online detection data samples which are judged to be faults in the online detection data set.
Wherein, the step 1 specifically comprises:
collecting process variables according to a certain time sequence based on normal process industry, and constructing a normal data set according to the collected process variables, wherein the normal data set is represented as X1=[x1,x2,...,xn]T∈Rn×mWherein n represents the number of normal data sets, and m represents the number of process variables;
to a normal data set X1=[x1,x2,...,xn]T∈Rn×mData preprocessing was performed as follows:
Figure BDA0002896914880000031
where i denotes a normal data sample, i 1,2., n,
Figure BDA0002896914880000032
the mth process variable representing the ith normal data sample,
Figure BDA0002896914880000041
represents the maximum value of the mth process variable in the normal data sample,
Figure BDA0002896914880000042
representing the minimum of the mth process variable in the normal data sample.
Wherein, the step 2 specifically comprises:
inputting the preprocessed normal data set into a first self-encoder of a preset stacked self-encoder model for model training, wherein the structures of the self-encoder and a decoder are as follows:
hi=f(xi;θ1)=f(W1xi+b1),θ1={W1,b1} (2)
ri=g(hi;θ2)=g(W2hi+b2),θ2={W2,b2} (3)
wherein, h ═ f (x) denotes an activation function of the hidden layer, and r ═ g (h) denotes an activation function of the output layer; theta1Set of parameters, θ, representing the hidden layer2A parameter set representing the output layer, k representing the number of nodes of the hidden layer, xi∈RmRepresents the ith normal data sample, hi∈RkRepresenting the hidden layer features extracted by the self-encoder, g (-) represents the decoding function, ri∈RmRepresenting the output vector, W1Representing the weight of the encoder, W1∈Rk×m,W2Representing the weight of the decoder, W2∈Rm×k,b1Representing the bias parameters of the encoder, b1∈Rk,b2Representing the offset parameters of the decoder, b2∈Rm
Wherein, the step 2 further comprises:
inputting the normal data set into a first self-encoder of a preset stacked self-encoder model, and extracting hidden layer characteristics through the first self-encoder;
inputting the extracted hidden layer characteristics into a subsequent self-encoder, and extracting the hidden layer characteristics through the subsequent self-encoder;
repeatedly inputting the hidden layer characteristic data extracted by the previous self-encoder into the next self-encoder, extracting the hidden layer characteristic through the next self-encoder until no next self-encoder stops, and obtaining the weight of each self-encoder, the bias parameter of each self-encoder and the hidden layer characteristic of each self-encoder;
sequentially connecting the self-coders after pre-training according to an input layer and a hidden layer to construct a trained stacked self-coder model;
and carrying out global fine adjustment on the parameters of the trained stacked self-encoder model through a back propagation algorithm.
Wherein, the step 3 specifically comprises:
final extraction of stacked autocoder by active feature selection strategyThe feature of the hidden layer is extracted effectively, and statistic T constructed in the feature space2As follows:
Figure BDA0002896914880000043
defining a calculation formula of the contribution degree of each feature of the normal data set hidden layer according to the formula (4) and calculating the contribution degree of each feature of the normal data set hidden layer as follows:
Figure BDA0002896914880000051
wherein, cjRepresenting the contribution of the jth feature of the hidden layer of the normal data set, f representing the self-encoder activation function, X1Representing a normal data set, w1jSelf-encoder weights representing the jth feature of the hidden layer of the normal data set,
Figure BDA0002896914880000052
a bias parameter representing the jth characteristic of the hidden layer of the normal data set,
Figure BDA0002896914880000053
means, σ, representing the jth feature in the hidden layer of normal data set extraction1j 2Representing the variance of the jth feature in the hidden layer extracted from the normal data set.
Wherein, the step 3 further comprises:
and performing descending order on the calculated contribution degrees of the characteristics of the hidden layer, selecting d characteristics with the largest contribution degree, wherein the hyper-parameter d of the trained stacked self-encoder model meets the following conditions as follows:
C(d)≥C* (6)
wherein, C*Indicates a predetermined threshold value, C*Set between 80% and 90%;
assuming that the hidden layer of the I-th self-encoder has y features, the contribution degrees of the y features are in descending orderIs arranged as c1≥c2≥...≥cyThe minimum trained hyper-parameter d of the stacked self-coder model satisfying the condition of equation (6) is derived as follows:
Figure BDA0002896914880000054
wherein d represents a hyper-parameter of the trained stacked self-encoder model, d is more than or equal to 1 and less than or equal to y, y represents the number of features, k represents the kth hidden layer feature, ckRepresenting the contribution degree of the characteristic of the kth hidden layer;
statistics of samples in the normal dataset were calculated as follows:
Figure BDA0002896914880000055
wherein, T1 2Statistics representing samples in the normal data set, H1jRepresenting the value of the jth feature in the hidden layer extracted from the normal data set.
Wherein, the step 3 further comprises:
the control limit of the statistic is calculated by kernel density estimation as follows:
Figure BDA0002896914880000061
wherein p (X) denotes a normal data set X1Of a probability distribution function of X1iThe ith normal data sample of the normal data set is represented, n represents the number of the normal data samples, h represents a bandwidth parameter, and K (·) represents a kernel function;
the kernel function K (·) satisfies the following condition:
Figure BDA0002896914880000062
K(x)≥0 (11)。
wherein, the step 4 specifically comprises:
acquiring real-time process variables in the process industry, and constructing an online detection data set according to the acquired real-time process variables, wherein the online detection data set is represented as X2=[x1,x2,...,xn]T∈Rn×mThe online detection data set comprises normal online detection data samples and fault data samples, and the online detection data set X2=[x1,x2,...,xn]T∈Rn×mData preprocessing was performed as follows:
Figure BDA0002896914880000063
where θ represents an online detection data sample, θ ═ 1,2., n,
Figure BDA0002896914880000064
represents the mth process variable of the theta online test data sample.
Wherein, the step 5 specifically comprises:
inputting the preprocessed online detection data sample into the trained stacked self-encoder model, extracting the characteristics of the hidden layer of the online detection data sample, selecting the effective characteristics of the hidden layer of the online detection data sample through an effective characteristic selection strategy, and calculating the statistic of the online detection data concentrated sample according to the effective characteristics of the selected characteristics of the hidden layer of the online detection data sample, wherein the statistic is as follows:
Figure BDA0002896914880000065
wherein, T2 2Statistics representing samples in the on-line test dataset, H2jAnd representing the value of the jth feature in the hidden layer extracted by the online detection data set.
Wherein, the step 6 specifically comprises:
the failure detection rate was calculated as follows:
Figure BDA0002896914880000066
wherein, fdr represents the fault detection rate, a represents the number of online detection data samples judged to be fault in the online detection data set, and s represents the number of fault data samples in the online detection data set.
The scheme of the invention has the following beneficial effects:
according to the method for monitoring the nonlinear process of the process industry based on the effective feature selection, the extracted original features are selected through the effective feature selection strategy, the features more effective for fault monitoring are selected, the fault monitoring accuracy is greatly improved, and the fault monitoring effect is greatly improved.
Drawings
FIG. 1 is a flow chart of the present invention;
FIG. 2 is a schematic diagram of the TE process of the present invention;
FIG. 3 is a comparison of the failure detection rates in feature space according to the present invention;
FIG. 4 is a comparison diagram of the fault detection rate in the residual error space according to the present invention.
Detailed Description
In order to make the technical problems, technical solutions and advantages of the present invention more apparent, the following detailed description is given with reference to the accompanying drawings and specific embodiments.
The invention provides a process industry nonlinear process monitoring method based on effective feature selection, aiming at the problems that the existing self-encoder model only focuses on feature extraction for reconstructing original input as much as possible and ignores feature information for increasing the difference between fault data and normal data extracted from the fault detection angle.
As shown in fig. 1 to 4, an embodiment of the present invention provides a process industry nonlinear process monitoring method based on active feature selection, including: step 1, acquiring process variables during normal operation of a process industry, constructing a normal data set according to the acquired process variables, and performing data preprocessing on the normal data set; step 2, inputting the preprocessed normal data sample into a preset stacking self-encoder model, extracting hidden layer characteristics of the preprocessed normal data sample, pre-training a plurality of self-encoders, sequentially connecting and stacking input layers and hidden layers of the pre-trained self-encoders, constructing the trained stacking self-encoder model, and finely adjusting parameters of the trained stacking self-encoder model to obtain hyper-parameters of the trained stacking self-encoder model; step 3, selecting effective characteristics in the hidden layer characteristics of the preprocessed normal data sample through an effective characteristic selection strategy, calculating the statistic of the normal data concentrated sample according to the effective characteristics of the hidden layer characteristics of the selected normal data sample, and calculating the control limit of the statistic through kernel density estimation; step 4, acquiring real-time process variables in the process industrial operation process, constructing an online detection data set according to the acquired real-time process variables, and performing data preprocessing on the online detection data set; step 5, inputting the preprocessed online detection data sample into the trained stacked self-encoder model, extracting hidden layer characteristics of the preprocessed online detection data sample, selecting effective characteristics in the hidden layer characteristics through an effective characteristic selection strategy, and calculating statistics of online detection data concentrated samples according to the selected effective characteristics of the hidden layer characteristics; step 6, comparing the statistic of each sample in the online detection data set with the control limit of the statistic, judging whether the statistic of each sample in the online detection data set exceeds the control limit of the statistic, when the statistic of a certain online detection data sample in the online detection data set exceeds the control limit of the statistic, alarming and prompting are carried out, and the current online detection data sample is judged as the online detection data sample of the fault, the number of the online detection data samples judged as the fault is recorded, when the statistic of one online detection data sample in the online detection data set does not exceed the control limit of the statistic, the current online detection data sample is judged as a normal online detection data sample, and calculating the fault detection rate according to the number of the fault data samples in the online detection data set and the number of the online detection data samples which are judged to be faults in the online detection data set.
Wherein, the step 1 specifically comprises: collecting process variables according to a certain time sequence based on normal process industry, and constructing a normal data set according to the collected process variables, wherein the normal data set is represented as X1=[x1,x2,...,xn]T∈Rn×mWherein n represents the number of normal data sets, and m represents the number of process variables;
for a normal data set X1=[x1,x2,...,xn]T∈Rn×mData preprocessing was performed as follows:
Figure BDA0002896914880000081
where i denotes a normal data sample, i 1,2., n,
Figure BDA0002896914880000082
the mth process variable representing the ith normal data sample,
Figure BDA0002896914880000083
represents the maximum of the mth process variable in the normal data sample,
Figure BDA0002896914880000084
representing the minimum of the mth process variable in the normal data sample.
Wherein, the step 2 specifically comprises: inputting the preprocessed normal data set into a first self-encoder of a preset stack self-encoder model for model training, wherein the structures of the self-encoder and a decoder are as follows:
hi=f(xi;θ1)=f(W1xi+b1),θ1={W1,b1} (2)
ri=g(hi;θ2)=g(W2hi+b2),θ2={W2,b2} (3)
wherein, h ═ f (x) denotes an activation function of the hidden layer, and r ═ g (h) denotes an activation function of the output layer; theta1Set of parameters, θ, representing the hidden layer2A parameter set representing the output layer, k representing the number of nodes of the hidden layer, xi∈RmRepresents the ith normal data sample, hi∈RkRepresenting the hidden layer features extracted by the self-encoder, g (-) represents the decoding function, ri∈RmRepresenting the output vector, W1Representing the weight of the encoder, W1∈Rk×m,W2Representing the weight of the decoder, W2∈Rm×k,b1Representing the bias parameters of the encoder, b1∈Rk,b2Representing the offset parameters of the decoder, b2∈Rm
Wherein, the step 2 further comprises: inputting the normal data set into a first self-encoder of a preset stacked self-encoder model, and extracting hidden layer characteristics through the first self-encoder; inputting the extracted hidden layer characteristics into a subsequent self-encoder, and extracting the hidden layer characteristics through the subsequent self-encoder; repeatedly inputting the hidden layer characteristic data extracted by the previous self-encoder into the next self-encoder, extracting the hidden layer characteristic through the next self-encoder until no next self-encoder stops, and obtaining the weight of each self-encoder, the bias parameter of each self-encoder and the hidden layer characteristic of each self-encoder; sequentially connecting the self-coders after pre-training according to an input layer and a hidden layer to construct a trained stacked self-coder model; and carrying out global fine adjustment on the parameters of the trained stacked self-encoder model through a back propagation algorithm.
In the method for monitoring a nonlinear process of process industry based on effective feature selection according to the embodiment of the invention, the principle of the self-encoder is to reconstruct the input at the output part through the learning of the hidden layer of the network, i.e. let r beiApproximation xiThen, through a back propagation algorithm, an error back propagation training between the two is utilized to obtain a pre-trained self-encoder, and the training of the l-1 self-encoder is completedAnd (3) taking the (l-1) th hidden layer characteristic as the input of the (l) th self-encoder, obtaining the weight and the bias parameter of the (l) th self-encoder, and extracting the (l) th hidden layer characteristic, wherein the activation function can be a sigmoid, a tanh hyperbolic tangent function or a ReLu linear rectification function.
Wherein, the step 3 specifically comprises: performing effective feature extraction on the features of the hidden layer finally extracted by the stacked self-encoder through an effective feature selection strategy, and constructing a statistic T in a feature space2As follows:
Figure BDA0002896914880000093
defining a calculation formula of the contribution degree of each feature of the normal data set hidden layer according to the formula (4) and calculating the contribution degree of each feature of the normal data set hidden layer as follows:
Figure BDA0002896914880000091
wherein, cjRepresenting the contribution of the jth feature of the hidden layer of the normal data set, f representing the self-encoder activation function, X1A normal data set is represented that is,
Figure BDA0002896914880000092
self-encoder weights representing the jth feature of the hidden layer of the normal data set,
Figure BDA0002896914880000105
a bias parameter representing the jth characteristic of the hidden layer of the normal data set,
Figure BDA0002896914880000101
means, σ, representing the jth feature in the hidden layer of normal data set extraction1j 2Representing the variance of the jth feature in the hidden layer extracted from the normal data set.
Wherein, the step 3 further comprises: and performing descending order on the calculated contribution degrees of the characteristics of the hidden layer, selecting d characteristics with the largest contribution degree, wherein the hyper-parameter d of the trained stacked self-encoder model meets the following conditions as follows:
C(d)≥C* (6)
wherein, C*Indicates a predetermined threshold value, C*Set between 80% and 90%;
assuming that the hidden layer of the I-th self-encoder has y features, the contribution degrees of the y features are arranged in a descending order as c1≥c2≥...≥cyThe minimum trained hyper-parameter d of the stacked self-coder model satisfying the condition of equation (6) is derived as follows:
Figure BDA0002896914880000102
wherein d represents a hyper-parameter of the trained stacked self-encoder model, d is more than or equal to 1 and less than or equal to y, y represents the number of features, k represents the kth hidden layer feature, ckRepresenting the contribution degree of the characteristic of the kth hidden layer;
statistics of samples in the normal dataset were calculated as follows:
Figure BDA0002896914880000103
wherein, T1 2Statistics representing samples in the normal data set, H1jRepresenting the value of the jth feature in the hidden layer extracted from the normal data set.
Wherein, the step 3 further comprises: the control limit of the statistic is calculated by kernel density estimation as follows:
Figure BDA0002896914880000104
wherein p (X) denotes a normal data set X1Of a probability distribution function of X1iI normal data sample of the normal data set, n normal data sample number, and h normal data sample numberThe bandwidth parameter, K (-) represents the kernel function;
the kernel function K (·) satisfies the following condition:
Figure BDA0002896914880000111
K(x)≥0 (11)。
wherein, the step 4 specifically comprises: acquiring real-time process variables in the process industry, and constructing an online detection data set according to the acquired real-time process variables, wherein the online detection data set is represented as X2=[x1,x2,...,xn]T∈Rn×mThe online detection data set comprises normal online detection data samples and fault data samples, and the online detection data set X2=[x1,x2,...,xn]T∈Rn×mData preprocessing was performed as follows:
Figure BDA0002896914880000112
where θ represents an online detection data sample, θ ═ 1,2., n,
Figure BDA0002896914880000113
represents the mth process variable of the theta online test data sample.
The method for monitoring the process industry nonlinear process based on the effective feature selection according to the embodiment of the invention verifies the prediction performance of the trained stacked self-coding model through the online detection data set.
Wherein, the step 5 specifically comprises: inputting the preprocessed online detection data sample into the trained stacked self-encoder model, extracting the characteristics of the hidden layer of the online detection data sample, selecting the effective characteristics of the hidden layer of the online detection data sample through an effective characteristic selection strategy, and calculating the statistic of the online detection data concentrated sample according to the effective characteristics of the selected characteristics of the hidden layer of the online detection data sample, wherein the statistic is as follows:
Figure BDA0002896914880000114
wherein, T2 2Statistics representing samples in an on-line test dataset, H2jAnd representing the value of the jth feature in the hidden layer extracted by the online detection data set.
Wherein, the step 6 specifically comprises: the failure detection rate was calculated as follows:
Figure BDA0002896914880000115
wherein, fdr represents the fault detection rate, a represents the number of online detection data samples judged as faults in the online detection data set, and s represents the number of fault data samples in the online detection data set.
According to the method for monitoring the process industry nonlinear process based on the effective feature selection, the training process of the self-encoder model is divided into two steps of pre-training layer by layer and reverse fine tuning, each self-encoder is pre-trained layer by layer, a normal data set is input and passes through a first self-encoder, and first hidden layer feature data are extracted; taking the first hidden layer characteristic data as the input of a second self-encoder, and extracting second hidden layer characteristic data; repeating the steps of taking hidden layer feature data of a previous self-encoder as input of hidden layer feature data of a next self-encoder and extracting hidden layer feature data of the next self-encoder, wherein l represents the position of the self-encoder in the stacked self-encoder model, and l is 1,2, …, n; for the training of a subsequent self-encoder (i.e. when l > -2), taking l-1 hidden layer feature data as the input of the first self-encoder, further obtaining the weight and the bias parameter of the first self-encoder, and extracting the first hidden layer feature data; sequentially connecting the pre-trained self-coders according to the input layer and the hidden layer to form a stacked self-coder model, and finally performing global fine tuning by utilizing back propagation on the basis of the parameters obtained by pre-training to train the stacked self-coder model. In the reverse fine tuning step, parameters of the stacked self-encoder model are fine-tuned by constructing a loss function of the stacked self-encoder model by using a back propagation algorithm until the network converges to a target range.
In this embodiment, the method of monitoring the TE process in the process industry includes the following steps: the control architecture for the TE process (Tennessee Eastman, TE, Tennessee-Eastman) comprises five major operating units, a reactor, a condenser, a separator, a compressor, and a stripper. The method comprises 22 measurement variables, 19 component variables and 12 operation variables, 11 operation variables and 22 continuous variables are selected for process monitoring, 21 fault data sets are used as a test set in the TE process, each data set comprises 960 samples, faults are introduced from the 161 st sample, 500 samples are used as normal data sets, normal data are collected for the TE process, and a stacked self-encoder model is 33->30->27->30->33, i.e. the input dimension is 33, and the number of self-encoders is 2. The parameters from the input layer to the hidden layer of the two coders are w1、w2、b1、b2The hidden layer activation functions of the two autoencoders are g1、g2}。
As shown in table 1, the feature space and residual space fault detection rate of the three models, which are Principal Component Analysis (PCA), Stacked Auto Encoder (SAE), and active feature selection stacked auto encoder (SAE-CS), on the fault in the test set 21 is given.
TABLE 1 Fault detection Rate for three models in TE Process
Figure BDA0002896914880000121
Figure BDA0002896914880000131
It can be seen from the table that the fault detection rate of the stacked self-encoder selected by the effective features is very good in 21 types of faults of the TE data set, and the detection rate of the stacked self-encoder selected by the effective features is generally better than that of the original stacked self-encoder in both the feature space and the residual space, and the stacked self-encoder selected by the effective features has higher accuracy.
TABLE 2TE Process failure categories
Figure BDA0002896914880000132
Figure BDA0002896914880000141
The method for monitoring the nonlinear process of the process industry based on the effective feature selection according to the embodiment of the invention includes acquiring process variables when the process industry normally operates to form a normal data set, inputting the normal data set into a stack self-encoder model which is originally set and formed by sequentially stacking l self-encoders, wherein l is 1,2, n to obtain a plurality of hidden layer features, training the stack self-encoder model to obtain the trained stack self-encoder model, selecting effective features which can distinguish a fault sample from a normal sample from the plurality of hidden layer features from the fault detection angle through an effective feature selection strategy in the stack self-encoder model, calculating statistics of the samples in the normal data set according to the selected effective features, and calculating a control limit of sample control quantity in the normal data set through kernel density estimation; acquiring real-time process variables in the process industry, constructing an online detection data set according to the real-time process variables, preprocessing the online detection data set, inputting the preprocessed online detection data set into a trained stacked self-encoder to extract hidden layer characteristics, selecting effective characteristics in the hidden layer characteristics through an effective characteristic selection strategy, calculating statistics of samples in the online detection data set according to the effective characteristics of the selected hidden layer characteristics, comparing the statistics of each sample in the online detection data set with control limits of a normal data set, judging whether the current online detection data sample is a failed online detection data sample, recording the number of the online detection data samples judged to be failed, calculating a failure detection rate according to the number of the failed data samples in the online detection data set and the number of the online detection data samples judged to be failed, and judging the prediction performance of the trained stacked self-encoder model according to the fault detection rate.
According to the method for monitoring the process industry nonlinear process based on the effective feature selection, the extracted hidden layer features are selected through an effective feature selection strategy, the features more effective for fault monitoring are selected, the effective features for improving the fault detection rate can be selected in a self-adaptive mode, the monitoring effect of the stacked self-encoder model is greatly improved, and the fault monitoring accuracy is greatly improved.
While the foregoing is directed to the preferred embodiment of the present invention, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention as defined in the appended claims.

Claims (10)

1. A process industry nonlinear process monitoring method based on effective feature selection is characterized by comprising the following steps:
step 1, acquiring process variables during normal operation of a process industry, constructing a normal data set according to the acquired process variables, and performing data preprocessing on the normal data set;
step 2, inputting the preprocessed normal data sample into a preset stacking self-encoder model, extracting hidden layer characteristics of the preprocessed normal data sample, pre-training a plurality of self-encoders, sequentially connecting and stacking input layers and hidden layers of the pre-trained self-encoders, constructing the trained stacking self-encoder model, and finely adjusting parameters of the trained stacking self-encoder model to obtain hyper-parameters of the trained stacking self-encoder model;
step 3, selecting effective characteristics in the hidden layer characteristics of the preprocessed normal data sample through an effective characteristic selection strategy, calculating the statistic of the normal data concentrated sample according to the effective characteristics of the hidden layer characteristics of the selected normal data sample, and calculating the control limit of the statistic through kernel density estimation;
step 4, acquiring real-time process variables in the process industrial operation process, constructing an online detection data set according to the acquired real-time process variables, and performing data preprocessing on the online detection data set;
step 5, inputting the preprocessed online detection data sample into the trained stacked self-encoder model, extracting hidden layer characteristics of the preprocessed online detection data sample, selecting effective characteristics in the hidden layer characteristics through an effective characteristic selection strategy, and calculating statistics of online detection data concentrated samples according to the selected effective characteristics of the hidden layer characteristics;
step 6, comparing the statistic of each sample in the online detection data set with the control limit of the statistic, judging whether the statistic of each sample in the online detection data set exceeds the control limit of the statistic, when the statistic of a certain online detection data sample in the online detection data set exceeds the control limit of the statistic, alarming and prompting are carried out, and the current online detection data sample is judged as the online detection data sample of the fault, the number of the online detection data samples judged as the fault is recorded, when the statistic of one online detection data sample in the online detection data set does not exceed the control limit of the statistic, the current online detection data sample is judged as a normal online detection data sample, and calculating the fault detection rate according to the number of the fault data samples in the online detection data set and the number of the online detection data samples which are judged to be faults in the online detection data set.
2. The method for monitoring the non-linear process of the process industry based on the effective characteristic selection according to claim 1, wherein the step 1 specifically comprises:
collecting process variables according to a certain time sequence based on normal process industry, and constructing a normal data set according to the collected process variables, wherein the normal data set is represented as X1=[x1,x2,...,xn]T∈Rn×mWherein n represents the number of normal data sets, and m represents the number of process variables;
for a normal data set X1=[x1,x2,...,xn]T∈Rn×mData preprocessing was performed as follows:
Figure FDA0002896914870000021
where i denotes a normal data sample, i 1,2., n,
Figure FDA0002896914870000022
the mth process variable representing the ith normal data sample,
Figure FDA0002896914870000023
represents the maximum value of the mth process variable in the normal data sample,
Figure FDA0002896914870000024
representing the minimum of the mth process variable in the normal data sample.
3. The method for process industry nonlinear process monitoring based on active feature selection according to claim 2, wherein the step 2 specifically comprises:
inputting the preprocessed normal data set into a first self-encoder of a preset stacked self-encoder model for model training, wherein the structures of the self-encoder and a decoder are as follows:
hi=f(xi;θ1)=f(W1xi+b1),θ1={W1,b1} (2)
ri=g(hi;θ2)=g(W2hi+b2),θ2={W2,b2} (3)
wherein, h ═ f (x) denotes an activation function of the hidden layer, and r ═ g (h) denotes an activation function of the output layer; theta1Set of parameters, θ, representing the hidden layer2A parameter set representing the output layer, k representing the number of nodes of the hidden layer, xi∈RmRepresents the ith normal data sample, hi∈RkRepresenting the hidden layer features extracted by the self-encoder, g (-) represents the decoding function, ri∈RmRepresenting the output vector, W1Representing the weight of the encoder, W1∈Rk×m,W2Representing the weight of the decoder, W2∈Rm×k,b1Representing the bias parameters of the encoder, b1∈Rk,b2Representing the offset parameters of the decoder, b2∈Rm
4. The method of claim 3, wherein the step 2 further comprises:
inputting the normal data set into a first self-encoder of a preset stacked self-encoder model, and extracting hidden layer characteristics through the first self-encoder;
inputting the extracted hidden layer characteristics into a subsequent self-encoder, and extracting the hidden layer characteristics through the subsequent self-encoder;
repeatedly inputting the hidden layer characteristic data extracted by the previous self-encoder into the next self-encoder, extracting the hidden layer characteristic through the next self-encoder until no next self-encoder stops, and obtaining the weight of each self-encoder, the bias parameter of each self-encoder and the hidden layer characteristic of each self-encoder;
sequentially connecting the self-coders after pre-training according to an input layer and a hidden layer to construct a trained stacked self-coder model;
and carrying out global fine adjustment on the parameters of the trained stacked self-encoder model through a back propagation algorithm.
5. The method for flow industrial nonlinear process monitoring based on active signature selection as recited in claim 4 wherein said step 3 specifically comprises:
carrying out effective feature extraction on the features of the hidden layer finally extracted by the stacked self-encoder through an effective feature selection strategy, and constructing a statistic T in a feature space2As follows:
Figure FDA0002896914870000031
defining a calculation formula of the contribution degree of each feature of the normal data set hidden layer according to the formula (4) and calculating the contribution degree of each feature of the normal data set hidden layer as follows:
Figure FDA0002896914870000032
wherein, cjRepresenting the contribution of the jth feature of the hidden layer of the normal data set, f representing the self-encoder activation function, X1A normal data set is represented that is,
Figure FDA0002896914870000033
self-encoder weights representing the jth feature of the hidden layer of the normal data set,
Figure FDA0002896914870000034
a bias parameter representing the jth characteristic of the hidden layer of the normal data set,
Figure FDA0002896914870000035
represents the mean of the jth feature in the hidden layer extracted from the normal data set,
Figure FDA0002896914870000036
representing the variance of the jth feature in the hidden layer extracted from the normal data set.
6. The method of claim 5, wherein the step 3 further comprises:
and performing descending order arrangement on the calculated contribution degrees of the characteristics of the hidden layer, selecting d characteristics with the maximum contribution degree, and enabling the hyper-parameter d of the trained stacked self-encoder model to meet the following conditions as follows:
C(d)≥C* (6)
wherein, C*Indicates a predetermined threshold value, C*Set between 80% and 90%;
assuming that the hidden layer of the I-th self-encoder has y features, the contribution degrees of the y features are arranged in a descending order as c1≥c2≥...≥cyThe minimum trained hyper-parameter d of the stacked self-coder model satisfying the condition of equation (6) is derived as follows:
Figure FDA0002896914870000041
wherein d represents a hyper-parameter of the trained stacked self-encoder model, d is more than or equal to 1 and less than or equal to y, y represents the number of features, k represents the kth hidden layer feature, ckRepresenting the contribution degree of the characteristic of the kth hidden layer;
statistics of samples in the normal dataset were calculated as follows:
Figure FDA0002896914870000042
wherein, T1 2Represents the statistics of the samples in the normal data set,
Figure FDA0002896914870000043
representing the value of the jth feature in the hidden layer extracted from the normal data set.
7. The method of claim 6, wherein the step 3 further comprises:
the control limit of the statistic is calculated by kernel density estimation as follows:
Figure FDA0002896914870000044
wherein p (X) denotes a normal data set X1Of a probability distribution function of X1iThe ith normal data sample of the normal data set is represented, n represents the number of the normal data samples, h represents a bandwidth parameter, and K (·) represents a kernel function;
the kernel function K (·) satisfies the following condition:
Figure FDA0002896914870000045
K(x)≥0 (11)。
8. the method for process industry nonlinear process monitoring based on active feature selection according to claim 7, wherein the step 4 specifically comprises:
acquiring real-time process variables in the process industry, and constructing an online detection data set according to the acquired real-time process variables, wherein the online detection data set is represented as X2=[x1,x2,...,xn]T∈Rn×mThe online detection data set comprises normal online detection data samples and fault data samples, and the online detection data set X2=[x1,x2,...,xn]T∈Rn×mData preprocessing was performed as follows:
Figure FDA0002896914870000046
where θ represents an online detection data sample, θ ═ 1,2., n,
Figure FDA0002896914870000051
represents the mth process variable of the theta online test data sample.
9. The method for process industry nonlinear process monitoring based on active feature selection as recited in claim 8 wherein said step 5 specifically comprises:
inputting the preprocessed online detection data sample into the trained stacked self-encoder model, extracting the characteristics of the hidden layer of the online detection data sample, selecting the effective characteristics of the hidden layer of the online detection data sample through an effective characteristic selection strategy, and calculating the statistic of the online detection data concentrated sample according to the effective characteristics of the selected characteristics of the hidden layer of the online detection data sample, wherein the statistic is as follows:
Figure FDA0002896914870000052
wherein, T2 2A statistic representing a sample in the online test data set,
Figure FDA0002896914870000053
and representing the value of the jth feature in the hidden layer extracted by the online detection data set.
10. The method for process industry nonlinear process monitoring based on active signature selection as recited in claim 9 wherein said step 6 specifically comprises:
the failure detection rate was calculated as follows:
Figure FDA0002896914870000054
wherein, fdr represents the fault detection rate, a represents the number of online detection data samples judged to be fault in the online detection data set, and s represents the number of fault data samples in the online detection data set.
CN202110044130.5A 2021-01-13 2021-01-13 Process industry nonlinear process monitoring method based on effective feature selection Active CN112904810B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110044130.5A CN112904810B (en) 2021-01-13 2021-01-13 Process industry nonlinear process monitoring method based on effective feature selection

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110044130.5A CN112904810B (en) 2021-01-13 2021-01-13 Process industry nonlinear process monitoring method based on effective feature selection

Publications (2)

Publication Number Publication Date
CN112904810A CN112904810A (en) 2021-06-04
CN112904810B true CN112904810B (en) 2022-07-01

Family

ID=76112879

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110044130.5A Active CN112904810B (en) 2021-01-13 2021-01-13 Process industry nonlinear process monitoring method based on effective feature selection

Country Status (1)

Country Link
CN (1) CN112904810B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113468537B (en) * 2021-06-15 2024-04-09 江苏大学 Feature extraction and exploit attack detection method based on improved self-encoder
CN115877223B (en) * 2023-03-03 2023-07-04 浙江天能新材料有限公司 Lithium battery detection method, device, equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109978079A (en) * 2019-04-10 2019-07-05 东北电力大学 A kind of data cleaning method of improved storehouse noise reduction self-encoding encoder
CN110096810A (en) * 2019-05-05 2019-08-06 中南大学 A kind of industrial process flexible measurement method based on layer-by-layer data extending deep learning
CN111324110A (en) * 2020-03-20 2020-06-23 北京工业大学 Fermentation process fault monitoring method based on multiple shrinkage automatic encoders
CN112034793A (en) * 2020-08-10 2020-12-04 华东理工大学 Method for monitoring production running state space of oxidation working section in PTA process flow
CN112101445A (en) * 2020-09-09 2020-12-18 浙江大学 Method for forecasting continuous casting billet subsurface slag inclusion defect in real time based on supervised neural network

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7844558B2 (en) * 2006-10-05 2010-11-30 Siemens Corporation Incremental learning of nonlinear regression networks for machine condition monitoring
CN108227653B (en) * 2017-12-28 2021-02-05 湖州师范学院 Large-scale nonlinear process monitoring method based on randomized kernel principal component analysis
US11568198B2 (en) * 2018-09-12 2023-01-31 Applied Materials, Inc. Deep auto-encoder for equipment health monitoring and fault detection in semiconductor and display process equipment tools

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109978079A (en) * 2019-04-10 2019-07-05 东北电力大学 A kind of data cleaning method of improved storehouse noise reduction self-encoding encoder
CN110096810A (en) * 2019-05-05 2019-08-06 中南大学 A kind of industrial process flexible measurement method based on layer-by-layer data extending deep learning
CN111324110A (en) * 2020-03-20 2020-06-23 北京工业大学 Fermentation process fault monitoring method based on multiple shrinkage automatic encoders
CN112034793A (en) * 2020-08-10 2020-12-04 华东理工大学 Method for monitoring production running state space of oxidation working section in PTA process flow
CN112101445A (en) * 2020-09-09 2020-12-18 浙江大学 Method for forecasting continuous casting billet subsurface slag inclusion defect in real time based on supervised neural network

Also Published As

Publication number Publication date
CN112904810A (en) 2021-06-04

Similar Documents

Publication Publication Date Title
CN109146246B (en) Fault detection method based on automatic encoder and Bayesian network
Jang et al. Adversarial autoencoder based feature learning for fault detection in industrial processes
CN108762228B (en) Distributed PCA-based multi-working-condition fault monitoring method
CN113642754B (en) Complex industrial process fault prediction method based on RF noise reduction self-coding information reconstruction and time convolution network
CN108875772B (en) Fault classification model and method based on stacked sparse Gaussian Bernoulli limited Boltzmann machine and reinforcement learning
Tian et al. Identification of abnormal conditions in high-dimensional chemical process based on feature selection and deep learning
CN112904810B (en) Process industry nonlinear process monitoring method based on effective feature selection
CN111340110B (en) Fault early warning method based on industrial process running state trend analysis
CN105607631B (en) The weak fault model control limit method for building up of batch process and weak fault monitoring method
CN109675935A (en) A kind of IPCA operation of rolling on-line fault diagnosis method becoming control limit
CN112000081B (en) Fault monitoring method and system based on multi-block information extraction and Mahalanobis distance
CN111914897A (en) Fault diagnosis method based on twin long-short time memory network
CN108830006B (en) Linear-nonlinear industrial process fault detection method based on linear evaluation factor
CN111414943B (en) Anomaly detection method based on mixed hidden naive Bayes model
CN114611067A (en) Chemical process slow-change fault detection method based on typical variable dissimilarity analysis
CN113703422B (en) Gas turbine pneumatic actuator fault diagnosis method based on feature analysis processing
CN112947649A (en) Multivariate process monitoring method based on mutual information matrix projection
CN113159088A (en) Fault monitoring and diagnosis method based on multi-feature fusion and width learning
CN113253682B (en) Nonlinear chemical process fault detection method
CN115017978A (en) Fault classification method based on weighted probability neural network
CN110674461B (en) Chemical production process monitoring method based on multi-block projection non-negative matrix decomposition
CN109947082B (en) Process monitoring method based on integrated core independent component analysis model
CN109523195B (en) Sample multi-modal classification-based fault detection method for hierarchical non-Gaussian algorithm
Huang et al. A fault detection method based on stacking the SAE-SRBM for nonstationary and stationary hybrid processes
CN113485269B (en) Industrial process monitoring method based on hidden variable model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant