CN111367253B - Chemical system multi-working-condition fault detection method based on local adaptive standardization - Google Patents

Chemical system multi-working-condition fault detection method based on local adaptive standardization Download PDF

Info

Publication number
CN111367253B
CN111367253B CN202010098141.7A CN202010098141A CN111367253B CN 111367253 B CN111367253 B CN 111367253B CN 202010098141 A CN202010098141 A CN 202010098141A CN 111367253 B CN111367253 B CN 111367253B
Authority
CN
China
Prior art keywords
data
local
train
valid
feature vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010098141.7A
Other languages
Chinese (zh)
Other versions
CN111367253A (en
Inventor
赵劲松
吴昊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
Original Assignee
Tsinghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tsinghua University filed Critical Tsinghua University
Priority to CN202010098141.7A priority Critical patent/CN111367253B/en
Publication of CN111367253A publication Critical patent/CN111367253A/en
Application granted granted Critical
Publication of CN111367253B publication Critical patent/CN111367253B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B23/00Testing or monitoring of control systems or parts thereof
    • G05B23/02Electric testing or monitoring
    • G05B23/0205Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults
    • G05B23/0259Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterized by the response to fault detection
    • G05B23/0262Confirmation of fault detection, e.g. extra checks to confirm that a failure has indeed occurred
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/20Pc systems
    • G05B2219/24Pc safety
    • G05B2219/24065Real time diagnostics

Landscapes

  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Automation & Control Theory (AREA)
  • Testing Or Calibration Of Command Recording Devices (AREA)

Abstract

The invention relates to a chemical system multi-working-condition fault detection method based on local adaptive standardization, and belongs to the technical field of chemical process monitoring, industrial data processing and process system engineering. The method provides a local self-adaptive standardization method, applies a variation automatic encoder technology of a deep neural network, calculates the average value of data in a local moving window to serve as the average value parameter of the local self-adaptive standardization, uses different average values aiming at different data, and has self-adaptive capacity. The method utilizes local self-adaptive standardization processing to detect whether data in a local moving window deviates or not so as to detect the fault. The method can be suitable for any working condition, has higher accuracy and higher generalization capability, can meet the requirement of real-time detection, and avoids the occurrence of chemical accidents or reduces the harm brought by the accidents by early warning of faults.

Description

Chemical system multi-working-condition fault detection method based on local adaptive standardization
Technical Field
The invention relates to a chemical system multi-working-condition fault detection method based on local adaptive standardization, and belongs to the technical field of chemical process monitoring, industrial data processing and process system engineering.
Background
The safe production in the petrochemical industry relates to each link in the life cycle of chemicals, and because the production link has large chemical quantity and centralized personnel distribution, serious property loss, casualties and environmental damage are caused once an accident occurs. With the continuous progress, popularization and implementation of informatization technology, the chemical industry enters a big data era. The fault detection technology is a basic key technology in the field of chemical process safety, and aims to distinguish whether a chemical system is in a normal operation state or has a fault by collecting and analyzing real-time data of an industrial process.
With the continuous improvement of the automation degree of chemical plants, most of the chemical plants are provided with advanced process control systems and industrial large data storage platforms, so that in recent years, a data-driven chemical fault detection method becomes a research hotspot in academia and industry. Data-driven failure detection mainly includes two types of methods. The first category of methods is multivariate statistical process monitoring methods, including Principal Component Analysis (PCA) and Partial Least Squares (PLS). Because the chemical process has the characteristics of multivariable, dynamic property, nonlinearity and the like, researchers provide a dynamic method and a kernel method based on PCA and PLS in order to be more applied to the chemical process. The second category of methods is deep neural network based methods, including deep belief networks, convolutional neural networks, and Variational auto-encoders (VAEs). The VAE method can train and obtain a monitoring model for chemical engineering fault detection by using normal operation data only. Compared with a multivariate statistical process monitoring method, the deep neural network has higher accuracy, recall rate and higher generalization capability. In recent years, with the development of hardware computing capabilities such as CPUs, GPUs and the like, the computing speed of the method can meet the real-time requirement of industrial data monitoring, and the method has great advantages in practical application. However, with the influence of factors such as raw materials, markets, environment and the like, the chemical device needs to continuously adjust the operation conditions in the production link, namely, the multi-working-condition characteristic exists. The existing chemical fault detection method based on the deep neural network has the problems that normal process variable data are generally assumed to be subjected to normal distribution or unimodal distribution, and the data need to be standardized before being input into a model, so that the model can only be suitable for a single working condition. In the face of the characteristic of multiple working conditions of chemical industry, the existing deep neural network method cannot effectively deal with the problem and cannot complete the fault detection task of the chemical process.
For the multi-working-condition characteristics of chemical industry, the current research generally utilizes a local neighbor standardization combined multivariate statistical process monitoring method, the method uses local neighbor standardization to preprocess data, and then uses the standardized data to model a PCA (principal component analysis) or PLS (partial least squares) method. The original standardization method is to estimate the distribution of variables by using the average value and standard deviation of historical normal operation data, and when online data standardization is carried out, the fixed historical average value and standard deviation are used for calculation, but the differences between the average value and the standard deviation under different working conditions are huge, so that the method can only be used for a single working condition. Local neighbor normalization is performed by finding a local neighbor set of current data in historical normal operating data, and calculating with the mean and standard deviation of the local neighbor set. The local neighbor standardization can find that the current data belongs to a certain working condition, and then the data of the working condition is utilized to carry out standardization processing, so that the data of a plurality of working conditions can be mapped to approximate unimodal distribution, and further the fault detection of historical working conditions can be completed. The method has the problems that the local neighbor standardization still uses historical data to calculate the average value and the standard deviation, is highly dependent on historical working conditions and can only be applied to the historical working conditions. Once the chemical process runs under a new working condition, the fault detection task cannot be finished if neighbor data does not exist in the historical data. Up to now, no universal fault detection method capable of monitoring all working conditions of a chemical process has appeared.
Disclosure of Invention
The invention aims to provide a chemical system multi-working-condition fault detection method based on local adaptive standardization, which is used for overcoming the defects of the existing method, applies a variational automatic encoder technology of a deep neural network, processes data of a local moving window by utilizing the local adaptive standardization, inputs the window data into a variational automatic encoder model to detect whether the deviation trend exists or not, and judges whether the process data is in a normal running state or has a fault or not, so that early warning is carried out when the early data deviates, and the possibility of occurrence of chemical accidents is reduced to the maximum extent.
The invention provides a chemical system multi-working-condition fault detection method based on local adaptive standardization, which comprises the following steps of:
(1) obtaining normal operation data set D under N working conditions from historical database of chemical systemhistoryData set DhistoryThe method comprises the following steps of (1) totally m rows and n columns of data, wherein m represents a process variable of a chemical system, and n represents total operation time;
(2) setting the normal operation data set D in the step (1)historyInto training sets DtrainAnd a verification set DvalidTraining set DtrainComprising m rows ntrainColumn data, validation set DvalidComprising m rows nvalidColumn data, in which training set DtrainD of historical normal operation data sethistoryIn a ratio of
Figure BDA0002385908640000021
60%≤a≤90%;
(3) Training set D in step (2)trainAnd a verification set DvalidCarrying out local self-adaptive standardization processing to obtain a transformed training set TtrainAnd a verification set TvalidThe method comprises the following specific steps:
(3-1) Using the training set D of step (2)trainThe global mean standard deviation gmstd (D) of m process variables in the chemical system is calculated by using the following formulatrain) Including m numbers:
Figure BDA0002385908640000031
wherein i represents the working condition serial number of the chemical process, i is more than or equal to 1 and less than or equal to N, and Dtrain,iRepresentative training set DtrainNormal operating data of the ith operating mode, std (D)train,i) Representative training set DtrainThe standard deviation vector of the ith working condition comprises m numerical values, and std (D) is obtained by calculating the standard deviation of the corresponding variabletrain,i),ntrain,iRepresentative training set DtrainThe amount of normal operation data for the ith condition,
(3-2) training set D for step (2)trainThe k-th normal operation data xk, k in (b) represents the training set DtrainRun time number of (1), 2, ntrain,xkComprises m variable values with time sequence number k, and local moving window data w with time window t is selected forward by calculating timek,wkThe total m rows and t columns of data are provided, wherein t is a time window, t is more than or equal to 10 and less than or equal to 100:
Figure BDA0002385908640000032
utilizing local moving window datawkCalculating wkAverage value of m variables in (1) to obtain mean (w)k),mean(wk) Comprises m numbers;
(3-3) Using gmstd (D) in step (3-1)train) And mean (w) in step (3-2)k) For the local moving window data w of step (3-2)kPerforming local adaptive normalization to wkApproximately converting m variables of the data into standard normal distribution to obtain local self-adaptive standardized local moving window data
Figure BDA0002385908640000033
Figure BDA0002385908640000034
(3-4) repeating the step (3-2) and the step (3-3), and sequentially calculating D in the training settrainObtaining a local adaptive standardized training set T by each normal operation datatrain
(3-5) authentication set D for step (2)validP-th normal operation data x in (1)pAnd p represents the verification set DvalidOperating time number of (1), 2, nvalid,xpThe local moving window data w with time window t is selected forward in time and comprises m variable values with time sequence number pp,wpThere are m rows and t columns of data, where t is the time window in step (3-2):
Figure BDA0002385908640000035
using local moving window data wpCalculating wpAverage value of m variables in (1) to obtain mean (w)p),mean(wp) Comprises m numbers;
(3-6) Using gmstd (D) in step (3-1)train) And mean (w) in step (3-5)p) For the local moving window data w of step (3-5)pThe local self-adaptive standardization is carried out,let wpApproximately converting m variables of the data into standard normal distribution to obtain local self-adaptive standardized local moving window data
Figure BDA0002385908640000041
Figure BDA0002385908640000042
(3-7) repeating the steps (3-5) and (3-6), and sequentially calculating D in the verification setvalidObtaining a local self-adaptive standardized verification set T from each normal operation datavalid
(4) Constructing a variational automatic encoder which comprises an encoder part and a decoder part and utilizing the training set T obtained in the step (3-4)trainTraining the variational automatic encoder to obtain the trained variational automatic encoder, and specifically comprising the following steps:
(4-1) designing and constructing an encoder by using a convolutional neural network, a cyclic neural network or a deep belief network, and carrying out local adaptive normalization on the local moving window data obtained in the step (3-3)
Figure BDA0002385908640000043
As input to the encoder, the mapping is derived
Figure BDA0002385908640000044
Feature vector σ ofkAnd mukFeature vector σkAnd mukThere are l numbers, l represents the dimension of the feature vector, l is greater than or equal to m and less than or equal to 4 m:
Figure BDA0002385908640000045
Figure BDA0002385908640000046
(4-2) Using the feature vector of step (4-1)σkAnd mukCarrying out reparameterization to obtain
Figure BDA0002385908640000047
Hidden feature vector h ofk,hkIncludes l values:
hk=μkk⊙∈
where e is normally distributed from the standard
Figure BDA0002385908640000048
A random sample results,. indicates multiplication of corresponding elements of the vector;
(4-3) designing and constructing a decoder by using a convolutional neural network, a cyclic neural network or a deep belief network, and enabling the hidden feature vector h in the step (4-2)kAs input to the decoder, reconstructing to obtain the data corresponding to step (3-3)
Figure BDA0002385908640000049
Reconstructed data with the same dimensionality
Figure BDA00023859086400000410
There are m rows and t columns of data:
Figure BDA00023859086400000411
(4-4) utilizing the eigenvector σ of step (4-1) according to the following loss functionkAnd mukAnd the reconstructed data of step (4-3)
Figure BDA00023859086400000412
Calculating the locally adaptive normalized local moving window data of step (3-3)
Figure BDA00023859086400000413
Error of (2)
Figure BDA00023859086400000414
Figure BDA00023859086400000415
Figure BDA00023859086400000416
I.e. the loss function of the variational automatic encoder, the loss function
Figure BDA00023859086400000417
Including reconstruction losses
Figure BDA0002385908640000051
And KL divergence loss
Figure BDA0002385908640000052
λ is the weighting factor of KL divergence loss versus reconstruction loss, 103≤λ≤106The loss of the two parts is calculated as follows, wherein j represents the variable serial number of the chemical process, and j is more than or equal to 1 and less than or equal to m:
Figure BDA0002385908640000053
Figure BDA0002385908640000054
(4-5) repeating the step (4-1) to the step (4-4), and sequentially combining the training sets T in the step (3-4)trainEach data of
Figure BDA0002385908640000055
Inputting the variational automatic encoder to carry out error calculation, and training the variational automatic encoder through an error back propagation algorithm to obtain a trained variational automatic encoder;
(5) utilizing the trained variational automatic encoder obtained in the step (4) and the verification set T obtained in the step (3-7)validBy estimating the verification set TvalidTo obtain a variational automatic encoder for fault detection taskThe specific steps of the time monitoring threshold eta are as follows:
(5-1) local moving window data for local adaptive normalization of step (3-6)
Figure BDA0002385908640000056
Mapping the input of the variational automatic encoder trained in the step (4) to obtain
Figure BDA0002385908640000057
Feature vector σ ofpAnd mupFeature vector σpAnd mupThere are l values, respectively, where l represents the dimension of the feature vector:
Figure BDA0002385908640000058
Figure BDA0002385908640000059
(5-2) Using the feature vector σ of step (5-1)pAnd mupTo, for
Figure BDA00023859086400000510
Carrying out reparameterization to obtain
Figure BDA00023859086400000511
Hidden feature vector h ofp,hpIncludes l values:
hp=μpp⊙∈
where e is normally distributed from the standard
Figure BDA00023859086400000512
A random sample results,. indicates multiplication of corresponding elements of the vector;
(5-3) combining the hidden feature vector h of the step (5-2)pAs the input of the decoder in the variational automatic encoder trained in the step (4), reconstructing to obtain the result of the step (3-6)
Figure BDA00023859086400000513
Reconstructed data with the same dimensionality
Figure BDA00023859086400000514
Figure BDA00023859086400000515
There are m rows and t columns of data:
Figure BDA00023859086400000516
(5-4) utilizing the feature vector σ of step (5-1) according to the following abnormality score calculation formulapAnd mupAnd the reconstructed data of step (5-3)
Figure BDA00023859086400000517
Calculating the local adaptive standardized local moving window number in the step (3-6)
Figure BDA00023859086400000518
Is abnormal score of
Figure BDA00023859086400000519
Figure BDA0002385908640000061
Abnormal score
Figure BDA0002385908640000062
Including reconstruction losses
Figure BDA0002385908640000063
And KL divergence loss
Figure BDA0002385908640000064
λ is a weighting coefficient of KL divergence loss with respect to reconstruction loss, the same as λ of step (4-4); two-part loss calculationWherein j represents the variable serial number of the chemical process, j is more than or equal to 1 and less than or equal to m:
Figure BDA0002385908640000065
Figure BDA0002385908640000066
(5-5) repeating the steps (5-1) to (5-4), and sequentially adding the verification sets T of the steps (3-7)validEach data of
Figure BDA0002385908640000067
Input variable automatic encoder for calculating abnormal score
Figure BDA0002385908640000068
Get the verification set TvalidIs abnormal score data set Svalid
(5-6) abnormal score data set SvalidObtaining abnormal score data set S according to normal distributionvalidThe abnormal fraction with the normal distribution confidence coefficient alpha is used as a monitoring threshold eta of the chemical system, and alpha is more than or equal to 99% and less than or equal to 99.99%;
(6) and (3) carrying out online fault detection on the process data of the chemical system under different working conditions by using the variational automatic encoder trained in the step (4) and the monitoring threshold eta obtained in the step (5), wherein the method comprises the following steps:
(6-1) collecting process data from a real-time database of the chemical system at the current detection moment q, and selecting local moving window data w with a time window t from the time to the frontq,wqThere are m rows and t columns of data, where t is the time window in step (3-2):
Figure BDA0002385908640000069
using local moving window data wqCalculating wqAverage of m variables inValue, to obtain mean (w)q),mean(wq) Comprises m numbers;
(6-2) Using gmstd (Dtrain) in step (3-1) and mean (w) in step (6-1)q) For the local moving window data w of step (6-1)qPerforming local adaptive normalization to wqApproximately converting m variables of the data into standard normal distribution to obtain local self-adaptive standardized local moving window data
Figure BDA00023859086400000610
Figure BDA00023859086400000611
(6-3) local moving window data for local adaptive normalization of step (6-2)
Figure BDA00023859086400000612
Mapping the input of the encoder in the variational automatic encoder trained in the step (4) to obtain
Figure BDA0002385908640000071
Feature vector σ ofqAnd muqThe two feature vectors have values of l, wherein l represents the dimension of the feature vector and has the same size as l in the step (4-1):
Figure BDA0002385908640000072
Figure BDA0002385908640000073
(6-4) Using the feature vector σ of step (6-3)qAnd muqCarrying out reparameterization to obtain
Figure BDA0002385908640000074
Hidden feature vector h ofq,hqComprisingIndividual values:
hq=μqq⊙∈
where e is normally distributed from the standard
Figure BDA0002385908640000075
A random sample results,. indicates multiplication of corresponding elements of the vector;
(6-5) hiding the feature vector h of the step (6-4)qAs the input of the decoder in the variational automatic encoder trained in the step (4), reconstructing to obtain the result corresponding to the step (6-2)
Figure BDA0002385908640000076
Reconstructed data with the same dimensionality
Figure BDA0002385908640000077
There are m rows and t columns of data:
Figure BDA0002385908640000078
(6-6) utilizing the feature vector σ of step (6-3) according to the following abnormality score calculation formulaqAnd muqAnd the reconstructed data of step (6-5)
Figure BDA0002385908640000079
Calculating the locally adaptive normalized local moving window data of step (6-2)
Figure BDA00023859086400000710
Is abnormal score of
Figure BDA00023859086400000711
Figure BDA00023859086400000712
Abnormal score
Figure BDA00023859086400000713
Including reconstruction losses
Figure BDA00023859086400000714
And KL divergence loss
Figure BDA00023859086400000715
λ is a weighting coefficient of KL divergence loss with respect to reconstruction loss, the same as λ of step (4-4); the loss of the two parts is calculated as follows, wherein j represents the variable serial number of the chemical process, j is more than or equal to 1 and less than or equal to m:
Figure BDA00023859086400000716
Figure BDA00023859086400000717
(6-7) scoring the abnormality of step (6-6)
Figure BDA00023859086400000718
Comparing with the monitoring threshold eta obtained in the step (5), if so
Figure BDA00023859086400000719
The current chemical system is in a normal operation state, the step (6-1) is returned to continue monitoring the online real-time data, and if the online real-time data is not monitored, the chemical system is in a normal operation state
Figure BDA00023859086400000720
The system fault of the current chemical system is indicated, and fault warning is sent out, so that the multi-working-condition fault detection of the chemical system based on local adaptive standardization is realized.
The invention provides a chemical system multi-working-condition fault detection method based on local adaptive standardization, which has the advantages that:
the invention discloses a chemical system multi-working-condition fault detection method based on local adaptive standardization, which is different from the existing fault detection method by providing a local adaptive standardization method and applying a variational automatic encoder technology of a deep neural network. The method is different from other existing detection methods which detect the deviation degree of the current data and the normal operation data, and the fault detection is carried out by detecting whether the data in the local moving window deviates or not by utilizing local self-adaptive standardization processing. Therefore, the method can be suitable for any working condition, and not only can be applied to the historical existing working condition, but also can be applied to the historical non-occurring working condition. In addition, the invention combines and applies the variational automatic encoder to detect whether the current window data has the deviation trend, and has higher accuracy and stronger generalization capability compared with the traditional multivariate statistical method. The invention can meet the requirement of real-time detection, can be applied to the fault detection task of the chemical system under all working conditions of the chemical process, and avoids the occurrence of chemical accidents or reduces the harm brought by the accidents by early warning of the faults.
Drawings
FIG. 1 is a block diagram of the overall process of the method of the present invention.
Fig. 2 is a schematic diagram of a variational auto-encoder configuration in an embodiment of the present invention.
Fig. 3 is a schematic diagram of a fault detection result under different working conditions according to an embodiment of the present invention.
Detailed Description
The invention provides a chemical system multi-working-condition fault detection method based on local adaptive standardization, which has an overall flow diagram shown in figure 1 and comprises the following steps:
(1) obtaining normal operation data set D under N working conditions from historical database of chemical systemhistoryData set DhistoryThere are m rows and n columns of data, where m represents process variables of the chemical system, such as temperature, time, pressure, etc., and n represents total run time;
(2) setting the normal operation data set D in the step (1)historyInto training sets DtrainAnd a verification set DvalidTraining set DtrainComprising m rows ntrainColumn data, validation set DvalidComprising m rows nvalidColumn data, in which training set DtrainD of historical normal operation data sethistoryIn a ratio of
Figure BDA0002385908640000081
60%≤a≤90%;
(3) Training set D in step (2)trainAnd a verification set DvalidCarrying out local self-adaptive standardization processing to obtain a transformed training set TtrainAnd a verification set TvalidThe method comprises the following specific steps:
(3-1) Using the training set D of step (2)trainThe global mean standard deviation gmstd (D) of m process variables in the chemical system is calculated by using the following formulatrain) Including m numbers:
Figure BDA0002385908640000091
wherein i represents the working condition serial number of the chemical process, i is more than or equal to 1 and less than or equal to N, and Dtrain,iRepresentative training set DtrainNormal operating data of the ith operating mode, std (D)train,i) Representative training set DtrainThe standard deviation vector of the ith working condition comprises m numerical values, and std (D) is obtained by calculating the standard deviation of the corresponding variabletrain,i),ntrain,iRepresentative training set DtrainThe amount of normal operation data for the ith condition,
(3-2) training set D for step (2)trainThe k-th normal operation data xk, k in (b) represents the training set DtrainRun time number of (1), 2, nttain,xkComprises m variable values with time sequence number k, and local moving window data w with time window t is selected forward by calculating timek,wkThe total m rows and t columns of data are provided, wherein t is a time window, t is more than or equal to 10 and less than or equal to 100:
Figure BDA0002385908640000092
using local moving window data wkCalculating wkAverage value of m variables in (1) to obtain mean (w)k),mean(wk) Comprises m numbers;
(3-3) Using gmstd (D) in step (3-1)train) And mean (w) in step (3-2)k) For the local moving window data w of step (3-2)kPerforming local adaptive normalization to wkApproximately converting m variables of the data into standard normal distribution to obtain local self-adaptive standardized local moving window data
Figure BDA0002385908640000093
Figure BDA0002385908640000094
(3-4) repeating the step (3-2) and the step (3-3), and sequentially calculating D in the training settrainObtaining a local adaptive standardized training set T by each normal operation datatrain
(3-5) authentication set D for step (2)validP-th normal operation data x in (1)pAnd p represents the verification set DvalidOperating time number of (1), 2, nvalid,xpThe local moving window data w with time window t is selected forward in time and comprises m variable values with time sequence number pp,wpThere are m rows and t columns of data, where t is the time window in step (3-2):
Figure BDA0002385908640000101
using local moving window data wpCalculating wpAverage value of m variables in (1) to obtain mean (w)p),mean(wp) Comprises m numbers;
(3-6) Using gmstd (D) in step (3-1)train) And mean (w) in step (3-5)p) For the local moving window data w of step (3-5)pPerforming local adaptive normalization to wpApproximately converting m variables of the data into standard normal distribution to obtain local self-adaptive standardized local moving window data
Figure BDA0002385908640000102
Figure BDA0002385908640000103
(3-7) repeating the steps (3-5) and (3-6), and sequentially calculating D in the verification setvalidObtaining a local self-adaptive standardized verification set T from each normal operation datavalid
(4) Constructing a variational automatic encoder which comprises an encoder part and a decoder part and utilizing the training set T obtained in the step (3-4)trainTraining the variational automatic encoder to obtain the trained variational automatic encoder, and specifically comprising the following steps:
(4-1) designing and constructing an encoder by using a convolutional neural network, a cyclic neural network or a deep belief network, and carrying out local adaptive normalization on the local moving window data obtained in the step (3-3)
Figure BDA0002385908640000104
As input to the encoder, the mapping is derived
Figure BDA0002385908640000105
Feature vector σ ofkAnd mukFeature vector σkAnd mukThere are l numbers, l represents the dimension of the feature vector, l is greater than or equal to m and less than or equal to 4 m:
Figure BDA0002385908640000106
Figure BDA0002385908640000107
(4-2) Using the feature vector σ of step (4-1)kAnd mukCarrying out reparameterization to obtain
Figure BDA0002385908640000108
Hidden feature vector h ofk,hkIncludes l values:
hk=μkk⊙∈
where e is normally distributed from the standard
Figure BDA0002385908640000109
A random sample results,. indicates multiplication of corresponding elements of the vector;
(4-3) designing and constructing a decoder by using a convolutional neural network, a cyclic neural network or a deep belief network, and enabling the hidden feature vector h in the step (4-2)kAs input to the decoder, reconstructing to obtain the data corresponding to step (3-3)
Figure BDA00023859086400001010
Reconstructed data with the same dimensionality
Figure BDA00023859086400001011
Figure BDA00023859086400001012
There are m rows and t columns of data:
Figure BDA00023859086400001013
(4-4) utilizing the eigenvector σ of step (4-1) according to the following loss functionkAnd mukAnd the reconstructed data of step (4-3)
Figure BDA0002385908640000111
Calculating the locally adaptive normalized local moving window data of step (3-3)
Figure BDA0002385908640000112
Error of (2)
Figure BDA0002385908640000113
Figure BDA0002385908640000114
Figure BDA0002385908640000115
I.e. the loss function of the variational automatic encoder, the loss function
Figure BDA0002385908640000116
Including reconstruction losses
Figure BDA0002385908640000117
And KL divergence loss
Figure BDA0002385908640000118
λ is the weighting factor of KL divergence loss versus reconstruction loss, 103≤λ≤106The loss of the two parts is calculated as follows, wherein j represents the variable serial number of the chemical process, and j is more than or equal to 1 and less than or equal to m:
Figure BDA0002385908640000119
Figure BDA00023859086400001110
(4-5) repeating the step (4-1) to the step (4-4), and sequentially combining the training sets T in the step (3-4)trainEach data of
Figure BDA00023859086400001111
Inputting the variational automatic encoder to carry out error calculation, and training the variational automatic encoder through an error back propagation algorithm to obtain the variational automatic encoderThe trained variational automatic encoder is used for a fault detection task of the chemical system;
(5) utilizing the trained variational automatic encoder obtained in the step (4) and the verification set T obtained in the step (3-7)validBy estimating the verification set TvalidThe abnormal score confidence interval of the variable automatic encoder is used for obtaining a monitoring threshold eta when the variable automatic encoder is used for a fault detection task, and the method specifically comprises the following steps:
(5-1) local moving window data for local adaptive normalization of step (3-6)
Figure BDA00023859086400001112
Mapping the input of the variational automatic encoder trained in the step (4) to obtain
Figure BDA00023859086400001113
Feature vector σ ofpAnd mupFeature vector σpAnd mupThere are l numbers, respectively, l represents the dimension of the feature vector, and has the same size as l of step (4-1):
Figure BDA00023859086400001114
Figure BDA00023859086400001115
(5-2) Using the feature vector σ of step (5-1)pAnd mupTo, for
Figure BDA00023859086400001116
Carrying out reparameterization to obtain
Figure BDA00023859086400001117
Hidden feature vector h ofp,hpIncludes l values:
hp=μpp⊙∈
where e is normally distributed from the standard
Figure BDA00023859086400001118
A random sample results,. indicates multiplication of corresponding elements of the vector;
(5-3) combining the hidden feature vector h of the step (5-2)pAs the input of the decoder in the variational automatic encoder trained in the step (4), reconstructing to obtain the result of the step (3-6)
Figure BDA00023859086400001119
Reconstructed data with the same dimensionality
Figure BDA00023859086400001120
Figure BDA00023859086400001121
There are m rows and t columns of data:
Figure BDA00023859086400001122
(5-4) utilizing the feature vector σ of step (5-1) according to the following abnormality score calculation formulapAnd mupAnd the reconstructed data of step (5-3)
Figure BDA0002385908640000121
Calculating the local adaptive standardized local moving window number in the step (3-6)
Figure BDA0002385908640000122
Is abnormal score of
Figure BDA0002385908640000123
Figure BDA0002385908640000124
Abnormal score
Figure BDA0002385908640000125
Including reconstruction losses
Figure BDA0002385908640000126
And KL divergence loss
Figure BDA0002385908640000127
λ is a weighting coefficient of KL divergence loss with respect to reconstruction loss, the same as λ of step (4-4); the loss of the two parts is calculated as follows, wherein j represents the variable serial number of the chemical process, j is more than or equal to 1 and less than or equal to m:
Figure BDA0002385908640000128
Figure BDA0002385908640000129
(5-5) repeating the steps (5-1) to (5-4), and sequentially adding the verification sets T of the steps (3-7)validEach data of
Figure BDA00023859086400001210
Input variable automatic encoder for calculating abnormal score
Figure BDA00023859086400001211
Get the verification set TvalidIs abnormal score data set Svalid
(5-6) abnormal score data set SvalidObtaining abnormal score data set S according to normal distributionvalidThe abnormal fraction with the normal distribution confidence coefficient alpha is used as a monitoring threshold eta of the chemical system, and alpha is more than or equal to 99% and less than or equal to 99.99%;
(6) and (3) carrying out online fault detection on the process data of the chemical system under different working conditions by using the variational automatic encoder trained in the step (4) and the monitoring threshold eta obtained in the step (5), wherein the method comprises the following steps:
(6-1) collecting process data from a real-time database of the chemical system at the current detection moment q, and selecting local moving window data w with a time window t from the time to the frontq,wqThere are m rows and t columns of data, where t is the time window in step (3-2):
Figure BDA00023859086400001212
using local moving window data wqCalculating wqAverage value of m variables in (1) to obtain mean (w)q),mean(wq) Comprises m numbers;
(6-2) Using gmstd (Dtrain) in step (3-1) and mean (w) in step (6-1)q) For the local moving window data w of step (6-1)qPerforming local adaptive normalization to wqApproximately converting m variables of the data into standard normal distribution to obtain local self-adaptive standardized local moving window data
Figure BDA00023859086400001213
Figure BDA0002385908640000131
(6-3) local moving window data for local adaptive normalization of step (6-2)
Figure BDA0002385908640000132
Mapping the input of the encoder in the variational automatic encoder trained in the step (4) to obtain
Figure BDA0002385908640000133
Feature vector σ ofqAnd vqThe two feature vectors have values of l, wherein l represents the dimension of the feature vector and has the same size as l in the step (4-1):
Figure BDA0002385908640000134
Figure BDA0002385908640000135
(6-4) Using the feature vector σ of step (6-3)qAnd muqCarrying out reparameterization to obtain
Figure BDA0002385908640000136
Hidden feature vector hq, h ofqIncludes l values:
hq=μqq⊙∈
where e is normally distributed from the standard
Figure BDA0002385908640000137
A random sample results,. indicates multiplication of corresponding elements of the vector;
(6-5) hiding the feature vector h of the step (6-4)qAs the input of the decoder in the variational automatic encoder trained in the step (4), reconstructing to obtain the result corresponding to the step (6-2)
Figure BDA0002385908640000138
Reconstructed data with the same dimensionality
Figure BDA0002385908640000139
There are m rows and t columns of data:
Figure BDA00023859086400001310
(6-6) utilizing the feature vector σ of step (6-3) according to the following abnormality score calculation formulaqAnd muqAnd the reconstructed data of step (6-5)
Figure BDA00023859086400001311
Calculating the locally adaptive normalized local moving window data of step (6-2)
Figure BDA00023859086400001312
Is abnormal score of
Figure BDA00023859086400001313
Figure BDA00023859086400001314
Abnormal score
Figure BDA00023859086400001315
Including reconstruction losses
Figure BDA00023859086400001316
And KL divergence loss
Figure BDA00023859086400001317
λ is a weighting coefficient of KL divergence loss with respect to reconstruction loss, the same as λ of step (4-4); the loss of the two parts is calculated as follows, wherein j represents the variable serial number of the chemical process, j is more than or equal to 1 and less than or equal to m:
Figure BDA00023859086400001318
Figure BDA00023859086400001319
(6-7) scoring the abnormality of step (6-6)
Figure BDA0002385908640000141
Comparing with the monitoring threshold eta obtained in the step (5), if so
Figure BDA0002385908640000142
The current chemical system is in a normal operation state, the step (6-1) is returned to continue monitoring the online real-time data, and if the online real-time data is not monitored, the chemical system is in a normal operation state
Figure BDA0002385908640000143
The system fault of the current chemical system is indicated, and fault warning is sent out, so that the chemical system based on local adaptive standardization is moreAnd detecting working condition faults.
Embodiments of the method of the present invention are described below with reference to the accompanying drawings:
(1) acquiring a normal operation data set D under 2 working conditions from a historical database of a chemical systemhistoryData set DhistoryThe method comprises the following steps of (1) sharing m rows and n columns of data, wherein m is 42 to represent a process variable of a chemical system, n is 16000 to represent total operation time, and each working condition comprises 8000 operation times;
(2) setting the normal operation data set D in the step (1)historyInto training sets DtrainAnd a verification set DvalidTraining set DtrainComprising m rows ntrainColumn data, validation set DvalidComprising m rows nvalidColumn data, in which training set DtrainD of historical normal operation data sethistoryThe ratio of a to 75%, ntrain=12000,nvalid=4000;
(3) Training set D in step (2)trainAnd a verification set DvalidCarrying out local self-adaptive standardization processing to obtain a transformed training set TtrainAnd a verification set TvalidThe method comprises the following specific steps:
(3-1) Using the training set D of step (2)trainThe global mean standard deviation gmstd (D) of m process variables in the chemical system is calculated by using the following formulatrain) And m is 42 values:
Figure BDA0002385908640000144
wherein i represents the working condition serial number of the chemical process, i is more than or equal to 1 and less than or equal to N, and Dtrain,iRepresentative training set DtrainNormal operating data of the ith operating mode, std (D)train,i) Representative training set DtrainThe standard deviation vector of the ith working condition comprises m numerical values, and std (D) is obtained by calculating the standard deviation of the corresponding variabletrain,i),ntrain,iRepresentative training set DtrainNumber of normal operating data of the ith working condition, ntrain,i=6000;
(3-2) training set D for step (2)trainThe k-th normal operation data xk, k in (b) represents the training set DtrainRun time number of (1), 2, ntrain,xkComprises m variable values with time sequence number k, and local moving window data w with time window t is selected forward by calculating timek,wkThere are m rows and t columns of data, where t is the time window, m is 42, t is 30:
Figure BDA0002385908640000151
using local moving window data wkCalculating wkAverage value of m variables in (1) to obtain mean (w)k),mean(wk) Comprises m numbers;
(3-3) Using gmstd (D) in step (3-1)train) And mean (w) in step (3-2)k) For the local moving window data w of step (3-2)kPerforming local adaptive normalization to wkApproximately converting m variables of the data into standard normal distribution to obtain local self-adaptive standardized local moving window data
Figure BDA0002385908640000152
Figure BDA0002385908640000153
(3-4) repeating the step (3-2) and the step (3-3), and sequentially calculating D in the training settrainObtaining a local adaptive standardized training set T by each normal operation datatrain
(3-5) authentication set D for step (2)validP-th normal operation data x in (1)pAnd p represents the verification set DvalidOperating time number of (1), 2, nvalid,xpLocal movement comprising m variable values with time sequence number p, with time forward selecting time window tWindow data wp,wpThere are m rows and t columns of data, where t is the time window in step (3-2), m is 42, t is 30:
Figure BDA0002385908640000154
using local moving window data wpCalculating wpAverage value of m variables in (1) to obtain mean (w)p),mean(wp) Comprises m numbers;
(3-6) Using gmstd (D) in step (3-1)train) And mean (w) in step (3-5)p) For the local moving window data w of step (3-5)pPerforming local adaptive normalization to wpApproximately converting m variables of the data into standard normal distribution to obtain local self-adaptive standardized local moving window data
Figure BDA0002385908640000155
Figure BDA0002385908640000156
(3-7) repeating the steps (3-5) and (3-6), and sequentially calculating D in the verification setvalidObtaining a local self-adaptive standardized verification set T from each normal operation datavalid
(4) Constructing a variational automatic encoder which comprises an encoder part and a decoder part and utilizing the training set T obtained in the step (3-4)trainTraining the variational automatic encoder to obtain the trained variational automatic encoder, and specifically comprising the following steps:
(4-1) designing and constructing an encoder by using a bidirectional Long Short-term memory (BilSTM) and a linear layer, wherein the encoder has a structure shown in figure 2 and comprises two layers of BilSTM and the linear layer, and the local adaptive standardized local moving window data in the step (3-3) is processed
Figure BDA0002385908640000161
As input to the encoder, the mapping is derived
Figure BDA0002385908640000162
Feature vector σ ofkAnd mukFeature vector σkAnd mukThere are l values, l represents the dimension of the feature vector, l is 50:
Figure BDA0002385908640000163
Figure BDA0002385908640000164
(4-2) Using the feature vector σ of step (4-1)kAnd mukCarrying out reparameterization to obtain
Figure BDA0002385908640000165
Hidden feature vector h ofk,hkIncludes l values, l 50:
hk=μkk⊙∈
where e is normally distributed from the standard
Figure BDA0002385908640000166
A random sample results,. indicates multiplication of corresponding elements of the vector;
(4-3) designing and constructing a decoder by using a bidirectional Long Short-term Memory (BilSTM) and a linear layer, wherein the decoder has a structure shown in figure 2 and comprises two layers of BilSTM and the linear layer, and the hidden feature vector h in the step (4-2) is processedkAs input to the decoder, reconstructing to obtain the data corresponding to step (3-3)
Figure BDA0002385908640000167
Reconstructed data with the same dimensionality
Figure BDA0002385908640000168
Figure BDA0002385908640000169
There are m rows and t columns of data, m is 42, t is 30:
Figure BDA00023859086400001610
(4-4) utilizing the eigenvector σ of step (4-1) according to the following loss functionkAnd mukAnd the reconstructed data of step (4-3)
Figure BDA00023859086400001611
Calculating the locally adaptive normalized local moving window data of step (3-3)
Figure BDA00023859086400001612
Error of (2)
Figure BDA00023859086400001613
Figure BDA00023859086400001614
Figure BDA00023859086400001615
I.e. the loss function of the variational automatic encoder, the loss function
Figure BDA00023859086400001616
Including reconstruction losses
Figure BDA00023859086400001617
And KL divergence loss
Figure BDA00023859086400001618
λ is the weighting factor of KL divergence loss versus reconstruction loss, λ is 105The loss of the two parts is calculated as follows, wherein j represents the variable serial number of the chemical process, j is more than or equal to 1 and less than or equal to m, and m is 42:
Figure BDA00023859086400001619
Figure BDA00023859086400001620
(4-5) repeating the step (4-1) to the step (4-4), and sequentially combining the training sets T in the step (3-4)trainEach data of
Figure BDA00023859086400001621
Inputting the variational automatic encoder to carry out error calculation, and training the variational automatic encoder through an error back propagation algorithm to obtain a trained variational automatic encoder;
(5) utilizing the trained variational automatic encoder obtained in the step (4) and the verification set T obtained in the step (3-7)validBy estimating the verification set TvalidThe abnormal score confidence interval of the variable automatic encoder is used for obtaining a monitoring threshold eta when the variable automatic encoder is used for a fault detection task, and the method specifically comprises the following steps:
(5-1) local moving window data for local adaptive normalization of step (3-6)
Figure BDA0002385908640000171
Mapping the input of the variational automatic encoder trained in the step (4) to obtain
Figure BDA0002385908640000172
Feature vector σ ofpAnd mupFeature vector σpAnd mupThere are l values, l represents the dimension of the feature vector, l is 50:
Figure BDA0002385908640000173
Figure BDA0002385908640000174
(5-2) Using the feature vector σ of step (5-1)pAnd mupTo, for
Figure BDA0002385908640000175
Carrying out reparameterization to obtain
Figure BDA0002385908640000176
Hidden feature vector h ofp,hpIncludes l values, l 50:
hp=μpp⊙∈
where e is normally distributed from the standard
Figure BDA0002385908640000177
A random sample results,. indicates multiplication of corresponding elements of the vector;
(5-3) combining the hidden feature vector h of the step (5-2)pAs the input of the decoder in the variational automatic encoder trained in the step (4), reconstructing to obtain the result of the step (3-6)
Figure BDA0002385908640000178
Reconstructed data with the same dimensionality
Figure BDA0002385908640000179
Figure BDA00023859086400001710
There are m rows and t columns of data, m is 42, t is 30:
Figure BDA00023859086400001711
(5-4) utilizing the feature vector σ of step (5-1) according to the following abnormality score calculation formulapAnd mupAnd the reconstructed data of step (5-3)
Figure BDA00023859086400001712
Computing step (3-6) local adaptationNormalized local moving window data
Figure BDA00023859086400001713
Is abnormal score of
Figure BDA00023859086400001714
Figure BDA00023859086400001715
Abnormal score
Figure BDA00023859086400001716
Including reconstruction losses
Figure BDA00023859086400001717
And KL divergence loss
Figure BDA00023859086400001718
λ is a weighting coefficient of KL divergence loss with respect to reconstruction loss, and is the same as λ in step (4-4), where λ is 105(ii) a The loss of the two parts is calculated as follows, wherein j represents the variable serial number of the chemical process, j is more than or equal to 1 and less than or equal to m, and m is 42:
Figure BDA00023859086400001719
Figure BDA00023859086400001720
(5-5) repeating the steps (5-1) to (5-4), and sequentially adding the verification sets T of the steps (3-7)validEach data of
Figure BDA0002385908640000181
Input variable automatic encoder for calculating abnormal score
Figure BDA0002385908640000182
Get the verification set TvalidIs abnormal score data set Svalid
(5-6) abnormal score data set SvalidObtaining abnormal score data set S according to normal distributionvalidThe abnormal fraction with the normal distribution confidence coefficient alpha is used as a monitoring threshold eta of the chemical system, and the alpha is 99.9 percent;
(6) and (3) carrying out online fault detection on the process data of the chemical system under different working conditions by using the variational automatic encoder trained in the step (4) and the monitoring threshold eta obtained in the step (5), wherein the method comprises the following steps:
(6-1) collecting the process data of 4 working conditions from the database of the chemical system as a test set DtestIn total, m rows ntestColumn data, m 42, ntest4 (2000+4 × 1650), wherein the 4 working conditions include 2 historical working conditions and 2 new working conditions in the step (1), and are used for testing the fault detection effect of the invention under different working conditions. Each condition includes normal operating data and 4 types of failed operating data. Wherein, each operating mode includes 2000 normal operating data, and each operating mode of the operating data that breaks down includes 4 fault types, and each fault type includes 1650 operating data, and preceding 450 operating data still belongs to normal operating data, introduces the trouble from 450 operating data, and 1200 last operating data belong to trouble operating data, and 4 fault types are as shown in the following table:
table 1 4 fault types in test data
Figure BDA0002385908640000183
For test set DtestQ-th normal operation data x in (1)qAnd q represents test set DtestOperating time sequence number q 1, 2, ntest,xqComprises m variable values with time sequence number q, and local moving window data w with time window t is selected forward by calculating timeq,wqThere are m rows and t columns of data, where t is the time window in step (3-2), m is 42, t is 30:
Figure BDA0002385908640000184
using local moving window data wqCalculating wqAverage value of m variables in (1) to obtain mean (w)q),mean(wq) Comprises m numbers;
(6-2) Using gmstd (Dtrain) in step (3-1) and mean (w) in step (6-1)q) For the local moving window data w of step (6-1)qPerforming local adaptive normalization to wqApproximately converting m variables of the data into standard normal distribution to obtain local self-adaptive standardized local moving window data
Figure BDA0002385908640000191
Figure BDA0002385908640000192
(6-3) local moving window data for local adaptive normalization of step (6-2)
Figure BDA0002385908640000193
Mapping the input of the encoder in the variational automatic encoder trained in the step (4) to obtain
Figure BDA0002385908640000194
Feature vector σ ofqAnd muqThe two eigenvectors have values of l, l represents the dimension of the eigenvector and has the same size as l in step (4-1), and l is 50:
Figure BDA0002385908640000195
Figure BDA0002385908640000196
(6-4) advantageUsing the feature vector σ of step (6-3)qAnd muqCarrying out reparameterization to obtain
Figure BDA0002385908640000197
Hidden feature vector h ofq,hqIncludes l values, l 50:
hq=μqq⊙∈
where e is normally distributed from the standard
Figure BDA0002385908640000198
A random sample results,. indicates multiplication of corresponding elements of the vector;
(6-5) hiding the feature vector h of the step (6-4)qAs the input of the decoder in the variational automatic encoder trained in the step (4), reconstructing to obtain the result corresponding to the step (6-2)
Figure BDA0002385908640000199
Reconstructed data with the same dimensionality
Figure BDA00023859086400001910
There are m rows and t columns of data, m is 42, t is 30:
Figure BDA00023859086400001911
(6-6) utilizing the feature vector σ of step (6-3) according to the following abnormality score calculation formulaqAnd muqAnd the reconstructed data of step (6-5)
Figure BDA00023859086400001912
Calculating the locally adaptive normalized local moving window data of step (6-2)
Figure BDA00023859086400001913
Is abnormal score of
Figure BDA00023859086400001914
Figure BDA00023859086400001915
Abnormal score
Figure BDA00023859086400001916
Including reconstruction losses
Figure BDA00023859086400001917
And KL divergence loss.
Figure BDA00023859086400001918
λ is a weighting coefficient of KL divergence loss with respect to reconstruction loss, and is the same as λ in step (4-4), where λ is 105(ii) a The loss of the two parts is calculated as follows, wherein j represents the variable serial number of the chemical process, j is more than or equal to 1 and less than or equal to m, and m is 42:
Figure BDA0002385908640000201
Figure BDA0002385908640000202
(6-7) scoring the abnormality of step (6-6)
Figure BDA0002385908640000203
Comparing with the monitoring threshold eta obtained in the step (5), if so
Figure BDA0002385908640000204
The current chemical system is in a normal operation state, the step (6-1) is returned to continue monitoring the online real-time data, and if the online real-time data is not monitored, the chemical system is in a normal operation state
Figure BDA0002385908640000205
The system fault of the current chemical system is indicated, and fault warning is sent out, so that the multi-working-condition fault detection of the chemical system based on local adaptive standardization is realized.
According to the above determination rule, fig. 3 shows the effect of fault detection in the present embodiment under the 4 conditions of step (6-1). Wherein, the working condition 1 and the working condition 2 represent 2 historical working conditions of the step (1), and the working condition 3 and the working condition 4 represent new working conditions which are not used in the step (4) and the step (5). Fig. 3 (a) to (e) show the monitoring effects of the fault detection model on the normal operation data and the fault operation data of the faults 1 to 4, respectively. The abscissa of each sub-graph represents the running time, the ordinate represents the anomaly score, the abscissa represents the monitoring threshold η obtained in step (5), and the vertical dotted line represents the introduction of a fault starting from the 450 th running data in step (1). If the black solid line is lower than the monitoring threshold represented by the dotted line, the normal operation of the chemical system is indicated; if the solid black line is higher than the monitoring threshold represented by the horizontal dashed line, it indicates that the chemical system is malfunctioning. As shown in fig. 3, in (a), the black solid line of the normal operation data under 4 working conditions is below the monitoring threshold (horizontal dotted line), which proves that the method can correctly determine the normal operation of the chemical system; and (b) to (e), the black solid line of the 4 working condition fault operation data is higher than the monitoring threshold (horizontal dotted line) from 450 (vertical dotted line), so that the method can be used for correctly judging that the chemical system has faults. The invention has similar fault detection results for the operation data of the working conditions 1-4, and proves that the fault detection method based on the local adaptive standardization has better detection effect under all the working conditions.

Claims (1)

1. A chemical system multi-working-condition fault detection method based on local adaptive standardization is characterized by comprising the following steps:
(1) obtaining normal operation data set D under N working conditions from historical database of chemical systemhistoryData set DhistoryThe method comprises the following steps of (1) totally m rows and n columns of data, wherein m represents a process variable of a chemical system, and n represents total operation time;
(2) setting the normal operation data set D in the step (1)historyInto training sets DtrainAnd a verification set DvalidTraining set DtrainComprising m rows ntrainColumn data, validation set DvalidComprising m rows nvalidColumn data, in which training setDtrainD of historical normal operation data sethistoryIn a ratio of
Figure FDA0002385908630000011
60%≤a≤90%;
(3) Training set D in step (2)trainAnd a verification set DvalidCarrying out local self-adaptive standardization processing to obtain a transformed training set TtrainAnd a verification set TvalidThe method comprises the following specific steps:
(3-1) Using the training set D of step (2)trainThe global mean standard deviation gmstd (D) of m process variables in the chemical system is calculated by using the following formulatrain) Including m numbers:
Figure FDA0002385908630000012
wherein i represents the working condition serial number of the chemical process, i is more than or equal to 1 and less than or equal to N, and Dtrain,iRepresentative training set DtrainNormal operating data of the ith operating mode, std (D)train,i) Representative training set DtrainThe standard deviation vector of the ith working condition comprises m numerical values, and std (D) is obtained by calculating the standard deviation of the corresponding variabletrain,i),ntrain,iRepresentative training set DtrainThe amount of normal operation data for the ith condition,
(3-2) training set D for step (2)trainThe kth normal operation data x in (1)kK represents the training set DtrainRun time number of (1), 2, ntrain,xkComprises m variable values with time sequence number k, and local moving window data w with time window t is selected forward by calculating timek,wkThe total m rows and t columns of data are provided, wherein t is a time window, t is more than or equal to 10 and less than or equal to 100:
Figure FDA0002385908630000013
using local moving window data wkCalculating wkAverage value of m variables in (1) to obtain mean (w)k),mean(wk) Comprises m numbers;
(3-3) Using gmstd (D) in step (3-1)train) And mean (w) in step (3-2)k) For the local moving window data w of step (3-2)kPerforming local adaptive normalization to wkApproximately converting m variables of the data into standard normal distribution to obtain local self-adaptive standardized local moving window data
Figure FDA0002385908630000021
Figure FDA0002385908630000022
(3-4) repeating the step (3-2) and the step (3-3), and sequentially calculating D in the training settrainObtaining a local adaptive standardized training set T by each normal operation datatrain
(3-5) authentication set D for step (2)validP-th normal operation data x in (1)pAnd p represents the verification set DvalidOperating time number of (1), 2, nvalid,xpThe local moving window data w with time window t is selected forward in time and comprises m variable values with time sequence number pp,wpThere are m rows and t columns of data, where t is the time window in step (3-2):
Figure FDA0002385908630000023
using local moving window data wpCalculating wpAverage value of m variables in (1) to obtain mean (w)p),mean(wp) Comprises m numbers;
(3-6) Using gmstd (D) in step (3-1)train) And mean (w) in step (3-5)p) For the local moving window data w of step (3-5)pPerforming local adaptive normalization to wpApproximately converting m variables of the data into standard normal distribution to obtain local self-adaptive standardized local moving window data
Figure FDA0002385908630000024
Figure FDA0002385908630000025
(3-7) repeating the steps (3-5) and (3-6), and sequentially calculating D in the verification setvalidObtaining a local self-adaptive standardized verification set T from each normal operation datavalid
(4) Constructing a variational automatic encoder which comprises an encoder part and a decoder part and utilizing the training set T obtained in the step (3-4)trainTraining the variational automatic encoder to obtain the trained variational automatic encoder, and specifically comprising the following steps:
(4-1) designing and constructing an encoder by using a convolutional neural network, a cyclic neural network or a deep belief network, and carrying out local adaptive normalization on the local moving window data obtained in the step (3-3)
Figure FDA0002385908630000026
As input to the encoder, the mapping is derived
Figure FDA0002385908630000027
Feature vector σ ofkAnd mukFeature vector σkAnd mukThere are l numbers, l represents the dimension of the feature vector, l is greater than or equal to m and less than or equal to 4 m:
Figure FDA0002385908630000031
Figure FDA0002385908630000032
(4-2) Using the feature vector σ of step (4-1)kAnd mukCarrying out reparameterization to obtain
Figure FDA0002385908630000033
Hidden feature vector h ofk,hkIncludes l values:
hk=μkk⊙∈
where e is normally distributed from the standard
Figure FDA0002385908630000034
A random sample results,. indicates multiplication of corresponding elements of the vector;
(4-3) designing and constructing a decoder by using a convolutional neural network, a cyclic neural network or a deep belief network, and enabling the hidden feature vector h in the step (4-2)kAs input to the decoder, reconstructing to obtain the data corresponding to step (3-3)
Figure FDA0002385908630000035
Reconstructed data with the same dimensionality
Figure FDA0002385908630000036
Figure FDA0002385908630000037
There are m rows and t columns of data:
Figure FDA0002385908630000038
(4-4) utilizing the eigenvector σ of step (4-1) according to the following loss functionkAnd mukAnd the reconstructed data of step (4-3)
Figure FDA0002385908630000039
Calculating the locally adaptive normalized local moving window data of step (3-3)
Figure FDA00023859086300000310
Error of (2)
Figure FDA00023859086300000311
Figure FDA00023859086300000312
Figure FDA00023859086300000313
I.e. the loss function of the variational automatic encoder, the loss function
Figure FDA00023859086300000314
Including reconstruction losses
Figure FDA00023859086300000315
And KL divergence loss
Figure FDA00023859086300000316
λ is the weighting factor of KL divergence loss versus reconstruction loss, 103≤λ≤106The loss of the two parts is calculated as follows, wherein j represents the variable serial number of the chemical process, and j is more than or equal to 1 and less than or equal to m:
Figure FDA00023859086300000317
Figure FDA00023859086300000318
(4-5) repeating the step (4-1) to the step (4-4), and sequentially combining the training sets T in the step (3-4)trainEach data of
Figure FDA00023859086300000319
Inputting the variational automatic encoder to carry out error calculation, and training the variational automatic encoder through an error back propagation algorithm to obtain a trained variational automatic encoder;
(5) utilizing the trained variational automatic encoder obtained in the step (4) and the verification set T obtained in the step (3-7)validBy estimating the verification set TvalidThe abnormal score confidence interval of the variable automatic encoder is used for obtaining a monitoring threshold eta when the variable automatic encoder is used for a fault detection task, and the method specifically comprises the following steps:
(5-1) local moving window data for local adaptive normalization of step (3-6)
Figure FDA00023859086300000320
Mapping the input of the variational automatic encoder trained in the step (4) to obtain
Figure FDA00023859086300000321
Feature vector σ ofpAnd mupFeature vector σpAnd mupThere are l values, respectively, where l represents the dimension of the feature vector:
Figure FDA0002385908630000041
Figure FDA0002385908630000042
(5-2) Using the feature vector σ of step (5-1)pAnd mupTo, for
Figure FDA0002385908630000043
Carrying out reparameterization to obtain
Figure FDA0002385908630000044
Is hiddenFeature vector hp,hpIncludes l values:
hp=μpp⊙∈
where e is normally distributed from the standard
Figure FDA00023859086300000421
A random sample results,. indicates multiplication of corresponding elements of the vector;
(5-3) combining the hidden feature vector h of the step (5-2)pAs the input of the decoder in the variational automatic encoder trained in the step (4), reconstructing to obtain the result of the step (3-6)
Figure FDA0002385908630000045
Reconstructed data with the same dimensionality
Figure FDA0002385908630000046
Figure FDA0002385908630000047
There are m rows and t columns of data:
Figure FDA0002385908630000048
(5-4) utilizing the feature vector σ of step (5-1) according to the following abnormality score calculation formulapAnd mupAnd the reconstructed data of step (5-3)
Figure FDA0002385908630000049
Calculating the locally adaptive normalized local moving window data of the step (3-6)
Figure FDA00023859086300000410
Is abnormal score of
Figure FDA00023859086300000420
Figure FDA00023859086300000412
Abnormal score
Figure FDA00023859086300000413
Including reconstruction losses
Figure FDA00023859086300000414
And KL divergence loss
Figure FDA00023859086300000415
λ is a weighting coefficient of KL divergence loss with respect to reconstruction loss, the same as λ of step (4-4); the loss of the two parts is calculated as follows, wherein j represents the variable serial number of the chemical process, j is more than or equal to 1 and less than or equal to m:
Figure FDA00023859086300000416
Figure FDA00023859086300000417
(5-5) repeating the steps (5-1) to (5-4), and sequentially adding the verification sets T of the steps (3-7)validEach data of
Figure FDA00023859086300000418
Input variable automatic encoder for calculating abnormal score
Figure FDA00023859086300000419
Get the verification set TvalidIs abnormal score data set Svalid
(5-6) abnormal score data set SvalidObtaining abnormal score data set S according to normal distributionvalidThe abnormal fraction with the normal distribution confidence coefficient alpha is used as the monitoring threshold eta of the chemical system, and the alpha is more than or equal to 99 percent and less than or equal to 99.99 percent;
(6) And (3) carrying out online fault detection on the process data of the chemical system under different working conditions by using the variational automatic encoder trained in the step (4) and the monitoring threshold eta obtained in the step (5), wherein the method comprises the following steps:
(6-1) collecting process data from a real-time database of the chemical system at the current detection moment q, and selecting local moving window data w with a time window t from the time to the frontq,wqThere are m rows and t columns of data, where t is the time window in step (3-2):
Figure FDA0002385908630000051
using local moving window data wqCalculating wqAverage value of m variables in (1) to obtain mean (w)q),mean(wq) Comprises m numbers;
(6-2) Using gmstd (D) in step (3-1)train) And mean (w) in step (6-1)q) For the local moving window data w of step (6-1)qPerforming local adaptive normalization to wqApproximately converting m variables of the data into standard normal distribution to obtain local self-adaptive standardized local moving window data
Figure FDA0002385908630000052
Figure FDA0002385908630000053
(6-3) local moving window data for local adaptive normalization of step (6-2)
Figure FDA0002385908630000054
Mapping the input of the encoder in the variational automatic encoder trained in the step (4) to obtain
Figure FDA0002385908630000055
Feature vector σ ofqAnd muqThe two feature vectors have 1 value respectively, l represents the dimension of the feature vector and has the same size as l in the step (4-1):
Figure FDA0002385908630000056
Figure FDA0002385908630000057
(6-4) Using the feature vector σ of step (6-3)qAnd muqCarrying out reparameterization to obtain
Figure FDA0002385908630000058
Hidden feature vector h ofq,hqIncludes l values:
hq=μqq⊙∈
where e is normally distributed from the standard
Figure FDA0002385908630000059
A random sample results,. indicates multiplication of corresponding elements of the vector;
(6-5) hiding the feature vector h of the step (6-4)qAs the input of the decoder in the variational automatic encoder trained in the step (4), reconstructing to obtain the result corresponding to the step (6-2)
Figure FDA00023859086300000510
Reconstructed data with the same dimensionality
Figure FDA00023859086300000513
There are m rows and t columns of data:
Figure FDA00023859086300000512
(6-6) utilizing the feature vector σ of step (6-3) according to the following abnormality score calculation formulaqAnd vqAnd the reconstructed data of step (6-5)
Figure FDA0002385908630000061
Calculating the locally adaptive normalized local moving window data of step (6-2)
Figure FDA0002385908630000062
Is abnormal score of
Figure FDA0002385908630000063
Figure FDA0002385908630000064
Abnormal score
Figure FDA0002385908630000065
Including reconstruction losses
Figure FDA0002385908630000066
And KL divergence loss
Figure FDA0002385908630000067
λ is a weighting coefficient of KL divergence loss with respect to reconstruction loss, the same as λ of step (4-4); the loss of the two parts is calculated as follows, wherein j represents the variable serial number of the chemical process, j is more than or equal to 1 and less than or equal to m:
Figure FDA0002385908630000068
Figure FDA0002385908630000069
(6-7) scoring the abnormality of step (6-6)
Figure FDA00023859086300000610
Comparing with the monitoring threshold eta obtained in the step (5), if so
Figure FDA00023859086300000611
The current chemical system is in a normal operation state, the step (6-1) is returned to continue monitoring the online real-time data, and if the online real-time data is not monitored, the chemical system is in a normal operation state
Figure FDA00023859086300000612
The system fault of the current chemical system is indicated, and fault warning is sent out, so that the multi-working-condition fault detection of the chemical system based on local adaptive standardization is realized.
CN202010098141.7A 2020-02-18 2020-02-18 Chemical system multi-working-condition fault detection method based on local adaptive standardization Active CN111367253B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010098141.7A CN111367253B (en) 2020-02-18 2020-02-18 Chemical system multi-working-condition fault detection method based on local adaptive standardization

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010098141.7A CN111367253B (en) 2020-02-18 2020-02-18 Chemical system multi-working-condition fault detection method based on local adaptive standardization

Publications (2)

Publication Number Publication Date
CN111367253A CN111367253A (en) 2020-07-03
CN111367253B true CN111367253B (en) 2021-03-16

Family

ID=71208052

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010098141.7A Active CN111367253B (en) 2020-02-18 2020-02-18 Chemical system multi-working-condition fault detection method based on local adaptive standardization

Country Status (1)

Country Link
CN (1) CN111367253B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112857806B (en) * 2021-03-13 2022-05-31 宁波大学科学技术学院 Bearing fault detection method based on moving window time domain feature extraction
CN113254978B (en) * 2021-06-24 2021-09-21 国能大渡河大数据服务有限公司 Data security management system based on machine learning
CN115965915B (en) * 2022-11-01 2023-09-08 哈尔滨市科佳通用机电股份有限公司 Railway wagon connecting pull rod breaking fault identification method and system based on deep learning

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100532892C (en) * 2005-01-25 2009-08-26 格特拉格传动机构和齿轮工厂赫尔曼·哈根迈尔有限公司&两合公司 Automatic multi-step variable speed transmission and method for controlling the transmission to detect failure
CN103576594A (en) * 2013-11-11 2014-02-12 浙江工业大学 Intermittent process online monitoring method based on tensor overall-local preserving projection
CN103793601A (en) * 2014-01-20 2014-05-14 广东电网公司电力科学研究院 Turbine set online fault early warning method based on abnormality searching and combination forecasting
CN105955241A (en) * 2016-06-03 2016-09-21 北京科技大学 Quality fault locating method based on federated data driven production process
CN109299507A (en) * 2018-08-20 2019-02-01 浙江大学 A kind of different degrees of Analysis on Fault Diagnosis method under the coal pulverizer multi-state based on dynamic modeling
CN109726524A (en) * 2019-03-01 2019-05-07 哈尔滨理工大学 A kind of rolling bearing remaining life prediction technique based on CNN and LSTM

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07295819A (en) * 1994-04-27 1995-11-10 Fuji Heavy Ind Ltd Fault diagnostic device
US6757665B1 (en) * 1999-09-28 2004-06-29 Rockwell Automation Technologies, Inc. Detection of pump cavitation/blockage and seal failure via current signature analysis
US9366451B2 (en) * 2010-12-24 2016-06-14 Commonwealth Scientific And Industrial Research Organisation System and method for the detection of faults in a multi-variable system utilizing both a model for normal operation and a model for faulty operation
CN105258947B (en) * 2015-11-06 2017-06-23 北京航空航天大学 A kind of Fault Diagnosis of Roller Bearings under operating mode disturbed conditions based on compressed sensing
EP3662331A4 (en) * 2017-08-02 2021-04-28 Strong Force Iot Portfolio 2016, LLC Methods and systems for detection in an industrial internet of things data collection environment with large data sets
CN109555566B (en) * 2018-12-20 2021-04-20 西安交通大学 Steam turbine rotor fault diagnosis method based on LSTM
CN110210381B (en) * 2019-05-30 2023-08-25 盐城工学院 Domain separation self-adaptive one-dimensional convolutional neural network intelligent fault diagnosis method

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100532892C (en) * 2005-01-25 2009-08-26 格特拉格传动机构和齿轮工厂赫尔曼·哈根迈尔有限公司&两合公司 Automatic multi-step variable speed transmission and method for controlling the transmission to detect failure
CN103576594A (en) * 2013-11-11 2014-02-12 浙江工业大学 Intermittent process online monitoring method based on tensor overall-local preserving projection
CN103793601A (en) * 2014-01-20 2014-05-14 广东电网公司电力科学研究院 Turbine set online fault early warning method based on abnormality searching and combination forecasting
CN105955241A (en) * 2016-06-03 2016-09-21 北京科技大学 Quality fault locating method based on federated data driven production process
CN109299507A (en) * 2018-08-20 2019-02-01 浙江大学 A kind of different degrees of Analysis on Fault Diagnosis method under the coal pulverizer multi-state based on dynamic modeling
CN109726524A (en) * 2019-03-01 2019-05-07 哈尔滨理工大学 A kind of rolling bearing remaining life prediction technique based on CNN and LSTM

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
人工免疫系统在间歇化工过程故障诊断中的应用;戴一阳;《化工学报》;20090131;第60卷(第1期);第172-176页 *
基于Hopfield网络的时滞分析故障诊断策略;贺丁 等;《化工学报》;20130228;第64卷(第2期);第633-640页 *
基于深度学习的滚动轴承故障诊断方法研究;金余丰;《中国优秀硕士学位论文全文数据库 工程科技II辑》;20200215(第02期);第C029-157页 *

Also Published As

Publication number Publication date
CN111367253A (en) 2020-07-03

Similar Documents

Publication Publication Date Title
CN111367253B (en) Chemical system multi-working-condition fault detection method based on local adaptive standardization
CN109146246B (en) Fault detection method based on automatic encoder and Bayesian network
US11840998B2 (en) Hydraulic turbine cavitation acoustic signal identification method based on big data machine learning
CN110348150B (en) Fault detection method based on correlation probability model
CN110244692B (en) Chemical process micro-fault detection method
CN112904810B (en) Process industry nonlinear process monitoring method based on effective feature selection
CN112598172A (en) Wind turbine bearing temperature early warning method
Han et al. Fault detection of pneumatic control valves based on canonical variate analysis
CN115994337B (en) Method and device for detecting minor faults in non-stationary process of hot continuous rolling of strip steel
CN116292367B (en) Power plant fan system abnormal condition detection method based on one-dimensional convolution
CN108830006B (en) Linear-nonlinear industrial process fault detection method based on linear evaluation factor
CN111324110A (en) Fermentation process fault monitoring method based on multiple shrinkage automatic encoders
Li et al. Canonical variate residuals-based contribution map for slowly evolving faults
CN112132394A (en) Power plant circulating water pump prediction state assessment method and system
Shi et al. Intelligent fault diagnosis of rolling mills based on dual attention-guided deep learning method under imbalanced data conditions
CN114611067A (en) Chemical process slow-change fault detection method based on typical variable dissimilarity analysis
CN111241629B (en) Intelligent prediction method for performance change trend of hydraulic pump of airplane based on data driving
CN112633331A (en) Self-encoder-based thermal power plant auxiliary machine health state assessment method
CN116878902A (en) Ship diesel engine fault early warning method and system based on deep learning CNN-BiLSTM
CN112131516A (en) Anomaly detection method based on feature weight mixed naive Bayes model
CN113253682B (en) Nonlinear chemical process fault detection method
CN116522993A (en) Chemical process fault detection method based on countermeasure self-coding network
Yang et al. A local Mahalanobis distance analysis based methodology for incipient fault diagnosis
CN114200914A (en) MW-OCCA-based quality-related early fault detection method
CN114417704A (en) Wind turbine generator health assessment method based on improved stack type self-coding

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant