CN116125922A - Complex industrial process monitoring method and system based on parallel dictionary learning - Google Patents

Complex industrial process monitoring method and system based on parallel dictionary learning Download PDF

Info

Publication number
CN116125922A
CN116125922A CN202310023849.XA CN202310023849A CN116125922A CN 116125922 A CN116125922 A CN 116125922A CN 202310023849 A CN202310023849 A CN 202310023849A CN 116125922 A CN116125922 A CN 116125922A
Authority
CN
China
Prior art keywords
linear
monitoring
variable
variables
subset
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310023849.XA
Other languages
Chinese (zh)
Inventor
阳春华
张娇娇
黄科科
吴德浩
李勇刚
朱红求
桂卫华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Central South University
Original Assignee
Central South University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Central South University filed Critical Central South University
Priority to CN202310023849.XA priority Critical patent/CN116125922A/en
Publication of CN116125922A publication Critical patent/CN116125922A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/418Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM]
    • G05B19/41875Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM] characterised by quality surveillance of production
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems
    • G05B2219/32Operator till task planning
    • G05B2219/32252Scheduling production, machining, job shop
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Manufacturing & Machinery (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Testing And Monitoring For Control Systems (AREA)
  • Testing Or Calibration Of Command Recording Devices (AREA)

Abstract

The invention discloses a complex industrial process monitoring method and system based on parallel dictionary learning, wherein the method comprises the following steps: extracting a collinearity variable set of the industrial process according to the variance expansion factors of all the monitoring variables; dividing a linear variable subset from the co-linear variable set based on a linear maximization method, and forming a nonlinear variable subset by the residual monitoring variables; establishing a linear monitoring model based on dictionary learning for each linear variable subset, and establishing a nonlinear monitoring model based on kernel dictionary learning for the nonlinear variable subset; calculating a reconstruction error based on the established linear and nonlinear monitoring models, and calculating each error control limit; acquiring real-time monitoring sample data of a complex industrial process on line, and calculating reconstruction errors of variable subsets; based on each reconstruction error and control limit, the global index of the current monitoring sample data is fused and calculated, and whether the current industrial process fails or not is judged by the global index. The invention can realize the monitoring of complex industrial processes with coexisting linearity and nonlinearity.

Description

Complex industrial process monitoring method and system based on parallel dictionary learning
Technical Field
The invention belongs to the field of industrial process monitoring, and particularly relates to a complex industrial process monitoring method and system based on parallel dictionary learning.
Background
With the increasing level of integration and intelligence of modern industrial systems, any minor failure in the industrial process may cause immeasurable disruption to the production campaign. Therefore, in order to ensure safe and stable operation of industrial systems, a higher level of requirements are also put on process monitoring. By adopting a proper process monitoring method, operators can acquire the current running state of the industrial process in real time, and once faults are monitored, an alarm prompt is triggered, so that the operators can react and process in time, thereby avoiding causing larger economic loss and eliminating potential safety hazards in time.
Data-driven methods have demonstrated great potential in the field of industrial process monitoring, and have been widely studied. The multivariate statistical process monitoring method (MSPM) is one of the most typical data driven methods that capture important features of data mainly by way of dimension reduction, with PCA, PLS, ICA, etc. being widely used. With the development of artificial intelligence technology, some machine learning methods are also applied to the field of process monitoring, such as Support Vector Machines (SVMs) and random forest methods.
Dictionary learning is a common machine learning method, and the core idea is to directly learn from original data, train a group of dictionary atoms reflecting essential characteristics of the data, and reconstruct the data by linear combination of a small number of atoms in a sparse way. Dictionary learning is widely studied and applied in the field of industrial process monitoring by virtue of excellent data reconstruction capability. Meanwhile, in order to adapt to different requirements of industrial actual scenes, a series of improved dictionary learning methods are proposed. Huang proposes distributed dictionary learning for efficient monitoring of complex, high-dimensional modern industrial systems. Yang has proposed a robust dictionary learning method to monitor performance of industrial processes with multiple modalities.
Because of the complex mechanism process of an industrial system, measurement data often has nonlinear characteristics, which brings challenges to process monitoring tasks, and a dictionary learning method based on linear reconstruction assumption is difficult to fully capture the nonlinear characteristics of the data. To solve this non-linearity problem, nguyen proposes kernel dictionary learning based on kernel methods. The kernel dictionary learning maps nonlinear data to a high-dimensional feature space, makes it linearly separable, and then seeks a linear solution in the high-dimensional feature space. However, since kernel functions and parameters determine the nature of feature space, kernel dictionary learning typically requires the selection of appropriate parameters, which is sometimes difficult. Furthermore, after the original data is projected into the high-dimensional feature space, not only is the original data difficult to express and interpret mathematically, but also the linear feature variation is easy to ignore.
However, as modern industrial processes become more integrated and complex, it has become common for complex correlations between system process variables to exist both linearly and non-linearly, and creating only a single linear monitoring model or a non-linear monitoring model is not the most ideal option. The dictionary learning method is based on linear reconstruction assumption, a nonlinear system cannot be effectively monitored, and the kernel dictionary learning is sensitive to parameter selection and is easy to ignore the change of linear characteristics. Since the linear and nonlinear characteristics have the same significance for process monitoring, the linear and nonlinear characteristics of the system should be fully considered when the monitoring method is selected. However, most of the current researches only select a single linear or nonlinear monitoring method when a process monitoring model is established, which cannot fully extract data characteristics and achieve the optimal monitoring effect.
Disclosure of Invention
The invention provides a complex industrial process monitoring method based on parallel dictionary learning, which realizes the monitoring of complex industrial processes with linear and nonlinear coexistence.
In order to achieve the technical purpose, the invention adopts the following technical scheme:
a complex industrial process monitoring method based on parallel dictionary learning, comprising:
extracting a collinearity variable set from the monitored variable set according to variance expansion factors of all the monitored variables of the industrial process;
identifying and dividing a linear variable subset from the co-linear variable set based on a linear maximization method, wherein the rest monitoring variables in the monitoring variable set form a nonlinear variable subset;
establishing a linear monitoring model based on dictionary learning for each linear variable subset, and establishing a nonlinear monitoring model based on kernel dictionary learning for the nonlinear variable subset;
based on the established linear and nonlinear monitoring models, calculating the reconstruction errors of training data in the variable subset parts, and calculating the control limits of the linear and nonlinear monitoring models based on a kernel density estimation method;
acquiring real-time monitoring sample data of a complex industrial process on line, and calculating reconstruction errors of variable subsets by using each linear and nonlinear monitoring model; based on each reconstruction error and control limit, calculating the normal likelihood probability, the abnormal likelihood probability and the abnormal posterior probability of each variable subset part of the current monitoring sample data, fusing to obtain a global index of the current monitoring sample data, and judging whether the current complex industrial process is faulty or not by the global index.
Further, the calculation method of the variance expansion factor of each monitoring variable comprises the following steps:
Figure BDA0004043804190000021
wherein ,VIFj Representing any monitored variable x j And the variance expansion factor of (2), m is the number of the monitored variables, R j 2 To monitor variable x j Determining coefficients when regressing on all the remaining monitored variables.
Further, the method for extracting the co-linear variable set in the monitored variable set comprises the following steps: the monitored variables whose variance expansion factors are greater than a given variance expansion factor threshold are combined into a set of co-linear variables.
Further, the linear variable subset is identified and divided from the co-linear variable set based on the linear maximization method, specifically:
(1) Representing the collinearity set as X P =[x 1 ,x 2 ,...,x P ]∈R N×P P is the number of monitoring variables of which the variance expansion factor exceeds a threshold value, and x max Is co-linear set X P Has the greatest variance expansion factor;
(2) Maximizing x max And co-linearity set X P Correlation between linear combinations of the remaining monitored variables:
Figure BDA0004043804190000031
wherein ,XP-1 ∈R N×(P-1) Is co-linear set X P Removing x from the medium max a.epsilon.R P-1 Is X P-1 Is set in the order of (1), the combined coefficient vector of D (x) max ) And D (X) P-1 a) Variance of variables, cov (x max ,X p-1 a) Is covariance matrix;
(3) Sigma-shaped 11 ,Σ 12 ,Σ 21 ,Σ 22 Respectively cov (x) max ,x max ),cov(x max ,X p-1 )cov(X P-1 ,x max ),cov(X p-1 ,X p-1 ) The method comprises the following steps of:
Figure BDA0004043804190000032
(4) Substituting the formula (3) into the optimization problem (2), fixing denominators, and converting the optimization problem (2) into:
Figure BDA0004043804190000033
(5) Converting the optimization problem (4) into a eigenvalue decomposition problem by a Lagrangian multiplier method, and obtaining:
Σ 22 -1 Σ 21 Σ 11 -1 Σ 12 a=λ 2 a (5)
(6) Based on the number of non-zero value feature roots being min (1, P-1), solving the feature vector to obtain an optimal coefficient vector a;
(7) Screening all coefficients greater than a given weight threshold from the optimal coefficient vector, wherein the corresponding variable constitution corresponds to x max A subset of the variables that are linearly related, denoted a subset of linear variables;
(8) Extracting a subset of linear variables from the collinearity set X P If at least two variables still exist in the linear variable subset, repeating the steps to extract a new linear variable subset, otherwise, indicating that the linear variable subset is divided.
Further, a linear monitoring model is built for each linear variable subset based on dictionary learning, specifically:
let the raw data of the industrial process based on the monitoring variable set be y= [ Y ] 1 ,y 2 ,...,y N ]∈R m×N M is the number of variables in the monitored variable set, namely the dimension of the sample, and N is the number of samples; the original data block based on the nth linear variable subset is
Figure BDA0004043804190000034
m n The number of variables in the nth linear variable subset; then based on the original data sub-block Y n The dictionary learning optimization problem of establishing a linear monitoring model is expressed as follows:
Figure BDA0004043804190000041
in the formula, D represents a dictionary to be learned,
Figure BDA0004043804190000042
for the nth original data sub-block Y n The dictionary obtained by the training is composed of K dictionary atoms d 1 n ,...,d K n Composition; x is X n =[x 1 n ,x 2 n ,...,x N n ]∈R K×N For the corresponding sparse coding matrix, T is the sparse constraint,limiting sparse coding vector x i n The non-zero element in (2) is less than T; n=1, …, l, l is the number of linear variable subsets obtained by division;
and establishing a nonlinear monitoring model for the nonlinear variable subset based on kernel dictionary learning, wherein the nonlinear monitoring model comprises the following concrete steps:
let the original data blocks of the linear variable subset be expressed as
Figure BDA0004043804190000043
m l+1 The number of variables in the nonlinear variable subset; then based on the original data sub-block Y l+1 The problem of the learning optimization of the nuclear dictionary for establishing the nonlinear monitoring model is expressed as follows:
Figure BDA0004043804190000044
where Φ (·) is the mapping function.
Further, adopting alternate optimization to solve sparse coding matrixes and dictionaries in the optimization problem (6); when solving the optimization problem (7), converting the optimization problem (7) into:
Figure BDA0004043804190000045
wherein A is a pseudo dictionary, and replaces an original dictionary D together with a base dictionary phi (Y);
and then converting the optimization problem (8) into the following according to the relation between the matrix norms and the traces:
Figure BDA0004043804190000046
wherein tr (. Cndot.) represents the trace of the matrix, K (Y) l+1 ,Y l+1 )∈R N×N Is a kernel matrix whose elements k (y i ,y j )=Φ(y i ) T Φ(y j );
The sparse coding matrix X and the pseudo dictionary a in the optimization problem (8) are finally updated alternately.
Further, the calculation method of the normal likelihood probability, the abnormal likelihood probability and the abnormal posterior probability comprises the following steps:
Figure BDA0004043804190000047
Figure BDA0004043804190000048
Figure BDA0004043804190000051
wherein ,
Figure BDA0004043804190000052
for monitoring sample data y in real time new Part of the data belonging to the nth variable subset; />
Figure BDA0004043804190000053
and />
Figure BDA0004043804190000054
Respectively->
Figure BDA0004043804190000055
Where N and F represent normal and abnormal, respectively;
Figure BDA0004043804190000056
is->
Figure BDA0004043804190000057
Reconstruction errors, TH n A control limit for the nth variable subset; />
Figure BDA0004043804190000058
Is->
Figure BDA0004043804190000059
Is an anomaly posterior probability of (2); p (P) n(F) and Pn (N) represent the abnormal and normal prior probabilities of the sample, respectively, and they are determined by the confidence level alpha, namely P n (N)=1-α,P n (F)=α;
The global index of the current monitoring sample data obtained by fusion is as follows:
Figure BDA00040438041900000510
wherein ,GFInew For monitoring sample data y in real time new Global index of (2); here the likelihood of anomalies over subsets of variables
Figure BDA00040438041900000511
All variable subsets n (n=1, 2, the posterior probability of abnormalities on l, l+1)
Figure BDA00040438041900000512
Fusion to establish global monitoring index GFI new
The complex industrial process monitoring system based on parallel dictionary learning comprises a memory and a processor, wherein a computer program is stored in the memory, and the complex industrial process monitoring system based on parallel dictionary learning is characterized in that the processor realizes the complex industrial process monitoring method based on parallel dictionary learning according to any one of the technical schemes when the computer program is executed by the processor.
Advantageous effects
The complex industrial process monitoring method based on parallel dictionary learning can be applied to the industrial process with complex correlation of monitoring process variables. The method overcomes the blindness of the traditional selection monitoring method and the limitation of constructing a single monitoring model, namely, the method can divide a variable set by exploring the correlation relation of the system process variables, and select a proper linear or nonlinear monitoring method to establish a model in parallel according to the correlation of the variables, thereby effectively improving the fault monitoring performance of the complex industrial process.
Drawings
FIG. 1 is a general framework of the method of the present invention;
FIG. 2 is a comparison of the evaluation index of the variable division results of the present invention;
fig. 3 shows the effects of the parallel dictionary learning method according to the present invention compared with the conventional dictionary learning and the nuclear dictionary learning methods, wherein (a) - (c) are the monitoring effects of the conventional dictionary learning and the nuclear dictionary learning, respectively, and (d) - (f) are the monitoring effects of the three methods in the second scenario.
Detailed Description
The following describes in detail the embodiments of the present invention, which are developed based on the technical solution of the present invention, and provide detailed embodiments and specific operation procedures, and further explain the technical solution of the present invention.
The invention provides a complex industrial process monitoring method based on parallel dictionary learning, which comprises two stages of off-line modeling and on-line monitoring. In the offline learning stage, a linear maximization method for effectively identifying a subset of linear variables is adopted, the collinearity of the variables is evaluated by using a variance expansion factor, and the division of linear and nonlinear subsets is realized by maximizing the linear correlation between one variable and the rest of variables. And then, according to the characteristics of each variable subset, respectively selecting dictionary learning and kernel dictionary learning methods to establish a monitoring model in parallel, and fully extracting the linear and nonlinear characteristics of the data. Finally, the error control limit under each sub-monitoring model is obtained through analysis of the training data. In the online monitoring stage, the system acquires online data of the industrial process, calculates local monitoring statistics of all subsets of the test samples by utilizing a monitoring model trained in the offline stage, and establishes global monitoring statistics by Bayesian inference fusion, so that the real-time state of the industrial process is estimated, and the stable and healthy operation of the industrial process is ensured. The method of the invention realizes that a proper monitoring method can be selected according to the linear and nonlinear characteristics of the system by exploring the complex correlation among the process variables of the system, and simultaneously overcomes the limitation of the traditional construction of a single linear or nonlinear monitoring model, thereby having superiority when monitoring the industrial process with the complex correlation among the process variables.
1. Extraction of a set of collinearity variables based on a variance-expansion factor
At the off-line initial stage, normal training data Y of the complex industrial process is collected 1 =[y 1 ,y 2 ,…,y N ]∈R m×N M and N represent the dimension and number of samples, respectively. First, to evaluate the degree of collinearity between each process variable and the remaining variables, for all variables x= [ X ] 1 ,x 2 ,...,x m ]∈R N×m Variance expansion factor (VIF) was calculated:
Figure BDA0004043804190000061
wherein ,VIFj Representing the variance expansion factor of arbitrary variables, R j 2 As variable x j (j=1, 2, …, m) determination coefficients when regressing on all the remaining variables.
VIF j Reflecting the variable x j The greater the value of the degree of collinearity with the remaining variables, the more significant the collinearity. All VIFs are scaled by a variance expansion factor threshold θ j The variables > θ form a set of collinearity.
2. Identifying and partitioning subsets of linear variables based on a linear maximization method, and constructing subsets of nonlinear variables
In order to explore the correlation between process variables, the present invention proposes a linear maximization method to identify and separate subsets of linear variables. The core idea is to identify a subset of linear variables related to one variable based on the combination coefficients by maximizing the correlation between the linear combination of the variable and the remaining variables.
Let the collinearity set be X P =[x 1 ,x 2 ,...,x P ]∈R N×P P represents the number of variables for which all variance expansion factors exceed the threshold. X is x max As a variable with the largest variance expansion factor in the collinearity set, this indicates x max The most number of linear subset variables or linesThe correlation is the strongest. To identify x max The linear variable subset to which is belongs, maximizing x max Correlation between linear combinations of the remaining variables:
Figure BDA0004043804190000071
wherein ,XP-1 ∈R N×P-1 Is co-linear set X P Removing x from the medium max a.epsilon.R P-1 Is a coefficient vector, D (x max ) And D (X) P-1 a) Variance of variables, cov (x max ,X p-1 a) Is a covariance matrix. Sigma-shaped 11 ,Σ 12 ,Σ 21 ,Σ 22 Respectively cov (x) max ,x max ),cov(x max ,X p-1 )cov(X P-1 ,x max ),cov(X p-1 ,X p-1 ) The method is easy to obtain:
Figure BDA0004043804190000072
substituting (3) into (2), fixing denominator, and easily converting the optimization problem (2) into:
Figure BDA0004043804190000073
by the lagrangian multiplier method, (4) can be converted into eigenvalue decomposition problem and obtained:
Σ 22 -1 Σ 21 Σ 11 -1 Σ 12 a=λ 2 a (5)
here, the number of non-zero value feature roots is min (1, p-1), and then the optimal coefficient vector a can be obtained by solving the feature vector.
The optimization problem (2) is essentially to maximize x by looking for the optimal coefficient a max Linear dependence on the remaining variables. If x is to be made max Maximum linear correlation with the remaining variables, then in the combined systemIn the number a, x is max The coefficient weights of the linearly related variables should be as large as possible. Therefore, by setting the combination coefficient weight threshold value beta at this time, the variable set X can be obtained from the combination coefficient weight P-1 Middle division and x max A subset of the variables that are linearly related.
To further identify and partition all remaining linear variable subsets in the collinearity set, the partitioned linear variable subsets are removed from the collinearity set. Then, the collinearity of the collinearity centralized variable is estimated again by the variance expansion factor, if at least two variables VIF still exist j And > θ, indicating that there are still other different subsets of linear variables, the iterative linear maximization method continues to identify and separate. Otherwise, indicating that all the linear variable sets are divided, and forming a nonlinear variable set by the residual variables except all the divided linear variable subsets in the original variable set.
3. Establishing a monitoring model
3.1 Linear monitoring model establishment based on dictionary learning
Dictionary learning learns and trains a series of atoms containing data features from original data, and the reconstruction expression of the original data is realized through linear combination of a small number of dictionary atoms. Since the data points are distributed on one linear manifold in each linear variable set, the original data can be well reconstructed through a dictionary learning method. Thus, on each linear subset of the partitions, a linear monitoring model is built based on dictionary learning.
Assuming that all process variables are ultimately divided into l subsets of linear variables and 1 set of nonlinear variables, then the raw data y= [ Y ] 1 ,y 2 ,...,y N ]∈R m×N Correspondingly decomposed into l+1 sub-blocks, i.e. y= [ Y ] 1 T ,Y 2 T ,...,Y l T ,Y l+1 T ] T. wherein ,
Figure BDA0004043804190000081
represents a first linear sub-block, and contains m 1 A linear variable and the number of samples is still N.
Figure BDA0004043804190000082
Representing non-linear sub-blocks therein. Accordingly, the nth linear sub-block contains m in total n The variables, taken together, are m = Σ n m n And n=1, 2, once again, l, l+1.
At n (n=1, 2.,), l) building a monitoring model on the linear subblocks, dictionary learning optimization problems can be expressed as:
Figure BDA0004043804190000083
here the number of the elements is the number,
Figure BDA0004043804190000084
the dictionary trained under the nth sub-block consists of K dictionary atoms, X n =[x 1 n ,x 2 n ,...,x N n ]∈R K×N For the corresponding sparse coding matrix, T is a sparse constraint, and the vector x is limited i n Is less than T. The problem (6) can be solved by alternating optimization, and consists of two parts, namely sparse coding solution and dictionary updating.
First, fix D n Solving corresponding sparse codes X n
Figure BDA0004043804190000085
Here, y i n Is Y n Is defined by the i-th column vector (i=1, 2, n.), x i n Is y i n In dictionary D n And (5) sparse coding. The optimization problem (7) can be solved directly by OMP algorithm.
Once solved to obtain X n In turn fix X n Update D sequentially column by column n Dictionary atoms of (a):
Figure BDA0004043804190000086
here, d k n For D n The kth dictionary atom, x i n Is X n Is the i-th column vector in (b). The optimization problem (8) can be solved by the K-SVD algorithm. When D is n After updating, updating again to solve sparse coding X n And repeatedly and alternately optimizing the method until the iteration stop condition is met.
3.2 building nonlinear monitoring model based on kernel dictionary learning
On nonlinear sub-blocks, there is a serious nonlinear relation among variables, and original data is difficult to be directly expressed by dictionary atom linear reconstruction in an original space. Nuclear dictionary learning is a non-linear extension of dictionary learning based on the kernel, and linear solutions can be sought in the feature space after mapping non-linear data into the high-dimensional feature space. Therefore, a monitoring model is built on the nonlinear sub-blocks through kernel dictionary learning, and the optimization problem is expressed as follows:
Figure BDA0004043804190000091
wherein Φ (·) is a mapping function: r is R m →R n (n > m), n and m being the dimensions of the feature space and the original space, respectively, phi (Y l+1 )=[Φ(y 1 l+1 ),...,Φ(y N l+1 )]∈R n×N To be mapped to the original data in the feature space. Since the feature space dimension n is often unknown and may be infinite, conventional K-SVD cannot solve directly for (9), thus converting (9) to:
Figure BDA0004043804190000092
here, the original dictionary is replaced by the base dictionary Φ (Y) and the pseudo dictionary a. Based on the relationship of matrix norms to traces, (10) can be translated into:
Figure BDA0004043804190000093
here, tr (. Cndot.) represents the trace of the matrix, K (Y) l+1 ,Y l+1 )∈R N×N Is a kernel matrix whose elements k (y i ,y j )=Φ(y i ) T Φ(y j )。
Similar to the dictionary learning optimization solution process, the problem (11) needs to be solved optimally by updating the sparse codes and the dictionary alternately. Firstly, fixing a pseudo dictionary A, and updating a sparse coding matrix X column by column l+1 . According to (11), it is possible to obtain:
Figure BDA0004043804190000094
here, the problem (12) can be solved directly by the KOMP algorithm.
After obtaining a sparse matrix X l+1 Thereafter, X is fixed in reverse l+1 Updating the dictionary a column by column:
Figure BDA0004043804190000095
wherein ,ak Is the kth column atom, x in pseudo dictionary A T k Is X l+1 In row k. Optimization problem (13) can be solved by applying the method to a k The derivative solution can also be solved by adopting a KK-SVD algorithm.
4. Calculating an error control limit
After the monitoring model on each sub-block is established and trained, the reconstruction error of the training data on each sub-block is obtained. For any training sample y i =[y i,1 T ,y i,2 T ,...,y i,l T ,y i,l+1 T ] T ∈R m I=1, 2, N, at N (n=1, 2.,), l) reconstruction errors under linear sub-blocks are expressed as:
Figure BDA0004043804190000101
here, x i n For sample y i n In dictionary D n And solving the corresponding sparse codes through an OMP algorithm.
Sample y i The reconstruction error under the nonlinear subblocks is:
Figure BDA0004043804190000102
here, x i l+1 For sample nonlinear subblock y i l+1 Is solved by means of a KOMP algorithm.
Finally, calculating the error control limit of the sample under each sub-block based on a Kernel Density Estimation (KDE) method. KDE is one of the non-parametric methods of estimating an unknown density function in probability theory, whose function is expressed as:
Figure BDA0004043804190000103
here, r= [ R 1 n ,R 2 n ,...,R N n ]N=1, 2,.. is the reconstruction error of the training data on each sub-block. h is the bandwidth, N is the number of training samples, f (R) is the probability density function, and K (x) is the non-negative kernel function, which satisfies the following condition:
Figure BDA0004043804190000104
there are many kinds of kernel functions, and all kernel functions in the present invention are gaussian kernel functions which are most widely used. The error control limit TH of each sub-block under the given confidence level can be obtained through KDE function n And a decision basis is provided for online testing.
5. On-line real-time monitoring of industrial processes
When a new sample y to be measured new To be temporary, the same decomposition is performed on the offline variable partition first. Assume thatThere are l linear and 1 nonlinear sub-blocks altogether, and the test sample is decomposed into y new =[y new,1 T ,y new,2 T ,...,y new,l T ,y new,l+1 T ]∈R m . Then, according to the local dictionary D learned in the off-line stage n And A, the test sample y can be calculated by the formulas (14) and (15) new Reconstruction error DER under each sub-block n ,n=1,2,...,l,l+1。
However, establishing block-level monitoring statistics alone is insufficient to monitor the entire process, so bayesian reasoning is introduced, the monitoring results on the sub-blocks are fused, and global monitoring statistics are established. Firstly, the reconstruction errors of the samples on each sub-block are respectively converted into normal likelihood probability and abnormal likelihood probability:
Figure BDA0004043804190000111
Figure BDA0004043804190000112
then, according to bayesian reasoning, the abnormal posterior probability of the samples in each sub-block can be obtained according to the following formula:
Figure BDA0004043804190000113
wherein F and N represent abnormality and normality, respectively, P n(F) and Pn (N) represents the normal and abnormal prior probabilities of the sample, respectively, and they are determined by the confidence level alpha. I.e. there is P n (N)=1-α,P n (F)=α。
After the normal likelihood probability, the abnormal likelihood probability and the abnormal posterior probability of the sample under all the sub-blocks are obtained, the monitoring results of the sample under each sub-block are fused according to the following steps to establish a global index GFI new
Figure BDA0004043804190000114
Here, the likelihood of abnormality
Figure BDA0004043804190000115
As weights, sub-blocks n (n=1, 2), abnormality posterior probability on l, l+1)>
Figure BDA0004043804190000116
Thereby establishing global index GFI new . The state of the sample can be based on the global indicator GFI new And confidence α assessment. If GFI new < alpha, can be considered as the sample y to be measured new A normal sample, and otherwise, a fault sample.
6. Data of detection and experimental verification
In order to verify the effectiveness of the method provided by the invention, a numerical simulation system is designed to simulate real industrial process data according to the improvement of a typical three-variable nonlinear system. From the process variable relationship, the system can be decomposed into x= [ X ] 1 ,X 2 ,X 3 ,X 4 ]Four subsets. Wherein the first three subsets X m (m=1, 2, 3) the variables within the same subset are linearly related, while in the fourth subset X 4 The intermediate variables have only nonlinear relationships. The simulation system is represented as follows:
Figure BDA0004043804190000117
Figure BDA0004043804190000121
here, the variable factor u 1 、u 2 and u3 Independent variables, e, respectively, subject to uniform distribution U (0.01,2) 1 1 ~e 4 4 A total of 13 independent noise variables obeying a gaussian distribution N (0,0.01), the variables in each subset being denoted as x n m (n=1, 2,3, m=1, 2,3, 4), n and m minutesRespectively denoted as variable dimension and variable subset category.
In order to fully explain the effectiveness of the method, two different test scenes are designed in total:
scene 1: a total of 500 test samples, starting from sample 201, in the linear variable subset X 1 Variable x in (a) 2 1 Introducing a sensor constant deviation fault of 0.6;
scene 2: a total of 500 test samples, starting from sample 201, in nonlinear variable subset X 4 Variable x in (a) 1 4 And a sensor constant deviation fault of 0.8 is introduced.
In order to quantify the accuracy of the variable division provided by the invention, a variable division evaluation index p is constructed as follows:
Figure BDA0004043804190000122
Figure BDA0004043804190000123
where m represents the number of partitioned subsets of linear variables, n represents the number of variables in all sets of linear variables, and when the number of variables m is smaller and the number of linear variables n included is greater,
Figure BDA0004043804190000124
will approach 1.c i (i=1, 2,., m) is the first principal contribution of each subset of linear variables, and accordingly, r is the average of the principal contributions over all the linear subsets. c i Reflecting the degree of linear correlation between the variables in each linear subset, c when the linear variable subsets are properly partitioned i Will approach 1 and correspondingly r will approach 1. Thus, in general, when the variable division is appropriate, the evaluation index p will approach 1.
Meanwhile, for the monitoring effect of the quantization method, two widely used monitoring indexes are adopted: false positive rate (FAR) and Fault Detection Rate (FDR) are defined as follows:
Figure BDA0004043804190000125
Figure BDA0004043804190000126
here, # J > J th |f=0 } and # { J > J th The # { f +.0 } represents the number of false alarms of the normal sample and the number of detected failure samples, respectively, and the # { f=0 } and # { f +.0 } represent the total number of the normal sample and the failure samples, respectively.
First, to illustrate the effectiveness of the linear maximization analysis method proposed by the present invention, the VIF of all variables calculated from equation (1) is: vif= [3349,11773,12132,2847,9879,10492,3393,10358,12386,5,90,5,82 ]] T. wherein
Variable x 3 3 With the maximum VIF value, the variable x is maximized according to equation (2) 3 3 With the remaining variables x other The linear correlation among the two is solved to obtain the linear combination under the optimal coefficient variable as follows:
Figure BDA0004043804190000131
wherein, is equal to x 3 3 X in the same linear variable set 1 3 and x2 3 The above statement can be verified because the coefficient weights are significantly larger than the other variables.
Then, the invention sets the VIF threshold θ to 10, and iteratively divides the variable set by a linear maximization method under the coefficient weight β=0.05-0.5, wherein the evaluation index p of each variable division result is shown in fig. 2.
When β=0.05, 0.1, and 0.15, the evaluation index p is equal and maximum, and the variable division result coincides with the actual result. At the same time, when β >0.2, there is substantially no longer a change, which demonstrates the effectiveness and robustness of the proposed variable partitioning method of the present invention.
After the weight threshold beta is selected to be 0.1, the iterative linear maximization method divides the linear variable set, monitoring experiments are respectively carried out under two designed scenes, and the parallel dictionary learning method provided by the invention is compared with the traditional dictionary learning and nuclear dictionary learning methods. The experimental results are shown in figure 3.
In fig. 3, (a) - (c) are respectively the monitoring effects of the traditional dictionary learning, the kernel dictionary learning and the parallel dictionary learning method provided by the invention under the scene, and (d) - (f) are the monitoring effects of the three methods under the second scene. According to experimental results, it was found that in scenario one, when a fault occurs on the linear subset, the traditional dictionary learning method is more sensitive to the fault than the kernel dictionary learning method. This is because the kernel dictionary learning method is based on nonlinear mapping solution, and linear feature changes of the original data are easily ignored in the high-dimensional feature space. In the second scenario, when a fault occurs on the nonlinear subset, the traditional dictionary learning method cannot effectively monitor the nonlinear fault due to the assumption based on linear reconstruction, and the kernel dictionary learning method can monitor the nonlinear feature change of the original data in time by searching for a linear solution in a high-dimensional feature space. As shown in fig. 3 (c) and (f), the parallel dictionary learning method provided by the invention divides the variable set and constructs linear and nonlinear monitoring models on each subset in parallel by exploring the correlation among the process variables, fully combines the advantages of the two methods of dictionary learning and nuclear dictionary learning, and improves the monitoring rate of faults under different scenes.
Through the experiments, the monitoring superiority of the method provided by the invention on the industrial process with complex correlation of the process variable compared with the traditional method is verified.
The above embodiments are preferred embodiments of the present application, and various changes or modifications may be made on the basis thereof by those skilled in the art, and such changes or modifications should be included within the scope of the present application without departing from the general inventive concept.
Reference is made to:
[1]Aharon M,Elad M,Bruckstein A.K-SVD:An algorithm for designing overcomplete dictionaries for sparse representation[J].IEEE Transactions on signal processing,2006,54(11):4311-4322.
[2]Yang C,Zhou L,Huang K,et al.Multimode process monitoring based on robust dictionary learning with application to aluminium electrolysis process[J].Neurocomputing,2019,332:305-319.
[3]Huang K,Wu Y,Wen H,et al.Distributed dictionary learning for high-dimensional process monitoring[J].Control Engineering Practice,2020,98:104386.
[4]Pati Y C,Rezaiifar R,Krishnaprasad P S.Orthogonal matching pursuit:Recursive function approximation with applications to wavelet decomposition[C].Proceedings of 27th Asilomar conference on signals,systems and computers.IEEE,1993:40-44.
[5]Huang K,Wen H,Ji H,et al.Nonlinear process monitoring using kernel dictionary learning with application to aluminum electrolysis process[J].Control Engineering Practice,2019,89:94-102.
[6]Van Nguyen H,Patel V M,Nasrabadi N M,et al.Kernel dictionary learning[C]//2012 IEEE International Conference on Acoustics,Speech and Signal Processing(ICASSP).IEEE,2012:2021-2024.

Claims (8)

1. a complex industrial process monitoring method based on parallel dictionary learning, comprising:
extracting a collinearity variable set from the monitored variable set according to variance expansion factors of all the monitored variables of the industrial process;
identifying and dividing a linear variable subset from the co-linear variable set based on a linear maximization method, wherein the rest monitoring variables in the monitoring variable set form a nonlinear variable subset;
establishing a linear monitoring model based on dictionary learning for each linear variable subset, and establishing a nonlinear monitoring model based on kernel dictionary learning for the nonlinear variable subset;
based on the established linear and nonlinear monitoring models, calculating the reconstruction errors of training data in the variable subset parts, and calculating the control limits of the linear and nonlinear monitoring models based on a kernel density estimation method;
acquiring real-time monitoring sample data of a complex industrial process on line, and calculating reconstruction errors of variable subsets by using each linear and nonlinear monitoring model; based on each reconstruction error and control limit, calculating the normal likelihood probability, the abnormal likelihood probability and the abnormal posterior probability of each variable subset part of the current monitoring sample data, fusing to obtain a global index of the current monitoring sample data, and judging whether the current complex industrial process is faulty or not by the global index.
2. The method for monitoring a complex industrial process according to claim 1, wherein the method for calculating the variance expansion factor of each monitored variable is as follows:
Figure FDA0004043804180000011
wherein ,VIFj Representing any monitored variable x j And the variance expansion factor of (2), m is the number of the monitored variables, R j 2 To monitor variable x j Determining coefficients when regressing on all the remaining monitored variables.
3. The method for monitoring a complex industrial process according to claim 1, wherein the method for extracting the collinearity variable set from the monitored variable set is as follows: the monitored variables whose variance expansion factors are greater than a given variance expansion factor threshold are combined into a set of co-linear variables.
4. The complex industrial process monitoring method according to claim 1, wherein the linear-based maximization method identifies and divides subsets of linear variables from a set of co-linear variables, in particular:
(1) Representing the collinearity set as X P =[x 1 ,x 2 ,...,x P ]∈R N×P P is the number of monitoring variables of which the variance expansion factor exceeds a threshold value, and x max Is co-linear set X P Has the greatest variance expansion factor;
(2) Maximizing x max And co-linearity set X P Correlation between linear combinations of the remaining monitored variables:
Figure FDA0004043804180000012
wherein ,XP-1 ∈R N×(P-1) Is co-linear set X P Removing x from the medium max a.epsilon.R P-1 Is X P-1 Is set in the order of (1), the combined coefficient vector of D (x) max ) And D (X) P-1 a) Variance of variables, cov (x max ,X p-1 a) Is covariance matrix;
(3) Sigma-shaped 11 ,Σ 12 ,Σ 21 ,Σ 22 Respectively cov (x) max ,x max ),cov(x max ,X p-1 )cov(X P-1 ,x max ),cov(X p-1 ,X p-1 ) The method comprises the following steps of:
Figure FDA0004043804180000021
(4) Substituting the formula (3) into the optimization problem (2), fixing denominators, and converting the optimization problem (2) into:
Figure FDA0004043804180000022
(5) Converting the optimization problem (4) into a eigenvalue decomposition problem by a Lagrangian multiplier method, and obtaining:
Σ 22 -1 Σ 21 Σ 11 -1 Σ 12 a=λ 2 a(5)
(6) Based on the number of non-zero value feature roots being min (1, P-1), solving the feature vector to obtain an optimal coefficient vector a;
(7) Screening all coefficients greater than a given weight threshold from the optimal coefficient vector, wherein the corresponding variable constitution corresponds to x max A subset of the variables that are linearly related, denoted a subset of linear variables;
(8) Extracting a subset of linear variables from the collinearity set X P If at least two variables still exist in the linear variable subset, repeating the steps to extract a new linear variable subset, otherwise, indicating that the linear variable subset is divided.
5. The complex industrial process monitoring method of claim 1, wherein the linear monitoring model is built based on dictionary learning for each subset of linear variables, in particular:
let the raw data of the industrial process based on the monitoring variable set be y= [ Y ] 1 ,y 2 ,...,y N ]∈R m×N M is the number of variables in the monitored variable set, namely the dimension of the sample, and N is the number of samples; the original data block based on the nth linear variable subset is
Figure FDA0004043804180000023
m n The number of variables in the nth linear variable subset; then based on the original data sub-block Y n The dictionary learning optimization problem of establishing a linear monitoring model is expressed as follows:
Figure FDA0004043804180000024
in the formula, D represents a dictionary to be learned,
Figure FDA0004043804180000025
for the nth original data sub-block Y n The dictionary obtained by the training is composed of K dictionary atoms d 1 n ,...,d K n Composition; x is X n =[x 1 n ,x 2 n ,...,x N n ]∈R K×N For correspondingly thinSparse coding matrix, T is sparse constraint, and sparse coding vector x is limited i n The non-zero element in (2) is less than T; n=1, …, l, l is the number of linear variable subsets obtained by division;
and establishing a nonlinear monitoring model for the nonlinear variable subset based on kernel dictionary learning, wherein the nonlinear monitoring model comprises the following concrete steps:
let the original data blocks of the linear variable subset be expressed as
Figure FDA0004043804180000031
m l+1 The number of variables in the nonlinear variable subset; then based on the original data sub-block Y l+1 The problem of the learning optimization of the nuclear dictionary for establishing the nonlinear monitoring model is expressed as follows:
Figure FDA0004043804180000032
where Φ (·) is the mapping function.
6. The complex industrial process monitoring method according to claim 5, wherein the sparse coding matrix and dictionary in the optimization problem (6) are solved with alternating optimization;
when solving the optimization problem (7), converting the optimization problem (7) into:
Figure FDA0004043804180000033
wherein A is a pseudo dictionary, and replaces an original dictionary D together with a base dictionary phi (Y);
and then converting the optimization problem (8) into the following according to the relation between the matrix norms and the traces:
Figure FDA0004043804180000034
wherein tr (. Cndot.) represents the trace of the matrix, K (Y) l+1 ,Y l+1 )∈R N×N Is a kernel matrix whose elements k (y i ,y j )=Φ(y i ) T Φ(y j );
The sparse coding matrix X and the pseudo dictionary a in the optimization problem (8) are finally updated alternately.
7. The complex industrial process monitoring method of claim 5, wherein the calculation methods of the normal likelihood probability, the abnormal likelihood probability and the abnormal posterior probability are:
Figure FDA0004043804180000035
Figure FDA0004043804180000036
Figure FDA0004043804180000037
wherein ,
Figure FDA0004043804180000038
for monitoring sample data y in real time new Partial data belonging to the n (n=1, 2,., l, l+1) variable subset; />
Figure FDA0004043804180000039
and />
Figure FDA00040438041800000310
Respectively->
Figure FDA00040438041800000311
Where N and F represent normal and abnormal, respectively; />
Figure FDA0004043804180000041
Is->
Figure FDA0004043804180000042
Is a reconstruction error of (a); TH (TH) n A control limit for the nth variable subset; />
Figure FDA0004043804180000043
Is that
Figure FDA0004043804180000044
Is an anomaly posterior probability of (2); p (P) n(F) and Pn (N) represent the prior probabilities of the abnormal and normal samples, respectively, each of which is determined by the confidence level α, i.e., P n (N)=1-α,P n (F)=α;
The global index of the current monitoring sample data obtained by fusion is as follows:
Figure FDA0004043804180000045
wherein ,GFInew For monitoring sample data y in real time new Global index of (2); here, the likelihood of anomalies over subsets of variables
Figure FDA0004043804180000046
Acting as weights, all variable subsets n (n=1, 2, abnormal posterior probability on l, l+1)>
Figure FDA0004043804180000047
Fusion to establish global index GFI new
8. A complex industrial process monitoring system based on parallel dictionary learning, comprising a memory and a processor, wherein the memory stores a computer program which, when executed by the processor, causes the processor to implement the method of any one of claims 1 to 7.
CN202310023849.XA 2023-01-09 2023-01-09 Complex industrial process monitoring method and system based on parallel dictionary learning Pending CN116125922A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310023849.XA CN116125922A (en) 2023-01-09 2023-01-09 Complex industrial process monitoring method and system based on parallel dictionary learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310023849.XA CN116125922A (en) 2023-01-09 2023-01-09 Complex industrial process monitoring method and system based on parallel dictionary learning

Publications (1)

Publication Number Publication Date
CN116125922A true CN116125922A (en) 2023-05-16

Family

ID=86293991

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310023849.XA Pending CN116125922A (en) 2023-01-09 2023-01-09 Complex industrial process monitoring method and system based on parallel dictionary learning

Country Status (1)

Country Link
CN (1) CN116125922A (en)

Similar Documents

Publication Publication Date Title
CN111222290B (en) Multi-parameter feature fusion-based method for predicting residual service life of large-scale equipment
CN111914873A (en) Two-stage cloud server unsupervised anomaly prediction method
US20180046902A1 (en) Enhanced restricted boltzmann machine with prognosibility regularization for prognostics and health assessment
CN109376401B (en) Self-adaptive multi-source information optimization and fusion mechanical residual life prediction method
CN104504296A (en) Gaussian mixture hidden Markov model and regression analysis remaining life prediction method
CN113642754B (en) Complex industrial process fault prediction method based on RF noise reduction self-coding information reconstruction and time convolution network
CN111504676A (en) Equipment fault diagnosis method, device and system based on multi-source monitoring data fusion
CN111666982A (en) Electromechanical equipment fault diagnosis method based on deep neural network
CN114065862A (en) Multi-dimensional time sequence data anomaly detection method and system
CN108549908B (en) Chemical process fault detection method based on multi-sampling probability kernel principal component model
CN116226646B (en) Method, system, equipment and medium for predicting health state and residual life of bearing
CN111325403B (en) Method for predicting residual life of electromechanical equipment of highway tunnel
CN115293280A (en) Power equipment system anomaly detection method based on space-time feature segmentation reconstruction
CN111259953B (en) Equipment defect time prediction method based on capacitive equipment defect data
CN115563563A (en) Fault diagnosis method and device based on transformer oil chromatographic analysis
CN116451117A (en) Power data anomaly detection method based on federal learning
CN115510975A (en) Multivariable time sequence abnormality detection method and system based on parallel Transomer-GRU
CN115833937A (en) Optical module fault prediction method based on variational self-encoder and long-and-short-term memory network hybrid model
CN114169091A (en) Method for establishing prediction model of residual life of engineering mechanical part and prediction method
CN114297921A (en) AM-TCN-based fault diagnosis method
CN112488202A (en) Distributed process monitoring method based on mutual information and projection dictionary pair learning
Yue Data decomposition for analytics of engineering systems: Literature review, methodology formulation, and future trends
Xie et al. Adaptive symbolic transfer entropy and its applications in modeling for complex industrial systems
CN115564075B (en) Main and auxiliary integrated fault collaborative diagnosis method and system for urban power grid
CN117290685A (en) Power plant power equipment expert diagnosis system and method based on historical data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination