CN101403923A  Course monitoring method based on nongauss component extraction and support vector description  Google Patents
Course monitoring method based on nongauss component extraction and support vector description Download PDFInfo
 Publication number
 CN101403923A CN101403923A CNA200810122086XA CN200810122086A CN101403923A CN 101403923 A CN101403923 A CN 101403923A CN A200810122086X A CNA200810122086X A CN A200810122086XA CN 200810122086 A CN200810122086 A CN 200810122086A CN 101403923 A CN101403923 A CN 101403923A
 Authority
 CN
 China
 Prior art keywords
 gauss
 alpha
 overbar
 sigma
 data
 Prior art date
 Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
 Pending
Links
 238000000605 extraction Methods 0.000 title claims abstract description 14
 238000000034 method Methods 0.000 claims abstract description 39
 238000004458 analytical method Methods 0.000 claims abstract description 31
 238000004422 calculation algorithm Methods 0.000 claims abstract description 22
 239000002245 particle Substances 0.000 claims abstract description 18
 238000004519 manufacturing process Methods 0.000 claims abstract description 15
 238000005457 optimization Methods 0.000 claims abstract description 15
 238000004364 calculation method Methods 0.000 claims abstract description 7
 239000011159 matrix material Substances 0.000 claims description 20
 230000000875 corresponding Effects 0.000 claims description 11
 239000000284 extract Substances 0.000 claims description 7
 230000003044 adaptive Effects 0.000 claims description 6
 238000006243 chemical reaction Methods 0.000 claims description 6
 238000000926 separation method Methods 0.000 claims description 5
 230000001133 acceleration Effects 0.000 claims description 3
 239000008187 granular material Substances 0.000 claims description 3
 238000009826 distribution Methods 0.000 abstract description 9
 230000000694 effects Effects 0.000 abstract description 5
 238000001514 detection method Methods 0.000 abstract description 2
 238000000513 principal component analysis Methods 0.000 abstract description 2
 238000007781 preprocessing Methods 0.000 abstract 1
 238000010586 diagrams Methods 0.000 description 4
 239000000203 mixtures Substances 0.000 description 4
 238000003745 diagnosis Methods 0.000 description 3
 OWIKHYCFFJSOEHUHFFFAOYSAN isocyanate Chemical compound   N=C=O OWIKHYCFFJSOEHUHFFFAOYSAN 0.000 description 3
 238000007619 statistical methods Methods 0.000 description 3
 230000002159 abnormal effects Effects 0.000 description 2
 238000005516 engineering processes Methods 0.000 description 2
 238000005070 sampling Methods 0.000 description 2
 230000004913 activation Effects 0.000 description 1
 230000002547 anomalous Effects 0.000 description 1
 230000001808 coupling Effects 0.000 description 1
 238000010168 coupling process Methods 0.000 description 1
 238000005859 coupling reactions Methods 0.000 description 1
 238000000354 decomposition reactions Methods 0.000 description 1
 239000000686 essences Substances 0.000 description 1
 239000011521 glasses Substances 0.000 description 1
 239000000155 melts Substances 0.000 description 1
Abstract
The invention discloses a process monitoring method which is based on nonGaussian component extraction and support vector description. The method comprises the following steps: readin of training data and data to be diagnosed, data preprocessing, establishment of a principal component analysis model, particle swarm optimization algorithm, nonGaussian projection calculation, support vector data description, residual analysis, principal component estimation, fault detection and the model updating. By the method, the nonGaussian components can be automatically extracted from operating data of an industrial process, thus avoiding the disadvantage that the conventional statistical process monitoring method assumes that data is subject to normal distribution, and the nonGaussian projection algorithm based on the particle swarm optimization algorithm ensures the maximization of the nonGaussian properties of the extracted independent components, and avoids the problem that the independent component analysis method is easy to be involved in the locally optimal solution. Compared with the conventional statistical process monitoring method, the method can find abnormity in time, effectively reduce the rate of false alarm, and obtain better monitoring effect.
Description
Technical field
The present invention relates to the industrial process fault diagnosis field, especially, relate to a kind of based on the nonGaussian statistics monitoring of nongauss component extraction and support vector description and the method for fault detect.
Background technology
Along with developing rapidly of modern industry and science and technology, characteristics such as modern process industry presents that scale is big, strong coupling between the complex structure, productive unit, investment are big.Meanwhile, the possibility that breaks down of production run also increases thereupon.In a single day this type systematic breaks down, and not only can cause the massive losses of personnel and property, and also will cause irremediable influence to ecologic environment.In order to improve the security of industrial processes and control system, improve simultaneously product quality, reduce production costs, process monitoring and fault diagnosis have become a part indispensable in the IT application in enterprises.
In recent years, multivariate statistical analysis is applied to process monitoring and fault diagnosis has obtained broad research.Traditional multivariate statistics method for supervising adopts pivot analysis (PCA:Principal Component Analysis) more, minimum partially binary is analysed methods such as (PLS:Partial Least Square), these methods are in the hypothesis independent identically distributed while of variable, also require the variable Normal Distribution, and utilization only is secondorder statistic information.In industrial real process, the reasons such as fluctuation owing to measuring interference, production status can cause no longer Normal Distribution of variable, T usually
^{2}Do not satisfying F distribution and x with the Q statistic yet
^{2}Distribute.Therefore,, often be difficult to obtain monitoring effect preferably, fail to report, rate of false alarm is higher, and can't note abnormalities timely and effectively if only adopt traditional Multivariable Statistical Methods to monitor to this type of industrial process.
Independent pca method (ICA:Independent Component Analysis) is a kind of analytical approach based on signal higher order statistical characteristic, can be used for extracting nongauss component.The purpose of this method is that observable data are carried out certain linear decomposition, utilizes the independence and the nonGauss of source signal, it is resolved into add up independently composition.Use it for the process data analyzing and processing of process industry, the probabilistic statistical characteristics of energy more efficient use variable, can under the meaning observational variable be decomposed adding up independently, obtain the activation bit source of process inherence, thus more essential description process feature.The FastICA algorithm is an algorithms most in use of monitoring at present work based on ICA, and its weak point is that separating resulting depends on initial solution, can't guarantee global optimum's property of separating also to lack effective standard of selecting the pivot number in addition.
Summary of the invention
The objective of the invention is in order to overcome the deficiency that existing Multivariable Statistical Methods is not considered the nonGauss of process variable, is difficult to obtain better monitoring effect, a kind of statistic processes monitoring and fault detection method based on nongauss component extraction and Support Vector data description is provided.This method has been avoided the deficiency of conventional statistics course monitoring method tentation data Normal Distribution, the abnormal conditions that occur in the discovery procedure in time.
Technical solution of the present invention is: by PCA dimensionality reduction is carried out in the process variable data space, then principal component space and residual error space are adopted the independent component that extracts nonGauss based on particulate group's FastICA algorithm respectively.After the nonGauss's independent component of procurement process, utilize the support vector data to describe its distribution situation, construct new statistic, determine its statistics control limit.Concrete steps are as follows:
The data of key variables are as training sample TX when 1) reading production run and normally move;
2) training sample TX is carried out preservice, make that the average of each variable is 0, variance is 1, obtains input matrix X ∈ R
^{N * n}, step is:
(1) computation of mean values:
$\stackrel{\‾}{\mathrm{TX}}=\frac{1}{N}\underset{i=1}{\overset{N}{\mathrm{\Σ}}}{\mathrm{TX}}_{i}$
(2) calculate variance:
${\mathrm{\σ}}_{x}^{2}=\frac{1}{N1}\underset{i=1}{\overset{N}{\mathrm{\Σ}}}{({\mathrm{TX}}_{i}\stackrel{\‾}{\mathrm{TX}})}^{2}$
(3) albefaction is handled:
$X=\frac{\mathrm{TX}\stackrel{\‾}{\mathrm{TX}}}{{\mathrm{\σ}}_{x}^{2}}$
Wherein, TX is a training sample, and N is a number of training, and n is a variable number;
3) set up the pivot analysis model;
4) calculate based on the nonGauss projection of particle swarm optimization algorithm, extract the nongauss component in the data;
5), make up the statistical variable and the control limit of nonGaussian signal based on Support Vector data description; Ask for the hypersphere that nonGaussian signal distributes, find the solution following quadratic programming problem:
Obtain hyperspherical center
$a=\underset{i}{\mathrm{\Σ}}{\mathrm{\α}}_{i}{x}_{i}$ And radius:
${R}^{2}=<{x}_{k}\·{x}_{k}>2\underset{i}{\mathrm{\Σ}}{\mathrm{\α}}_{i}<{x}_{k}\·{x}_{i}>+\underset{i}{\mathrm{\Σ}}\underset{j}{\mathrm{\Σ}}{\mathrm{\α}}_{i}{\mathrm{\α}}_{j}<{x}_{i}\·{x}_{j}>,$ x
_{i}, x
_{j}Be the sample point of nongauss component, x
_{k}Be the borderline support vector of hypersphere;
6) pivot is estimated: the T that makes up the pivot gaussian signal
^{2}Statistic, the calculation control limit; When insolation level is α, the control limit is calculated as follows:
7) residual analysis: make up residual error gaussian signal Q statistic, the calculation control limit;
For arbitrary input residual error e
_{i}, the Q statistic is:
When insolation level is α, the control limit is calculated as follows:
Wherein
$g=\frac{{\mathrm{\ρ}}^{2}}{2\mathrm{\μ}},$ $h=\frac{{2\mathrm{\μ}}^{2}}{{\mathrm{\ρ}}^{2}},$ ρ and μ are respectively the variance and the average of Q statistic.
8) read variable data uptodate in the production run as diagnostic data VX;
9) fault detect;
10) regularly the normal point of process status is added among the training set TX, repeats 2)～7) training process so that models such as the support vector description that upgrades in time, residual analysis and pivot statistics.
The described pivot analysis model step of setting up:
(1) covariance matrix of calculating X is designated as ∑ x;
(2) ∑ x is carried out svd, obtain characteristic root λ
_{1}, λ
_{2}..., λ
_{n}, λ wherein
_{1}〉=λ
_{2}〉=... 〉=λ
_{n}, the characteristic of correspondence vector matrix is U;
(3) calculate population variance and each eigenwert corresponding variance contribution rate, adding up from big to small by the variance contribution ratio of each eigenwert reaches setpoint up to total variance contribution ratio, and it is r that note is chosen number;
(4) the preceding r row of selected characteristic vector matrix U constitute principal component space P ∈ R
^{N * r}, remaining columns constitutes the residual error space
$\stackrel{~}{P}\∈{R}^{n\×(nr)};$
(5) calculate respectively that PCA keeps variation per minute Z=XP and remain variation per minute
$\stackrel{~}{Z}=X\stackrel{~}{P};$ The described step of calculating based on the nonGauss projection of particle swarm optimization algorithm:
(1) makes Z
^{(1)}=Z ', i=1 asks for the strongest pairing separating vector b of nonGauss's independent component of following formula by adopting the particle swarm optimization algorithm
_{1}:
Wherein J () is nonGauss's metric function, its functional form be J (y) ≈ [E{G (y) }E{G (v) }]
^{2}, in the formula, v is zeromean, unit variance gaussian variable, and G () is a nonquadratic function, and first independent component is
${s}_{1}={b}_{1}^{T}{Z}^{\left(1\right)}.$
(2) check s
_{i}Gauss; Calculate nonGauss and measure J (s
_{i}) significance degree is the confidence limit J of α
_{α}If J is (s
_{i})≤J
_{α}, s then
_{i}Be gaussian signal, nonGaussian signal is counted m=i1, forwards (5) to, otherwise continues;
(3)i＝i+1，
${Z}^{\left(i\right)}=({I}_{r,r}{b}_{i1}{b}_{i1}^{T}){Z}^{(i1)}=({I}_{r,r}\underset{j=1}{\overset{i1}{\mathrm{\Σ}}}{b}_{j1}{b}_{j1}^{T}){Z}^{\left(1\right)},$ R in the formula is the dimension of input sample point;
(4) adopt the PSO algorithm to ask for the separating vector of i nongauss component:
In the formula,
$M=({I}_{r,r}\underset{j=1}{\overset{i1}{\mathrm{\Σ}}}{b}_{j1}{b}_{j1}^{T}).$ By the projection of M battle array, guaranteed the orthogonality between the separating vector.I independent component is
${s}_{i}={b}_{i}^{T}{Z}^{\left(i\right)},$ Return (2);
(5) output separation matrix B=(b
_{1}, b
_{2}..., b
_{m}), finish;
Described particle swarm optimization algorithm steps:
(1) initialization a group particulate comprises granule amount, particulate random site and speed;
(2) estimate the fitness of each particulate;
(3) to each particulate, if adaptive value is greater than its desired positions, then with it as current desired positions; If adaptive value is then reset call number greater than full group's desired positions;
(4) as not reaching termination condition, then revise i particle's velocity and position, return (2) by following formula; Otherwise, finish
In the formula,
${\stackrel{~}{a}}_{i}=[{\stackrel{~}{a}}_{i1},{\stackrel{~}{a}}_{i2},...,{\stackrel{~}{a}}_{\mathrm{ir}}]$ Represent i particulate, V
_{i}=[V
_{I1}, V
_{I2}..., V
_{Ir}] be the speed of particulate, p
_{i}=[p
_{I1}, p
_{I2}..., p
_{Ir}] be the optimum position of this particulate experience, p
_{g}=[p
_{G1}, p
_{G2}..., p
_{Gr}] be the desired positions of all particulate experience in the colony, r is equal to dimension to be found the solution; W represents inertia weight, c
_{1}And c
_{2}Be positive acceleration constant, r
_{1}, r
_{2}Be the random number that is evenly distributed on interval [0,1].
Described fault detect:
To data to be tested VX with when training the TX and the σ that obtain
_{x} ^{2}Carry out albefaction and handle, and with the input of the data after the albefaction as the pivot analysis model, the P that obtains with training with
It is divided into principal component space and residual error residual error space, matrix is input to nonGauss projection module respectively after the conversion, obtain nongauss component, the gauss component of principal component space, nongauss component and gauss component with the residual error space, nongauss component calculates corresponding statistic by support vector description, and gauss component calculates corresponding T by the pivot analysis of routine
^{2}Statistic and Q statistic are if all less than control limit separately, judge that then this sample point is normal; Otherwise, think that the sample point statistics is unusual, the process object may break down.
Beneficial effect of the present invention mainly shows: 1, by nonGauss projection, isolated the nongauss component in the process variable, and utilize the support vector data to describe its distribution situation, construct new statistic, determine its statistics control limit, avoided the deficiency of conventional statistics course monitoring method hypothesis Normal Distribution; Gaussian signal after the separation is more suitable for the monitoring of multivariates such as pivot analysis, residual analysis, thus the abnormal conditions in time in the discovery procedure; 2, based on the nonGauss projection algorithm of particle swarm optimization, overcome independent pivot analysis (ICA) method and easily be absorbed in the deficiency of local minimum, can guarantee the nonGauss's maximization of independent component of extraction, and need not to set the nongauss component number in advance.
Description of drawings
Fig. 1 is the theory diagram of course monitoring method proposed by the invention
Fig. 2 is the process flow diagram of nonGauss projection algorithm
Fig. 3 is that synoptic diagram is carried out in online monitoring
Fig. 4 is traditional pca method and the inventive method monitoring effect comparison diagram
Embodiment
Below in conjunction with accompanying drawing the present invention is further described.
With reference to Fig. 1, Fig. 2 and Fig. 3, a kind of course monitoring method based on nongauss component extraction and Support Vector data description, specific implementation method is as follows:
(1) offline modeling
Obtain a collection of measurement data of industrial process, set up each model, obtain corresponding projection matrix, detailed process is as follows:
The data of key variables are as training sample TX when 1) reading production run and normally move
_{N * n}, wherein, N is a number of training, n is a variable number;
2) training sample TX is carried out preservice, make that the average of each variable is 0, variance is 1, obtains input matrix X ∈ R
^{N * n}, step is:
(1) computation of mean values:
$\stackrel{\‾}{\mathrm{TX}}=\frac{1}{N}\underset{i=1}{\overset{N}{\mathrm{\Σ}}}{\mathrm{TX}}_{i}$
(2) calculate variance:
${\mathrm{\σ}}_{x}^{2}=\frac{1}{N1}\underset{i=1}{\overset{N}{\mathrm{\Σ}}}{({\mathrm{TX}}_{i}\stackrel{\‾}{\mathrm{TX}})}^{2}$
(3) albefaction is handled:
$X=\frac{\mathrm{TX}\stackrel{\‾}{\mathrm{TX}}}{{\mathrm{\σ}}_{x}^{2}}$
3) set up the pivot analysis model;
Pivot analysis is mainly used in dimensionality reduction, extracts the pivot composition, and measurement space is decomposed into principal component space and residual error space.Pivot variance extraction ratio is generally greater than 80%, and computation process adopts the method for covariance svd, and step is as follows:
(1) covariance matrix of calculating X is designated as ∑ x;
(2) ∑ x is carried out svd, obtain characteristic root λ
_{1}, λ
_{2}..., λ
_{n}, λ wherein
_{1}〉=λ
_{2}〉=... 〉=λ
_{n}, the characteristic of correspondence vector matrix is U;
(3) calculate population variance and each eigenwert corresponding variance contribution rate, adding up from big to small by the variance contribution ratio of each eigenwert reaches setpoint up to total variance contribution ratio, and it is r that note is chosen number;
(4) the preceding r row of selected characteristic vector matrix U constitute principal component space P ∈ R
^{N * r}, remaining columns constitutes the residual error space
$\stackrel{~}{P}\∈{R}^{n\×(nr)};$
(5) calculate respectively that PCA keeps variation per minute Z=XP and remain variation per minute
$\stackrel{~}{Z}=X\stackrel{~}{P};$
Pivot analysis is lost under the minimum principle making every effort to data message, to the variable space dimensionality reduction of higherdimension.In fact, essence is a few linear combination of research variable system, and the generalized variable that this several linear combination constituted will keep former variable information as much as possible.
4) calculate based on the nonGauss projection of particle swarm optimization algorithm, extract the nongauss component in the data;
It is the nongauss component that is used to extract the input data that nonGauss projection is calculated, and adopts based on particulate group's FastICA algorithm and realizes, can guarantee that the independent component that extracts is a global optimum, and provide the nongauss component number automatically, need not artificial setting.Supposing will be to data set Z ∈ R
^{N * r}Extract nongauss component, N is a sample number, and r is a variable number, and the specific implementation step is as follows:
(1) makes Z
^{(1)}=Z ', i=1 adopts the particle swarm optimization algorithm to ask for the strongest pairing separating vector b of nonGauss's independent component of following formula
_{1}, obtain first independent component and be
${s}_{1}={b}_{1}^{T}{Z}^{\left(1\right)};$
In the formula, J () is nonGauss's metric function, its functional form be J (y) ≈ [E{G (y) }E{G (v) }]
^{2}, in the formula, v is zeromean, unit variance gaussian variable, G () is a nonquadratic function, can select following form for use:
G
_{2}(u)＝exp(a
_{2}u
^{2}/2)，
G
_{3}(u)＝u
^{4}.
In the formula, 1≤a
_{1}≤ 2, a
_{2}≈ 1.
(2) check s
_{i}Gauss:
1. calculate nonGauss and measure J (s
_{i}) significance degree is the confidence limit J of α
_{α}(s
_{i}), can ask for by following theorem:
Suppose that it is y ∈ N (0,1) that y obeys the standard Gaussian distribution, y
_{1}, y
_{2}..., y
_{N}For the capacity of independent draws from overall y is the simple sample of N, then when N → ∞, according to sample y
_{1}, y
_{2}..., y
_{N}The nonGauss of the y that calculates measures J (y
_{1}, y
_{2}..., y
_{N}) meet the following conditions:
Promptly
$\frac{1}{D\left(G\left(v\right)\right)}N\·J({y}_{1},{y}_{2},...,{y}_{N})$ Progressively obey degree of freedom and be 1 x
^{2}Distribute, wherein J (), G () function definition are the same, and D () is a variance function.Given level of significance α, then
α generally gets 0.05 or 0.1;
If 2. J (s
_{i})≤J
_{α}, s then
_{i}Be gaussian signal, nonGaussian signal is counted m=i1, forwards (5) to, otherwise continues;
(3)i＝i+1，
${Z}^{\left(i\right)}=({I}_{r,r}{b}_{i1}{b}_{i1}^{T}){Z}^{(i1)}=({I}_{r,r}\underset{j=1}{\overset{i1}{\mathrm{\Σ}}}{b}_{j1}{b}_{j1}^{T}){Z}^{\left(1\right)};$
(4) adopt the PSO algorithm to ask for the separating vector of i nongauss component:
In the formula,
$M=({I}_{r,r}\underset{j=1}{\overset{i1}{\mathrm{\Σ}}}{b}_{j1}{b}_{j1}^{T}).$ By the projection of M battle array, guaranteed the orthogonality between the separating vector.I independent component is
${s}_{i}={b}_{i}^{T}{Z}^{\left(i\right)},$ Return (2);
(5) output separation matrix B=(b
_{1}, b
_{2}..., b
_{m}), finish;
Described particle swarm optimization algorithm is used to find the solution unconstrained optimization problem, obtains globally optimal solution, adopts following steps to realize:
(1) initialization a group particulate comprises granule amount, particulate random site and speed; Atomic dimension is equal to dimension to be found the solution, and particulate scale (number) is 10～15 times of particle dimension, and position initial value, speed initial value are random number;
(2) estimate the fitness of each particulate, promptly calculate corresponding target function value;
(3) to each particulate, if adaptive value is greater than its desired positions, then with it as current desired positions; If adaptive value is then reset call number greater than full group's desired positions;
(4) as not reaching termination condition, then revise i particle's velocity and position, return (2) by following formula; Otherwise, finish
In the formula,
${\stackrel{~}{a}}_{i}=[{\stackrel{~}{a}}_{i1},{\stackrel{~}{a}}_{i2},...,{\stackrel{~}{a}}_{\mathrm{ir}}]$ Represent i particulate, V
_{i}=[V
_{I1}, V
_{I2}..., V
_{Ir}] be the speed of particulate, p
_{i}=[p
_{I1}, p
_{I2}..., p
_{Ir}] be the optimum position of this particulate experience, p
_{g}=[p
_{G1}, p
_{G2}..., p
_{Gr}] be the desired positions of all particulate experience in the colony, r is equal to dimension to be found the solution; W represents inertia weight, c
_{1}And c
_{2}Be positive acceleration constant, r
_{1}, r
_{2}Be the random number that is evenly distributed on interval [0,1].
5), make up the statistical variable and the control limit of nonGaussian signal based on Support Vector data description; Ask for the hypersphere that nonGaussian signal distributes, find the solution following quadratic programming problem:
Obtain hyperspherical center
$a=\underset{i}{\mathrm{\Σ}}{\mathrm{\α}}_{i}{x}_{i}$ And radius:
${R}^{2}=<{x}_{k}\·{x}_{k}>2\underset{i}{\mathrm{\Σ}}{\mathrm{\α}}_{i}<{x}_{k}\·{x}_{i}>+\underset{i}{\mathrm{\Σ}}\underset{j}{\mathrm{\Σ}}{\mathrm{\α}}_{i}{\mathrm{\α}}_{j}<{x}_{i}\·{x}_{j}>,$ x
_{i}, x
_{j}Be the sample point of nongauss component, x
_{k}Be the borderline support vector of hypersphere;
6) pivot is estimated: the T that makes up the pivot gaussian signal
^{2}Statistic, the calculation control limit; When insolation level is α, the control limit is calculated as follows:
7) residual analysis: make up residual error gaussian signal Q statistic, the calculation control limit;
For arbitrary input residual error e
_{i}, the Q statistic is:
When insolation level is α, the control limit is calculated as follows:
Wherein
$g=\frac{{\mathrm{\ρ}}^{2}}{2\mathrm{\μ}},$ $h=\frac{{2\mathrm{\μ}}^{2}}{{\mathrm{\ρ}}^{2}},$ ρ and μ are respectively the variance and the average of Q statistic.
(2) online monitoring
Abovementioned steps is the process monitoring modeling process.After the modelling, obtain separation matrix and control limit, can realize online monitoring, may further comprise the steps:
8) read variable data uptodate in the production run as diagnostic data VX;
9) fault detect;
To data to be tested VX with when training the TX and the σ that obtain
_{x} ^{2}Carry out albefaction and handle, and with the input of the data after the albefaction as the pivot analysis model, the P that obtains with training with
It is divided into principal component space and residual error residual error space, matrix is input to nonGauss projection module respectively after the conversion, obtain nongauss component, the gauss component of principal component space, nongauss component and gauss component with the residual error space, nongauss component calculates corresponding statistic by support vector description, and gauss component calculates corresponding T by the pivot analysis of routine
^{2}Statistic and Q statistic are if all less than control limit separately, judge that then this sample point is normal; Otherwise, think that the sample point statistics is unusual, the process object may break down.
10) regularly the normal point of process status is added among the training set TX, repeats 2)～7) the training journey so that models such as the support vector description that upgrades in time, residual analysis and pivot statistics.
For actual industrial process, the present invention realizes that the specific implementation process of online monitoring is:
(1) sets time interval of each sampling with timer;
(2) each sampling period from the realtime data base of DCS, obtain uptodate variable data, as diagnostic data VX;
(3) data to be tested VX is with training TX and the σ that obtains
_{x} ^{2}Carry out albefaction and handle, and the data after will handling are as the input of pivot analysis model;
(4) with the P battle array that obtains of training conversion is carried out in input, obtain z and
The input of calculating respectively as nonGauss projection;
(5) during nonGauss projection is calculated, the B that z obtains by training
_{1}The battle array conversion obtains s
_{1}And τ
_{1}Signal;
The B that obtains by training
_{2}The battle array conversion obtains s
_{2}And τ
_{2}Signal is estimated as Support Vector data description, pivot respectively and the input of residual analysis.Here it should be noted that different according to process data X linear degree and nonGauss's degree, may can only the acquisition unit subsignal.
(6) in the Support Vector data description, to input data s
_{1}, adopt following formula to calculate the D statistic of input data:
If
${D}_{1}^{2}\≤{R}_{1}^{2},$ Illustrate that this sample point D statistics is normal, otherwise, illustrate that this sample point statistics is unusual.
In like manner, to input data s
_{2}, if the nongauss component in residual error space, adopt following formula to calculate the D statistic of input data:
If
${D}_{2}^{2}\≤{R}_{2}^{2},$ Illustrate that this sample point D statistics is normal, otherwise, illustrate that this sample point statistics is unusual.
(7) during pivot is estimated, adopt following formula to calculate the T of input data
^{2}Statistic:
If
${T}^{2}<{T}_{\mathrm{\α}}^{2},$ This sample point T is described
^{2}Statistics is normal, otherwise, this sample point T
^{2}Statistics is unusual.
(8) in the residual analysis, adopt following formula to calculate the Q statistic of input data:
If
$Q<{\mathrm{\δ}}_{\mathrm{\α}}^{2},$ Illustrate that this sample point Q statistics is normal, otherwise this sample point Q statistics is unusual, the process object breaks down;
(9) the process monitoring result is passed to DCS, by DCS system and fieldbus procedural information is delivered to operator station simultaneously and shows, make the executeinplace worker can in time handle anomalous event.
In the online monitoring process, regularly the normal point of process status to be added among the training set TX, the repetition training process is so that the model in the support vector description that upgrades in time, residual analysis and the pivot statistics keeps model to have better dynamic.
The course monitoring method based on nongauss component extraction and Support Vector data description in order to illustrate that better the present invention proposes utilizes industrial glass to melt process data, adds up method for supervising with traditional pivot analysis and compares.Fig. 4 has provided both monitored results.The result shows that the method that the present invention proposes can detect fault earlier, and is sensitiveer than pca method, and the alert rate of mistake is low.
Claims (5)
1, a kind of course monitoring method based on nongauss component extraction and support vector description is characterized in that may further comprise the steps:
The data of key variables are as training sample TX when 1) reading production run and normally move;
2) training sample TX is carried out preservice, make that the average of each variable is 0, variance is 1, obtains input matrix X ∈ R
^{N * n}, step is:
(1) computation of mean values:
$\stackrel{\‾}{\mathrm{TX}}=\frac{1}{N}\underset{i=1}{\overset{N}{\mathrm{\Σ}}}T{X}_{i}$
(2) calculate variance:
${\mathrm{\σ}}_{x}^{2}=\frac{1}{N1}\underset{i=1}{\overset{N}{\mathrm{\Σ}}}{({\mathrm{TX}}_{i}\stackrel{\‾}{\mathrm{TX}})}^{2}$
(3) albefaction is handled:
$X=\frac{\mathrm{TX}\stackrel{\‾}{\mathrm{TX}}}{{\mathrm{\σ}}_{x}^{2}}$
Wherein, TX is a training sample, and N is a number of training, and n is a variable number;
3) set up the pivot analysis model;
4) calculate based on the nonGauss projection of particle swarm optimization algorithm, extract the nongauss component in the data;
5), make up the statistical variable and the control limit of nonGaussian signal based on Support Vector data description; Ask for the hypersphere that nonGaussian signal distributes, find the solution following quadratic programming problem:
Obtain hyperspherical center
$a=\underset{i}{\mathrm{\Σ}}{\mathrm{\α}}_{i}{x}_{i}$ And radius:
${R}^{2}=<{x}_{k}\·{x}_{k}>2\underset{i}{\mathrm{\Σ}}{\mathrm{\α}}_{i}<{x}_{k}\·{x}_{i}>+\underset{i}{\mathrm{\Σ}}\underset{j}{\mathrm{\Σ}}{\mathrm{\α}}_{i}{\mathrm{\α}}_{j}<{x}_{i}\·{x}_{j}>,$ x
_{i}, x
_{j}Be the sample point of nongauss component, x
_{k}Be the borderline support vector of hypersphere;
6) pivot is estimated: the T that makes up the pivot gaussian signal
^{2}Statistic, the calculation control limit; When insolation level is α, the control limit is calculated as follows:
7) residual analysis: make up residual error gaussian signal Q statistic, the calculation control limit;
For arbitrary input residual error e
_{i}, the Q statistic is:
When insolation level is α, the control limit is calculated as follows:
Wherein
$g=\frac{{\mathrm{\ρ}}^{2}}{2\mathrm{\μ}},$ $h=\frac{{2\mathrm{\μ}}^{2}}{{\mathrm{\ρ}}^{2}},$ ρ and μ are respectively the variance and the average of Q statistic.
8) read variable data uptodate in the production run as diagnostic data VX;
9) fault detect;
10) regularly the normal point of process status is added among the training set TX, repeats 2)～7) training process so that models such as the support vector description that upgrades in time, residual analysis and pivot statistics.
2. a kind of course monitoring method based on nongauss component extraction and support vector description as claimed in claim 1 is characterized in that the described pivot analysis model step of setting up:
(1) covariance matrix of calculating X is designated as ∑ x;
(2) ∑ x is carried out svd, obtain characteristic root λ
_{1}, λ
_{2}..., λ
_{n}, λ wherein
_{1}〉=λ
_{2}〉=... 〉=λ
_{n}, the characteristic of correspondence vector matrix is U;
(3) calculate population variance and each eigenwert corresponding variance contribution rate, adding up from big to small by the variance contribution ratio of each eigenwert reaches setpoint up to total variance contribution ratio, and it is r that note is chosen number;
(4) the preceding r row of selected characteristic vector matrix U constitute principal component space P ∈ R
^{N * r}, remaining columns constitutes the residual error space
$\stackrel{~}{P}\∈{R}^{n\×(nr)};$
(5) calculate respectively that PCA keeps variation per minute Z=XP and remain variation per minute
$\stackrel{~}{Z}=X\stackrel{~}{P};$
3. a kind of course monitoring method based on nongauss component extraction and support vector description as claimed in claim 1 is characterized in that the described step of calculating based on the nonGauss projection of particle swarm optimization algorithm:
(1) makes Z
^{(1)}=Z ', i=1 asks for the strongest pairing separating vector b of nonGauss's independent component of following formula by adopting the particle swarm optimization algorithm
_{1}:
Wherein J () is nonGauss's metric function, its functional form be J (y) ≈ [E{G (y) }E{G (v) }]
^{2}, in the formula, v is zeromean, unit variance gaussian variable, and G () is a nonquadratic function, and first independent component is
${s}_{1}={b}_{1}^{T}{Z}^{\left(1\right)};$
(2) check s
_{i}Gauss; Calculate nonGauss and measure J (s
_{i}) significance degree is the confidence limit J of α
_{α}If J is (s
_{i})≤J
_{α}, s then
_{i}Be gaussian signal, nonGaussian signal is counted m=i1, forwards (5) to, otherwise continues;
(3)i＝i+1，
${Z}^{\left(i\right)}=({I}_{r,r}{b}_{i1}{b}_{i1}^{T}){Z}^{(i1)}=({I}_{r,r}\underset{j=1}{\overset{i1}{\mathrm{\Σ}}}{b}_{j1}{b}_{j1}^{T}){Z}^{\left(1\right)},$ R in the formula is the dimension of input sample point;
(4) adopt the PSO algorithm to ask for the separating vector of i nongauss component:
In the formula,
$M=({I}_{r,r}\underset{j=1}{\overset{i1}{\mathrm{\Σ}}}{b}_{j1}{b}_{j1}^{T}).$ By the projection of M battle array, guaranteed the orthogonality between the separating vector.I independent component is
${s}_{i}={b}_{i}^{T}{Z}^{\left(i\right)},$ Return (2);
(5) output separation matrix B=(b
_{1}, b
_{2}..., b
_{m}), finish;
4, a kind of course monitoring method based on nongauss component extraction and support vector description as claimed in claim 3 is characterized in that described particle swarm optimization algorithm steps:
(1) initialization a group particulate comprises granule amount, particulate random site and speed;
(2) estimate the fitness of each particulate;
(3) to each particulate, if adaptive value is greater than its desired positions, then with it as current desired positions; If adaptive value is then reset call number greater than full group's desired positions;
(4) as not reaching termination condition, then revise i particle's velocity and position, return (2) by following formula; Otherwise, finish
In the formula,
${\stackrel{~}{a}}_{i}=[{\stackrel{~}{a}}_{i1},{\stackrel{~}{a}}_{i2},...,{\stackrel{~}{a}}_{\mathrm{ir}}]$ Represent i particulate, V
_{i}=[V
_{I1}, V
_{I2}..., V
_{Ir}] be the speed of particulate, p
_{i}=[p
_{I1}, p
_{I2}..., p
_{Ir}] be the optimum position of this particulate experience, p
_{g}=[p
_{G1}, p
_{G2}..., p
_{Gr}] be the desired positions of all particulate experience in the colony, r is equal to dimension to be found the solution; W represents inertia weight, c
_{1}And c
_{2}Be positive acceleration constant, r
_{1}, r
_{2}Be the random number that is evenly distributed on interval [0,1].
5, a kind of course monitoring method based on nongauss component extraction and support vector description as claimed in claim 1 is characterized in that described fault detect:
To data to be tested VX with when training the TX and the σ that obtain
_{x} ^{2}Carry out albefaction and handle, and with the input of the data after the albefaction as the pivot analysis model, the P that obtains with training with
It is divided into principal component space and residual error residual error space, matrix is input to nonGauss projection module respectively after the conversion, obtain nongauss component, the gauss component of principal component space, nongauss component and gauss component with the residual error space, nongauss component calculates corresponding statistic by support vector description, and gauss component calculates corresponding T by the pivot analysis of routine
^{2}Statistic and Q statistic are if all less than control limit separately, judge that then this sample point is normal; Otherwise, think that the sample point statistics is unusual, the process object may break down.
Priority Applications (1)
Application Number  Priority Date  Filing Date  Title 

CNA200810122086XA CN101403923A (en)  20081031  20081031  Course monitoring method based on nongauss component extraction and support vector description 
Applications Claiming Priority (1)
Application Number  Priority Date  Filing Date  Title 

CNA200810122086XA CN101403923A (en)  20081031  20081031  Course monitoring method based on nongauss component extraction and support vector description 
Publications (1)
Publication Number  Publication Date 

CN101403923A true CN101403923A (en)  20090408 
Family
ID=40537956
Family Applications (1)
Application Number  Title  Priority Date  Filing Date 

CNA200810122086XA Pending CN101403923A (en)  20081031  20081031  Course monitoring method based on nongauss component extraction and support vector description 
Country Status (1)
Country  Link 

CN (1)  CN101403923A (en) 
Cited By (19)
Publication number  Priority date  Publication date  Assignee  Title 

CN102331772A (en) *  20110330  20120125  浙江省电力试验研究院  Method for carrying out early warning of abnormal superheated steam temperature and fault diagnosis on direct current megawatt unit 
CN103377316A (en) *  20130715  20131030  浙江大学  Penicillin production process monitoring method based on statistical analysis and Bayesian ensemble 
CN103488091A (en) *  20130927  20140101  上海交通大学  Datadriving control process monitoring method based on dynamic component analysis 
CN103761445A (en) *  20140218  20140430  苏州大学  Medical diagnosis method and system based on density induction oneclass support vector machine 
CN104503436A (en) *  20141208  20150408  浙江大学  Quick fault detection method based on random projection and knearest neighbor method 
CN104657574A (en) *  20140613  20150527  苏州大学  Building method and device for medical diagnosis models 
CN104765022A (en) *  20150318  20150708  中船重工鹏力（南京）大气海洋信息系统有限公司  Methods for characteristic value probability statistics model establishment and antenna selfcheck based on echo spectra 
CN105739489A (en) *  20160512  20160706  电子科技大学  Batch process fault detecting method based on ICAKNN 
CN106444665A (en) *  20160922  20170222  宁波大学  Fault classification diagnosis method based on nonGaussian similarity matching 
CN106897509A (en) *  20170216  20170627  大连理工大学  A kind of dynamic NonGaussian structures Monitoring Data abnormality recognition method 
CN106897505A (en) *  20170213  20170627  大连理工大学  A kind of structure monitoring data exception recognition methods for considering temporal correlation 
CN107065842A (en) *  20170526  20170818  宁波大学  A kind of fault detection method based on particle group optimizing core independent component analysis model 
CN107103162A (en) *  20170526  20170829  中国人民解放军国防科学技术大学  A kind of vibration accelerated test method and system based on Theory of The Cumulative Fatigue Damage 
CN107298485A (en) *  20170727  20171027  华东理工大学  It is a kind of based on method of the data model to the fault detection and diagnosis of During Industrial Wastewater Treatment Process 
CN109240276A (en) *  20181109  20190118  江南大学  Mutipiece PCA fault monitoring method based on FaultSensitive Principal variables selection 
WO2019019429A1 (en) *  20170728  20190131  上海中兴软件有限责任公司  Anomaly detection method, device and apparatus for virtual machine, and storage medium 
CN110187206A (en) *  20190522  20190830  中国人民解放军国防科技大学  The fault detection method of the suspension system of nongausian process under a kind of complex working condition 
CN110701487A (en) *  20190918  20200117  浙江工业大学  KPCA and CasSVDDbased multiworkingcondition pipeline leakage detection method 
CN111177970A (en) *  20191210  20200519  浙江大学  Multistage semiconductor process virtual metering method based on Gaussian process and convolutional neural network 

2008
 20081031 CN CNA200810122086XA patent/CN101403923A/en active Pending
Cited By (27)
Publication number  Priority date  Publication date  Assignee  Title 

CN102331772A (en) *  20110330  20120125  浙江省电力试验研究院  Method for carrying out early warning of abnormal superheated steam temperature and fault diagnosis on direct current megawatt unit 
CN102331772B (en) *  20110330  20130327  浙江省电力试验研究院  Method for carrying out early warning of abnormal superheated steam temperature and fault diagnosis on direct current megawatt unit 
CN103377316A (en) *  20130715  20131030  浙江大学  Penicillin production process monitoring method based on statistical analysis and Bayesian ensemble 
CN103488091A (en) *  20130927  20140101  上海交通大学  Datadriving control process monitoring method based on dynamic component analysis 
CN103761445A (en) *  20140218  20140430  苏州大学  Medical diagnosis method and system based on density induction oneclass support vector machine 
CN104657574B (en) *  20140613  20171031  苏州大学  The method for building up and device of a kind of medical diagnosismode 
CN104657574A (en) *  20140613  20150527  苏州大学  Building method and device for medical diagnosis models 
CN104503436A (en) *  20141208  20150408  浙江大学  Quick fault detection method based on random projection and knearest neighbor method 
CN104765022A (en) *  20150318  20150708  中船重工鹏力（南京）大气海洋信息系统有限公司  Methods for characteristic value probability statistics model establishment and antenna selfcheck based on echo spectra 
CN104765022B (en) *  20150318  20170301  中船重工鹏力（南京）大气海洋信息系统有限公司  The method that eigenvalue probability statistics model and antenna selfinspection are set up based on echo spectrum 
CN105739489A (en) *  20160512  20160706  电子科技大学  Batch process fault detecting method based on ICAKNN 
CN105739489B (en) *  20160512  20180413  电子科技大学  A kind of batch process fault detection method based on ICA KNN 
CN106444665A (en) *  20160922  20170222  宁波大学  Fault classification diagnosis method based on nonGaussian similarity matching 
CN106897505A (en) *  20170213  20170627  大连理工大学  A kind of structure monitoring data exception recognition methods for considering temporal correlation 
CN106897509B (en) *  20170216  20200616  大连理工大学  Dynamic nonGaussian structure monitoring data anomaly identification method 
CN106897509A (en) *  20170216  20170627  大连理工大学  A kind of dynamic NonGaussian structures Monitoring Data abnormality recognition method 
CN107103162A (en) *  20170526  20170829  中国人民解放军国防科学技术大学  A kind of vibration accelerated test method and system based on Theory of The Cumulative Fatigue Damage 
CN107065842B (en) *  20170526  20190426  宁波大学  A kind of fault detection method based on particle group optimizing core independent component analysis model 
CN107065842A (en) *  20170526  20170818  宁波大学  A kind of fault detection method based on particle group optimizing core independent component analysis model 
CN107298485A (en) *  20170727  20171027  华东理工大学  It is a kind of based on method of the data model to the fault detection and diagnosis of During Industrial Wastewater Treatment Process 
WO2019019429A1 (en) *  20170728  20190131  上海中兴软件有限责任公司  Anomaly detection method, device and apparatus for virtual machine, and storage medium 
CN109240276A (en) *  20181109  20190118  江南大学  Mutipiece PCA fault monitoring method based on FaultSensitive Principal variables selection 
CN110187206A (en) *  20190522  20190830  中国人民解放军国防科技大学  The fault detection method of the suspension system of nongausian process under a kind of complex working condition 
CN110701487A (en) *  20190918  20200117  浙江工业大学  KPCA and CasSVDDbased multiworkingcondition pipeline leakage detection method 
CN110701487B (en) *  20190918  20210824  浙江工业大学  KPCA and CasSVDDbased multiworkingcondition pipeline leakage detection method 
CN111177970A (en) *  20191210  20200519  浙江大学  Multistage semiconductor process virtual metering method based on Gaussian process and convolutional neural network 
CN111177970B (en) *  20191210  20211119  浙江大学  Multistage semiconductor process virtual metering method based on Gaussian process and convolutional neural network 
Similar Documents
Publication  Publication Date  Title 

CN101403923A (en)  Course monitoring method based on nongauss component extraction and support vector description  
Jiang et al.  Plantwide process monitoring based on mutual information–multiblock principal component analysis  
CN101169623B (en)  Nonlinear procedure fault identification method based on kernel principal component analysis contribution plot  
Wang et al.  Multivariate statistical process monitoring using an improved independent component analysis  
CN102361014B (en)  State monitoring and fault diagnosis method for largescale semiconductor manufacture process  
Ge et al.  Online monitoring of nonlinear multiple mode processes based on adaptive local model approach  
Ge et al.  Improved kernel PCAbased monitoring approach for nonlinear processes  
Zhang et al.  Fault detection of nonGaussian processes based on modified independent component analysis  
CN100470417C (en)  Fault diagnostic system and method for under industrial producing process small sample condition  
CN101446831B (en)  Decentralized process monitoring method  
Jiang et al.  Weighted kernel principal component analysis based on probability density estimation and moving window and its application in nonlinear chemical process monitoring  
CN103488091A (en)  Datadriving control process monitoring method based on dynamic component analysis  
CN104062968A (en)  Continuous chemical process fault detection method  
CN104714537A (en)  Fault prediction method based on joint relative change analysis and autoregression model  
CN109459993B (en)  Online adaptive fault monitoring and diagnosing method for process industrial process  
BernaldeLázaro et al.  Enhanced dynamic approach to improve the detection of smallmagnitude faults  
Deng et al.  Multimode process fault detection using local neighborhood similarity analysis  
CN104808648A (en)  Online and realtime batch process monitoring method based on k nearest neighbor  
RuizCárcel et al.  Canonical variate analysis for performance degradation under faulty conditions  
Ge  Improved twolevel monitoring system for plantwide processes  
CN103926919A (en)  Industrial process fault detection method based on wavelet transform and Lasso function  
CN201017232Y (en)  Industry process nonlinearity failure diagnosis device based on fisher  
Lindner et al.  Datadriven fault detection with process topology for fault identification  
Maestri et al.  Kernel PCA performance in processes with multiple operation modes  
CN201035376Y (en)  Failure diagnosis device under small sample conditional in the process of manufacturing production 
Legal Events
Date  Code  Title  Description 

PB01  Publication  
C06  Publication  
SE01  Entry into force of request for substantive examination  
C10  Entry into substantive examination  
RJ01  Rejection of invention patent application after publication 
Open date: 20090408 

C12  Rejection of a patent application after its publication 