CN109491338B - Multimode process quality-related fault diagnosis method based on sparse GMM - Google Patents
Multimode process quality-related fault diagnosis method based on sparse GMM Download PDFInfo
- Publication number
- CN109491338B CN109491338B CN201811331420.2A CN201811331420A CN109491338B CN 109491338 B CN109491338 B CN 109491338B CN 201811331420 A CN201811331420 A CN 201811331420A CN 109491338 B CN109491338 B CN 109491338B
- Authority
- CN
- China
- Prior art keywords
- sparse
- sample
- fault
- variable
- gmm
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 134
- 238000003745 diagnosis Methods 0.000 title claims abstract description 29
- 238000012544 monitoring process Methods 0.000 claims abstract description 40
- 239000011159 matrix material Substances 0.000 claims abstract description 20
- 239000000203 mixture Substances 0.000 claims abstract description 19
- 238000009826 distribution Methods 0.000 claims abstract description 17
- 238000001514 detection method Methods 0.000 claims abstract description 15
- 230000006870 function Effects 0.000 claims description 27
- 238000012549 training Methods 0.000 claims description 27
- 238000005457 optimization Methods 0.000 claims description 10
- 238000005259 measurement Methods 0.000 claims description 7
- 150000001875 compounds Chemical class 0.000 claims description 6
- 238000012360 testing method Methods 0.000 claims description 6
- 238000005516 engineering process Methods 0.000 claims description 3
- 238000002955 isolation Methods 0.000 claims description 3
- 238000013139 quantization Methods 0.000 claims description 3
- 238000003860 storage Methods 0.000 claims description 3
- 230000017105 transposition Effects 0.000 claims description 3
- 230000000694 effects Effects 0.000 abstract description 2
- 238000004519 manufacturing process Methods 0.000 description 9
- 238000003070 Statistical process control Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000010978 in-process monitoring Methods 0.000 description 1
- 230000007786 learning performance Effects 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 230000001932 seasonal effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G05B19/418—Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM]
- G05B19/41875—Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM] characterised by quality surveillance of production
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G05B2219/32—Operator till task planning
- G05B2219/32015—Optimize, process management, optimize production line
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/02—Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]
Landscapes
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Manufacturing & Machinery (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Automation & Control Theory (AREA)
- Complex Calculations (AREA)
- Testing And Monitoring For Control Systems (AREA)
Abstract
The invention discloses a multimode process quality related fault diagnosis method based on sparse GMM, which is characterized in that a high-quality coefficient weight matrix is obtained by utilizing sparse representation, manifold structure information is fused, a sparse Gaussian mixture model is constructed, probability distribution of Gaussian components smoothly changes along a data manifold structure, local neighbor samples of the Gaussian components are similar, the number of the Gaussian components is automatically obtained, robustness on noise and outliers is achieved, quality related fault detection is obtained, and meanwhile, a root-cause variable of fault occurrence is located according to controlled neighbor of detected faults. Compared with a Gaussian mixture model monitoring method, the method disclosed by the invention represents the sparse relation between the local manifold structure of the process data and the data, obtains the local similarity relation between samples, and reflects the change condition of the multi-modal process. Therefore, the sparse GMM method can obtain better fault detection effect and accurately position the root variable of the fault.
Description
Technical Field
The invention relates to the technical field of industrial process monitoring, in particular to a multimode process quality-related fault diagnosis method based on sparse GMM.
Background
The process monitoring in modern industry plays a key role in ensuring production safety, improving yield and the like. With the development of distributed control systems, the production scale and the operational complexity increase dramatically, and a large amount of high-dimensional data is collected in the process. Moreover, as the grade and yield of the produced products can be continuously adjusted according to market demands and seasonal effects, the process parameters such as product components, process set values, feed proportions and the like can fluctuate, and modern industrial processes can be switched among a plurality of different operation modes. These random variations in the production process cause the process data to exhibit non-linear, multi-modal, etc. characteristics. Although the data-driven Multivariate Statistical Process Control (MSPC) method is successfully applied to Process monitoring, the mean value and the covariance of multimode nonlinear data are significantly changed, and the traditional MSPC method ignores the nonlinear and multimode relations between different Process variables, which may cause the degradation of the monitoring result. Moreover, in the actual production process, the yield and the product quality are usually difficult to measure directly on-line, and measurement is required after the production is completed. Therefore, building a relational model between product variables and quality variables is particularly important for quality-related multi-modal process monitoring. A Gaussian Mixture Model (GMM) is used for multi-modal process monitoring, a series of Gaussian components are used for estimating complex data distribution in the multi-modal process, and statistical indexes based on Mahalanobis distance and likelihood probability are constructed to implement process monitoring.
However, GMM assumes that each single mode of a multi-modal process is Gaussian distributed, and it is possible for actual process data to be centrally distributed over a low-dimensional sub-manifold structure. Therefore, the neighborhood samples of each manifold are probably distributed in the same Gaussian component, and the manifold geometric information of the process can be fused to construct the manifold GMM monitoring model. Furthermore, it is difficult to measure these key variables, both the yield of the production process and the product quality, directly on-line, but after the production is complete. Therefore, building a relational model between process variables and quality variables is particularly important for quality-related fault detection and diagnosis.
Recently, horses, etc. (neuron computing,2015(285)) proposed a Robust Gaussian Mixture Model (RGMM) that automatically acquires Gaussian component scores for quality-related fault detection and diagnosis. However, in each gaussian component in the multi-modal process, the similarity of the distribution of neighboring data also needs to be grasped during modeling, the manifold discrimination capability of the multi-modal is fully mined, and the local geometric structural features in the gaussian are maintained. In addition, the weight matrix in the local manifold structure plays an important role in describing the geometry of data, and conventional manifold learning generally adopts a k-nearest neighbor or epsilon-sphere method to determine a neighborhood value given parameters. These methods are sensitive to data noise and outliers.
Disclosure of Invention
The invention provides a multimode process quality-related fault diagnosis method based on sparse GMM, aiming at overcoming the defects of the prior art, the sparse GMM method provided by the invention can fully find the internal change of the multimode process, and selects important fault variables related to quality according to the controlled neighbor of the detected fault; compared with the traditional GMM monitoring method, the method can obtain higher diagnosis precision and stronger fault discrimination capability.
The invention adopts the following technical scheme for solving the technical problems:
the invention provides a multimode process quality-related fault diagnosis method based on sparse GMM, which comprises the following steps:
step A, establishing a sparse representation model between a process variable and a quality variable to obtain a sparse reconstruction weight matrix S of a sample;
b, constructing a sparse Gaussian mixture model: forming a constraint condition of the similarity of neighboring samples around Gaussian components according to a sparse coefficient in a sparse reconstruction weight matrix S obtained by a sparse representation model, adaptively selecting a neighborhood range of a training sample, obtaining the conditional probability distribution of the similarity of the Gaussian components, keeping the locality and the sparsity of a data manifold structure, and automatically identifying the number of the Gaussian components;
and step C, constructing a sparse GMM monitoring index, designing a fault detection and diagnosis index by using an output result of a sparse Gaussian mixture model and the local Mahalanobis distance of each mode, fusing the overall output and the local information of the process, and evaluating the running state of the multi-mode process.
As a further optimization scheme of the multimode process quality-related fault diagnosis method based on the sparse GMM, the step a is specifically as follows:
given N samples from a multimodal process, each sample contains a process variable x ∈ RmAnd the quality variable y ∈ R, the historical data set D is represented as:
wherein R ismRepresents an m-dimensional variable, R represents a 1-dimensional variable, [ y (1) x (1)]TRepresenting quality variable and 1 st process variable, abbreviated to d1By analogy, [ y (1) x (N)]TRepresenting quality variable and Nth process variable, abbreviated to dNThe superscript T is transposition;
sparse representation the sparse representation of the training sample is calculated by jointly optimizing an objective function through the following formula:
in the formula, L1-normal form represents the sum of absolute values of matrix elements, lambda is a regularization parameter positive value, E is a sparse error, and E is a column vector with 1 element; solving sparse reconstruction weight matrix S ═ S by convex optimization method1,s2,…,sN]∈RN×N,RN ×NRepresenting storage spaces with rows and columns of N, where sn=[sn1,sn2,…,snn-1,0,snn+1,…,snN]T,n=1,2...,N,snFor the nth sample data dnOf each element s of the sparse coefficient variablenjDenotes dnFor reconstructed j sample djThe degree of contribution of (c).
As a further optimization scheme of the multimode process quality-related fault diagnosis method based on the sparse GMM, the method specifically comprises the following steps of:
step a, constructing a Gaussian mixture model related to quality, wherein a probability density function p (d | Θ) of each sample is represented as:
where d is the sample data, C is the number of Gaussian components, ωkIs the posterior probability of the kth Gaussian component, satisfiesGauss parameter thetak={μk,Σk},μkIs a k-th class mean, ΣkIs a covariance matrix of class k, theta ═ theta1,θ2,...,θCIs the set of global Gaussian model parameters, θ1,θ2,...,θCRespectively represent the 1 st, 2 nd, … th and C th local Gaussian components; for the k-th Gaussian component, its profileRate density function g (d θ)k) Is shown as
Let Z be { Z ═ Z1,z2,…,zNDenotes missing data of a category, and zi∈{1,2,...,C},ziRepresents a category to which the ith sample belongs; if z isiK, i-th sample belongs to k-th Gaussian component, and latent variable zki1, otherwise zki0; m is the dimension of the sample, and the parameters Θ of the quality-related gaussian mixture model are solved by maximizing the following likelihood functions:
wherein L (omega, theta; D, Z) is a likelihood function, omega is a posterior probability;
b, fusing geometric structure knowledge of conditional probability distribution in a GMM target function based on the principle that similar samples in the sparse representation and manifold structures are still close to each other in an embedding space, and constructing a sparse GMM; and realizing the minimized sparse regular term for the pairwise similarity constraint on the Gaussian probability distribution, wherein the constraint term is expressed as:
wherein, γn=p(zn|dn) Denotes the nth sample dnBelong to znProbability of class, γj=p(zj|dj) Denotes the jth sample djBelong to zjThe probability of the category, H (| ·), measures the similarity of the two distributions, and by adopting KL distance measurement, after minimizing the constraint term, the data are smoothly distributed along manifold geometric distance measurement; fusing the sparse regular term with the likelihood function of the GMM to obtain an objective function J of the sparse GMM:
wherein δ >0 is a regularization parameter; solving the parameters of the sparse GMM by adopting an expected value maximization method;
step c, calculating latent variable z according to original GMM and Bayes lawki:
Wherein, ω islIs the posterior probability of the Gaussian component l, g (d)i|θk) Is a sample diIn the Gaussian component thetakDensity probability of (1), g (d)i|θl) Is a sample diIn the Gaussian component thetal(ii) a density probability of;
step d, updating model parameters:
in the formula (I), the compound is shown in the specification,is the sum of classes, Ω, of samples of the k-th Gaussian componentnk=(dn-μk)(dn-μk)TAnd is the sample covariance of the kth gaussian component.
As a further optimization scheme of the multimode process quality-related fault diagnosis method based on the sparse GMM, the steps of constructing the sparse GMM monitoring index are as follows:
step I, for each monitoring sample dtAccording to the output result of the global structure information represented by the sparse GMM, the model is used for outputting JoutMeasuring the degree of deviation of the monitoring sample from the controlled training baseline, and measuring the degree of deviation J of the monitoring sample from the controlled training baselineoutExpressed as:
Jout=-ln p(dt|Θ) (12)
in the formula, p (d)t| Θ) represents a monitoring sample dtBased on the probability density of the training model;
counting the probability that the output value of the monitoring sample is smaller than the output value of the training sample, and designing LotIndex quantization process state:
Lot(dt)=Pr(Jout(dt)≤Jout(dtrain)) (13)
in the formula, Jout(dtrain) Represents the output of the training set, Jout(dt) Represents the output of the test sample, Lot(dt) Is a global information index; pr (J)out(dt)≤Jout(dtrain) Represents the probability that the test sample output is lower than the training set output;
mahalanobis distance is used to measure quality-related fault detection within a single modality, calculating a monitoring sample dtTo each modality CkThe mahalanobis distance MD of (a):
By fusing the global information index Lot(dt) And local information index MD(k)(dt|dtE k), designing a synthetic fault detection index GL, reducing monitoring cost and operation load, wherein GL is expressed as:
given control limit ηαWhen GL of the monitoring sample exceeds the control limit, detecting a fault related to the quality, otherwise, indicating that no fault occurs;
secondly, after detecting the quality-related fault, further determining a root variable of the fault; defining the contribution degree of the process variable as the distance between a fault sample and a controlled neighbor, obtaining the direction of the fault deviating from normal operation by solving the virtual controlled neighbor of the fault, and determining a root variable generated by the fault;
the optimization function that defines the controlled vicinity of the fault is:
wherein d isfaultIs a detected fault sample, dnicnThe method is characterized in that the method is a virtual controlled neighbor sample of a fault, the sample is randomly selected from training samples to serve as initialization input of a nonlinear programming technology, and the controlled neighbor sample is solved, so that the fault sample is further expressed as:
dfault=dnicn+f (17)
in the formula, f represents an estimated fault vector and is used for evaluating the minimum value of the fault sample adjusted to a normal state; the degree of contribution of each variable is defined as dfaultAnd dnicnIn euclidean distance CON betweenq:
In the formula (I), the compound is shown in the specification,and fqRespectively represents dnicn,dfaultAnd the qth element of f;
redefining the degree of contribution of a variable CONMD using Mahalanobis distancesq:
In the formula, xiqIs the unit vector with the qth element being 1, will have the maximum CONMDqIsolation of variable sets of values, setting threshold τ as:
if the variable of the fault exceeds this threshold τ, it is determined that the change in the variable caused a process anomaly, triggering the fault.
As a further optimization scheme of the multimode process quality-related fault diagnosis method based on the sparse GMM, α is 0.05.
Compared with the prior art, the invention adopting the technical scheme has the following technical effects: the method utilizes the weight matrix of the sparse representation manifold structure to keep the local geometric structure characteristics of the data manifold in the Gaussian components, automatically obtains the quantity of the Gaussian components, and has robustness on noise and outliers in the process; the method is based on GMM, combines sparsity and locality of manifold geometry structure, constructs sparse GMM, captures process variable characteristics related to quality variables, reflects state change conditions of the multimode process, and identifies root cause variables of faults according to the detected controlled neighbors of the faults; the method is suitable for quality-related fault detection and diagnosis in the multimode process.
Drawings
FIG. 1 is a flow chart of an embodiment of the method of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in detail with reference to the accompanying drawings and specific embodiments.
The invention adopts a sparse representation method without parameter robustness to construct an intrinsic matrix of a manifold structure and automatically decide the neighborhood range of a sample.
In view of the advantages of the sparse representation and quality-related GMM model in process monitoring, the multimode process quality-related fault diagnosis method based on the sparse GMM is provided, the local similarity and sparsity among samples on the manifold geometry of Gaussian components are kept, the learning performance of the GMM model is enhanced, and the diagnosis capability of the model is improved.
On the basis of a Gaussian mixture model, aiming at potential complex structural characteristics of a multi-mode process related to quality, a weight matrix of a manifold structure is represented in a parameter-free robust mode by sparse representation, a sparse Gaussian mixture model is constructed on the basis of similarity between manifold data samples on Gaussian components, and local features and sparse features of the multi-mode process data are kept. The proposed sparse GMM method can fully discover the intrinsic changes of the multimode process, and select important fault variables related to quality according to the controlled neighbors of the detected faults. Compared with the traditional GMM monitoring method, the method can obtain higher diagnosis precision and stronger fault discrimination capability.
The multimode process quality-related fault diagnosis method based on the sparse GMM utilizes the sparse representation to represent the geometric information distribution condition of Gaussian components, constructs the sparse GMM, analyzes the data distribution characteristics of the multimode process, extracts the process variable characteristics related to the quality, captures the internal change of the multimode process, analyzes the root variable of the fault occurrence and enhances the fault diagnosis capability of the sparse GMM method.
As shown in fig. 1, the present invention relates to a sparse GMM-based multimode process quality-related fault diagnosis method, which comprises the following specific implementation steps:
(1) and establishing a sparse representation model between the process variable and the quality variable to obtain a sparse reconstruction weight matrix S of the sample.
Given N samples from a multimodal process, each sample contains a process variable x ∈RmAnd the quality variable y ∈ R, the historical data set D is represented as:
wherein R ismRepresents an m-dimensional variable, R represents a 1-dimensional variable, [ y (1) x (1)]TRepresenting quality variable and 1 st process variable, abbreviated to d1By analogy, [ y (1) x (N)]TRepresenting quality variable and Nth process variable, abbreviated to dNThe superscript T is transposition;
sparse representation the sparse representation of the training sample is calculated by jointly optimizing an objective function through the following formula:
in the formula, L1-normal form represents the sum of absolute values of matrix elements, lambda is a regularization parameter positive value, E is a sparse error, and E is a column vector with 1 element; solving sparse reconstruction weight matrix S ═ S by convex optimization method1,s2,…,sN]∈RN×N,RN ×NRepresenting storage spaces with rows and columns of N, where sn=[sn1,sn2,…,snn-1,0,snn+1,…,snN]T,n=1,2...,N,snFor the nth sample data dnOf each element s of the sparse coefficient variablenjDenotes dnFor reconstructed j sample djThe degree of contribution of (c).
(2) And constructing a sparse Gaussian mixture model. By increasing the constraint condition of the similarity of the neighbor samples around the Gaussian components, the neighborhood range of the training samples is selected in a self-adaptive manner, the conditional probability distribution of the similarity of the Gaussian components is obtained, the locality and the sparsity of a data manifold structure are maintained, the number of the Gaussian components is automatically identified, and the robustness is provided for data noise and outliers. The substeps of solving the sparse gaussian mixture model are as follows:
step a, constructing a Gaussian mixture model related to quality, wherein a probability density function p (d | Θ) of each sample is represented as:
where d is the sample data, C is the number of Gaussian components, ωkIs the posterior probability of the kth Gaussian component, satisfiesGauss parameter thetak={μk,Σk},μkIs a k-th class mean, ΣkIs a covariance matrix of class k, theta ═ theta1,θ2,...,θCIs the set of global Gaussian model parameters, θ1,θ2,...,θCRespectively represent the 1 st, 2 nd, … th and C th local Gaussian components; for the kth Gaussian component, its probability density function g (d θ)k) Is shown as
Let Z be { Z ═ Z1,z2,…,zNDenotes missing data of a category, and zi∈{1,2,...,C},ziRepresents a category to which the ith sample belongs; if z isiK, i-th sample belongs to k-th Gaussian component, and latent variable zki1, otherwise zki0; m is the dimension of the sample, and the parameters Θ of the quality-related gaussian mixture model are solved by maximizing the following likelihood functions:
wherein L (omega, theta; D, Z) is a likelihood function, omega is a posterior probability;
b, fusing geometric structure knowledge of conditional probability distribution in a GMM target function based on the principle that similar samples in the sparse representation and manifold structures are still close to each other in an embedding space, and constructing a sparse GMM; and realizing the minimized sparse regular term for the pairwise similarity constraint on the Gaussian probability distribution, wherein the constraint term is expressed as:
wherein, γn=p(zn|dn) Denotes the nth sample dnBelong to znProbability of class, γj=p(zj|dj) Denotes the jth sample djBelong to zjThe probability of the category, H (| ·), measures the similarity of the two distributions, and by adopting KL distance measurement, after minimizing the constraint term, the data are smoothly distributed along manifold geometric distance measurement; fusing the sparse regular term with the likelihood function of the GMM to obtain an objective function J of the sparse GMM:
wherein δ >0 is a regularization parameter; solving the parameters of the sparse GMM by adopting an expected value maximization method;
step c, calculating latent variable z according to original GMM and Bayes lawki:
Wherein, ω islIs the posterior probability of the Gaussian component l, g (d)i|θk) Is a sample diIn the Gaussian component thetakDensity probability of (1), g (d)i|θl) Is a sample diIn the Gaussian component thetal(ii) a density probability of;
step d, updating model parameters:
in the formula (I), the compound is shown in the specification,is the sum of classes, Ω, of samples of the k-th Gaussian componentnk=(dn-μk)(dn-μk)TAnd is the sample covariance of the kth gaussian component.
(3) Constructing a sparse GMM monitoring index, designing a reasonable fault detection and diagnosis index by using the output result of the whole model and the local Mahalanobis distance of each mode, fusing the global output and the local information of the process, and evaluating the running state of the multi-mode process. The substep of solving the sparse GMM monitoring index is represented as:
step I, for each monitoring sample dtAccording to the output result of the global structure information represented by the sparse GMM, the model is used for outputting JoutMeasuring the degree of deviation of the monitoring sample from the controlled training baseline, and measuring the degree of deviation J of the monitoring sample from the controlled training baselineoutExpressed as:
Jout=-ln p(dt|Θ) (12)
in the formula, p (d)t| Θ) represents a monitoring sample dtBased on the probability density of the training model;
counting the probability that the output value of the monitoring sample is smaller than the output value of the training sample, and designing LotIndex quantization process state:
Lot(dt)=Pr(Jout(dt)≤Jout(dtrain)) (13)
in the formula, Jout(dtrain) Represents the output of the training set, Jout(dt) Presentation measurementOutput of the test sample, Lot(dt) Is a global information index; pr (J)out(dt)≤Jout(dtrain) Represents the probability that the test sample output is lower than the training set output;
mahalanobis distance is used to measure quality-related fault detection within a single modality, calculating a monitoring sample dtTo each modality CkThe mahalanobis distance MD of (a):
By fusing global information indicators Lot(dt) And local information index MD(k)(dt|dtE k), designing a synthetic fault detection index GL, reducing monitoring cost and operation load, wherein GL is expressed as:
given control limit ηα(α ═ 0.05), when GL of the monitored sample exceeds the control limit, a fault related to quality is detected, otherwise, no fault occurs;
secondly, after detecting the quality-related fault, further determining a root variable of the fault; defining the contribution degree of the process variable as the distance between a fault sample and a controlled neighbor, obtaining the direction of the fault deviating from normal operation by solving the virtual controlled neighbor of the fault, and determining a root variable generated by the fault;
the optimization function that defines the controlled vicinity of the fault is:
wherein d isfaultIs a detected fault sample, dnicnThe method is characterized in that the method is a virtual controlled neighbor sample of a fault, the sample is randomly selected from training samples to serve as initialization input of a nonlinear programming technology, and the controlled neighbor sample is solved, so that the fault sample is further expressed as:
dfault=dnicn+f (17)
in the formula, f represents an estimated fault vector and is used for evaluating the minimum value of the fault sample adjusted to a normal state; the degree of contribution of each variable is defined as dfaultAnd dnicnIn euclidean distance CON betweenq:
In the formula (I), the compound is shown in the specification,and fqRespectively represents dnicn,dfaultAnd the qth element of f;
redefining the degree of contribution of a variable CONMD using Mahalanobis distancesq:
In the formula, xiqIs the unit vector with the qth element being 1, will have the maximum CONMDqIsolation of variable sets of values, setting threshold τ as:
if the variable of the fault exceeds this threshold τ, it is determined that the change in the variable caused a process anomaly, triggering the fault.
The method utilizes the weight matrix of the sparse representation manifold structure to keep the local geometric structure characteristics of the data manifold in the Gaussian components, automatically obtains the number of the Gaussian components, and has robustness to noise and outliers in the process. The method is based on GMM, combines sparsity and locality of manifold geometry structure, constructs sparse GMM, captures process variable characteristics related to quality variables, reflects state change conditions of the multi-mode process, and identifies root cause variables of faults according to the detected controlled neighbors of the faults. Therefore, the method is suitable for quality-related fault detection and diagnosis in a multimode process.
The above description is only for the specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention.
Claims (5)
1. A multimode process quality related fault diagnosis method based on sparse GMM is characterized by comprising the following steps:
step A, establishing a sparse representation model between a process variable and a quality variable to obtain a sparse reconstruction weight matrix S of a sample;
b, constructing a sparse Gaussian mixture model: forming a constraint condition of the similarity of neighboring samples around Gaussian components according to a sparse coefficient in a sparse reconstruction weight matrix S obtained by a sparse representation model, adaptively selecting a neighborhood range of a training sample, obtaining the conditional probability distribution of the similarity of the Gaussian components, keeping the locality and the sparsity of a data manifold structure, and automatically identifying the number of the Gaussian components;
and step C, constructing a sparse GMM monitoring index, designing a fault detection and diagnosis index by using an output result of a sparse Gaussian mixture model and the local Mahalanobis distance of each mode, fusing the overall output and the local information of the process, and evaluating the running state of the multi-mode process.
2. The sparse GMM-based multimode process quality-related fault diagnosis method according to claim 1, wherein the step A specifically comprises the following steps:
given N samples from a multimodal process, each sample contains a process variable x ∈ RmAnd the quality variable y ∈ R, the historical data set D is represented as:
wherein R ismRepresents an m-dimensional variable, R represents a 1-dimensional variable, [ y (1) x (1)]TRepresenting quality variable and 1 st process variable, abbreviated to d1By analogy, [ y (1) x (N)]TRepresenting quality variable and Nth process variable, abbreviated to dNThe superscript T is transposition;
sparse representation the sparse representation of the training sample is calculated by jointly optimizing an objective function through the following formula:
in the formula, L1-normal form represents the sum of absolute values of matrix elements, lambda is a regularization parameter positive value, E is a sparse error, and E is a column vector with 1 element; solving sparse reconstruction weight matrix S ═ S by convex optimization method1,s2,…,sN]∈RN×N,RN×NRepresenting storage spaces with rows and columns of N, where sn=[sn1,sn2,…,snn-1,0,snn+1,…,snN]T,n=1,2...,N,snFor the nth sample data dnOf each element s of the sparse coefficient variablenjDenotes dnFor reconstructed j sample djThe degree of contribution of (c).
3. The sparse GMM-based multimode process quality-related fault diagnosis method according to claim 2, wherein the sparse Gaussian mixture model is constructed by the following specific steps:
step a, constructing a Gaussian mixture model related to quality, wherein a probability density function p (d | Θ) of each sample is represented as:
where d is the sample data, C is the number of Gaussian components, ωkIs the posterior probability of the kth Gaussian component, satisfiesGauss parameter thetak={μk,∑k},μkIs a class k mean, ΣkIs a covariance matrix of class k, theta ═ theta1,θ2,...,θCIs the set of global Gaussian model parameters, θ1,θ2,...,θCRespectively represent the 1 st, 2 nd, … th and C th local Gaussian components; for the kth Gaussian component, its probability density function g (d | θ)k) Is shown as
Let Z be { Z ═ Z1,z2,…,zNDenotes missing data of a category, and zi∈{1,2,...,C},ziRepresents a category to which the ith sample belongs; if z isiK, i-th sample belongs to k-th Gaussian component, and latent variable zki1, otherwise zki0; m is the dimension of the sample, and the parameters Θ of the quality-related gaussian mixture model are solved by maximizing the following likelihood functions:
wherein L (omega, theta; D, Z) is a likelihood function, omega is a posterior probability;
b, fusing geometric structure knowledge of conditional probability distribution in a GMM target function based on the principle that similar samples in the sparse representation and manifold structures are still close to each other in an embedding space, and constructing a sparse GMM; and realizing the minimized sparse regular term for the pairwise similarity constraint on the Gaussian probability distribution, wherein the constraint term is expressed as:
wherein, γn=p(zn|dn) Denotes the nth sample dnBelong to znProbability of class, γj=p(zj|dj) Denotes the jth sample djBelong to zjThe probability of the category, H (| ·), measures the similarity of the two distributions, and by adopting KL distance measurement, after minimizing the constraint term, the data are smoothly distributed along manifold geometric distance measurement; fusing the sparse regular term with the likelihood function of the GMM to obtain an objective function J of the sparse GMM:
wherein δ >0 is a regularization parameter; solving the parameters of the sparse GMM by adopting an expected value maximization method;
step c, calculating latent variable z according to original GMM and Bayes lawki:
Wherein, ω islIs the posterior probability of the Gaussian component l, g (d)i|θk) Is a sample diIn the Gaussian component thetakDensity probability of (1), g (d)i|θl) Is a sample diIn the Gaussian component thetal(ii) a density probability of;
step d, updating model parameters:
4. The sparse GMM-based multimode process quality-related fault diagnosis method according to claim 3, wherein the step of constructing the sparse GMM monitoring index is as follows:
step I, for each monitoring sample dtAccording to the output result of the global structure information represented by the sparse GMM, the model is used for outputting JoutMeasuring the degree of deviation of the monitoring sample from the controlled training baseline, and measuring the degree of deviation J of the monitoring sample from the controlled training baselineoutExpressed as:
Jout=-ln p(dt|Θ) (12)
in the formula, p (d)t| Θ) represents a monitoring sample dtBased on the probability density of the training model;
counting the probability that the output value of the monitoring sample is smaller than the output value of the training sample, and designing LotIndex quantization process state:
Lot(dt)=Pr(Jout(dt)≤Jout(dtrain)) (13)
in the formula, Jout(dtrain) Represents the output of the training set, Jout(dt) Represents the output of the test sample, Lot(dt) Is a global information index; pr (J)out(dt)≤Jout(dtrain) Represents the probability that the test sample output is lower than the training set output;
mahalanobis distance is used to measure quality-related fault detection within a single modality, calculating a monitoring sample dtTo each modality CkThe mahalanobis distance MD of (a):
By fusing the global information index Lot(dt) And local information index MD(k)(dt|dtE k), designing a synthetic fault detection index GL, reducing monitoring cost and operation load, wherein GL is expressed as:
given control limit ηαWhen GL of the monitoring sample exceeds the control limit, detecting a fault related to the quality, otherwise, indicating that no fault occurs;
secondly, after detecting the quality-related fault, further determining a root variable of the fault; defining the contribution degree of the process variable as the distance between a fault sample and a controlled neighbor, obtaining the direction of the fault deviating from normal operation by solving the virtual controlled neighbor of the fault, and determining a root variable generated by the fault;
the optimization function that defines the controlled vicinity of the fault is:
wherein d isfaultIs a detected fault sample, dnicnThe method is characterized in that the method is a virtual controlled neighbor sample of a fault, the sample is randomly selected from training samples to serve as initialization input of a nonlinear programming technology, and the controlled neighbor sample is solved, so that the fault sample is further expressed as:
dfault=dnicn+f (17)
in the formula, f represents an estimated fault vector and is used for evaluating the minimum value of the fault sample adjusted to a normal state; the degree of contribution of each variable is defined as dfaultAnd dnicnIn euclidean distance CON betweenq:
In the formula (I), the compound is shown in the specification,and fqRespectively represents dnicn,dfaultAnd the qth element of f;
redefining the degree of contribution of a variable CONMD using Mahalanobis distancesq:
In the formula, xiqIs the unit vector with the qth element being 1, will have the maximum CONMDqIsolation of variable sets of values, setting threshold τ as:
if the variable of the fault exceeds this threshold τ, it is determined that the change in the variable caused a process anomaly, triggering the fault.
5. The sparse GMM-based multimode process quality-related fault diagnosis method according to claim 4, wherein α is 0.05.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811331420.2A CN109491338B (en) | 2018-11-09 | 2018-11-09 | Multimode process quality-related fault diagnosis method based on sparse GMM |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811331420.2A CN109491338B (en) | 2018-11-09 | 2018-11-09 | Multimode process quality-related fault diagnosis method based on sparse GMM |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109491338A CN109491338A (en) | 2019-03-19 |
CN109491338B true CN109491338B (en) | 2021-05-11 |
Family
ID=65694254
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811331420.2A Active CN109491338B (en) | 2018-11-09 | 2018-11-09 | Multimode process quality-related fault diagnosis method based on sparse GMM |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109491338B (en) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110135492B (en) * | 2019-05-13 | 2020-12-22 | 山东大学 | Equipment fault diagnosis and abnormality detection method and system based on multiple Gaussian models |
CN110308713A (en) * | 2019-06-03 | 2019-10-08 | 湖州师范学院 | A kind of industrial process failure identification variables method based on k neighbour reconstruct |
CN111880090B (en) * | 2019-06-28 | 2021-07-06 | 浙江大学 | Online fault detection method for million-kilowatt ultra-supercritical unit |
CN111913461B (en) * | 2019-09-07 | 2022-03-18 | 宁波大学 | Distributed chemical process monitoring method based on regularization GCCA model |
CN113162787B (en) * | 2020-01-23 | 2023-09-29 | 华为技术有限公司 | Method for fault location in a telecommunication network, node classification method and related devices |
CN111291319B (en) * | 2020-03-24 | 2023-07-07 | 广东海洋大学深圳研究院 | Mobile robot state estimation method applied to non-Gaussian noise environment |
CN112068065B (en) * | 2020-10-12 | 2022-06-21 | 武汉格蓝若智能技术有限公司 | Voltage transformer state early warning method and device and storage medium |
CN113314142B (en) * | 2021-05-26 | 2022-04-01 | 山东大学 | Engine fault detection method, system, storage medium and equipment |
CN115310561B (en) * | 2022-09-29 | 2022-12-20 | 中国空气动力研究与发展中心设备设计与测试技术研究所 | Electromagnetic valve fault monitoring method based on integrated instant learning |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103456302B (en) * | 2013-09-02 | 2016-04-20 | 浙江大学 | A kind of emotional speaker recognition method based on the synthesis of emotion GMM Model Weight |
CN103631145B (en) * | 2013-12-11 | 2016-08-17 | 清华大学 | Multiple operating modes process monitoring method and system based on monitor control index switching |
US10553218B2 (en) * | 2016-09-19 | 2020-02-04 | Pindrop Security, Inc. | Dimensionality reduction of baum-welch statistics for speaker recognition |
CN106600557B (en) * | 2016-12-19 | 2019-09-17 | 辽宁工程技术大学 | PSF estimation method based on mixed Gauss model and sparse constraint |
-
2018
- 2018-11-09 CN CN201811331420.2A patent/CN109491338B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN109491338A (en) | 2019-03-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109491338B (en) | Multimode process quality-related fault diagnosis method based on sparse GMM | |
Tang et al. | Convolutional neural network‐based data anomaly detection method using multiple information for structural health monitoring | |
CN109766930B (en) | Method for predicting residual life of mine mechanical equipment based on DCNN model | |
US12001949B2 (en) | Computer-implemented method, computer program product and system for data analysis | |
EP3620983B1 (en) | Computer-implemented method, computer program product and system for data analysis | |
CN112580263B (en) | Turbofan engine residual service life prediction method based on space-time feature fusion | |
CN110940523B (en) | Unsupervised domain adaptive fault diagnosis method | |
Wang et al. | Random convolutional neural network structure: An intelligent health monitoring scheme for diesel engines | |
Yu et al. | Challenges and opportunities of deep learning-based process fault detection and diagnosis: a review | |
Cheng et al. | Monitoring influent conditions of wastewater treatment plants by nonlinear data-based techniques | |
CN111353373A (en) | Correlation alignment domain adaptive fault diagnosis method | |
CN111665819A (en) | Deep learning multi-model fusion-based complex chemical process fault diagnosis method | |
CN112557034B (en) | Bearing fault diagnosis method based on PCA _ CNNS | |
CN111768000A (en) | Industrial process data modeling method for online adaptive fine-tuning deep learning | |
CN113221441B (en) | Method and device for health assessment of power plant equipment | |
Pantazi et al. | Intelligent data mining and fusion systems in agriculture | |
Peng et al. | Towards robust and understandable fault detection and diagnosis using denoising sparse autoencoder and smooth integrated gradients | |
CN110222387B (en) | Multi-element drilling time sequence prediction method based on mixed leaky integration CRJ network | |
Yu et al. | Control chart recognition based on the parallel model of CNN and LSTM with GA optimization | |
Gao et al. | A process fault diagnosis method using multi‐time scale dynamic feature extraction based on convolutional neural network | |
Yu et al. | Wafer map defect recognition based on deep transfer learning-based densely connected convolutional network and deep forest | |
CN115062272A (en) | Water quality monitoring data abnormity identification and early warning method | |
CN116701992A (en) | Multi-mode anomaly detection method based on graph attention network and time convolution network | |
Dong et al. | Quality monitoring and root cause diagnosis for industrial processes based on Lasso-SAE-CCA | |
Wang et al. | An intelligent process fault diagnosis system based on Andrews plot and convolutional neural network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |