CN109491338B - Multimode process quality-related fault diagnosis method based on sparse GMM - Google Patents

Multimode process quality-related fault diagnosis method based on sparse GMM Download PDF

Info

Publication number
CN109491338B
CN109491338B CN201811331420.2A CN201811331420A CN109491338B CN 109491338 B CN109491338 B CN 109491338B CN 201811331420 A CN201811331420 A CN 201811331420A CN 109491338 B CN109491338 B CN 109491338B
Authority
CN
China
Prior art keywords
sparse
sample
fault
variable
gmm
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811331420.2A
Other languages
Chinese (zh)
Other versions
CN109491338A (en
Inventor
卢春红
王杰华
商亮亮
陈晓红
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nantong University
Original Assignee
Nantong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nantong University filed Critical Nantong University
Priority to CN201811331420.2A priority Critical patent/CN109491338B/en
Publication of CN109491338A publication Critical patent/CN109491338A/en
Application granted granted Critical
Publication of CN109491338B publication Critical patent/CN109491338B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/418Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM]
    • G05B19/41875Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM] characterised by quality surveillance of production
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems
    • G05B2219/32Operator till task planning
    • G05B2219/32015Optimize, process management, optimize production line
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Manufacturing & Machinery (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Complex Calculations (AREA)
  • Testing And Monitoring For Control Systems (AREA)

Abstract

The invention discloses a multimode process quality related fault diagnosis method based on sparse GMM, which is characterized in that a high-quality coefficient weight matrix is obtained by utilizing sparse representation, manifold structure information is fused, a sparse Gaussian mixture model is constructed, probability distribution of Gaussian components smoothly changes along a data manifold structure, local neighbor samples of the Gaussian components are similar, the number of the Gaussian components is automatically obtained, robustness on noise and outliers is achieved, quality related fault detection is obtained, and meanwhile, a root-cause variable of fault occurrence is located according to controlled neighbor of detected faults. Compared with a Gaussian mixture model monitoring method, the method disclosed by the invention represents the sparse relation between the local manifold structure of the process data and the data, obtains the local similarity relation between samples, and reflects the change condition of the multi-modal process. Therefore, the sparse GMM method can obtain better fault detection effect and accurately position the root variable of the fault.

Description

Multimode process quality-related fault diagnosis method based on sparse GMM
Technical Field
The invention relates to the technical field of industrial process monitoring, in particular to a multimode process quality-related fault diagnosis method based on sparse GMM.
Background
The process monitoring in modern industry plays a key role in ensuring production safety, improving yield and the like. With the development of distributed control systems, the production scale and the operational complexity increase dramatically, and a large amount of high-dimensional data is collected in the process. Moreover, as the grade and yield of the produced products can be continuously adjusted according to market demands and seasonal effects, the process parameters such as product components, process set values, feed proportions and the like can fluctuate, and modern industrial processes can be switched among a plurality of different operation modes. These random variations in the production process cause the process data to exhibit non-linear, multi-modal, etc. characteristics. Although the data-driven Multivariate Statistical Process Control (MSPC) method is successfully applied to Process monitoring, the mean value and the covariance of multimode nonlinear data are significantly changed, and the traditional MSPC method ignores the nonlinear and multimode relations between different Process variables, which may cause the degradation of the monitoring result. Moreover, in the actual production process, the yield and the product quality are usually difficult to measure directly on-line, and measurement is required after the production is completed. Therefore, building a relational model between product variables and quality variables is particularly important for quality-related multi-modal process monitoring. A Gaussian Mixture Model (GMM) is used for multi-modal process monitoring, a series of Gaussian components are used for estimating complex data distribution in the multi-modal process, and statistical indexes based on Mahalanobis distance and likelihood probability are constructed to implement process monitoring.
However, GMM assumes that each single mode of a multi-modal process is Gaussian distributed, and it is possible for actual process data to be centrally distributed over a low-dimensional sub-manifold structure. Therefore, the neighborhood samples of each manifold are probably distributed in the same Gaussian component, and the manifold geometric information of the process can be fused to construct the manifold GMM monitoring model. Furthermore, it is difficult to measure these key variables, both the yield of the production process and the product quality, directly on-line, but after the production is complete. Therefore, building a relational model between process variables and quality variables is particularly important for quality-related fault detection and diagnosis.
Recently, horses, etc. (neuron computing,2015(285)) proposed a Robust Gaussian Mixture Model (RGMM) that automatically acquires Gaussian component scores for quality-related fault detection and diagnosis. However, in each gaussian component in the multi-modal process, the similarity of the distribution of neighboring data also needs to be grasped during modeling, the manifold discrimination capability of the multi-modal is fully mined, and the local geometric structural features in the gaussian are maintained. In addition, the weight matrix in the local manifold structure plays an important role in describing the geometry of data, and conventional manifold learning generally adopts a k-nearest neighbor or epsilon-sphere method to determine a neighborhood value given parameters. These methods are sensitive to data noise and outliers.
Disclosure of Invention
The invention provides a multimode process quality-related fault diagnosis method based on sparse GMM, aiming at overcoming the defects of the prior art, the sparse GMM method provided by the invention can fully find the internal change of the multimode process, and selects important fault variables related to quality according to the controlled neighbor of the detected fault; compared with the traditional GMM monitoring method, the method can obtain higher diagnosis precision and stronger fault discrimination capability.
The invention adopts the following technical scheme for solving the technical problems:
the invention provides a multimode process quality-related fault diagnosis method based on sparse GMM, which comprises the following steps:
step A, establishing a sparse representation model between a process variable and a quality variable to obtain a sparse reconstruction weight matrix S of a sample;
b, constructing a sparse Gaussian mixture model: forming a constraint condition of the similarity of neighboring samples around Gaussian components according to a sparse coefficient in a sparse reconstruction weight matrix S obtained by a sparse representation model, adaptively selecting a neighborhood range of a training sample, obtaining the conditional probability distribution of the similarity of the Gaussian components, keeping the locality and the sparsity of a data manifold structure, and automatically identifying the number of the Gaussian components;
and step C, constructing a sparse GMM monitoring index, designing a fault detection and diagnosis index by using an output result of a sparse Gaussian mixture model and the local Mahalanobis distance of each mode, fusing the overall output and the local information of the process, and evaluating the running state of the multi-mode process.
As a further optimization scheme of the multimode process quality-related fault diagnosis method based on the sparse GMM, the step a is specifically as follows:
given N samples from a multimodal process, each sample contains a process variable x ∈ RmAnd the quality variable y ∈ R, the historical data set D is represented as:
Figure BDA0001860104610000021
wherein R ismRepresents an m-dimensional variable, R represents a 1-dimensional variable, [ y (1) x (1)]TRepresenting quality variable and 1 st process variable, abbreviated to d1By analogy, [ y (1) x (N)]TRepresenting quality variable and Nth process variable, abbreviated to dNThe superscript T is transposition;
sparse representation the sparse representation of the training sample is calculated by jointly optimizing an objective function through the following formula:
Figure BDA0001860104610000035
in the formula, L1-normal form represents the sum of absolute values of matrix elements, lambda is a regularization parameter positive value, E is a sparse error, and E is a column vector with 1 element; solving sparse reconstruction weight matrix S ═ S by convex optimization method1,s2,…,sN]∈RN×N,RN ×NRepresenting storage spaces with rows and columns of N, where sn=[sn1,sn2,…,snn-1,0,snn+1,…,snN]T,n=1,2...,N,snFor the nth sample data dnOf each element s of the sparse coefficient variablenjDenotes dnFor reconstructed j sample djThe degree of contribution of (c).
As a further optimization scheme of the multimode process quality-related fault diagnosis method based on the sparse GMM, the method specifically comprises the following steps of:
step a, constructing a Gaussian mixture model related to quality, wherein a probability density function p (d | Θ) of each sample is represented as:
Figure BDA0001860104610000031
where d is the sample data, C is the number of Gaussian components, ωkIs the posterior probability of the kth Gaussian component, satisfies
Figure BDA0001860104610000032
Gauss parameter thetak={μkk},μkIs a k-th class mean, ΣkIs a covariance matrix of class k, theta ═ theta12,...,θCIs the set of global Gaussian model parameters, θ12,...,θCRespectively represent the 1 st, 2 nd, … th and C th local Gaussian components; for the k-th Gaussian component, its profileRate density function g (d θ)k) Is shown as
Figure BDA0001860104610000033
Let Z be { Z ═ Z1,z2,…,zNDenotes missing data of a category, and zi∈{1,2,...,C},ziRepresents a category to which the ith sample belongs; if z isiK, i-th sample belongs to k-th Gaussian component, and latent variable zki1, otherwise zki0; m is the dimension of the sample, and the parameters Θ of the quality-related gaussian mixture model are solved by maximizing the following likelihood functions:
Figure BDA0001860104610000034
wherein L (omega, theta; D, Z) is a likelihood function, omega is a posterior probability;
b, fusing geometric structure knowledge of conditional probability distribution in a GMM target function based on the principle that similar samples in the sparse representation and manifold structures are still close to each other in an embedding space, and constructing a sparse GMM; and realizing the minimized sparse regular term for the pairwise similarity constraint on the Gaussian probability distribution, wherein the constraint term is expressed as:
Figure BDA0001860104610000041
wherein, γn=p(zn|dn) Denotes the nth sample dnBelong to znProbability of class, γj=p(zj|dj) Denotes the jth sample djBelong to zjThe probability of the category, H (| ·), measures the similarity of the two distributions, and by adopting KL distance measurement, after minimizing the constraint term, the data are smoothly distributed along manifold geometric distance measurement; fusing the sparse regular term with the likelihood function of the GMM to obtain an objective function J of the sparse GMM:
Figure BDA0001860104610000042
wherein δ >0 is a regularization parameter; solving the parameters of the sparse GMM by adopting an expected value maximization method;
step c, calculating latent variable z according to original GMM and Bayes lawki
Figure BDA0001860104610000043
Wherein, ω islIs the posterior probability of the Gaussian component l, g (d)ik) Is a sample diIn the Gaussian component thetakDensity probability of (1), g (d)il) Is a sample diIn the Gaussian component thetal(ii) a density probability of;
step d, updating model parameters:
Figure BDA0001860104610000044
Figure BDA0001860104610000045
Figure BDA0001860104610000046
in the formula (I), the compound is shown in the specification,
Figure BDA0001860104610000047
is the sum of classes, Ω, of samples of the k-th Gaussian componentnk=(dnk)(dnk)TAnd is the sample covariance of the kth gaussian component.
As a further optimization scheme of the multimode process quality-related fault diagnosis method based on the sparse GMM, the steps of constructing the sparse GMM monitoring index are as follows:
step I, for each monitoring sample dtAccording to the output result of the global structure information represented by the sparse GMM, the model is used for outputting JoutMeasuring the degree of deviation of the monitoring sample from the controlled training baseline, and measuring the degree of deviation J of the monitoring sample from the controlled training baselineoutExpressed as:
Jout=-ln p(dt|Θ) (12)
in the formula, p (d)t| Θ) represents a monitoring sample dtBased on the probability density of the training model;
counting the probability that the output value of the monitoring sample is smaller than the output value of the training sample, and designing LotIndex quantization process state:
Lot(dt)=Pr(Jout(dt)≤Jout(dtrain)) (13)
in the formula, Jout(dtrain) Represents the output of the training set, Jout(dt) Represents the output of the test sample, Lot(dt) Is a global information index; pr (J)out(dt)≤Jout(dtrain) Represents the probability that the test sample output is lower than the training set output;
mahalanobis distance is used to measure quality-related fault detection within a single modality, calculating a monitoring sample dtTo each modality CkThe mahalanobis distance MD of (a):
Figure BDA0001860104610000051
mahalanobis distance follows the approximated χ2Is distributed, i.e.
Figure BDA0001860104610000052
And is
Figure BDA0001860104610000053
By fusing the global information index Lot(dt) And local information index MD(k)(dt|dtE k), designing a synthetic fault detection index GL, reducing monitoring cost and operation load, wherein GL is expressed as:
Figure BDA0001860104610000054
given control limit ηαWhen GL of the monitoring sample exceeds the control limit, detecting a fault related to the quality, otherwise, indicating that no fault occurs;
secondly, after detecting the quality-related fault, further determining a root variable of the fault; defining the contribution degree of the process variable as the distance between a fault sample and a controlled neighbor, obtaining the direction of the fault deviating from normal operation by solving the virtual controlled neighbor of the fault, and determining a root variable generated by the fault;
the optimization function that defines the controlled vicinity of the fault is:
Figure BDA0001860104610000055
wherein d isfaultIs a detected fault sample, dnicnThe method is characterized in that the method is a virtual controlled neighbor sample of a fault, the sample is randomly selected from training samples to serve as initialization input of a nonlinear programming technology, and the controlled neighbor sample is solved, so that the fault sample is further expressed as:
dfault=dnicn+f (17)
in the formula, f represents an estimated fault vector and is used for evaluating the minimum value of the fault sample adjusted to a normal state; the degree of contribution of each variable is defined as dfaultAnd dnicnIn euclidean distance CON betweenq
Figure BDA0001860104610000061
In the formula (I), the compound is shown in the specification,
Figure BDA0001860104610000062
and fqRespectively represents dnicn,dfaultAnd the qth element of f;
redefining the degree of contribution of a variable CONMD using Mahalanobis distancesq
Figure BDA0001860104610000063
In the formula, xiqIs the unit vector with the qth element being 1, will have the maximum CONMDqIsolation of variable sets of values, setting threshold τ as:
Figure BDA0001860104610000064
if the variable of the fault exceeds this threshold τ, it is determined that the change in the variable caused a process anomaly, triggering the fault.
As a further optimization scheme of the multimode process quality-related fault diagnosis method based on the sparse GMM, α is 0.05.
Compared with the prior art, the invention adopting the technical scheme has the following technical effects: the method utilizes the weight matrix of the sparse representation manifold structure to keep the local geometric structure characteristics of the data manifold in the Gaussian components, automatically obtains the quantity of the Gaussian components, and has robustness on noise and outliers in the process; the method is based on GMM, combines sparsity and locality of manifold geometry structure, constructs sparse GMM, captures process variable characteristics related to quality variables, reflects state change conditions of the multimode process, and identifies root cause variables of faults according to the detected controlled neighbors of the faults; the method is suitable for quality-related fault detection and diagnosis in the multimode process.
Drawings
FIG. 1 is a flow chart of an embodiment of the method of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in detail with reference to the accompanying drawings and specific embodiments.
The invention adopts a sparse representation method without parameter robustness to construct an intrinsic matrix of a manifold structure and automatically decide the neighborhood range of a sample.
In view of the advantages of the sparse representation and quality-related GMM model in process monitoring, the multimode process quality-related fault diagnosis method based on the sparse GMM is provided, the local similarity and sparsity among samples on the manifold geometry of Gaussian components are kept, the learning performance of the GMM model is enhanced, and the diagnosis capability of the model is improved.
On the basis of a Gaussian mixture model, aiming at potential complex structural characteristics of a multi-mode process related to quality, a weight matrix of a manifold structure is represented in a parameter-free robust mode by sparse representation, a sparse Gaussian mixture model is constructed on the basis of similarity between manifold data samples on Gaussian components, and local features and sparse features of the multi-mode process data are kept. The proposed sparse GMM method can fully discover the intrinsic changes of the multimode process, and select important fault variables related to quality according to the controlled neighbors of the detected faults. Compared with the traditional GMM monitoring method, the method can obtain higher diagnosis precision and stronger fault discrimination capability.
The multimode process quality-related fault diagnosis method based on the sparse GMM utilizes the sparse representation to represent the geometric information distribution condition of Gaussian components, constructs the sparse GMM, analyzes the data distribution characteristics of the multimode process, extracts the process variable characteristics related to the quality, captures the internal change of the multimode process, analyzes the root variable of the fault occurrence and enhances the fault diagnosis capability of the sparse GMM method.
As shown in fig. 1, the present invention relates to a sparse GMM-based multimode process quality-related fault diagnosis method, which comprises the following specific implementation steps:
(1) and establishing a sparse representation model between the process variable and the quality variable to obtain a sparse reconstruction weight matrix S of the sample.
Given N samples from a multimodal process, each sample contains a process variable x ∈RmAnd the quality variable y ∈ R, the historical data set D is represented as:
Figure BDA0001860104610000071
wherein R ismRepresents an m-dimensional variable, R represents a 1-dimensional variable, [ y (1) x (1)]TRepresenting quality variable and 1 st process variable, abbreviated to d1By analogy, [ y (1) x (N)]TRepresenting quality variable and Nth process variable, abbreviated to dNThe superscript T is transposition;
sparse representation the sparse representation of the training sample is calculated by jointly optimizing an objective function through the following formula:
Figure BDA0001860104610000072
in the formula, L1-normal form represents the sum of absolute values of matrix elements, lambda is a regularization parameter positive value, E is a sparse error, and E is a column vector with 1 element; solving sparse reconstruction weight matrix S ═ S by convex optimization method1,s2,…,sN]∈RN×N,RN ×NRepresenting storage spaces with rows and columns of N, where sn=[sn1,sn2,…,snn-1,0,snn+1,…,snN]T,n=1,2...,N,snFor the nth sample data dnOf each element s of the sparse coefficient variablenjDenotes dnFor reconstructed j sample djThe degree of contribution of (c).
(2) And constructing a sparse Gaussian mixture model. By increasing the constraint condition of the similarity of the neighbor samples around the Gaussian components, the neighborhood range of the training samples is selected in a self-adaptive manner, the conditional probability distribution of the similarity of the Gaussian components is obtained, the locality and the sparsity of a data manifold structure are maintained, the number of the Gaussian components is automatically identified, and the robustness is provided for data noise and outliers. The substeps of solving the sparse gaussian mixture model are as follows:
step a, constructing a Gaussian mixture model related to quality, wherein a probability density function p (d | Θ) of each sample is represented as:
Figure BDA0001860104610000081
where d is the sample data, C is the number of Gaussian components, ωkIs the posterior probability of the kth Gaussian component, satisfies
Figure BDA0001860104610000082
Gauss parameter thetak={μkk},μkIs a k-th class mean, ΣkIs a covariance matrix of class k, theta ═ theta12,...,θCIs the set of global Gaussian model parameters, θ12,...,θCRespectively represent the 1 st, 2 nd, … th and C th local Gaussian components; for the kth Gaussian component, its probability density function g (d θ)k) Is shown as
Figure BDA0001860104610000083
Let Z be { Z ═ Z1,z2,…,zNDenotes missing data of a category, and zi∈{1,2,...,C},ziRepresents a category to which the ith sample belongs; if z isiK, i-th sample belongs to k-th Gaussian component, and latent variable zki1, otherwise zki0; m is the dimension of the sample, and the parameters Θ of the quality-related gaussian mixture model are solved by maximizing the following likelihood functions:
Figure BDA0001860104610000084
wherein L (omega, theta; D, Z) is a likelihood function, omega is a posterior probability;
b, fusing geometric structure knowledge of conditional probability distribution in a GMM target function based on the principle that similar samples in the sparse representation and manifold structures are still close to each other in an embedding space, and constructing a sparse GMM; and realizing the minimized sparse regular term for the pairwise similarity constraint on the Gaussian probability distribution, wherein the constraint term is expressed as:
Figure BDA0001860104610000085
wherein, γn=p(zn|dn) Denotes the nth sample dnBelong to znProbability of class, γj=p(zj|dj) Denotes the jth sample djBelong to zjThe probability of the category, H (| ·), measures the similarity of the two distributions, and by adopting KL distance measurement, after minimizing the constraint term, the data are smoothly distributed along manifold geometric distance measurement; fusing the sparse regular term with the likelihood function of the GMM to obtain an objective function J of the sparse GMM:
Figure BDA0001860104610000086
wherein δ >0 is a regularization parameter; solving the parameters of the sparse GMM by adopting an expected value maximization method;
step c, calculating latent variable z according to original GMM and Bayes lawki
Figure BDA0001860104610000091
Wherein, ω islIs the posterior probability of the Gaussian component l, g (d)ik) Is a sample diIn the Gaussian component thetakDensity probability of (1), g (d)il) Is a sample diIn the Gaussian component thetal(ii) a density probability of;
step d, updating model parameters:
Figure BDA0001860104610000092
Figure BDA0001860104610000093
Figure BDA0001860104610000094
in the formula (I), the compound is shown in the specification,
Figure BDA0001860104610000095
is the sum of classes, Ω, of samples of the k-th Gaussian componentnk=(dnk)(dnk)TAnd is the sample covariance of the kth gaussian component.
(3) Constructing a sparse GMM monitoring index, designing a reasonable fault detection and diagnosis index by using the output result of the whole model and the local Mahalanobis distance of each mode, fusing the global output and the local information of the process, and evaluating the running state of the multi-mode process. The substep of solving the sparse GMM monitoring index is represented as:
step I, for each monitoring sample dtAccording to the output result of the global structure information represented by the sparse GMM, the model is used for outputting JoutMeasuring the degree of deviation of the monitoring sample from the controlled training baseline, and measuring the degree of deviation J of the monitoring sample from the controlled training baselineoutExpressed as:
Jout=-ln p(dt|Θ) (12)
in the formula, p (d)t| Θ) represents a monitoring sample dtBased on the probability density of the training model;
counting the probability that the output value of the monitoring sample is smaller than the output value of the training sample, and designing LotIndex quantization process state:
Lot(dt)=Pr(Jout(dt)≤Jout(dtrain)) (13)
in the formula, Jout(dtrain) Represents the output of the training set, Jout(dt) Presentation measurementOutput of the test sample, Lot(dt) Is a global information index; pr (J)out(dt)≤Jout(dtrain) Represents the probability that the test sample output is lower than the training set output;
mahalanobis distance is used to measure quality-related fault detection within a single modality, calculating a monitoring sample dtTo each modality CkThe mahalanobis distance MD of (a):
Figure BDA0001860104610000101
mahalanobis distance follows the approximated χ2Is distributed, i.e.
Figure BDA0001860104610000102
And is
Figure BDA0001860104610000103
By fusing global information indicators Lot(dt) And local information index MD(k)(dt|dtE k), designing a synthetic fault detection index GL, reducing monitoring cost and operation load, wherein GL is expressed as:
Figure BDA0001860104610000104
given control limit ηα(α ═ 0.05), when GL of the monitored sample exceeds the control limit, a fault related to quality is detected, otherwise, no fault occurs;
secondly, after detecting the quality-related fault, further determining a root variable of the fault; defining the contribution degree of the process variable as the distance between a fault sample and a controlled neighbor, obtaining the direction of the fault deviating from normal operation by solving the virtual controlled neighbor of the fault, and determining a root variable generated by the fault;
the optimization function that defines the controlled vicinity of the fault is:
Figure BDA0001860104610000105
wherein d isfaultIs a detected fault sample, dnicnThe method is characterized in that the method is a virtual controlled neighbor sample of a fault, the sample is randomly selected from training samples to serve as initialization input of a nonlinear programming technology, and the controlled neighbor sample is solved, so that the fault sample is further expressed as:
dfault=dnicn+f (17)
in the formula, f represents an estimated fault vector and is used for evaluating the minimum value of the fault sample adjusted to a normal state; the degree of contribution of each variable is defined as dfaultAnd dnicnIn euclidean distance CON betweenq
Figure BDA0001860104610000106
In the formula (I), the compound is shown in the specification,
Figure BDA0001860104610000107
and fqRespectively represents dnicn,dfaultAnd the qth element of f;
redefining the degree of contribution of a variable CONMD using Mahalanobis distancesq
Figure BDA0001860104610000108
In the formula, xiqIs the unit vector with the qth element being 1, will have the maximum CONMDqIsolation of variable sets of values, setting threshold τ as:
Figure BDA0001860104610000111
if the variable of the fault exceeds this threshold τ, it is determined that the change in the variable caused a process anomaly, triggering the fault.
The method utilizes the weight matrix of the sparse representation manifold structure to keep the local geometric structure characteristics of the data manifold in the Gaussian components, automatically obtains the number of the Gaussian components, and has robustness to noise and outliers in the process. The method is based on GMM, combines sparsity and locality of manifold geometry structure, constructs sparse GMM, captures process variable characteristics related to quality variables, reflects state change conditions of the multi-mode process, and identifies root cause variables of faults according to the detected controlled neighbors of the faults. Therefore, the method is suitable for quality-related fault detection and diagnosis in a multimode process.
The above description is only for the specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention.

Claims (5)

1. A multimode process quality related fault diagnosis method based on sparse GMM is characterized by comprising the following steps:
step A, establishing a sparse representation model between a process variable and a quality variable to obtain a sparse reconstruction weight matrix S of a sample;
b, constructing a sparse Gaussian mixture model: forming a constraint condition of the similarity of neighboring samples around Gaussian components according to a sparse coefficient in a sparse reconstruction weight matrix S obtained by a sparse representation model, adaptively selecting a neighborhood range of a training sample, obtaining the conditional probability distribution of the similarity of the Gaussian components, keeping the locality and the sparsity of a data manifold structure, and automatically identifying the number of the Gaussian components;
and step C, constructing a sparse GMM monitoring index, designing a fault detection and diagnosis index by using an output result of a sparse Gaussian mixture model and the local Mahalanobis distance of each mode, fusing the overall output and the local information of the process, and evaluating the running state of the multi-mode process.
2. The sparse GMM-based multimode process quality-related fault diagnosis method according to claim 1, wherein the step A specifically comprises the following steps:
given N samples from a multimodal process, each sample contains a process variable x ∈ RmAnd the quality variable y ∈ R, the historical data set D is represented as:
Figure FDA0002921561830000011
wherein R ismRepresents an m-dimensional variable, R represents a 1-dimensional variable, [ y (1) x (1)]TRepresenting quality variable and 1 st process variable, abbreviated to d1By analogy, [ y (1) x (N)]TRepresenting quality variable and Nth process variable, abbreviated to dNThe superscript T is transposition;
sparse representation the sparse representation of the training sample is calculated by jointly optimizing an objective function through the following formula:
Figure FDA0002921561830000012
in the formula, L1-normal form represents the sum of absolute values of matrix elements, lambda is a regularization parameter positive value, E is a sparse error, and E is a column vector with 1 element; solving sparse reconstruction weight matrix S ═ S by convex optimization method1,s2,…,sN]∈RN×N,RN×NRepresenting storage spaces with rows and columns of N, where sn=[sn1,sn2,…,snn-1,0,snn+1,…,snN]T,n=1,2...,N,snFor the nth sample data dnOf each element s of the sparse coefficient variablenjDenotes dnFor reconstructed j sample djThe degree of contribution of (c).
3. The sparse GMM-based multimode process quality-related fault diagnosis method according to claim 2, wherein the sparse Gaussian mixture model is constructed by the following specific steps:
step a, constructing a Gaussian mixture model related to quality, wherein a probability density function p (d | Θ) of each sample is represented as:
Figure FDA0002921561830000021
where d is the sample data, C is the number of Gaussian components, ωkIs the posterior probability of the kth Gaussian component, satisfies
Figure FDA0002921561830000022
Gauss parameter thetak={μk,∑k},μkIs a class k mean, ΣkIs a covariance matrix of class k, theta ═ theta12,...,θCIs the set of global Gaussian model parameters, θ12,...,θCRespectively represent the 1 st, 2 nd, … th and C th local Gaussian components; for the kth Gaussian component, its probability density function g (d | θ)k) Is shown as
Figure FDA0002921561830000023
Let Z be { Z ═ Z1,z2,…,zNDenotes missing data of a category, and zi∈{1,2,...,C},ziRepresents a category to which the ith sample belongs; if z isiK, i-th sample belongs to k-th Gaussian component, and latent variable zki1, otherwise zki0; m is the dimension of the sample, and the parameters Θ of the quality-related gaussian mixture model are solved by maximizing the following likelihood functions:
Figure FDA0002921561830000024
wherein L (omega, theta; D, Z) is a likelihood function, omega is a posterior probability;
b, fusing geometric structure knowledge of conditional probability distribution in a GMM target function based on the principle that similar samples in the sparse representation and manifold structures are still close to each other in an embedding space, and constructing a sparse GMM; and realizing the minimized sparse regular term for the pairwise similarity constraint on the Gaussian probability distribution, wherein the constraint term is expressed as:
Figure FDA0002921561830000025
wherein, γn=p(zn|dn) Denotes the nth sample dnBelong to znProbability of class, γj=p(zj|dj) Denotes the jth sample djBelong to zjThe probability of the category, H (| ·), measures the similarity of the two distributions, and by adopting KL distance measurement, after minimizing the constraint term, the data are smoothly distributed along manifold geometric distance measurement; fusing the sparse regular term with the likelihood function of the GMM to obtain an objective function J of the sparse GMM:
Figure FDA0002921561830000031
wherein δ >0 is a regularization parameter; solving the parameters of the sparse GMM by adopting an expected value maximization method;
step c, calculating latent variable z according to original GMM and Bayes lawki
Figure FDA0002921561830000032
Wherein, ω islIs the posterior probability of the Gaussian component l, g (d)ik) Is a sample diIn the Gaussian component thetakDensity probability of (1), g (d)il) Is a sample diIn the Gaussian component thetal(ii) a density probability of;
step d, updating model parameters:
Figure FDA0002921561830000033
Figure FDA0002921561830000034
Figure FDA0002921561830000035
in the formula (I), the compound is shown in the specification,
Figure FDA0002921561830000036
is the sum of classes, Ω, of samples of the k-th Gaussian componentnk=(dnk)(dnk)TAnd is the sample covariance of the kth gaussian component.
4. The sparse GMM-based multimode process quality-related fault diagnosis method according to claim 3, wherein the step of constructing the sparse GMM monitoring index is as follows:
step I, for each monitoring sample dtAccording to the output result of the global structure information represented by the sparse GMM, the model is used for outputting JoutMeasuring the degree of deviation of the monitoring sample from the controlled training baseline, and measuring the degree of deviation J of the monitoring sample from the controlled training baselineoutExpressed as:
Jout=-ln p(dt|Θ) (12)
in the formula, p (d)t| Θ) represents a monitoring sample dtBased on the probability density of the training model;
counting the probability that the output value of the monitoring sample is smaller than the output value of the training sample, and designing LotIndex quantization process state:
Lot(dt)=Pr(Jout(dt)≤Jout(dtrain)) (13)
in the formula, Jout(dtrain) Represents the output of the training set, Jout(dt) Represents the output of the test sample, Lot(dt) Is a global information index; pr (J)out(dt)≤Jout(dtrain) Represents the probability that the test sample output is lower than the training set output;
mahalanobis distance is used to measure quality-related fault detection within a single modality, calculating a monitoring sample dtTo each modality CkThe mahalanobis distance MD of (a):
Figure FDA0002921561830000041
mahalanobis distance follows the approximated χ2Is distributed, i.e.
Figure FDA0002921561830000042
And is
Figure FDA0002921561830000043
By fusing the global information index Lot(dt) And local information index MD(k)(dt|dtE k), designing a synthetic fault detection index GL, reducing monitoring cost and operation load, wherein GL is expressed as:
Figure FDA0002921561830000044
given control limit ηαWhen GL of the monitoring sample exceeds the control limit, detecting a fault related to the quality, otherwise, indicating that no fault occurs;
secondly, after detecting the quality-related fault, further determining a root variable of the fault; defining the contribution degree of the process variable as the distance between a fault sample and a controlled neighbor, obtaining the direction of the fault deviating from normal operation by solving the virtual controlled neighbor of the fault, and determining a root variable generated by the fault;
the optimization function that defines the controlled vicinity of the fault is:
Figure FDA0002921561830000045
wherein d isfaultIs a detected fault sample, dnicnThe method is characterized in that the method is a virtual controlled neighbor sample of a fault, the sample is randomly selected from training samples to serve as initialization input of a nonlinear programming technology, and the controlled neighbor sample is solved, so that the fault sample is further expressed as:
dfault=dnicn+f (17)
in the formula, f represents an estimated fault vector and is used for evaluating the minimum value of the fault sample adjusted to a normal state; the degree of contribution of each variable is defined as dfaultAnd dnicnIn euclidean distance CON betweenq
Figure FDA0002921561830000046
In the formula (I), the compound is shown in the specification,
Figure FDA0002921561830000047
and fqRespectively represents dnicn,dfaultAnd the qth element of f;
redefining the degree of contribution of a variable CONMD using Mahalanobis distancesq
Figure FDA0002921561830000051
In the formula, xiqIs the unit vector with the qth element being 1, will have the maximum CONMDqIsolation of variable sets of values, setting threshold τ as:
Figure FDA0002921561830000052
if the variable of the fault exceeds this threshold τ, it is determined that the change in the variable caused a process anomaly, triggering the fault.
5. The sparse GMM-based multimode process quality-related fault diagnosis method according to claim 4, wherein α is 0.05.
CN201811331420.2A 2018-11-09 2018-11-09 Multimode process quality-related fault diagnosis method based on sparse GMM Active CN109491338B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811331420.2A CN109491338B (en) 2018-11-09 2018-11-09 Multimode process quality-related fault diagnosis method based on sparse GMM

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811331420.2A CN109491338B (en) 2018-11-09 2018-11-09 Multimode process quality-related fault diagnosis method based on sparse GMM

Publications (2)

Publication Number Publication Date
CN109491338A CN109491338A (en) 2019-03-19
CN109491338B true CN109491338B (en) 2021-05-11

Family

ID=65694254

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811331420.2A Active CN109491338B (en) 2018-11-09 2018-11-09 Multimode process quality-related fault diagnosis method based on sparse GMM

Country Status (1)

Country Link
CN (1) CN109491338B (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110135492B (en) * 2019-05-13 2020-12-22 山东大学 Equipment fault diagnosis and abnormality detection method and system based on multiple Gaussian models
CN110308713A (en) * 2019-06-03 2019-10-08 湖州师范学院 A kind of industrial process failure identification variables method based on k neighbour reconstruct
CN111880090B (en) * 2019-06-28 2021-07-06 浙江大学 Online fault detection method for million-kilowatt ultra-supercritical unit
CN111913461B (en) * 2019-09-07 2022-03-18 宁波大学 Distributed chemical process monitoring method based on regularization GCCA model
CN113162787B (en) * 2020-01-23 2023-09-29 华为技术有限公司 Method for fault location in a telecommunication network, node classification method and related devices
CN111291319B (en) * 2020-03-24 2023-07-07 广东海洋大学深圳研究院 Mobile robot state estimation method applied to non-Gaussian noise environment
CN112068065B (en) * 2020-10-12 2022-06-21 武汉格蓝若智能技术有限公司 Voltage transformer state early warning method and device and storage medium
CN113314142B (en) * 2021-05-26 2022-04-01 山东大学 Engine fault detection method, system, storage medium and equipment
CN115310561B (en) * 2022-09-29 2022-12-20 中国空气动力研究与发展中心设备设计与测试技术研究所 Electromagnetic valve fault monitoring method based on integrated instant learning

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103456302B (en) * 2013-09-02 2016-04-20 浙江大学 A kind of emotional speaker recognition method based on the synthesis of emotion GMM Model Weight
CN103631145B (en) * 2013-12-11 2016-08-17 清华大学 Multiple operating modes process monitoring method and system based on monitor control index switching
US10553218B2 (en) * 2016-09-19 2020-02-04 Pindrop Security, Inc. Dimensionality reduction of baum-welch statistics for speaker recognition
CN106600557B (en) * 2016-12-19 2019-09-17 辽宁工程技术大学 PSF estimation method based on mixed Gauss model and sparse constraint

Also Published As

Publication number Publication date
CN109491338A (en) 2019-03-19

Similar Documents

Publication Publication Date Title
CN109491338B (en) Multimode process quality-related fault diagnosis method based on sparse GMM
Tang et al. Convolutional neural network‐based data anomaly detection method using multiple information for structural health monitoring
CN109766930B (en) Method for predicting residual life of mine mechanical equipment based on DCNN model
US12001949B2 (en) Computer-implemented method, computer program product and system for data analysis
EP3620983B1 (en) Computer-implemented method, computer program product and system for data analysis
CN112580263B (en) Turbofan engine residual service life prediction method based on space-time feature fusion
CN110940523B (en) Unsupervised domain adaptive fault diagnosis method
Wang et al. Random convolutional neural network structure: An intelligent health monitoring scheme for diesel engines
Yu et al. Challenges and opportunities of deep learning-based process fault detection and diagnosis: a review
Cheng et al. Monitoring influent conditions of wastewater treatment plants by nonlinear data-based techniques
CN111353373A (en) Correlation alignment domain adaptive fault diagnosis method
CN111665819A (en) Deep learning multi-model fusion-based complex chemical process fault diagnosis method
CN112557034B (en) Bearing fault diagnosis method based on PCA _ CNNS
CN111768000A (en) Industrial process data modeling method for online adaptive fine-tuning deep learning
CN113221441B (en) Method and device for health assessment of power plant equipment
Pantazi et al. Intelligent data mining and fusion systems in agriculture
Peng et al. Towards robust and understandable fault detection and diagnosis using denoising sparse autoencoder and smooth integrated gradients
CN110222387B (en) Multi-element drilling time sequence prediction method based on mixed leaky integration CRJ network
Yu et al. Control chart recognition based on the parallel model of CNN and LSTM with GA optimization
Gao et al. A process fault diagnosis method using multi‐time scale dynamic feature extraction based on convolutional neural network
Yu et al. Wafer map defect recognition based on deep transfer learning-based densely connected convolutional network and deep forest
CN115062272A (en) Water quality monitoring data abnormity identification and early warning method
CN116701992A (en) Multi-mode anomaly detection method based on graph attention network and time convolution network
Dong et al. Quality monitoring and root cause diagnosis for industrial processes based on Lasso-SAE-CCA
Wang et al. An intelligent process fault diagnosis system based on Andrews plot and convolutional neural network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant