CN110647922A - Layered non-Gaussian process monitoring method based on public and special feature extraction - Google Patents

Layered non-Gaussian process monitoring method based on public and special feature extraction Download PDF

Info

Publication number
CN110647922A
CN110647922A CN201910828558.1A CN201910828558A CN110647922A CN 110647922 A CN110647922 A CN 110647922A CN 201910828558 A CN201910828558 A CN 201910828558A CN 110647922 A CN110647922 A CN 110647922A
Authority
CN
China
Prior art keywords
data
gaussian
detected
input
output
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910828558.1A
Other languages
Chinese (zh)
Other versions
CN110647922B (en
Inventor
何雨辰
韩丽黎
王云
宋执环
曾九孙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China University of Metrology
Original Assignee
China University of Metrology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China University of Metrology filed Critical China University of Metrology
Priority to CN201910828558.1A priority Critical patent/CN110647922B/en
Publication of CN110647922A publication Critical patent/CN110647922A/en
Application granted granted Critical
Publication of CN110647922B publication Critical patent/CN110647922B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/211Selection of the most significant subset of features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Testing Or Calibration Of Command Recording Devices (AREA)
  • Testing And Monitoring For Control Systems (AREA)

Abstract

The invention discloses a layered non-Gaussian process monitoring method based on public and special feature extraction. Dividing training data into a plurality of modes, and obtaining high-order public characteristics of the training data by applying a plurality of weight vectors, component vectors and other characteristic quantities in each mode; obtaining low-order public features of the training data by applying a plurality of weight vectors, component vectors and other feature quantities in the remaining low-order modes; constructing statistical limits and statistics in the public subspace according to the public characteristics, and carrying out fault detection; and constructing statistical limits and statistics in the remaining unique subspaces to detect faults. The method is superior to other traditional methods in multi-modal non-Gaussian process fault detection, can extract the special characteristics of the multi-modal and can also extract the common characteristics, and considers the mutual connection of the characteristics and the commonality among different modes, so that the multi-modal process monitoring is more effective.

Description

Layered non-Gaussian process monitoring method based on public and special feature extraction
Technical Field
The invention belongs to the field of industrial process system engineering, and relates to a multi-mode fault monitoring method applied to complex industrial processes such as chemical production and the like.
Background
The complexity of the complex industrial process is reflected in that the process of the modern industrial production generates a large amount of data, the process data generally have different process characteristics, and the distribution characteristics are unknown; the industrial process has a plurality of units, and the data has a mechanism relation; moreover, the traditional method mostly assumes that the chemical process is operated under a single and stable operation condition, but actually the plant-level data is multi-modal, so how to better monitor the multi-modal problem is very worthy of study.
In the past, a great deal of research has been conducted by researchers on advanced data classification and information fusion methods, so that the new measurement data can be accurately distributed to actual operation modes, and the monitoring results can be displayed more accurately. It is noted that there are some process monitoring methods that study global process monitoring methods by using different patterns of data simultaneously, such as support vector data description, k-nearest neighbor based methods, multi-block partial least squares. However, most of these methods related to multi-modal process monitoring can only acquire the unique features of each mode data, and ignore the common features among each mode data. In actual monitoring, it would be beneficial to monitor multimodal processes if the correlation of characteristics and commonalities between different modes could be obtained in advance. Some researchers have proposed a two-step model framework, the first step being the need to develop separate process monitoring models, and the second step being the examination of similarities between them to obtain models representing common features. There have also been some studies attempting to divide the data of each mode into common and unique parts by using the data of multiple modes simultaneously, and although this method has been further extended to the monitoring of the transition process between modes, this method can be used simply for the case of calculating both modes. Moreover, the common features and characteristic features between different modes are not clearly explained in the geometrical relationship, so that the method has no way to carry out deeper research on the multimode process. A two-step multi-set principal component analysis method has also been proposed to obtain common basis vectors for multiple sets of data. The scholars propose a multimode process monitoring method based on a least square subspace based on the obtained common base vector, although the multimode process monitoring method also successfully extends to the conversion process between monitoring modes, the common base vector does not span any subspace of subset data and only serves as an auxiliary vector to improve the correlation of a cross set to the maximum extent, and therefore, the common characteristics of multimode data cannot be accurately obtained by directly mapping the data to the subspace. Therefore, how to extract the relationship among the multi-modal data and apply different models to the closely-connected multi-modal data becomes an important problem for fault monitoring.
Disclosure of Invention
In order to overcome the problems that in the multi-modal monitoring process, the traditional multi-modal process fault monitoring is carried out, only the specific characteristics of each mode data are developed, and the mutual connection of the characteristics and the commonality among different modes is ignored, the invention aims to provide a layered non-Gaussian process monitoring method based on common and specific characteristic extraction, the data collected by a sensor in industrial production is utilized to carry out real-time analysis and processing to obtain the detection and monitoring of real-time fault states, and the technical problem that the multi-modal process monitoring effect is poor in the prior art is solved.
The invention seeks to simultaneously construct process monitoring models from all mode data, obtain a common subspace by capturing common characteristics behind different modes, obtain an independent subspace reflecting the characteristic characteristics of each mode, and then carry out fault detection method research on each subspace.
The purpose of the invention is realized by the following technical scheme:
firstly, the classified multi-modal data are classified to obtain public features through weight vector parameters and weight vector scores, and fault monitoring is carried out by utilizing the public features. And introducing a double-layer non-Gaussian monitoring method aiming at the residual characteristic features to monitor the faults.
The specific method comprises the following steps:
1) acquiring data of an input variable x and an output variable y of the industrial production process as training data through a sensor under the condition that no fault is known in the industrial production process;
in industryAcquiring an input variable x of the industrial production process on line through a sensor under the condition that a fault needs to be detected in the production processtestAnd an output variable ytestThe data to be detected is used as the data to be detected;
2) classifying the training data by adopting a fault detection method of a layered non-Gaussian algorithm based on sample multi-modal classification to obtain classified training data;
3) aiming at the classified training data, each classification corresponds to one mode, an optimization objective function is established in each mode by using a plurality of weight vectors, component vectors and other characteristic quantities to obtain the classified training data and the common characteristics of the data to be detected, the step iii) in the fault monitoring method of the industrial production process based on the hierarchical non-Gaussian monitoring algorithm is adopted according to the common characteristics to extract the training data of the non-Gaussian part with the common characteristics in each mode, and the statistical limit of the training data of the non-Gaussian part with the common characteristics and the statistical quantity of the data to be detected are processed;
4) after training data of a non-Gaussian part with common characteristics are obtained, extracting the common characteristics of the remaining Gaussian part data, establishing an optimization objective function by using a plurality of weight vectors, component vectors and other characteristic quantities in each mode for the Gaussian part data to obtain the common characteristics of the Gaussian part data, and combining a partial least square method according to the common characteristics to obtain the statistical limit of the training data of the Gaussian part with the common characteristics and the statistical quantity of data to be detected;
5) after Gaussian part data with common characteristics and non-Gaussian part data with common characteristics are extracted, the rest data are data with characteristic characteristics, and according to the data with characteristic characteristics, the iv) step in the industrial production process fault monitoring method based on the hierarchical non-Gaussian monitoring algorithm is adopted to obtain the statistical limit of training data with characteristic characteristics and the statistical quantity of data to be detected;
6) and classifying the data to be detected in an online identification mode to obtain the monitored and classified data to be detected.
The input variable x is, for example, a process variable in an industrial process.
The output variable y is, for example, a quality variable in an industrial process.
In the step 3), the concrete steps are as follows:
a) forming all input variables X of the industrial process into an input variable data set X, wherein X belongs to Rn×M·mCarrying out standardization processing on each input variable X in an input variable data set X, wherein n is the number of sampling values in the input variable X, the input variable X is composed of a series of sampling values, M is the number of the input variables, and M is the classification number of training data, namely the mode number; outputting the variable data set as Y in the same way;
3, b) the classified input variable data is expressed as: x ═ X(1) X(2) ... X(M)]The output variable data is expressed as: y ═ Y(1) Y(2) ... Y(M)];X(1) X(2) ... X(M)Input variable data respectively expressed as first to Mth classifications, Y(1) Y(2) ... Y(M)Output variable data respectively expressed as first to Mth classifications;
3, c) thus, the composition variable t of the input variable x for each modality in the data is obtained using the following formulacAnd a component vector u of an output variable y of each modalityc
Figure BDA0002189898800000031
Figure BDA0002189898800000032
Wherein p iscAnd q iscRespectively single input and output common weight vectors; α, β are input and output component weight vector parameters for combining M pieces of modal data, α ═ α(1) α(2) ... α(M)],α(1) α(2) ... α(M)Expressed as input component weight vector parameters of first to mth classes, respectively, β ═ β(1) β(2) ... β(M)],β(1) β(2) ... β(M)Output component weight vector parameters of a first classification to an Mth classification are respectively expressed, wherein i represents an ith mode;
establishing a first objective function as:
max{I(tc,uc)}
s.t.||pc||=||qc||=||α||=||β||=1
wherein, I (t)c,uc) Component variable t being input variable xcAnd component vector u of output variable ycThe mutual information value between, | | | | represents the modulus;
3, d) solving by adopting an iterative algorithm to obtain input and output public weight vectors pc、qcAnd input and output component weight vector parameters alpha and beta, and then processing to obtain non-Gaussian partial data with common features, which is expressed as
Figure BDA0002189898800000041
And is obtained by processing training data of non-Gaussian parts with common features
Figure BDA0002189898800000042
Statistical limit of statistics
Figure BDA0002189898800000043
3.e) pre-multiplying the input variable of the data to be detected by pcObtaining a first fault estimation vector of the data to be detected, and obtaining the data to be detected by using the first fault estimation vector
Figure BDA0002189898800000044
Statistics are obtained.
In the step 3.d), the following iterative algorithm is used for solving and obtaining the input and output public weight vector pc、qcAnd input and output component weight vector parameters α, β:
(3.d.1) randomly initializing input and output component weight vector parameters alpha and beta;
(3.d.2) will loseThe input and output component weight vector parameters alpha and beta are substituted into the classified input variable data and the classified output variable data to obtain the weighted input data
Figure BDA0002189898800000045
And output data
Figure BDA0002189898800000046
(3.d.3) solving an input and output public weight vector by adopting an industrial production process fault monitoring method based on a layered non-Gaussian monitoring algorithm;
(3.d.4) inputting and outputting the public weight vector p obtained in the last stepcAnd q iscIs brought into a first objective function;
(3, d.5) solving input and output component weight vector parameters alpha and beta by using a particle swarm algorithm;
(3.d.6) repeating the steps (2) - (6) until α, β and pc,qcConvergence, i.e. the four values are all less than the respective preset threshold.
Step 4), removing the data of the non-gaussian part obtained in step 3) from the training data, and remaining the training data of the gaussian part, and then the specific steps are as follows:
a) Gaussian data of the classified input variables are represented as: xG=[XG(1) XG(2) ... XG(M)]The gaussian data for the output variable is expressed as: y isG=[YG(1) YG(2) ... YG(M)];XG(1) XG(2) ... XG(M)Gaussian data, Y, respectively expressed as input variables of the first to Mth classesG(1) YG(2) ... YG(M)Gaussian data respectively expressed as output variables of the first to mth classifications;
4.b) thus, the component variable t of the input variable x for each modality in the Gaussian partial data is obtained using the following formulac,GAnd a component vector u of an output variable y of each modalityc,G
Figure BDA0002189898800000051
Figure BDA0002189898800000052
Wherein p isc,GAnd q isc,GA single input, output common weight vector of the gaussian partial data, respectively; alpha is alphaGAnd betaGIs the input and output component weight vector parameter, alpha, of the combined M modal data of the Gaussian partial dataG=[αG(1) αG(2) ... αG(M)],αG(1) αG(2) ... αG(M)Expressed as the input component weight vector parameter, beta, of the first to Mth classes, respectivelyG=[βG(1)βG(2) ... βG(M)],βG(1) βG(2) ... βG(M)Output component weight vector parameters of a first classification to an Mth classification are respectively expressed, wherein i represents an ith mode;
establishing a second objective function as:
max{I(tc,G,uc,G)}
s.t.||pc,G||=||qc,G||=||αG||=||βG||=1
wherein, I (t)c,G,uc,G) Component variable t of input variable x in Gaussian part datac,GAnd component vector u of output variable yc,GThe mutual information value between, | | | | represents the modulus;
4, c) solving and obtaining the input and output public weight vector p in the Gaussian part data by adopting an iterative algorithmc,G、qc,GAnd input and output component weight vector parameter alphaG、βGThen processing to obtain training data of a Gaussian part with common characteristics;
and T is obtained by processing the training data of Gaussian part with common characteristicsc 2Statistical limit of statistics
Figure BDA0002189898800000055
4, d) pre-multiplying the input variable of the data to be detected by pc,GObtaining a second fault estimation vector of the data to be detected, and obtaining T of the data to be detected by using the second fault estimation vector in combination with a partial least square methodc 2Statistics are obtained.
In the step 4.c), the following iterative algorithm is used for solving and obtaining the input and output public weight vector p in the Gaussian partial datac,G、qc,GAnd input and output component weight vector parameter alphaG、βG
(4.c.1) randomly initializing input and output component weight vector parameter alpha in Gaussian partial dataG、βG
(4.c.2) inputting and outputting weight vector parameter alpha of input and output components in Gaussian part dataG、βGThe classified input variable data and the classified output variable data are brought in to obtain weighted input data
Figure BDA0002189898800000053
And output data
(4, c.3) solving input and output public weight vectors p in Gaussian part data by adopting industrial production process fault monitoring method based on hierarchical non-Gaussian monitoring algorithmc,GAnd q isc,G
(4.c.4) inputting and outputting the public weight vector p in the Gaussian part data obtained in the last stepc,GAnd q isc,GIs brought into a second objective function;
(4, c.5) solving the weight vector parameter alpha of the input and output components in the Gaussian part data by using a particle swarm algorithmG、βG
(4, c.6) repeating the processes of steps (2) - (6) until alphaG、βGAnd pc,G、qc,GConvergence, i.e. the four values are all less than the respective preset threshold.
In the step 5), the concrete steps are as follows:
a) when the public weight vectorpcAfter the determination, the training data of each modality with the characteristic features remaining is obtained by processing in the following way:
XI(i)=X(i)-X(i)*pcpc T
Figure BDA0002189898800000061
wherein, XI(i)Input variable data set representing training data of the i-th modality with characteristic features, YI(i)An output variable data set representing training data of the ith modality with characteristic features, X(i)Input variable data set, Y, representing training data for the ith modality(i)An output variable data set representing training data for an ith modality;
5, b) aiming at the training data of each mode with the characteristic features, respectively obtaining a load matrix aiming at a non-Gaussian part and a Gaussian part of the training data by adopting step iii) to step iv) of the fault monitoring method of the industrial production process based on the layered non-Gaussian monitoring algorithm, and obtaining I by utilizing the load matrix to processi 2Statistical limit corresponding to statistical quantity
Figure BDA0002189898800000062
C) removing the obtained data of the non-Gaussian part from the training data with the characteristic features, remaining the training data of the Gaussian part with the characteristic features, adopting step iii to step iv in the fault monitoring method of the industrial production process based on the hierarchical non-Gaussian monitoring algorithm to process the training data of the Gaussian part with the characteristic features to respectively obtain load matrixes of the Gaussian part and the Gaussian part of the training data, and obtaining T by utilizing the load matrix processingi 2Statistical limits corresponding to the statistics and Q statisticsAnd a statistical limit Q_limit
5, d) pre-multiplying the data to be detected with the Gaussian part with the characteristic by the training data of the Gaussian part with the characteristic to obtain a load matrix, and pre-multiplying the input variable of the data to be detected with the Gaussian part by the load matrix PGObtaining a fourth fault estimation vector of the data to be detected, and obtaining the data to be detected T by using the fourth fault estimation vectori 2Statistics and Q statistics.
In the step 6), the concrete steps are as follows:
the data to be detected starts to operate, but the types of modal working conditions are unknown, one mode corresponds to one modal working condition, the industrial production process has modal working conditions under M modes in total, the data to be detected of each known mode is monitored to obtain each statistical limit, the data to be detected of each current unknown mode is monitored to obtain each statistical quantity, and the following judgment is carried out:
if the current unknown mode of the data to be detected
Figure BDA0002189898800000071
statistic/Tc 2The statistics being obtained while being higher than the corresponding training data of the first modality representing common features
Figure BDA0002189898800000072
Statistical limit of statistics
Figure BDA0002189898800000073
Or Tc 2Statistical limit of statisticsThe data to be detected in the current unknown mode is fault data;
if the current unknown mode of the data to be detected
Figure BDA0002189898800000075
statistic/Tc 2Obtained with statistics lower than corresponding training data of the first modality representing common features
Figure BDA0002189898800000076
Statistical limit of statistics
Figure BDA0002189898800000077
Or Tc 2Statistical limit of statistics
Figure BDA0002189898800000078
Then judge
Figure BDA00021898988000000720
statistic/Ti 2Whether or not the statistic/Q statistic is simultaneously higher than the corresponding representative characteristic
Figure BDA0002189898800000079
Statistical limit of statistics
Figure BDA00021898988000000710
Ti 2Statistical limit of statistics
Figure BDA00021898988000000711
And statistical limit Q of Q statistic_limit
If not, the data to be detected of the current unknown mode belongs to a first mode;
if the current mode is higher than the preset threshold, the training data of the second mode is called for judgment, and the data to be detected of the current unknown mode is judged
Figure BDA00021898988000000718
statistic/Ti 2Whether the statistic/Q statistic is simultaneously higher than that obtained by training data of a second modality representing characteristic features
Figure BDA00021898988000000719
Statistical limit of statistics
Figure BDA00021898988000000712
Ti 2Statistical limit of
Figure BDA00021898988000000713
And statistical limit of Q_limit
If not, the data to be detected in the current unknown mode belongs to a second mode;
if the current modal is higher than the preset threshold, training data of a third modal is called for judgment, and the current unknown modal to-be-detected data is judged
Figure BDA00021898988000000721
statistic/Ti 2Whether the statistic/Q statistic is simultaneously higher than the I obtained by the corresponding training data of the third modality representing the characteristic featurei 2Statistical limit of statistics
Figure BDA00021898988000000714
Ti 2Statistical limit of statistics
Figure BDA00021898988000000715
And statistical limit Q of Q statistic_limit
If not, the data to be detected in the current unknown mode belongs to a third mode;
if the current modal is higher than the first modal, calling the training data of the fourth modal for judgment, and so on until the current unknown modal to be detected data Ii 2statistic/Ti 2The statistic/Q statistic being higher than that obtained by modal training data
Figure BDA00021898988000000722
Statistical limit of statistics
Figure BDA00021898988000000716
Ti 2Statistical limit of statistics
Figure BDA00021898988000000717
And statistical limit Q of Q statistic_limitConsider the current unknown modality to be detected asAnd (4) fault data, and finishing monitoring and classification of the data to be detected.
And step 7) is also included after the step 6), specifically, the fault detection is carried out by utilizing the statistical limits and statistics in the step 3), the step 4) and the step 5).
The step 7), specifically, the statistical limit and the statistical quantity in the step 3), 4), 5) meet the data to be detected corresponding to any one of the following five conditions, namely, the industrial production process is considered to have a process fault:
for data of non-Gaussian part with common features in the data to be detected, the data is above the statistical limit
Figure BDA00021898988000000723
Data to be detected corresponding to the statistic;
for the data of Gaussian part with common features in the data to be detected, the data is positioned at T above the statistical limitc 2Data to be detected corresponding to the statistic;
for data of non-Gaussian part with characteristic features in the data to be detected, the data is located above the statistical limitData to be detected corresponding to the statistic;
for the data of Gaussian part with characteristic features in the data to be detected, the data is positioned at T above the statistical limiti 2And the statistic and the data to be detected corresponding to the Q statistic.
The invention has the beneficial effects that:
and extracting multi-modal common features and special features for tracking the running performance of the process and the quality of the process. The method seeks to simultaneously construct process monitoring models from all mode data, obtain a common subspace by capturing common features behind different modes, obtain an independent subspace reflecting the characteristic features of each mode, and then carry out fault detection method research on each subspace.
The method is superior to other traditional methods in fault detection of the non-Gaussian multi-modal process, can fully consider the highly complex coupling relation among variables, can extract the non-Gaussian part in the data with unknown distribution characteristics, and can effectively extract the common characteristics and the unique characteristics of the multi-modal data, so that the fault monitoring of the multi-modal process is more efficient and accurate.
The method is superior to other traditional methods in multi-modal non-Gaussian process fault detection, can extract the special characteristics of the multi-modal and can also extract the common characteristics, and considers the mutual connection of the characteristics and the commonality among different modes, so that the multi-modal process monitoring is more effective.
Drawings
FIG. 1 is a diagram of the classification results of the multi-modal process of the present invention.
Fig. 2 is a diagram of the first failure detection result of the present invention.
Fig. 3 is a diagram of a second failure detection result of the present invention.
Detailed Description
The method of the present invention is described in detail below with reference to the accompanying drawings and specific embodiments.
The specific implementation case adopted by the invention is a Tennessee-Emament (TE) process, which comprises five main units: a reactor, a condenser, a compressor, a separator, and a stripper.
The product stream from the reactor was cooled by a condenser and then sent to a vapor/liquid separator. The vapor from the separator is recycled to the reactor by means of a compressor. To prevent the accumulation of inert components and reaction by-products in the process, a portion of the recycle stream must be vented. The condensed components from the separator (stream 10) are pumped to the stripper. Stream 4 is used to strip the remaining reactants in stream 10 which are combined with the recycle stream via stream 5 for further reaction.
The Tennessee-Emersmann (TE) process has 41 measurement variables (including 22 process measurement variables and 19 component measurement variables) and 12 control variables, and in order to monitor the faults of the TE process, ensure the safety of the actual chemical production process and improve the economic benefit, 31 variables are selected from the 22 process measurement variables and the 12 control variables as input variables shown in Table 1 and 6 component measurement variables, namely quality variables, are selected from the 19 measurement variables as output variables when the process monitoring variables are determined, as shown in Table 2. The tennessee-eastman process also simulated 21 faults as shown in table 3.
TABLE 1
Figure BDA0002189898800000091
TABLE 2
Figure BDA0002189898800000092
TABLE 3
Figure BDA0002189898800000101
In order to demonstrate the superiority of the proposed processing method, the present invention is embodied as a multi-modal process case including different stable models, from which 3000 samples are collected in total, and a data matrix of 50 variables is used as a training data set, and the multi-modal process is divided into three modes, wherein the first mode includes 1000 samples, the second mode includes 1000 samples, and the third mode includes 1000 samples, as shown in fig. 1. In order to achieve the purpose of fault detection, two faults are designed, wherein the fault 1 is that a step fault is added to a sample from 300 to 1000 of a 22 th variable of a first modality, and the fault 2 is that a slope fault is added to a sample from 1500 to 2000 of the 22 th variable of a second modality, and the fault detection is respectively carried out as a data set to be detected, and the specific steps are as follows:
1) under the condition that no fault is known in the industrial production process, acquiring data of an input variable x and an output variable y in the industrial production process as training data through a sensor, and carrying out standardization processing to enable the mean value of the data to be 0 and the variance to be 1;
under the condition that the industrial production process needs to detect faults, the industrial production process is acquired on line through a sensorInput variable x of the programtestAnd an output variable ytestThe two fault data are used as data to be detected, and standardized processing is carried out, so that the mean value of the data is 0, and the variance is 1, and the data are used as the data to be detected;
2) classifying the training data by adopting a fault detection method of a layered non-Gaussian algorithm based on sample multi-modal classification to obtain classified training data;
3) aiming at the classified training data, each classification corresponds to one mode, an optimization objective function is established in each mode by using a plurality of weight vectors, component vectors and other characteristic quantities to obtain the classified training data and the common characteristics of the data to be detected, the step iii) in the fault monitoring method of the industrial production process based on the hierarchical non-Gaussian monitoring algorithm is adopted according to the common characteristics to extract the training data of the non-Gaussian part with the common characteristics in each mode, and the statistical limit of the training data of the non-Gaussian part with the common characteristics and the statistical quantity of the data to be detected are processed;
4) after training data of a non-Gaussian part with common characteristics are obtained, extracting the common characteristics of the remaining Gaussian part data, establishing an optimization objective function by using a plurality of weight vectors, component vectors and other characteristic quantities in each mode for the Gaussian part data to obtain the common characteristics of the Gaussian part data, and combining a partial least square method according to the common characteristics to obtain the statistical limit of the training data of the Gaussian part with the common characteristics and the statistical quantity of data to be detected;
5) after Gaussian part data with common characteristics and non-Gaussian part data with common characteristics are extracted, the rest data are data with characteristic characteristics, and according to the data with characteristic characteristics, the iv) step in the industrial production process fault monitoring method based on the hierarchical non-Gaussian monitoring algorithm is adopted to obtain the statistical limit of training data with characteristic characteristics and the statistical quantity of data to be detected;
6) and classifying the data to be detected in an online identification mode to obtain the monitored and classified data to be detected.
The invention can simultaneously construct the process monitoring model from all mode data, obtain the public subspace by capturing the public characteristics behind different modes, and obtain the independent subspace reflecting the characteristic characteristics of each mode, and then carry out the fault detection method research on each subspace, thereby improving the accuracy and effectiveness of fault monitoring.
The above embodiments are disclosed to illustrate the present invention, not to limit the present invention, and any modifications and changes made within the spirit of the present invention and the scope of the claims fall within the scope of the present invention.

Claims (9)

1. A layered non-Gaussian process monitoring method based on public and unique feature extraction is characterized by comprising the following steps:
1) acquiring data of an input variable x and an output variable y of the industrial production process as training data through a sensor under the condition that no fault is known in the industrial production process;
under the condition that the industrial production process needs to detect faults, acquiring an input variable x of the industrial production process on line through a sensortestAnd an output variable ytestThe data to be detected is used as the data to be detected;
2) classifying the training data to obtain classified training data;
3) aiming at the classified training data, each classification corresponds to one mode, an optimization objective function is established in each mode by using a plurality of weight vectors, component vectors and other characteristic quantities to obtain the classified training data and the common characteristics of the data to be detected, the training data of the non-Gaussian part with the common characteristics in each mode are extracted according to the common characteristics, and the statistical limit of the training data of the non-Gaussian part with the common characteristics and the statistical quantity of the data to be detected are processed;
4) after training data of a non-Gaussian part with common characteristics are obtained, extracting the common characteristics of the remaining Gaussian part data, establishing an optimization objective function by using a plurality of weight vectors, component vectors and other characteristic quantities in each mode for the Gaussian part data to obtain the common characteristics of the Gaussian part data, and combining a partial least square method according to the common characteristics to obtain the statistical limit of the training data of the Gaussian part with the common characteristics and the statistical quantity of data to be detected;
5) after Gaussian part data with common characteristics and non-Gaussian part data with common characteristics are extracted, the rest data are data with characteristic characteristics, and statistical limits of training data with characteristic characteristics and statistics of data to be detected are obtained according to the data with characteristic characteristics;
6) and classifying the data to be detected in an online identification mode to obtain the monitored and classified data to be detected.
2. The hierarchical non-gaussian process monitoring method based on common and unique feature extraction as claimed in claim 1, wherein:
in the step 3), the concrete steps are as follows:
a) forming all input variables X of the industrial process into an input variable data set X, wherein X belongs to Rn×M·mStandardizing each input variable X in an input variable data set X, wherein n is the number of sampling values in the input variable X, M is the number of input variables, and M is the classification number of training data, namely a modal number;
3, b) the classified input variable data is expressed as: x ═ X(1) X(2) ... X(M)]The output variable data is expressed as: y ═ Y(1) Y(2) ... Y(M)];X(1) X(2) ... X(M)Input variable data respectively expressed as first to Mth classifications, Y(1) Y(2) ... Y(M)Output variable data respectively expressed as first to Mth classifications;
3, c) thus, the composition variable t of the input variable x for each modality in the data is obtained using the following formulacAnd a component vector u of an output variable y of each modalityc
Figure FDA0002189898790000021
Figure FDA0002189898790000022
Wherein p iscAnd q iscRespectively single input and output common weight vectors; α, β are input and output component weight vector parameters for combining M pieces of modal data, α ═ α(1) α(2) ... α(M)],α(1) α(2) ... α(M)Expressed as input component weight vector parameters of first to mth classes, respectively, β ═ β(1) β(2) ... β(M)],β(1) β(2) ... β(M)Output component weight vector parameters of a first classification to an Mth classification are respectively expressed, wherein i represents an ith mode;
establishing a first objective function as:
max{I(tc,uc)}
s.t.||pc||=||qc||=||α||=||β||=1
wherein, I (t)c,uc) Component variable t being input variable xcAnd component vector u of output variable ycThe mutual information value between, | | | | represents the modulus;
3, d) solving by adopting an iterative algorithm to obtain input and output public weight vectors pc、qcAnd inputting and outputting the component weight vector parameters alpha and beta, then processing to obtain non-Gaussian part data with common characteristics, and processing the non-Gaussian part data with common characteristics to obtain the non-Gaussian part data
Figure FDA0002189898790000023
Statistical limit of statistics
Figure FDA0002189898790000024
3.e) pre-multiplying the input variable of the data to be detected by pcObtaining a first fault estimation vector of the data to be detectedObtaining data to be detected using the first fault estimate vector
Figure FDA0002189898790000025
Statistics are obtained.
3. The hierarchical non-gaussian process monitoring method based on common denominator feature extraction as claimed in claim 2, wherein:
in the step 3.d), the following iterative algorithm is used for solving and obtaining the input and output public weight vector pc、qcAnd input and output component weight vector parameters α, β:
(3.d.1) randomly initializing input and output component weight vector parameters alpha and beta;
(3.d.2) substituting the input and output component weight vector parameters alpha, beta into the classified input variable data and the classified output variable data to obtain weighted input data
Figure FDA0002189898790000026
And output data
Figure FDA0002189898790000027
(3.d.3) solving an input and output public weight vector;
(3.d.4) inputting and outputting the public weight vector p obtained in the last stepcAnd q iscIs brought into a first objective function;
(3, d.5) solving input and output component weight vector parameters alpha and beta by using a particle swarm algorithm;
(3.d.6) repeating the steps (2) - (6) until α, β and pc,qcAnd (6) converging.
4. The hierarchical non-gaussian process monitoring method based on common denominator feature extraction as claimed in claim 1, wherein:
step 4), removing the data of the non-gaussian part obtained in step 3) from the training data, and remaining the training data of the gaussian part, and then the specific steps are as follows:
a) Gaussian data of the classified input variables are represented as: xG=[XG(1) XG(2) ... XG(M)]The gaussian data for the output variable is expressed as: y isG=[YG(1) YG(2) ... YG(M)];XG(1) XG(2) ... XG(M)Gaussian data, Y, respectively expressed as input variables of the first to Mth classesG(1) YG(2) ... YG(M)Gaussian data respectively expressed as output variables of the first to mth classifications;
4.b) thus, the component variable t of the input variable x for each modality in the Gaussian partial data is obtained using the following formulac,GAnd a component vector u of an output variable y of each modalityc,G
Figure FDA0002189898790000031
Figure FDA0002189898790000032
Wherein p isc,GAnd q isc,GA single input, output common weight vector of the gaussian partial data, respectively; alpha is alphaGAnd betaGIs the input and output component weight vector parameter, alpha, of the combined M modal data of the Gaussian partial dataG=[αG(1) αG(2) ... αG(M)],αG(1)αG(2) ... αG(M)Expressed as the input component weight vector parameter, beta, of the first to Mth classes, respectivelyG=[βG(1) βG(2) ... βG(M)],βG(1) βG(2) ... βG(M)Output component weight vector parameters of a first classification to an Mth classification are respectively expressed, wherein i represents an ith mode;
establishing a second objective function as:
max{I(tc,G,uc,G)}
s.t.||pc,G||=||qc,G||=||αG||=||βG||=1
wherein, I (t)c,G,uc,G) Component variable t of input variable x in Gaussian part datac,GAnd component vector u of output variable yc,GThe mutual information value between, | | | | represents the modulus;
4, c) solving and obtaining the input and output public weight vector p in the Gaussian part data by adopting an iterative algorithmc,G、qc,GAnd input and output component weight vector parameter alphaG、βGThen processing to obtain training data of a Gaussian part with common characteristics;
and T is obtained by processing the training data of Gaussian part with common characteristicsc 2Statistical limit of statistics
Figure FDA0002189898790000041
4, d) pre-multiplying the input variable of the data to be detected by pc,GObtaining a second fault estimation vector of the data to be detected, and obtaining T of the data to be detected by using the second fault estimation vector in combination with a partial least square methodc 2Statistics are obtained.
5. The hierarchical non-Gaussian process monitoring method based on common denominator feature extraction as claimed in claim 4, characterized in that:
in the step 4.c), the following iterative algorithm is used for solving and obtaining the input and output public weight vector p in the Gaussian partial datac,G、qc,GAnd input and output component weight vector parameter alphaG、βG
(4.c.1) randomly initializing input and output component weight vector parameter alpha in Gaussian partial dataG、βG
(4.c.2) inputting and outputting weight vector parameter alpha of input and output components in Gaussian part dataG、βGThe classified input variable data and the classified output variable data are brought in to obtain weighted input data
Figure FDA0002189898790000042
And output data
Figure FDA0002189898790000043
(4, c.3) solving input and output public weight vectors p in Gaussian part data by adopting industrial production process fault monitoring method based on hierarchical non-Gaussian monitoring algorithmc,GAnd q isc,G
(4.c.4) inputting and outputting the public weight vector p in the Gaussian part data obtained in the last stepc,GAnd q isc,GIs brought into a second objective function;
(4, c.5) solving the weight vector parameter alpha of the input and output components in the Gaussian part data by using a particle swarm algorithmG、βG
(4, c.6) repeating the processes of steps (2) - (6) until alphaG、βGAnd pc,G、qc,GAnd (6) converging.
6. The hierarchical non-gaussian process monitoring method based on common and unique feature extraction as claimed in claim 1, wherein:
in the step 5), the concrete steps are as follows:
a) when the common weight vector pcAfter the determination, the training data of each modality with the characteristic features remaining is obtained by processing in the following way:
XI(i)=X(i)-X(i)*pcpc T
Figure FDA0002189898790000044
wherein, XI(i)Input variable data set representing training data of the i-th modality with characteristic features, YI(i)An output variable data set representing training data of the ith modality with characteristic features, X(i)Input variable data set, Y, representing training data for the ith modality(i)Representing the ith modalityAn output variable data set of training data;
5, b) aiming at the training data of each mode with the characteristic features, processing the training data by adopting an industrial production process fault monitoring method based on a layered non-Gaussian monitoring algorithm to respectively obtain load matrixes aiming at the non-Gaussian part and the Gaussian part of the training data, and processing the load matrixes to obtain the load matrixes
Figure FDA0002189898790000051
Statistical limit corresponding to statistical quantity
Figure FDA0002189898790000052
C) removing the obtained data of the non-Gaussian part from the training data with the characteristic features, remaining the training data of the Gaussian part with the characteristic features, processing the training data of the Gaussian part with the characteristic features by adopting an industrial production process fault monitoring method based on a hierarchical non-Gaussian monitoring algorithm to respectively obtain load matrixes of the Gaussian part and the Gaussian part of the training data, and processing the load matrixes to obtain Ti 2Statistical limits corresponding to the statistics and Q statistics
Figure FDA0002189898790000053
And a statistical limit Q_limit
5, d) pre-multiplying the data to be detected with the Gaussian part with the characteristic by the training data of the Gaussian part with the characteristic to obtain a load matrix, and pre-multiplying the input variable of the data to be detected with the Gaussian part by the load matrix PGObtaining a fourth fault estimation vector of the data to be detected, and obtaining the data to be detected T by using the fourth fault estimation vectori 2Statistics and Q statistics.
7. The hierarchical non-gaussian process monitoring method based on common and unique feature extraction as claimed in claim 1, wherein:
in the step 6), the concrete steps are as follows:
the data to be detected starts to operate, modal conditions under M modals are totally generated in the industrial production process, the data to be detected of each known modality are monitored to obtain each statistical limit, the data to be detected of each current unknown modality are monitored to obtain each statistic, and the following judgment is carried out:
if the current unknown mode of the data to be detected
Figure FDA0002189898790000054
Statistic-
Figure FDA0002189898790000055
The statistics being obtained while being higher than the corresponding training data of the first modality
Figure FDA0002189898790000056
Statistical limit of statistics
Figure FDA0002189898790000057
Or
Figure FDA0002189898790000058
Statistical limit of statistics
Figure FDA0002189898790000059
The data to be detected in the current unknown mode is fault data;
if the current unknown mode of the data to be detected
Figure FDA00021898987900000510
Statistic-
Figure FDA00021898987900000511
Obtained with training data having statistics lower than the corresponding first modality
Figure FDA00021898987900000512
Statistical limit of statistics
Figure FDA00021898987900000513
Or
Figure FDA00021898987900000514
Statistical limit of statistics
Figure FDA00021898987900000515
Then judge
Figure FDA00021898987900000516
statistic/Ti 2Whether statistic/Q statistic is simultaneously higher than corresponding
Figure FDA00021898987900000517
Statistical limit of statisticsTi 2Statistical limit of statistics
Figure FDA00021898987900000519
And statistical limit Q of Q statistic_limit
If not, the data to be detected of the current unknown mode belongs to a first mode;
if the current mode is higher than the preset threshold, the training data of the second mode is called for judgment, and the data to be detected of the current unknown mode is judged
Figure FDA00021898987900000520
statistic/Ti 2Whether the statistic/Q statistic is simultaneously higher than the training data of the second modality
Figure FDA00021898987900000521
Statistical limit of statistics
Figure FDA00021898987900000522
Ti 2Statistical limit of
Figure FDA00021898987900000523
And statistical limit of Q_limit
If not, the data to be detected in the current unknown mode belongs to a second mode;
if the current modal is higher than the preset threshold, training data of a third modal is called for judgment, and the current unknown modal to-be-detected data is judged
Figure FDA00021898987900000524
statistic/Ti 2Whether the statistic/Q statistic is simultaneously higher than the corresponding training data of the third modality
Figure FDA0002189898790000061
Statistical limit of statistics
Figure FDA0002189898790000062
Ti 2Statistical limit of statistics
Figure FDA0002189898790000063
And statistical limit Q of Q statistic_limit
If not, the data to be detected in the current unknown mode belongs to a third mode;
if the current modal is higher than the preset threshold, the training data of the fourth modal is called for judgment, and the like is repeated until the current unknown modal to-be-detected data
Figure FDA0002189898790000064
statistic/Ti 2The statistic/Q statistic being higher than that obtained by modal training data
Figure FDA0002189898790000065
Statistical limit of statistics
Figure FDA0002189898790000066
Ti 2Statistical limit of statistics
Figure FDA0002189898790000067
And statistical limit Q of Q statistic_limitAnd considering the data to be detected in the current unknown mode as fault data.
8. The hierarchical non-gaussian process monitoring method based on common and unique feature extraction as claimed in claim 1, wherein:
and step 7) is also included after the step 6), specifically, the fault detection is carried out by utilizing the statistical limits and statistics in the step 3), the step 4) and the step 5).
9. The hierarchical non-gaussian process monitoring method based on common and unique feature extraction as claimed in claim 1, wherein:
the step 7), specifically, the statistical limit and the statistical quantity in the step 3), 4), 5) meet the data to be detected corresponding to any one of the following five conditions, namely, the industrial production process is considered to have a process fault:
for data of non-Gaussian part with common features in the data to be detected, the data is above the statistical limit
Figure FDA0002189898790000068
Data to be detected corresponding to the statistic;
for the data of Gaussian part with common features in the data to be detected, the data is above the statistical limit
Figure FDA0002189898790000069
Data to be detected corresponding to the statistic;
for data of non-Gaussian part with characteristic features in the data to be detected, the data is located above the statistical limit
Figure FDA00021898987900000610
Data to be detected corresponding to the statistic;
for the data of Gaussian part with characteristic features in the data to be detected, the data is positioned at T above the statistical limiti 2And the statistic and the data to be detected corresponding to the Q statistic.
CN201910828558.1A 2019-09-03 2019-09-03 Layered non-Gaussian process monitoring method based on public and special feature extraction Active CN110647922B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910828558.1A CN110647922B (en) 2019-09-03 2019-09-03 Layered non-Gaussian process monitoring method based on public and special feature extraction

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910828558.1A CN110647922B (en) 2019-09-03 2019-09-03 Layered non-Gaussian process monitoring method based on public and special feature extraction

Publications (2)

Publication Number Publication Date
CN110647922A true CN110647922A (en) 2020-01-03
CN110647922B CN110647922B (en) 2023-03-24

Family

ID=69010156

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910828558.1A Active CN110647922B (en) 2019-09-03 2019-09-03 Layered non-Gaussian process monitoring method based on public and special feature extraction

Country Status (1)

Country Link
CN (1) CN110647922B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112307257A (en) * 2020-11-25 2021-02-02 中国计量大学 Short video click rate prediction method based on multi-information node graph network

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2437034A2 (en) * 2010-09-30 2012-04-04 Siemens Corporation System and method for conditional multi-output regression for machine condition monitoring
CN109507972A (en) * 2018-12-19 2019-03-22 中国计量大学 Industrial processes fault monitoring method based on layer-stepping non-gaussian monitoring algorithm
CN109523195A (en) * 2018-12-19 2019-03-26 中国计量大学 The fault detection method of layer-stepping non-gaussian algorithm based on the multi-modal classification of sample

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2437034A2 (en) * 2010-09-30 2012-04-04 Siemens Corporation System and method for conditional multi-output regression for machine condition monitoring
CN109507972A (en) * 2018-12-19 2019-03-22 中国计量大学 Industrial processes fault monitoring method based on layer-stepping non-gaussian monitoring algorithm
CN109523195A (en) * 2018-12-19 2019-03-26 中国计量大学 The fault detection method of layer-stepping non-gaussian algorithm based on the multi-modal classification of sample

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112307257A (en) * 2020-11-25 2021-02-02 中国计量大学 Short video click rate prediction method based on multi-information node graph network

Also Published As

Publication number Publication date
CN110647922B (en) 2023-03-24

Similar Documents

Publication Publication Date Title
WO2021004154A1 (en) Method for predicting remaining life of numerical control machine tool
CN106355030B (en) A kind of fault detection method based on analytic hierarchy process (AHP) and Nearest Neighbor with Weighted Voting Decision fusion
CN110543616B (en) SMT solder paste printing volume prediction method based on industrial big data
CN108960329B (en) Chemical process fault detection method containing missing data
CN104063719A (en) Method and device for pedestrian detection based on depth convolutional network
CN108345284B (en) Quality-related fault detection method based on two variable blocks
CN109409425B (en) Fault type identification method based on neighbor component analysis
CN103488091A (en) Data-driving control process monitoring method based on dynamic component analysis
CN109675935A (en) A kind of IPCA operation of rolling on-line fault diagnosis method becoming control limit
CN105955214B (en) Batch process fault detection method based on sample time-series and neighbour's affinity information
CN114429152A (en) Rolling bearing fault diagnosis method based on dynamic index antagonism self-adaption
CN114757269A (en) Complex process refined fault detection method based on local subspace-neighborhood preserving embedding
CN109901553B (en) Heterogeneous industrial big data collaborative modeling process fault monitoring method based on multiple visual angles
CN112214006A (en) Intermittent process fault detection method and system considering two-dimensional dynamic characteristics
CN115496108A (en) Fault monitoring method and system based on manifold learning and big data analysis
CN110647922B (en) Layered non-Gaussian process monitoring method based on public and special feature extraction
CN112947649B (en) Multivariate process monitoring method based on mutual information matrix projection
CN110673568A (en) Method and system for determining fault sequence of industrial equipment in glass fiber manufacturing industry
CN117407313A (en) Automatic quality testing method and system based on machine learning analysis
CN112183569A (en) FDA and SOM based intermittent industrial process reaction phase clustering and fault classification visualization
Yang et al. A novel decentralized weighted ReliefF-PCA method for fault detection
CN114647231A (en) Data-driven industrial process fault detection method
CN109523195B (en) Sample multi-modal classification-based fault detection method for hierarchical non-Gaussian algorithm
CN108596210B (en) Intelligent identification system and method for mechanical component butt joint state
CN109034172B (en) Product appearance defect detection method based on fuzzy relaxation constraint multi-core learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant