CN110647922B - Layered non-Gaussian process monitoring method based on public and special feature extraction - Google Patents

Layered non-Gaussian process monitoring method based on public and special feature extraction Download PDF

Info

Publication number
CN110647922B
CN110647922B CN201910828558.1A CN201910828558A CN110647922B CN 110647922 B CN110647922 B CN 110647922B CN 201910828558 A CN201910828558 A CN 201910828558A CN 110647922 B CN110647922 B CN 110647922B
Authority
CN
China
Prior art keywords
data
gaussian
detected
input
output
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910828558.1A
Other languages
Chinese (zh)
Other versions
CN110647922A (en
Inventor
何雨辰
韩丽黎
王云
宋执环
曾九孙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Jiliang University
Original Assignee
China Jiliang University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Jiliang University filed Critical China Jiliang University
Priority to CN201910828558.1A priority Critical patent/CN110647922B/en
Publication of CN110647922A publication Critical patent/CN110647922A/en
Application granted granted Critical
Publication of CN110647922B publication Critical patent/CN110647922B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/211Selection of the most significant subset of features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Testing Or Calibration Of Command Recording Devices (AREA)
  • Testing And Monitoring For Control Systems (AREA)

Abstract

The invention discloses a layered non-Gaussian process monitoring method based on public and special feature extraction. Dividing training data into a plurality of modes, and obtaining high-order public characteristics of the training data by applying a plurality of weight vectors, component vectors and other characteristic quantities in each mode; obtaining low-order public features of the training data by applying a plurality of weight vectors, component vectors and other feature quantities in the remaining low-order modes; constructing statistical limits and statistics in the public subspace according to the public characteristics, and carrying out fault detection; and constructing statistical limits and statistics in the remaining unique subspaces to detect faults. The method is superior to other traditional methods in multi-modal non-Gaussian process fault detection, can extract the special characteristics of the multi-modal and can also extract the common characteristics, and considers the mutual connection of the characteristics and the commonality among different modes, so that the multi-modal process monitoring is more effective.

Description

Layered non-Gaussian process monitoring method based on public and special feature extraction
Technical Field
The invention belongs to the field of industrial process system engineering, and relates to a multi-mode fault monitoring method applied to complex industrial processes such as chemical production and the like.
Background
The complexity of the complex industrial process is reflected in that the process of the modern industrial production generates a large amount of data, and the process data generally have different process characteristics and are not clear in distribution characteristics; the industrial process has a plurality of units, and the data has a mechanism relation; moreover, the traditional method mostly assumes that the chemical process is operated under a single and stable operation condition, but actually the plant-level data is multi-modal, so how to better monitor the multi-modal problem is very worthy of study.
In the past, a great deal of research has been conducted by researchers on advanced data classification and information fusion methods, so that the new measurement data can be accurately distributed to actual operation modes, and the monitoring results can be displayed more accurately. It is noted that there are some process monitoring methods that study global process monitoring methods by using different patterns of data simultaneously, such as support vector data description, k-nearest neighbor based methods, multi-block partial least squares. However, most of these methods related to multi-modal process monitoring can only acquire the unique features of each mode data, and ignore the common features among each mode data. In actual monitoring, it would be beneficial to monitor multimodal processes if the correlation of characteristics and commonalities between different modes could be obtained in advance. Some researchers have proposed a two-step model framework, the first step being the need to develop separate process monitoring models, and the second step being the examination of similarities between them to obtain models representing common features. There have also been some studies attempting to divide the data of each mode into common and unique parts by using the data of multiple modes simultaneously, and although this method has been further extended to the monitoring of the transition process between modes, this method can be used simply for the case of calculating both modes. Moreover, the common features and characteristic properties between different modes are not clearly explained in the geometrical relationship, so that the method has no way to carry out deeper research on the multimode process. A two-step multi-set principal component analysis method has also been proposed to obtain common basis vectors for multiple sets of data. The scholars propose a multimode process monitoring method based on a least square subspace based on the obtained common base vector, although the multimode process monitoring method also successfully extends to the conversion process between monitoring modes, the common base vector does not span any subspace of subset data and only serves as an auxiliary vector to improve the correlation of a cross set to the maximum extent, and therefore, the common characteristics of multimode data cannot be accurately obtained by directly mapping the data to the subspace. Therefore, how to extract the relationship among the multi-modal data and apply different models to the closely-connected multi-modal data becomes an important problem for fault monitoring.
Disclosure of Invention
In order to overcome the problems that in the multi-mode monitoring process, the traditional multi-mode process fault monitoring is carried out, only the specific characteristics of each mode data are developed, and the mutual connection of the characteristics and the commonality of different modes is neglected, the invention aims to provide a layered non-Gaussian process monitoring method based on common and specific characteristic extraction.
The invention seeks to simultaneously construct process monitoring models from all mode data, obtain a common subspace by capturing common characteristics behind different modes, obtain an independent subspace reflecting the characteristic characteristics of each mode, and then carry out fault detection method research on each subspace.
The purpose of the invention is realized by the following technical scheme:
firstly, the classified multi-modal data are classified to obtain common characteristics through weight vector parameters and weight vector scores, and the common characteristics are utilized to monitor faults. And introducing a double-layer non-Gaussian monitoring method aiming at the residual characteristic features to monitor the faults.
The specific method comprises the following steps:
1) Acquiring data of an input variable x and an output variable y of the industrial production process as training data through a sensor under the condition that no fault is known in the industrial production process;
under the condition that the industrial production process needs to detect faults, acquiring an input variable x of the industrial production process on line through a sensor test And an output variable y test The data to be detected is used as the data to be detected;
2) Classifying the training data by adopting a fault detection method of a layered non-Gaussian algorithm based on sample multi-modal classification to obtain classified training data;
3) Aiming at the classified training data, each classification corresponds to a mode, an optimization objective function is established in each mode by applying a plurality of weight vectors, component vectors and other characteristic quantities, common characteristics of the classified training data and the data to be detected are obtained, according to the common characteristics, the step iii) in the industrial production process fault monitoring method based on the hierarchical non-Gaussian monitoring algorithm is adopted to extract the training data of the non-Gaussian part with the common characteristics in each mode, and the statistical limit of the training data of the non-Gaussian part with the common characteristics and the statistical quantity of the data to be detected are processed;
4) After training data of a non-Gaussian part with common characteristics are obtained, extracting the common characteristics of the remaining Gaussian part data, establishing an optimization objective function by using a plurality of weight vectors, component vectors and other characteristic quantities in each mode for the Gaussian part data to obtain the common characteristics of the Gaussian part data, and combining a partial least square method according to the common characteristics to obtain the statistical limit of the training data of the Gaussian part with the common characteristics and the statistical quantity of data to be detected;
5) After Gaussian part data with common characteristics and non-Gaussian part data with common characteristics are extracted, the rest data are data with characteristic characteristics, and according to the data with characteristic characteristics, the iv) step in the industrial production process fault monitoring method based on the hierarchical non-Gaussian monitoring algorithm is adopted to obtain the statistical limit of training data with characteristic characteristics and the statistical quantity of data to be detected;
6) And classifying the data to be detected in an online identification mode to obtain the monitored and classified data to be detected.
The input variable x is, for example, a process variable in an industrial process.
The output variable y is, for example, a quality variable in an industrial process.
In the step 3), the concrete steps are as follows:
3.a) all the input variables X of the industrial production process are formed into an input variable data set X, wherein X belongs to R n×M·m Carrying out standardization processing on each input variable X in an input variable data set X, wherein n is the number of sampling values in the input variable X, the input variable X is composed of a series of sampling values, M is the number of the input variables, and M is the classification number of training data, namely the mode number; outputting the variable data set as Y in the same way;
3.b) the classified input variable data is expressed as: x = [ X = (1) X (2) ... X (M) ]The output variable data is expressed as:Y=[Y (1) Y (2) ... Y (M) ];X (1) X (2) ... X (M) Input variable data respectively expressed as first to Mth classifications, Y (1) Y (2) ... Y (M) Output variable data respectively expressed as first to Mth classifications;
3.c) in this way, the component variable t of the input variable x for each modality in the data is obtained using the following formula c And a component vector u of an output variable y of each modality c
Figure BDA0002189898800000031
Figure BDA0002189898800000032
Wherein p is c And q is c Respectively single input and output common weight vectors; α, β are input and output component weight vector parameters for combining M pieces of modal data, α = [ α ] (1) α (2) ... α (M) ],α (1) α (2) ... α (M) Expressed as input component weight vector parameters of first to Mth classes, respectively, β = [ β ] (1) β (2) ... β (M) ],β (1) β (2) ... β (M) Output component weight vector parameters of a first classification to an Mth classification are respectively expressed, wherein i represents an ith mode;
establishing a first objective function as:
max{I(t c ,u c )}
s.t.||p c ||=||q c ||=||α||=||β||=1
wherein, I (t) c ,u c ) Component variable t being input variable x c And component vector u of output variable y c Mutual information value between them, | | | | represents modulus;
3.d) using iterative algorithm to obtain input and output common weight vector p c 、q c And input and output component weight vector parameters alpha and beta, and then processing to obtain non-Gaussian partial data with common features, which is expressed as
Figure BDA0002189898800000041
And processed by training data for non-Gaussian portions having a common characteristic to result in->
Figure BDA0002189898800000042
Statistical limit for statistics>
Figure BDA0002189898800000043
3.e) pre-multiplying the input variable of the data to be detected by p c Obtaining a first fault estimation vector of the data to be detected, and obtaining the data to be detected by using the first fault estimation vector
Figure BDA0002189898800000044
Statistics are obtained.
The step 3.d) uses the following iterative algorithm to solve and obtain the input and output public weight vector p c 、q c And input and output component weight vector parameters α, β:
(3.d.1) randomly initializing input and output component weight vector parameters alpha and beta;
(3.d.2) substituting the input and output component weight vector parameters alpha and beta into the classified input variable data and the classified output variable data to obtain weighted input data
Figure BDA0002189898800000045
And output data->
Figure BDA0002189898800000046
(3.d.3) solving input and output public weight vectors by adopting a fault monitoring method in the industrial production process based on a layered non-Gaussian monitoring algorithm;
(3.d.4) inputting and outputting the public weight vector p obtained in the last step c And q is c Brought into the first object boxCounting;
(3.d.5) solving input and output component weight vector parameters alpha and beta by using a particle swarm algorithm;
(3.d.6) repeating the steps (2) - (6) until α, β and p c ,q c Convergence, i.e. the four values are all less than the respective preset threshold.
Step 4), removing the data of the non-gaussian part obtained in step 3) from the training data, and remaining the training data of the gaussian part, and then the specific steps are as follows:
4.a) the gaussian data for the classified input variables is represented as: x G =[X G(1) X G(2) ... X G(M) ]The gaussian data for the output variable is expressed as: y is G =[Y G(1) Y G(2) ... Y G(M) ];X G(1) X G(2) ... X G(M) Gaussian data, Y, respectively expressed as input variables of the first to Mth classifications G(1) Y G(2) ... Y G(M) Gaussian data respectively expressed as output variables of the first to mth classifications;
4.b) in this way, the component variable t of the input variable x for each modality in the gaussian partial data is obtained using the following formula c,G And a component vector u of an output variable y of each modality c,G
Figure BDA0002189898800000051
Figure BDA0002189898800000052
Wherein p is c,G And q is c,G A single input, output common weight vector of the gaussian partial data, respectively; alpha is alpha G And beta G Is the input and output component weight vector parameter, alpha, of the combined M modal data of the Gaussian partial data G =[α G(1) α G(2) ... α G(M) ],α G(1) α G(2) ... α G(M) Respectively expressed as the first to Mth classificationsInput component weight vector parameter, β, of a class G =[β G(1) β G(2) ... β G(M) ],β G(1) β G(2) ... β G(M) Output component weight vector parameters of a first classification to an Mth classification are respectively expressed, wherein i represents an ith mode;
establishing a second objective function as:
max{I(t c,G ,u c,G )}
s.t.||p c,G ||=||q c,G ||=||α G ||=||β G ||=1
wherein, I (t) c,G ,u c,G ) Component variable t of input variable x in Gaussian part data c,G And component vector u of output variable y c,G The mutual information value between, | | | | represents the modulus;
4.c) using iterative algorithm to solve and obtain input and output common weight vector p in Gaussian partial data c,G 、q c,G And the input and output component weight vector parameter alpha G 、β G Then processing to obtain training data of a Gaussian part with common characteristics;
and T is obtained by processing the training data of Gaussian part with common characteristics c 2 Statistical limit of statistics
Figure BDA0002189898800000055
4.d) pre-multiplying the input variable of the data to be detected by p c,G Obtaining a second fault estimation vector of the data to be detected, and obtaining T of the data to be detected by using the second fault estimation vector in combination with a partial least square method c 2 Statistics are obtained.
The step 4.c) uses the following iterative algorithm to solve and obtain the input and output public weight vector p in the Gaussian partial data c,G 、q c,G And input and output component weight vector parameter alpha G 、β G
(4.c.1) randomly initializing input and output component weight vector parameter alpha in Gaussian partial data G 、β G
(4.c.2) Weighting vector parameter alpha of input and output components in Gaussian partial data G 、β G The classified input variable data and the classified output variable data are brought in to obtain weighted input data
Figure BDA0002189898800000053
And output data
Figure BDA0002189898800000054
(4.c.3) solving input and output public weight vectors p in Gaussian part data by adopting industrial production process fault monitoring method based on hierarchical non-Gaussian monitoring algorithm c,G And q is c,G
(4.c.4) inputting and outputting the public weight vector p in the Gaussian part data obtained in the last step c,G And q is c,G Is brought into a second objective function;
(4.c.5) solving weight vector parameter alpha of input and output components in Gaussian partial data by using particle swarm optimization G 、β G
(4.c.6) repeating the processes of steps (2) - (6) until alpha G 、β G And p c,G 、q c,G Convergence, i.e. the four values are all less than the respective preset threshold.
In the step 5), the concrete steps are as follows:
5.a) as the common weight vector p c After the determination, the training data of each modality with the characteristic features remaining is obtained by processing in the following way:
X I(i) =X (i) -X (i) *p c p c T
Figure BDA0002189898800000061
wherein, X I(i) Input variable data set representing training data of the i-th modality with characteristic features, Y I(i) An output variable data set representing training data of the ith modality with characteristic features, X (i) Input variable data set, Y, representing training data for the ith modality (i) An output variable data set representing training data for an ith modality;
5.b) for training data of each mode with special characteristics, adopting step iii) to step iv) of fault monitoring method of industrial production process based on hierarchical non-Gaussian monitoring algorithm to process step by step to obtain load matrix for non-Gaussian part and Gaussian part of training data respectively, and obtaining I by processing the load matrix i 2 Statistical limit corresponding to statistical quantity
Figure BDA0002189898800000062
5.c) removing the obtained non-gaussian part data from the training data with characteristic features, and obtaining load matrixes for the gaussian part and the gaussian part of the training data by processing the training data with characteristic features in the steps from step iii to step iv in the fault monitoring method of the industrial production process based on the hierarchical non-gaussian monitoring algorithm according to the training data with characteristic features, and obtaining T by processing the load matrixes i 2 Statistical limits corresponding to the statistics and Q statistics
Figure BDA0002189898800000063
And statistical limit Q _limit
5.d) pre-multiplying the data to be detected with the gaussian part having the characteristic by the training data of the gaussian part having the characteristic to obtain a load matrix, pre-multiplying the input variable of the data to be detected with the gaussian part by the load matrix P G Obtaining a fourth fault estimation vector of the data to be detected, and obtaining the data to be detected T by using the fourth fault estimation vector i 2 Statistics and Q statistics.
In the step 6), the concrete steps are as follows:
the data to be detected starts to operate, but the types of modal working conditions are unknown, one mode corresponds to one modal working condition, the industrial production process has modal working conditions under M modes in total, the data to be detected of each known mode is monitored to obtain each statistical limit, the data to be detected of each current unknown mode is monitored to obtain each statistical quantity, and the following judgment is carried out:
if the current unknown mode of the data to be detected
Figure BDA0002189898800000071
statistic/T c 2 The statistic is simultaneously higher than that obtained by the corresponding training data of the first modality representing the common feature>
Figure BDA0002189898800000072
Statistical limit of statistic->
Figure BDA0002189898800000073
Or T c 2 Statistical limit of statistic->
Figure BDA0002189898800000074
The data to be detected in the current unknown mode is fault data;
if the current unknown mode of the data to be detected
Figure BDA0002189898800000075
statistic/T c 2 A statistic lower than that obtained by corresponding training data of the first modality representing the common signature>
Figure BDA0002189898800000076
Statistical limit of statistic->
Figure BDA0002189898800000077
Or T c 2 Statistical limit of statistic->
Figure BDA0002189898800000078
Then makes a judgment>
Figure BDA00021898988000000720
statistic/T i 2 statistics/Q systemWhether the measurement is simultaneously higher than the corresponding ^ representing characteristic feature>
Figure BDA0002189898800000079
Statistical limit of statistics
Figure BDA00021898988000000710
T i 2 Statistical limit of statistic->
Figure BDA00021898988000000711
And a statistical limit Q of the Q statistic _limit
If not, the data to be detected of the current unknown mode belongs to a first mode;
if the current modal is higher than the preset threshold, training data of a second modal is called to be judged, and the data to be detected of the current unknown modal is judged
Figure BDA00021898988000000718
statistic/T i 2 Whether the statistic/Q statistic is simultaneously greater than +obtained by training data representing a second modality of characteristic feature>
Figure BDA00021898988000000719
Statistical limit for statistics>
Figure BDA00021898988000000712
T i 2 Is greater than or equal to>
Figure BDA00021898988000000713
And statistical limit of Q _limit
If not, the data to be detected in the current unknown mode belongs to a second mode;
if the current modal is higher than the preset threshold, training data of a third modal is called for judgment, and the current unknown modal to-be-detected data is judged
Figure BDA00021898988000000721
statistic/T i 2 Statistics ofWhether the quantity/Q statistic is simultaneously higher than I obtained from corresponding training data of a third modality representing a characteristic feature i 2 Statistical limit of statistic->
Figure BDA00021898988000000714
T i 2 Statistical limit of statistic->
Figure BDA00021898988000000715
And statistical limit Q of Q statistic _limit
If not, the data to be detected of the current unknown mode belongs to a third mode;
if the current modal is higher than the first modal, calling the training data of the fourth modal for judgment, and so on until the current unknown modal to be detected data I i 2 statistic/T i 2 The statistic/Q statistic being higher than that obtained by modal training data
Figure BDA00021898988000000722
Statistical limit of statistic->
Figure BDA00021898988000000716
T i 2 Statistical limit of statistic->
Figure BDA00021898988000000717
And statistical limit Q of Q statistic _limit And considering the data to be detected in the current unknown mode as fault data, and finishing the monitoring and classification of the data to be detected.
Step 7) is also included after the step 6), specifically, the fault detection is carried out by using the statistical limit and the statistical quantity in the step 3), the step 4) and the step 5).
The step 7), specifically, the statistical limit and the statistical quantity in the step 3), 4), 5) meet the data to be detected corresponding to any one of the following five conditions, namely, the industrial production process is considered to have a process fault:
for data of non-Gaussian part with common characteristic in data to be detectedAbove statistical limits
Figure BDA00021898988000000723
Data to be detected corresponding to the statistic;
for the data of Gaussian part with common features in the data to be detected, the data is positioned at T above the statistical limit c 2 Data to be detected corresponding to the statistic;
for data of non-Gaussian part with characteristic features in the data to be detected, the data is located above the statistical limit
Figure BDA0002189898800000081
Data to be detected corresponding to the statistic;
for the data of Gaussian part with characteristic features in the data to be detected, the data is positioned at T above the statistical limit i 2 And the statistic and the data to be detected corresponding to the Q statistic.
The invention has the beneficial effects that:
and extracting multi-modal common features and special features for tracking the running performance of the process and the quality of the process. The method seeks to simultaneously construct process monitoring models from all mode data, obtain a common subspace by capturing common features behind different modes, obtain an independent subspace reflecting the characteristic features of each mode, and then carry out fault detection method research on each subspace.
The method is superior to other traditional methods in fault detection of the non-Gaussian multi-modal process, can fully consider the highly complex coupling relation among variables, can extract the non-Gaussian part in the data with unknown distribution characteristics, and can effectively extract the common characteristics and the unique characteristics of the multi-modal data, so that the fault monitoring of the multi-modal process is more efficient and accurate.
The method is superior to other traditional methods in multi-modal non-Gaussian process fault detection, can extract the special characteristics of the multi-modal and can also extract the common characteristics, and considers the mutual connection of the characteristics and the commonality among different modes, so that the multi-modal process monitoring is more effective.
Drawings
FIG. 1 is a graph of the classification results of the multi-modal process of the present invention.
Fig. 2 is a diagram of the first failure detection result of the present invention.
Fig. 3 is a diagram of a second failure detection result of the present invention.
Detailed Description
The method of the present invention is described in detail below with reference to the accompanying drawings and specific embodiments.
The specific implementation case adopted by the invention is a Tennessee-Emament (TE) process, which comprises five main units: a reactor, a condenser, a compressor, a separator, and a stripper.
The product stream from the reactor was cooled by a condenser and then sent to a vapor/liquid separator. The vapor from the separator is recycled to the reactor by means of a compressor. To prevent the accumulation of inert components and reaction by-products in the process, a portion of the recycle stream must be vented. The condensed components from the separator (stream 10) are pumped to the stripper. Stream 4 is used to strip the remaining reactants in stream 10 which are combined with the recycle stream via stream 5 for further reaction.
In order to monitor the faults of the TE process, ensure the safety of the actual chemical production process and improve the economic benefit, the Tennessee-Marshman (TE) process has 41 measurement variables (including 22 process measurement variables and 19 component measurement variables) and 12 control variables, 31 variables are selected from the 22 process measurement variables and the 12 control variables as input variables as shown in Table 1 and 6 component measurement variables are selected from the 19 measurement variables as output variables, namely quality variables as shown in Table 2, when the process monitoring variables are determined. The tennessee-eastman process also simulated 21 faults as shown in table 3.
TABLE 1
Figure BDA0002189898800000091
TABLE 2
Figure BDA0002189898800000092
TABLE 3
Figure BDA0002189898800000101
In order to demonstrate the superiority of the proposed processing method, the present invention is embodied as a multi-modal process case including different stable models, from which 3000 samples are collected in total, and a data matrix of 50 variables is used as a training data set, and the multi-modal process is divided into three modes, wherein the first mode includes 1000 samples, the second mode includes 1000 samples, and the third mode includes 1000 samples, as shown in fig. 1. In order to achieve the purpose of fault detection, two faults are designed, wherein the fault 1 is that a step fault is added to a sample from 300 to 1000 of a 22 th variable of a first modality, and the fault 2 is that a slope fault is added to a sample from 1500 to 2000 of the 22 th variable of a second modality, and the fault detection is respectively carried out as a data set to be detected, and the specific steps are as follows:
1) Under the condition that no fault is known in the industrial production process, acquiring data of an input variable x and an output variable y in the industrial production process as training data through a sensor, and carrying out standardization processing to enable the mean value of the data to be 0 and the variance to be 1;
under the condition that the industrial production process needs to detect faults, acquiring an input variable x of the industrial production process on line through a sensor test And an output variable y test The two fault data are used as data to be detected, and standardized processing is carried out, so that the mean value of the data is 0, and the variance is 1, and the data are used as the data to be detected;
2) Classifying the training data by adopting a fault detection method of a layered non-Gaussian algorithm based on sample multi-modal classification to obtain classified training data;
3) Aiming at the classified training data, each classification corresponds to one mode, an optimization objective function is established in each mode by using a plurality of weight vectors, component vectors and other characteristic quantities to obtain the classified training data and the common characteristics of the data to be detected, the step iii) in the fault monitoring method of the industrial production process based on the hierarchical non-Gaussian monitoring algorithm is adopted according to the common characteristics to extract the training data of the non-Gaussian part with the common characteristics in each mode, and the statistical limit of the training data of the non-Gaussian part with the common characteristics and the statistical quantity of the data to be detected are processed;
4) After training data of a non-Gaussian part with common characteristics are obtained, extracting the common characteristics of the rest Gaussian part data, establishing an optimized objective function by applying a plurality of weight vectors, component vectors and other characteristic quantities in each mode for the Gaussian part data to obtain the common characteristics of the Gaussian part data, and obtaining the statistical limit of the training data of the Gaussian part with the common characteristics and the statistical quantity of data to be detected according to the common characteristics by combining a partial least square method;
5) After Gaussian part data with common characteristics and non-Gaussian part data with common characteristics are extracted, the rest data are data with characteristic characteristics, and according to the data with characteristic characteristics, the iv) step in the industrial production process fault monitoring method based on the hierarchical non-Gaussian monitoring algorithm is adopted to obtain the statistical limit of training data with characteristic characteristics and the statistical quantity of data to be detected;
6) And classifying the data to be detected in an online identification mode to obtain the monitored and classified data to be detected.
The invention can simultaneously construct the process monitoring model from all mode data, obtain the public subspace by capturing the public characteristics behind different modes, and obtain the independent subspace reflecting the characteristic characteristics of each mode, and then carry out the fault detection method research on each subspace, thereby improving the accuracy and effectiveness of fault monitoring.
The above embodiments are disclosed to illustrate the present invention, not to limit the present invention, and any modifications and changes made within the spirit of the present invention and the scope of the claims fall within the scope of the present invention.

Claims (9)

1. A layered non-Gaussian process monitoring method based on public and unique feature extraction is characterized by comprising the following steps:
1) Acquiring data of an input variable x and an output variable y of the industrial production process as training data through a sensor under the condition that no fault is known in the industrial production process;
under the condition that the industrial production process needs to detect faults, acquiring an input variable x of the industrial production process on line through a sensor test And an output variable y test The data to be detected is used as the data to be detected;
2) Classifying the training data to obtain classified training data;
3) Aiming at the classified training data, each classification corresponds to one mode, an optimization objective function is established in each mode by using a plurality of weight vectors and feature quantities of component vectors to obtain the classified training data and the common features of the data to be detected, the training data of the non-Gaussian part with the common features in each mode are extracted according to the common features, and the statistical limit of the training data of the non-Gaussian part with the common features and the statistical quantity of the data to be detected are processed;
4) After training data of a non-Gaussian part with common characteristics are obtained, extracting the common characteristics of the remaining Gaussian part data, establishing an optimization objective function by using the characteristic quantities of a plurality of weight vectors and component vectors in each mode for the Gaussian part data to obtain the common characteristics of the Gaussian part data, and obtaining the statistical limit of the training data of the Gaussian part with the common characteristics and the statistical quantity of data to be detected according to the common characteristics by combining a partial least square method;
5) After Gaussian partial data with common characteristics and non-Gaussian partial data with common characteristics are extracted, the rest data are data with characteristic characteristics, and statistical limits of training data with characteristic characteristics and statistics of data to be detected are obtained according to the data with characteristic characteristics;
6) And classifying the data to be detected in an online identification mode to obtain the monitored and classified data to be detected.
2. The hierarchical non-gaussian process monitoring method based on common and unique feature extraction as claimed in claim 1, wherein:
in the step 3), the concrete steps are as follows:
3.a) all input variables X of industrial production process are formed into input variable data set X, X belongs to R n×M·m Standardizing each input variable X in an input variable data set X, wherein n is the number of sampling values in the input variable X, M is the number of input variables, and M is the classification number of training data, namely a modal number;
3.b) the classified input variable data is expressed as: x = [ X = (1) X (2) ... X (M) ]The output variable data is expressed as: y = [ Y) (1) Y (2) ... Y (M) ];X (1) X (2) ... X (M) Input variable data respectively expressed as first to Mth classifications, Y (1) Y (2) ... Y (M) Output variable data respectively expressed as first to Mth classifications;
3.c) in this way, the component variable t of the input variable x for each modality in the data is obtained using the following formula c And a component vector u of an output variable y of each modality c
Figure FDA0004002502170000021
Figure FDA0004002502170000022
Wherein p is c And q is c Respectively single input and output common weight vectors; alpha, beta is the input and output component weight vector of the combined M modal dataParameter, α = [ ] (1) α (2) ... α (M) ],α (1) α (2) ... α (M) Expressed as input component weight vector parameters of first to Mth classes, respectively, β = [ β ] (1) β (2) ... β (M) ],β (1) β (2) ... β (M) Output component weight vector parameters of a first classification to an Mth classification are respectively expressed, wherein i represents an ith mode;
establishing a first objective function as:
max{I(t c ,u c )}
s.t.||p c ||=||q c ||=||α||=||β||=1
wherein, I (t) c ,u c ) A component variable t being an input variable x c And component vector u of output variable y c The mutual information value between, | | | | represents the modulus;
3.d) using iterative algorithm to obtain input and output common weight vector p c 、q c And inputting and outputting component weight vector parameters alpha and beta, then processing to obtain non-Gaussian part data with common features, and processing the non-Gaussian part data with common features to obtain the non-Gaussian part data
Figure FDA0004002502170000023
Statistical limits of statistics
Figure FDA0004002502170000024
3.e) pre-multiplying an input variable of data to be detected by p c Obtaining a first fault estimation vector of the data to be detected, and obtaining the data to be detected by using the first fault estimation vector
Figure FDA0004002502170000025
Statistics are obtained.
3. The hierarchical non-gaussian process monitoring method based on common and unique feature extraction as claimed in claim 2, wherein:
the step 3.d) uses the following iterative algorithm to solve and obtain the input and output public weight vector p c 、q c And input and output component weight vector parameters α, β:
(3.d.1) randomly initializing input and output component weight vector parameters alpha and beta;
(3.d.2) substituting the input and output component weight vector parameters alpha and beta into the classified input variable data and the classified output variable data to obtain weighted input data
Figure FDA0004002502170000026
And output data
Figure FDA0004002502170000027
(3.d.3) solving an input and output public weight vector;
(3.d.4) inputting and outputting the public weight vector p obtained in the last step c And q is c Is brought into a first objective function;
(3.d.5) solving input and output component weight vector parameters alpha and beta by using a particle swarm algorithm;
(3.d.6) repeating the steps (2) - (6) until α, β and p c ,q c And (6) converging.
4. The hierarchical non-gaussian process monitoring method based on common and unique feature extraction as claimed in claim 1, wherein:
step 4), removing the data of the non-gaussian part obtained in step 3) from the training data, and remaining the training data of the gaussian part, and then the specific steps are as follows:
4.a) the gaussian data for the classified input variables is represented as: x G =[X G(1) X G(2) ... X G(M) ]The gaussian data for the output variable is expressed as: y is G =[Y G(1) Y G(2) ... Y G(M) ];X G(1) X G(2) ... X G(M) Gaussian data respectively expressed as input variables of first to Mth classifications,Y G(1) Y G(2) ... Y G(M) Gaussian data respectively expressed as output variables of the first to mth classifications;
4.b) in this way, the component variable t of the input variable x for each modality in the gaussian partial data is obtained using the following formula c,G And a component vector u of an output variable y of each modality c,G
Figure FDA0004002502170000031
Figure FDA0004002502170000032
Wherein p is c,G And q is c,G A single input, output common weight vector of the gaussian partial data, respectively; alpha is alpha G And beta G Is the input and output component weight vector parameter, alpha, of the combined M modal data of the Gaussian partial data G =[α G(1) α G(2) ... α G(M) ],α G(1) α G(2) ... α G(M) Expressed as the input component weight vector parameter, beta, of the first to Mth classes, respectively G =[β G(1) β G(2) ... β G(M) ],β G(1) β G(2) ... β G(M) Output component weight vector parameters of a first classification to an Mth classification are respectively expressed, wherein i represents an ith mode;
establishing a second objective function as:
max{I(t c,G ,u c,G )}
s.t.||p c,G ||=||q c,G ||=||α G ||=||β G ||=1
wherein, I (t) c,G ,u c,G ) Component variable t of input variable x in Gaussian part data c,G And component vector u of output variable y c,G The mutual information value between, | | | | represents the modulus;
4.c) using iterative algorithm to solve and obtain Gaussian partial dataInput and output common weight vector p c,G 、q c,G And input and output component weight vector parameter alpha G 、β G Then processing to obtain training data of a Gaussian part with common characteristics;
and T is obtained by processing the training data of Gaussian part with common characteristics c 2 Statistical limit of statistics
Figure FDA0004002502170000041
4.d) pre-multiplying the input variable of the data to be detected by p c,G Obtaining a second fault estimation vector of the data to be detected, and obtaining T of the data to be detected by using the second fault estimation vector in combination with a partial least square method c 2 Statistics are obtained.
5. The hierarchical non-Gaussian process monitoring method based on common and unique feature extraction as claimed in claim 4, wherein:
the step 4.c) uses the following iterative algorithm to solve and obtain the input and output public weight vector p in the Gaussian partial data c,G 、q c,G And input and output component weight vector parameter alpha G 、β G
(4.c.1) randomly initializing input and output component weight vector parameters alpha in Gaussian partial data G 、β G
(4.c.2) inputting and outputting the weight vector parameter alpha of the input and output components in the Gaussian part data G 、β G The classified input variable data and the classified output variable data are brought in to obtain weighted input data
Figure FDA0004002502170000042
And output data
Figure FDA0004002502170000043
(4.c.3) solving the Gaussian fraction by adopting the industrial production process fault monitoring method based on the layered non-Gaussian monitoring algorithmAccording to which the common weight vector p is input and output c,G And q is c,G
(4.c.4) inputting and outputting the public weight vector p in the Gaussian part data obtained in the last step c,G And q is c,G Is brought into a second objective function;
(4.c.5) solving weight vector parameter alpha of input and output components in Gaussian partial data by using particle swarm optimization G 、β G
(4.c.6) repeating the processes of steps (2) - (6) until alpha G 、β G And p c,G 、q c,G And (6) converging.
6. The hierarchical non-gaussian process monitoring method based on common and unique feature extraction as claimed in claim 1, wherein:
in the step 5), the concrete steps are as follows:
5.a) as the common weight vector p c After the determination, the training data of each modality with the characteristic features remaining is obtained by processing in the following way:
X I(i) =X (i) -X (i) *p c p c T
Figure FDA0004002502170000044
wherein, X I(i) Input variable data set representing training data of the i-th modality with characteristic features, Y I(i) Output variable data set representing training data of the ith modality with characteristic features, X (i) Input variable data set, Y, representing training data for the ith modality (i) An output variable data set representing training data for an ith modality;
5.b) for training data of each mode with specific characteristics, processing by industrial production process fault monitoring method based on hierarchical non-Gaussian monitoring algorithm to obtain load matrix for non-Gaussian part and Gaussian part of training data, and processing by using the load matrix to obtain load matrix
Figure FDA0004002502170000051
Statistical limit corresponding to statistical quantity
Figure FDA0004002502170000052
5.c) removing the above obtained non-gaussian part data from the training data with characteristic features, and processing the training data with gaussian part with characteristic features by using a fault monitoring method based on a hierarchical non-gaussian monitoring algorithm to obtain load matrices for gaussian part and gaussian part of the training data, and processing the load matrices to obtain T i 2 Statistical limits corresponding to the statistics and Q statistics
Figure FDA0004002502170000053
And statistical limit Q _limit
5.d) pre-multiplying the data to be detected with the gaussian part having the characteristic by the training data of the gaussian part having the characteristic to obtain a load matrix, pre-multiplying the input variable of the data to be detected with the gaussian part by the load matrix P G Obtaining a fourth fault estimation vector of the data to be detected, and obtaining the data to be detected T by using the fourth fault estimation vector i 2 Statistics and Q statistics.
7. The method of claim 1, wherein the method comprises the following steps:
in the step 6), the concrete steps are as follows:
the data to be detected starts to operate, modal conditions under M modals are totally generated in the industrial production process, the data to be detected of each known modality are monitored to obtain each statistical limit, the data to be detected of each current unknown modality are monitored to obtain each statistic, and the following judgment is carried out:
if when it is usedOf data to be detected in a previously unknown modality
Figure FDA0004002502170000054
statistic/T c 2 The statistics being obtained while being higher than the corresponding training data of the first modality
Figure FDA0004002502170000055
Statistical limit of statistics
Figure FDA0004002502170000056
Or T c 2 Statistical limits of statistics
Figure FDA00040025021700000517
The data to be detected in the current unknown mode is fault data;
if the current unknown mode of the data to be detected
Figure FDA0004002502170000057
statistic/T c 2 The statistics being obtained from training data lower than the corresponding first modality
Figure FDA0004002502170000058
Statistical limit of statistics
Figure FDA0004002502170000059
Or T c 2 Statistical limit of statistics
Figure FDA00040025021700000518
Then judge
Figure FDA00040025021700000510
statistic/T i 2 Whether statistic/Q statistic is simultaneously higher than corresponding
Figure FDA00040025021700000511
Statistical limit of statistics
Figure FDA00040025021700000512
T i 2 Statistical limit of statistics
Figure FDA00040025021700000519
And statistical limit Q of Q statistic _limit
If not, the data to be detected of the current unknown mode belongs to a first mode;
if the current mode is higher than the preset threshold, the training data of the second mode is called for judgment, and the data to be detected of the current unknown mode is judged
Figure FDA00040025021700000513
statistic/T i 2 Whether the statistic/Q statistic is simultaneously higher than the training data of the second modality
Figure FDA00040025021700000514
Statistical limit of statistics
Figure FDA00040025021700000515
T i 2 Statistical limit of
Figure FDA00040025021700000520
And statistical limit of Q _limit
If not, the data to be detected in the current unknown mode belongs to a second mode;
if the current modal is higher than the preset threshold, training data of a third modal is called for judgment, and the current unknown modal to-be-detected data is judged
Figure FDA00040025021700000516
statistic/T i 2 Whether the statistic/Q statistic is simultaneously higher than the corresponding training data of the third modality
Figure FDA0004002502170000061
Statistical limit of statistics
Figure FDA0004002502170000062
T i 2 Statistical limit of statistics
Figure FDA0004002502170000068
And statistical limit Q of Q statistic _limit
If not, the data to be detected in the current unknown mode belongs to a third mode;
if the current modal is higher than the preset threshold, the training data of the fourth modal is called for judgment, and the like is repeated until the current unknown modal to-be-detected data
Figure FDA0004002502170000063
statistic/T i 2 The statistic/Q statistic being higher than that obtained by modal training data
Figure FDA0004002502170000064
Statistical limit of statistics
Figure FDA0004002502170000065
T i 2 Statistical limit of statistics
Figure FDA0004002502170000069
And statistical limit Q of Q statistic _limit And considering the data to be detected in the current unknown mode as fault data.
8. The method of claim 1, wherein the method comprises the following steps:
and step 7) is also included after the step 6), specifically, the fault detection is carried out by utilizing the statistical limits and statistics in the step 3), the step 4) and the step 5).
9. The hierarchical non-gaussian process monitoring method based on common and unique feature extraction as claimed in claim 1, wherein:
the step 7), specifically, the statistical limit and the statistical quantity in the step 3), 4), 5) meet the data to be detected corresponding to any one of the following five conditions, namely, the industrial production process is considered to have a process fault:
for data of non-Gaussian part with common features in the data to be detected, the data is above the statistical limit
Figure FDA0004002502170000066
Data to be detected corresponding to the statistic;
for the data of Gaussian part with common features in the data to be detected, the data is positioned at T above the statistical limit c 2 Data to be detected corresponding to the statistic;
for data of non-Gaussian part with characteristic features in the data to be detected, the data is located above the statistical limit
Figure FDA0004002502170000067
Data to be detected corresponding to the statistic;
for the data of Gaussian part with characteristic features in the data to be detected, the data is positioned at T above the statistical limit i 2 And the statistic and the data to be detected corresponding to the Q statistic.
CN201910828558.1A 2019-09-03 2019-09-03 Layered non-Gaussian process monitoring method based on public and special feature extraction Active CN110647922B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910828558.1A CN110647922B (en) 2019-09-03 2019-09-03 Layered non-Gaussian process monitoring method based on public and special feature extraction

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910828558.1A CN110647922B (en) 2019-09-03 2019-09-03 Layered non-Gaussian process monitoring method based on public and special feature extraction

Publications (2)

Publication Number Publication Date
CN110647922A CN110647922A (en) 2020-01-03
CN110647922B true CN110647922B (en) 2023-03-24

Family

ID=69010156

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910828558.1A Active CN110647922B (en) 2019-09-03 2019-09-03 Layered non-Gaussian process monitoring method based on public and special feature extraction

Country Status (1)

Country Link
CN (1) CN110647922B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112307257B (en) * 2020-11-25 2021-06-15 中国计量大学 Short video click rate prediction method based on multi-information node graph network

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2437034A2 (en) * 2010-09-30 2012-04-04 Siemens Corporation System and method for conditional multi-output regression for machine condition monitoring
CN109507972A (en) * 2018-12-19 2019-03-22 中国计量大学 Industrial processes fault monitoring method based on layer-stepping non-gaussian monitoring algorithm
CN109523195A (en) * 2018-12-19 2019-03-26 中国计量大学 The fault detection method of layer-stepping non-gaussian algorithm based on the multi-modal classification of sample

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2437034A2 (en) * 2010-09-30 2012-04-04 Siemens Corporation System and method for conditional multi-output regression for machine condition monitoring
CN109507972A (en) * 2018-12-19 2019-03-22 中国计量大学 Industrial processes fault monitoring method based on layer-stepping non-gaussian monitoring algorithm
CN109523195A (en) * 2018-12-19 2019-03-26 中国计量大学 The fault detection method of layer-stepping non-gaussian algorithm based on the multi-modal classification of sample

Also Published As

Publication number Publication date
CN110647922A (en) 2020-01-03

Similar Documents

Publication Publication Date Title
CN108960329B (en) Chemical process fault detection method containing missing data
CN104063719A (en) Method and device for pedestrian detection based on depth convolutional network
CN109409425B (en) Fault type identification method based on neighbor component analysis
CN108345284B (en) Quality-related fault detection method based on two variable blocks
CN105955214B (en) Batch process fault detection method based on sample time-series and neighbour's affinity information
CN111580506A (en) Industrial process fault diagnosis method based on information fusion
CN103488091A (en) Data-driving control process monitoring method based on dynamic component analysis
CN109675935A (en) A kind of IPCA operation of rolling on-line fault diagnosis method becoming control limit
CN114676742A (en) Power grid abnormal electricity utilization detection method based on attention mechanism and residual error network
CN110765587A (en) Complex petrochemical process fault diagnosis method based on dynamic regularization judgment local retention projection
CN114429152A (en) Rolling bearing fault diagnosis method based on dynamic index antagonism self-adaption
CN101738998A (en) System and method for monitoring industrial process based on local discriminatory analysis
CN112580693A (en) Petrochemical process fault diagnosis method based on self-help resampling neighborhood preserving embedding
AU2021335237A1 (en) Method for detecting abnormality of automatic verification system of smart watt-hour meter based on transductive support vector machine (TSVM) model
CN110647922B (en) Layered non-Gaussian process monitoring method based on public and special feature extraction
CN109901553B (en) Heterogeneous industrial big data collaborative modeling process fault monitoring method based on multiple visual angles
CN117516937A (en) Rolling bearing unknown fault detection method based on multi-mode feature fusion enhancement
CN110673568A (en) Method and system for determining fault sequence of industrial equipment in glass fiber manufacturing industry
CN117407313A (en) Automatic quality testing method and system based on machine learning analysis
CN112183569A (en) FDA and SOM based intermittent industrial process reaction phase clustering and fault classification visualization
Yang et al. A novel decentralized weighted ReliefF-PCA method for fault detection
CN109523195B (en) Sample multi-modal classification-based fault detection method for hierarchical non-Gaussian algorithm
CN114415609B (en) Dynamic process refinement monitoring method based on multi-subspace division
CN108596210A (en) A kind of intelligent identifying system and method for mechanical part mated condition
Gallup et al. Enhancing Fault Detection with Clustering and Covariance Analysis

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant