CN111160811A - Batch process fault monitoring method based on multi-stage FOM-SAE - Google Patents

Batch process fault monitoring method based on multi-stage FOM-SAE Download PDF

Info

Publication number
CN111160811A
CN111160811A CN202010057396.9A CN202010057396A CN111160811A CN 111160811 A CN111160811 A CN 111160811A CN 202010057396 A CN202010057396 A CN 202010057396A CN 111160811 A CN111160811 A CN 111160811A
Authority
CN
China
Prior art keywords
stage
monitoring
data
sae
fom
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010057396.9A
Other languages
Chinese (zh)
Other versions
CN111160811B (en
Inventor
金辰
王普
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Technology
Original Assignee
Beijing University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Technology filed Critical Beijing University of Technology
Priority to CN202010057396.9A priority Critical patent/CN111160811B/en
Publication of CN111160811A publication Critical patent/CN111160811A/en
Application granted granted Critical
Publication of CN111160811B publication Critical patent/CN111160811B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/067Enterprise or organisation modelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0633Workflow analysis
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Educational Administration (AREA)
  • Development Economics (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Testing And Monitoring For Control Systems (AREA)
  • Testing Or Calibration Of Command Recording Devices (AREA)

Abstract

The invention relates to a batch industrial process fault monitoring method based on multi-stage FOM-SAE, which is necessary for ensuring the safety and stability of a production process and improving the product quality. The specific method comprises two steps of off-line modeling and on-line monitoring. The off-line modeling firstly processes normal data, carries out stage division on the whole batch production process, then respectively establishes FOM-SAE monitoring models and constructs monitoring statistics in each stage, and determines the control limit of the statistics by using a nuclear density estimation method. And in the step of 'online monitoring', a corresponding sub-model is selected according to the current sampling moment, the statistic is calculated and compared with the control limit, and whether the current system is normal or not is judged. The method provided by the invention can simultaneously process the multi-stage, non-Gaussian and non-linear characteristics of batch data, effectively reduces the failure report rate of monitoring, greatly improves the accuracy rate of fault monitoring, and has higher practical application value.

Description

Batch process fault monitoring method based on multi-stage FOM-SAE
Technical Field
The invention relates to batch process fault diagnosis based on multi-stage FOM-SAE, and belongs to the field of industrial process fault diagnosis and soft measurement.
Background
In recent decades, batch processes have been widely used in biopharmaceutical, semiconductor processing, and other fields because they can meet the demand for producing high add-on products and can be quickly adjusted to produce in response to market demand. The industrial process has complex mechanism and high operation control difficulty, and the product quality and the production efficiency are easily influenced by uncertain factors. In order to ensure the safety and stability of the industrial system, it is necessary to establish an effective process monitoring means to find out abnormal conditions in time. At present, Multivariate Statistical Process Monitoring (MSPM) methods based on Principal Component Analysis (PCA), Partial Least Squares (PLS) and other methods are widely applied to batch industrial processes. However, the application of the above method is limited to two: firstly, the data must conform to the gaussian distribution, and secondly, the different variables must have linear relations.
To address the problem of non-gaussian characteristics of batch process data, Independent Component Analysis (ICA) was introduced into the process monitoring field. Although this method can handle non-gaussian data, ICA is unstable when solving the unmixing matrix, resulting in degraded monitoring performance. The related literature proves that the Fourth order distance (FOM) can effectively extract the non-gaussian information of the data. Furthermore, for non-linear relationships between batch process data variables, the introduction of Kernel Trick (Kernel lock) was introduced into traditional MSPM algorithms such as KPCA, KPLS, KICA. However, these non-linear monitoring methods require manual setting of the values of the nuclear parameters, which affects the monitoring effect.
The batch process has a plurality of operation stages, and monitoring the whole batch by adopting a single model inevitably leads to the reduction of the monitoring effect in different stages. In view of this feature, many scholars have made a lot of research, and the monitoring accuracy can be improved by reasonably dividing the batch data into stages and establishing a separate monitoring model at each sub-stage.
The purpose of process monitoring is to extract characteristic information from industrial data and realize the separation of normal data and fault data. An Auto Encoder (AE) algorithm is an unsupervised feature extraction method proposed by Hinton, and consists of an encoder and a decoder. The AE algorithm is used for projecting the normal samples to the low-dimensional specific space through nonlinear transformation, so that the nonlinear problem of the data can be effectively solved, and meanwhile, deep information of the data is extracted. While the stacked auto-encoder (SAE) has a stronger feature extraction capability compared to AE. SAE has been used in the fields of image processing, natural language processing, and the like, and application to the field of fault monitoring is also beginning to be emphasized.
A multi-stage batch process fault monitoring method based on four-step distance and stacking self-encoders can effectively improve the fault monitoring capability of a batch process. The method can simultaneously process the problems of non-Gaussian, non-linear and multi-stage process data, and has better fault monitoring performance.
Disclosure of Invention
Aiming at the problems of multiple stages, non-Gaussian and non-linearity of batch process data, the invention provides a batch industrial process fault monitoring method based on multi-stage FOM-SAE, and an overall flow chart of a model is shown in figure 1.
The invention adopts the following technical scheme and implementation steps:
A. an off-line modeling stage:
1) collecting multiple batches of historical data under a normal working condition to form a three-dimensional array X, wherein three dimensions are respectively production batch times I equal to 1 and …, I and process variables J equal to 1 and … and J, and sampling time K equal to 1 and … and K;
2) performing optimal stage division on multiple batches of historical data X by adopting a Fisher algorithm;
3) splitting the three-dimensional data X into I batch slice matrixes along the batch direction, wherein the matrix T of the ith batchiThe row represents the sampling time K equal to 1, …, K, and the column represents the process variable J equal to 1, …, J, as shown in fig. 3;
4) for each batch slice matrix TiCalculate FOM, I is 1, …, I, resulting in I FOM matrices, wherein the FOM matrix T of the ith batchFOMiThe form is as follows:
Figure BDA0002371760880000021
wherein, the element t of the kth row and the jth column in the ith batch of slicesj(k) Fourth step distance fj(k) Calculated by the following formula:
fj(k)=tj(k)tj(k-1)tj(k-2)tj(k-3)
for data at the first 3 moments, padding with data at the 4 th moment, i.e. fj(1)=fj(2)=fj(3)=fj(4);
5) Combining the data of the s stage in all batches of FOM matrixes into a new two-dimensional data matrix T according to the optimal stage division obtained in the step 2Stage sS1, …, S, whose row represents the sampling instant, has a total of I × (p)s+1-ps) Rows and columns represent the process variable J ═ 1, …, J, as shown in fig. 4;
6) for each phase data matrix TStage sS is 1, …, and is obtained by normalizing all elements in S
Figure BDA0002371760880000031
Wherein, the element f of the kth row and the jth column in the s-stage data matrixj(k) The normalization formula of (a) is as follows:
Figure BDA0002371760880000032
wherein f isj maxAnd fj minRespectively represent the current s-th stage data matrix TStage sThe maximum and minimum values in column j,
Figure BDA0002371760880000033
denotes fj(k) Normalized values;
7) setting SAE of phase ssSub-model hyper-parameters, utilization
Figure BDA0002371760880000034
Training SAE layer by layer as inputsSubmodel, minimizing the reconstruction error function for each layer:
Figure BDA0002371760880000035
wherein, thetaEnAnd thetaDeIs SAEsParameter of submodel, x(i)And s(i)Is SAEsInputting and outputting the ith layer of the submodel, wherein M is the total amount of samples; will SAEsThe output of the previous layer of the sub-model is used as the input of the next layer to be trained layer by layer until the last layer finishes SAEsTraining a model;
8) SAE will besThe reconstruction error function of the last layer of the model is used as RE statistic, RE statistic of all data in each stage is calculated, and the control limit of the RE statistic in the sub-stage s is determined by utilizing a nuclear density estimation method and is used for on-line monitoring of the stage s;
9) and (5) repeating the steps 5-8, respectively training the monitoring models of the S sub-stages, and respectively calculating the control limit of each sub-stage.
B. And (3) an online monitoring stage:
1) obtaining sampling data of current time
Figure BDA0002371760880000036
Selecting a corresponding stage monitoring model according to the stage to which the sampling value at the current k moment belongs;
2) calculating the current time
Figure BDA0002371760880000037
FOM matrix x off=[f1(k),f2(k),…,fJ(k)]Where the variable sample value x at time kj(k) FOM value f ofj(k) The following formula is adopted to obtain:
fj(k)=xj(k)xj(k-1)xj(k-2)xj(k-3)
for data at the first 3 moments, padding with data at the 4 th moment, i.e. fj(1)=fj(2)=fj(3)=fj(4)。
3) According to the maximum value and the minimum value obtained in the off-line modeling step 6), x obtained in the previous step is addedfIs subjected to standardization to obtain
Figure BDA0002371760880000041
Where the j variable k is the FOM value f at the momentj(k) The normalization formula of (a) is as follows:
Figure BDA0002371760880000042
wherein f isj maxAnd fj minRespectively represent the s-th stage off-line data TStage sThe maximum and minimum values in column j,
Figure BDA0002371760880000043
denotes fj(k) Normalized values;
4) obtained in the previous step
Figure BDA0002371760880000044
Inputting corresponding phase submodel SAEsAnd calculating the statistic RE;
5) comparing the monitoring statistic RE obtained in the previous step with the control limit of the corresponding stage obtained in the offline modeling step 8), and if the monitoring statistic RE exceeds the control limit, determining that a fault occurs, otherwise, determining that the fault occurs normally until the batch process is finished.
Advantageous effects
Compared with the prior art, the method divides the whole production batch into a plurality of operation stages, fully considers the stage characteristics of the batch process, respectively establishes the FOM-SAE model for each stage, and constructs the monitoring statistic for fault monitoring. Because the traditional FOM can effectively extract non-Gaussian information in data, the instability of solving the ICA model is avoided. The SAE can effectively extract deep information of the batch process while solving the nonlinearity of the batch data. The method can reduce the report missing in the monitoring process and improve the accuracy of fault monitoring.
Drawings
FIG. 1 is a flow chart of a method of the present invention;
FIG. 2 shows three-dimensional data X30×10×400Expanded into a two-dimensional time slice matrix xkA schematic diagram of (a);
FIG. 3 shows three-dimensional data X30×10×400Expanded into a two-dimensional batch sheet matrix XIA schematic diagram of (a);
FIG. 4 is a two-dimensional data matrix T for each stage of data processing reconstructionStage sA schematic diagram of (a);
FIG. 5 is a block diagram of a stacked self-encoder monitor model
FIG. 6 is a graph of the relationship between the number of stages of the penicillin batch fermentation process and α;
FIG. 7 shows the results of the staging of a penicillin batch fermentation process;
FIG. 8(a) is a diagram showing the monitoring effect of the SAE method on No. 5 fault
FIG. 8(b) is a diagram showing the monitoring effect of FOM-SAE method on No. 5 fault
FIG. 8(c) is a diagram showing the effect of the multi-stage SAE method on the monitoring of No. 5 fault
FIG. 8(d) is a diagram showing the effect of the multi-stage FOM-SAE method on the monitoring of No. 5 fault
FIG. 9(a) is a diagram showing the monitoring effect of the SAE method on No. 7 fault
FIG. 9(b) is a diagram showing the monitoring effect of the FOM-SAE method on No. 7 fault
FIG. 9(c) is a diagram showing the monitoring effect of the multi-stage SAE method on No. 7 fault
FIG. 9(d) is a diagram showing the effect of the multi-stage FOM-SAE method on the monitoring of No. 7 fault
Detailed Description
Penicillin fed-batch fermentation is a typical batch industrial process with obvious non-gaussian, non-linear, multi-stage characteristics. A Birol model improved based on a Bajpai mechanism model and Pensim2.0 software developed by the Ethino science and technology institute become a standard test platform for monitoring, fault diagnosis and control of the penicillin fermentation process. Relevant studies have demonstrated the effectiveness and utility of this simulation platform.
In the experiment, the Pensim2.0 simulation data is used as a research object, the sampling time is set to be 1 hour, the fermentation time of each batch is set to be 400 hours, and 10 main process variables are selected to monitor the process running condition, as shown in Table 1. Three-dimensional matrix X for generating 30 batches of normal data by adopting platform30×10×400For modeling, 10 additional batches of fault data were generated to test the effectiveness of the monitoring model. The set fault type, amplitude and start-stop time are shown in table 2.
TABLE 1 Process variables used for modeling
Figure BDA0002371760880000051
TABLE 2 failure batch data
Figure BDA0002371760880000052
Figure BDA0002371760880000061
The application process of the invention in the simulation platform for penicillin fermentation production is specifically stated as follows:
A. an off-line modeling stage:
1) the historical data X collected in the collection batch process under the normal working condition is a three-dimensional array, the three dimensions are respectively production batch times i of 1, … and 30, process variables j of 1, … and 10, and sampling points k of 1, … and 400.
2) The Fisher algorithm is adopted to carry out stage division on the time slice matrix, and the steps are as follows:
2.1) splitting the three-dimensional data into 400 time slice matrices along the time direction, wherein the time slice matrix x at the k-th time instantkThe row represents the production lot number i 1, …,30, the column represents the process variable j 1, …, 10;
2.2) time slice matrix is divided into S stages according to time sequence, and the initial time of each stage is arbitrarily set as psWhere S is 1, …, S, in this case p1=1<p2<…<pSK is less than or equal to K. Calculating the class inner diameter of each stage, wherein the class inner diameter is calculated for the stage S by using the following formula (when S is S, ps+1-1=K):
Figure BDA0002371760880000062
Wherein
Figure BDA0002371760880000063
Represents the average of all time slice matrices during this phase. At this time, the time slice matrix is dividedFor an error function of S stages, by the following formula:
Figure BDA0002371760880000064
2.3) in order to obtain the optimal division result quickly, the minimum value of the error function is preferably calculated by a recurrence formula, and the optimal division result divided into S stages can be obtained at the moment:
when the division into two stages is made,
Figure BDA0002371760880000071
when the division stage is greater than 2 a,
Figure BDA0002371760880000072
the minimum value of the error function is calculated by the above formula when S is 1,2, …, K.
2.4) when a recurrence formula is adopted, a discriminant function can be calculated
Figure BDA0002371760880000073
The more α is close to 1, the more accurate the staged result is, the combined stage characteristics of penicillin fermentation and α value determination process is divided into 5 stages, and α value change and division results are shown in fig. 6 and fig. 7.
3) Splitting three-dimensional data X into 30 batch slice matrixes according to batches, wherein the matrix T of the ith batchiThe row represents the sampling time k equal to 1, …,400, and the column represents the process variable j equal to 1, …, 10;
4) for each batch slice matrix TiCalculate FOM, I1, …,30, resulting in I FOM matrices, wherein the FOM matrix T of the ith batchFOMiThe form is as follows:
Figure BDA0002371760880000074
wherein, in the ith batch of slicesElement t of k row and j columnj(k) Fourth step distance fj(k) Calculated by the following formula:
fj(k)=tj(k)tj(k-1)tj(k-2)tj(k-3)
for data at the first 3 moments, padding with data at the 4 th moment, i.e. fj(1)=fj(2)=fj(3)=fj(4);
5) Combining the data of the s stage in all batches of FOM matrixes into a new two-dimensional data matrix T according to the optimal stage division obtained in the step 2Stage sS is 1, …, S, the row representing the sampling instant, for a total of 30 × (p)s+1-ps) Rows and columns represent process variables J ═ 1, …, J;
6) for s-phase data matrix TStage sAll elements in 1, …,5 are normalized to obtain
Figure BDA0002371760880000075
Wherein, the element f of the kth row and the jth column in the s-stage data matrixj(k) The normalization formula of (a) is as follows:
Figure BDA0002371760880000076
wherein f isj maxAnd fj minRespectively represent the current s-th stage data matrix TStage sThe maximum and minimum values in column j,
Figure BDA0002371760880000081
denotes fj(k) Normalized values;
7) setting SAE of phase ssSub-model hyper-parameters, utilization
Figure BDA0002371760880000082
Training SAE layer by layer as inputsSubmodel, minimizing the reconstruction error function for each layer:
Figure BDA0002371760880000083
wherein, thetaEnAnd thetaDeIs SAEsParameter of submodel, x(i)And s(i)Is SAEsInputting and outputting the ith layer of the submodel, wherein M is the total amount of samples; SAE will besThe output of the previous layer of the sub-model is used as the input of the next layer to be trained layer by layer until the last layer finishes SAEsModel training, SAE model is a well-known structure as shown in FIG. 5.
8) SAE will besThe reconstruction error function of the last layer of the model is used as RE statistic, RE statistic of all data in each stage is calculated, and the control limit of the RE statistic in the sub-stage s is determined by utilizing a nuclear density estimation method and is used for on-line monitoring of the stage s;
9) and (5) repeating the steps 5-8, respectively training the monitoring models of the 5 sub-stages, and respectively calculating the control limit of each sub-stage.
B. And (3) an online monitoring stage:
1) obtaining sampling data of current time
Figure BDA0002371760880000084
Selecting a corresponding stage monitoring sub-model according to the stage to which the sampling value at the current k moment belongs;
2) calculating the current time
Figure BDA0002371760880000085
FOM matrix x off=[f1(k),f2(k),…,fJ(k)]Where the variable sample value x at time kj(k) FOM value f ofj(k) The following formula is adopted to obtain:
fj(k)=xj(k)xj(k-1)xj(k-2)xj(k-3)
for data at the first 3 moments, padding with data at the 4 th moment, i.e. fj(1)=fj(2)=fj(3)=fj(4)。
3) According to the maximum value and the minimum value obtained in the off-line modeling step 6), x obtained in the previous step is addedfIs subjected to standardization to obtain
Figure BDA0002371760880000086
Where the j variable k is the FOM value f at the momentj(k) The normalization formula of (a) is as follows:
Figure BDA0002371760880000087
wherein f isj maxAnd fj minRespectively represent the s-th stage off-line data TStage sThe maximum and minimum values in column j,
Figure BDA0002371760880000088
denotes fj(k) Normalized values;
4) obtained in the previous step
Figure BDA0002371760880000091
Inputting corresponding phase submodel SAEsAnd calculating the statistic RE;
5) comparing the monitoring statistic RE obtained in the previous step with the control limit of the corresponding stage obtained in the offline modeling step 8), and if the monitoring statistic RE exceeds the control limit, determining that a fault occurs, otherwise, determining that the fault occurs normally until the batch process is finished.
The steps are the specific application of the invention on the penicillin fermentation simulation platform. To verify the effectiveness of the monitoring algorithm herein, two failures with failure numbers 5 and 7 were chosen for comparison with the SAE, FOM-SAE, multi-stage FOM-SAE algorithms, respectively, to show the experimental results in detail, see FIGS. 8 and 9. Each graph includes 5 straight lines parallel to the abscissa, i.e., control limits obtained through kernel density estimation, and the up-down fluctuation curves are values of the monitoring statistics. If the curve value is larger than the control limit, the fault occurs at the moment. For more intuitive comparison of monitoring performance of the method, the false alarm rate (MAR) and the accuracy rate (ACC) of the model for 10 faults are respectively calculated as monitoring indexes:
Figure BDA0002371760880000092
Figure BDA0002371760880000093
fig. 8 shows a graph of the monitoring results for fault No. 5, which is a ramp-type fault with an increase of 2W/h in the agitation rate fault, and which is introduced from time 70 and continues until time 150 is completed. For a ramped fault, the magnitude of the fault increases cumulatively over time, making it more difficult to detect the fault at an early time when the fault occurs. It can be seen from fig. 8(a) that the single-stage SAE model has a higher false alarm rate of 86.42%, while the multi-stage SAE model of fig. 8(c) better captures the stage characteristics, the false alarm rate is reduced to 67.9%, and the two models have obvious alarm lag. After FOM is added in the graph 8(b), the accuracy is obviously improved to 95%, and the model is proved to effectively improve the monitoring accuracy by extracting non-Gaussian information. The monitoring result of the present invention is shown in fig. 8(d), which has the highest accuracy of 95.25%.
Fig. 9 shows a graph of the monitoring results for fault No. 7, which is a step-type fault with a 3% increase in substrate feed rate, introducing the fault from time 58 to the end of time 130. As can be seen from fig. 9(a) and 9(b), the phase models have high false negative rate and have lost monitoring effect. Fig. 9(c) a multi-stage SAE with a slight improvement over a single-stage SAE, but still has a false negative rate of 87.67%. The method of the invention is shown in figure 9(d), and the method can simultaneously keep lower rate of missing report and rate of false report, and the accuracy rate reaches 97.75 percent.
The performance indexes of monitoring ten faults are shown in a table 3, the multi-stage FOM-SAE provided by the method has higher accuracy rate for various faults, and the fault monitoring effect in the penicillin fermentation process is improved.
TABLE 3 monitoring Performance indicators
Figure BDA0002371760880000101

Claims (2)

1. A multi-stage FOM-SAE based fault monitoring method for complex industrial process is realized by
The method comprises two steps of 'off-line modeling' and 'on-line monitoring', and the specific scheme is as follows:
the invention adopts the following technical scheme and implementation steps:
A. an off-line modeling stage:
1) collecting multiple batches of historical data under a normal working condition to form a three-dimensional array X, wherein three dimensions are respectively production batch times I equal to 1 and …, I and process variables J equal to 1 and … and J, and sampling time K equal to 1 and … and K;
2) performing optimal stage division on multiple batches of historical data X by adopting a Fisher algorithm;
3) splitting the three-dimensional data X into I batch slice matrixes along the batch direction, wherein the matrix T of the ith batchiThe row represents the sampling time K equal to 1, …, K, and the column represents the process variable J equal to 1, …, J;
4) for each batch slice matrix TiCalculate FOM, I is 1, …, I, resulting in I FOM matrices, wherein the FOM matrix T of the ith batchFOMiThe form is as follows:
Figure FDA0002371760870000011
wherein, the element t of the kth row and the jth column in the ith batch of slicesj(k) Fourth step distance fj(k) Calculated by the following formula:
fj(k)=tj(k)tj(k-1)tj(k-2)tj(k-3)
for data at the first 3 moments, padding with data at the 4 th moment, i.e. fj(1)=fj(2)=fj(3)=fj(4);
5) Combining the data of the s stage in all batches of FOM matrixes into a new two-dimensional data matrix T according to the optimal stage division obtained in the step 2StagesS1, …, S, whose row represents the sampling instant, has a total of I × (p)s+1-ps) Rows and columns represent process variables J ═ 1, …, J;
6) for each onePhase data matrix TStagesS is 1, …, and is obtained by normalizing all elements in S
Figure FDA0002371760870000012
Wherein, the element f of the kth row and the jth column in the s-stage data matrixj(k) The normalization formula of (a) is as follows:
Figure FDA0002371760870000013
wherein f isj maxAnd fj minRespectively represent the current s-th stage data matrix TStagesThe maximum and minimum values in column j,
Figure FDA0002371760870000021
denotes fj(k) Normalized values;
7) setting SAE of phase ssSub-model hyper-parameters, utilization
Figure FDA0002371760870000022
Training SAE layer by layer as inputsSubmodel, minimizing the reconstruction error function for each layer:
Figure FDA0002371760870000023
wherein, thetaEnAnd thetaDeIs SAEsParameter of submodel, x(i)And s(i)Is SAEsInputting and outputting the ith layer of the submodel, wherein M is the total amount of samples; SAE will besThe output of the previous layer of the sub-model is used as the input of the next layer to be trained layer by layer until the last layer finishes SAEsTraining a model;
8) SAE will besThe reconstruction error function of the last layer of the model is used as RE statistic, RE statistic of all data in each stage is calculated, and the control limit of the RE statistic in the sub-stage s is determined by utilizing a nuclear density estimation method and is used for on-line monitoring of the stage s;
9) and (5) repeating the steps 5-8, respectively training the monitoring models of the S sub-stages, and respectively calculating the control limit of each sub-stage.
B. And (3) an online monitoring stage:
1) obtaining sampling data of current time
Figure FDA0002371760870000024
Selecting a corresponding stage monitoring sub-model according to the stage to which the sampling value at the current k moment belongs;
2) calculating the current time
Figure FDA0002371760870000025
FOM matrix x off=[f1(k),f2(k),…,fJ(k)]Where the variable sample value x at time kj(k) FOM value f ofj(k) The following formula is adopted to obtain:
fj(k)=xj(k)xj(k-1)xj(k-2)xj(k-3)
for data at the first 3 moments, padding with data at the 4 th moment, i.e. fj(1)=fj(2)=fj(3)=fj(4);
3) According to the maximum value and the minimum value obtained in the off-line modeling step 6), x obtained in the previous step is addedfIs subjected to standardization to obtain
Figure FDA0002371760870000026
Where the j variable k is the FOM value f at the momentj(k) The normalization formula of (a) is as follows:
Figure FDA0002371760870000027
wherein f isj maxAnd fj minRespectively represent the s-th stage off-line data TStagesThe maximum and minimum values in column j,
Figure FDA0002371760870000028
to representfj(k) Normalized values;
4) obtained in the previous step
Figure FDA0002371760870000031
Inputting corresponding phase submodel SAEsAnd calculating the statistic RE;
5) comparing the monitoring statistic RE obtained in the previous step with the control limit of the corresponding stage obtained in the offline modeling step 8), and if the monitoring statistic RE exceeds the control limit, determining that a fault occurs, otherwise, determining that the fault occurs normally until the batch process is finished.
2. The multi-stage FOM-SAE-based fault monitoring method for complex industrial processes according to claim 1, characterized in that: the specific steps of the optimal phase division in the off-line modeling phase are as follows:
2.1) splitting the three-dimensional data X into K time slice matrices along the time direction, wherein XkA time slice matrix representing the sampling time of k, wherein the row of the time slice matrix represents the production batch number I equal to 1, …, and the column of the time slice matrix represents the process variable J equal to 1, …, J;
2.2) randomly dividing the multiple batches of historical data X into S stages, calculating the class inner diameter of each stage, and calculating the error function of the stage division according to the obtained class inner diameter, wherein the class inner diameter calculation formula of the stage S is as follows:
Figure FDA0002371760870000032
wherein p issDenotes the start of the s-th phase, ps+1Denotes the start of the s +1 th phase, ps+1-1 denotes the end of the s-th phase, i.e. ps+1The time of the previous sampling instant of (c),
Figure FDA0002371760870000033
represents the average of all time slice matrices within the phase s,
the error function for dividing the multi-batch historical data X into S stages is as follows:
Figure FDA0002371760870000034
and 2.3) the division mode with the minimum error function is the optimal division mode.
CN202010057396.9A 2020-01-17 2020-01-17 Batch process fault monitoring method based on multi-stage FOM-SAE Active CN111160811B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010057396.9A CN111160811B (en) 2020-01-17 2020-01-17 Batch process fault monitoring method based on multi-stage FOM-SAE

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010057396.9A CN111160811B (en) 2020-01-17 2020-01-17 Batch process fault monitoring method based on multi-stage FOM-SAE

Publications (2)

Publication Number Publication Date
CN111160811A true CN111160811A (en) 2020-05-15
CN111160811B CN111160811B (en) 2023-10-03

Family

ID=70564050

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010057396.9A Active CN111160811B (en) 2020-01-17 2020-01-17 Batch process fault monitoring method based on multi-stage FOM-SAE

Country Status (1)

Country Link
CN (1) CN111160811B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113420815A (en) * 2021-06-24 2021-09-21 江苏师范大学 Semi-supervised RSDAE nonlinear PLS intermittent process monitoring method
CN114167729A (en) * 2021-12-08 2022-03-11 北京工业大学 Distributed two-segment adaptive weighted stacking automatic encoder-based tiny fault detection method and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106709214A (en) * 2017-02-20 2017-05-24 北京工业大学 Penicillin fermentation process fault monitoring method based on MLLE-OCSVM
CN107272655A (en) * 2017-07-21 2017-10-20 江南大学 Batch process fault monitoring method based on multistage ICA SVDD
CN109740687A (en) * 2019-01-09 2019-05-10 北京工业大学 A kind of fermentation process fault monitoring method based on DLAE
CN110245460A (en) * 2019-06-28 2019-09-17 北京工业大学 A kind of batch process fault monitoring method based on multistage OICA

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106709214A (en) * 2017-02-20 2017-05-24 北京工业大学 Penicillin fermentation process fault monitoring method based on MLLE-OCSVM
CN107272655A (en) * 2017-07-21 2017-10-20 江南大学 Batch process fault monitoring method based on multistage ICA SVDD
CN109740687A (en) * 2019-01-09 2019-05-10 北京工业大学 A kind of fermentation process fault monitoring method based on DLAE
CN110245460A (en) * 2019-06-28 2019-09-17 北京工业大学 A kind of batch process fault monitoring method based on multistage OICA

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
李晴晴 等: "基于改进堆叠自编码器的滚动轴承故障诊断", 计算机工程与设计 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113420815A (en) * 2021-06-24 2021-09-21 江苏师范大学 Semi-supervised RSDAE nonlinear PLS intermittent process monitoring method
CN113420815B (en) * 2021-06-24 2024-04-30 江苏师范大学 Nonlinear PLS intermittent process monitoring method of semi-supervision RSDAE
CN114167729A (en) * 2021-12-08 2022-03-11 北京工业大学 Distributed two-segment adaptive weighted stacking automatic encoder-based tiny fault detection method and system

Also Published As

Publication number Publication date
CN111160811B (en) 2023-10-03

Similar Documents

Publication Publication Date Title
CN107895224B (en) MKECA fermentation process fault monitoring method based on extended nuclear entropy load matrix
CN108897286B (en) Fault detection method based on distributed nonlinear dynamic relation model
CN110929765B (en) Batch-imaging-based convolution self-coding fault monitoring method
CN109740687B (en) Fermentation process fault monitoring method based on DLAE
CN109407652B (en) Multivariable industrial process fault detection method based on main and auxiliary PCA models
CN110245460B (en) Intermittent process fault monitoring method based on multi-stage OICA
CN108469805B (en) Distributed dynamic process monitoring method based on dynamic optimal selection
CN108960309B (en) Dynamic process monitoring method based on RBF neural network autocorrelation elimination
CN108375965B (en) non-Gaussian process monitoring method based on multi-variable block cross correlation elimination
CN111160811A (en) Batch process fault monitoring method based on multi-stage FOM-SAE
CN108445867B (en) non-Gaussian process monitoring method based on distributed ICR model
CN112904810B (en) Process industry nonlinear process monitoring method based on effective feature selection
CN111324110A (en) Fermentation process fault monitoring method based on multiple shrinkage automatic encoders
CN105955214A (en) Batch process fault detection method based on sample timing sequence and neighborhood similarity information
CN110119579B (en) OICA-based complex industrial process fault monitoring method
CN108427398B (en) Dynamic process monitoring method based on distributed AR-PLS model
CN111913460B (en) Fault monitoring method based on sequence correlation local preserving projection algorithm
CN108572639B (en) Dynamic process monitoring method based on principal component autocorrelation elimination
CN114757269A (en) Complex process refined fault detection method based on local subspace-neighborhood preserving embedding
CN111188761B (en) Monitoring method for pump equipment based on Fourier-CVA model
CN112214006A (en) Intermittent process fault detection method and system considering two-dimensional dynamic characteristics
CN108491878B (en) Fault classification diagnosis method based on multiple error generation models
CN111126671A (en) Early warning method for tower flushing fault of primary distillation tower in oil refining production
CN110554667A (en) convolutional Neural Network (CNN) based intermittent industrial process fault diagnosis
CN111695581B (en) Nonlinear dynamic process monitoring method based on autoregressive extreme learning machine

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant