CN111291020A - Dynamic process soft measurement modeling method based on local weighted linear dynamic system - Google Patents

Dynamic process soft measurement modeling method based on local weighted linear dynamic system Download PDF

Info

Publication number
CN111291020A
CN111291020A CN201911094779.7A CN201911094779A CN111291020A CN 111291020 A CN111291020 A CN 111291020A CN 201911094779 A CN201911094779 A CN 201911094779A CN 111291020 A CN111291020 A CN 111291020A
Authority
CN
China
Prior art keywords
online
training
samples
window
hidden
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911094779.7A
Other languages
Chinese (zh)
Inventor
方靖云
何雨辰
张丽芳
曾九孙
严天宏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Jiliang University
Original Assignee
China Jiliang University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Jiliang University filed Critical China Jiliang University
Priority to CN201911094779.7A priority Critical patent/CN111291020A/en
Publication of CN111291020A publication Critical patent/CN111291020A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/211Schema design and management
    • G06F16/212Schema design and management with details for data modelling support
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06395Quality analysis or management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/067Enterprise or organisation modelling

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Physics & Mathematics (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Databases & Information Systems (AREA)
  • Tourism & Hospitality (AREA)
  • Data Mining & Analysis (AREA)
  • General Business, Economics & Management (AREA)
  • Operations Research (AREA)
  • Marketing (AREA)
  • Probability & Statistics with Applications (AREA)
  • General Engineering & Computer Science (AREA)
  • Game Theory and Decision Science (AREA)
  • Quality & Reliability (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Fuzzy Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a dynamic process soft measurement modeling method based on a local weighted linear dynamic system. The invention introduces sliding windows, establishes a linear dynamic system model in each sliding window, obtains the original space weight and the hidden space weight of an online sample relative to all offline samples by calculating the similarity of the online sample and the offline samples in each window in an original space and a hidden space, and establishes a weighted linear dynamic system model based on the two weights to obtain the hidden variables of all the offline samples. And finally, calculating a butane content predicted value of the online sample by a local weighted regression method. The invention simultaneously considers the similarity relation between the online sample and the offline sample in the original space and the hidden space, and improves the accuracy of key variable prediction in the industrial process.

Description

Dynamic process soft measurement modeling method based on local weighted linear dynamic system
Technical Field
The invention belongs to the field of industrial and chemical process soft measurement modeling and application, and particularly relates to a dynamic process soft measurement modeling method based on a local weighted linear dynamic system.
Background
The industrial production process is very complex, the quality requirement of products is higher and higher, the production quality of the products can be ensured by effectively monitoring the production process, and the process monitoring is becoming more and more important. Typical process monitoring methods are knowledge-based methods and data-driven methods. The knowledge-based methods perform diagnosis through known knowledge, and can generally achieve good results under the condition of complete knowledge; however, the current industrial production has high complexity, so the data-driven method can avoid the defect that the knowledge-based method needs a large amount of comprehensive process knowledge, and the effective monitoring is carried out by analyzing the relation among data in the production process and modeling.
At present, key variables in a complex industrial production process are often difficult to be directly measured by a sensor, and a soft measurement model between a process variable which is easy to be measured and the key variable which is difficult to be directly measured needs to be established. The traditional soft measurement model such as probability principal component analysis is a static model, and dynamic characteristics and nonlinear characteristics before and after data are not considered; linear dynamic system models, while taking into account the dynamics between data, are difficult to directly describe for non-linearities in complex industrial processes. Therefore, the traditional linear dynamic system model is expanded into a linear dynamic system model based on sliding window weighting under a probabilistic modeling framework, and the dynamic and nonlinear characteristics among data are well explained, so that the problem of monitoring key variables in the industrial production process is solved.
Disclosure of Invention
The invention provides a dynamic process soft measurement modeling method based on a local weighted linear dynamic system, which is characterized by comprising the following steps of:
step 1: and collecting offline data of the debutanizer as a training sample set, wherein the training sample set comprises a plurality of groups of training samples, and each group of training samples comprises a plurality of process variables and a key variable which are observed at the same time. The process variables are flow values, pressure and temperature of different parts in the operation process of the debutanizer; the key variable is the butane content value at that time obtained by off-line assay analysis.
Step 2: and introducing sliding windows, traversing the training sample set by the sliding windows with a fixed step length to obtain a plurality of sliding windows, wherein each sliding window comprises process variables of a plurality of groups of training samples, the process variables of the plurality of groups of training samples contained in each sliding window are defined as sub-training sets of the sliding window, and the sub-training sets in different sliding windows are not identical. The window length and the step length of the sliding window are manually preset. And establishing a linear dynamic system model for the sub-training set under each window to obtain model parameters calculated by the sub-training set under each window and a plurality of hidden variable values of each group of training samples in a hidden space.
The window length and the step length of the sliding window are manually determined in advance, if the window step length is defined to be S, the length of the training sample set is N, the window length is T, the total number of windows is
Figure RE-GDA0002456128470000021
And the sub-training set in each window comprises T groups of training samples.
And step 3: online data of the debutanizer is collected as a test sample set, the test sample set including a plurality of sets of online samples, each set of online samples including a plurality of process variables at the same time. The process variable name and number of the online sample are consistent with the process variable of the offline sample. Calculating the similarity of each group of online samples in the original space relative to the training sample set; and calculating the similarity of each group of online samples in the hidden space relative to each sub training set. And finally, obtaining the global similarity of the online samples in the original space and the global weight of the hidden space relative to the training sample set through calculation.
And 4, step 4: establishing a local weighted linear dynamic system model, taking the process variables of the training sample set, the hidden space global weight of the online samples and the original space global weight of the online samples as the input of the model, and obtaining the parameters of the model under the model.
And 5: and extracting the training sample closest to the online sample at the sampling time in the training sample set, and calculating to obtain the hidden variable value of the online sample by a Kalman filtering method.
Step 6: and establishing a local weighted linear regression model according to the model parameters obtained by the local weighted linear dynamic system model and the training sample set to obtain model parameters and predict key variable values of the online sample, namely the butane content value.
The processing method of each sliding window in the step 2 is the same, and the processing method of the η th sliding window specifically includes:
2.1) carrying out normalization processing of subtracting the mean value and dividing by the square difference on the sub-training set in the window, and establishing a linear dynamic system model on the normalized sub-training set:
the linear dynamic system model is represented as follows:
Figure RE-GDA0002456128470000031
Figure RE-GDA0002456128470000032
Figure RE-GDA0002456128470000033
representing hidden variable values of a t group of training samples under the linear dynamic system model in an η th window in a hidden space;
Figure RE-GDA0002456128470000034
representing the process variable in the t-th set of training samples in the η -th window;
Aη∈RH×Hrepresenting a state transition probability matrix under the linear dynamic system model in the η th window;
Bη∈RV×Hrepresenting the emission probability matrix under the linear dynamic system model in the η th window;
Figure RE-GDA0002456128470000035
representing the noise of the t group of hidden variables under the linear dynamic system model in the η th window;
Figure RE-GDA0002456128470000036
noise representing the process variable of the tth set of training samples in the η th window;
v is the number of process variables in a group of training samples, H is the number of variables in a group of hidden variables, T is 1, 2.
Noise(s)
Figure RE-GDA0002456128470000037
And
Figure RE-GDA0002456128470000038
are subject to a gaussian distribution,
Figure RE-GDA0002456128470000039
and
Figure RE-GDA00024561284700000310
is the covariance of the implicit variable and the process variable in the η th window.
2.2) maximizing the likelihood function by building the likelihood function of the model and by Kalman filtering, Kalman smoothing and expectation maximization (EM algorithm), recording the parameters of the model in the η th window when the likelihood function converges
Figure RE-GDA00024561284700000311
Wherein
Figure RE-GDA00024561284700000312
And (4) the hidden variable values in the hidden space of each group of training samples in the window.
The processing method of the group of online samples in step 3 in each window is consistent, and the processing method of the online samples in the η th window specifically includes:
3.1) calculating the global weight of the process variables of the training sample set in relation to the online samples in the original space:
Figure RE-GDA00024561284700000313
wherein v isnProcess variables of an nth set of training samples in the set of training samples; v. ofnewIs a process variable of the online sample;
Figure RE-GDA00024561284700000314
representing the similarity of the process variables of the nth set of training samples in the original space with respect to the online samples; n is the number of training sample sets.
Secondly, the global weight of the training sample set in the original space relative to the online sample is calculated:
Figure RE-GDA00024561284700000315
wherein λ isnGlobal weights for the nth training sample in the original space with respect to the online samples; zetavThe weight control parameters of the original space are manually set;
3.2) calculating the global weight of the online samples in the hidden space relative to the process variable of the training sample set:
firstly, computing the hidden variables of the online samples in η th windows, wherein the computing method of the online samples in each window is the same:
Figure RE-GDA0002456128470000041
wherein v isnewIs an online sample; mu.sηIs the mean value of the sub-training set in the η th windowηThe variance of the sub-training set in the η th window, BηAnd
Figure RE-GDA0002456128470000042
model parameters of the linear dynamic system model in the η th window;
Figure RE-GDA0002456128470000043
and obtaining an implicit variable after the online sample in the η th window is subjected to projection.
Secondly, calculating the similarity of the process variables of the sub-training set in the η th window in the hidden space relative to the online sample:
Figure RE-GDA0002456128470000044
wherein,
Figure RE-GDA0002456128470000045
and (3) the similarity between the variable values of the t-th group of training samples in the hidden space in the η th window and the hidden variables obtained after the online samples are projected in the η th window.
Then summing the similarity obtained by each group of training samples in different windows to obtain the global similarity of the online samples in the hidden space relative to the training sample set:
Figure RE-GDA0002456128470000046
wherein, thetan,iSimilarity for the nth set of training samples with respect to the online samples in the ith window; Γ, i is the number of window sequences comprising the nth set of training samples, Γ is the total number of all sliding windows comprising the nth set of training samples,
Figure RE-GDA0002456128470000047
global similarity of the nth set of training samples in the set of training samples.
And finally, calculating the global weight of the training sample set in the hidden space relative to the online sample:
Figure RE-GDA0002456128470000048
therein, ζhThe weight control parameters are manually set; phi is anThe global weight of the nth training sample in the hidden space with respect to the online samples is used.
The sliding window weighted linear dynamic system model in the step 4 is as follows:
hn=Ahn-1+an
vn=Bhn+bn
whereinA and B are a state transition probability matrix and an emission probability matrix under a weighted linear dynamic system model; h isnHidden variable values of process variables of the nth group of training samples in the training sample set in a hidden space; v. ofnProcess variables of an nth set of training samples in the set of training samples; a isn,bnAre respectively hnAnd vnThe noise of (2).
Noise anAnd bnAll obey Gaussian distribution, an~N(0,Σh),bn~N(0,Σv);ΣhSum-sigmavIs the covariance of the hidden variable and the process variable in the model.
The input to the model is the process variable of a training sample set with respect to the weight λ ═ of the online samples in the original space (λ)12,...,λN) The weight of the training sample set in relation to the online samples in the hidden space is (phi)12,...,φN)。
By establishing a likelihood function for the model and maximizing the likelihood function by Kalman filtering, Kalman smoothing and expectation maximization (EM algorithm), the parameters of the model are recorded when the likelihood function converges
Figure RE-GDA0002456128470000051
Wherein
Figure RE-GDA0002456128470000052
Figure RE-GDA0002456128470000053
The variance of each set of hidden variables in the kalman filter.
In the step 5, the calculation methods of each group of online samples are consistent, wherein one group of online samples are processed, and the specific formula of the Kalman filtering in the step 5 is as follows:
V*=AF*A+Σh
K=V*BT(BV*BTv)-1
hnew=Ah*+K(vnew-BAh*)
wherein h is*And F*The training sample closest to the online sample at the sampling moment is the variable value and the variance in the hidden space; k is a Kalman gain matrix; h isnewCalculating a hidden variable value of the online sample in a hidden space by a Kalman filtering method for the online sample; v. ofnewIs the process variable of the on-line sample.
The specific steps for predicting the key variables of a group of online samples in the step 6 are as follows, and the calculation methods of each group of online samples are consistent:
first, a weighted average of the key variables of the online sample with respect to λ is calculated, and the weighted average is subtracted from each key variable of the training sample set:
Figure RE-GDA0002456128470000054
Figure RE-GDA0002456128470000055
wherein Y is (Y)1,y2,...yN)∈R1×NIs a key variable in the training sample set;
Figure RE-GDA0002456128470000056
is a weighted average.
Figure RE-GDA0002456128470000057
And subtracting the key variable of the training sample set after the weighted average value is subtracted from the nth group.
Secondly, predicting key variables of the online samples through local weighted linear regression:
b=(Yλ*hT)(hλ*hT)-1
Figure RE-GDA0002456128470000058
wherein b isRegression parameter, y, of a locally weighted linear regressionnewIs the key variable of the online sample obtained by final prediction. Lambda [ alpha ]*Is a matrix of lambda after diagonalization, lambda*∈RN×N
All the steps 2 to 6 are modeling a group of online samples, and online predicting the butane content value of the online samples. The butane content values are predicted for multiple sets of online samples by repeating steps 2 through 6.
Drawings
Fig. 1 shows soft measurement results of a linear dynamic system model based on local weighting.
Fig. 2 is a flow chart of the algorithm.
Detailed Description
The invention is further described with reference to the following figures and specific examples.
Aiming at the problem of online monitoring of the butane content in the debutanizer, the invention carries out soft measurement modeling on variables which are easy to directly measure, and estimates the butane content value in the chemical process on line.
The embodiment of the invention and the specific implementation process thereof are as follows:
the first step is as follows: and collecting the process variables of the operation of the debutanizer in the chemical process by using a distributed control system, and storing the process variables into a history database system.
The second step is that: and obtaining a key variable value under the process variable at each moment, namely a butane content value, through offline assay analysis, and storing the value into a history database system. This yields the process variables and the key variables. And extracting part of process variables and key variables thereof to form a training sample set.
In order to obtain an optimal parameter set, in the step E of the expectation maximum algorithm, Kalman filtering and smoothing operation are required, the calculation method of each window is consistent, and the step E in the η th window specifically comprises the following steps:
firstly, kalman filtering operation is required, and a specific calculation formula is as follows:
Figure RE-GDA0002456128470000061
Figure RE-GDA0002456128470000062
Figure RE-GDA0002456128470000063
Figure RE-GDA0002456128470000064
wherein,
Figure RE-GDA0002456128470000071
a Kalman gain matrix for the t-th set of training samples in the η th window;
Figure RE-GDA0002456128470000072
estimate the variance of the t-th group of hidden variables for the η th window;
Figure RE-GDA0002456128470000073
the t group hidden variable in the η th window obtained by final calculation;
Figure RE-GDA0002456128470000074
the variance of the t-th group of hidden variables in the η th window is finally calculated.
Next, kalman smoothing operation is performed, and a specific calculation formula is as follows:
Figure RE-GDA0002456128470000075
Figure RE-GDA0002456128470000076
Figure RE-GDA0002456128470000077
wherein,
Figure RE-GDA0002456128470000078
a gain matrix for the t-th set of training samples in the η -th window in Kalman smoothing;
Figure RE-GDA0002456128470000079
the t group of hidden variables in η th window after Kalman smoothing;
Figure RE-GDA00024561284700000710
is the variance of t group hidden variables in η th window after Kalman smoothing
Figure RE-GDA00024561284700000711
And finally, calculating the estimated values of the first medium and second medium statistics of the hidden variables:
Figure RE-GDA00024561284700000712
Figure RE-GDA00024561284700000713
Figure RE-GDA00024561284700000714
and (3) updating the parameters of the model in the step M of the η th window, wherein the specific steps are as follows:
Figure RE-GDA00024561284700000715
Figure RE-GDA00024561284700000716
Figure RE-GDA00024561284700000717
Figure RE-GDA00024561284700000718
whereby through successive iterations, eventually when the likelihood function converges, the optimal set of parameters for the window is recorded
Figure RE-GDA00024561284700000719
The fourth step: collecting easily measured process variables in the online operation process of the debutanizer to form an online sample, and calculating the global weight lambda of the online sample in an original space and the global weight phi of the online sample in a hidden space.
The fifth step: taking the process variable of the training sample set, the hidden space global weight phi of the online sample and the original space global weight lambda of the online sample as the input of a local weighted linear dynamic system model, and solving through an expectation maximization algorithm to obtain an optimal parameter set, wherein in the step E of the expectation maximization algorithm, the specific calculation steps are as follows:
firstly, Kalman filtering operation is required, and a specific calculation formula is as follows:
Vn=AFn-1ATh(30)
Kn=VnBT(BVnBTv)-1(31)
Figure RE-GDA0002456128470000081
Fn=(I-KnB)Vn(33)
wherein, KnA Kalman gain matrix for the nth set of training samples; vnIs an estimate of the nth set of latent variable variances; h isnTo finally countCalculating the nth group of hidden variables; fnThe variance of the nth group of hidden variables is obtained through final calculation. N, N is the total number of the training sample set, and then kalman smoothing operation is required, wherein a specific calculation formula is as follows:
Jn=(AFn)T·(AFnATh)-1(34)
Figure RE-GDA0002456128470000082
Figure RE-GDA0002456128470000083
wherein, JnA gain matrix for the nth set of hidden variables;
Figure RE-GDA0002456128470000084
the nth group of hidden variables is subjected to Kalman smoothing;
Figure RE-GDA0002456128470000085
is the variance of the nth group of hidden variables in the η th window after Kalman smoothing.
And in Kalman smoothing there are
Figure RE-GDA0002456128470000086
And finally, calculating the estimated values of the first medium and second medium statistics of the hidden variables:
Figure RE-GDA0002456128470000087
Figure RE-GDA0002456128470000088
Figure RE-GDA0002456128470000089
and M, updating the parameters of the model, and specifically comprising the following steps:
Figure RE-GDA00024561284700000810
Figure RE-GDA00024561284700000811
Figure RE-GDA00024561284700000812
Figure RE-GDA00024561284700000813
whereby through successive iterations, eventually when the likelihood function converges, an optimal set of parameters is recorded
Figure RE-GDA00024561284700000814
And a sixth step: inserting the online samples into the sampling time of the online samples according to the sequence of the sampling time; and after the training sample closest to the sampling instant of the online sample. And calculating to obtain the hidden variable value of the process variable of the online sample in the hidden space by a Kalman filtering method. The specific calculation formula is as follows:
V*=AF*A+Σh
K=V*BT(BV*BTv)-1
hnew=Ah*+K(vnew-BAh*)
the seventh step: the specific steps for predicting the key variables of the online samples are as follows, and the calculation methods of each group of online samples are consistent:
first, a weighted average of the key variables of the online sample with respect to λ is calculated, and the weighted average is subtracted from each key variable of the training sample set:
Figure RE-GDA0002456128470000091
Figure RE-GDA0002456128470000092
secondly, predicting key variables of the online samples through local weighted linear regression:
b=(Yλ*hT)(hλhT)-1
Figure RE-GDA0002456128470000093
where b is the regression parameter of the locally weighted linear regression, ynewIs the key variable of the online sample obtained by final prediction.
The effectiveness of the invention is illustrated below by a specific debutanizer example. For the process, a total of 2000 sets of process variables of the tobutane tower data are collected and the values of the key variables, namely the butane content values, are obtained through off-line analysis. 400 groups of data are selected for modeling, and 200 groups of data acquired additionally are used as online samples for verifying the effectiveness of the samples. In the process, 7 process variables are selected for soft measurement modeling, wherein the 7 process variables are tower top pressure, tower top temperature, sensitive plate temperature, next-stage flow, tower bottom temperature, tower bottom pressure and reflux flow. The following detailed description of the steps of the present invention is provided in conjunction with the specific process:
the 400 groups of training sample sets are divided into a plurality of sub-training sets through a sliding window, and normalization processing is carried out on each sub-training set.
And establishing a linear dynamic system model for each sub-training set, and recording parameters of the model.
1. According to the method given in the implementation step, the global weight of each group of online samples in the original space relative to the training samples and the weight of the original samples in the hidden space are calculated.
2. And establishing a local weighted linear dynamic system model for a group of online samples and two global weights of the online samples relative to the training samples, and recording parameters of the model.
3. Calculating the variable value of each group of online samples in the hidden space according to the model parameters recorded in the step 2 and the method given in the implementation step
4. And calculating the predicted value of the butane content of each group of online samples by a local weighted linear regression method.
5. Repeat 2 through 4 until the butane content values for all online samples are calculated.
Fig. 1 shows an online predicted value of butane content, a blue line is a value obtained by offline assay analysis of butane content, a red scatter point is an online estimated butane content value obtained by model prediction, the closer the red scatter point is to the blue line, the better the explanation effect is, and the predicted root mean square error is 0.0307. Compared with the traditional soft measurement method, the method has the advantages that the dynamic characteristic and the nonlinear characteristic of data are well considered through introducing the sliding window, calculating the weight and weighting the linear dynamic system model, online prediction is carried out, and the butane content value is difficult to measure, so that the soft measurement result is more reliable.
Fig. 2 is a flow chart of the method.
The above-described embodiments are intended to illustrate rather than to limit the invention, and any modifications and variations of the present invention are within the spirit of the invention and the scope of the appended claims.

Claims (6)

1. The dynamic process soft measurement modeling method based on the local weighted linear dynamic system is characterized by comprising the following steps of:
step 1: and collecting offline data of the debutanizer as a training sample set, wherein the training sample set comprises a plurality of groups of training samples, and each group of training samples comprises a plurality of process variables and a key variable which are observed at the same time. The process variables are flow values, pressure and temperature of different parts in the operation process of the debutanizer; the key variable is the butane content value at that time obtained by off-line assay analysis.
Step 2: and introducing sliding windows, traversing the training sample set by the sliding windows with a fixed step length to obtain a plurality of sliding windows, wherein each sliding window comprises process variables of a plurality of groups of training samples, the process variables of the plurality of groups of training samples contained in each sliding window are defined as sub-training sets of the sliding window, and the sub-training sets in different sliding windows are not identical. And establishing a linear dynamic system model for the sub-training set under each window to obtain model parameters calculated by the sub-training set under each window and a plurality of hidden variable values of each group of training samples in a hidden space.
And step 3: online data of the debutanizer is collected as a test sample set, the test sample set including a plurality of sets of online samples, each set of online samples including a plurality of process variables at the same time. The process variable name and number of the online sample are consistent with the process variable of the offline sample. Calculating the similarity of each group of online samples in the original space relative to the training sample set; and calculating the similarity of each group of online samples in the hidden space relative to each sub training set. And finally, calculating to obtain the global similarity of each group of online samples in the original space and the global weight of the hidden space relative to the training sample set.
And 4, step 4: establishing a local weighted linear dynamic system model, taking the process variables of the training sample set, the hidden space global weight of the online samples and the original space global weight of the online samples as the input of the model, and obtaining the parameters of the model under the model.
And 5: and extracting the training sample closest to the online sample at the sampling time in the training sample set, and calculating to obtain the hidden variable value of the online sample by a Kalman filtering method.
Step 6: and establishing a local weighted linear regression model according to the model parameters obtained by the local weighted linear dynamic system model and the training sample set to obtain model parameters and predict key variable values of the online sample, namely the butane content value.
2. The dynamic process soft measurement modeling method based on the local weighted linear dynamic system as claimed in claim 1, wherein the processing method of each sliding window in the step 2 is the same, and the processing method of the η th sliding window is specifically:
2.1) carrying out normalization processing on the sub-training set in the window, and establishing a linear dynamic system model for the normalized sub-training set:
the linear dynamic system model is represented as follows:
Figure RE-FDA0002456128460000021
Figure RE-FDA0002456128460000022
representing the t group of hidden variables under the linear dynamic system model in the η th window;
Figure RE-FDA0002456128460000023
representing the process variable of the t-th set of training samples in the η -th window;
Aη∈RH×Hrepresenting a state transition probability matrix under the linear dynamic system model in the η th window;
Bη∈RV×Hrepresenting the emission probability matrix under the linear dynamic system model in the η th window;
Figure RE-FDA0002456128460000024
representing the noise of the t group of hidden variables under the linear dynamic system model in the η th window;
Figure RE-FDA0002456128460000025
noise representing the process variable of the tth set of training samples in the η th window;
v is the number of process variables in a set of training samples, H is the number of variables in a set of hidden variables, T is 1, 2.
Noise(s)
Figure RE-FDA0002456128460000026
And
Figure RE-FDA0002456128460000027
are subject to a gaussian distribution,
Figure RE-FDA0002456128460000028
Figure RE-FDA0002456128460000029
and
Figure RE-FDA00024561284600000210
is the covariance of the implicit variable and the process variable in the η th window.
2.2) Linear dynamic System model by establishing the likelihood function of the model and maximizing the likelihood function by Kalman filtering, Kalman smoothing and expectation maximization (EM algorithm), when the likelihood function converges, recording the parameters of the model under the η th window
Figure RE-FDA00024561284600000211
Wherein
Figure RE-FDA00024561284600000212
Figure RE-FDA00024561284600000213
The optimal estimated value of the t group of hidden variables under the linear dynamic system model in the η th window is obtained.
3. The dynamic process soft measurement modeling method based on the local weighted linear dynamic system as claimed in claim 1, wherein the processing method of the group of online samples in step 3 in each window is consistent, and the processing method of the online samples in η th window is specifically:
3.1) calculating the similarity of the process variables of the training sample set in relation to the online samples in the original space:
Figure RE-FDA00024561284600000214
wherein v isnProcess variables of an nth set of training samples in the set of training samples; v. ofnewIs a process variable of the online sample;
Figure RE-FDA00024561284600000215
representing the similarity of the process variables of the nth set of training samples in the original space with respect to the online samples; n is the number of groups in the training sample set.
Secondly, the global weight of the training sample set in the original space relative to the online sample is calculated:
Figure RE-FDA0002456128460000031
wherein λ isnGlobal weights for the nth training sample in the original space with respect to the online samples; zetavControlling parameters for the weights of the original space;
3.2) calculating the global weight of the online samples in the hidden space relative to the process variable of the training sample set:
firstly, computing the hidden variable of the online sample in an η th window by a projection method, wherein the computing method of the online sample in each window is the same:
Figure RE-FDA0002456128460000032
wherein v isnewIs an online sample; mu.sηIs the mean value of the sub-training set in the η th windowηThe variance of the sub-training set in the η th window, BηAnd
Figure RE-FDA0002456128460000033
model parameters obtained by training a linear dynamic system model in the η th window;
Figure RE-FDA0002456128460000034
in the η th window, the online sample is projected to obtain a hidden variable.
Secondly, calculating the global weight of the process variable of the sub-training set in the η th window in the hidden space of the online sample:
Figure RE-FDA0002456128460000035
wherein,
Figure RE-FDA0002456128460000036
and (3) the similarity between the variable values of the t-th group of training samples in the hidden space in the η th window and the hidden variables obtained after the online samples are projected in the η th window.
Then summing the similarity obtained by each group of training samples in different windows to obtain the global similarity of the online samples in the hidden space relative to the training sample set:
Figure RE-FDA0002456128460000037
wherein, thetan,iSimilarity for the nth set of training samples with respect to the online samples in the ith window; i is 1,2,. Γ, i is the number of window sequences that contain the nth set of training samples, and Γ is the total number of all sliding windows that contain the nth set of training samples.
And finally, calculating the global weight of the training sample set in the hidden space relative to the online sample:
Figure RE-FDA0002456128460000038
therein, ζhA weight control parameter; phi is anThe global weight of the nth training sample in the hidden space with respect to the online samples is used.
4. The method of claim 1 for modeling dynamic process soft measurements based on a locally weighted linear dynamic system, wherein: the local weighted linear dynamic system model in step 4 is as follows:
Figure RE-FDA0002456128460000041
a and B are a state transition probability matrix and an emission probability matrix under a weighted linear dynamic system model; h isnHidden variable values of process variables of the nth group of training samples in the training sample set in a hidden space; v. ofnProcess variables of an nth set of training samples in the set of training samples; a isn,bnAre respectively hnAnd vnThe noise of (2).
Noise anAnd bnAll obey Gaussian distribution, an~N(0,Σh),bn~N(0,Σv);ΣhSum-sigmavIs the covariance of the hidden variable and the process variable in the model.
The input of the model is the process variable of the training sample set, and the global weight in the original space is (lambda)12,...,λN)∈R1×NGlobal weight in implicit space phi ═ phi (phi)12,...,φN)∈R1×N
By establishing a likelihood function for the model and maximizing the likelihood function by Kalman filtering, Kalman smoothing and expectation maximization (EM algorithm), the parameters of the model are recorded when the likelihood function converges
Figure RE-FDA0002456128460000042
Wherein
Figure RE-FDA0002456128460000043
Figure RE-FDA0002456128460000044
The variance for each set of hidden variables.
5. The method of claim 1 for modeling dynamic process soft measurements based on a locally weighted linear dynamic system, wherein: in step 5, a specific variable value formula of the online sample in the hidden space is obtained through Kalman filtering:
V*=AF*A+Σh(9)
K=V*BT(BV*BTv)-1(10)
hnew=Ah*+K(vnew-BAh*) (11)
wherein h is*And F*The training samples, which are closest to the online samples at the sampling instant, the variable values and variances in the implicit space,
Figure RE-FDA0002456128460000045
k is a Kalman gain matrix; h isnewAnd calculating the hidden variable value of the online sample in the hidden space for the online sample by a Kalman filtering method.
6. The method of claim 1 for modeling dynamic process soft measurements based on a locally weighted linear dynamic system, wherein: the specific steps for predicting the key variables of a group of online samples in the step 6 are as follows, and the calculation methods of each group of online samples are consistent:
first, a weighted average of the key variables of the online sample with respect to λ is calculated, and the weighted average is subtracted from each key variable of the training sample set:
Figure RE-FDA0002456128460000046
Figure RE-FDA0002456128460000047
wherein Y is (Y)1,y2,...yN)∈R1×NIs a key variable in the training sample set;
Figure RE-FDA0002456128460000051
is a weighted average.
Figure RE-FDA0002456128460000052
And subtracting the key variable of the training sample set after the weighted average value is subtracted from the nth group.
Secondly, predicting key variables of the online samples through local weighted linear regression:
b=(Yλ*hT)(hλ*hT)-1(14)
Figure RE-FDA0002456128460000053
where b is the regression parameter of the locally weighted linear regression, ynewIs the key variable of the online sample obtained by final prediction. Lambda [ alpha ]*Is a matrix of lambda after diagonalization, lambda*∈RN×N
CN201911094779.7A 2019-11-11 2019-11-11 Dynamic process soft measurement modeling method based on local weighted linear dynamic system Pending CN111291020A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911094779.7A CN111291020A (en) 2019-11-11 2019-11-11 Dynamic process soft measurement modeling method based on local weighted linear dynamic system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911094779.7A CN111291020A (en) 2019-11-11 2019-11-11 Dynamic process soft measurement modeling method based on local weighted linear dynamic system

Publications (1)

Publication Number Publication Date
CN111291020A true CN111291020A (en) 2020-06-16

Family

ID=71025669

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911094779.7A Pending CN111291020A (en) 2019-11-11 2019-11-11 Dynamic process soft measurement modeling method based on local weighted linear dynamic system

Country Status (1)

Country Link
CN (1) CN111291020A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112562797A (en) * 2020-11-30 2021-03-26 中南大学 Method and system for predicting outlet ions in iron precipitation process

Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060155486A1 (en) * 2004-10-07 2006-07-13 Walsh Alicia M Computer-implemented system and method for analyzing mixtures of gases
CN103440368A (en) * 2013-08-12 2013-12-11 上海交通大学 Multi-model dynamic soft measuring modeling method
CN103927412A (en) * 2014-04-01 2014-07-16 浙江大学 Real-time learning debutanizer soft measurement modeling method on basis of Gaussian mixture models
CN104239489A (en) * 2014-09-05 2014-12-24 河海大学 Method for predicting water level by similarity search and improved BP neural network
CN105868164A (en) * 2016-03-19 2016-08-17 浙江大学 Soft measurement modeling method based on monitored linear dynamic system model
CN106056127A (en) * 2016-04-07 2016-10-26 江南大学 GPR (gaussian process regression) online soft measurement method with model updating
CN106682312A (en) * 2016-12-28 2017-05-17 浙江大学 Industrial process soft-measurement modeling method of local weighing extreme learning machine model
CN107391851A (en) * 2017-07-26 2017-11-24 江南大学 A kind of glutamic acid fermentation process soft-measuring modeling method based on core ridge regression
CN108549732A (en) * 2017-12-19 2018-09-18 中南大学 Roller Conveying Kiln for Temperature soft-measuring modeling method based on local secondary Weighted Kernel principal component regression
CN108763362A (en) * 2018-05-17 2018-11-06 浙江工业大学 Method is recommended to the partial model Weighted Fusion Top-N films of selection based on random anchor point
CN108898220A (en) * 2018-06-11 2018-11-27 北京工业大学 Sewage treatment is discharged TP interval prediction method
CN108984851A (en) * 2018-06-22 2018-12-11 江南大学 A kind of Weighted Gauss model soft-measuring modeling method with time delay estimation
CN109255186A (en) * 2018-09-12 2019-01-22 浙江大学 A kind of industrial process flexible measurement method based on output constraint AP-XGBOOST model
CN109325065A (en) * 2018-12-04 2019-02-12 浙江科技学院 Multi-sampling rate flexible measurement method based on dynamic latent variable model
CN110110209A (en) * 2018-01-22 2019-08-09 青岛科技大学 A kind of intersection recommended method and system based on local weighted linear regression model (LRM)
CN110320806A (en) * 2019-07-24 2019-10-11 东北大学 Sewage disposal process adaptive prediction control method based on integrated instant learning

Patent Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060155486A1 (en) * 2004-10-07 2006-07-13 Walsh Alicia M Computer-implemented system and method for analyzing mixtures of gases
CN103440368A (en) * 2013-08-12 2013-12-11 上海交通大学 Multi-model dynamic soft measuring modeling method
CN103927412A (en) * 2014-04-01 2014-07-16 浙江大学 Real-time learning debutanizer soft measurement modeling method on basis of Gaussian mixture models
CN104239489A (en) * 2014-09-05 2014-12-24 河海大学 Method for predicting water level by similarity search and improved BP neural network
CN105868164A (en) * 2016-03-19 2016-08-17 浙江大学 Soft measurement modeling method based on monitored linear dynamic system model
CN106056127A (en) * 2016-04-07 2016-10-26 江南大学 GPR (gaussian process regression) online soft measurement method with model updating
CN106682312A (en) * 2016-12-28 2017-05-17 浙江大学 Industrial process soft-measurement modeling method of local weighing extreme learning machine model
CN107391851A (en) * 2017-07-26 2017-11-24 江南大学 A kind of glutamic acid fermentation process soft-measuring modeling method based on core ridge regression
CN108549732A (en) * 2017-12-19 2018-09-18 中南大学 Roller Conveying Kiln for Temperature soft-measuring modeling method based on local secondary Weighted Kernel principal component regression
CN110110209A (en) * 2018-01-22 2019-08-09 青岛科技大学 A kind of intersection recommended method and system based on local weighted linear regression model (LRM)
CN108763362A (en) * 2018-05-17 2018-11-06 浙江工业大学 Method is recommended to the partial model Weighted Fusion Top-N films of selection based on random anchor point
CN108898220A (en) * 2018-06-11 2018-11-27 北京工业大学 Sewage treatment is discharged TP interval prediction method
CN108984851A (en) * 2018-06-22 2018-12-11 江南大学 A kind of Weighted Gauss model soft-measuring modeling method with time delay estimation
CN109255186A (en) * 2018-09-12 2019-01-22 浙江大学 A kind of industrial process flexible measurement method based on output constraint AP-XGBOOST model
CN109325065A (en) * 2018-12-04 2019-02-12 浙江科技学院 Multi-sampling rate flexible measurement method based on dynamic latent variable model
CN110320806A (en) * 2019-07-24 2019-10-11 东北大学 Sewage disposal process adaptive prediction control method based on integrated instant learning

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
XIAOFENG YUAN: ""Deep learning based feature representation and its application for soft sensor modeling with variable-wise weighted SAE"", 《IEEE》, 28 February 2018 (2018-02-28) *
朱鹏飞: ""氯乙烯聚合过程建模与质量控制方法研究"", 《中国优秀博硕士学位论文全文数据库(博士) 信息科技辑》, 15 July 2016 (2016-07-15) *
袁小锋: ""基于即时学习的复杂非线性过程软测量建模及应用"", 《中国优秀博硕士学位论文全文数据库(博士) 信息科技辑》, 15 August 2017 (2017-08-15) *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112562797A (en) * 2020-11-30 2021-03-26 中南大学 Method and system for predicting outlet ions in iron precipitation process
CN112562797B (en) * 2020-11-30 2024-01-26 中南大学 Method and system for predicting outlet ions in iron precipitation process

Similar Documents

Publication Publication Date Title
CN106572493B (en) Rejecting outliers method and system in LTE network
CN105425779B (en) ICA-PCA multi-state method for diagnosing faults based on local neighborhood standardization and Bayesian inference
CN109829136B (en) Method and system for predicting residual life of degradation equipment with random jump
CN109409425B (en) Fault type identification method based on neighbor component analysis
CN109085805B (en) Industrial process fault detection method based on multi-sampling-rate factor analysis model
CN110757510B (en) Method and system for predicting remaining life of robot
CN107085750A (en) A kind of mixing dynamic fault Forecasting Methodology based on ARMA and ANN
CN111030889B (en) Network traffic prediction method based on GRU model
CN112487694B (en) Complex equipment residual life prediction method based on multiple degradation indexes
CN113012766A (en) Self-adaptive soft measurement modeling method based on online selective integration
CN107220500B (en) Bayesian reliability evaluation method for performance degradation test based on inverse Gaussian process
CN112163624A (en) Data abnormity judgment method and system based on deep learning and extreme value theory
CN113094893A (en) Wafer quality virtual measurement method and device, computer equipment and storage medium
CN112116002A (en) Determination method, verification method and device of detection model
CN111639304B (en) CSTR fault positioning method based on Xgboost regression model
WO2023236387A1 (en) Method and apparatus for predicting element information, and device and medium
CN111079348B (en) Method and device for detecting slowly-varying signal
CN105550457A (en) Dynamic evolution model correction method and system
CN111291020A (en) Dynamic process soft measurement modeling method based on local weighted linear dynamic system
CN103279030A (en) Bayesian framework-based dynamic soft measurement modeling method and device
CN108647897A (en) A kind of method and system of product reliability analysis
WO2024060287A1 (en) Blast furnace temperature prediction method, terminal device, and storage medium
US20220381832A1 (en) Production of a Quality Test System
CN110288724B (en) Batch process monitoring method based on wavelet function principal component analysis
CN111160464B (en) Industrial high-order dynamic process soft measurement method based on multi-hidden-layer weighted dynamic model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
AD01 Patent right deemed abandoned

Effective date of abandoning: 20221230

AD01 Patent right deemed abandoned