CN101872444A

CN101872444A - Batch-to-batch optimization method of batch process by combining medium-term correction strategy

Info

Publication number: CN101872444A
Application number: CN 201010180634
Authority: CN
Inventors: 葛铭; 李春富; 魏江; 郑小青; 郑松
Original assignee: Hangzhou Dianzi University
Current assignee: Hangzhou Kuntian Automation System Co ltd
Priority date: 2010-05-21
Filing date: 2010-05-21
Publication date: 2010-10-27
Anticipated expiration: 2030-05-21
Also published as: CN101872444B

Abstract

The invention relates to a batch-to-batch optimization method of a batch process by combining a medium-term correction strategy. The method comprises the following steps: firstly establishing a quality variable predictive model of an NLPLS, and carrying out prediction on final product quality according to control operation variables of the process; on the basis of the model, calculating an optimal control strategy and implementing the optimal control strategy on a practical device; adopting a recurrence algorithm to carry out updating on the original NLPLS model according the newly-obtained data and old model parameters after each batch is finished; then solving the optimal control strategy again and implementing the optimal control strategy on an object; generally, after several batches, leading the control strategy to converge a satisfactory solution; and simultaneously, in order to process the interference in batches, adopting the medium-term correction strategy, utilizing new information obtained by the current batch to carry out correction on the latter control strategy. The method combines the batch-to-batch optimization and the medium-term correction strategy, makes up the insufficiency that the traditional batch-to-batch optimization method can not process the interference in batches and improves the control performance.

Description

A kind of batch-to-batch optimization method of batch process in conjunction with the correction strategy in mid-term

Technical field

The invention belongs to areas of information technology, relate to a kind of batch-to-batch optimization method of batch process in conjunction with the correction strategy in mid-term.

Background technology

Along with the popularization of quick manufacturing technology, be applicable to that the batch process of production short run high value added product more and more comes into one's own.In order to obtain maximum economic interests, answer the optimizing process operation.In batch process, a lot of quality index can not on-line measurement, normally after a batch of end, judges last product quality quality according to product sampling analysis value, thereby next batch is adjusted, and the product quality of this batch can't change.In order better to control product quality, need set up mechanism or statistical model to batch process, according to the control operation variable of on-line measurement product quality is predicted.Set up detailed mechanism model and expend time in very much usually and energy, and need understand very much mechanism.In order to address this problem, more and more used based on the statistical model of data.When setting up statistical model, owing to the training data of gathering is limited or of low quality, there is change in the process operation condition simultaneously, has mismatch usually between the object of the model of foundation and reality.Therefore " optimum " control strategy that calculates from institute's established model often is not optimum when acting on practical object.Because the repeatability of batch process operation can be according to the operation that improved next batch in the past with current batch information.

Summary of the invention

Purpose of the present invention is exactly the weak point at existing batch process optimisation technique, a kind of batch-to-batch optimization method of batch process in conjunction with the correction strategy in mid-term is proposed, utilizing batch-to-batch optimization to obtain on the basis of current optimal control policy, utilize current batch of information that is obtained, adopt the correction strategy in mid-term, control strategy to the back is revised, thereby can handle current batch process interference, has improved control performance.

Technical scheme of the present invention is by means such as data acquisition, data-driven, process optimizations, at first, last product quality is predicted according to the control operation variable of process based on the quality variable forecast model of process database foundation based on non-linear partial least square (NLPLS).Based on this model, calculate optimal control policy and on actual device, implement.In order to solve model and object mismatch and to have the problem of unknown disturbance, adopt recursive algorithm, after each batch end, former NLPLS model is upgraded according to the data and the old model parameter that newly obtain.Then, find the solution optimal control policy and on object, implementing again.Usually through several batches, control strategy will converge to a satisfactory solution.Simultaneously, adopt the correction strategy in mid-term, utilize current batch of fresh information that is obtained that the control strategy of back is revised, thereby improved control performance for the interference in handling batch.

The concrete steps of the inventive method are:

Step (1) is set up based on non-linear partial least square (NLPLS) quality variable forecast model based on process database, and concrete grammar is:

A. by data collector gatherer process service data, with the process operation data of the gathering sample set as data-driven, as input, the end product quality variable is used to set up NLPLS quality variable forecast model as output with the control operation variable; The data of each batch are to being expressed as { x (k) } and { y (k) }, and x (k) represents k batch of control operation variable data, y (k) expression k batch products quality variable data; To import data constitutes input matrix X, output data is constituted output matrix Y;

B. set up NLPLS quality variable forecast model based on inputoutput data, method is:

Matrix X and Y are carried out normalized, and making it average is 0, and variance is 1; Then input matrix is listed as expansion, the expansion item is 1 column vector 1 entirely for the latent node output matrix G of radial basis function (RBF) neural network and element, the output g of the corresponding input vector effect of each row latent node down of G wherein, the bias term coefficient that conceals node is 1; Following augmentation input matrix and output matrix are carried out partial least square (PLS) recurrence:

The NLPLS quality variable forecast model that obtains is expressed as for [1 X G], Y}:

\hat{Y} = XA + GH + 1 b^{T} = [1 X G] [\begin{matrix} b^{T} \\ A \\ H \end{matrix}] = X_{E} β - - - (1)

In the formula (1), X _EExpression augmentation input matrix, A and H are respectively the weights matrix of coefficients of corresponding original input vector and the latent node output vector of corresponding RBF network, and b is the output offset vector, and T represents transposition.

Unknown parameter in the NLPLS quality variable forecast model is latent node center vector c, respective width vector σ, weights coefficient matrices A and H, model bias vector b, and these parameters are determined as follows:

1. with the k-means clustering algorithm input data are carried out cluster, obtain latent node center c; This algorithm can be determined optimum cluster centre number, simultaneously cluster centre reasonably is distributed in the data space.

2. adopt p neighbour rule to calculate latent node width:

σ_{j} = \sqrt{\frac{1}{p} Σ_{i = 1}^{p} {| | c_{i} - c_{j} | |}^{2}}, j = 1, \cdot \cdot \cdot, N - - - (2)

Wherein N is the number of latent node center, c _iBe p nearest latent node center of j latent node center of distance.

3. adopt PLS to return and determine weights coefficient matrices A, H and bias vector b:

Calculate latent node output matrix G according to latent node center that obtains and width, then input matrix is expanded, obtain augmentation input matrix [1 X G].To data to { [1 X G], Y} carry out PLS and return, and obtain PLS model parameter matrix { T, W, P, B, Q}.In order in the model modification of back, to keep all information, extract the order that the characteristic variable number equals augmentation input matrix [1 X G], and the proper vector number a that model kept that finally is used to predict adopts cross validation method to determine, the parameter matrix that obtains is designated as { Ta, Wa, Pa, Ba, Qa} calculates PLS regression coefficient matrix β by them, thereby obtain A, H and b.

The NLPLS model note of the quality index that the process control performance variable prediction that the application of above-mentioned foundation is all is last is made model I.If constantly can obtain the quality variable measured value in the middle of some in course of reaction, can utilize them to come the Correction and Control variable, to improve control performance: measurements moment θ in each _i, set up two other NLPLS model, note is made model II and model III respectively.Moment θ in the middle of model II is used for predicting _iQuality variable, input variable comprises from reaction and beginning to moment θ _iThe measured value and the θ constantly of all process variable _iPreceding quality variable measured value.Model III is used for the pre-measured reaction quality variable when finishing, and input variable comprises all control operation variablees and begins to moment θ from reaction _iAll quality variable measured values.

Step (2) calculates initial optimal control policy according to the model I that obtains in the step (1), and concrete grammar is:

The target of batch process optimization is normally sought one group of control variable makes certain objective function minimize, and this objective function adopts following mathematical form to describe normally about the function of the reaction quality variable finish time:

\min_{u_{k}} {[y_{sp} - {\hat{y}}_{k} (t_{f})]}^{T} Q_{1} [y_{sp} - {\hat{y}}_{k} (t_{f})] + {Δu}_{k}^{T} Q_{2} {Δu}_{k} - - - (3)

Wherein, t _fBe the reaction time, u _kBe the control variable that needs are optimized, Δ u _kVariable quantity for control variable is defined as: Δ u _k=u _k-u _K-1, y _SpBe the setting value of end product quality variable,

Be the predicted value of building soft-sensing model to the end product quality variable, soft-sensing model is with u _kInput as model.Q ₁And Q ₂Be the diagonal angle weighting matrix.Second in the following formula is in order to limit the change of control variable, too big thereby output variable can not fluctuate between former and later two batches.In order to reflect physical restriction, can introduce hard restriction: u to control variable _Min≤ u _k≤ u _MaxFormula (3) is found the solution the control variable that is optimized, concrete ripe optimization method, for example seqential quadratic programming (SQP) algorithm of adopting.

Step (3) is implemented the optimal control policy that obtains on new batch.When at θ _iWhen constantly obtaining the quality variable measured value, itself and the predicted value that is obtained by model II are compared:

If the predicated error of model II is greater than setting threshold, there is bigger variation in declarative procedure, and using original optimal control policy like this can not be less than the end product quality of optimum.At this moment, employing correction strategy in mid-term is adjusted performance variable, makes end product quality get back to desired value, and concrete grammar is: adopt the optimization method in the step (2), with model III substitution model I, recomputate θ _iOptimum control performance variable constantly applies it to current batch then.

If the predicated error of model II is smaller or equal to setting threshold, declarative procedure changes little, and model I just may provide accurately and predict the outcome, and needn't utilize model III that model is revised again.

Step (4) obtains actual end product quality variable after a batch of end.Utilize the new lot data that obtain in conjunction with original NLPLS model, adopt recursive algorithm to model I, II and III upgrade, and concrete grammar is:

If pass through in the NLPLS model that obtains after k-1 batch, the latent node center matrix of RBF network is

The corresponding center vector of each row; The respective width vector is

The width of the corresponding latent node of each element.{ W (k-1), P (k-1), B (k-1), Q (k-1) } is PLS model parameter matrix.After k batch end, obtain new input/output variable x (k) and y (k),

A. adopt with step (1) in the same method new data is carried out the data pre-service.Calculate the output vector of the latent node of former NLPLS model, be designated as g (k) for new samples x (k).

B. judge whether to increase new latent node:

If all elements of g (k) all less than setting value, then adds new latent node.New latent node center is taken as x (k), and corresponding width σ adopts the arest neighbors rule to calculate:

σ＝z _c-ησ _c (4)

Wherein, z _cBe the distance of x (k) to nearest latent node center, η is overlapping parameter, and span is [0,1], σ _cBe width from the nearest latent node of x (k).Thereby obtain new latent node center matrix and width vector:

C_{g}^{(k)} = [\begin{matrix} C_{g}^{(k - 1)} \\ x^{(k) T} \end{matrix}],

σ_{g}^{(k)} = [\begin{matrix} σ_{g}^{(k - 1)} \\ σ \end{matrix}]

As follows to parameter matrix P (k-1) and vectorial g (k) expansion simultaneously:

P (k - 1) = [\begin{matrix} P (k - 1) \\ 0 \end{matrix}],

g (k) = [\begin{matrix} g (k) \\ 1 \end{matrix}]

In the formula, 0 is that whole elements all are 0 row vector.

If all elements of g (k) all more than or equal to setting value, does not then need to increase latent node, C _g, σ _g, P, g remain unchanged.

C. x (k) is expanded, obtain augmentation input vector: x _E(k) ^T=[1x (k) ^TG (k) ^T].

D. with new data x _E(k) and y (k) combine with old PLS model parameter matrix, carry out PLS then and return, form is as follows:

X (k) = [\begin{matrix} {P (k - 1)}^{T} \\ x_{E} {(k)}^{T} \end{matrix}],

Y (k) = [\begin{matrix} {B (k - 1) Q (k - 1)}^{T} \\ {y (k)}^{T} \end{matrix}]

According to 3. method of step, calculate PLS regression parameter A (k), H (k) and b (k).Preserve new model parameter

For prediction with use during model modification next time.

Model after step (5) utilization is upgraded returns step (2), obtains new optimal control policy by finding the solution formula (3).

Above step batch between constantly repeat.Usually through several batches, control strategy will converge to optimum solution, and it is optimum that end product quality will reach.

The batch-to-batch optimization method of batch process in conjunction with the correction strategy in mid-term that the present invention proposes adopts recursive algorithm, according to the data and the old model parameter that newly obtain master mould is upgraded after each batch end.Then, find the solution optimal control policy and on object, implementing again.Solve model and object mismatch and had the problem of unknown disturbance.Utilize the current batch of fresh information that is obtained the control strategy of back is revised simultaneously, the interference in having overcome batch, thus improved control performance.

Embodiment

A kind of batch-to-batch optimization method of batch process in conjunction with the correction strategy in mid-term, concrete steps are:

\hat{Y} = XA + GH + 1 b^{T} = [1 X G] [\begin{matrix} b^{T} \\ A \\ H \end{matrix}] = X_{E} β - - - (1)

1. with the k-means clustering algorithm input data are carried out cluster, obtain latent node center c.

2. adopt p neighbour rule to calculate latent node width:

σ_{j} = \sqrt{\frac{1}{p} Σ_{i = 1}^{p} {| | c_{i} - c_{j} | |}^{2}}, j = 1, \cdot \cdot \cdot, N - - - (2)

Calculate latent node output matrix G according to latent node center that obtains and width, then input matrix is expanded, obtain augmentation input matrix [1 X G].To data to { [1 X G], Y} carry out PLS and return, and obtain PLS model parameter matrix { T, W, P, B, Q} extracts the order that the characteristic variable number equals augmentation input matrix [1 X G], and the proper vector number a that model kept that finally is used to predict adopts cross validation method to determine, the parameter matrix that obtains is designated as { Ta, Wa, Pa, Ba, Qa} calculates PLS regression coefficient matrix β by them, thereby obtain A, H and b.

\min_{u_{k}} {[y_{sp} - {\hat{y}}_{k} (t_{f})]}^{T} Q_{1} [y_{sp} - {\hat{y}}_{k} (t_{f})] + {Δu}_{k}^{T} Q_{2} {Δu}_{k} - - - (3)

Correction strategy is adjusted performance variable if the predicated error of model II, adopts mid-term greater than setting threshold, makes end product quality get back to desired value, and concrete grammar is: adopt the optimization method in the step (2), with model III substitution model I, recomputate θ _iOptimum control performance variable constantly applies it to current batch then.

The corresponding center vector of each row; The respective width vector is The width of the corresponding latent node of each element.{ W (k-1), P (k-1), B (k-1), Q (k-1) } is PLS model parameter matrix.After k batch end, obtain new input/output variable x (k) and y (k),

B. judge whether to increase new latent node:

σ＝z _c-ησ _c (4)

C_{g}^{(k)} = [\begin{matrix} C_{g}^{(k - 1)} \\ x^{(k) T} \end{matrix}],

σ_{g}^{(k)} = [\begin{matrix} σ_{g}^{(k - 1)} \\ σ \end{matrix}]

P (k - 1) = [\begin{matrix} P (k - 1) \\ 0 \end{matrix}],

g (k) = [\begin{matrix} g (k) \\ 1 \end{matrix}]

In the formula, 0 is that whole elements all are 0 row vector.

C. x (k) is expanded, obtain augmentation input vector: x _E(k) ^T=[1x (k) ^TG (k) _T].

X (k) = [\begin{matrix} {P (k - 1)}^{T} \\ x_{E} {(k)}^{T} \end{matrix}],

Y (k) = [\begin{matrix} {B (k - 1) Q (k - 1)}^{T} \\ {y (k)}^{T} \end{matrix}]

According to 3. method of step, calculate PLS regression parameter A (k), H (k) and b (k).Preserve new model parameter For prediction with use during model modification next time.

Claims

1. batch-to-batch optimization method of batch process in conjunction with the correction strategy in mid-term is characterized in that this method may further comprise the steps:

Step (1) is set up based on non-linear partial least square quality variable forecast model based on process database, and concrete grammar is:

A. by data collector gatherer process service data, with the process operation data of gathering sample set as data-driven, as input, the end product quality variable is used for setting up based on non-linear partial least square quality variable forecast model as output with the control operation variable; The data of each batch are to being expressed as { x (k) } and { y (k) }, and x (k) represents k batch of control operation variable data, y (k) expression k batch products quality variable data; To import data constitutes input matrix X, output data is constituted output matrix Y;

B. set up based on non-linear partial least square quality variable forecast model based on inputoutput data, method is:

Matrix X and Y are carried out normalized, and making it average is 0, and variance is 1; Then input matrix is listed as expansion, the expansion item is 1 column vector 1 entirely for the latent node output matrix G of radial basis function neural network and element, the output g of the corresponding input vector effect of each row latent node down of G wherein, the bias term coefficient that conceals node is 1; Following augmentation input matrix and output matrix are carried out the partial least square recurrence:

What obtain is expressed as based on non-linear partial least square quality variable forecast model for [1 X G], Y}:

\hat{Y} = XA + GH + 1 b^{T} = [\begin{matrix} 1 & X & G \end{matrix}] [\begin{matrix} b^{T} \\ A \\ H \end{matrix}] = X_{E} β - - - (1)

In the formula (1), X _EExpression augmentation input matrix, A and H are respectively the weights matrix of coefficients of corresponding original input vector and the latent node output vector of corresponding radial basis function neural network, and b is the output offset vector, and T represents transposition;

Be latent node center vector c, respective width vector σ, weights coefficient matrices A and H, model bias vector b based on the unknown parameter in the non-linear partial least square quality variable forecast model, these parameters are determined as follows:

1. with the k-means clustering algorithm input data are carried out cluster, obtain latent node center c;

2. adopt p neighbour rule to calculate latent node width:

σ_{i} = \sqrt{\frac{1}{p} Σ_{i = 1}^{p} {| | c_{i} - c_{j} | |}^{2}}, j = 1, \cdot \cdot \cdot, N - - - (2)

Wherein N is the number of latent node center, c _iBe p nearest latent node center of j latent node center of distance;

3. adopt partial least square to return and determine weights coefficient matrices A, H and bias vector b:

Calculate latent node output matrix G according to latent node center that obtains and width, then input matrix is expanded, obtain augmentation input matrix [1 X G]; To data to { [1 X G], Y} carry out partial least square and return, and obtain partial least square model parameter matrix { T, W, P, B, Q} extracts the order that the characteristic variable number equals augmentation input matrix [1 X G], and the proper vector number a that model kept that finally is used to predict adopts cross validation method to determine, the parameter matrix that obtains is designated as { Ta, Wa, Pa, Ba, Qa} calculates partial least square regression coefficient matrix β by them, thereby obtain A, H and b;

Model I is made in remembering based on non-linear partial least square quality variable forecast model of the quality index that the process control performance variable prediction that the application of above-mentioned foundation is all is last; Measurements moment θ in each _i, set up two other based on non-linear partial least square quality variable forecast model, note is made model II and model III respectively; Moment θ in the middle of model II is used for predicting _iQuality variable, input variable comprises from reaction and beginning to moment θ _iThe measured value and the θ constantly of all process variable _iPreceding quality variable measured value; Model III is used for the pre-measured reaction quality variable when finishing, and input variable comprises all control operation variablees and begins to moment θ from reaction _iAll quality variable measured values;

The target of batch process optimization is to seek one group of control variable to make certain objective function minimize, and adopts following mathematical form to describe:

\min_{u_{k}} {[y_{sp} - {\hat{y}}_{k} (t_{f})]}^{T} Q_{1} [y_{sp} - {\hat{y}}_{k} (t_{f})] + {Δu}_{k}^{T} Q_{2} Δ u_{k} - - - (3)

Be the predicted value of building soft-sensing model to the end product quality variable, soft-sensing model is with u _kInput as model; Q ₁And Q ₂Be the diagonal angle weighting matrix; Control variable is introduced hard restriction: u _Min≤ u _k≤ u _MaxFormula (3) is found the solution the control variable that is optimized;

Step (3) is implemented the optimal control policy that obtains on new batch; When at θ _iWhen constantly obtaining the quality variable measured value, itself and the predicted value that is obtained by model II are compared:

Correction strategy is adjusted performance variable if the predicated error of model II, adopts mid-term greater than setting threshold, makes end product quality get back to desired value, and concrete grammar is: adopt the optimization method in the step (2), with model III substitution model I, recomputate θ _iOptimum control performance variable constantly applies it to current batch then;

If the predicated error of model II is smaller or equal to setting threshold, model I promptly provides accurately and predicts the outcome;

Step (4) obtains actual end product quality variable after a batch of end; The new lot data that utilize to obtain based on non-linear partial least square quality variable forecast model, adopt recursive algorithm to model I in conjunction with original, and II and III upgrade, and concrete grammar is:

If through obtain after k-1 batch based in the non-linear partial least square quality variable forecast model, the latent node center matrix of radial basis function neural network is

The corresponding center vector of each row; The respective width vector is

The width of the corresponding latent node of each element; { W (k-1), P (k-1), B (k-1), Q (k-1) } is partial least square model parameter matrix; After k batch end, obtain new input/output variable x (k) and y (k);

A. adopt with step (1) in identical method new data is carried out the data pre-service; Calculate formerly, be designated as g (k) based on the output vector of the latent node of non-linear partial least square quality variable forecast model for new samples x (k);

B. judge whether to increase new latent node:

If all elements of g (k) all less than setting value, then adds new latent node; New latent node center is taken as x (k), and corresponding width σ adopts the arest neighbors rule to calculate:

σ＝z _c-ησ _c (4)

Wherein, z _cBe the distance of x (k) to nearest latent node center, η is overlapping parameter, and span is [0,1], σ _cBe width from the nearest latent node of x (k); Thereby obtain new latent node center matrix and width vector:

C_{g}^{(k)} = [\begin{matrix} C_{g}^{(k - 1)} \\ x^{(k) T} \end{matrix}], σ_{g}^{(k)} = [\begin{matrix} σ_{g}^{(k - 1)} \\ σ \end{matrix}]

P (k - 1) = [\begin{matrix} P (k - 1) \\ 0 \end{matrix}], g (k) = [\begin{matrix} g (k) \\ 1 \end{matrix}]

In the formula, 0 is that whole elements all are 0 row vector;

If all elements of g (k) all more than or equal to setting value, does not then need to increase latent node, C _g, σ _g, P, g remain unchanged;

C. x (k) is expanded, obtain augmentation input vector: x _E(k) ^T=[1x (k) ^TG (k) ^T];

D. with new data x _E(k) and y (k) divide least square model parameter matrix to combine with former subordinates, carry out partial least square then and return, form is as follows:

X (k) = [\begin{matrix} P {(k - 1)}^{T} \\ x_{E} {(k)}^{T} \end{matrix}], Y (k) = [\begin{matrix} B (k - 1) Q {(k - 1)}^{T} \\ y {(k)}^{T} \end{matrix}]

According to 3. method of step in the step (1), calculate partial least square regression parameter A (k), H (k) and b (k); Preserve new model parameter A (k), H (k), b (k), P (k), B (k), Q (k),

For prediction with use during model modification next time;