JP7154774B2

JP7154774B2 - Optimal control device, control method and computer program

Info

Publication number: JP7154774B2
Application number: JP2018033243A
Authority: JP
Inventors: 理山中; 祐太大西; 祐一中川
Original assignee: Toshiba Corp; Toshiba Infrastructure Systems and Solutions Corp
Current assignee: Toshiba Corp; Toshiba Infrastructure Systems and Solutions Corp
Priority date: 2018-02-27
Filing date: 2018-02-27
Publication date: 2022-10-18
Anticipated expiration: 2038-02-27
Also published as: JP2019148988A; WO2019167998A1; CN111788529A

Description

本発明の実施形態は、最適制御装置、制御方法及びコンピュータプログラムに関する。 Embodiments of the present invention relate to an optimum control device, control method, and computer program.

近年、プラント制御の方法として、極値制御と呼ばれる技術が注目されている。極値制御は、プラントの複雑なモデルを用いないモデルフリーのリアルタイム最適制御技術である。極値制御の概要は、操作量を強制的に変化させることにより、制御対象プロセスの制御量に基づく評価量が最適化される操作量を探索していくものである。このような極値制御をプラント制御に適用する場合、極値制御に係る各種のパラメータ（以下「制御パラメータ」という。）を制御対象プロセスの特性に応じて適切に設定する必要がある。従来、制御パラメータの設計に関する指針がいくつか示されているが、そのいずれも制御対象プロセスの時間的な変化（以下「ダイナミクス」という。）に適応して極値制御を安定的に動作させることができるまでには至っていない。 In recent years, a technique called extremum control has attracted attention as a method of plant control. Extremum control is a model-free real-time optimal control technique that does not use a complex model of the plant. The outline of extreme value control is to forcibly change the manipulated variable to search for the manipulated variable that optimizes the evaluation amount based on the controlled variable of the controlled process. When such extreme value control is applied to plant control, it is necessary to appropriately set various parameters (hereinafter referred to as "control parameters") related to extreme value control according to the characteristics of the process to be controlled. Conventionally, some guidelines for designing control parameters have been presented, but all of them are designed to adapt to temporal changes in the controlled process (hereinafter referred to as "dynamics") and to stably operate extremum control. It has not reached the point where it can be done.

特開２０１７－３３１０４号公報JP 2017-33104 A

D.Nesic et. al., ‘A Unifying Approach to Extremum Seeking: Adaptive Schemes Based on Estimation of Derivatives’, Proc. 49th IEEE Conference on Decision and Control, December 15-17, 2010D. Nesic et. al., 'A Unifying Approach to Extremum Seeking: Adaptive Schemes Based on Estimation of Derivatives', Proc. 49th IEEE Conference on Decision and Control, December 15-17, 2010 W.H.Moase et al, ‘Newton-Like Extremum-Seeking Part I: Theory’, Proc. Joint 48th IEEE Conference on Decision and Control and 28th Chinese Control Conference, December 16-18, 2009W.H.Moase et al, 'Newton-Like Extremum-Seeking Part I: Theory', Proc. Joint 48th IEEE Conference on Decision and Control and 28th Chinese Control Conference, December 16-18, 2009 Yan et al, On the choice of dither in extremum seeking systems:A case study, Automatica, 44, pp.1446-1450 (2008)Yan et al, On the choice of dither in extremum seeking systems: A case study, Automatica, 44, pp.1446-1450 (2008)

本発明が解決しようとする課題は、制御対象プロセスのダイナミクスに適応して極値制御をより安定的に動作させることができる最適制御装置、制御方法及びコンピュータプログラムを提供することである。 The problem to be solved by the present invention is to provide an optimum control device, a control method, and a computer program that can adapt to the dynamics of a process to be controlled and operate extremum control more stably.

実施形態の最適制御装置は、制御対象プロセスの操作量と、前記操作量に応じて変化する制御量に基づく前記制御対象プロセスの最適化に関する指標を示す評価量とに基づいて、前記評価量が最適値に向かうように前記操作量を変化させる極値制御を実行する制御装置である。最適制御装置は、勾配推定部と、補正部と、を持つ。勾配推定部は、前記制御対象プロセスに関して観測される前記評価量に基づいて、前記評価量を表す関数であって前記操作量に対して未知の関数である評価関数の変化率を示す勾配を推定する。補正部は、前記勾配推定部によって取得された前記勾配の推定値に基づいて、前記極値制御の実行に必要な制御パラメータ、前記操作量又は前記評価量を、前記評価関数の変化に適応して補正する。 The optimum control device according to the embodiment is configured such that the evaluation amount is determined based on the operation amount of the controlled process and the evaluation amount indicating the index related to the optimization of the controlled process based on the control amount that changes according to the operation amount. The control device executes extreme value control for changing the manipulated variable so as to move toward the optimum value. The optimum controller has a gradient estimator and a corrector. The gradient estimating unit estimates a gradient indicating a rate of change of an evaluation function, which is a function representing the evaluation quantity and unknown with respect to the manipulated variable, based on the evaluation quantity observed for the controlled process. do. The correction unit adapts the control parameter, the manipulated variable, or the evaluation amount necessary for executing the extreme value control to changes in the evaluation function, based on the estimated value of the gradient obtained by the gradient estimator. to correct.

極値制御の基本的な概念を説明する図。The figure explaining the basic concept of extreme value control. 極値制御を実現する極値制御システム９の基本的な構成例を示すブロック線図。1 is a block diagram showing a basic configuration example of an extreme value control system 9 that implements extreme value control; FIG. 第１の実施形態における最適制御装置２の機能構成の具体例を示すブロック図。2 is a block diagram showing a specific example of the functional configuration of an optimum control device 2 according to the first embodiment; FIG. 第１の実施形態におけるｎ階微分値の推定方法の一具体例を示す図。FIG. 5 is a diagram showing a specific example of an n-order differential value estimation method according to the first embodiment; 第１の実施形態における制御パラメータの決定方法の一例を示す図。The figure which shows an example of the determination method of the control parameter in 1st Embodiment. 第１の実施形態の最適制御装置２によって実現される極値制御システム１の構成例を示すブロック線図。1 is a block diagram showing a configuration example of an extreme value control system 1 realized by an optimum control device 2 of the first embodiment; FIG. 第１の実施形態におけるプラントＰの一例として、生物学的排水処理プロセスを実現する水処理プラント３の具体例を示す図。The figure which shows the specific example of the water treatment plant 3 which implement|achieves a biological waste water treatment process as an example of the plant P in 1st Embodiment. 第１の実施形態における最適制御装置２が制御対象プロセスを極値制御によって制御する処理の流れを示すフローチャート。4 is a flow chart showing the flow of processing in which the optimum control device 2 according to the first embodiment controls the process to be controlled by extreme value control; 第２の実施形態における最適制御装置２ａの機能構成の具体例を示すブロック図。The block diagram which shows the specific example of the functional structure of the optimal control apparatus 2a in 2nd Embodiment. 第２の実施形態の最適制御装置２ａによって実現される極値制御システム１ａの構成例を示すブロック線図。The block diagram which shows the structural example of the extreme value control system 1a implement|achieved by the optimal control apparatus 2a of 2nd Embodiment. 第３の実施形態における最適制御装置２ｂの機能構成の具体例を示すブロック図。The block diagram which shows the specific example of the functional structure of the optimal control apparatus 2b in 3rd Embodiment. 第３の実施形態におけるｎ階微分値の推定方法の一例を示す図。The figure which shows an example of the estimation method of the nth order differential value in 3rd Embodiment. 第３の実施形態の最適制御装置２ｂによって実現される極値制御システム１ｂの構成例を示すブロック線図。The block diagram which shows the structural example of the extreme value control system 1b implement|achieved by the optimal control apparatus 2b of 3rd Embodiment. 第１～第３の実施形態の最適制御装置によって得られる効果の具体例を示す図。FIG. 5 is a diagram showing a specific example of effects obtained by the optimum control devices of the first to third embodiments; 変形例の最適制御装置２において表示情報によって表示される画面の具体例を示す図。The figure which shows the specific example of the screen displayed by the display information in the optimal control apparatus 2 of a modification.

以下、実施形態の最適制御装置、制御方法及びコンピュータプログラムを、図面を参照して説明する。 BEST MODE FOR CARRYING OUT THE INVENTION An optimum control device, a control method, and a computer program according to embodiments will be described below with reference to the drawings.

（概略）
図１は、極値制御の基本的な概念を説明する図である。極値制御は、評価量の変化に基づいて操作量を更新していくことで評価量を最適値に近づけていく制御手法である。評価量は、制御対象となるプロセス（以下「制御対象プロセス」という。）についての最適化の指標となる値である。評価量は、制御対象プロセスの制御量に基づいて決定される指標値であり、評価量と制御量との関係は所定の評価関数によって表される。この評価関数は、制御量に基づくものであれば任意の評価基準に基づいて設定されてよい。また評価量は制御量そのものであってもよい。一般に、極値制御において、制御対象プロセスの評価関数は操作量に対して未知の関数である。 (outline)
FIG. 1 is a diagram for explaining the basic concept of extreme value control. Extreme value control is a control method that brings the evaluation amount closer to the optimum value by updating the manipulated variable based on the change in the evaluation amount. The evaluation amount is a value that serves as an optimization index for a process to be controlled (hereinafter referred to as "controlled process"). The evaluation amount is an index value determined based on the control amount of the process to be controlled, and the relationship between the evaluation amount and the control amount is represented by a predetermined evaluation function. This evaluation function may be set based on any evaluation criteria as long as it is based on the control amount. Also, the evaluation amount may be the control amount itself. Generally, in extreme value control, the evaluation function of the controlled process is an unknown function with respect to the manipulated variable.

極値制御ではディザー信号と呼ばれる周期的な信号によって操作量を変化させる。通常このディザー信号は、正弦波で与えられることが多い。極値制御では、まずディザー信号によって操作量を継続的に振動させ、それによって生じる評価量の変化（増減）を観測する。そして、観測された評価量の変化に基づいて、評価量を評価関数の最適値（最大値又は最小値）に近づけるような操作量を算出し、算出された操作量で現在の操作量を更新する。極値制御は、このような評価量の観測及び操作量の更新を繰り返すことによって評価関数の最適値を探索していく制御方法である。 In extreme value control, the manipulated variable is changed by a periodic signal called a dither signal. Usually, this dither signal is often given as a sine wave. In extreme value control, first, the operation amount is continuously oscillated by a dither signal, and the resulting change (increase or decrease) in the evaluation amount is observed. Then, based on the observed change in the evaluation amount, calculate the operation amount that brings the evaluation amount closer to the optimum value (maximum value or minimum value) of the evaluation function, and update the current operation amount with the calculated operation amount. do. The extreme value control is a control method that searches for the optimum value of the evaluation function by repeating such observation of the evaluation amount and updating of the manipulated variable.

図１（Ａ）の評価関数曲線ＥＶは、操作量に対して未知の評価関数を表す。ここでは、説明の便宜のため、未知の評価関数を下に凸の二次関数として想定する。図１（Ｂ）は、このような評価関数を持つ制御対象プロセスに対してディザー信号で操作量を変化させたときに、ディザー信号の位相と逆位相の評価量が得られた場合を示す。この場合、操作量の増加に対して評価量が減少しているため、動作点が評価関数曲線ＥＶの極小点Ｐｍｉｎより左側で変化したことが分かる。一方、図１（Ｃ）は、図１（Ｂ）と同様のディザー信号に対して、ディザー信号の位相と同位相の評価量が得られた場合を示す。この場合、操作量の増加に対して評価量も増加しているため、動作点が極小点Ｐｍｉｎより右側で変化したことが分かる。 An evaluation function curve EV in FIG. 1A represents an unknown evaluation function with respect to the manipulated variable. Here, for convenience of explanation, the unknown evaluation function is assumed to be a downwardly convex quadratic function. FIG. 1(B) shows a case where an evaluation amount opposite in phase to the phase of the dither signal is obtained when the manipulated variable is changed with the dither signal for the process to be controlled having such an evaluation function. In this case, since the evaluation amount decreases as the operation amount increases, it can be seen that the operating point has changed to the left of the minimum point Pmin of the evaluation function curve EV. On the other hand, FIG. 1(C) shows a case where an evaluation amount having the same phase as the dither signal is obtained for a dither signal similar to that of FIG. 1(B). In this case, since the evaluation amount also increases with an increase in the manipulated variable, it can be seen that the operating point has changed to the right of the minimum point Pmin.

したがって、操作量を周期的に増減させた結果、評価量の増減が操作量の増減と同位相の動きをする場合には操作量を減少させ、逆位相の動きをする場合には操作量を増加させることによって、評価量を最適値に近づけることができる。従来、産業用プラントの制御方式として一般的に用いられてきたＰＩＤ制御（Proportional-Integral-Derivative Control）は、制御量が予め設定された目標値に追従するように操作量を制御する目標値追従型の制御方式であった。これに対して、極値制御は、評価量が最適化されるような操作量を探索する最適値探索型の制御方式であるため、ＰＩＤ制御のように操作量と制御量との関係を表すプロセスモデルを予め必要としない。そのため、極値制御は、目標値を予め設定できないような制御対象プロセスについても有効な制御方式であり、今後広く普及する可能性を秘めている。このような原理で極値制御を行う極値制御コントローラは比較的簡単な構成で実現することができる。 Therefore, as a result of periodically increasing/decreasing the manipulated variable, if the increase/decrease in the evaluation amount moves in the same phase as the increase/decrease in the manipulated variable, the manipulated variable is decreased. By increasing the evaluation amount, the evaluation amount can be brought closer to the optimum value. Conventionally, PID control (Proportional-Integral-Derivative Control), which has been generally used as a control method for industrial plants, is a target value tracking system that controls the manipulated variable so that the controlled variable follows a preset target value. It was a mold control method. On the other hand, since extreme value control is an optimum value search type control method that searches for an operation amount that optimizes the evaluation amount, it expresses the relationship between the operation amount and the control amount like PID control. No process model is required in advance. Therefore, extreme value control is an effective control method even for a controlled process for which a target value cannot be set in advance, and has the potential to become widely used in the future. An extreme value control controller that performs extreme value control based on such a principle can be realized with a relatively simple configuration.

図２は、極値制御を実現する極値制御システム９の基本的な構成例を示すブロック線図である。図２の極値制御システム９（極値制御部）は、ハイパスフィルタ１１（ＨＰＦ:High-Pass Filter）、ディザー信号出力部１２、ローパスフィルタ１３（ＬＰＦ：Low-Pass Filter）及び推定器１４を備える。このように極値制御システム９の構成は、従来のＰＩＤ制御コントローラと比較しても同程度の複雑さである。そのため、極値制御システム９は、ＰＩＤ制御コントローラと同様に、ＰＬＣ（Programmable Logic Controller）等のハードウェアを用いて容易に実装可能である。以下、図２の極値制御システム９の動作の概要について説明する。なお、ここでは、最適値として評価関数の極小値を探索する場合を例に説明する。 FIG. 2 is a block diagram showing a basic configuration example of an extreme value control system 9 that implements extreme value control. The extremum control system 9 (extremum control unit) in FIG. Prepare. Thus, the configuration of the extreme value control system 9 is as complicated as a conventional PID controller. Therefore, the extreme value control system 9 can be easily implemented using hardware such as a PLC (Programmable Logic Controller), like the PID controller. An outline of the operation of the extreme value control system 9 of FIG. 2 will be described below. Here, an example of searching for the minimum value of the evaluation function as the optimum value will be described.

極値制御システム９は、周期的な変化を持つディザー信号Ｍ（ＭはModulationを意味する）を作用させることによって、制御対象プロセスＴＰの操作量を強制的に変化させる。以下、この操作をモジュレーション（Modulation：変調）と呼ぶ。このモジュレーションにより、制御対象プロセスＴＰの操作量が周期的に変化し、操作量の変化に応じて制御量が変化する。制御対象プロセスＴＰは、制御量に基づいて評価量を取得し、取得した評価量を極値制御システム９にフィードバックする。 The extremum control system 9 forcibly changes the manipulated variable of the controlled process TP by applying a dither signal M (M means Modulation) having periodic changes. This operation is hereinafter referred to as modulation. Due to this modulation, the manipulated variable of the controlled process TP changes periodically, and the controlled variable changes according to the change in the manipulated variable. The controlled process TP acquires an evaluation amount based on the control amount, and feeds back the acquired evaluation amount to the extreme value control system 9 .

なお、制御量に基づいて評価量を取得する機能（以下「評価量取得機能」という。）は、必ずしも制御対象プロセスＴＰに含まれる必要はない。例えば、評価量取得機能は極値制御システム９に含まれてもよいし、制御対象プロセスＴＰと極値制御システム９との間に評価量取得機能を有する他の装置が介在してもよい。 It should be noted that the function of acquiring the evaluation amount based on the control amount (hereinafter referred to as the "evaluation amount acquisition function") does not necessarily have to be included in the controlled process TP. For example, the evaluation value acquisition function may be included in the extreme value control system 9, or another device having the evaluation value acquisition function may be interposed between the controlled process TP and the extreme value control system 9.

通常、操作量の変化に対する評価量の変化はある程度の時間遅れを伴って現れる。上述したように、極値制御は操作量に対して未知の評価関数の極値を探索する制御方法である。そのため、制御対象プロセスＴＰの評価関数は極小値を持つことが前提であるが、その値は操作量に対して未知である。 Normally, the change in the evaluation amount with respect to the change in the manipulated variable appears with some time delay. As described above, extreme value control is a control method for searching for extreme values of unknown evaluation functions with respect to manipulated variables. Therefore, it is assumed that the evaluation function of the controlled process TP has a minimum value, but the value is unknown with respect to the manipulated variable.

ハイパスフィルタ１１は、フィードバックされた評価量から未知の極小値に応じた一定値のバイアスを除去する。この処理はすなわち、未知の極小値を常にゼロに調整するための処理であり、推定器１４が操作量に対して与える変化の方向（増加又は減少）を決定するために必要な前処理である。 A high-pass filter 11 removes a constant bias corresponding to an unknown minimum value from the feedback evaluation quantity. This processing is processing for always adjusting an unknown minimum value to zero, and is preprocessing necessary for determining the direction of change (increase or decrease) given by the estimator 14 to the manipulated variable. .

ディザー信号出力部１２は、このように調整された評価量に対してディザー信号Ｄ（ＤはDemodulationを意味する）を作用させる。これにより、操作量のモジュレーションに応じて変化した評価量からディザー信号Ｍと同じ周波数成分が抽出される。以下、この操作をデモジュレーション（Demodulation：復調）と呼ぶ。デモジュレーションの役割は次のとおりである。 The dither signal output unit 12 applies a dither signal D (D means demodulation) to the evaluation amount thus adjusted. As a result, the same frequency component as the dither signal M is extracted from the evaluation amount changed according to the modulation of the manipulated variable. This operation is hereinafter referred to as demodulation. The role of demodulation is as follows.

上述したとおり制御対象プロセスＴＰの操作量に対する評価関数は未知である。そのため、評価関数には非線形要素が含まれている場合がある。この場合、評価関数は下に凸（極大値探索の場合は上に凸）の非線形関数であると想定される。このような非線形要素に起因して、評価量にはディザー信号Ｍの周波数ωに応じた高調波成分や分調波成分が現れる可能性が高い。デモジュレーションは、このような高調波や分調波の影響を取り除くための処理である。このデモジュレーションによって、評価量に含まれる成分のうち、評価量を変化させたディザー信号Ｍと同じ周波数ωの成分が抽出される。 As described above, the evaluation function for the manipulated variable of the controlled process TP is unknown. Therefore, the evaluation function may contain nonlinear elements. In this case, the evaluation function is assumed to be a downwardly convex (or upwardly convex in case of local maximum search) nonlinear function. Due to such non-linear elements, there is a high possibility that harmonic components and subharmonic components corresponding to the frequency ω of the dither signal M appear in the evaluation quantity. Demodulation is processing for removing the effects of such harmonics and subharmonics. By this demodulation, among the components included in the evaluation amount, the component with the same frequency ω as that of the dither signal M whose evaluation amount is changed is extracted.

デモジュレーションされた評価量は、ローパスフィルタ１３に入力される。ローパスフィルタ１３によって、評価量から定常成分（低周波成分）が抽出される。定常成分は、ディザー信号Ｍを作用させたことによって評価量が増加方向に変化したのか、又は減少方向に変化したのかを表すと考えられる。 The demodulated evaluation quantity is input to the low-pass filter 13 . A low-pass filter 13 extracts a stationary component (low-frequency component) from the evaluation quantity. The stationary component is considered to represent whether the evaluation amount has changed in the increasing direction or the decreasing direction due to the application of the dither signal M. FIG.

推定器１４は、ローパスフィルタ１３によって抽出された定常成分を積分する積分器である。推定器１４は、定常成分の積分値に基づいて評価量を極小値に近づけるために動かすべき操作量の方向（以下「操作方向」という。）を推定する推定器として機能する。このような操作方向の推定方法は、適応制御系における操作方向の推定法として最も基本的な勾配法に基づくものである。推定器１４によって操作方向（以下「勾配」ともいう。）が決定されると、その勾配に応じて評価量を極小値に近づけるように操作量が調整される。このように調整された操作量は、再びディザー信号を印加されて制御対象プロセスＴＰに入力される。 The estimator 14 is an integrator that integrates stationary components extracted by the low-pass filter 13 . The estimator 14 functions as an estimator for estimating the direction of the manipulated variable (hereinafter referred to as "manipulated direction") to move the evaluated quantity closer to the minimum value based on the integral value of the stationary component. Such an operation direction estimation method is based on the most basic gradient method as an operation direction estimation method in an adaptive control system. When the estimator 14 determines the operation direction (hereinafter also referred to as "gradient"), the operation amount is adjusted according to the gradient so that the evaluation amount approaches the minimum value. The manipulated variable adjusted in this way is again applied with a dither signal and input to the controlled process TP.

なお、ここでは、極小値を探索する場合を想定して極値制御システム９の構成例を説明したが、極大値を探索する場合には、推定器１４が推定する勾配の符号を反転させればよい。 Here, an example of the configuration of the extreme value control system 9 has been described assuming a case of searching for a local minimum value. Just do it.

また、一般に積分器はローパス特性を有するため、推定器１４が十分なローパス特性を有する場合には、極値制御システム９はローパスフィルタ１３を備えなくてもよい。以下では、簡単のため、推定器１４は十分なローパス特性を有し、ローパスフィルタ１３の機能を包含するものとして説明する。 Also, since an integrator generally has a low-pass characteristic, the extremal value control system 9 need not include the low-pass filter 13 if the estimator 14 has a sufficiently low-pass characteristic. For simplicity, the estimator 14 will be described below as having sufficient low-pass characteristics and including the function of the low-pass filter 13 .

以下に説明する実施形態の最適制御装置は、上記の極値制御システム９を用いて構成され、制御対象プロセスを極値制御によって制御する装置として機能する。実施形態の最適制御装置は、操作量の入力に対して制御量を出力する任意のプロセスの制御に適用可能である。例えば、制御対象プロセスは、下水処理プロセスや燃焼プロセス、石油化学プロセスなどであってもよい。以下、生物学的排水処理プロセスを適宜例にとり実施形態の最適制御装置の詳細を説明する。 The optimum control device of the embodiment described below is configured using the above-described extreme value control system 9 and functions as a device that controls the controlled process by extreme value control. The optimum control device of the embodiment can be applied to control any process that outputs a controlled variable in response to an input of a manipulated variable. For example, the controlled process may be a sewage treatment process, a combustion process, a petrochemical process, or the like. Hereinafter, the details of the optimal control device of the embodiment will be described by appropriately taking the biological wastewater treatment process as an example.

（第１の実施形態）
図３は、第１の実施形態における最適制御装置２の機能構成の具体例を示すブロック図である。図３に示すプラントＰは制御対象プロセスを実現する手段の一例であり、例えば、生物学的排水処理プロセスを実現する水処理プラントである。プラントＰは、制御対象プロセスを実現するための各種機器を含み、最適制御装置２によって与えられる操作量に基づいて各種機器を動作させる。また、プラントＰは、操作量に対する制御対象プロセスの応答（すなわち制御量）を計測する各種の計測機器を含み、計測機器によって取得される計測データを制御対象プロセスの制御量を示す情報（以下「計測情報」という。）を最適制御装置２に出力する。最適制御装置２は、制御対象プロセスから取得される計測情報に基づいて、制御対象プロセスの評価量が最適値に近づくような操作方向で操作量を更新していく。このような動作は、最適制御装置２が以下のような構成を備えることによって実現される。 (First embodiment)
FIG. 3 is a block diagram showing a specific example of the functional configuration of the optimum control device 2 in the first embodiment. A plant P shown in FIG. 3 is an example of means for realizing a process to be controlled, and is, for example, a water treatment plant for realizing a biological wastewater treatment process. The plant P includes various equipment for realizing the controlled process, and operates the various equipment based on the manipulated variables given by the optimum control device 2 . In addition, the plant P includes various measuring instruments for measuring the response of the controlled process to the manipulated variable (that is, the controlled variable), and the measurement data acquired by the measuring instrument is used as information indicating the controlled variable of the controlled process (hereinafter referred to as " (referred to as "measurement information") to the optimum control device 2. Based on the measurement information acquired from the controlled process, the optimum control device 2 updates the manipulated variable in such an operating direction that the evaluation amount of the controlled process approaches the optimum value. Such operations are realized by the optimum control device 2 having the following configuration.

最適制御装置２は、バスで接続されたＣＰＵ（Central Processing Unit）やメモリや補助記憶装置などを備え、極値制御プログラムを実行する。最適制御装置２は、極値制御プログラムの実行によってディザー信号出力部２１、操作量出力部２２、計測情報取得部２３、評価量算出部２４、勾配推定部２５、パラメータ決定部２６及び極値制御部２７を備える装置として機能する。なお、最適制御装置２の各機能の全て又は一部は、ＡＳＩＣ（Application Specific Integrated Circuit）やＰＬＤ（Programmable Logic Device）やＦＰＧＡ（Field Programmable Gate Array）等のハードウェアを用いて実現されてもよい。制御プログラムは、コンピュータ読み取り可能な記録媒体に記録されてもよい。コンピュータ読み取り可能な記録媒体とは、例えばフレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ－ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置である。制御プログラムは、電気通信回線を介して送信されてもよい。 The optimum control device 2 includes a CPU (Central Processing Unit), a memory, an auxiliary storage device, etc. connected via a bus, and executes an extremum control program. Optimum control device 2 executes dither signal output unit 21, manipulated variable output unit 22, measurement information acquisition unit 23, evaluation amount calculation unit 24, gradient estimation unit 25, parameter determination unit 26, and extreme value control by executing the extreme value control program. It functions as a device having a portion 27 . All or part of each function of the optimum control device 2 may be realized using hardware such as an ASIC (Application Specific Integrated Circuit), a PLD (Programmable Logic Device), or an FPGA (Field Programmable Gate Array). . The control program may be recorded on a computer-readable recording medium. Computer-readable recording media include portable media such as flexible disks, magneto-optical disks, ROMs and CD-ROMs, and storage devices such as hard disks incorporated in computer systems. A control program may be transmitted via an electric communication line.

ディザー信号出力部２１は、ディザー信号を生成し、生成したディザー信号を極値制御部２７に出力する。具体的には、ディザー信号出力部２１は、操作量のモジュレーションのためにディザー信号Ｍを生成し、評価量のデモジュレーションのためにディザー信号Ｄを生成する。 The dither signal output section 21 generates a dither signal and outputs the generated dither signal to the extreme value control section 27 . Specifically, the dither signal output unit 21 generates a dither signal M for modulation of the manipulated variable and a dither signal D for demodulation of the evaluation amount.

操作量出力部２２及び計測情報取得部２３は、最適制御装置２とプラントＰとを通信可能に接続する通信インターフェースを含んで構成される。操作量出力部２２は、極値制御部２７から出力される操作量をプラントＰに送信する。また、計測情報取得部２３は、プラントＰから計測情報を取得し、取得した計測情報が示す制御量を評価量算出部２４に出力する。 The manipulated variable output unit 22 and the measurement information acquisition unit 23 are configured including a communication interface that connects the optimum control device 2 and the plant P so as to be able to communicate with each other. The manipulated variable output unit 22 transmits to the plant P the manipulated variable output from the extreme value control unit 27 . The measurement information acquisition unit 23 also acquires measurement information from the plant P, and outputs the control amount indicated by the acquired measurement information to the evaluation amount calculation unit 24 .

評価量算出部２４は、計測情報取得部２３から出力される制御量に基づいて極値制御に用いられる評価量を算出する。評価量算出部２４は、算出した評価量を勾配推定部２５及び極値制御部２７に出力する。 The evaluation amount calculator 24 calculates an evaluation amount used for extreme value control based on the control amount output from the measurement information acquisition unit 23 . The evaluation amount calculator 24 outputs the calculated evaluation amount to the gradient estimator 25 and the extreme value controller 27 .

勾配推定部２５は、評価量算出部２４から出力される評価量に基づいて、評価関数の勾配を推定する。具体的には、勾配推定部２５は、順次取得される評価量の変化に基づいて、操作量に対する１階からＮ階（Ｎは１以上の整数）までの勾配（すなわち微分値）を推定する。ここでは一例として１階微分値を推定する場合について説明する。ここで最適制御装置２の制御周期をＴとし、制御周期Ｔごとの制御が行われる時刻を制御時刻という。この場合、勾配推定部２５は、ある制御時刻ｔにおける評価量Ｊ（ｔ）と、その１制御周期前の制御時刻ｔ－Ｔにおける評価量Ｊ（ｔ－Ｔ）との差を、両時刻における操作量Ｕ（ｔ）及びＵ（ｔ－Ｔ）の差で除することによって、操作量に対する評価量の１階微分値の近似値を取得することができる。すなわち、評価量の１階微分値ｄＪ／ｄＵは、次の式（１）のように近似される。 The gradient estimator 25 estimates the gradient of the evaluation function based on the evaluation quantity output from the evaluation quantity calculator 24 . Specifically, the gradient estimating unit 25 estimates gradients (that is, differential values) from the first order to the Nth order (where N is an integer equal to or greater than 1) with respect to the manipulated variable, based on changes in the evaluation amounts that are sequentially acquired. . Here, as an example, a case of estimating a first-order differential value will be described. Here, the control cycle of the optimum control device 2 is T, and the time at which control is performed for each control cycle T is called control time. In this case, the gradient estimating unit 25 calculates the difference between the evaluation amount J(t) at a certain control time t and the evaluation amount J(tT) at the control time tT one control cycle earlier, at both times. By dividing by the difference between the manipulated variables U(t) and U(tT), it is possible to obtain an approximate value of the first derivative of the evaluated quantity with respect to the manipulated variable. That is, the first order differential value dJ/dU of the evaluation quantity is approximated by the following equation (1).

式（１）は、評価量の微分値を取得する方法の最も簡単な例を示したものであるが、実際には、このような方法で取得される１階微分値は評価関数や操作量の計測値又は算出値によるノイズの影響を受けやすい。また、２階以上の高階微分値を取得する場合にはノイズの影響が大きくなり、実質的に勾配を推定することができなくなる可能性が高い。このような問題に関して、以下に示す各文献には、ディザー信号が、通常、正弦波として与えられることに着目し、より精度良く勾配を推定する方法が提案されている。 Formula (1) shows the simplest example of a method of obtaining the differential value of the evaluation quantity. It is susceptible to noise due to measured or calculated values of In addition, when obtaining a high-order differential value of the second order or higher, the influence of noise increases, and there is a high possibility that the gradient cannot be estimated substantially. Regarding such a problem, the following documents propose a method of estimating the gradient with higher accuracy, focusing on the fact that the dither signal is usually given as a sine wave.

非特許文献１には、フィルタを用いた勾配推定法が記載されており、非特許文献２には、オブザーバの考え方を用いた勾配推定法が記載されている。本実施形態において、勾配推定部２５は、このような従来技術に基づいて評価関数の勾配を推定することが望ましい。ここで、非特許文献１に記載された勾配推定法の基本的な考え方を説明する。 Non-Patent Document 1 describes a gradient estimation method using a filter, and Non-Patent Document 2 describes a gradient estimation method using the observer concept. In this embodiment, the gradient estimator 25 preferably estimates the gradient of the evaluation function based on such conventional technology. Here, the basic idea of the gradient estimation method described in Non-Patent Document 1 will be described.

一般に、操作量には高調波成分や分調波成分が含まれる場合があるが、ディザー信号が正弦波で与えられる場合、操作量は概ねディザー信号と同じ周波数で正弦波状に変化する。そこで、操作量ＵがＵ（ｔ）＝Ｕ_０＋ａ×sinωtという正弦波状に変化すると仮定し、それによって得られる評価量が次の式（２）に示す評価関数Ｊで表されると仮定する。 In general, the manipulated variable may include harmonic components and subharmonic components, but when the dither signal is given as a sine wave, the manipulated variable changes sinusoidally at approximately the same frequency as the dither signal. Therefore, it is assumed that the manipulated variable U changes sinusoidally as U(t)=U ₀ +a×sinωt, and that the evaluation value obtained by this change is represented by the evaluation function J shown in the following equation (2). .

ここで、fは未知の関数である。実際には、ｆにはプラントのダイナミクスが含まれるため、正確には、ｆは動的システムの作用素（オペレータ）とみなされるべきである。ただし、ディザー信号の周波数ωがプラントのダイナミクスに対して十分に緩やかな変化をもたらす場合には、ｆを近似的に関数とみなすことができる。このような前提の下、ここではfを関数とみなす。この式（２）をテーラー展開することにより次の式（３）が得られる。 where f is an unknown function. In practice, f contains the dynamics of the plant, so rather f should be considered the operator of the dynamic system. However, if the frequency ω of the dither signal provides a sufficiently gradual change in the dynamics of the plant, then f can be approximated as a function. Under this premise, f is regarded as a function here. The following equation (3) is obtained by Taylor-expanding this equation (2).

ここで、Ｄ^ｋｆ（ｋは１以上の整数）は、関数fのＵに関するｋ階微分を意味する。この式（３）にｓｉｎ^ｎωｔ（ｎは１以上の整数）を掛けることにより次の式（４）が得られる。 Here, D ^k f (where k is an integer equal to or greater than 1) means the k-th differential of the function f with respect to U. The following equation (4) is obtained by multiplying this equation (3) by sin ⁿ ωt (where n is an integer of 1 or more).

ここで、式（４）に周期平均処理を行うと次の式（５）が得られる。 Here, the following equation (5) is obtained by performing the periodic averaging process on the equation (4).

ここで、Ａ_０は次の式（６）で定義される。 Here, A ₀ is defined by the following equation (6).

ディザー信号の振幅ａと冪数ｎは定数であることに着目し、ｎ階微分Ｄ^ｎｆの値が１制御周期で大きく変化しないと仮定しており、μ_ｎは次の式（７）で定義される。 Focusing on the fact that the amplitude a and the exponent n of the dither signal are constants, it is assumed that the value of the ⁿ -th order differential Dnf does not change significantly in one control cycle, and µn is _given by the following equation (7): Defined.

続いて、次の式（８）及び（９）を定義し、式（５）の関係を用いると、０からｎ階までの微分Ｄ^０ｆ～Ｄ^ｎｆは次の式（１０）のように表される。 Subsequently, by defining the following equations (8) and (9) and using the relationship of the equation (5), the differentials D ⁰ f to D ⁿ f from 0 to nth order are as shown in the following equation (10) is represented by

ここで、Ａ_ｎは次の式（１１）で定義される。 Here, _An is defined by the following equation (11).

したがって、式（１０）を用いることで任意の次数のｎ階微分値（或いは第０階～第ｎ階微分値）を推定することができる。さらに非特許文献１には、このような基本的な考え方に沿って、若干の修正を加えたｎ階微分値の推定方法が記載されている。 Therefore, by using equation (10), it is possible to estimate the n-th order differential value (or the 0th to n-th order differential values) of any order. Furthermore, Non-Patent Document 1 describes a method of estimating the n-th order differential value with some modifications along with such a basic idea.

図４は、第１の実施形態におけるｎ階微分値の推定方法の一具体例を示す図である。図４においてＧ（ｔ）は次の式（１２）で定義される。 FIG. 4 is a diagram showing a specific example of a method of estimating the n-order differential value in the first embodiment. In FIG. 4, G(t) is defined by the following equation (12).

ここで、Ｘ（ｔ）は、図４のｘ_１（ｔ）～ｘ_ｎ（ｔ）を並べたベクトル信号である。すなわち式（１２）は、式（５）及び（８）で定義した信号を、図４に示したＸ（ｔ）で近似（代用）したものと考えることができる。 Here, X(t) is a vector signal in which x ₁ (t) to x _n (t) in FIG. 4 are arranged. That is, equation (12) can be considered as approximating (substituting) the signals defined by equations (5) and (8) with X(t) shown in FIG.

このように、フィルタを用いた勾配推定器Ｇ（ｔ）で評価関数Ｊの勾配を推定することができる。本実施形態の最適制御装置２は、このような方法を用いて推定された勾配の推定値に基づいて極値制御の制御パラメータを決定する。勾配推定部２５は、このようにして取得した勾配推定値をパラメータ決定部２６に出力する。 Thus, the gradient of the evaluation function J can be estimated by the gradient estimator G(t) using a filter. The optimum control device 2 of the present embodiment determines control parameters for extreme value control based on the estimated value of the gradient estimated using such a method. The gradient estimating section 25 outputs the gradient estimated value obtained in this manner to the parameter determining section 26 .

図３の説明に戻る。パラメータ決定部２６は、勾配推定部２５によって取得された評価関数の勾配推定値に基づいて極値制御の制御パラメータを決定する。具体的には、パラメータ決定部２６は、ローパスフィルタの周波数、ハイパスフィルタの周波数、ディザー信号の周波数、ディザー信号の振幅及び積分ゲインの５つの制御パラメータを決定する。 Returning to the description of FIG. The parameter determining unit 26 determines control parameters for extreme value control based on the gradient estimation value of the evaluation function acquired by the gradient estimating unit 25 . More specifically, the parameter determination unit 26 determines five control parameters: low-pass filter frequency, high-pass filter frequency, dither signal frequency, dither signal amplitude, and integral gain.

図５は、第１の実施形態における制御パラメータの決定方法の一例を示す図である。具体的には、図５は、特許文献１に記載された制御パラメータの調整則を示す。この調整則は、基本的には、制御対象プロセスの制御に極値制御を適用する前の設計段階において制御パラメータを決定する際に用いられることを想定したものである。すなわち、特許文献１は、この調整則に基づいて決定された制御パラメータを極値制御の適用後に変更することを想定したものではない。 FIG. 5 is a diagram illustrating an example of a control parameter determination method according to the first embodiment. Specifically, FIG. 5 shows the control parameter adjustment rule described in Patent Document 1. As shown in FIG. This adjustment rule is basically assumed to be used when determining control parameters in the design stage before applying extreme value control to the control of the controlled process. That is, Patent Document 1 does not assume that the control parameters determined based on this adjustment rule are changed after the application of the extreme value control.

本実施形態の最適制御装置２において、パラメータ決定部２６は、図５に示す５つのパラメータのうち積分ゲイン以外のパラメータについては、図５に示すＮｏ．１～Ｎｏ．４の各調整則に基づいて決定する。一方、積分ゲインについては、パラメータ決定部２６は、勾配推定部２５によって取得された勾配推定値に基づいて、制御対象プロセスの状態に応じて適応的に決定し、極値制御に反映させる。積分ゲインの決定方法については後述する。 In the optimum control device 2 of this embodiment, the parameter determination unit 26 selects No. 1 shown in FIG. 5 for parameters other than the integral gain among the five parameters shown in FIG. 1 to No. 4 based on each adjustment rule. On the other hand, the parameter determining unit 26 adaptively determines the integral gain according to the state of the controlled process based on the gradient estimation value obtained by the gradient estimating unit 25, and reflects it in the extreme value control. A method of determining the integral gain will be described later.

図３の説明に戻る。極値制御部２７は、パラメータ決定部２６によって決定された制御パラメータで、制御対象プロセスの極値制御を行う。具体的には、まず、極値制御部２７は、プラントＰに与える操作量にディザー信号を印加し、それによって変化する評価量を観測する。そして、極値制御部２７は、評価量の観測値に基づいて、評価量を最適値に近づけるように操作量を更新する。極値制御部２７が、ディザー信号の印加、評価量の観測及び操作量の更新を繰り返し実行することで、制御対象プロセスの評価量が最適値に近づけられる。 Returning to the description of FIG. The extreme value control unit 27 performs extreme value control of the controlled process using the control parameters determined by the parameter determination unit 26 . Specifically, first, the extremum control unit 27 applies a dither signal to the manipulated variable given to the plant P, and observes the evaluation quantity that changes accordingly. Based on the observed value of the evaluation amount, the extreme value control unit 27 updates the operation amount so that the evaluation amount approaches the optimum value. The extreme value control unit 27 repeatedly applies the dither signal, observes the evaluation amount, and updates the manipulated variable, thereby bringing the evaluation amount of the controlled process closer to the optimum value.

図６は、第１の実施形態の最適制御装置２によって実現される極値制御システム１の構成例を示すブロック線図である。極値制御システム１が図２に示した基本的な構成の極値制御システム９と異なる点は、勾配推定部２５によって取得された評価関数の勾配推定値が推定器１４の動作に適応的に作用する点である。具体的には、パラメータ決定部２６（図示せず）が勾配推定値に基づいて算出した積分ゲインＫＩが適応的に推定器１４に反映される。これにより、最適制御装置２は、制御対象プロセスのダイナミクスに適応して極値制御をより安定的に動作させることが可能となる。積分ゲインの決定方法の詳細は後述する。 FIG. 6 is a block diagram showing a configuration example of the extreme value control system 1 realized by the optimum control device 2 of the first embodiment. The extreme value control system 1 differs from the extreme value control system 9 having the basic configuration shown in FIG. This is the point at which it works. Specifically, the integral gain KI calculated by the parameter determination unit 26 (not shown) based on the gradient estimation value is adaptively reflected in the estimator 14 . As a result, the optimum control device 2 can adapt to the dynamics of the process to be controlled and operate the extreme value control more stably. The details of the method of determining the integral gain will be described later.

なお、図６に示す極値制御システム１は、図３に示した最適制御装置２のディザー信号出力部２１及び極値制御部２７として機能する。また、極値制御システム１は、操作量出力部２２、計測情報取得部２３及び評価量算出部２４を含むように構成されてもよい。 The extreme value control system 1 shown in FIG. 6 functions as the dither signal output section 21 and the extreme value control section 27 of the optimum control device 2 shown in FIG. Also, the extreme value control system 1 may be configured to include the manipulated variable output section 22 , the measurement information acquisition section 23 and the evaluation amount calculation section 24 .

図７は、第１の実施形態におけるプラントＰの一例として、生物学的排水処理プロセスを実現する水処理プラント３の具体例を示す図である。例えば、図７に示す水処理プラント３は、嫌気槽３１、無酸素槽３２、好気槽３３及び最終沈澱池３４の各設備を備える。嫌気槽３１は、微生物を活性化させるための設備である。無酸素槽３２は、窒素を除去するための設備である。好気槽３３は有機物の分解やリンの除去、アンモニアの硝化を行うための設備である。最終沈澱池３４は、活性汚泥を沈殿させるための設備である。 FIG. 7 is a diagram showing a specific example of a water treatment plant 3 that implements a biological wastewater treatment process, as an example of the plant P in the first embodiment. For example, the water treatment plant 3 shown in FIG. The anaerobic tank 31 is equipment for activating microorganisms. The anoxic tank 32 is equipment for removing nitrogen. The aerobic tank 33 is equipment for decomposing organic matter, removing phosphorus, and nitrifying ammonia. The final sedimentation tank 34 is equipment for precipitating the activated sludge.

水処理プラント３には、上記設備間で水や汚泥を搬送するポンプや、槽内に空気を供給するブロワ、空気中又は水中の物質の濃度を計測するセンサー等の設備が設置される。薬品投入ポンプ３１１は、微生物を活性化させる炭素源等の薬品を嫌気槽３１に投入するポンプである。循環ポンプ３３１は、好気槽３３と無酸素槽３２との間で循環する被処理水の循環量を制御するポンプである。ブロワ３３２は、好気槽３３に空気を供給して曝気量を制御する。返送汚泥ポンプ３４１は、最終沈澱池３４から無酸素槽３２に汚泥を返送するポンプである。余剰汚泥引き抜きポンプ３４２は、最終沈澱池３４から過剰な汚泥を引き抜くポンプである。センサー３１２及びセンサー３４３は、それぞれ、嫌気槽３１及び最終沈澱池３４における放流水の水質を計測する。 The water treatment plant 3 is equipped with equipment such as pumps for conveying water and sludge between the above equipment, blowers for supplying air to tanks, and sensors for measuring concentrations of substances in the air or water. The chemical feed pump 311 is a pump that feeds a chemical such as a carbon source for activating microorganisms into the anaerobic tank 31 . The circulation pump 331 is a pump that controls the amount of water to be treated that circulates between the aerobic tank 33 and the anoxic tank 32 . The blower 332 supplies air to the aerobic tank 33 to control the amount of aeration. The return sludge pump 341 is a pump that returns sludge from the final sedimentation basin 34 to the anoxic tank 32 . The excess sludge extraction pump 342 is a pump that extracts excess sludge from the final sedimentation tank 34 . A sensor 312 and a sensor 343 measure the water quality of the effluent in the anaerobic tank 31 and the final sedimentation tank 34, respectively.

一般に、このような生物学的廃水処理プロセスでは、操作量は返送汚泥の返送率であり、制御量は放流水に含まれる窒素及びリンの濃度（以下それぞれを「放流窒素濃度」及び「放流リン濃度」という。）である。返送率は、返送汚泥ポンプ３４１の放流量を流入量で割ることによって得られる。放流窒素濃度及び放流リン濃度は、センサー３１２及びセンサー３４３によって取得される。なお、制御量を、放流水に含まれる窒素及びリンの量（以下それぞれを「放流窒素量」及び「放流リン量」という。）としてもよい。この場合、放流窒素量及び放流リン量は、それぞれ放流窒素濃度及び放流リン濃度に放流量を乗算することにより得られる。 In general, in such a biological wastewater treatment process, the manipulated variable is the return rate of the returned sludge, and the controlled variable is the concentration of nitrogen and phosphorus contained in the effluent (hereinafter referred to as "effluent nitrogen concentration" and "effluent phosphorus concentration", respectively). concentration”). The return rate is obtained by dividing the discharge rate of the return sludge pump 341 by the inflow rate. The effluent nitrogen concentration and the effluent phosphorus concentration are obtained by sensors 312 and 343 . The amount of control may be the amount of nitrogen and phosphorus contained in the discharged water (hereinafter referred to as the "discharged nitrogen amount" and the "discharged phosphorus amount", respectively). In this case, the effluent nitrogen amount and the effluent phosphorus amount are obtained by multiplying the effluent nitrogen concentration and the effluent phosphorus concentration by the effluent amount, respectively.

評価量算出部２４には、水処理プラント３から出力される制御量に基づいて評価量を取得するための評価関数を予め設定しておく。ここでいう評価関数は、操作量に対する未知の評価関数を、制御量の関数として定義したものである。例えば、評価関数は、放流窒素濃度及び放流リン濃度と評価量との関係を表す関数である。この評価関数は、操作量（返送率）上限での制御量と、操作量下限での制御量との間で極値をとるように設定される必要がある。このように評価関数を設定する方法の一例として、評価量を排水賦課金の考え方に基づく水質コストと、返送汚泥ポンプ３４１の電力コストとの総和（以下「総コスト」という。）として表す方法が考えられる。返送汚泥ポンプ３４１の電力コストは、返送汚泥流量と返送汚泥ポンプ３４１の定格電力などから算出することができる。一般に、排水賦課金の考え方では、水質コストは以下の式で表される。 An evaluation function for obtaining an evaluation amount based on the control amount output from the water treatment plant 3 is preset in the evaluation amount calculation unit 24 . The evaluation function here is defined as an unknown evaluation function for the manipulated variable as a function of the controlled variable. For example, the evaluation function is a function representing the relationship between the effluent nitrogen concentration and the effluent phosphorus concentration and the evaluation amount. This evaluation function must be set so as to take an extreme value between the controlled variable at the upper limit of the manipulated variable (return rate) and the controlled variable at the lower limit of the manipulated variable. As an example of the method of setting the evaluation function in this way, there is a method of expressing the evaluation amount as the sum of the water quality cost based on the concept of the wastewater levy and the power cost of the return sludge pump 341 (hereinafter referred to as "total cost"). Conceivable. The power cost of the return sludge pump 341 can be calculated from the return sludge flow rate, the rated power of the return sludge pump 341, and the like. In general, in the concept of wastewater levy, the water quality cost is expressed by the following formula.

式（１３）においてＣＯＤは化学的酸素要求量、ＢＯＤは生物化学的酸素要求量、ＴＮは放流窒素、ＴＰは放流リンを意味する。各コストの換算係数は、実際の排水賦課金に基づいて決定されても良いし、他の方法によって決定されてもよい。一般に、ＣＯＤ、ＢＯＤ、ＴＮ及びＴＰのうち、返送率を変えることによって大きく変化するものはＴＮ及びＴＰであることが知られている。そのためここでは、水質コストを次の式（１４）で表す。 In formula (13), COD means chemical oxygen demand, BOD means biochemical oxygen demand, TN means effluent nitrogen, and TP means effluent phosphorus. The conversion factor for each cost may be determined based on the actual waste water charge, or may be determined by other methods. Generally, among COD, BOD, TN and TP, it is known that TN and TP are greatly changed by changing the return rate. Therefore, here, the water quality cost is represented by the following equation (14).

なお、一般に、返送率を上げると窒素の除去率が高まりＴＮに関する水質コストが減少し、逆に返送率を下げるとリンの除去率が高まりＴＰに関する水質コストが減少することが知られている。このような場合、水質コストのみに基づいて評価関数が設定されても良い。ただし、このようなトレードオフの関係を持たない水質同士のコストを指標とする場合には、評価量を、運転コスト（電力コスト）を加味した総コストとして表すことにより、評価関数が、操作量（返送率）上限での制御量と操作量下限での制御量との間で極値をとるように設定する。 It is generally known that increasing the return rate increases the nitrogen removal rate and reduces the water quality cost related to TN, and conversely, decreasing the return rate increases the phosphorus removal rate and reduces the water quality cost related to TP. In such a case, the evaluation function may be set based only on the water quality cost. However, when the cost of water quality that does not have such a trade-off relationship is used as an index, by expressing the evaluation amount as a total cost that takes into account the operation cost (electricity cost), the evaluation function can be expressed as the operation amount (Return rate) Set so as to take an extreme value between the controlled variable at the upper limit and the controlled variable at the manipulated variable lower limit.

また、評価関数には、このような総コストではなく、直接的に水質の評価を表す関数が設定されてもよい。例えば、評価量は、次の式（１５）のように算出されてもよい。 Moreover, instead of such a total cost, a function that directly expresses the evaluation of water quality may be set as the evaluation function. For example, the evaluation amount may be calculated as in Equation (15) below.

式（１５）において、ＴＮ_ｌｉｍ及びＴＰ_ｌｉｍは、放流水質の規制値や管理値に相当するスレッシホールドレベルを表すパラメータである。このような評価関数を用いた場合、スレッシホールドレベルを超えると評価量が急上昇する。そのため、評価量をスレッシホールドレベル以内に抑えるように極値制御が機能することが期待できる。 In Equation (15), TN _lim and TP _lim are parameters that represent threshold levels corresponding to regulation values and management values for effluent water quality. When such an evaluation function is used, the evaluation amount rises sharply when the threshold level is exceeded. Therefore, it can be expected that extremal value control functions so as to keep the evaluation quantity within the threshold level.

以上、図４に示したような水処理プラント３を例として、極値制御に必要となる評価関数の設定方法について説明したが、制御対象とするプラントＰによっては評価関数の設定を必要としない場合もある。そのような例として、風力発電プラントにおける風車のブレードの制御が挙げられる。風車のブレードの向きを風向に併せて動かすことにより発電量を最大化するような制御に極値制御を適用する場合、評価量は発電量であり、操作量は風車のブレードの回転角となる。この場合、制御量がそのまま評価量となるため評価関数の設定を必要としない。このような場合、評価量算出部２４が設けられなくてもよい。その一方で、評価量を取得することによって、極値制御の適用が可能となる場合もある。 The method of setting the evaluation function required for extreme value control has been described above using the water treatment plant 3 as shown in FIG. 4 as an example. In some cases. One such example is the control of wind turbine blades in wind power plants. When extreme value control is applied to control that maximizes power generation by moving the direction of the wind turbine blades in accordance with the wind direction, the evaluation amount is the power generation amount, and the operation amount is the rotation angle of the wind turbine blades. . In this case, it is not necessary to set an evaluation function because the control amount becomes the evaluation amount as it is. In such a case, the evaluation amount calculation unit 24 may not be provided. On the other hand, it may be possible to apply extremum control by obtaining an evaluation quantity.

［積分ゲインの決定方法］
以下、パラメータ決定部２６が勾配推定値に基づいて積分ゲインを決定する方法について説明する。特許文献１で提案されている上記の調整則は、非特許文献３に記載されたアベレージシステムに基づくものである。アベレージシステムとは、あるシステムに周期的な入力が加えられたときに、その周期平均（アベレージ）をとったシステムの動的な挙動を表すシステムであり、極値制御系の安定解析に用いられる。 [Determination method of integral gain]
The method by which the parameter determination unit 26 determines the integral gain based on the gradient estimate will be described below. The above adjustment rule proposed in Patent Document 1 is based on the average system described in Non-Patent Document 3. An average system is a system that expresses the dynamic behavior of a system that takes the periodic average (average) when a periodic input is applied to a system, and is used for stability analysis of extreme control systems. .

特に非特許文献３には、ダイナミクスを持たないスタティックなプラントを制御対象とする極値制御系のアベレージシステムについて具体的に記載されている。そのアベレージシステムは、次の式（１６）で表される。 In particular, Non-Patent Document 3 specifically describes an averaging system of an extreme value control system that controls a static plant without dynamics. The average system is represented by the following equation (16).

ただし、ＤＪは評価関数Ｊの入力の周期平均Ｕ－Ｕ_＊に関する勾配を表す。Ｕ_＊はＵの平衡点である。τはディザー信号の周波数ωでスケール変換された時間関数であり、次の式（１７）によって表される値である。 where DJ represents the slope of the input of the evaluation function J with respect to the periodic mean UU _* . U _* is the equilibrium point of U. τ is a scaled function of time at the frequency ω of the dither signal and is a value represented by the following equation (17).

また、ＫＩ_０は時間軸τ上での積分ゲインであり、実際の時間軸ｔ上での積分ゲインＫＩは次の式（１８）によって変換される。 KI ₀ is the integral gain on the time axis τ, and the actual integral gain KI on the time axis t is converted by the following equation (18).

なお、式（１６）におけるＰはディザー信号のパワーを表す。非特許文献３に記載されているように、ディザー信号として正弦波を用いる場合にはＰ＝１／２であり、三角波の場合はＰ＝１／３であり、矩形波の場合はＰ＝１である。式（１６）が示すアベレージシステムは、ディザー信号で操作量を周期的に振動させながら、評価量を最小値（極小値）に収束させていくとき、評価量が周期的に振動しながらどのような速さで最小値（極小値）収束していくかという極値制御の収束のダイナミクスを表現したものである。 Note that P in Equation (16) represents the power of the dither signal. As described in Non-Patent Document 3, when using a sine wave as a dither signal, P=1/2, when using a triangular wave, P=1/3, and when using a square wave, P=1. is. In the average system shown by equation (16), when the evaluation amount converges to the minimum value (minimum value) while the operation amount is periodically oscillated by the dither signal, how does the evaluation amount oscillate periodically? It expresses the dynamics of convergence of extremum control that the minimum value (minimum value) converges at a high speed.

非特許文献３では、プラントがスタティックである場合を仮定し、ディザー信号の周期がプラントの時定数よりも十分に長く設定されている。これはすなわち、ディザー信号の周波数ωがプラントのカットオフ周波数２π／ωよりも十分に小さく設定されている場合である。このような場合には、プラントがダイナミクスを持っている場合であっても、これを近似的にスタティックであるとみなせる。このことは、極値制御の安定解析に用いられる特異摂動論によって裏付けられる。したがって、ここでは、ディザー信号の周波数ωが適切に設定されているという想定の下で式（１６）のアベレージシステムを用いて積分ゲインを決定する方法を示す。 In Non-Patent Document 3, it is assumed that the plant is static, and the period of the dither signal is set sufficiently longer than the time constant of the plant. This is the case when the frequency ω of the dither signal is set sufficiently lower than the cutoff frequency 2π/ω of the plant. In such cases, even though the plant has dynamics, it can be considered to be approximately static. This is supported by the singular perturbation theory used in the stability analysis of extremal control. Therefore, here we show how to determine the integral gain using the average system of equation (16) under the assumption that the frequency ω of the dither signal is set appropriately.

式（１６）は、ディザー信号の周波数でスケール変換された時間軸τ=ωtでの極値制御系の挙動を示すため、式（１６）の時定数は、極値制御が極値に収束するまでの時間軸τでの時定数に対応すると考えられる。したがって、式（１６）で表されるアベレージシステムの時定数Ｔ_ａｖｅが、ディザー信号の周期Ｔ＝２π／ωより十分長くなるようにパラメータω、ａ、ＫＩ_０を決定すれば、評価量はディザー信号による操作量の増減に応じて徐々に最小値（極小値）に収束していくと期待される。 Equation (16) shows the behavior of the extremal control system on the time axis τ=ωt scaled by the frequency of the dither signal. It is thought that it corresponds to the time constant on the time axis τ until . Therefore, if the parameters ω, a, and _KI0 are determined so that the time constant T _ave of the average system represented by Equation (16) is sufficiently longer than the dither signal period T=2π/ω, the evaluation amount is dither It is expected to gradually converge to the minimum value (minimum value) according to the increase/decrease of the manipulated variable by the signal.

ここで、ディザー信号の周波数ωと振幅ａは図５の調整則に基づいて決定されるため、アベレージシステムの時定数がディザー信号の周期Ｔ＝２π／ωより十分に長くなるようにするためにはＫＩ_０を調整することになる。しかしながら、式（１６）は一般的には非線形の微分方程式となるため、時定数という概念を直接的に定義することができない。そこで、特許文献１では、評価関数Ｊが二次関数であるとの想定の下で時定数を定義して制御パラメータを決定する調整則が提案されている。例えば、評価関数Ｊ（ｔ）が次の式（１９）で表される場合を想定する。 Here, since the frequency ω and the amplitude a of the dither signal are determined based on the adjustment rule of FIG. will adjust KI ₀ . However, since Equation (16) is generally a nonlinear differential equation, the concept of time constant cannot be defined directly. Therefore, Patent Document 1 proposes an adjustment rule for determining a control parameter by defining a time constant under the assumption that the evaluation function J is a quadratic function. For example, assume that the evaluation function J(t) is represented by the following equation (19).

この場合、ＤＪ（Ｕ＋Ｕ_＊）＝Ｇ×Ｕ（ｔ）となるため、式（１４）は次の式（２０）のように表される。 In this case, DJ(U+U _* )=G×U(t), so equation (14) is expressed as the following equation (20).

式（２０）の時定数Ｔ_ａｖｅは、１／（ＫＩ_０×ａ×Ｐ×Ｇ）となる。このＴ_ａｖｅは時間軸τ上での時定数であり、τ＝１は１／ωに相当する時間である。このことから、時定数Ｔ_ａｖｅに相当する時間をディザー信号の周期２π／ωの何倍にするかが決まればＫＩ_０の値を決定することができる。ここで、アベレージシステムの時定数がディザー信号の周期より十分に長くなるように調整される必要があることから、例えば、時定数に相当する時間を、ディザー信号の周期のｋ３（＝５～１０）倍程度とする。この場合、ｋ３×２π＝１／（ＫＩ_０×ａ×Ｐ×Ｇ）が成立するため、ＫＩ_０は次の式（２１）のように決定される。 The time constant T _ave of Equation (20) is 1/(KI ₀ ×a×P×G). This _Tave is a time constant on the time axis τ, and τ=1 is the time corresponding to 1/ω. Therefore, the value of KI ₀ can be determined by determining how many times the period 2π/ω of the dither signal should be set for the time corresponding to the time constant T _ave . Here, since the time constant of the average system needs to be adjusted to be sufficiently longer than the period of the dither signal, for example, the time corresponding to the time constant is set to k3 (=5 to 10) of the period of the dither signal. ). In this case, k3×2π=1/(KI ₀ ×a×P×G) holds, so KI ₀ is determined by the following equation (21).

特許文献１には、以上のような積分ゲインの調整則が提案されているが、この調整則は上述したとおり評価関数Ｊ（ｔ）が二次関数であるとの想定に基づくものである。しかしながら、現実の問題がこのような想定の範囲内にあることはほとんど期待できない。これに対して、特許文献１には評価関数を二次関数で近似できるように変換する方法も提案されているが、このような変換を行うためには予め操作量と評価量との関係性をある程度明らかにしておく必要がある。そして、このような関係性の取得には、制御対象プロセスについていくつかの動作点の観測が必要となり、多大なエンジニアリングコストを要する。 Patent Literature 1 proposes an adjustment rule for the integral gain as described above, but this adjustment rule is based on the assumption that the evaluation function J(t) is a quadratic function as described above. However, one can hardly expect real-life problems to lie within such assumptions. On the other hand, Patent Document 1 proposes a method of converting the evaluation function so that it can be approximated by a quadratic function. should be clarified to some extent. Acquisition of such a relationship requires observation of several operating points of the controlled process, which requires a large engineering cost.

このような課題に対して、本実施形態では式（１９）及び（２１）におけるＧが評価関数の二階微分値であることに着目し、パラメータ決定部２６が勾配推定部２５によって取得される二階微分の推定値を式（２１）に適用することで積分ゲインを決定する。なお、評価関数Ｊ（ｔ）が二次関数である場合にはＧは定数となるが、現実の問題のほとんどにおいて評価関数Ｊ（ｔ）は二次関数でない。このように評価関数Ｊ（ｔ）が二次関数でない場合、Ｇは時間とともに変化する関数となる。そのため、この場合、積分ゲインは次の式（２２）のように表される。 To address this problem, the present embodiment focuses on the fact that G in equations (19) and (21) is the second-order differential value of the evaluation function, and the parameter determination unit 26 uses the second-order Applying the derivative estimate to equation (21) determines the integral gain. Note that G is a constant when the evaluation function J(t) is a quadratic function, but in most real problems the evaluation function J(t) is not a quadratic function. When the evaluation function J(t) is not a quadratic function in this way, G is a function that changes with time. Therefore, in this case, the integral gain is represented by the following equation (22).

式（１９）において、Ｇ（ｔ）は勾配推定部２５によって推定された評価関数の操作量に対する二階微分値である。また、式（１９）は、積分ゲインが時刻ｔにおける二階微分値Ｇ（ｔ）によって時間ｔの関数となることを表している。すなわち、パラメータ決定部２６は、順次取得される勾配推定値を式（２２）に適用することにより、経時的に変化する制御対象プロセスのダイナミクスに適応して積分ゲインを更新することができる。 In Equation (19), G(t) is the second order differential value with respect to the manipulated variable of the evaluation function estimated by the gradient estimator 25 . Equation (19) expresses that the integral gain becomes a function of time t by the second-order differential value G(t) at time t. That is, the parameter determining unit 26 can update the integral gain by applying the sequentially acquired gradient estimation values to Equation (22), adapting to the dynamics of the controlled process that changes over time.

図８は、第１の実施形態における最適制御装置２が制御対象プロセスを極値制御によって制御する処理の流れを示すフローチャートである。なお、プラントＰの制御対象プロセスは、フローチャートの開始時点においてＰＩＤ制御等の極値制御以外の制御方法で制御されているものとする。まず、計測情報取得部２３は、プラントＰから計測情報を取得する（ステップＳ１０１）。計測情報取得部２３は、計測情報が示す制御量を評価量算出部２４に出力する。 FIG. 8 is a flow chart showing the flow of processing in which the optimum control device 2 according to the first embodiment controls the process to be controlled by extreme value control. It is assumed that the controlled process of the plant P is controlled by a control method other than extreme value control such as PID control at the start of the flowchart. First, the measurement information acquisition unit 23 acquires measurement information from the plant P (step S101). The measurement information acquisition unit 23 outputs the control amount indicated by the measurement information to the evaluation amount calculation unit 24 .

評価量算出部２４は、計測情報取得部２３から出力された制御量に基づいて、制御対象プロセスのその時点における評価量を算出する（ステップＳ１０２）。評価量算出部２４は、算出した評価量を勾配推定部２５及び極値制御部２７に出力する。 The evaluation amount calculation unit 24 calculates the evaluation amount of the controlled process at that time based on the control amount output from the measurement information acquisition unit 23 (step S102). The evaluation amount calculator 24 outputs the calculated evaluation amount to the gradient estimator 25 and the extreme value controller 27 .

勾配推定部２５は、評価量算出部２４から出力された評価量に基づいて評価関数の勾配を推定する（ステップＳ１０３）。勾配推定部２５は、取得した勾配推定値をパラメータ決定部２６に出力する。 The gradient estimation unit 25 estimates the gradient of the evaluation function based on the evaluation amount output from the evaluation amount calculation unit 24 (step S103). The gradient estimator 25 outputs the obtained gradient estimate to the parameter determiner 26 .

パラメータ決定部２６は、勾配推定部２５から出力された勾配推定値と、予め定められた制御パラメータの調整則とに基づいて制御パラメータを決定する（ステップＳ１０４）。具体的には、パラメータ決定部２６は、図５のＮｏ．１～Ｎｏ．４に示す調整則に基づいてハイパスフィルタ１１の周波数ω_１、ディザー信号出力部１２が出力するディザー信号の周波数ω及び振幅ａ、及びローパスフィルタ１３の周波数ω_２を決定する。なお、図５に示す調整則でこれらの制御パラメータを決定する際に必要な情報（例えば、制御対象プロセスの時定数やむだ時間など）は、計測情報に基づいて取得されてもよいし、予め最適制御装置２に記憶されていてもよい。 The parameter determining unit 26 determines the control parameters based on the slope estimation value output from the slope estimating unit 25 and a predetermined control parameter adjustment rule (step S104). Specifically, the parameter determining unit 26 uses No. 1 in FIG. 1 to No. 4, the frequency ω ₁ of the high-pass filter 11, the frequency ω and amplitude a of the dither signal output from the dither signal output unit 12, and the frequency ω ₂ of the low-pass filter 13 are determined. Information necessary for determining these control parameters according to the adjustment rule shown in FIG. It may be stored in the optimum control device 2 .

一方で、パラメータ決定部２６は、勾配推定部２５から出力された勾配推定値を式（２２）に適用して積分ゲインＫＩ_０を決定する。パラメータ決定部２６は、このように決定した制御パラメータの値を極値制御部２７に出力する。 On the other hand, the parameter determination unit 26 determines the integral gain KI ₀ by applying the slope estimation value output from the slope estimation unit 25 to Equation (22). The parameter determination unit 26 outputs the control parameter values thus determined to the extreme value control unit 27 .

続いて、極値制御部２７が、パラメータ決定部２６によって決定された各制御パラメータを用いて、制御対象プロセスの極値制御を開始する（ステップＳ１０５）。ここでは、パラメータ決定部２６によって制御パラメータが決定された後、所定のタイミングで、制御対象プロセスの制御方法が極値制御に切り替えられるものとする。このタイミングは予め定められたタイミングであってもよいし、ユーザの操作による任意のタイミングであってもよい。 Subsequently, the extreme value control unit 27 starts extreme value control of the controlled process using each control parameter determined by the parameter determination unit 26 (step S105). Here, it is assumed that the control method for the controlled process is switched to extreme value control at a predetermined timing after the control parameters are determined by the parameter determining unit 26 . This timing may be a predetermined timing or an arbitrary timing by user's operation.

極値制御部２７が極値制御を開始した後、最適制御装置２は、ステップＳ１０１、Ｓ１０２及びＳ１０３と同様の処理を繰り返し実行する（ステップＳ１０６、Ｓ１０７及びＳ１０８）とともに、ステップＳ１０４と同様の方法で積分ゲインの値を取得する（ステップＳ１０９）。そして、パラメータ決定部２６は、取得した積分ゲインの値で現在の積分ゲインの値を更新する（ステップＳ１１０）。 After the extreme value control unit 27 starts extreme value control, the optimum control device 2 repeatedly executes the same processes as steps S101, S102 and S103 (steps S106, S107 and S108), and performs the same method as step S104. to acquire the value of the integral gain (step S109). Then, the parameter determining unit 26 updates the current integral gain value with the acquired integral gain value (step S110).

このように構成された第１の実施形態の最適制御装置２は、取得される計測情報に基づいて評価関数の勾配を推定するとともに、取得した勾配推定値に基づいて積分ゲインを適応的に決定する機能を有する。このような最適制御装置２によれば、極値制御の安定性に大きく関係する積分ゲインを制御対象プロセスの状態に応じて適応的に更新することができるため、制御対象プロセスのダイナミクスに適応して極値制御をより安定的に動作させることが可能となる。 The optimal control device 2 of the first embodiment configured as described above estimates the gradient of the evaluation function based on the obtained measurement information, and adaptively determines the integral gain based on the obtained estimated gradient value. It has the function to According to such an optimum control device 2, since the integral gain, which is greatly related to the stability of the extreme value control, can be adaptively updated according to the state of the controlled process, it can adapt to the dynamics of the controlled process. Therefore, it becomes possible to operate the extreme value control more stably.

このように制御パラメータを調整することができる最適制御装置２によれば、例えば図７の水処理プラントにおける汚泥の返送量を操作量として、水処理プロセスのダイナミクスに適応しながら総コストを最小化するような極値制御を実現することが可能となる。 According to the optimum control device 2 that can adjust the control parameters in this way, for example, the total cost is minimized while adapting to the dynamics of the water treatment process, using the amount of sludge returned in the water treatment plant of FIG. 7 as the operation amount. It is possible to realize such extreme value control.

（第２の実施形態）
図９は、第２の実施形態における最適制御装置２ａの機能構成の具体例を示すブロック図である。最適制御装置２ａは、勾配推定部２５に代えて勾配推定部２５ａを備える点、パラメータ決定部２６に代えてパラメータ決定部２６ａを備える点、操作量変換部２８をさらに備える点で第１の実施形態における最適制御装置２と異なる。最適制御装置２ａのその他の構成は第１の実施形態における最適制御装置２と同様である。そのため、ここでは、それらの同様の構成には図３と同じ符号を付すことにより説明を省略する。 (Second embodiment)
FIG. 9 is a block diagram showing a specific example of the functional configuration of the optimum control device 2a in the second embodiment. The optimum control device 2a is different from the first embodiment in that it includes a gradient estimator 25a instead of the gradient estimator 25, a parameter determiner 26a instead of the parameter determiner 26, and a manipulated variable converter 28. It differs from the optimum control device 2 in form. Other configurations of the optimum control device 2a are the same as those of the optimum control device 2 in the first embodiment. Therefore, here, the same reference numerals as in FIG. 3 are assigned to those similar configurations, and the description thereof is omitted.

勾配推定部２５ａは、取得された評価量に基づいて評価関数の勾配を推定する点では第１の実施形態における勾配推定部２５と同様であるが、評価関数の勾配として二階微分値ではなく一階微分値を推定する点、取得した勾配推定値をパラメータ決定部２６ａではなく、操作量変換部２８に出力する点で勾配推定部２５と異なる。 The gradient estimating unit 25a is similar to the gradient estimating unit 25 in the first embodiment in that it estimates the gradient of the evaluation function based on the acquired evaluation amount, but the gradient of the evaluation function is not the second derivative value but the linear value. It differs from the gradient estimating section 25 in that it estimates the differential value and outputs the acquired gradient estimated value to the manipulated variable converting section 28 instead of the parameter determining section 26a.

パラメータ決定部２６ａは、ローパスフィルタの周波数、ハイパスフィルタの周波数、ディザー信号の周波数、ディザー信号の振幅及び積分ゲインの５つの制御パラメータを決定する点では第１の実施形態におけるパラメータ決定部２６と同様であるが、積分ゲインの決定に評価関数の勾配推定値を用いない点でパラメータ決定部２６と異なる。 The parameter determination unit 26a is the same as the parameter determination unit 26 in the first embodiment in that it determines five control parameters: low-pass filter frequency, high-pass filter frequency, dither signal frequency, dither signal amplitude, and integral gain. However, it is different from the parameter determining section 26 in that the slope estimation value of the evaluation function is not used to determine the integral gain.

操作量変換部２８は、勾配推定部２５ａから出力される評価関数の勾配推定値に基づいて極値制御部２７から出力される操作量を変換する。操作量変換部２８は、変換した操作量を操作量出力部２２に出力する。具体的には、操作量変換部２８は、以下のような方法で操作量を変換する。 The manipulated variable conversion unit 28 converts the manipulated variable outputted from the extreme value control unit 27 based on the gradient estimated value of the evaluation function outputted from the gradient estimating unit 25a. The manipulated variable conversion unit 28 outputs the converted manipulated variable to the manipulated variable output unit 22 . Specifically, the manipulated variable conversion unit 28 converts the manipulated variable by the following method.

まず、極値制御部２７から出力される操作量をＵとすると、Ｕを入力（操作量）とした場合の極値制御のアベレージシステムは上記の式（１６）で表される。以下、式（２３）として式（１６）を再掲する。 First, assuming that the manipulated variable output from the extreme value control unit 27 is U, the average system for the extreme value control when U is the input (manipulated variable) is expressed by the above equation (16). Equation (16) is re-cited below as Equation (23).

式（２３）において評価関数の勾配ＤＪ（Ｕ＋Ｕ_＊）は、一般的にはＵに関する非線形関数となるため、式（２３）の収束の速さを時定数の概念で表現することができなかった。そこで、第１の実施形態では、まず、評価関数Ｊ（Ｕ）が二次関数で表されると仮定した上で時定数の概念を定義し、定義した時定数が所望の値になるように積分ゲインＫＩ_０を決定した。しかしながら、実際には，評価関数Ｊ（Ｕ）は二次関数ではないため、その二階微分値が一定となるように積分ゲインＫＩ_０を適応的に調整した。 In equation (23), the gradient DJ (U+U _* ) of the evaluation function is generally a nonlinear function with respect to U, so the speed of convergence in equation (23) could not be expressed by the concept of time constant. . Therefore, in the first embodiment, first, the concept of the time constant is defined on the assumption that the evaluation function J(U) is represented by a quadratic function, and the defined time constant is set to a desired value. The integral gain KI ₀ was determined. However, since the evaluation function J(U) is not actually a quadratic function, the integral gain KI ₀ is adaptively adjusted so that the second derivative value is constant.

これに対して、本実施形態では、式（２３）によって表されるアベレージシステムが線形システムとなるように入力変数Ｕを変数変換することで時定数を定義する。すなわち、変換後の変数ｖに関するアベレージシステムが線形システムとなるように次の式（２４）に示す変数変換を行う。この変数変換により、変数ｖに関するアベレージシステムは次の式（２５）のように変換される。 On the other hand, in this embodiment, the time constant is defined by transforming the input variable U so that the average system expressed by Equation (23) becomes a linear system. That is, the variable conversion shown in the following equation (24) is performed so that the average system for the variable v after conversion becomes a linear system. By this variable transformation, the average system for the variable v is transformed as shown in the following equation (25).

ここで、式（２３）を式（２５）に変換する関数ｖ＝ｈ（Ｕ）は次の式（２６）に示す偏微分方程式を満たす必要がある。 Here, the function v=h(U) for converting equation (23) to equation (25) must satisfy the partial differential equation shown in equation (26) below.

この条件を満たす変換関数ｈは多数存在するが、評価関数の一階微分値ＤＪ（Ｕ）が既知であれば、どのような微分方程式であっても少なくとも近似的には解くことが可能である。例えば、式（２６）を次の式（２７）のように近似することができる。 There are many transformation functions h that satisfy this condition, but if the first derivative value DJ(U) of the evaluation function is known, any differential equation can be solved at least approximately. . For example, Equation (26) can be approximated by Equation (27) below.

このように近似した式（２７）を用いれば、例えば、初期値の操作量Ｕ_０に対して適当な変換ｖ_０＝ｈ（Ｕ_０）を与えることができれば、極値制御による操作量の変化に応じてｈ（Ｕ）を更新していくことができる。操作量変換部２８は、勾配推定部２５ａによって取得された勾配推定値（一階微分値）を式（２７）に適用することによって操作量を変換することができる。 Using equation (27) approximated in this way, for example, if an appropriate transformation v ₀ =h (U ₀ ) can be given to the initial value of the manipulated variable U ₀ , the change in the manipulated variable by extreme value control h(U) can be updated accordingly. The manipulated variable conversion unit 28 can convert the manipulated variable by applying the gradient estimated value (first order differential value) acquired by the gradient estimating unit 25a to Equation (27).

また、変数ｖの初期値ｖ_０については、ＤＪ（Ｕ）がＵの一次関数で近似できる場合には、次の式（２８）によって変換することで、近似的に式（２２）を成立させることができる。 Further, regarding the initial value v ₀ of the variable v, when DJ(U) can be approximated by a linear function of U, the following formula (28) is used to approximate the formula (22). be able to.

なお、式（２５）は、式（２０）が示すアベレージシステムにおいて評価関数の二階微分値ＧをＧ＝１としたものに相当する。そのため、積分ゲインは、式（２２）のＧをＧ＝１として算出されればよい。 Equation (25) corresponds to the average system represented by Equation (20) in which the second-order differential value G of the evaluation function is set to G=1. Therefore, the integral gain may be calculated by setting G in Equation (22) to G=1.

図１０は、第２の実施形態の最適制御装置２ａによって実現される極値制御システム１ａの構成例を示すブロック線図である。極値制御システム１ａが図２に示した基本的な構成の極値制御システム９と異なる点は、勾配推定部２５ａによって取得された評価関数の勾配推定値（一階微分値）（図中の（）^ｎ）が制御対象プロセスＴＰに与えられる操作量に適応的に作用する点である。具体的には、操作量変換部２８（図示せず）が勾配推定値に基づいて変換した操作量が制御対象プロセスＴＰに与えられる。これにより、最適制御装置２ａは、第１の実施形態の最適制御装置２と同様に、制御対象プロセスのダイナミクスに適応して極値制御をより安定的に動作させることが可能となる。 FIG. 10 is a block diagram showing a configuration example of an extreme value control system 1a realized by the optimum control device 2a of the second embodiment. The extreme value control system 1a differs from the extreme value control system 9 having the basic configuration shown in FIG. () ⁿ ) adaptively acts on the manipulated variable given to the controlled process TP. Specifically, the manipulated variable converted by the manipulated variable converter 28 (not shown) based on the estimated gradient value is given to the controlled process TP. As a result, the optimum controller 2a can adapt to the dynamics of the process to be controlled and operate the extreme value control more stably, like the optimum controller 2 of the first embodiment.

さらに、第２の実施形態では勾配推定値として評価関数の一階微分値を取得すればよいため、二階微分値を取得する第１の実施形態よりも、極値制御の処理を簡単にすることができる。具体的には、図４に示したようなフィルタを用いた勾配の推定において、フィルタの段数を少なくすることができるため、勾配推定部２５を第１の実施形態よりも簡単な回路構成で実現することができる。また、このことは、フィルタ後段のＧ（ｔ）をより小さい次元で実現できることを意味する。 Furthermore, in the second embodiment, it is sufficient to obtain the first derivative of the evaluation function as the gradient estimation value, so that the extreme value control processing can be made simpler than in the first embodiment in which the second derivative is obtained. can be done. Specifically, in estimating a gradient using a filter such as that shown in FIG. 4, the number of filter stages can be reduced. can do. This also means that G(t) after the filter can be realized in smaller dimensions.

また、その一方で、第２の実施形態では、操作量を変換する際の初期値を適切に設定するために多少の手間がかかる可能性がある。そのため、どちらの実施形態を用いるかは、適用する対象のプロセスの特性や制約事項等に応じて選択されるとよい。 On the other hand, in the second embodiment, it may take some time and effort to appropriately set the initial value when converting the manipulated variable. Therefore, which embodiment should be used should be selected according to the characteristics, restrictions, etc. of the target process to be applied.

（第３の実施形態）
図１１は、第３の実施形態における最適制御装置２ｂの機能構成の具体例を示すブロック図である。最適制御装置２ｂは、評価量算出部２４に代えて評価量算出部２４ｂを備える点、勾配推定部２５に代えて勾配推定部２５ｂを備える点、パラメータ決定部２６に代えてパラメータ決定部２６ｂを備える点、評価量変換部２９をさらに備える点で第１の実施形態における最適制御装置２と異なる。最適制御装置２ｂのその他の構成は第１の実施形態における最適制御装置２と同様である。そのため、ここでは、それらの同様の構成には図３と同じ符号を付すことにより説明を省略する。 (Third embodiment)
FIG. 11 is a block diagram showing a specific example of the functional configuration of the optimum control device 2b in the third embodiment. The optimum control device 2b includes an evaluation amount calculation unit 24b instead of the evaluation amount calculation unit 24, a gradient estimation unit 25b instead of the gradient estimation unit 25, and a parameter determination unit 26b instead of the parameter determination unit 26. It differs from the optimum control device 2 in the first embodiment in that it further includes an evaluation amount conversion unit 29 . Other configurations of the optimum control device 2b are the same as those of the optimum control device 2 in the first embodiment. Therefore, here, the same reference numerals as in FIG. 3 are assigned to those similar configurations, and the description thereof is omitted.

評価量算出部２４ｂは、計測情報取得部２３から出力される制御量に基づいて極値制御に用いられる評価量を算出する点では第１の実施形態における評価量算出部２４ｂと同様であるが、算出した評価量を勾配推定部２５ｂ及び評価量変換部２９に出力する点で評価量算出部２４と異なる。 The evaluation amount calculation unit 24b is similar to the evaluation amount calculation unit 24b in the first embodiment in that it calculates an evaluation amount used for extreme value control based on the control amount output from the measurement information acquisition unit 23. , differs from the evaluation amount calculation unit 24 in that the calculated evaluation amount is output to the gradient estimation unit 25 b and the evaluation amount conversion unit 29 .

勾配推定部２５ｂは、取得された評価量に基づいて評価関数の勾配を推定する点では第１の実施形態における勾配推定部２５と同様であるが、取得した勾配推定値をパラメータ決定部２６ｂではなく、評価量変換部２９に出力する点で勾配推定部２５と異なる。 The gradient estimating unit 25b is similar to the gradient estimating unit 25 in the first embodiment in that it estimates the gradient of the evaluation function based on the obtained evaluation amount. It is different from the gradient estimating unit 25 in that it is output to the evaluation amount conversion unit 29 .

評価量変換部２９は、勾配推定部２５ｂから出力される評価関数の勾配推定値に基づいて評価量算出部２４ｂから出力される評価量を変換する。評価量変換部２９は、変換した評価量を極値制御部２７に出力する。具体的には、評価量変換部２９は、以下のような方法で評価量を変換する。 The evaluation amount conversion unit 29 converts the evaluation amount output from the evaluation amount calculation unit 24b based on the gradient estimated value of the evaluation function output from the gradient estimation unit 25b. The evaluation amount conversion unit 29 outputs the converted evaluation amount to the extreme value control unit 27 . Specifically, the evaluation amount conversion unit 29 converts the evaluation amount by the following method.

図１２は、第３の実施形態におけるｎ階微分値の推定方法の一例を示す図である。まず、評価量変換部２９に対して、評価量を変換するための変換関数を予め決定しておく。ここでは、簡単のため、評価関数Ｊを冪乗変換することにより、変換後の評価関数Ｊを局所的に二次関数で近似する。ここで、変換後の評価関数をＪ_ｍ、変換に用いる冪数（以下「冪乗パラメータ」という。）をｎとするとＪ_ｍはＪ_ｍ＝Ｔ（Ｊ）＝Ｊ^ｎと表すことができる。この冪乗パラメータｎは以下のように推定することができる。 FIG. 12 is a diagram showing an example of a method of estimating the n-order differential value in the third embodiment. First, a conversion function for converting the evaluation amount is determined in advance for the evaluation amount conversion unit 29 . Here, for the sake of simplification, the evaluation function J is converted to a power, and the converted evaluation function J is locally approximated by a quadratic function. Here, J _m can be expressed as J _m =T(J)=J ⁿ , where J _m is the evaluation function after conversion, and n is the exponent used for conversion (hereinafter referred to as "power parameter"). This power parameter n can be estimated as follows.

Ｊ_ｍ＝Ｊ^ｎと変換する場合、操作量Ｕに関するＪの勾配も同じ変換関数で変換されるため、Ｄ^２Ｊ_ｍ＝（Ｄ^２Ｊ）^ｎとなる。このとき、Ｄ^２Ｊ_ｍが一定値となるように冪乗パラメータｎを定めれば、変換後の評価関数の二階微分値は一定値となるため、変換後の評価量は二次関数で近似されたとみなすことができる。そこで、評価量変換部２９は、入力された評価量に基づいて取得される評価関数の二階微分値が予め定められた所定の定数Ｃ＝１となる冪数を算出することによって冪乗パラメータｎを推定する。図１２は、このような評価量変換部２９の構成を示す概念図である。 When converting J _m =J ⁿ , the gradient of J with respect to the manipulated variable U is also converted by the same conversion function, so D ² J _m =(D ² J) ⁿ . At this time, if the power parameter n is determined so that D ² J _m is a constant value, the second derivative value of the evaluation function after conversion will be a constant value, so the evaluation amount after conversion is approximated by a quadratic function. can be considered to have been Therefore, the evaluation amount conversion unit 29 calculates the power parameter n by calculating a power such that the second-order differential value of the evaluation function obtained based on the input evaluation amount is a predetermined constant C=1. to estimate FIG. 12 is a conceptual diagram showing the configuration of such an evaluation amount conversion unit 29. As shown in FIG.

具体的には、評価量変換部２９は、推定器２９１及び変換部２９２を備える。推定器２９１は、定数Ｃ＝１を評価関数の二階微分値の目標値として、変換関数Ｄ^２Ｊ_ｍ＝（Ｄ^２Ｊ）^ｎを満たす冪数ｎを探索し、探索結果を最終的な冪乗パラメータの値として変換部２９２に出力する。変換部２９２は、推定器２９１によって推定された冪乗パラメータｎを評価量の変換関数に適用して評価量Ｊを変換する。変換部２９２は、変換後の評価量Ｊ_ｍを極値制御部２７に出力する。 Specifically, the evaluation amount conversion unit 29 includes an estimator 291 and a conversion unit 292 . The estimator 291 searches for a power n that satisfies the transformation function D ² J _m =(D ² J) ⁿ using the constant C=1 as the target value of the second derivative of the evaluation function, and uses the search result as the final power It is output to the conversion unit 292 as the value of the power parameter. The conversion unit 292 converts the evaluation amount J by applying the power parameter n estimated by the estimator 291 to the conversion function of the evaluation amount. The conversion unit 292 outputs the converted evaluation amount J _m to the extreme value control unit 27 .

例えば、冪乗パラメータの推定を最急降下法のような方法で推定する場合、推定器２９１を、積分器を用いて構成することができる。また、推定器２９１は、冪乗パラメータｎを仮想的な操作量とみなして、目標値である定数Ｃ＝１と二階微分値Ｄ^２Ｊ_ｍ＝（Ｄ^２Ｊ）^ｎとの誤差をゼロにするようなＰＩＤ制御器を用いて構成されてもよい。 For example, if the power parameter is estimated by a method such as the method of steepest descent, the estimator 291 can be constructed using an integrator. Also, the estimator 291 regards the power parameter n as a virtual manipulated variable, and sets the error between the constant C=1, which is the target value, and the second-order differential value D ² J _m =(D ² J) ⁿ to zero. It may be configured using a PID controller such as

なお、ここでは、勾配推定部２５ｂによって二階微分値が推定される場合について説明したが、勾配推定部２５ｂが一階微分値を推定する場合には、評価量変換部２９は、その勾配推定値が操作量に対して比例するように評価量を変換するように構成されてもよい。 Here, the case where the gradient estimating unit 25b estimates the second-order differential value has been described, but when the gradient estimating unit 25b estimates the first-order differential value, the evaluation amount conversion unit 29 calculates the gradient estimated value may be configured to transform the evaluation quantity so that is proportional to the manipulated variable.

ただし、推定器２９１の目標値を定数Ｃ＝１とした場合には、図５に示した調整則において評価関数の勾配Ｇ（二階微分値）をＧ＝１と仮定して積分ゲインを決定する。もし、推定器２９１の目標値を定数Ｃ＝Ｇとした場合には、定数Ｃの値を用いて積分ゲインを決定する。 However, when the target value of the estimator 291 is set to a constant C=1, the integral gain is determined by assuming that the gradient G (second derivative value) of the evaluation function is G=1 in the adjustment rule shown in FIG. . If the target value of estimator 291 is constant C=G, then the value of constant C is used to determine the integral gain.

図１１の説明に戻る。パラメータ決定部２６ｂは、ローパスフィルタの周波数、ハイパスフィルタの周波数、ディザー信号の周波数、ディザー信号の振幅及び積分ゲインの５つの制御パラメータを決定する点では第１の実施形態におけるパラメータ決定部２６と同様であるが、積分ゲインの決定に評価関数の勾配推定値を用いない点でパラメータ決定部２６と異なる。 Returning to the description of FIG. The parameter determination unit 26b is the same as the parameter determination unit 26 in the first embodiment in that it determines five control parameters: low-pass filter frequency, high-pass filter frequency, dither signal frequency, dither signal amplitude, and integral gain. However, it is different from the parameter determining section 26 in that the slope estimation value of the evaluation function is not used to determine the integral gain.

図１３は、第３の実施形態の最適制御装置２ｂによって実現される極値制御システム１ｂの構成例を示すブロック線図である。極値制御システム１ｂが図２に示した基本的な構成の極値制御システム９と異なる点は、勾配推定部２５ｂによって取得された評価関数の勾配推定値（一階微分値又は二階微分値）が制御対象プロセスＴＰの制御量に基づいて取得される評価量に適応的に作用する点である。具体的には、評価量変換部２９（図示せず）が勾配推定値に基づいて変換した評価量が極値制御システム１ｂに入力される。これにより、最適制御装置２ｂは、第１の実施形態の最適制御装置２と同様に、制御対象プロセスのダイナミクスに適応して極値制御をより安定的に動作させることが可能となる。 FIG. 13 is a block diagram showing a configuration example of an extreme value control system 1b realized by the optimum control device 2b of the third embodiment. The difference between the extreme value control system 1b and the extreme value control system 9 having the basic configuration shown in FIG. adaptively acts on the evaluation amount obtained based on the control amount of the controlled process TP. Specifically, the evaluation amount converted by the evaluation amount conversion unit 29 (not shown) based on the gradient estimated value is input to the extreme value control system 1b. As a result, the optimum controller 2b can adapt to the dynamics of the process to be controlled and operate the extreme value control more stably, like the optimum controller 2 of the first embodiment.

図１４は、第１～第３の実施形態の最適制御装置によって得られる効果の具体例を示す図である。図１４（Ａ）は、実際には未知の評価関数の形状が二次関数、三次関数及び０．５次関数であると仮定し、従来の調整則で調整した制御パラメータに基づく極値制御をシミュレーションした結果を示す。また、図１４（Ｂ）は、同仮定の下で、本実施形態の調整方法で調整された制御パラメータに基づく極値制御をシミュレーションした結果を示す。具体的には、図１４（Ａ）のシミュレーションにおいては、特許文献１に記載された調整則に基づいて制御パラメータを調整した。図１４（Ａ）を見ても分かるように、従来の調整方法では、評価関数の形状が二次関数や三次関数である場合には極値探索に成功しているものの、評価関数の形状が０．５次関数である場合には探索性能が著しく劣化している。一方、本実施形態の調整方法では、図１４（Ｂ）を見ても分かるように、評価関数の形状に関わらず極値の探索成功している。 FIG. 14 is a diagram showing a specific example of effects obtained by the optimum control devices of the first to third embodiments. FIG. 14(A) assumes that the shape of the unknown evaluation function is actually a quadratic function, a cubic function, and a 0.5th order function, and performs extreme value control based on the control parameters adjusted by the conventional adjustment rule. A simulation result is shown. Further, FIG. 14B shows the result of simulating extreme value control based on the control parameters adjusted by the adjustment method of the present embodiment under the same assumption. Specifically, in the simulation of FIG. 14A, the control parameters were adjusted based on the adjustment rule described in Patent Document 1. As can be seen from FIG. 14A, in the conventional adjustment method, the extremum search is successful when the shape of the evaluation function is a quadratic function or a cubic function. In the case of the 0.5th order function, the search performance is remarkably degraded. On the other hand, as can be seen from FIG. 14B, the adjustment method of the present embodiment succeeds in searching for extreme values regardless of the shape of the evaluation function.

実際の極値制御においては最適化したい評価関数の形状を予め知ることができない。そのため、従来の制御パラメータの調整方法では、評価関数の形状に依存して、極値（局所的な最適値）の探索性能が変化し、最悪の場合には制御が不安定になる可能性もあった。これに対して、本実施形態における制御パラメータの調整方法によれば、評価関数の形状がどのような形状であっても常に安定的に極値を探索することが可能になる。 In actual extreme value control, the shape of the evaluation function to be optimized cannot be known in advance. Therefore, in the conventional control parameter adjustment method, the search performance for extreme values (local optimum values) changes depending on the shape of the evaluation function, and in the worst case, the control may become unstable. there were. In contrast, according to the control parameter adjustment method of the present embodiment, it is possible to always stably search for the extreme value regardless of the shape of the evaluation function.

（変形例）
第１の実施形態の最適制御装置２は、勾配推定部２５によって推定された評価関数の勾配を示す勾配情報と、極値制御部２７によって決定された操作量を示す操作量情報と、を対応づけて、ＣＲＴ（Cathode Ray Tube）ディスプレイや液晶ディスプレイ、有機ＥＬ（Electro-Luminescence）ディスプレイ等の表示装置に表示させるための情報（以下「表示情報」という。）を生成する表示制御部（図示せず）を備えてもよい。図１５は、変形例の最適制御装置２において表示情報によって表示される画面の具体例を示す図である。表示情報は、図１５（Ａ）に示すように、時間軸に対して各値を別系列で示すものであってもよいし、各値を一つの系列で示すものであってもよい。また、表示情報は、図１５（Ｂ）に示すように、一方又は両方の値を相関する他の値に置き換えた形で各値を表示するものであってもよい。図１５（Ｂ）は、勾配情報を、評価量を示す情報で置き換えた例である。なお、図１５に示されるように、表示情報には、現在の動作点や現在時刻などを示す情報が含まれてもよい。 (Modification)
The optimum control device 2 of the first embodiment associates gradient information indicating the gradient of the evaluation function estimated by the gradient estimator 25 with manipulated variable information indicating the manipulated variable determined by the extreme value controller 27. Then, a display control unit (not shown) generates information (hereinafter referred to as "display information") for display on a display device such as a CRT (Cathode Ray Tube) display, a liquid crystal display, an organic EL (Electro-Luminescence) display, or the like. ) may be provided. FIG. 15 is a diagram showing a specific example of a screen displayed by the display information in the optimum control device 2 of the modified example. As shown in FIG. 15A, the display information may indicate each value in a separate series on the time axis, or may indicate each value in one series. Moreover, as shown in FIG. 15B, the display information may display each value in a form in which one or both of the values are replaced with another correlated value. FIG. 15B is an example in which gradient information is replaced with information indicating an evaluation amount. In addition, as shown in FIG. 15, the display information may include information indicating the current operating point, the current time, and the like.

なお、この場合、最適制御装置２は、表示情報を表示させるための表示装置を備えてもよいし、これらの表示装置を自装置に接続するインターフェースを備えてもよい。また、最適制御装置２は、表示情報を他の装置に送信するための通信インターフェースを備えてもよい。また、表示制御部は、第１の実施形態の最適制御装置２と同様に、第２又は第３の実施形態における最適制御装置２ａ又は２ｂに備えられてもよい。なお、第２の実施形態においては、表示される操作量は、極値制御部２７によって決定された操作量であってもよいし、操作量変換部２８によって変換された操作量であってもよい。 In this case, the optimum control device 2 may be provided with a display device for displaying display information, or may be provided with an interface for connecting these display devices to its own device. The optimum control device 2 may also have a communication interface for transmitting display information to other devices. Also, the display control unit may be provided in the optimum control device 2a or 2b in the second or third embodiment, like the optimum control device 2 in the first embodiment. In the second embodiment, the manipulated variable displayed may be the manipulated variable determined by the extreme value control unit 27, or may be the manipulated variable converted by the manipulated variable conversion unit 28. good.

以上説明した少なくともひとつの実施形態によれば、制御対象プロセスに関して観測される評価量に基づいて、操作量に対する評価量を示す未知の評価関数の変化率を示す勾配を推定する勾配推定部と、勾配推定部によって取得された勾配の推定値に基づいて、極値制御の実行に必要な制御パラメータ、操作量又は評価量を、評価関数の変化に適応して補正する補正部と、を持つことにより、制御対象プロセスのダイナミクスに適応して極値制御をより安定的に動作させることができる。 According to at least one embodiment described above, a gradient estimator for estimating a gradient indicating a rate of change of an unknown evaluation function indicating an evaluation amount with respect to a manipulated variable based on an evaluation amount observed for a controlled process; a correction unit that adapts to changes in the evaluation function and corrects the control parameter, the manipulated variable, or the evaluation amount necessary for executing extreme value control based on the estimated value of the gradient obtained by the gradient estimator; Therefore, extremal control can be operated more stably by adapting to the dynamics of the controlled process.

なお、第１の実施形態においては、勾配推定値に基づいて推定器１４の積分ゲインを適応的に更新するパラメータ決定部２６が上記補正部の一例である。また、第２の実施形態においては、勾配推定値に基づいて操作量を適応的に変換する操作量変換部２８が上記補正部の一例である。また、第３の実施形態においては、勾配推定値に基づいて評価量を適応的に変換する評価量変換部２９が上記補正部の一例である。 In the first embodiment, the parameter determination section 26 that adaptively updates the integral gain of the estimator 14 based on the gradient estimation value is an example of the correction section. Further, in the second embodiment, the manipulated variable conversion unit 28 that adaptively converts the manipulated variable based on the estimated gradient value is an example of the correction unit. Further, in the third embodiment, the evaluation amount conversion unit 29 that adaptively converts the evaluation amount based on the gradient estimation value is an example of the correction unit.

本発明のいくつかの実施形態を説明したが、これらの実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれると同様に、特許請求の範囲に記載された発明とその均等の範囲に含まれるものである。 While several embodiments of the invention have been described, these embodiments have been presented by way of example and are not intended to limit the scope of the invention. These embodiments can be implemented in various other forms, and various omissions, replacements, and modifications can be made without departing from the scope of the invention. These embodiments and their modifications are included in the scope and spirit of the invention, as well as the scope of the invention described in the claims and equivalents thereof.

１，１ａ，１ｂ…極値制御システム、１１…ハイパスフィルタ、１２…ディザー信号出力部、１３…ローパスフィルタ、１４…推定器、９…極値制御システム（基本的な構成）、２，２ａ，２ｂ…最適制御装置、２１…ディザー信号出力部、２２…操作量出力部、２３…計測情報取得部、２４，２４ｂ…評価量算出部、２５，２５ａ，２５ｂ…勾配推定部、２６，２６ａ，２６ｂ…パラメータ決定部、２７…極値制御部、２８…操作量変換部、２９…評価量変換部、２９１…推定器、２９２…変換部、３…水処理プラント、３１…嫌気槽、３１１…薬品投入ポンプ、３１２…センサー、３２…無酸素槽、３３…好気槽、３３１…循環ポンプ、３３２…ブロワ、３４…最終沈澱池、３４１…返送汚泥ポンプ、３４２…余剰汚泥引き抜きポンプ、３４３…センサー 1, 1a, 1b... extremum control system, 11...high pass filter, 12... dither signal output unit, 13... low pass filter, 14... estimator, 9... extremum control system (basic configuration), 2, 2a, 2b... optimum control device, 21... dither signal output unit, 22... operation amount output unit, 23... measurement information acquisition unit, 24, 24b... evaluation amount calculation unit, 25, 25a, 25b... gradient estimation unit, 26, 26a, 26b... parameter determination unit, 27... extreme value control unit, 28... operation amount conversion unit, 29... evaluation amount conversion unit, 291... estimator, 292... conversion unit, 3... water treatment plant, 31... anaerobic tank, 311... Chemical injection pump 312 Sensor 32 Anoxic tank 33 Aerobic tank 331 Circulation pump 332 Blower 34 Final sedimentation tank 341 Return sludge pump 342 Excess sludge extraction pump 343 sensor

Claims

Based on a manipulated variable of the controlled process and an evaluation quantity indicating an index related to optimization of the controlled process based on the controlled quantity that changes according to the manipulated variable, the operation is performed so that the evaluated quantity approaches an optimum value. A control device that performs extreme value control that changes the amount of
a gradient estimator for estimating a gradient indicating a rate of change of an evaluation function, which is a function representing the evaluation quantity and is unknown with respect to the manipulated variable, based on the evaluation quantity observed with respect to the controlled process; ,
Correction for adaptively correcting the control parameter, the manipulated variable, or the evaluation amount necessary for executing the extreme value control based on the estimated value of the gradient obtained by the gradient estimator, in accordance with changes in the evaluation function. Department and
with
The extreme value control unit that executes the extreme value control has an integrator for determining a manipulated variable to be given to the controlled process,
The gradient estimator estimates a second derivative value of the evaluation function as the gradient,
The correcting unit corrects the integral gain of the integrator based on the second order differential value of the evaluation function estimated by the gradient estimating unit.
optimum controller.

Based on a manipulated variable of the controlled process and an evaluation quantity indicating an index related to optimization of the controlled process based on the controlled quantity that changes according to the manipulated variable, the operation is performed so that the evaluated quantity approaches an optimum value. A control device that performs extreme value control that changes the amount of
a gradient estimator for estimating a gradient indicating a rate of change of an evaluation function, which is a function representing the evaluation quantity and is unknown with respect to the manipulated variable, based on the evaluation quantity observed with respect to the controlled process; ,
Correction for adaptively correcting the control parameter, the manipulated variable, or the evaluation amount necessary for executing the extreme value control based on the estimated value of the gradient obtained by the gradient estimator, in accordance with changes in the evaluation function. Department and
with
The extreme value control unit that executes the extreme value control is an integrator for determining the manipulated variable given to the controlled process, and the integral gain thereof is obtained by assuming that the evaluation function is a quadratic function. having an integrator determined based on the second derivative of the evaluation function;
The gradient estimator estimates a first derivative value of the evaluation function as the gradient,
The correcting unit corrects the manipulated variable determined by the extremal value control unit based on the integral gain based on the first derivative of the evaluation function estimated by the gradient estimating unit.
optimum controller.

Based on a manipulated variable of the controlled process and an evaluation quantity indicating an index related to optimization of the controlled process based on the controlled quantity that changes according to the manipulated variable, the operation is performed so that the evaluated quantity approaches an optimum value. A control device that performs extreme value control that changes the amount of
a gradient estimator for estimating a gradient indicating a rate of change of an evaluation function, which is a function representing the evaluation quantity and is unknown with respect to the manipulated variable, based on the evaluation quantity observed with respect to the controlled process; ,
Correction for adaptively correcting the control parameter, the manipulated variable, or the evaluation amount necessary for executing the extreme value control based on the estimated value of the gradient obtained by the gradient estimator, in accordance with changes in the evaluation function. Department and
with
The extreme value control unit that executes the extreme value control is an integrator for determining the manipulated variable given to the controlled process, and the integral gain thereof is obtained by assuming that the evaluation function is a quadratic function. having an integrator determined based on the second derivative of the evaluation function;
The gradient estimating unit estimates a first-order differential value or a second-order differential value of the evaluation function as the gradient,
The correction unit adjusts the evaluation function unknown with respect to the manipulated variable based on the first-order differential value or the second-order differential value of the evaluation function estimated by the gradient estimating unit. Transform variables so that they change linearly with respect to changes,
optimum controller.

Based on a manipulated variable of the controlled process and an evaluation quantity indicating an index related to optimization of the controlled process based on the controlled quantity that changes according to the manipulated variable, the operation is performed so that the evaluated quantity approaches an optimum value. A control device that performs extreme value control that changes the amount of
a gradient estimator for estimating a gradient indicating a rate of change of an evaluation function, which is a function representing the evaluation quantity and is unknown with respect to the manipulated variable, based on the evaluation quantity observed with respect to the controlled process; ,
Correction for adaptively correcting the control parameter, the manipulated variable, or the evaluation amount necessary for executing the extreme value control based on the estimated value of the gradient obtained by the gradient estimator, in accordance with changes in the evaluation function. Department and
with
When the dither signal used for the extreme value control is a sine wave, the gradient estimating unit estimates a differential value of the first or higher order of the evaluation function using a filter or an observer.
optimum controller.

Gradient information indicating the gradient of the evaluation function estimated by the gradient estimating unit, and a manipulated variable indicating the manipulated variable determined by the extreme value control unit that executes the extreme value control or the manipulated variable corrected by the correcting unit. Further comprising a display control unit that generates information to be displayed on the display unit in association with the information,
Optimal control device according to any one of claims 1 to 4 .

Based on a manipulated variable of the controlled process and an evaluation quantity indicating an index related to optimization of the controlled process based on the controlled quantity that changes according to the manipulated variable, the operation is performed so that the evaluated quantity approaches an optimum value. A control method for extreme value control that changes the amount of
a gradient estimation step of estimating a gradient indicating a rate of change of an evaluation function, which is a function representing the evaluation quantity and is unknown with respect to the manipulated variable, based on the evaluation quantity observed with respect to the controlled process; ,
Correction for adaptively correcting the control parameter, the manipulated variable, or the evaluation amount necessary for executing the extreme value control based on the estimated value of the gradient obtained in the gradient estimation step, in accordance with changes in the evaluation function. a step;
has
estimating a second derivative of the evaluation function as the gradient in the gradient estimation step;
In the correcting step, the integral gain of an integrator that the extreme value control unit that executes the extreme value control has for determining the manipulated variable given to the controlled process is calculated by the gradient estimating step. corrected based on the second derivative of the evaluation function,
control method.

Based on a manipulated variable of the controlled process and an evaluation quantity indicating an index related to optimization of the controlled process based on the controlled quantity that changes according to the manipulated variable, the operation is performed so that the evaluated quantity approaches an optimum value. A control method for extreme value control that changes the amount of
a gradient estimation step of estimating a gradient indicating a rate of change of an evaluation function, which is a function representing the evaluation quantity and is unknown with respect to the manipulated variable, based on the evaluation quantity observed with respect to the controlled process; ,
Correction for adaptively correcting the control parameter, the manipulated variable, or the evaluation amount necessary for executing the extreme value control based on the estimated value of the gradient obtained in the gradient estimation step, in accordance with changes in the evaluation function. a step;
has
The extreme value control unit that executes the extreme value control is an integrator for determining the manipulated variable given to the controlled process, and the integral gain thereof is obtained by assuming that the evaluation function is a quadratic function. having an integrator determined based on the second derivative of the evaluation function;
estimating a first derivative of the evaluation function as the gradient in the gradient estimation step;
In the correcting step, the manipulated variable determined by the extremal value control unit based on the integral gain is corrected based on the first derivative of the evaluation function estimated by the gradient estimating step.
control method.

Based on a manipulated variable of the controlled process and an evaluation quantity indicating an index related to optimization of the controlled process based on the controlled quantity that changes according to the manipulated variable, the operation is performed so that the evaluated quantity approaches an optimum value. A control method for extreme value control that changes the amount of
a gradient estimation step of estimating a gradient indicating a rate of change of an evaluation function, which is a function representing the evaluation quantity and is unknown with respect to the manipulated variable, based on the evaluation quantity observed with respect to the controlled process; ,
Correction for adaptively correcting the control parameter, the manipulated variable, or the evaluation amount necessary for executing the extreme value control based on the estimated value of the gradient obtained in the gradient estimation step, in accordance with changes in the evaluation function. a step;
has
The extreme value control unit that executes the extreme value control is an integrator for determining the manipulated variable given to the controlled process, and the integral gain thereof is obtained by assuming that the evaluation function is a quadratic function. having an integrator determined based on the second derivative of the evaluation function;
in the gradient estimation step, estimating a first-order differential value or a second-order differential value of the evaluation function as the gradient;
In the correcting step, the evaluation function, which is unknown with respect to the manipulated variable, is corrected based on the first-order differential value or the second-order differential value of the evaluation function estimated by the gradient estimating step. Transform variables so that they change linearly with respect to changes,
control method.

Based on a manipulated variable of the controlled process and an evaluation quantity indicating an index related to optimization of the controlled process based on the controlled quantity that changes according to the manipulated variable, the operation is performed so that the evaluated quantity approaches an optimum value. A control method for extreme value control that changes the amount of
a gradient estimation step of estimating a gradient indicating a rate of change of an evaluation function, which is a function representing the evaluation quantity and is unknown with respect to the manipulated variable, based on the evaluation quantity observed with respect to the controlled process; ,
Correction for adaptively correcting the control parameter, the manipulated variable, or the evaluation amount necessary for executing the extreme value control based on the estimated value of the gradient obtained in the gradient estimation step, in accordance with changes in the evaluation function. a step;
has
In the gradient estimation step, if the dither signal used for the extreme value control is a sine wave, a filter or an observer is used to estimate a differential value of the first order or higher of the evaluation function.
control method.

the computer,
A computer program for functioning as the optimum control device according to any one of claims 1 to 5 .