JPH09114503A

JPH09114503A - Controller

Info

Publication number: JPH09114503A
Application number: JP7265086A
Authority: JP
Inventors: Makoto Kano; 誠加納
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1995-10-13
Filing date: 1995-10-13
Publication date: 1997-05-02

Abstract

PROBLEM TO BE SOLVED: To perform accurate control in consideration of a temporal factor present in a loop for feedback control. SOLUTION: According to the response time pΔt from when a manipulated variable is inputted to a controlled system 1 until when the influence of the manipulated variable appears in a controlled variable, and observation data which is stored in an observation data storage part and consists of a controlled variable measured in past one-unit time and the manipulated variable inputted to the controlled system 1, a learning part 4 regulates the parameter W of a regulation part 2 by learning and outputs it to the regulation part 2.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、例えば下水処理施
設の曝気槽内の溶存酸素濃度を一定値に制御する制御装
置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a controller for controlling the dissolved oxygen concentration in an aeration tank of a sewage treatment facility to a constant value.

【０００２】[0002]

【従来の技術】この種のシステムを制御する制御装置
は、制御対象の緩やかな特性変化に適応すると共に、制
御量に影響を与える制御対象の状態の急激な変化に対応
できることが望まれる。しかし、下水処理施設の曝気槽
内の溶存酸素濃度制御は、観測された下水中の溶存酸素
濃度と目標溶存酸素濃度との誤差から曝気風量を決定す
るフィードバック制御を行っているが、降雨などにより
下水流入量が急激に変化すると、フィードバック制御で
は正確な制御ができなくなり、監視員が手作業で曝気風
量を調節しなければならない。また、下水流入量の変化
などによる溶存酸素濃度の急激な変化だけでなく、季節
によって下水処理プロセスの特性が大きく変化するた
め、フィードバック制御のゲイン定数を調整しなければ
ならないのが現状である。2. Description of the Related Art It is desirable that a control device for controlling a system of this type adapts to a gradual change in the characteristics of a controlled object and can cope with a sudden change in the state of the controlled object that affects the controlled variable. However, the dissolved oxygen concentration control in the aeration tank of the sewage treatment facility uses feedback control that determines the aeration air volume from the error between the observed dissolved oxygen concentration in the sewage and the target dissolved oxygen concentration. If the sewage inflow rate changes abruptly, feedback control will not be able to perform accurate control, and a monitor must manually adjust the aeration air volume. In addition, not only abrupt changes in dissolved oxygen concentration due to changes in sewage inflow, but also characteristics of the sewage treatment process change significantly depending on the season, so the current situation is that the feedback control gain constant must be adjusted.

【０００３】そこで、観測データを利用した学習により
調節部を調整することにより、制御対象の緩やかな特性
変化に適応させると共に、調節部において制御対象の制
御量の予測と操作量の修正とを繰り返すループを持つこ
とで、制御量に影響を与える制御対象の状態の急激な変
化に対応させることが考えられる。Therefore, the adjustment unit is adjusted by learning using the observation data to adapt to the gradual characteristic change of the control target, and the control unit repeats the prediction of the control amount of the control target and the correction of the operation amount. By having a loop, it is possible to deal with a sudden change in the state of the controlled object that affects the controlled variable.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、このよ
うな対策を講じたとしても、現実のシステムにおいては
フィードバック制御のループの中は様々な時間的な要因
が存在するため、正確な制御が行えない。すなわち、例
えば制御対象に操作量を与えた時刻と操作量による影響
が制御量に現れるまでの時刻とは実際には時間的な開き
がある。また、調節部で調節量が決定する時刻とこの調
節量に基づき操作部から制御対象に操作量が与えれる時
刻とは時間的な開きがある。さらに、制御対象で計測さ
れた制御量の計測値が調節部に伝送されるのに時間遅れ
がある。However, even if such measures are taken, accurate control cannot be performed in an actual system because various time factors exist in the feedback control loop. . That is, for example, there is actually a time difference between the time when the operation amount is given to the controlled object and the time when the influence of the operation amount appears in the control amount. Further, there is a temporal difference between the time when the adjustment amount is determined by the adjustment unit and the time when the operation amount is applied to the controlled object from the operation unit based on the adjustment amount. Furthermore, there is a time delay before the measured value of the controlled variable measured by the controlled object is transmitted to the adjustment unit.

【０００５】本発明は、かかる事情に基づき成されたも
ので、フィードバック制御のループの中に存在する時間
的な要因を考慮してより正確な制御を行うことができる
制御装置を提供することを目的としている。The present invention has been made in view of the above circumstances, and it is an object of the present invention to provide a control device capable of performing more accurate control in consideration of a time factor existing in a feedback control loop. Has an aim.

【０００６】[0006]

【課題を解決するための手段】上記目的を達成するた
め、請求項１記載の発明は、制御対象に入力される操作
量を変化させることで制御対象の制御量を制御する制御
装置において、前記制御対象で計測された制御量及び目
標制御量に基づき、調節信号を算出する調節部と、前記
調節部で算出された調節信号に基づき、前記制御対象に
操作量を出力する操作部と、過去に観測された制御量及
び操作量を有する観測データを記憶する観測データ記憶
部と、前記操作部から前記制御対象へ操作量を入力して
から当該操作量による影響が制御量に現れるまでの制御
対象の応答時間と前記観測データ記憶部に記憶された観
測データとに基づいて、前記調節部を学習する学習部と
を具備する。In order to achieve the above object, the invention according to claim 1 is a control device for controlling the controlled variable of a controlled object by changing the manipulated variable input to the controlled object. An adjusting unit that calculates an adjustment signal based on the control amount and the target control amount measured by the control target, and an operation unit that outputs the operation amount to the control target based on the adjustment signal calculated by the adjusting unit, and the past. An observation data storage unit that stores observation data having a control amount and an operation amount that are observed in, and control from input of the operation amount from the operation unit to the control target until the influence of the operation amount appears in the control amount. A learning unit that learns the adjustment unit based on the response time of the target and the observation data stored in the observation data storage unit.

【０００７】請求項２記載の発明は、制御対象に入力さ
れる操作量を変化させることで制御対象の制御量を制御
する制御装置において、前記制御対象で計測された制御
量及び目標制御量に基づき、調節信号を算出する調節部
と、前記調節部で算出された調節信号に基づき、前記制
御対象に操作量を出力する操作部と、過去に観測された
制御量及び操作量を有する観測データを記憶する観測デ
ータ記憶部と、前記操作部から出力された操作量と前記
制御対象で計測された制御量に基づき、前記操作部から
前記制御対象へ操作量を入力してから当該操作量による
影響が制御量に現れるまでの制御対象の応答時間を算出
する応答時間算出部と、前記応答時間算出部により算出
された応答時間と前記観測データ記憶部に記憶された観
測データとに基づいて、前記調節部を学習する学習部と
を具備する。According to a second aspect of the present invention, in a control device for controlling a control amount of a control target by changing an operation amount input to the control target, the control amount and the target control amount measured by the control target are set. Based on the adjustment signal calculated by the adjustment unit, based on the adjustment signal calculated by the adjustment unit, the operation unit for outputting the operation amount to the control target, the observation data having the control amount and the operation amount observed in the past Based on the operation amount output from the operation unit and the control amount measured by the control target, the observation data storage unit that stores the Based on the response time calculation unit that calculates the response time of the controlled object until the influence appears in the control amount, the response time calculated by the response time calculation unit, and the observation data stored in the observation data storage unit. Te comprises a learning unit for learning the adjustment portion.

【０００８】請求項３記載の発明は、請求項１または請
求項２記載の制御装置において、調節部が、前記制御対
象で計測された制御量及び目標制御量に基づき、仮想操
作量を算出する仮想操作量算出部と、前記制御対象で計
測された制御量及び仮想操作量に基づき、前記応答時間
後の制御量を予測する制御量予測部と、前記制御量予測
部により予測された応答時間後の予測制御量及び目標制
御量に基づき、仮想操作量を修正する操作量修正部と、
前記制御量予測部による制御量の予測と前記操作量修正
部による仮想操作量の修正を少なくとも１回行って得ら
れた仮想操作量を調節信号として前記操作部に入力する
手段とを具備する。According to a third aspect of the present invention, in the control device according to the first or second aspect, the adjusting section calculates the virtual operation amount based on the control amount and the target control amount measured by the controlled object. A virtual operation amount calculation unit, a control amount prediction unit that predicts a control amount after the response time based on the control amount and the virtual operation amount measured in the control target, and a response time predicted by the control amount prediction unit An operation amount correction unit that corrects the virtual operation amount based on the subsequent predicted control amount and the target control amount,
The control amount predicting unit predicts the control amount and the operation amount correcting unit corrects the virtual operation amount at least once, and inputs the virtual operation amount obtained as an adjustment signal to the operation unit.

【０００９】請求項４記載の発明は、請求項３記載の制
御装置において、前記制御量予測部が神経回路モデルか
ら構成され、前記操作量修正部の計算が神経回路モデル
の誤差逆伝播計算により実現され、前記学習部は、前記
応答時間と前記観測データ記憶部に記憶された観測デー
タとに基づいて、学習データを作成し、前記制御量予測
部を構成する神経回路モデルの学習を行う。According to a fourth aspect of the present invention, in the control device according to the third aspect, the control amount predicting unit is composed of a neural circuit model, and the calculation of the manipulated variable correcting unit is performed by an error back propagation calculation of the neural circuit model. Realized, the learning unit creates learning data based on the response time and the observation data stored in the observation data storage unit, and learns the neural circuit model that constitutes the control amount prediction unit.

【００１０】請求項５記載の発明は、請求項４記載の制
御装置において、前記学習部は、前記観測データ記憶部
に記憶された観測データに基づいて、学習データを作成
する際に、同じ時刻の制御量と操作量とを有する観測デ
ータを入力信号とし、前記応答時間後の制御量を教師信
号にして学習を行う。According to a fifth aspect of the present invention, in the control device according to the fourth aspect, the learning unit creates the learning data based on the observation data stored in the observation data storage unit at the same time. The observation data having the controlled variable and the manipulated variable of is used as an input signal, and the controlled variable after the response time is used as a teacher signal for learning.

【００１１】請求項６記載の発明は、制御対象に入力さ
れる操作量を変化させることで制御対象の制御量を制御
する制御装置において、前記制御対象で計測された制御
量及び目標制御量に基づき、調節信号を算出する調節部
と、前記調節部で算出された調節信号に基づき、前記制
御対象に操作量を出力する操作部と、過去に観測された
制御量及び操作量を有する観測データを記憶する観測デ
ータ記憶部と、前記操作部から前記制御対象へ操作量を
入力してから当該操作量による影響が制御量に現れるま
での制御対象の応答時間と前記調節部で算出された調節
信号が当該操作部を通して操作量として実現されるまで
の操作遅れ時間と前記観測データ記憶部に記憶された観
測データとに基づいて、前記調節部を学習する学習部と
を具備する。According to a sixth aspect of the present invention, in a control device for controlling the control amount of the control target by changing the operation amount input to the control target, the control amount and the target control amount measured by the control target are set. Based on the adjustment signal calculated by the adjustment unit, based on the adjustment signal calculated by the adjustment unit, the operation unit for outputting the operation amount to the control target, the observation data having the control amount and the operation amount observed in the past An observation data storage unit that stores the response time of the control target from the time when the operation amount is input from the operation unit to the control target until the influence of the operation amount appears on the control amount, and the adjustment calculated by the adjustment unit. The learning unit learns the adjustment unit based on the operation delay time until the signal is realized as an operation amount through the operation unit and the observation data stored in the observation data storage unit.

【００１２】請求項７記載の発明は、制御対象に入力さ
れる操作量を変化させることで制御対象の制御量を制御
する制御装置において、前記制御対象で計測された制御
量及び目標制御量に基づき、調節信号を算出する調節部
と、前記調節部で算出された調節信号に基づき、前記制
御対象に操作量を出力する操作部と、過去に観測された
制御量及び操作量を有する観測データを記憶する観測デ
ータ記憶部と、前記調節部で算出された調節信号と前記
操作部で出力された操作量に基づき、前記調節部で算出
された調節信号が当該操作部を通して操作量として実現
されるまでの操作遅れ時間を計測する操作遅れ時間計測
部と、前記操作遅れ時間計測部により計測された操作送
れ時間と前記操作部から前記制御対象へ操作量を入力し
てから当該操作量による影響が制御量に現れるまでの制
御対象の応答時間と前記観測データ記憶部に記憶された
観測データとに基づいて、前記調節部を学習する学習部
とを具備する。According to a seventh aspect of the present invention, in a control device for controlling the controlled variable of the controlled object by changing the manipulated variable input to the controlled object, the controlled variable and the target controlled variable measured by the controlled object are used. Based on the adjustment signal calculated by the adjustment unit, based on the adjustment signal calculated by the adjustment unit, the operation unit for outputting the operation amount to the control target, the observation data having the control amount and the operation amount observed in the past Based on the observation data storage unit that stores the adjustment signal calculated by the adjustment unit and the operation amount output by the operation unit, the adjustment signal calculated by the adjustment unit is realized as the operation amount through the operation unit. The operation delay time measuring unit that measures the operation delay time until the operation, the operation sending time measured by the operation delay time measuring unit, and the operation amount after the operation amount is input from the operation unit to the control target. According influence based on the stored observation data in the observation data memory unit and the response time of the control object to appear in the controlled variable comprises a learning unit for learning the adjustment portion.

【００１３】請求項８記載の発明は、請求項６または請
求項７記載の制御装置において、調節部が、前記制御対
象で計測された制御量及び目標制御量に基づき、仮想操
作量を算出する仮想操作量算出部と、前記制御対象で計
測された制御量及び仮想操作量に基づき、前記応答時間
後の制御量を予測する制御量予測部と、前記制御量予測
部により予測された応答時間後の予測制御量及び目標制
御量に基づき、仮想操作量を修正する操作量修正部と、
前記制御量予測部による制御量の予測と前記操作量修正
部による仮想操作量の修正を少なくとも１回行って得ら
れた仮想操作量を調節信号として前記操作部に入力する
手段とを具備する。According to an eighth aspect of the present invention, in the control device according to the sixth or seventh aspect, the adjusting section calculates the virtual operation amount based on the control amount measured by the control target and the target control amount. A virtual operation amount calculation unit, a control amount prediction unit that predicts a control amount after the response time based on the control amount and the virtual operation amount measured in the control target, and a response time predicted by the control amount prediction unit An operation amount correction unit that corrects the virtual operation amount based on the subsequent predicted control amount and the target control amount,
The control amount predicting unit predicts the control amount and the operation amount correcting unit corrects the virtual operation amount at least once, and inputs the virtual operation amount obtained as an adjustment signal to the operation unit.

【００１４】請求項９記載の発明は、請求項８記載の制
御装置において、前記制御量予測部が神経回路モデルか
ら構成され、前記操作量修正部の計算が神経回路モデル
の誤差逆伝播計算により実現され、前記学習部は、前記
応答時間と前記操作遅れ時間と前記観測データ記憶部に
記憶された観測データとに基づいて、学習データを作成
し、前記制御量予測部を構成する神経回路モデルの学習
を行う。According to a ninth aspect of the present invention, in the control device according to the eighth aspect, the control amount predicting unit is composed of a neural circuit model, and the calculation of the manipulated variable correcting unit is performed by an error back propagation calculation of the neural circuit model. Realized, the learning unit creates learning data based on the response time, the operation delay time, and the observation data stored in the observation data storage unit, and forms a neural circuit model that constitutes the control amount prediction unit. Learn.

【００１５】請求項１０記載の発明は、請求項９記載の
制御装置において、前記学習部は、前記観測データ記憶
部に記憶された観測データに基づいて、学習データを作
成する際に、当該時刻の制御量と当該時刻より操作遅れ
時間後の操作量とを有する観測データを入力信号とし、
前記応答時間後の制御量を教師信号にして学習を行う。
請求項１１記載の発明は、制御対象に入力される操作
量を変化させることで制御対象の制御量を制御する制御
装置において、前記制御対象で計測された制御量及び目
標制御量に基づき、調節信号を算出する調節部と、前記
調節部で算出された調節信号に基づき、前記制御対象に
操作量を出力する操作部と、過去に観測された制御量及
び操作量を有する観測データを記憶する観測データ記憶
部と、前記操作部から前記制御対象へ操作量を入力して
から当該操作量による影響が制御量に現れるまでの制御
対象の応答時間と前記制御対象で計測された制御量が前
記伝送部を通して調節部に入力されるまでの伝送むだ時
間と前記観測データ記憶部に記憶された観測データとに
基づいて、前記調節部を学習する学習部とを具備する。
請求項１２記載の発明は、制御対象に入力される操作
量を変化させることで制御対象の制御量を制御する制御
装置において、前記制御対象で計測された制御量及び目
標制御量に基づき、調節信号を算出する調節部と、前記
調節部で算出された調節信号に基づき、前記制御対象に
操作量を出力する操作部と、過去に観測された制御量及
び操作量を有する観測データを記憶する観測データ記憶
部と、前記制御対象で計測された制御量と前記伝送部を
通して調節部に入力された制御量とに基づき、前記伝送
部による伝送むだ時間を計測する伝送むだ時間計測部
と、前記伝送むだ時間計測部により計測された伝送むだ
時間と前記操作部から前記制御対象へ操作量を入力して
から当該操作量による影響が制御量に現れるまでの制御
対象の応答時間と前記観測データ記憶部に記憶された観
測データとに基づいて、前記調節部を学習する学習部と
を具備する。According to a tenth aspect of the present invention, in the control device according to the ninth aspect, when the learning unit creates learning data based on the observation data stored in the observation data storage unit, the time of day As the input signal, the observation data having the control amount of and the operation amount after the operation delay time from the time,
Learning is performed by using the control amount after the response time as a teacher signal.
According to an eleventh aspect of the present invention, in a control device that controls a controlled variable of a controlled object by changing an operation amount input to the controlled object, adjustment is made based on the controlled variable and the target controlled variable measured by the controlled object. An adjustment unit that calculates a signal, an operation unit that outputs an operation amount to the controlled object based on the adjustment signal calculated by the adjustment unit, and an observation data that has a control amount and an operation amount that have been observed in the past are stored. The observation data storage unit, the response time of the control target from the input of the operation amount from the operation unit to the control target until the influence of the operation amount appears in the control amount and the control amount measured by the control target are The learning unit learns the adjustment unit based on the transmission time delay until the data is input to the adjustment unit via the transmission unit and the observation data stored in the observation data storage unit.
According to a twelfth aspect of the present invention, in a control device for controlling the controlled variable of the controlled object by changing the manipulated variable input to the controlled object, adjustment is made based on the controlled variable and the target controlled variable measured by the controlled object. An adjustment unit that calculates a signal, an operation unit that outputs an operation amount to the controlled object based on the adjustment signal calculated by the adjustment unit, and an observation data that has a control amount and an operation amount that have been observed in the past are stored. An observation data storage unit, a transmission dead time measuring unit that measures a transmission dead time by the transmission unit based on a control amount measured by the control target and a control amount input to the adjustment unit through the transmission unit, and The transmission dead time measured by the transmission dead time measuring unit and the response time of the control object from the time when the operation amount is input from the operation unit to the control target until the influence of the operation amount appears in the control amount, and Based on the stored observation data in the measurement data storage unit comprises a learning unit for learning the adjustment portion.

【００１６】請求項１３記載の発明は、請求項１１また
は請求項１２記載の制御装置において、調節部が、前記
制御対象で計測された制御量及び目標制御量に基づき、
仮想操作量を算出する仮想操作量算出部と、前記制御対
象で計測された制御量及び仮想操作量に基づき、前記応
答時間後の制御量を予測する制御量予測部と、前記制御
量予測部により予測された応答時間後の予測制御量及び
目標制御量に基づき、仮想操作量を修正する操作量修正
部と、前記制御量予測部による制御量の予測と前記操作
量修正部による仮想操作量の修正を少なくとも１回行っ
て得られた仮想操作量を調節信号として前記操作部に入
力する手段とを具備する。According to a thirteenth aspect of the present invention, in the control device according to the eleventh aspect or the twelfth aspect, the adjusting unit is based on the control amount and the target control amount measured by the control target,
A virtual operation amount calculation unit that calculates a virtual operation amount, a control amount prediction unit that predicts the control amount after the response time based on the control amount and the virtual operation amount measured in the control target, and the control amount prediction unit Based on the predicted control amount and the target control amount after the response time predicted by the operation amount correction unit that corrects the virtual operation amount, the control amount prediction by the control amount prediction unit, and the virtual operation amount by the operation amount correction unit. Of the virtual operation amount obtained by performing the correction of 1. at least once, and inputting the virtual operation amount to the operation unit as an adjustment signal.

【００１７】請求項１４記載の発明は、請求項１３記載
の制御装置において、前記制御量予測部が神経回路モデ
ルから構成され、前記操作量修正部の計算が神経回路モ
デルの誤差逆伝播計算により実現され、前記学習部は、
前記応答時間と前記伝送むだ時間と前記観測データ記憶
部に記憶された観測データとに基づいて、学習データを
作成し、前記制御量予測部を構成する神経回路モデルの
学習を行う。According to a fourteenth aspect of the present invention, in the control device according to the thirteenth aspect, the control amount predicting unit is composed of a neural circuit model, and the calculation of the manipulated variable correcting unit is performed by an error back propagation calculation of the neural circuit model. Realized, the learning unit
Learning data is created based on the response time, the transmission dead time, and the observation data stored in the observation data storage unit, and the neural circuit model that constitutes the control amount prediction unit is learned.

【００１８】請求項１５記載の発明は、請求項１４記載
の制御装置において、前記学習部は、前記観測データ記
憶部に記憶された観測データに基づいて、学習データを
作成する際に、前記伝送むだ時間前の制御量と当該時刻
の操作量とを有する観測データを入力信号とし、前記応
答時間後の制御量を教師信号にして学習を行う。According to a fifteenth aspect of the present invention, in the control device according to the fourteenth aspect, when the learning section creates the learning data based on the observation data stored in the observation data storage section, the transmission is performed. Learning is performed by using the observation data having the control amount before the dead time and the operation amount at the time as the input signal and the control amount after the response time as the teacher signal.

【００１９】請求項１６記載の発明は、制御対象に入力
される操作量を変化させることで制御対象の制御量を制
御する制御装置において、前記制御対象で計測された制
御量及び目標制御量に基づき、調節信号を算出する調節
部と、前記調節部で算出された調節信号に基づき、前記
制御対象に操作量を出力する操作部と、過去に観測され
た制御量及び操作量を有する観測データを記憶する観測
データ記憶部と、前記操作部から前記制御対象へ操作量
を入力してから当該操作量による影響が制御量に現れる
までの制御対象の応答時間と前記制御対象で計測された
制御量が前記伝送部を通して調節部に入力されるまでの
伝送むだ時間と前記調節部で算出された調節信号が当該
操作部を通して操作量として実現されるまでの操作遅れ
時間と前記観測データ記憶部に記憶された観測データと
に基づいて、前記調節部を学習する学習部とを具備す
る。According to a sixteenth aspect of the present invention, in a control device for controlling the controlled variable of the controlled object by changing the manipulated variable input to the controlled object, the controlled variable and the target controlled variable measured by the controlled object are controlled. Based on the adjustment signal calculated by the adjustment unit, based on the adjustment signal calculated by the adjustment unit, the operation unit for outputting the operation amount to the control target, the observation data having the control amount and the operation amount observed in the past And an observation data storage unit that stores the response time of the control target from the input of the operation amount from the operation unit to the control target until the influence of the operation amount appears on the control amount and the control measured by the control target. Transmission delay time until the amount is input to the adjustment unit through the transmission unit, the operation delay time until the adjustment signal calculated by the adjustment unit is realized as the operation amount through the operation unit, and the observation data Based on the stored observation data in the data storage unit comprises a learning unit for learning the adjustment portion.

【００２０】請求項１７記載の発明は、制御対象に入力
される操作量を変化させることで制御対象の制御量を制
御する制御装置において、前記制御対象で計測された制
御量及び目標制御量に基づき、調節信号を算出する調節
部と、前記調節部で算出された調節信号に基づき、前記
制御対象に操作量を出力する操作部と、過去に観測され
た制御量及び操作量を有する観測データを記憶する観測
データ記憶部と、前記調節部で算出された調節信号と前
記操作部で出力された操作量に基づき、前記調節部で算
出された調節信号が当該操作部を通して操作量として実
現されるまでの操作遅れ時間を計測する操作遅れ時間計
測部と、前記制御対象で計測された制御量と前記伝送部
を通して調節部に入力された制御量とに基づき、前記伝
送部による伝送むだ時間を計測する伝送むだ時間計測部
と、前記操作遅れ時間計測部により計測された操作送れ
時間と前記伝送むだ時間計測部により計測された伝送む
だ時間と前記操作部から前記制御対象へ操作量を入力し
てから当該操作量による影響が制御量に現れるまでの制
御対象の応答時間と前記観測データ記憶部に記憶された
観測データとに基づいて、前記調節部を学習する学習部
とを具備する。According to a seventeenth aspect of the present invention, in a control device for controlling the controlled variable of the controlled object by changing the manipulated variable input to the controlled object, the controlled variable and the target controlled variable measured by the controlled object are set. Based on the adjustment signal calculated by the adjustment unit, based on the adjustment signal calculated by the adjustment unit, the operation unit for outputting the operation amount to the control target, the observation data having the control amount and the operation amount observed in the past Based on the observation data storage unit that stores the adjustment signal calculated by the adjustment unit and the operation amount output by the operation unit, the adjustment signal calculated by the adjustment unit is realized as the operation amount through the operation unit. Based on the control delay measured by the control target measured by the control target and the control delay measured by the control unit, the transmission delay by the transmission unit. A transmission dead time measuring unit that measures time, an operation delay time measured by the operation delay time measuring unit, a transmission dead time measured by the transmission dead time measuring unit, and an operation amount from the operation unit to the controlled object. A learning unit that learns the adjustment unit based on the response time of the controlled object from the input to the influence of the operation amount on the control amount and the observation data stored in the observation data storage unit. .

【００２１】請求項１８記載の発明は、請求項１６また
は請求項１７記載の制御装置において、調節部が、前記
制御対象で計測された制御量及び目標制御量に基づき、
仮想操作量を算出する仮想操作量算出部と、前記制御対
象で計測された制御量及び仮想操作量に基づき、前記応
答時間後の制御量を予測する制御量予測部と、前記制御
量予測部により予測された応答時間後の予測制御量及び
目標制御量に基づき、仮想操作量を修正する操作量修正
部と、前記制御量予測部による制御量の予測と前記操作
量修正部による仮想操作量の修正を少なくとも１回行っ
て得られた仮想操作量を調節信号として前記操作部に入
力する手段とを具備する。According to an eighteenth aspect of the present invention, in the control device according to the sixteenth aspect or the seventeenth aspect, the adjusting unit is based on the control amount and the target control amount measured by the control target,
A virtual operation amount calculation unit that calculates a virtual operation amount, a control amount prediction unit that predicts the control amount after the response time based on the control amount and the virtual operation amount measured in the control target, and the control amount prediction unit Based on the predicted control amount and the target control amount after the response time predicted by the operation amount correction unit that corrects the virtual operation amount, the control amount prediction by the control amount prediction unit, and the virtual operation amount by the operation amount correction unit. Of the virtual operation amount obtained by performing the correction of 1. at least once, and inputting the virtual operation amount to the operation unit as an adjustment signal.

【００２２】請求項１９記載の発明は、請求項１８記載
の制御装置において、前記制御量予測部が神経回路モデ
ルから構成され、前記操作量修正部の計算が神経回路モ
デルの誤差逆伝播計算により実現され、前記学習部は、
前記応答時間と前記操作遅れ時間と前記伝送むだ時間と
前記観測データ記憶部に記憶された観測データとに基づ
いて、学習データを作成し、前記制御量予測部を構成す
る神経回路モデルの学習を行う。According to a nineteenth aspect of the present invention, in the control device according to the eighteenth aspect, the control amount predicting unit is composed of a neural circuit model, and the calculation of the operation amount correcting unit is performed by an error back propagation calculation of the neural circuit model. Realized, the learning unit
Based on the response time, the operation delay time, the transmission dead time, and the observation data stored in the observation data storage unit, learning data is created to learn the neural circuit model that constitutes the control amount prediction unit. To do.

【００２３】請求項２０記載の発明は、請求項１９記載
の制御装置において、前記学習部は、前記観測データ記
憶部に記憶された観測データに基づいて、学習データを
作成する際に、前記伝送むだ時間前の制御量と前記操作
遅れ時間後の操作量とを有する観測データを入力信号と
し、前記応答時間後の制御量を教師信号にして学習を行
う。According to a twentieth aspect of the present invention, in the control device according to the nineteenth aspect, the learning section transmits the learning data based on the observation data stored in the observation data storage section. Learning is performed by using the observation data having the control amount before the dead time and the operation amount after the operation delay time as an input signal and the control amount after the response time as a teacher signal.

【００２４】[0024]

【発明の実施の形態】図１は本発明の原理を説明するた
めのブロック図であり、時刻ｎΔｔの信号の流れを表し
たものである。DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 is a block diagram for explaining the principle of the present invention and shows a signal flow at time nΔt.

【００２５】この制御装置は、図１の太線で示すように
制御対象１と調節部２と操作部３とでフィードバック制
御のループを構成する。また、学習部４と観測データ記
憶部５は、調節部２のパラメータを学習によって調節す
る。In this control device, as shown by the thick line in FIG. 1, the controlled object 1, the adjusting section 2 and the operating section 3 constitute a feedback control loop. The learning unit 4 and the observation data storage unit 5 adjust the parameters of the adjustment unit 2 by learning.

【００２６】制御対象１には単位時間Δｔ毎に操作量が
入力され、制御量が計測される。制御対象１で計測され
た制御量Ｘ_ｎは調節部２に入力される。調節部２は制御
対象１で計測された制御量Χ_ｎと予め設定された目標制
御量Χ^ｄに基づき、調節信号Ｃ_ｎを算出する。操作部３
では調節部２で算出された調節信号Ｃ_ｎに基づき操作量
Ｕ_ｎが出力される。制御対象１には操作部３が出力した
操作量Ｕ_ｎが入力され、その操作量Ｕ_ｎが変化すること
で制御される。記号の添字のｎは時刻を示し、例えばＸ
_ｎは時刻（ｎ−１）ΔｔからｎΔｔの間に計測された制
御量を意味するが、この説明では簡単に時刻ｎΔｔの制
御量ということにする。以後も同様の意味で添字を使用
する。A manipulated variable is input to the controlled object 1 every unit time Δt, and the controlled variable is measured. The controlled variable X _n measured by the controlled object 1 is input to the adjustment unit 2. The adjusting unit 2 calculates the adjusting signal C _n based on the control amount Χ _n measured for the controlled object 1 and the preset target control amount Χ ^d . Operation part 3
Then, the manipulated variable U _n is output based on the adjustment signal C _n calculated by the adjustment unit 2. The operation amount U _n output by the operation unit 3 is input to the controlled object 1 and is controlled by changing the operation amount U _n . The subscript n of the symbol indicates the time, for example, X
_n means the control amount measured from the time (n-1) [Delta] t to n [Delta] t, but in this description, it is simply referred to as the control amount at the time n [Delta] t. Hereinafter, the subscript will be used with the same meaning.

【００２７】学習部４は制御対象１に操作量が入力され
てからその操作量による影響が制御量に現れるまでの応
答時間ｐΔｔと、観測データ記憶部５に記憶された過去
ｌ単位時間に計測された制御量（Ｘ_n-l，Ｘ_n-l+1，・
・・，Ｘ_n-1）と制御対象１に入力された操作量（Ｕ
_n-l，Ｕ_n-l+1，・・・，Ｕ_n-1）（ｌは正の整数）か
らなる観測データに基づいて、調節部２のパラメータＷ
を学習によって調節し、調節部２へ出力する。これによ
り、本発明では、制御量に影響を与える制御対象１の特
性パラメータの緩やかな変化に対して、調節部２が適応
できる。上記応答時間ｐΔｔは、制御対象１に操作量が
入力されてからその操作量による影響が制御量に現れる
までの時間を予め計測あるいは算出して設定しておいて
もよいし、後述するように制御対象１の変動に応じて動
的に設定するようにしてもよい。The learning unit 4 measures the response time pΔt from the input of the operation amount to the controlled object 1 to the influence of the operation amount on the control amount, and the last 1 unit time stored in the observation data storage unit 5. The controlled _variable (X _nl , X _{n-l + 1} , _...
.., X _n-1 ) and the manipulated variable (U
_nl , U _{n-l + 1} , ..., U _n-1 ) (l is a positive integer) based on the observation data, the parameter W of the adjusting unit 2
Is adjusted by learning and output to the adjusting unit 2. As a result, in the present invention, the adjusting unit 2 can adapt to a gradual change in the characteristic parameter of the controlled object 1 that affects the controlled variable. The response time pΔt may be set by previously measuring or calculating the time from the input of the operation amount to the controlled object 1 until the influence of the operation amount appears in the control amount, as will be described later. You may make it set dynamically according to the change of the controlled object 1.

【００２８】図２は調節部２の構成の一例を示す図であ
る。FIG. 2 is a diagram showing an example of the configuration of the adjusting section 2.

【００２９】図２に示すように、仮想操作量算出部６は
制御対象１で計測された制御量Ｘ_ｎ及び目標制御量Ｘ^ｄ
に基づき、仮想操作量Ｕ（＾）_ｎを算出する。制御量予
測部７は仮想操作量Ｕ（＾）_ｎと制御対象１で計測され
た制御量Χ_ｎに基づき、仮想操作量Ｕ（＾）_ｎが調節信
号として出力された場合のｐΔｔ時間後の予測制御量Ｘ
（〜）_n+pを算出する。ｐΔｔは制御対象１の応答時間
である。操作量修正部８は、制御量予測部７で予測され
たｐΔｔ時間後の予測制御量Ｘ（〜）_n+p及び目標制御
量Ｘ^ｄに基づき、予測制御量Ｘ（〜）_n+pが目標制御量
Ｘ^ｄに近付くように仮想操作量Ｕ（＾）_ｎを修正する。
第１スイッチ９は制御対象１で制御量が計測されると下
側に閉じ、仮想操作量算出部６で算出された仮想操作量
Ｕ（＾）_ｎを制御量予測部７に入力し、その後、第１ス
イッチ９は次の計測時まで上側に閉じて、操作量修正部
８で修正された仮想操作量Ｕ（＾）_ｎを制御量予測部７
の入力にし、所定の回数、あるいは予測制御量Ｘ（〜）
_n+pと目標制御量Χ^ｄの差が所定の値以下になるまで、
制御量予測部７による制御量の予測と操作量修正部８に
よる操作量の修正が繰り返される。制御量予測部７によ
る制御量の予測と操作量修正部８による操作量の修正が
所定の回数まで、あるいは予測制御量Ｘ（〜）_n+pと目
標制御量Ｘ^ｄの差が所定の値以下になるまで繰り返され
ると、第２スイッチ１０が閉じ、仮想操作量Ｕ（＾）_ｎ
が調節信号Ｃ_ｎとして操作部３へ出力される。これによ
り、本発明では、制御量Ｘ_ｎに影響を与える制御対象１
の状態が急激に変化しても、操作量Ｕ_ｎを正確に修正
し、制御量Ｘ_ｎを目標制御量Ｘ^ｄの近くに制御すること
ができる。As shown in FIG. 2, the virtual manipulated variable calculator 6 controls the controlled variable X _n and the target controlled variable X ^{d for the} controlled object 1.
The virtual operation amount U (^) _n is calculated based on Based on the virtual operation amount U (^) _n and the control amount Χ _n measured by the controlled object 1, the control amount predicting unit 7 outputs pΔt time after the virtual operation amount U (^) _n is output as the adjustment signal. Predicted controlled variable X
(~) Calculate _{n + p} . pΔt is the response time of the controlled object 1. Manipulated variable correcting section 8, based on the control amount predicting control amount X of pΔt time after that is predicted by the prediction unit 7 (~) n + _p and the target control amount X ^d, the predicted controlled variable X (~) n + _p The virtual manipulated variable U (^) _n is corrected so as to approach the target controlled variable X ^d .
The first switch 9 closes downward when the controlled variable is measured by the controlled object 1, inputs the virtual manipulated variable U (^) _n calculated by the virtual manipulated variable calculating unit 6 into the controlled variable predicting unit 7, and thereafter. , The first switch 9 is closed to the upper side until the next measurement, and the virtual operation amount U (^) _n corrected by the operation amount correction unit 8 is set to the control amount prediction unit 7
For a predetermined number of times or the predicted controlled variable X (~)
until the difference _{n + p} and the target control amount chi ^d is below the predetermined value,
Prediction of the control amount by the control amount prediction unit 7 and correction of the operation amount by the operation amount correction unit 8 are repeated. The control amount prediction unit 7 predicts the control amount and the operation amount correction unit 8 corrects the operation amount up to a predetermined number of times, or the difference between the predicted control amount X (-) _{n + p} and the target control amount ^Xd is a predetermined value. When it is repeated until the following, the second switch 10 is closed and the virtual operation amount U (^) _n
Is output to the operation unit 3 as the adjustment signal C _n . Accordingly, in the present invention, the controlled object 1 that affects the controlled variable X _n
Even if the state of is suddenly changed, the manipulated variable U _n can be corrected accurately and the controlled variable X _n can be controlled to be close to the target controlled variable X ^d .

【００３０】図３は制御量予測部７の構成の一例を示す
図である。ここでは、制御量予測部７は神経回路モデル
から構成されている。図３は時刻ｎΔｔの仮想操作量Ｕ
（＾）_ｎ＝（Ｕ（＾）１_ｎ，Ｕ（＾）２_ｎ，・・・，Ｕ
（＾）ｇ_ｎ）^Ｔ（ｇは操作量の次元数を表す）と制御量
Χ_ｎ＝（Χ１_ｎ，Χ２_ｎ，・・・，Χｈ_ｎ）^Ｔ（ｈは制
御量の次元数を表す）が入力されると、ｐΔｔ時間後の
予測制御量Ｘ（〜）_n+ _p＝（Ｘ（〜）１_n+p，Ｘ（〜）
２_n+p，・・・，Χ（〜）ｈ_n+p）^Ｔが出力される神経
回路モデル１１の制御時の予測計算の信号の流れを示し
ている。この入力信号から出力信号を求めるのは神経回
路モデル１１の前向き計算によって行われる。神経回路
モデル１１の前向き計算については後で説明する。FIG. 3 is a diagram showing an example of the configuration of the control amount predicting section 7. Here, the control amount prediction unit 7 is composed of a neural circuit model. FIG. 3 shows a virtual operation amount U at time nΔt
(^) _N = (U (^) 1 _n , U (^) 2 _n , ..., U
(^) G _n ) ^T (g represents the number of dimensions of the manipulated variable) and the controlled variable Χ _n = (Χ ₁ _n , Χ 2 _n , ..., Χ h _n ) ^T (h represents the number of dimensions of the controlled variable) Is input, the predicted control amount X (-) _{n +} _p = (X (-) 1n _{+ p} , X (-) after _{p [} Delta] t time has elapsed.
2 _{n + p} , ..., Χ (∼) h _{n + p} ) ^T is shown, showing the signal flow of the prediction calculation when the neural circuit model 11 is controlled. The output signal is obtained from the input signal by the forward calculation of the neural circuit model 11. The forward calculation of the neural network model 11 will be described later.

【００３１】制御量予測部７が神経回路モデル１１から
構成される場合には、操作量修正部８は神経回路モデル
１１の誤差逆伝播計算によって仮想操作量Ｕ（＾）_ｎの
修正量を求め、仮想操作量を修正する。神経回路モデル
１１の誤差逆伝播計算については前向き計算とともに後
で説明する。When the control amount predicting unit 7 is composed of the neural circuit model 11, the operation amount correcting unit 8 obtains the correction amount of the virtual operation amount U (^) _n by the error back propagation calculation of the neural circuit model 11. , Correct the virtual operation amount. The error back propagation calculation of the neural network model 11 will be described later together with the forward calculation.

【００３２】ここで、制御量予測部７が神経回路モデル
１１から構成される場合には、学習部４は以下のように
構成される。Here, when the control amount predicting unit 7 is composed of the neural circuit model 11, the learning unit 4 is structured as follows.

【００３３】すなわち、学習部４は、制御対象１に対す
る操作量の影響が制御量に現れるまでの応答時間ｐΔｔ
と、観測データ記憶部５に記憶された過去の制御量及び
操作量からなる観測データとに基づいて学習データを作
成し、制御量予測部７を構成する神経回路モデル１１の
学習を行う。神経回路モデル１１が図３に示したように
現時刻の制御量と仮想操作量を入力すると応答時間ｐΔ
ｔ後の制御量を出力する構成の場合、入力信号（Ｕ_ｓ，
Ｘ_ｓ）、教師信号（Ｘ_s+p）（ｓ＝ｎ−ｌ，ｎ−ｌ＋
１，・・・，ｎ−ｐ−１）の学習データにより学習を行
う。That is, the learning section 4 has a response time pΔt until the influence of the manipulated variable on the controlled object 1 appears in the controlled variable.
And learning data are created based on the observation data including the past control amount and the manipulated variable stored in the observation data storage unit 5, and the neural circuit model 11 that constitutes the control amount prediction unit 7 is learned. When the neural network model 11 inputs the control amount and the virtual operation amount at the current time as shown in FIG. 3, the response time pΔ
In the case of the configuration that outputs the controlled variable after t, the input signal (U _s ,
X _s), a teacher signal _{(X s + p) (s} = n-l, n-l +
, ..., Np-1) learning data.

【００３４】次に、神経回路モデルの前向き計算、誤差
逆伝播計算、学習計算について説明する。Next, the forward calculation, the back propagation calculation, and the learning calculation of the neural network model will be described.

【００３５】前向き計算とは神経回路モデルに入力信号
が与えられた時に、出力信号を求める計算である。ここ
では図３に示した３層型神経回路モデルに基づいて前向
き計算を説明する。図中の円がユニットと呼ばれる神経
細胞モデルで、ユニット間を結び付けている線分はユニ
ット間の信号を伝達する結合である。ユニットは左から
入力層、中間層、出力層の３層に分かれて並べられてい
る。入力層のユニットと中間層のユニットの間、中間層
のユニットと出力層のユニットの間に結合が張られてい
る。ユニット間の結合は信号伝達に方向性を持ち、前向
き計算では入力層ユニットから中間層ユニットヘ、中間
層ユニットから出力層ユニットへの一方向にだけ信号は
伝達される。The forward calculation is a calculation for obtaining an output signal when an input signal is given to the neural network model. Here, the forward calculation will be described based on the three-layer neural circuit model shown in FIG. The circles in the figure are nerve cell models called units, and the line segments that connect the units are the connections that transmit signals between the units. The units are arranged in three layers, from the left, an input layer, an intermediate layer, and an output layer. Couplings are provided between the units of the input layer and the units of the intermediate layer, and between the units of the intermediate layer and the units of the output layer. The coupling between the units has a directivity in signal transmission, and in the forward calculation, the signal is transmitted only in one direction from the input layer unit to the intermediate layer unit and from the intermediate layer unit to the output layer unit.

【００３６】入力層のユニットには、外部からの入力信
号が１ユニットに１信号ずつ入力される。図３では、仮
想操作量Ｕ（＾）_ｎ＝（Ｕ（＾）１_ｎ，Ｕ（＾）２_ｎ，
…，Ｕ（＾）ｇ_ｎ）^Ｔ、制御量Ｘ_ｎ＝（Ｘ１_ｎ，Ｘ
２_ｎ，…，Ｘｈ_ｎ）^Ｔが１ユニットに１信号ずつ入力さ
れている。ｘｉ_ｉを入力層第ｉ番ユニットの入力値とす
ると、Input signals from the outside are input to the input layer units one by one. In FIG. 3, the virtual operation amount U (^) _n = (U (^) 1 _n , U (^) 2 _n ,
, U (^) g _n ) ^T , controlled variable X _n = (X1 _n , X
2 _n , ..., Xh _n ) ^T are input to each unit, one signal at a time. If xi _i is the input value of the input layer i-th unit,

【数１】入力層ユニットの入出力関数は恒等関数とする。入力層
ユニットの出力値ｙｉ_ｉ（ｉ＝１，２，…，Νｉ）は次
のように表される。(Equation 1) The input / output function of the input layer unit is the identity function. The output value yi _i (i = 1, 2, ..., Νi) of the input layer unit is expressed as follows.

【００３７】[0037]

【数２】 Νｉは入力層ユニット数である（Νｉ＝ｇ＋ｈ）。(Equation 2) Νi is the number of input layer units (Νi = g + h).

【００３８】次に、中間層ユニットの入力値は入力層ユ
ニットの出力値を入力層一中間層間の結合荷重値で重み
付けした合計から中間層ユニットの閾値を引いた値にな
る。Next, the input value of the intermediate layer unit is a value obtained by subtracting the threshold value of the intermediate layer unit from the sum of the output value of the input layer unit weighted by the coupling weight value between the input layer and the intermediate layer.

【数３】ｘｈ_ｊは中間層第ｊ番ユニットの入力値、ｗ１_j,iは入
力層第ｉ番ユニット−中間層第ｊ番ユニット間の結合荷
重値、ｔｈｈ_ｊは中間層第ｊ番ユニットの閾値、Ｎｈは
中間層のユニット数を表す。(Equation 3) xh _j is the input value of the middle layer jth unit, w1 _{j, i} is the combined load value between the input layer ith unit and the middle layer jth unit, thh _j is the threshold value of the middle layer jth unit, Nh Represents the number of units in the intermediate layer.

【００３９】中間層ユニットの入出力関数をシグモイド
型関数The input / output function of the middle layer unit is a sigmoid type function.

【数４】とすると、中間層第ｊ番ユニットの出力値ｙｈ_ｊは入力
値ｘｈ_ｊ，を使い、(Equation 4) Then, the output value yh _j of the j-th unit in the middle layer uses the input value xh _j ,

【数５】と表せる。(Equation 5) Can be expressed as

【００４０】出力層ユニットの入力値は中間層ユニット
の出力値を中間層一出力層間の結合荷重値で重み付けし
た合計から出力層ユニットの閾値を引いた値になる。The input value of the output layer unit is a value obtained by subtracting the threshold value of the output layer unit from the sum of the output value of the intermediate layer unit weighted by the coupling weight value between the intermediate layer and the output layer.

【００４１】[0041]

【数６】ｘｏ_ｋは出力層第ｋ番ユニットの入力値、ｗ２_k,jは中
間層第ｊ番ユニット−出力層第ｋ番ユニット間の結合荷
重値、ｔｈｏ_ｋは出力層第ｋ番ユニットの閾値、Ｎｏは
出力層のユニット数を表す。出力層ユニットの入出力関
数は恒等関数とする。(Equation 6) xo _k input value of the k-th unit output layer, w2 k, _j intermediate layer j-th unit - bond load value between the output layer the k-th unit, tho _k is a threshold of the k-th unit output layer, No Represents the number of units in the output layer. The input / output function of the output layer unit is an identity function.

【００４２】[0042]

【数７】この出力層ユニットの出力信号が外部に出力される。図
３では時刻（ｎ＋ｐ）Δｔの制御量の予測値Ｘ（〜）
_n+p＝（Ｘ１（〜）_n+p，Ｘ（〜）２_n+p…，Ｘ（〜）
ｈ_n+p）^Ｔになる。(Equation 7) The output signal of this output layer unit is output to the outside. In FIG. 3, the predicted value X (-) of the control amount at time (n + p) Δt
_{n + p} = (X1 (~) _{n + p} , X (~) _{2n + p} ..., X (~)
h _{n + p} ) ^T.

【００４３】[0043]

【数８】以上が神経回路モデル１１の前向き計算の計算式であ
る。(Equation 8) The above is the calculation formula for the forward calculation of the neural circuit model 11.

【００４４】次に、神経回路モデルの誤差逆伝播計算に
ついて説明する。Next, the error back propagation calculation of the neural network model will be described.

【００４５】制御量予測部７によって予測された制御量
Ｘ（〜）_n+pと、目標制御量Ｘ^ｄが操作量修正部８に入
力されると、操作量修正部８では、仮想操作量Ｕ（＾）
_ｎの修正量が計算され、仮想操作量Ｕ（＾）_ｎが修正さ
れる。制御量予測部７が神経回路モデル１１から構成さ
れる場合には、仮想操作量Ｕ（＾）_ｎの修正量は神経回
路モデル１１の誤差逆伝播計算によって計算できる。When the control amount X (-) _{n + p} predicted by the control amount predicting unit 7 and the target control amount ^Xd are input to the operation amount correcting unit 8, the operation amount correcting unit 8 calculates the virtual operation amount. U (^)
The correction amount of _n is calculated, and the virtual operation amount U (^) _n is corrected. When the control amount prediction unit 7 is composed of the neural circuit model 11, the correction amount of the virtual operation amount U (^) _n can be calculated by the error back propagation calculation of the neural circuit model 11.

【００４６】神経回路モデルの出力信号と望ましい出力
信号から誤差関数を定義する。図３の場合には、制御量
予測部７の神経回路モデル１１の前向き計算によって計
算された予測制御量Ｘ（〜）_n+pと、目標制御量Ｘ
（〜）^ｄ＝（Ｘｌ^ｄ，Ｘ２^ｄ，…，Ｘｈ^ｄ）^Ｔとの誤差
から誤差関数Ｅを次のように定義する。An error function is defined from the output signal of the neural network model and the desired output signal. In the case of FIG. 3, the predicted control amount X (-) _{n + p} calculated by the forward calculation of the neural circuit model 11 of the control amount prediction unit 7 and the target control amount X
^{^{^{(~) D = (Xl d}}} , X2 d, ..., Xh d) define the error between ^T error function E as follows.

【００４７】[0047]

【数９】次に、この誤差関数を減少させるユニット出力値の修正
量を、出力層から入力層へ層毎に順に求めていき、最終
的には入力信号の修正量を求める。まず、出力層ユニッ
トの出力値の修正量は、正の定数ηを使って(Equation 9) Next, the correction amount of the unit output value that reduces this error function is sequentially obtained for each layer from the output layer to the input layer, and finally the correction amount of the input signal is obtained. First, the correction amount of the output value of the output layer unit is calculated by using the positive constant η.

【数１０】と計算される。(Equation 10) Is calculated.

【００４８】次に、中間層ユニットの出力値の修正量
は、出力層ユニットの修正量を使って次のように表され
る。Next, the correction amount of the output value of the intermediate layer unit is expressed as follows using the correction amount of the output layer unit.

【００４９】[0049]

【数１１】同様に入力層ユニットの出力値の修正量は、中間層ユニ
ットの修正量を使って、[Equation 11] Similarly, the correction amount of the output value of the input layer unit, using the correction amount of the middle layer unit,

【数１２】と表される。ｆ′（ｘ）はシグモイド関数ｆ（ｘ）の導
関数を表していて、具体的には、(Equation 12) It is expressed as f ′ (x) represents the derivative of the sigmoid function f (x), and specifically,

【数１３】と表せる。入力層ユニットの入出力関数は恒等関数であ
るから、出力値の修正量はそのまま入力信号の修正量に
なる。(Equation 13) Can be expressed as Since the input / output function of the input layer unit is the identity function, the correction amount of the output value becomes the correction amount of the input signal as it is.

【００５０】神経回路モデルが図３の場合には、入力信
号である仮想操作量Ｕ（＾）_ｎの修正量ΔＵ_ｎは、When the neural circuit model is as shown in FIG. 3, the correction amount ΔU _n of the virtual operation amount U (^) _n which is the input signal is

【数１４】と表現される。この修正量が仮想操作量Ｕ（＾）_ｎに加
算される。[Equation 14] Is expressed as This correction amount is added to the virtual operation amount U (^) _n .

【００５１】以上が誤差逆伝播計算と、この計算によっ
て実現される仮想操作量の修正である。The above is the error backpropagation calculation and the correction of the virtual operation amount realized by this calculation.

【００５２】次に、神経回路モデルの学習計算について
図４を基に説明する。Next, the learning calculation of the neural circuit model will be described with reference to FIG.

【００５３】図４において、５は観測データ記憶部、１
１は神経回路モデル、１２は減算器、１３は誤差逆伝播
学習器、１４は神経回路モデル１１の入力層ユニットの
出力が誤差逆伝播学習器１３に入力されることを示す矢
印、１５は神経回路モデル１１の中間層ユニットの出力
が誤差逆伝播学習器１３に入力されることを示す矢印、
１６は神経回路モデル１１の中間層一出力層間の結合荷
重値が誤差逆伝播学習器１３に入力されることを示す矢
印、１７は誤差逆伝播学習器１３で計算された入力層一
中間層間の結合荷重値と中間層ユニットの閾値の修正量
を各結合荷重、中間層ユニットに出力し、各結合荷重値
と閾値の修正を行うことを示す矢印、１８は誤差逆伝播
学習器１３で計算された中間層一出力層間の結合荷重値
と出力層ユニットの閾値の修正量を各結合荷重、出力層
ユニットに出力し、各結合荷重値と閾値の修正を行うこ
とを示す矢印を表す。In FIG. 4, 5 is an observation data storage unit, 1
1 is a neural network model, 12 is a subtractor, 13 is an error backpropagation learning device, 14 is an arrow indicating that the output of the input layer unit of the neural circuit model 11 is input to the error backpropagation learning device 13, and 15 is a neural network. An arrow indicating that the output of the intermediate layer unit of the circuit model 11 is input to the error backpropagation learning device 13,
Reference numeral 16 denotes an arrow indicating that the coupling weight value between the intermediate layer and the output layer of the neural network model 11 is input to the error backpropagation learning device 13, and 17 indicates between the input layer and the intermediate layer calculated by the error backpropagation learning device 13. An arrow indicating that the combined weight value and the correction amount of the threshold value of the intermediate layer unit are output to each combined load and intermediate layer unit and the combined load value and the threshold value are corrected, 18 is calculated by the error backpropagation learning device 13. An arrow indicating that the combined weight value between the intermediate layer and the output layer and the correction amount of the threshold value of the output layer unit are output to each combined load and output layer unit, and each combined weight value and the threshold value are corrected is shown.

【００５４】過去の時刻ｓΔｔ（ｓ＝ｎ−ｌ，ｎ−ｌ＋
１，…，ｎ−ｐ−１）について、時刻ｓΔｔの操作量Ｕ
_ｓと制御量Ｘ_ｓが入力信号として、観測データ記憶部５
から神経回路モデル１１に出力される。神経回路モデル
１１では時刻ｓΔｔの操作量Ｕ_ｓと制御量Ｘ_ｓを入力信
号として前向き計算を行い、出力信号Ｘ（〜）_s+pを出
力する。減算器１２では観測データ記憶部５から教師信
号として出力された応答時間後の制御量Χ_s+pと神経回
路モデル１１の出力信号Ｘ（〜）_s+pとの差分Ｘ_s+p−
Χ（〜）_s+pが計算され、誤差逆伝播学習器１３に入力
される。誤差逆伝播学習器１３には差分の他に、神経回
路モデルの入力層ユニットの各出力値、中間層ユニット
の各出力値、中間層一出力層間の各結合荷重値が入力さ
れ、結合荷重値、閾値の修正量が計算される。Past time sΔt (s = n−1, n−1 +
1, ..., Np-1), the manipulated variable U at time sΔt
_s and the controlled variable X _s as input signals, the observation data storage unit 5
Is output to the neural circuit model 11. In the neural circuit model 11, forward calculation is performed using the manipulated variable U _s and the controlled variable X _s at time sΔt as input signals, and the output signal X (-) _{s + p} is output. In the subtractor 12, the difference X _{s + p} − between the control amount Χ _{s + p} after the response time output as the teacher signal from the observation data storage unit 5 and the output signal X (-) _{s + p} of the neural circuit model 11
Χ (∼) _{s + p} is calculated and input to the error backpropagation learning device 13. In addition to the difference, each output value of the input layer unit of the neural network model, each output value of the intermediate layer unit, and each coupling weight value between the intermediate layer and one output layer are input to the error back-propagation learning device 13, and the coupling weight value is input. , The threshold modification amount is calculated.

【００５５】誤差逆伝播学習器１３では、教師信号Ｘ
_s+pと神経回路モデル１１の出力信号Ｘ_s+pとの差分か
ら誤差関数Ｅが次のように定義される。In the error backpropagation learning device 13, the teacher signal X
_The error function E is defined as follows from the difference between _{s + p} and the output signal X _{s + p} of the neural circuit model 11.

【００５６】[0056]

【数１５】誤差逆伝播学習法に従つて、神経回路モデル１１の結合
荷重値、閾値の修正量は次のように計算される。(Equation 15) According to the error backpropagation learning method, the connection weight value of the neural network model 11 and the correction amount of the threshold value are calculated as follows.

【００５７】[0057]

【数１６】 Δｗ２_k,jは中間層第ｊ番ユニット−出力層第ｋ番ユニ
ット間の結合荷重値ｗ２_k,jの修正量、Δｔｈｏ_ｋは出
力層第ｋ番ユニットの閾値ｔｈｏ_ｋの修正量、Δｗ１
_j,iは入力層第ｉ番ユニット−中間層第ｊ番ユニット間
の結合荷重値ｗ１_j,iの修正量、Δｔｈｈ_ｊは中間層第
ｊ番ユニットの閾値ｔｈｈ_ｊの修正量、εは学習定数で
ある。ｙｏ^ｓ _ｋ時刻ｓΔｔのデータが入力された時の出
力層第ｋ番ユニットの出力値、ｘｏ^ｓ _ｋ時刻ｓΔｔのデ
ータが入力された時の出力層第ｋ番ユニットの入力値、
ｙｈ^ｓ _ｊは時刻ｓΔｔのデータが入力された時の中間層
第ｊ番ユニットの出力値、ｘｈ^ｓ _ｋは時刻ｓΔｔのデー
タが入力された時の中間層第ｊ番ユニットの入力値、ｙ
ｉ^ｓ _ｉは時刻ｓΔｔのデータが入力された時の中間層第
ｉ番ユニットの出力値である。これらの修正量を基に、
結合荷重値、閾値が修正される。(Equation 16) Derutadaburyu2 k, _j is the intermediate layer j-th unit - correction amount of the coupling weight value w2 _{k, j} between the output layer the k-th unit, Derutatho _k is the correction amount of the threshold tho _k of the k-th unit output layer, .DELTA.W1
_{j, i} is the correction amount of the coupling load value w1 _{j, i} between the input layer i-th unit and the intermediate layer j-th unit, Δthh _j is the correction amount of the threshold thh _j of the intermediate layer j-th unit, and ε is the learning value. It is a constant. yo ^s _k time output layer output values of the k-th unit when data is input in Esuderutati, the input values of the output layer the k th unit when data xo ^s _k time Esuderutati is input,
yh ^s _j is the output value of the middle-layer j-th unit when the data at time sΔt is input, and xh ^s _k is the input value of the middle-layer j-th unit when the data at time sΔt is input, y
i ^s _i is the output value of the i-th unit in the middle layer when the data at time sΔt is input. Based on these corrections
The connection weight value and the threshold value are modified.

【００５８】次に、本発明の他の実施の形態を図５に基
づき説明する。Next, another embodiment of the present invention will be described with reference to FIG.

【００５９】図５に示す制御装置では、図１に示した制
御装置に応答時間算出部１９が付加されている。応答時
間算出部１９は、制御対象１の応答時間を算出する。応
答時間算出部１９は過去ｍ単位時間に制御対象１に入力
された操作量の時系列Ｕ＝（Ｕ_n-m，Ｕ_n-m+1，…，Ｕ
_n-1）と制御対象１で計測された制御量の時系列Ｘ＝
（Ｘ_n-m，Ｘ_n-m+1，…，Ｘ_n-1）に基づき、制御対象
１に入力された操作量による影響が制御量に現れるまで
の制御対象１の応答時間を計算し、学習部４に出力する
（ｍは正の整数である）。In the control device shown in FIG. 5, a response time calculating section 19 is added to the control device shown in FIG. The response time calculation unit 19 calculates the response time of the controlled object 1. The response time calculation unit 19 is a time series U = (U _nm , U _{n-m + 1} , ..., U of the manipulated variables input to the controlled object 1 in the past m unit time.
_n-1 ) and the time series of the controlled variable X =
Based on (X _nm , X _{n-m + 1} , ..., X _n-1 ), the response time of the controlled object 1 until the influence of the manipulated variable input to the controlled object 1 appears in the controlled variable is calculated and learned. Output to the unit 4 (m is a positive integer).

【００６０】次に、本発明の他の実施の形態を図６に基
づき説明する。Next, another embodiment of the present invention will be described with reference to FIG.

【００６１】図６に示す制御装置では、操作部２が時間
遅れを持つことを考慮したものである。同図に示すよう
に、制御対象１で計測された制御量Ｘ_ｎは調節部２に入
力される。調節部２は制御対象１で計測された制御量Χ
_ｎと目標制御量Ｘ^ｄに基づき、調節信号Ｃ_ｎを算出す
る。操作部３では調節部２で算出された調節信号Ｃ_ｎに
基づき操作量Ｕ_ｎが出力されるが、操作部３は操作遅れ
時間を持ち、入力された調節信号は操作遅れ時間遅れて
操作量として出力される。制御対象１には操作部３が出
力した操作量Ｕ_ｎが入力され、その操作量Ｕ_ｎが変化す
ることで制御される。学習部４は制御対象１に操作量
が入力されてからその操作量による影響が制御量に現れ
るまでの応答時間ｐΔｔと、調節部２で算出された調節
信号が操作部３を通して操作量として実現されるまでの
操作遅れ時間ｑΔｔと、観測データ記憶部５に記憶され
た過去ｌ単位時間に計測された制御量（Ｘ_n-l，Ｘ
_n-l+1，…，Ｘ_n-1）と制御対象１に入力された操作量
（Ｕ_n-l，Ｕ_n-l+1，…，Ｕ_n-1）（ｌは正の整数）か
らなる観測データに基づいて、調節部２のパラメータＷ
を学習によって調節し、調節部２へ出力する。上記操作
遅れ時間ｑΔｔは、応答時間ｐΔｔと同様に、調節部２
で算出された調節信号が操作部３を通して操作量として
実現されるまでの時間を予め計測あるいは算出して設定
しておいてもよいし、後述するように調節部２の変動に
応じて動的に設定するようにしてもよい。The control device shown in FIG. 6 takes into consideration that the operating section 2 has a time delay. As shown in the figure, the controlled variable X _n measured by the controlled object 1 is input to the adjustment unit 2. The adjusting unit 2 controls the controlled variable Χ measured by the controlled object 1.
_The adjustment signal C _n is calculated based on _n and the target control amount X ^d . The operation unit 3 outputs the operation amount U _n based on the adjustment signal C _n calculated by the adjustment unit 2. However, the operation unit 3 has an operation delay time, and the input adjustment signal delays the operation delay time and the operation amount is delayed. Is output as. The operation amount U _n output by the operation unit 3 is input to the controlled object 1 and is controlled by changing the operation amount U _n . The learning unit 4 realizes the response time pΔt from when the operation amount is input to the controlled object 1 until the influence of the operation amount appears in the control amount and the adjustment signal calculated by the adjustment unit 2 as the operation amount through the operation unit 3. Operation delay time qΔt until the operation is performed, and the control amount (X _nl , X) measured in the past 1 unit time stored in the observation data storage unit 5.
_{n-l + 1} , ..., X _n-1 ) and the manipulated _variables (U _nl , U _{n-l + 1} , ..., U _n-1 ) input to the controlled object 1 (l is a positive integer) Based on the observation data, the parameter W of the adjusting unit 2
Is adjusted by learning and output to the adjusting unit 2. The operation delay time qΔt is the same as the response time pΔt.
The time until the adjustment signal calculated in step 3 is realized as an operation amount through the operation unit 3 may be measured or calculated in advance, and may be set in accordance with a change in the adjustment unit 2 as described later. It may be set to.

【００６２】図７は図６に示した調節部２の構成の一例
を示す図である。FIG. 7 is a diagram showing an example of the configuration of the adjusting section 2 shown in FIG.

【００６３】図７に示すように、仮想操作量算出部６は
制御対象１で計測された制御量Ｘ_ｎ及び目標制御量Ｘ^ｄ
に基づき、仮想操作量Ｕ（＾）_ｎを算出する。制御量予
測部７は仮想操作量Ｕ（＾）_ｎと制御対象１で計測され
た制御量Ｘ_ｎに基づき、仮想操作量Ｕ（＾）_ｎが調節信
号として出力された場合のｐΔｔ時間後の予測制御量Χ
（〜）_n+pを算出する。ｐΔｔは制御対象１の応答時間
である。操作量修正部８は、制御量予測部７で予測され
たｐΔｔ時間後の制御量Ｘ_n+p及び目標制御量Ｘ^ｄに基
づき、予測制御量Ｘ（〜）_n+pが目標制御量Ｘ^ｄに近付
くように仮想操作量Ｕ（＾）_ｎを修正する。第１スイッ
チ９は制御対象１で制御量が計測されると下側に閉じ、
仮想操作量算出部６で算出された仮想操作量Ｕ（＾）_ｎ
を制御量予測部７に入力し、その後、第１スイッチ９は
次の計測時まで上側に閉じて、操作量修正部８で修正さ
れた仮想操作量Ｕ（＾）_ｎを制御量予測部７の入力に
し、所定の回数、あるいは予測制御量Ｘ（〜）_n+pと目
標制御量Ｘ^ｄの差が所定の値以下になるまで、制御量予
測部７による制御量の予測と操作量修正部８による操作
量の修正が繰り返される。制御量予測部７による制御量
の予測と操作量修正部８による操作量の修正が所定の回
数まで、あるいは予測制御量Χ（〜）_n+pと目標制御量
Ｘ^ｄの差が所定の値以下になるまで繰り返されると、第
２スイッチ１０が閉じ、仮想操作量Ｕ（＾）_ｎが調節信
号Ｃ_ｎとして操作部３へ出力される。As shown in FIG. 7, the virtual manipulated variable calculator 6 controls the controlled variable X _n and the target controlled variable X ^d measured for the controlled object 1.
The virtual operation amount U (^) _n is calculated based on Based on the virtual operation amount U (^) _n and the control amount X _n measured by the controlled object 1, the control amount predicting unit 7 calculates the virtual operation amount U (^) _n as an adjustment signal after pΔt time. Predicted controlled variable Χ
(~) Calculate _{n + p} . pΔt is the response time of the controlled object 1. Manipulated variable correcting section 8, based on the control amount X _{n + p} and the target control amount X ^d after predicted pΔt time by the control amount prediction unit 7, the predicted controlled variable X (~) n + _p is the target control amount X ^The virtual manipulated variable U (^) _n is modified so as to approach ^d . The first switch 9 closes downward when the controlled variable is measured by the controlled object 1,
Virtual operation amount U (^) _n calculated by the virtual operation amount calculation unit 6
Is input to the control amount predicting unit 7, and then the first switch 9 is closed upward until the next measurement, and the virtual operation amount U (^) _n corrected by the operation amount correcting unit 8 is input to the control amount predicting unit 7 Input, and until the predetermined number of times, or until the difference between the predicted control amount X (-) _{n + p} and the target control amount ^Xd becomes a predetermined value or less, the control amount prediction unit 7 predicts the control amount and corrects the manipulated variable. The correction of the operation amount by the unit 8 is repeated. The control amount prediction unit 7 predicts the control amount and the operation amount correcting unit 8 corrects the operation amount up to a predetermined number of times, or the difference between the predicted control amount Χ (~) _{n + p} and the target control amount X ^d is a predetermined value. When it is repeated until the following, the second switch 10 is closed and the virtual operation amount U (^) _n is output to the operation unit 3 as the adjustment signal C _n .

【００６４】図８は図７に示した制御量予測部７の構成
の一例を示す図である。ここでも、制御量予測部７は神
経回路モデルから構成されている。図８では時刻ｎΔｔ
の仮想操作量Ｕ（＾）_ｎ＝（Ｕ（＾）１_ｎ，Ｕ（＾）２
_ｎ，…，Ｕ（＾）ｇ_ｎ）^Ｔ（ｇは操作量の次元数を表
す）と制御量Χ_ｎ＝（Χ１_ｎ，Χ２_ｎ，…，Χｈ_ｎ）^Ｔ
（ｈは制御量の次元数を表す）が入力されると、応答時
間ｐΔｔ後の予測制御量Ｘ（〜）_n+p＝（Ｘ（〜）１
_n+p，Χ（〜）２_n+p，…，Χ（〜）ｈ_n+p）^Ｔが出力
される神経回路モデル１１の制御時の予測計算の信号の
流れを示している。この入力信号から出力信号を求める
のは先に示した神経回路モデル１１の前向き計算によっ
て行われる。FIG. 8 is a diagram showing an example of the configuration of the control amount predicting section 7 shown in FIG. In this case as well, the control amount prediction unit 7 is composed of a neural circuit model. In FIG. 8, time nΔt
Virtual operation amount U (^) _n = (U (^) _1n , U (^) 2 of
_n , ..., U (^) g _n ) ^T (g represents the number of dimensions of the manipulated variable) and the controlled variable Χ _n = (Χ ₁ _n , Χ 2 _n , ..., Χ h _n ) ^T
When (h represents the number of dimensions of the controlled variable) is input, the predicted controlled variable X (~) _{n + p} = (X (~) 1 after the response time pΔt is reached.
_{n + p} , Χ (-) _{2n + p} , ..., Χ (-) h _{n + p} ) ^T is output, showing the signal flow of the prediction calculation during control of the neural circuit model 11. The output signal is obtained from the input signal by the forward calculation of the neural circuit model 11 described above.

【００６５】制御量予測部７が神経回路モデル１１から
構成される場合には、操作量修正部８は先に示した神経
回路モデル１１の誤差逆伝播計算によって仮想操作量Ｕ
（＾）_ｎの修正量を求め、仮想操作量が修正される。When the control amount predicting unit 7 is composed of the neural circuit model 11, the operation amount correcting unit 8 calculates the virtual operation amount U by the error backpropagation calculation of the neural circuit model 11 described above.
(^) The virtual operation amount is corrected by _obtaining the correction amount of _n .

【００６６】ここで、制御量予測部７が神経回路モデル
１１から構成される場合には、学習部４は以下のように
構成される。Here, when the control amount predicting unit 7 is composed of the neural circuit model 11, the learning unit 4 is structured as follows.

【００６７】すなわち、学習部４は、制御対象１の応答
時間ｐΔｔと、操作遅れ時間ｑΔｔと、観測データ記憶
部５に記憶された過去の制御量と操作量からなる観測デ
ータに基づいて学習データを作成し、制御量予測部７を
構成する神経回路モデル１１の学習を行う。That is, the learning unit 4 learns data based on the response time pΔt of the controlled object 1, the operation delay time qΔt, and the observation data that is stored in the observation data storage unit 5 and includes the past control amount and operation amount. Is created, and the neural circuit model 11 that constitutes the control amount prediction unit 7 is learned.

【００６８】観測データの時刻ｓΔｔの操作量Ｕ_ｓと制
御量Χ_ｓは時刻ｓΔｔに制御対象１に入力された操作量
と時刻ｓΔｔに制御対象１で計測された制御量である。
従って、操作部３が時間遅れを持つ場合に、入力信号
（Ｕ_ｓ，Ｘ_ｓ）、教師信号（Ｘ_s+p）（ｓ＝ｎ−ｌ，ｎ
−ｌ＋１，…，ｎ−ｐ−１）を学習データとして学習を
行うと、神経回路モデル１１は時刻ｓΔｔに計測された
制御量Ｘ_ｓと時刻ｓΔｔに制御対象１に入力された操作
量Ｕ_ｓからｐΔｔ時間後の制御量Ｘ_s+pを出力するよう
になる。この神経回路モデル１１を制御量予測部７とし
て制御を行うと、調節部２では予測制御量Ｘ（〜）_n+p
が目標制御量Ｘ^ｄに近付くための操作量Ｕ_ｎの値が求め
られてしまい、実際にその値が制御対象１に入力される
のは操作遅れ時間ｑΔｔ後になるので、正確な制御がで
きなくなる。[0068] an operation amount U _s and the controlled variable chi _s operation amount input to the time sΔt to the controlled object 1 and the time control amount measured by the control target 1 to sΔt time sΔt observation data.
Therefore, when the operation unit 3 has a time delay, the input signal (U _s , X _s ) and the teacher signal (X _{s + p} ) (s = n-1, n)
When learning is performed using −l + 1, ..., Np−1) as learning data, the neural circuit model 11 causes the controlled variable X _s measured at time sΔt and the manipulated variable U _s input to the controlled object 1 at time sΔt. Then, the control amount X _{s + p} after pΔt time is output. When this neural circuit model 11 is controlled as the control amount predictor 7, the controller 2 causes the predictive control amount X (-) _{n + p}
There will be a demand value of the operating amount U _n for approaching the target control amount X ^d, since the actual value is input to the control object 1 is after the operation delay time Qderutati, can not be accurately controlled .

【００６９】そこで、本発明では、神経回路モデル１１
に、制御対象１で計測された制御量Ｘ_ｎと調節信号Ｃ_ｎ
を入力すると、ｐΔｔ時間後の予測制御量Ｘ（〜）_n+p
が出力されるように、神経回路モデル１１の学習を行う
ために、図９に示すように、制御量Ｘｓ＝（Ｘ１_ｓ，Ｘ
２_ｓ，…，Ｘｈ_ｓ）と操作遅れ時間ｑΔｔ後に制御対象
１に入力される操作量Ｕ_s+q＝（Ｕ１_s+q，Ｕ２_s+q，
…，Ｕｇ_s+q）を入力信号、応答時間ｐΔｔ後の制御量
Χ_s+p＝（Χ１_s+p，Χ２_s+p，…Ｘｈ_s+p）を教師信
号にした学習データを作成する（ｓ＝ｎ−ｌ，ｎ−ｌ＋
１，…，ｎ−ｐ−１）。この学習データで学習ができれ
ば、制御時に調節部２で求められた調節信号Ｃ_ｎは、操
作遅れ時間ｑΔｔ後に制御対象１に入力された時に応答
時間ｐΔｔ後の制御量Ｘ_n+pが目標制御量Ｘ^ｄに近付く
操作量なので、正確な制御ができる。Therefore, in the present invention, the neural circuit model 11
In addition, the controlled variable X _n and the adjustment signal C _n measured in the controlled object 1
, The predicted controlled variable X (~) _{n + p} after pΔt time
In order to perform the learning of the neural circuit model 11 so as to output the control amount Xs = (X1 _s , X
2 _s , ..., Xh _s ) and the manipulated variable U _{s + q} = (U1 _{s + q} , U2 _{s + q} , which is input to the controlled object 1 after the operation delay time qΔt.
,, Ug _{s + q} ) as the input signal, and the control amount after the response time pΔt Χ _{s + p} = (Χ 1 _{s + p} , Χ 2 _{s + p} ,… Xh _{s + p} ) is used as the teacher signal to create learning data. (S = n-1, n-1 +
1, ..., Np-1). If learning can be performed with this learning data, the control signal C _n obtained by the adjusting unit 2 during control is the target control amount X _{n + p} after the response time pΔt when input to the control target 1 after the operation delay time qΔt. Since the operation amount approaches the amount ^Xd , accurate control can be performed.

【００７０】次に、本発明の他の実施の形態を図１０に
基づき説明する。Next, another embodiment of the present invention will be described with reference to FIG.

【００７１】図１０に示す制御装置では、図６に示した
制御装置に操作遅れ時間計測部２０が付加されている。
操作遅れ時間計測部２０は過去ｍ単位時間に調節部２で
算出された調節信号の時系列Ｃ＝（Ｃ_n-m，Ｃ_n-m+1，
…，Ｃ_n-1）と、操作部３から出力された操作量の時系
列Ｕ＝（Ｕ_n-m，Ｕ_n-m+1，…，Ｕ_n-1）に基づき、調
節信号が操作部３に入力されて操作量として出力される
までの操作遅れ時間ｑΔｔを計算し、学習部４に出力す
る。ｍは正の整数である。操作遅れ時間計測部２０が付
加されることにより、操作部３の特性変化によって操作
遅れ時間ｑΔｔが変化する場合でも、操作遅れ時間計測
部２０によって変化後の操作遅れ時間ｑΔｔが計算さ
れ、学習部４に出力される。学習部４では操作遅れ時間
計測部２０から入力された操作遅れ時間ｑΔｔに従い、
学習データを作成し、学習を行うので、操作遅れ時間が
変化する場合でも正しく制御し続けることができる。In the control device shown in FIG. 10, an operation delay time measuring section 20 is added to the control device shown in FIG.
The operation delay time measuring unit 20 has a time series C = (C _nm , C _{n-m + 1} , of the adjustment signal calculated by the adjusting unit 2 in the past m unit time,
, C _n-1 ) and the time series U = (U _nm , U _{n-m + 1} , ..., U _n-1 ) of the manipulated variables output from the operating unit 3, the adjustment signal The operation delay time qΔt from the input to the output to the output as the operation amount is calculated and output to the learning unit 4. m is a positive integer. By adding the operation delay time measuring unit 20, even if the operation delay time qΔt changes due to the characteristic change of the operation unit 3, the operation delay time measuring unit 20 calculates the changed operation delay time qΔt, and the learning unit 4 is output. In the learning unit 4, according to the operation delay time qΔt input from the operation delay time measuring unit 20,
Since learning data is created and learning is performed, correct control can be continued even if the operation delay time changes.

【００７２】次に、本発明の他の実施の形態を図１１に
基づき説明する。Next, another embodiment of the present invention will be described with reference to FIG.

【００７３】図１１に示す制御装置では、制御対象１で
計測された制御量を調節部２に伝送するのに、むだ時間
を持つ伝送部が存在することを考慮したものである。The control device shown in FIG. 11 takes into consideration that there is a transmission section having a dead time for transmitting the control amount measured by the controlled object 1 to the adjustment section 2.

【００７４】同図に示すように、時刻ｎΔｔに制御対象
１で計測された制御量Χ_ｎは、伝送部２１を通して調節
部２に入力されるが、伝送部２１は伝送むだ時間を持
ち、制御対象１で計測された制御量Χ_ｎは伝送むだ時間
遅れて調節部２に入力される。時刻ｎΔｔに調節部２に
入力される制御量をＸ′_ｎと表す。調節部２は制御量
Ｘ′_ｎと目標制御量Ｘ^ｄに基づき、調節信号Ｃ_ｎを算出
する。操作部３では調節部２で算出された調節信号Ｃ_ｎ
に基づき操作量Ｕ_ｎが出力される。制御対象１には操作
部３が出力した操作量Ｕ_ｎが入力され、その操作量Ｕ_ｎ
が変化することで制御される。As shown in the figure, the control amount _n measured at the controlled object 1 at the time nΔt is input to the adjustment unit 2 through the transmission unit 21, but the transmission unit 21 has a transmission dead time and control is performed. control amount chi _n measured by the subject 1 is input to the modulating portion 2 delayed transmission dead time. The control amount input to the adjusting unit 2 at time nΔt is represented by X ′ _n . The adjustment unit 2 calculates the adjustment signal C _n based on the control amount X ′ _n and the target control amount X ^d . In the operation unit 3, the adjustment signal C _n calculated by the adjustment unit 2
The manipulated variable U _n is output based on The controlled object 1 is input operating amount U _n of the operation unit 3 and output, the operation amount U _n
Is controlled by changing.

【００７５】学習部４は制御対象１に操作量が入力され
てからその操作量による影響が制御量に現れるまでの応
答時間ｐΔｔと、制御対象１で計測された制御量Ｘ_ｎが
伝送部２１を通して調節部２に入力されるまでの伝送む
だ時間ｒΔｔと、観測データ記憶部５に記憶された過去
ｌ単位時間に計測された制御量（Χ_n-l，Ｘ_n-l+1，
…，Ｘ_n-1）と制御対象１に入力された操作量
（Ｕ_n-l，Ｕ_n-l+1，…，Ｕ_n-1）からなる観測データ
に基づいて、調節部２のパラメータＷを学習によって調
節し、調節部２へ出力する。上記伝送むだ時間は、操作
遅れ時間ｑΔｔや応答時間ｐΔｔと同様に、制御対象１
で計測された制御量Ｘ_ｎが伝送部２１を通して調節部２
に入力されるまでの時間を予め計測あるいは算出して設
定しておいてもよいし、後述するように伝送部２１の変
動に応じて動的に設定するようにしてもよい。The learning unit 4 receives the response time pΔt from the input of the manipulated variable to the controlled object 1 until the influence of the manipulated variable appears in the controlled variable, and the controlled variable X _n measured by the controlled object 1 as the transmission unit 21. Transmission delay time rΔt until it is input to the adjustment unit 2 through the control data and the control amount (Χ _nl , X _{n-l + 1} ,
, X _n-1 ) and the manipulated variables (U _nl , U _{n-l + 1} , ..., U _n-1 ) input to the controlled object 1, based on the observation data, the parameter W of the adjusting unit 2 is set. It is adjusted by learning and output to the adjusting unit 2. The transmission dead time is the same as the operation delay time qΔt and the response time pΔt.
The control amount X _n measured by the controller 2 is transmitted through the transmitter 21.
The time until it is input may be measured or calculated and set in advance, or may be dynamically set according to the variation of the transmission unit 21 as described later.

【００７６】図１２は図１１に示した調節部２の構成の
一例を示す図である。FIG. 12 is a diagram showing an example of the configuration of the adjusting section 2 shown in FIG.

【００７７】図１２に示す調節部２が図７に示した調節
部２と異なるのは調節部２に入力される制御量Ｘ´_ｎが
制御対象１で計測された制御量Ｘ_ｎに対して伝送むだ時
間ｒΔｔ遅れていることである。仮想操作量算出部６は
伝送部２１を通して入力された制御量Ｘ′_ｎ及び目標制
御量Ｘ^ｄに基づき、仮想操作量Ｕ（＾）_ｎを算出する。
制御量予測部７は仮想操作量Ｕ（＾）_ｎと制御量Χ´_ｎ
に基づき、仮想操作量Ｕ（＾）_ｎが調節信号として出力
された場合のｐΔｔ時間後の予測制御量Ｘ（〜）_n+pを
算出する。ｐΔｔは制御対象１の応答時間である。操作
量修正部８は、制御量予測部７で予測されたｐΔｔ時間
後の制御量Ｘ_n+p及び目標制御量Ｘ^ｄに基づき、予測制
御量Ｘ（〜）_n+pが目標制御量Ｘ^ｄに近付くように仮想
操作量Ｕ（＾）_ｎを修正する。第１スイッチ９は制御対
象１で制御量が計測されると下側に閉じ、仮想操作量算
出部６で算出された仮想操作量Ｕ（＾）_ｎを制御量予測
部７に入力し、その後、第１スイッチ９は次の計測時ま
で上側に閉じて、操作量修正部８で修正された仮想操作
量Ｕ（＾）_ｎを制御量予測部７の入力にし、所定の回
数、あるいは予測制御量Χ（〜）_n+pと目標制御量Χ^ｄ
の差が所定の値以下になるまで、制御量予測部７による
制御量の予測と操作量修正部８による操作量の修正が繰
り返される。制御量予測部７による制御量の予測と操作
量修正部８による操作量の修正が所定の回数まで、ある
いは予測制御量Χ（〜）_n+pと目標制御量Ｘ^ｄの差が所
定の値以下になるまで繰り返されると第２スイッチ１０
が閉じ、仮想操作量Ｕ（＾）_ｎが調節信号Ｃ_ｎとして操
作部３へ出力される。[0077] adjusting section 2 shown in FIG. 12 relative to adjustment portion 2 differs from the control amount control amount _X'n inputted to modulating section 2 is measured by the control target 1 is X _n shown in FIG. 7 That is, the transmission dead time rΔt is delayed. The virtual operation amount calculation unit 6 calculates the virtual operation amount U (^) _n based on the control amount _X'n and the target control amount ^Xd input through the transmission unit 21.
The control amount predicting unit 7 calculates the virtual operation amount U (^) _n and the control amount Χ ' _n.
Based on, the predicted control amount X (-) _{n + p} after pΔt time when the virtual operation amount U (^) _n is output as the adjustment signal is calculated. pΔt is the response time of the controlled object 1. Manipulated variable correcting section 8, based on the control amount X _{n + p} and the target control amount X ^d after predicted pΔt time by the control amount prediction unit 7, the predicted controlled variable X (~) n + _p is the target control amount X ^The virtual manipulated variable U (^) _n is modified so as to approach ^d . The first switch 9 closes downward when the controlled variable is measured by the controlled object 1, inputs the virtual manipulated variable U (^) _n calculated by the virtual manipulated variable calculating unit 6 into the controlled variable predicting unit 7, and thereafter. , The first switch 9 is closed to the upper side until the next measurement, and the virtual operation amount U (^) _n corrected by the operation amount correction unit 8 is input to the control amount prediction unit 7, and the predetermined number of times or the prediction control is performed. Quantity Χ (~) _{n + p} and target control quantity Χ ^d
The control amount prediction by the control amount prediction unit 7 and the correction of the operation amount by the operation amount correction unit 8 are repeated until the difference is less than or equal to a predetermined value. The control amount prediction unit 7 predicts the control amount and the operation amount correcting unit 8 corrects the operation amount up to a predetermined number of times, or the difference between the predicted control amount Χ (~) _{n + p} and the target control amount X ^d is a predetermined value. When repeated until the second switch 10
Is closed, and the virtual operation amount U (^) _n is output to the operation unit 3 as the adjustment signal C _n .

【００７８】図１３は図１２に示した制御量予測部７の
構成の一例を示す図である。ここでも、制御量予測部７
は神経回路モデルから構成されている。図１３では時刻
ｎΔｔの仮想操作量Ｕ（＾）_ｎ＝（Ｕ（＾）１_ｎ，Ｕ
（＾）２_ｎ，…，Ｕ（＾）ｇ_ｎ）^Ｔ（ｇは操作量の次元
数を表す）と制御量Χ´_ｎ（Ｘ１′_ｎ，Ｘ２′_ｎ，…，
Χｈ′_ｎ）^Ｔ（ｈは制御量の次元数を表す）が入力され
ると、応答時間ｐΔｔ後の予測制御量Ｘ（〜）_n+p＝
（Ｘ１_n+p，Ｘ２_n+p，…，Ｘｈ_n+p）^Ｔが出力される
神経回路モデル１１の制御時の予測計算の信号の流れを
示している。この入力信号から出力信号を求めるのは先
に示した神経回路モデル１１の前向き計算によって行わ
れる。FIG. 13 is a diagram showing an example of the configuration of the control amount predicting section 7 shown in FIG. Again, the control amount predicting unit 7
Is composed of a neural circuit model. In FIG. 13, the virtual manipulated variable U (^) _n = (U (^) 1 _n , U at time nΔt
_{(^) 2 n, ...,} U (^) g n) T (g represents the number of dimensions of the operation amount) and the controlled variable _{_{Χ'n (X1 'n, X2}} ' n, ...,
When Χh ′ _n ) ^T (h represents the number of dimensions of the controlled variable) is input, the predicted controlled variable X (∼) _{n + p} = after the response time pΔt
(X1 _{n + p} , X2 _{n + p} , ..., Xh _{n + p} ) ^T shows the signal flow of the prediction calculation during control of the neural circuit model 11. The output signal is obtained from the input signal by the forward calculation of the neural circuit model 11 described above.

【００７９】制御量予測部７が神経回路モデル１１から
構成される場合には、操作量修正部８は先に示した神経
回路モデル１１の誤差逆伝播計算によって仮想操作量Ｕ
（＾）_ｎの修正量を求め、仮想操作量を修正する。When the control amount predicting unit 7 is composed of the neural circuit model 11, the operation amount correcting unit 8 calculates the virtual operation amount U by the error back propagation calculation of the neural circuit model 11 described above.
(^) _Obtain the correction amount of _n and correct the virtual operation amount.

【００８０】ここで、制御量予測部７が神経回路モデル
１１から構成される場合には、学習部４は以下のように
構成される。Here, when the control amount predicting unit 7 is composed of the neural circuit model 11, the learning unit 4 is structured as follows.

【００８１】すなわち、学習部４は、制御対象１の応答
時間ｐΔｔと、伝送むだ時間ｒΔｔと、観測データ記憶
部５に記憶された過去の制御量と操作量からなる観測デ
ータに基づいて学習データを作成し、制御量予測部７を
構成する神経回路モデル１１の学習を行う。伝送部２１
に伝送むだ時間がある場合に、入力信号（Ｕ_ｓ，
Ｘ_ｓ）、教師信号（Ｘ_s+p）（ｓ＝ｎ−ｌ，ｎ−ｌ＋
１，…，ｎ−ｐ−１）を学習データとして学習を行う
と、神経回路モデル１１は時刻ｓΔｔに計測された制御
量Ｘ_ｓと時刻ｓΔｔに制御対象１に入力された操作量Ｕ
_ｓから応答時間ｐΔｔ後の制御量Χ_s+pを出力するよう
になる。この神経回路モデル１１を制御量予測部７とし
て制御を行うと、制御量予測部７では伝送むだ時間ｒΔ
ｔ前に計測された制御量を現在の制御量としてｐΔｔ時
間後の予測制御量Ｘ（〜）_n+pを出力するため、正確な
制御ができなくなる。That is, the learning unit 4 learns data based on the response time pΔt of the controlled object 1, the transmission dead time rΔt, and the observation data including the past control amount and operation amount stored in the observation data storage unit 5. Is created, and the neural circuit model 11 that constitutes the control amount prediction unit 7 is learned. Transmission unit 21
If there is a dead time in transmission, the input signal (U _s ,
X _s), a teacher signal _{(X s + p) (s} = n-l, n-l +
1, ..., Np-1) is used as learning data, the neural circuit model 11 controls the controlled variable X _s measured at time sΔt and the manipulated variable U input to the controlled object 1 at time sΔt.
_{From s,} the control amount Χ _{s + p} after the response time pΔt is output. When this neural circuit model 11 is controlled by the control amount predicting unit 7, the control amount predicting unit 7 transmits the transmission dead time rΔ.
Since the control amount measured before t is used as the current control amount and the predicted control amount X (~) _{n + p} after _{p [} Delta] t time is output, accurate control cannot be performed.

【００８２】そこで、本発明では、時刻ｎΔｔに調節部
２に入力される制御量Ｘ′_ｎと調節信号Ｃ_ｎを神経回路
モデル１１に入力すると、ｐΔｔ時間後の予測制御量Χ
（〜）_n+pを出力するように、神経回路モデル１１の学
習を行うために、図１４に示すように、伝送むだ時間前
に制御対象１で計測された制御量Ｘ_s-r＝（Ｘ１_s-r，
Ｘ２_s-r，…，Ｘｈ_s-r）と操作量Ｕ_ｓ＝（Ｕ１_ｓ，Ｕ
２_ｓ，…，Ｕｇ_ｓ）を入力信号、応答時間ｐΔｔ後の制
御量Χ_s+p＝（Χ１_s+p，Χ_2+p，…，Ｘｈ_s+ _p）を教
師信号にする（ｓ＝ｎ−ｌ＋ｒ，ｎ−ｌ＋ｒ＋１，…，
ｎ−ｐ−１）。この学習データで学習ができれば、制御
時に調節部２で求められた調節信号Ｃ_ｎは、伝送むだ時
間ｒΔｔ前に制御対象１で計測された制御量Ｘ´_ｎと調
節信号Ｃ_ｎから予測される応答時間ｐΔｔ後の制御量Χ
_n+pが目標制御量Χ^ｄに近付く調節信号なので、正確な
制御ができる。[0082] In the present invention, when the control amount X _'n and the adjustment signal C _n inputted to modulating section 2 at time nΔt input to the neural network model 11, the prediction control amount after pΔt time Χ
In order to perform learning of the neural circuit model 11 so that (~) _{n + p} is output, as shown in FIG. 14, the controlled _variable X _sr = (X1 _sr measured by the controlled object 1 before the dead time of the transmission. ，
X2 _sr , ..., Xh _sr ) and the manipulated variable U _s = (U1 _s , U)
2 _s , ..., Ug _s ) as an input signal, and a control amount Χ _{s + p} = (Χ 1 _{s + p} , Χ _{2 + p} , ..., Xh _{s +} _p ) after the response time pΔt is used as a teacher signal (s = n -L + r, n-l + r + 1, ...,
n-p-1). If learning can be performed with this learning data, the adjustment signal C _n obtained by the adjustment unit 2 at the time of control is predicted from the control amount X ′ _n and the adjustment signal C _n measured by the controlled object 1 before the transmission dead time rΔt. Control amount Χ after response time pΔt
Since _{n + p} is adjustment signal approaches the target control amount chi ^d, it can be accurately controlled.

【００８３】次に、本発明の他の実施の形態を図１５に
基づき説明する。Next, another embodiment of the present invention will be described with reference to FIG.

【００８４】図１５に示す制御装置では、図１１に示し
た制御装置に伝送むだ時間計測部２２が付加されてい
る。伝送むだ時間計測部２２は、過去ｍ単位時間に制御
対象１で計測された制御量の時系列Ｘ＝（Ｘ_n-m，Ｘ
_n-m+1，…，Χ_n-1）と、調節部２に入力された制御量
の時系列Ｘ′＝（Ｘ_n-m，Χ´_n-m+1，…，Χ´_n-1）
に基づき、制御対象１で計測された制御量が調節部２に
入力されるまでの伝送むだ時間ｒΔｔを計算し、学習部
４に出力する。ｍは正の整数である。伝送むだ時間計測
部２２が付加されることにより、伝送部２１の特性変化
によって伝送むだ時間が変化する場合でも、伝送むだ時
間計測部２２によって変化後の伝送むだ時間ｒΔｔが計
算され、学習部４に出力される。学習部４では伝送むだ
時間計測部２２から入力された伝送むだ時間ｒΔｔに従
い、学習データを作成し、学習を行うので、伝送むだ時
間が変化する場合でも正しく制御し続けることができ
る。In the control device shown in FIG. 15, a transmission dead time measuring section 22 is added to the control device shown in FIG. The transmission dead time measuring unit 22 determines the time series X = (X _nm , X of the controlled variable measured by the controlled object 1 in the past m unit time.
_{n-m + 1} , ..., Χ _n-1 ) and the time series of control quantities input to the adjusting unit 2 X '= (X _nm , Χ' _{n-m + 1} , ..., Χ ' _n-1 ).
Based on, the transmission dead time rΔt until the controlled variable measured by the controlled object 1 is input to the adjusting unit 2 is calculated and output to the learning unit 4. m is a positive integer. By adding the transmission dead time measuring unit 22, even if the transmission dead time changes due to the characteristic change of the transmission unit 21, the transmission dead time measuring unit 22 calculates the changed transmission dead time rΔt, and the learning unit 4 Is output to. The learning unit 4 creates learning data according to the transmission dead time rΔt input from the transmission dead time measuring unit 22 and performs learning, so that correct control can be continued even when the transmission dead time changes.

【００８５】次に、本発明の他の実施の形態を図１６に
基づき説明する。Next, another embodiment of the present invention will be described with reference to FIG.

【００８６】図１６に示す制御装置は、操作部３に調節
部２で算出された調節信号が操作量として制御対象１に
入力されるまでにむだ時間を持つこと、伝送部２１に制
御対象１で計測された制御量を調節部２に伝送するのに
むだ時間を持つことを考慮したものである。同図に示す
ように、時刻ｎΔｔに制御対象１で計測された制御量Ｘ
_ｎは、伝送部２１を通して調節部２に入力されるが、伝
送部２１は伝送むだ時間ｒΔｔを持ち、制御対象１で計
測された制御量Ｘ_ｎは伝送むだ時間ｒΔｔ遅れて調節部
２に入力される。時刻ｎΔｔに調節部２に入力される制
御量をＸ′_ｎと表す。調節部２は制御量Ｘ′_ｎと目標制
御量Ｘ^ｄに基づき、調節信号Ｃ_ｎを算出する。操作部３
では調節部２で算出された調節信号Ｃ_ｎに基づき操作量
Ｕ_ｎが出力されるが、操作部３は時間遅れを持ち、入力
された調節信号Ｃ_ｎは操作遅れ時間ｑΔｔ遅れて操作量
として出力される。制御対象１には操作部３が出力した
操作量Ｕ_ｎが入力され、その操作量Ｕ_ｎが変化すること
で制御される。In the control device shown in FIG. 16, the operation unit 3 has a dead time until the adjustment signal calculated by the adjustment unit 2 is input to the control target 1 as an operation amount, and the transmission unit 21 controls the control target 1 This is because it takes a dead time to transmit the control amount measured in step 2 to the adjustment part 2. As shown in the figure, the controlled variable X measured by the controlled object 1 at time nΔt
_n is input to the adjusting unit 2 through the transmitting unit 21, and the transmitting unit 21 has a transmission dead time rΔt, and the control amount X _n measured by the controlled object 1 is input to the adjusting unit 2 with a delay of the transmission dead time rΔt. To be done. The control amount input to the adjusting unit 2 at time nΔt is represented by X ′ _n . The adjustment unit 2 calculates the adjustment signal C _n based on the control amount X ′ _n and the target control amount X ^d . Operation part 3
Then, the operation amount U _n is output based on the adjustment signal C _n calculated by the adjustment unit 2, but the operation unit 3 has a time delay, and the input adjustment signal C _n is delayed as the operation delay time qΔt to be an operation amount. Is output. The operation amount U _n output by the operation unit 3 is input to the controlled object 1 and is controlled by changing the operation amount U _n .

【００８７】学習部４には制御対象１に操作量が入力さ
れてからその操作量による影響が制御量に現れるまでの
応答時間ｐΔｔと、調節部２で算出された調節信号が操
作部３を通して操作量として実現されるまでの操作遅れ
時間ｑΔｔと、制御対象１で計測された制御量が伝送部
２１を通して調節部２に入力されるまでの伝送むだ時間
ｒΔｔと、観測データ記憶部５に記憶された過去ｌ単位
時間に計測された制御量（Ｘ_n-l，Ｘ_n-l+1，…，Ｘ
_n-1）と制御対象１に入力された操作量（Ｕ_n-l，Ｕ
_n-l+1，…，Ｕ_n-1）からなる観測データに基づいて、
調節部２のパラメータＷを学習によって調節し、調節部
２へ出力する。In the learning unit 4, the response time pΔt from the input of the operation amount to the controlled object 1 until the influence of the operation amount appears in the control amount and the adjustment signal calculated by the adjusting unit 2 are transmitted through the operating unit 3. The operation delay time qΔt until it is realized as an operation amount, the transmission dead time rΔt until the control amount measured by the controlled object 1 is input to the adjustment unit 2 through the transmission unit 21, and stored in the observation data storage unit 5. Control amount (X _nl , X _{n-l + 1} , ..., X
_n-1 ) and the manipulated _variables (U _nl , U input to the controlled object ₁ )
_{n-l + 1} , ..., U _n-1 )
The parameter W of the adjusting unit 2 is adjusted by learning and output to the adjusting unit 2.

【００８８】図１７は図１６に示した調節部２の構成の
一例を示す図である。FIG. 17 is a diagram showing an example of the configuration of the adjusting section 2 shown in FIG.

【００８９】図１７に示すように、仮想操作量算出部６
は伝送部２１を通して入力された制御量Ｘ′_ｎ及び目標
制御量Ｘ^ｄに基づき、仮想操作量Ｕ（＾）_ｎを算出す
る。制御量予測部７は仮想操作量Ｕ（＾）_ｎと制御量
Ｘ′_ｎに基づき、仮想操作量Ｕ（＾）_ｎが調節信号とし
て出力された場合のｐΔｔ時間後の予測制御量Ｘ（〜）
_n+pを算出する。ｐΔｔは制御対象１の応答時間であ
る。操作量修正部８は、制御量予測部７で予測されたｐ
Δｔ時間後の制御量Ｘ_n+p及び目標制御量Ｘ^ｄに基づ
き、予測制御量Ｘ（〜）_n+pが目標制御量Ｘ^ｄに近付く
ように仮想操作量Ｕ（＾）_ｎを修正する。第１スイッチ
９は制御対象１で制御量が計測されると下側に閉じ、仮
想操作量算出部６で算出された仮想操作量Ｕ（＾）_ｎを
制御量予測部７に入力し、その後、第１スイッチ９は次
の計測時まで上側に閉じて、操作量修正部８で修正され
た仮想操作量Ｕ（＾）_ｎを制御量予測部７の入力にし、
所定の回数、あるいは予測制御量Ｘ（〜）_n+pと目標制
御量Ｘ^ｄの差が所定の値以下になるまで、制御量予測部
７による制御量の予測と操作量修正部８による操作量の
修正が繰り返される。制御量予測部７による制御量の予
測と操作量修正部８による操作量の修正が所定の回数ま
で、あるいは予測制御量Ｘ（〜）_n+pと目標制御量Χ^ｄ
の差が所定の値以下になるまで繰り返されると、第２ス
イッチ１０が閉じ、仮想操作量Ｕ（＾）_ｎが調節信号Ｃ
_ｎとして操作部３へ出力される。As shown in FIG. 17, the virtual operation amount calculator 6
Calculates a virtual manipulated variable U (^) _n based on the controlled variable _X'n and the target controlled variable ^Xd input through the transmission unit 21. Control amount prediction unit 7 based on the virtual operation amount U (^) _n and the controlled variable X _'n, the virtual operating amount U (^) predicted controlled amount after pΔt time _{when n} is outputted as the adjustment signal X (~ )
Calculate _{n + p} . pΔt is the response time of the controlled object 1. The manipulated variable correcting unit 8 calculates p calculated by the control amount predicting unit 7.
Based on the Δt time control amount after X _{n + p} and the target control amount X ^d, modify the virtual operation amount U (^) _n, as predicted controlled variable X (~) n + _p approaches the target control amount X ^d . The first switch 9 closes downward when the controlled variable is measured by the controlled object 1, inputs the virtual manipulated variable U (^) _n calculated by the virtual manipulated variable calculating unit 6 into the controlled variable predicting unit 7, and thereafter. , The first switch 9 is closed upward until the next measurement, and the virtual manipulated variable U (^) _n corrected by the manipulated variable correcting unit 8 is input to the control amount predicting unit 7,
Prediction of the control amount by the control amount predicting unit 7 and operation by the operation amount correcting unit 8 until a predetermined number of times or a difference between the predicted control amount X (-) _{n + p} and the target control amount ^Xd becomes a predetermined value or less The amount modification is repeated. The control amount prediction unit 7 predicts the control amount and the operation amount correction unit 8 corrects the operation amount up to a predetermined number of times, or the predicted control amount X (-) _{n + p} and the target control amount Χ ^d
Is repeated until the difference becomes less than or equal to a predetermined value, the second switch 10 is closed, and the virtual manipulated variable U (^) _n becomes the adjustment signal C.
It is output to the operation unit 3 as _n .

【００９０】図１８は図１７に示した制御量予測部７の
構成の一例を示す図である。ここでも、制御量予測部７
は神経回路モデルから構成されている。図１８では時刻
ｎΔｔの仮想操作量Ｕ（＾）_ｎ＝（Ｕ（＾）１_ｎ，Ｕ
（＾）２_ｎ，…，Ｕ（＾）ｇ_ｎ）^Ｔ（ｇは操作量の次元
数を表す）と制御量Ｘ´_ｎ＝（Ｘ１′_ｎ，Χ２´_ｎ，
…，Ｘｈ´_ｎ）^Ｔ（ｈは制御量の次元数を表す）を入力
すると、ｐΔｔ時間後の予測制御量Ｘ（〜）_n+p＝（Ｘ
（〜）１_n+p，Ｘ（〜）２_n+p，…，Ｘ（〜）ｈ_n+ _p）
^Ｔを出力する神経回路モデル１１の制御時の予測計算の
信号の流れを示している。この入力信号から出力信号を
求めるのは前述した神経回路モデル１１の前向き計算に
よって行われる。FIG. 18 is a diagram showing an example of the configuration of the control amount predicting section 7 shown in FIG. Again, the control amount predicting unit 7
Is composed of a neural circuit model. In FIG. 18, the virtual operation amount U (^) _n = (U (^) 1 _n , U at time nΔt
_{(^) 2 n, ...,} U (^) g n) T (g represents the number of dimensions of the operation amount) and the controlled variable _{_{X'n = (X1 'n,}} Χ2' n,
, Xh ' _n ) ^T (where h represents the number of dimensions of the controlled variable), the predicted controlled variable X (~) _{n + p} = (X
(~) 1 _{n + p} , X (~) 2 _{n + p} , ..., X (~) h _{n +} _p )
The signal flow of the prediction calculation at the time of control of the neural circuit model 11 which outputs ^T is shown. The output signal is obtained from this input signal by the forward calculation of the neural circuit model 11 described above.

【００９１】制御量予測部７が神経回路モデル１１から
構成される場合には、操作量修正部８は前述した神経回
路モデル１１の誤差逆伝播計算によって仮想操作量Ｕ
（＾）_ｎの修正量を求め、仮想操作量を修正する。When the control amount predicting unit 7 is composed of the neural circuit model 11, the operation amount correcting unit 8 calculates the virtual operation amount U by the error backpropagation calculation of the neural circuit model 11 described above.
(^) _Obtain the correction amount of _n and correct the virtual operation amount.

【００９２】ここで、制御量予測部７が神経回路モデル
１１から構成される場合には、学習部４は以下のように
構成される。Here, when the control amount predicting section 7 is composed of the neural circuit model 11, the learning section 4 is structured as follows.

【００９３】学習部４は制御対象１の応答時間ｐΔｔ
と、操作遅れ時間ｑΔｔと、伝送むだ時間ｒΔｔと、観
測データ記憶部５に記憶された過去の制御量と操作量か
らなる観測データに基づいて学習データを作成し、制御
量予測部７を構成する神経回路モデル１１の学習を行
う。操作部３に操作遅れ時間があり、伝送部２１に伝送
むだ時間がある場合に、入力信号（Ｕ_ｓ，Χ_ｓ）、出力
信号（Ｘ_s+p）（ｓ＝ｎ−ｌ，ｎ−ｌ＋１，…，ｎ−ｐ
−１）を学習データとして学習を行うと、神経回路モデ
ル１１は時刻ｓΔｔに計測された制御量Ｘ_ｓと時刻ｓΔ
ｔに制御対象１に入力された操作量Ｕ_ｓからｐΔｔ時間
後の制御量Χ_s+pを出力するようになる。この神経回路
モデル１１を制御量予測部７に用いて制御を行うと、調
節部２では伝送むだ時間ｒΔｔ前に観測された制御量Ｘ
_n-rを現在の制御量Ｘ_ｎとみなし、また予測制御量Χ
（〜）_n+pが目標制御量Χ^ｄに近付くための操作量Ｕ_ｎ
の値が求められてしまい、実際にその値が制御対象１に
入力されるのは操作遅れ時間ｑΔｔ後になるので、正確
な制御ができなくなる。The learning section 4 determines the response time pΔt of the controlled object 1.
The learning amount is created based on the operation delay time qΔt, the transmission dead time rΔt, and the observation data including the past control amount and the operation amount stored in the observation data storage unit 5, and the control amount prediction unit 7 is configured. Learning of the neural circuit model 11 is performed. When the operation unit 3 has an operation delay time and the transmission unit 21 has a transmission dead time, an input signal (U _s , Χ _s ) and an output signal (X _{s + p} ) (s = n−1, n−1 + 1) , ..., np
When learning is performed using -1) as learning data, the neural network model 11 causes the controlled variable X _s measured at the time sΔt and the time sΔ.
The control amount Χ _{s + p} after pΔt time is output from the operation amount U _s input to the controlled object 1 at t. When control is performed using this neural circuit model 11 in the control amount predicting unit 7, the control unit 2 controls the control amount X observed before the transmission dead time rΔt.
_nr is regarded as the current controlled _variable X _n, and the predicted controlled variable Χ
(-) _{N + p} is the manipulated variable U _n for approaching the target controlled variable Χ ^d
Since the value of is calculated and the value is actually input to the controlled object 1 after the operation delay time qΔt, accurate control cannot be performed.

【００９４】そこで、本発明では、神経回路モデル１１
が、時刻ｎΔｔに調節部２に入力される制御量Ｘ´
_ｎと、時刻ｎΔｔに出力する調節信号Ｃ_ｎを入力する
と、ｐΔｔ時間後の予測制御量Χ（〜）_n+pを出力する
ように神経回路モデル１１の学習を行うため、図１９に
示すように、伝送むだ時間ｒΔｔ前に制御対象１で計測
された制御量Χ_s-r＝（Χ１_s-r，Ｘ２_s-r，…，Ｘｈ
_s-r）と操作遅れ時間ｑΔｔ後に制御対象１に入力され
る操作量Ｕ_s+q＝（Ｕ１_s+q，Ｕ２_s+q，…，Ｕ
ｇ_s+ _q）を入力信号、ｐΔｔ時間後の制御量Χ_s+p＝
（Χ１_s+p，Χ２_s+p，…，Χｈ_s+p）を教師信号にし
た学習データを作成する（ｓ＝ｎ−ｌ＋ｒ，ｎ−ｌ＋ｒ
＋１，…，ｎ−ｐ−１）。この学習データで学習ができ
れぱ、制御時に調節部２で求められた調節信号Ｃ_ｎは、
操作遅れ時間ｑΔｔ後に制御対象１に入力された時にｐ
Δｔ時間後の制御量Χ_n+pが目標制御量Ｘ^ｄに近付く操
作量なので、正確な制御ができる。Therefore, in the present invention, the neural circuit model 11
Is the control amount X ′ input to the adjustment unit 2 at time nΔt
_{When n} and the control signal C _n output at time nΔt are input, the neural circuit model 11 is trained so as to output the predicted control amount Χ (∼) _{n + p} after pΔt time. , And the controlled variable Χ _sr = (Χ1 _sr , X2 _sr , ..., Xh measured by the controlled object 1 before the transmission dead time rΔt.
_sr ) and the manipulated variable U _{s + q} = (U1 _{s + q} , U2 _{s + q} , ..., U input to the controlled object 1 after the operation delay time qΔt.
g _{s +} _q ) is the input signal, and the controlled variable after pΔt time Χ _{s + p} =
Create learning data using (Χ1 _{s + p} , Χ2 _{s + p} , ..., Χ _{s + p} ) as a teacher signal (s = n−1 + r, n−1 + r).
+1, ..., Np-1). If learning can be performed with this learning data, the adjustment signal C _n obtained by the adjustment unit 2 during control is
When the input is made to the controlled object 1 after the operation delay time qΔt, p
Since the control amount _{An + p} after Δt time is the operation amount approaching the target control amount ^Xd , accurate control can be performed.

【００９５】次に、本発明の他の実施の形態を図２０に
基づき説明する。Next, another embodiment of the present invention will be described with reference to FIG.

【００９６】図２０に示す制御装置では、図１６に示し
た制御装置に操作遅れ時間計測部２０、伝送むだ時間計
測部２２が付加されている。操作遅れ時間計測部２０は
過去ｍ単位時間に調節部２で算出された調節信号の時系
列Ｃ＝（Ｃ_n-m，Ｃ_n-m+1，…，Ｃ_n-1）と、操作部３
から出力された操作量の時系列Ｕ＝（Ｕ_n-m，
Ｕ_n-m+ ₁，…，Ｕ_n-1）に基づき、調節信号が操作部３
に入力されて操作量として出力されるまでの操作遅れ時
間ｑΔｔを計算し、学習部４に出力する。また、伝送む
だ時間計測部２２は過去ｍ単位時間に制御対象１で計測
された制御量の時系列Ｘ＝（Χ_n-m，Ｘ_n-m+1，…，Χ
_n-1）と、調節部２に入力された制御量の時系列Χ´＝
（Χ´_n-m，Ｘ´_n-m+1，…，Χ´_n-1）に基づき、制
御対象１で計測された制御量が調節部２に入力されるま
での伝送むだ時間ｒΔｔを計算し、学習部４に出力す
る。ｍは正の整数である。操作遅れ時間計測部２０が付
加されることにより、操作部３の特性変化によって操作
遅れ時間が変化する場合でも、操作遅れ時間計測部２０
によって変化後の操作遅れ時間ｑΔｔが計算され、学習
部４に出力される。学習部４では操作遅れ時間計測部２
０から入力された操作遅れ時間ｑΔｔに従い、学習デー
タを作成し、学習を行うので、操作遅れ時間が変化する
場合でも正しく制御し続けることができる。また、伝送
むだ時間計測部２２が付加されることにより、伝送部２
１の特性変化によって伝送むだ時間が変化する場合で
も、伝送むだ時間計測部２２によって変化後の伝送むだ
時間ｒΔｔが計算され、学習部４に出力される。学習部
４では伝送むだ時間計測部２２から入力された伝送むだ
時間ｒΔｔに従い、学習データを作成し、学習を行うの
で、伝送むだ時間が変化する場合でも正しく制御し続け
ることができる。In the control device shown in FIG. 20, an operation delay time measuring unit 20 and a transmission dead time measuring unit 22 are added to the control device shown in FIG. The operation delay time measuring unit 20 uses the time series C = (C _nm , C _{n-m + 1} , ..., C _n-1 ) of the adjustment signal calculated by the adjusting unit 2 in the past m unit time, and the operation unit 3
Of the manipulated variable output from U = (U _nm ,
U _{n-m +} ₁ , ..., U _n-1 ) based on the adjustment signal
The operation delay time qΔt from the input to the output to the output as the operation amount is calculated and output to the learning unit 4. Further, the transmission dead time measurement unit 22 has a time series X = (Χ _nm , X _{n-m + 1} , ..., Χ) of the controlled variable measured by the controlled object 1 in the past m unit time.
_n-1 ) and the time series of the control amount input to the adjusting unit 2 Χ '=
Based on (Χ ' _nm , X'n _{-m + 1} , ..., Χ'n _-1 ), the transmission dead time rΔt until the controlled variable measured by the controlled object 1 is input to the adjusting unit 2 is calculated. , To the learning unit 4. m is a positive integer. By adding the operation delay time measuring unit 20, even when the operation delay time changes due to the characteristic change of the operation unit 3, the operation delay time measuring unit 20
The operation delay time qΔt after change is calculated by and is output to the learning unit 4. In the learning unit 4, the operation delay time measuring unit 2
Since learning data is created and learning is performed according to the operation delay time qΔt input from 0, correct control can be continued even when the operation delay time changes. Further, by adding the transmission dead time measurement unit 22, the transmission unit 2
Even if the transmission dead time changes due to the characteristic change of 1, the transmission dead time measuring unit 22 calculates the changed transmission dead time rΔt and outputs it to the learning unit 4. The learning unit 4 creates learning data according to the transmission dead time rΔt input from the transmission dead time measuring unit 22 and performs learning, so that correct control can be continued even when the transmission dead time changes.

【００９７】[0097]

【実施例】本発明を下水処理プロセスの溶存酸素濃度を
制御する下水処理プロセスにおける溶存酸素濃度の制御
装置に適用した実施例を説明する。EXAMPLE An example in which the present invention is applied to a device for controlling the dissolved oxygen concentration in a sewage treatment process for controlling the dissolved oxygen concentration in the sewage treatment process will be described.

【００９８】図２１は本発明の一実施例の制御装置の構
成を示すブロック図である。FIG. 21 is a block diagram showing the configuration of the control device according to an embodiment of the present invention.

【００９９】同図に示すように、下水処理プロセス２３
では単位時間Δｔ毎に制御量である溶存酸素濃度と下水
流入量が計測され、操作量である曝気風量が入力され
る。時刻ｎΔｔに計測された溶存酸素濃度ＤＯ_ｎ、下水
流入量Ｑｉｎｆ_ｎと単位時間前に下水処理プロセス２３
に入力された曝気風量Ｑａｅｒ_n-1が伝送部２４を通し
て調節部２５に入力されるが、伝送部２４は伝送むだ時
間を持つため、伝送むだ時間遅れて調節部２５に入力さ
れる。時刻ｎΔｔに調節部２５に入力される曝気風量を
Ｑａｅｒ´_ｎ、溶存酸素濃度をＤＯ´_ｎ、下水流入量を
Ｑｉｎｆ´_n-1と表す。調節部２５は目標溶存酸素濃度
ＤＯ^ｄ、曝気風量Ｑａｅｒ´_n-1、溶存酸素濃度ＤＯ´
_ｎ、下水流入量Ｑｉｎｆ´_ｎに基づき、調節信号Ｃ_ｎを
算出する。ブロア２６では調節部２５で算出された調節
信号Ｃ_ｎが曝気風量Ｑａｅｒ_ｎに変換され、下水処理プ
ロセス２３に出力される。下水処理プロセス２３にはブ
ロア２６が出力した曝気風量Ｑａｅｒ_ｎが入力される。As shown in the figure, the sewage treatment process 23
Then, the dissolved oxygen concentration which is the control amount and the sewage inflow amount are measured for each unit time Δt, and the aeration air amount which is the operation amount is input. Dissolved oxygen concentration DO _n , sewage inflow amount Qinf _n measured at time nΔt and sewage treatment process 23 before unit time
The aeration air amount Qaer _n-1 input to the control unit 25 is input to the adjustment unit 25 through the transmission unit 24. However, since the transmission unit 24 has a transmission dead time, it is input to the adjustment unit 25 with a delay in the transmission dead time. The aeration air volume input to the control unit 25 at time nΔt is represented by Qaer ′ _n , the dissolved oxygen concentration is represented by DO ′ _n , and the sewage inflow volume is represented by Qinf ′ _n−1 . The control unit 25 controls the target dissolved oxygen concentration DO ^d , the aeration air amount Qaer ′ _n−1 , and the dissolved oxygen concentration DO ′.
_The control signal C _n is calculated based on _n and the sewage inflow amount Qinf ′ _n . In the blower 26, the control signal C _n calculated by the control unit 25 is converted into the aeration air amount Qaer _n and output to the sewage treatment process 23. Aeration amount QAER _n the blower 26 has output is input to the wastewater treatment process 23.

【０１００】操作遅れ時間計測部２７は、過去ｍ単位時
間に調節部２５で算出された調節信号の時系列Ｃ＝（Ｃ
_n-m，Ｃ_n-m+1，…，Ｃ_n-1）と、ブロア２６から出力
された曝気風量の時系列Ｑａｅｒ＝（Ｑａｅｒ_n-m，Ｑ
ａｅｒ_n-m-1，…，Ｑａｅｒ_n-1）に基づき、調節信号
が曝気風量として実現されるまでの操作遅れ時間ｑΔｔ
を算出する。ｍは正の整数である。The operation delay time measuring unit 27 uses the time series C = (C of the adjustment signal calculated by the adjusting unit 25 in the past m unit time.
_nm , C _{n-m + 1} , ..., C _n-1 ) and the time series Qaer = (Qaer _nm , Q) of the aeration air volume output from the blower 26.
based on aer _nm-1 , ..., Qaer _n-1 ), the operation delay time qΔt until the control signal is realized as the aeration air volume.
Is calculated. m is a positive integer.

【０１０１】伝送むだ時間計測部２８は過去ｍ単位時間
に下水処理プロセス２３で観測された溶存酸素濃度の時
系列ＤＯ＝（ＤΟ_１，ＤΟ_２，…，ＤＯ_n-1）と、調節
部２５に入力された溶存酸素濃度の時系列ＤＯ´＝（Ｄ
Ｏ´_１，ＤＯ´_２，…，ＤＯ´_n-1）に基づき、下水処
理プロセス２３で溶存酸素濃度が観測されてから調節部
２５に入力されるまでの伝送むだ時間ｒΔｔを算出す
る。The transmission dead time measuring unit 28 uses the time series DO = (D Ο ₁ , D Ο ₂ , ..., DO _n-1 ) of the dissolved oxygen concentration observed in the sewage treatment process 23 in the past m unit time, and the adjusting unit 25. Time series of dissolved oxygen concentration DO '= (D
Based on O ′ ₁ , DO ′ ₂ , ..., DO ′ _n−1 ), the transmission dead time rΔt from when the dissolved oxygen concentration is observed in the sewage treatment process 23 to when it is input to the adjusting unit 25 is calculated.

【０１０２】学習部２９は下水処理プロセス２３に曝気
風量が入力されてからその曝気風量による影響が溶存酸
素濃度に現れるまでの応答時間ｐΔｔと、操作遅れ時間
計測部２７によって算出された操作遅れ時間ｑΔｔと、
伝送むだ時間計測部２８によって算出された伝送むだ時
間ｒΔｔと、観測データ記憶部３０に記憶された過去の
曝気風量（Ｑａｅｒ_n-l，Ｑａｅｒ_n-l+1，…，Ｑａｅ
ｒ_n-1）と溶存酸素濃度（ＤＯ_n-l，ＤＯ_n-l+1，…，
ＤＯ_n-1）と流入量（Ｑｉｎｆ_n-l，Ｑｉｎｆ_n-l+1，
…，Ｑｉｎｆ_n-1）からなる観測データに基づいて、調
節部２５のパラメータＷを学習によって調節し、調節部
２５へ出力する。The learning unit 29 receives the response time pΔt from the input of the aeration air volume to the sewage treatment process 23 until the influence of the aeration air volume appears in the dissolved oxygen concentration, and the operation delay time calculated by the operation delay time measurement unit 27. qΔt,
The transmission dead time rΔt calculated by the transmission dead time measuring unit 28 and the past aeration air amount (Qaer _nl , Qaer _{n-l + 1} , ..., Qae) stored in the observation data storage unit 30.
r _n-1 ) and the dissolved oxygen concentration (DO _nl , DO _{n-l + 1} , ...,
DO _n-1 ) and the inflow amount (Qinf _nl , Qinf _{n-l + 1} ,
, Qinf _n-1 ) based on the observed data, the parameter W of the adjusting unit 25 is adjusted by learning and output to the adjusting unit 25.

【０１０３】次に、調節部２５が仮想曝気風量算出部、
溶存酸素濃度予測部、曝気風量修正部から構成される場
合について図２２を基に説明する。仮想曝気風量算出部
３１は単位時間前の曝気風量Ｑａｅｒ´_n-1、下水処理
プロセス２３で計測された溶存酸素濃度ＤＯ´_ｎ及び目
標溶存酸素濃度ＤＯ^ｄに基づき、仮想曝気風量Ｑａｅｒ
（＾）_ｎを算出する。この算出の具体的な方法は、例え
ばＫを正の定数として、Next, the adjusting unit 25 is a virtual aeration air volume calculating unit,
The case of being composed of the dissolved oxygen concentration prediction unit and the aeration air volume correction unit will be described with reference to FIG. The virtual aeration air flow rate calculation unit 31 calculates the virtual aeration air flow rate Qaer based on the aeration air flow rate Qaer ′ _n−1 before the unit time, the dissolved oxygen concentration DO ′ _n and the target dissolved oxygen concentration DO ^d measured in the sewage treatment process 23.
(^) Calculate _n . A specific method of this calculation is, for example, when K is a positive constant,

【数１７】のように計算してもよいし、または、[Equation 17] May be calculated as

【数１８】のように単位時間前の曝気風量をそのまま仮想曝気風量
としても良い。(Equation 18) As described above, the amount of aeration air before a unit time may be used as it is as the virtual amount of aeration air.

【０１０４】溶存酸素濃度予測部３２は仮想曝気風量Ｑ
ａｅｒ（＾）_ｎと単位時間前の曝気風量Ｑａｅｒ
´_n-1、下水処理プロセス２３で計測された溶存酸素濃
度ＤＯ´_ｎ、下水流入流量Ｑｉｎｆ´_ｎに基づき、仮想
曝気風量Ｑａｅｒ（＾）_ｎが調節信号Ｃ_ｎとして出力さ
れた場合のｐΔｔ時間後の予測溶存酸素濃度ＤＯ（−）
_n+pを算出する。ｐは下水処理プロセス２３の応答時間
である。The dissolved oxygen concentration predicting unit 32 uses the virtual aeration air quantity Q.
aer (^) _n and aeration amount Qaer before unit time
_'N-1, the dissolved oxygen concentration DO' _n measured in sewage treatment process 23, based on the sewage inlet flow Qinf' _n, pΔt time when the virtual aeration amount Qaer _{(^) n} is outputted as the adjustment signal _{C n} Predicted dissolved oxygen concentration after DO (-)
Calculate _{n + p} . p is the response time of the sewage treatment process 23.

【０１０５】曝気風量修正部３３は、溶存酸素濃度予測
部３２で予測された溶存酸素濃度ＤＯ（−）_n+p及び目
標溶存酸素濃度ＤＯ^ｄに基づき、予測溶存酸素濃度ＤＯ
（−）_n+pが目標溶存酸素濃度ＤＯ^ｄに近付くように仮
想曝気風量Ｑａｅｒ（＾）_ｎを修正する。The aeration air flow rate correction unit 33 calculates the predicted dissolved oxygen concentration DO based on the dissolved oxygen concentration DO (−) _{n + p} and the target dissolved oxygen concentration DO ^d predicted by the dissolved oxygen concentration prediction unit 32.
(-) _{n + p} to modify the virtual aeration air quantity Qaer _{(^) n} so as to approach the target dissolved oxygen concentration DO ^d.

【０１０６】第１スイッチ３４は下水処理プロセス２３
で溶存酸素濃度が計測されると下側に閉じ、仮想曝気風
量算出部３１で算出された仮想曝気風量Ｑａｅｒ（＾）
_ｎを溶存酸素濃度予測部３２に入力し、その後、第１ス
イッチ３４は次の計測時まで上側に閉じて、曝気風量修
正部３３で修正された仮想曝気風量Ｑａｅｒ（＾）_ｎを
溶存酸素濃度予測部３２の入力にし、所定の回数、ある
いは予測溶存酸素濃度ＤＯ（−）_n+pと目標溶存酸素濃
度ＤＯ^ｄの差が所定の値以下になるまで、溶存酸素濃度
予測部３２による溶存酸素濃度の予測と曝気風量修正部
３３による曝気風量の修正が繰り返される。溶存酸素濃
度予測部３２による溶存酸素濃度の予測と曝気風量修正
部３３による曝気風量の修正が所定の回数まで、あるい
は予測溶存酸素濃度ＤＯ（−）_n+pと目標溶存酸素濃度
ＤＯ^ｄの差が所定の値以下になるまで繰り返されると、
第２スイッチ３５が閉じ、仮想曝気風量Ｑａｅｒ（＾）
_ｎが調節信号Ｃ_ｎとしてブロア２６へ出力される。The first switch 34 is for the sewage treatment process 23.
When the dissolved oxygen concentration is measured at, it closes to the lower side, and the virtual aeration air volume Qaer (^) calculated by the virtual aeration air volume calculation unit 31.
_n is input to the dissolved oxygen concentration predicting unit 32, and then the first switch 34 is closed upward until the next measurement, and the virtual aeration air amount Qaer (^) _n corrected by the aeration air amount correction unit 33 is used as the dissolved oxygen concentration. Dissolved oxygen by the dissolved oxygen concentration predicting unit 32 is input to the predicting unit 32 until a predetermined number of times or until the difference between the predicted dissolved oxygen concentration DO (−) _{n + p} and the target dissolved oxygen concentration DO ^d becomes a predetermined value or less. The prediction of the concentration and the correction of the aeration air volume by the aeration air volume correction unit 33 are repeated. The dissolved oxygen concentration prediction unit 32 predicts the dissolved oxygen concentration and the aeration air amount correction unit 33 corrects the aeration air amount up to a predetermined number of times, or the difference between the predicted dissolved oxygen concentration DO (−) _{n + p} and the target dissolved oxygen concentration DO ^d . Is repeated until the value becomes less than or equal to a predetermined value,
The second switch 35 is closed, and the virtual aeration air amount Qaer (^)
_n is output to the blower 26 as a control signal C _n .

【０１０７】次に、溶存酸素濃度予測部３２が多層神経
回路モデルから構成される場合について説明する。図２
３は溶存酸素濃度予測部３２の一例の説明図である。溶
存酸素濃度予測部３２には仮想曝気風量Ｑａｅｒ（＾）
_ｎ、溶存酸素濃度ＤＯ´_ｎ、下水流入量Ｑｉｎｆ´_ｎ、
単位時間前の曝気風量Ｑａｅｒ´_n-1が入力される。多
段に組まれた遅れ時間要素３７によって時間遅れが生
じ、時刻（ｎ−ｍ_aer＋１）Δｔから時刻（ｎ−１）Δ
ｔまでの曝気風量（Ｑａｅｒ´_n-ｍaer+1，Ｑａｅｒ´
_n-ｍaer+2，…，Ｑａｅｒ´_n-1）、時刻（ｎ−ｍ_do＋
１）Δｔから時刻ｎΔｔまでの溶存酸素濃度（ＤＯ´
_n-ｍdo+1，ＤＯ´_n-ｍdo+2，…，ＤＯ´_ｎ）、時刻（ｎ
−ｍ_inf＋１）Δｔから時刻ｎΔｔまでの下水流入量
（Ｑｉｎｆ´_n- _ｍinf+1，Ｑｉｎｆ´_n-ｍinf+2，…，
Ｑｉｎｆ´_ｎ）、仮想曝気風量Ｑａｅｒ（＾）_ｎが神経
回路モデル３６に入力される。ｍ_aer，ｍ_do、ｍ_infは
正の整数である。神経回路モデル３６はこれらの入力信
号から前向き計算によって時刻（ｎ＋ｐ）Δｔの予測溶
存酸素濃度ＤＯ（−）_n+pを算出する。Next, the case where the dissolved oxygen concentration predicting unit 32 is composed of a multi-layer neural circuit model will be described. FIG.
3 is an explanatory diagram of an example of the dissolved oxygen concentration predicting unit 32. The dissolved oxygen concentration predicting unit 32 has a virtual aeration air amount Qaer (^).
_n , dissolved oxygen concentration DO ′ _n , sewage inflow amount Qinf ′ _n ,
The aeration air amount Qaer ' _n-1 before the unit time is input. A time delay occurs due to the delay time elements 37 arranged in multiple _stages , and the time (n− _maer + 1) Δt to the time (n−1) Δ.
Aeration amount up to t (Qaer ' _{n-maer + 1} , Qaer'
_{n-maer + 2} , ..., Qaer ' _n-1 ), time (n-m _do +
1) Dissolved oxygen concentration (DO ′) from Δt to time nΔt
_{n-mdo + 1} , DO ' _{n-mdo + 2} , ..., DO' _n ), time (n
−m _inf +1) Δt to time nΔt sewage inflow (Qinf ′ _n- _{minf + 1} , Qinf ′ _{n-minf + 2} , ...,
Qinf ′ _n ) and the virtual aeration amount Qaer (^) _n are input to the neural circuit model 36. m _aer , m _do , and m _inf are positive integers. The neural circuit model 36 calculates the predicted dissolved oxygen concentration DO (-) _{n + p} at time (n + p) [Delta] t from these input signals by forward calculation.

【０１０８】溶存酸素濃度予測部３２の構造は他にも考
えられる。例えば、図２４のように、時刻（ｎ＋ｐ）Δ
ｔの予測溶存酸素濃度ＤＯ（−）_n+pの時刻ｎΔｔの溶
存酸素濃度ＤＯ´_ｎからの変化量ΔＤＯ（−）_n+pを神
経回路モデルで算出し、その算出された変化量と時刻ｎ
Δｔの溶存酸素濃度ＤΟ´_ｎが加算され、時刻（ｎ＋
ｐ）Δｔの予測溶存酸素濃度ＤＯ（−）_n+pとして出力
されるものでもよい。Other structures of the dissolved oxygen concentration predicting section 32 are possible. For example, as shown in FIG. 24, time (n + p) Δ
t prediction dissolved oxygen concentration DO (-) n + _p of the time variation of the dissolved oxygen concentration DO' _n of n.DELTA.t Delta] DO (-) of n + _p is calculated by the neural network model, the calculated amount of change and time n
Dissolved oxygen concentration Diomikuron' _n of Δt is added, the time (n +
p) The predicted dissolved oxygen concentration DO (−) _{n + p} of Δt may be output.

【０１０９】曝気風量修正部３３は、溶存酸素濃度予測
部３２で算出された予測溶存酸素濃度ＤＯ（−）_n+pと
目標溶存酸素濃度ＤＯ^ｄに基づき、予測溶存酸素濃度Ｄ
Ｏ（−）_n+pが目標溶存酸素濃度ＤＯ^ｄに近付くように
仮想曝気風量Ｑａｅｒ（＾）_ｎを修正する。溶存酸素濃
度予測部３２が神経回路モデル３６から構成される場合
には、曝気風量修正部３３は溶存酸素濃度予測部３２の
神経回路モデル３６と同一の構成を持ち、神経回路モデ
ル３６の誤差逆伝播計算によって仮想曝気風量Ｑａｅｒ
（＾）_ｎの修正量を求め、修正を行う。The aeration air flow rate correction unit 33 calculates the predicted dissolved oxygen concentration D based on the predicted dissolved oxygen concentration DO (-) _{n + p} calculated by the dissolved oxygen concentration prediction unit 32 and the target dissolved oxygen concentration DO ^d.
O (-) _{n + p} modifies the virtual aeration amount Qaer _{(^) n} to approach the target dissolved oxygen concentration DO ^d. When the dissolved oxygen concentration prediction unit 32 is composed of the neural circuit model 36, the aeration air volume correction unit 33 has the same configuration as the neural circuit model 36 of the dissolved oxygen concentration prediction unit 32, and the error inverse of the neural circuit model 36 is reversed. Virtual aeration air volume Qaer by propagation calculation
(^) _Obtain the correction amount of _n and make corrections.

【０１１０】次に、学習部２９の説明をする。溶存酸素
濃度予測部３２が神経回路モデル３６から構成される場
合に、学習部２９は下水処理プロセス２３の応答時間ｐ
Δｔと、操作遅れ時間計測部２７によって計測された操
作遅れ時間ｑΔｔと、伝送むだ時間計測部２８によって
計測された伝送むだ時間ｒΔｔと、観測データ記憶部３
０に記憶された過去の溶存酸素濃度、下水流入量、曝気
風量からなる観測データに基づいて学習データを作成
し、溶存酸素濃度予測部３２を構成する神経回路モデル
３６の学習を行う。Next, the learning section 29 will be described. When the dissolved oxygen concentration predicting unit 32 is composed of the neural circuit model 36, the learning unit 29 causes the response time p of the sewage treatment process 23.
Δt, the operation delay time qΔt measured by the operation delay time measuring unit 27, the transmission dead time rΔt measured by the transmission dead time measuring unit 28, and the observation data storage unit 3
Learning data is created on the basis of the observation data composed of the past dissolved oxygen concentration, the sewage inflow amount, and the aeration air amount stored in 0, and the neural network model 36 constituting the dissolved oxygen concentration prediction unit 32 is learned.

【０１１１】観測データ記憶部３０に記憶された観測デ
ータは過去に下水処理プロセス２３に入力された曝気風
量と下水処理プロセス２３で計測された溶存酸素濃度、
下水流入量から構成される。神経回路モデル３６の学習
に観測データを使い、入力信号（Ｑａｅｒ_ｓ，Ｑａｅｒ
_s-1，…，Ｑａｅｒ_s-ｍaer+1，ＤＯ_ｓ，ＤＯ_s-1，
…，ＤＯ_s-ｍdo+1，Ｑｉｎｆ_ｓ，Ｑｉｎｆ_s-1，…，Ｑ
ｉｎｆ_s-ｍinf+1）、出力信号ＤＯ_s+p（Ｓ＝ｎ−ｌ＋
ｍ_max，ｎ−ｌ＋ｍ_max＋１，…，ｎ−ｐ−１）（ｍ
_maxはｍ_aer，ｍ_do，ｍ_infの最大値）を学習データと
して学習を行うと、制御時に、溶存酸素濃度予測部３２
では伝送むだ時間ｒΔｔ前に観測された溶存酸素濃度、
下水流入量、曝気風量を現在観測された値として、応答
時間ｐΔｔ後の溶存酸素濃度を予測してしまい、さら
に、この予測溶存酸素濃度ＤＯ_n+pが目標溶存酸素濃度
ＤＯ^ｄに近付くように求められた曝気風量Ｑａｅｒ_ｎは
時刻ｎΔｔに下水処理プロセス２３に入力される曝気風
量であるが、実際に下水処理プロセス２３に入力される
のは操作遅れ時間ｑΔｔ後になるので、正確な制御がで
きない。The observation data stored in the observation data storage unit 30 are the aeration air volume input to the sewage treatment process 23 in the past and the dissolved oxygen concentration measured in the sewage treatment process 23.
It is composed of sewage inflow. Use the observation data to the learning of the neural circuit model 36, the input signal _(Qaer s, Qaer
_s-1 , ..., Qaer _{s-maer + 1} , DO _s , DO _s-1 ,
…, DO _{s-mdo + 1} , Qinf _s , Qinf _s-1 ,…, Q
inf _s-m inf _{+ 1} ), output signal DO _{s + p} (S = n-1 +
m _max , n-l + m _max +1, ..., n-p-1) (m
_{If max} is the maximum value of m _aer , m _do , and m _inf ) and the learning data is used as the learning data, the dissolved oxygen concentration predicting unit 32 is controlled.
Then, the dissolved oxygen concentration observed before the transmission dead time rΔt,
The dissolved oxygen concentration after the response time pΔt is predicted by using the sewage inflow and the aeration air flow as the currently observed values, and further, the predicted dissolved oxygen concentration DO _{n + p} approaches the target dissolved oxygen concentration DO ^d. Although aeration amount QAER _n obtained is aeration amount input at time nΔt sewage treatment process 23, since the inputted actual sewage treatment process 23 is after the operation delay time Qderutati, it can not be accurately controlled .

【０１１２】本発明では、時刻ｎΔｔに調節部２５に入
力される溶存酸素濃度ＤＯ´_ｎ、下水流入量Ｑｉｎｆ´
_ｎと曝気風量Ｑａｅｒ´_n-1と、調節部２５から出力さ
れる調節信号Ｃ_ｎを溶存酸素濃度予測部３２に入力する
と、下水処理プロセス２３の応答時間ｐΔｔ後の予測溶
存酸素濃度ＤＯ_n+pを出力するように神経回路モデル３
６を学習するため、操作遅れ時間計測部２７によって操
作遅れ時間ｑΔｔを算出し、伝送むだ時間計測部２８に
よって伝送むだ時間ｒΔｔを算出し、操作遅れ時間ｑΔ
ｔ後に下水処理プロセス２３に入力される曝気風量Ｑａ
ｅｒ_s+qと（ｒ＋１）Δｔ前から（ｒ＋ｍ_aer−１）Δ
ｔ前まで下水処理プロセス２３に入力された曝気風量
（Ｑａｅｒ_s-r-1，Ｑａｅｒ_s-r-2，…，Ｑａｅｒ
_s-r-ｍaer＋１）とｒΔｔ前から（ｒ＋ｍ_do−１）Δｔ
前まで下水処理プロセス２３で計測された溶存酸素濃度
（ＤＯ_s-r，ＤＯ_s-r-1，…，ＤΟ_s-r-ｍdo+1）と下水
流入量（Ｑｉｎｆ_s-r，Ｑｉｎｆ_s-r-1，…，Ｑｉｎｆ
_s-r-ｍinf+1）を入力信号、応答時間ｐΔｔ後の溶存酸
素濃度ＤＯ_s+pを出力信号にした学習データを作成する
（ｓ＝ｎ−ｌ＋ｒ＋ｍ_aer，…ｎ−ｐ−１）。この学習
データで学習ができれば、制御時に調節部２５で求めら
れた調節信号Ｃ_ｎは、ブロア２６による操作遅れ時間後
に下水処理プロセス２３に入力された時に溶存酸素濃度
が目標溶存酸素濃度に近付く曝気風量なので、正確な制
御ができる。In the present invention, the dissolved oxygen concentration DO ′ _n and the sewage inflow amount Qinf ′ input to the control unit 25 at time nΔt.
_{When n} , the aeration air amount Qaer ′ _n−1, and the control signal C _n output from the control unit 25 are input to the dissolved oxygen concentration prediction unit 32, the predicted dissolved oxygen concentration DO _{n +} after the response time pΔt of the sewage treatment process 23 is input. Neural circuit model 3 to output _p
6, the operation delay time measuring unit 27 calculates the operation delay time qΔt, the transmission dead time measuring unit 28 calculates the transmission dead time rΔt, and the operation delay time qΔ is calculated.
Aeration air volume Qa input to the sewage treatment process 23 after t
er _{s + q} and (r + 1) Δt before (r + _maer −1) Δ
The amount of aeration air (Qaer _sr-1 , Qaer _sr-2 , ..., Qaer input to the sewage treatment process 23 until t before
_sr-maer +1) and rΔt before (r + m _do -1) Δt
Dissolved oxygen concentration (DO _sr , DO _sr-1 , ..., D _{sr-mdo + 1} ) and sewage inflow (Qinf _sr , Qinf _sr-1 , ..., Qinf) measured in the sewage treatment process 23 until before.
_{sr-minf + 1} ) is used as an input signal, and learning data using the dissolved oxygen concentration DO _{s + p} after the response time pΔt as an output signal is created (s = n−1 + r + _maer , ... _Np −1). If learning can be performed with this learning data, the control signal C _n obtained by the control unit 25 at the time of control is such that the dissolved oxygen concentration approaches the target dissolved oxygen concentration when input to the sewage treatment process 23 after the operation delay time by the blower 26. Because of the air volume, accurate control is possible.

【０１１３】次に、制御装置全体の計算手順を図２５に
示す流れ図を参照して説明する。図２５の流れ図では、
時刻０から時刻ΝΔｔまで制御するように書いている
が、この計算手順は単位時間Δｔ毎に同じ手続きが繰り
返されるので、ここでは時刻ｎΔｔにおける曝気風量を
求める手続きを示す。Next, the calculation procedure of the entire control device will be described with reference to the flowchart shown in FIG. In the flow chart of FIG. 25,
Although written so as to control from time 0 to time ΔΔt, this calculation procedure repeats the same procedure every unit time Δt, and therefore the procedure for obtaining the aeration air amount at time nΔt is shown here.

【０１１４】（ａ）前もって設定された目標溶存酸素濃
度ＤＯ^ｄと観測された溶存酸素濃度ＤＯ´_ｎ，、単位時
間前の曝気風量Ｑａｅｒ´_n-1が仮想曝気風量算出部３
１に入力され、仮想曝気風量Ｑａｅｒ（＾）_ｎが出力さ
れる。(A) The target dissolved oxygen concentration DO ^d set in advance, the observed dissolved oxygen concentration DO ' _n , and the aeration air amount Qaer' _n-1 before the unit time are the virtual aeration air amount calculation unit 3
1 and the virtual aeration air amount Qaer (^) _n is output.

【０１１５】（ｂ）第１スイッチ３４が下側に閉じ、
（ａ）で求められた仮想曝気風量Ｑａｅｒ（＾）_ｎと、
単位時間前の曝気風量Ｑａｅｒ´_n-1、溶存酸素濃度Ｄ
Ｏ´_ｎ、下水流入量Ｑｉｎｆ´_ｎが溶存酸素濃度予測部
３２に入力され、神経回路モデル３６の前向き計算によ
って、応答時間ｐΔｔ後の予測溶存酸素濃度ＤＯ（−）
_n+ _pが出力される。(B) The first switch 34 is closed downward,
The virtual aeration air amount Qaer (^) _n obtained in (a),
Aeration amount Qaer ' _n-1 before unit time, dissolved oxygen concentration D
O ′ _n and the sewage inflow amount Qinf ′ _n are input to the dissolved oxygen concentration prediction unit 32, and the predicted dissolved oxygen concentration DO (−) after the response time pΔt is calculated by the forward calculation of the neural circuit model 36.
_{n +} _p is output.

【０１１６】（ｃ）（ｂ）で求められた予測溶存酸素濃
度ＤＯ（−）_n+pと目標溶存酸素濃度ＤＯ^ｄが曝気風量
修正部３３に入力され、神経回路モデル３６の誤差逆伝
播計算によって仮想曝気風量の修正量が求められ、修正
された仮想曝気風量Ｑａｅｒ（＾）_ｎが出力される。The predicted dissolved oxygen concentration DO (−) _{n + p} and the target dissolved oxygen concentration DO ^d obtained in (c) and (b) are input to the aeration air volume correction unit 33, and the error back propagation calculation of the neural circuit model 36 is performed. The correction amount of the virtual aeration air amount is obtained by the above, and the corrected virtual aeration air amount Qaer (^) _n is output.

【０１１７】（ｄ）所定の回数、または目標溶存酸素濃
度ＤＯ^ｄと予測溶存酸素濃度ＤＯ（−）_n+pとの差が所
定の範囲内になるまで、第１スイッチ３４が上側に閉
じ、修正された仮想曝気風量Ｑａｅｒ_ｎが再び溶存酸素
濃度予測部に入力され、（ｂ）（ｃ）の手続きが繰り返
される。(D) The first switch 34 is closed to the upper side for a predetermined number of times or until the difference between the target dissolved oxygen concentration DO ^d and the predicted dissolved oxygen concentration DO (-) _{n + p} is within a predetermined range, Fixed virtual aeration amount QAER _n is input to the dissolved oxygen concentration predicting section again are repeated procedures (b) (c).

【０１１８】（ｅ）（ｄ）の条件が満たされると第２ス
イッチ３５が閉じ、仮想曝気風量Ｑａｅｒ（＾）_ｎが調
節信号Ｃ_ｎとしてブロア２６に入力され、ブロア２６か
ら下水処理プロセス２３に曝気風量Ｑａｅｒ_ｎが入力さ
れる。When the conditions (e) and (d) are satisfied, the second switch 35 is closed, and the virtual aeration air amount Qaer (^) _n is input to the blower 26 as the control signal C _n , and the blower 26 informs the sewage treatment process 23. aeration air quantity Qaer _n is input.

【０１１９】この（ａ）から（ｅ）までの手続きが単位
時間Δｔ毎に繰り返される。The procedure from (a) to (e) is repeated every unit time Δt.

【０１２０】次に神経回路モデル３６の前向き計算、誤
差逆伝播計算、学習計算について説明する。溶存酸素濃
度予測部３２が多層神経回路モデルから構成される場
合、溶存酸素濃度の予測値は多層神経回路モデルの前向
き計算で求められ、曝気風量の修正量は多層神経回路モ
デルの誤差逆伝播計算で求められる。Next, the forward calculation, the error backpropagation calculation, and the learning calculation of the neural network model 36 will be described. When the dissolved oxygen concentration predictor 32 is composed of a multilayer neural network model, the predicted value of the dissolved oxygen concentration is obtained by the forward calculation of the multilayer neural network model, and the correction amount of the aeration air volume is the error back propagation calculation of the multilayer neural network model. Required by.

【０１２１】では、まず前向き計算について説明する。
ここでは図２３に基づいて前向き計算を説明する。First, the forward calculation will be described.
Here, the forward calculation will be described with reference to FIG.

【０１２２】入力層のユニットには仮想曝気風量Ｑａｅ
ｒ（＾）_ｎ、時刻（ｎ−１）Δｔから時刻（ｎ−ｍ_aer
＋１）Δｔまでの曝気風量（Ｑａｅｒ´_n-1，Ｑａｅｒ
´_n- ₂，…，Ｑａｅｒ´_n-ｍaer+1）、時刻ｎΔｔから
時刻（ｎ−ｍ_do＋１）Δｔまでの溶存酸素濃度（ＤＯ´
_ｎ，ＤＯ´_n-1，…，ＤＯ´_n-ｍdo+1）、時刻ｎΔｔか
ら時刻（ｎ−ｍ_inf＋１）Δｔまでの下水流入量（Ｑｉ
ｎｆ´_ｎ，Ｑｉｎｆ´_n-1，…，Ｑｉｎｆ´
_{n −minf ＋1}）が１ユニットに１信号ずつ入力され、
時刻（ｎ＋ｐ）Δｔの溶存酸素濃度の予測値ＤＯ_n+pが
出力される。ｍ_do，ｍ_inf，ｍ_aerは各々正の整数を表
す。この入力信号から出力信号を計算する前向き計算を
式に示す。The unit of the input layer has a virtual aeration air quantity Qae.
r (^) _n , from time (n-1) [Delta] t to time (n- _maer)
Aeration volume up to +1) Δt (Qaer ' _n-1 , Qaer
_{_{_{'N- 2, ..., Qaer' n}}} -maer + 1), dissolved oxygen concentration from time nΔt time (n-m _do +1) to Δt (DO'
_n , DO ' _n-1 , ..., DO' _{n-mdo + 1} ), from the time nΔt to the time (n-m _inf +1) Δt, the sewage inflow (Qi
nf ' _n , Qinf' _n-1 , ..., Qinf '
_{n −minf +1} ) is input to one unit one signal at a time,
The predicted value DO _{n + p} of the dissolved oxygen concentration at time (n + p) Δt is output. m _do , m _inf and m _aer each represent a positive integer. The formula shows the forward calculation for calculating the output signal from this input signal.

【０１２３】ｘｉ_ｉを入力層の第ｉ番目のユニットの入
力値とすると、If xi _i is an input value of the i-th unit of the input layer,

【数１９】入力層ユニットの入出力関数は恒等関数とする。入力層
ユニットの出力ｙｉ_ｉ（ｉ＝１，…，Νｉ）は次のよう
に表される。[Equation 19] The input / output function of the input layer unit is the identity function. The output yi _i (i = 1, ..., N i) of the input layer unit is expressed as follows.

【０１２４】[0124]

【数２０】 Νｉは入力層ユニット数である（Ｎｉ＝ｍ_do＋ｍ_aer＋
ｍ_inf）。次に、中間層の第ｊ番ユニットの入力値ｘｈ
_ｊは入力層ユニットの出力値を入力層一中間層間の結合
荷重値で重み付けした合計から中間層ユニットの閾値ｔ
ｈｈ_ｊを引いた値になる。(Equation 20) Νi is the number of input layer units (Ni = m _do + m _aer +
m _inf ). Next, the input value xh of the j-th unit in the middle layer
_j is the threshold value t of the intermediate layer unit from the sum of the output value of the input layer unit weighted by the coupling weight value between the input layer and the intermediate layer
It is a value obtained by subtracting hh _j .

【０１２５】[0125]

【数２１】ｗ１_j,iは入力層第ｉ番ユニット中間層第ｊ番ユニット
間の結合荷重値、ｔｈｈ_ｊは中間層第ｊ番ユニットの閾
値、Νｈは中間層のユニット数を表す。(Equation 21) w1 _{j, i} is the coupling load value between the input layer i-th unit and the intermediate layer j-th unit, thh _j is the threshold value of the intermediate layer j-th unit, and Νh is the number of units of the intermediate layer.

【０１２６】中間層ユニットの入出力関数をシグモイド
型関数The input / output function of the intermediate layer unit is a sigmoid type function.

【数２２】とすると、中間層ユニットの出力力値ｙｈ_ｊは入力値ｘ
ｈ_ｊを使い、(Equation 22) Then, the output force value yh _j of the intermediate layer unit is the input value x
use h _j ,

【数２３】と表せる。ｙｈ_ｊは中間層第ｊ番ユニットの出力値であ
る。出力層の第ｋ番ユニットの入力値ｘｏ_ｋ（ｋ＝１）
は中間層ユニットの出力値を中間層一出力層間の結合荷
重値で重み付けした合計から出力層ユニットの閾値を引
いた値になる。(Equation 23) Can be expressed as yh _j is the output value of the j-th unit in the middle layer. Input value xo _k (k = 1) of the kth unit in the output layer
Is the value obtained by subtracting the threshold value of the output layer unit from the sum of the output value of the intermediate layer unit weighted by the coupling weight value between the intermediate layer and the output layer.

【０１２７】[0127]

【数２４】ｗ２_k,jは中間層第ｊ番ユニット−出力層第ｋ番ユニッ
ト間の結合荷重値、ｔｈｏ_ｋは出力層第ｋ番ユニットの
閾値である。出力層ユニットの入出力関数は恒等関数と
する。(Equation 24) w2 k, _j is the intermediate layer j-th unit - bond load value between the output layer the k-th unit, tho _k is the threshold of the k-th unit output layer. The input / output function of the output layer unit is an identity function.

【０１２８】[0128]

【数２５】この出力層ユニットの出力値が予測溶存酸素濃度ＤＯ
（−）_n+pである。(Equation 25) The output value of this output layer unit is the predicted dissolved oxygen concentration DO
(−) _{N + p} .

【０１２９】[0129]

【数２６】以上が３層型神経回路モデルの前向き計算の計算式であ
る。(Equation 26) The above is the calculation formula for the forward calculation of the three-layer neural circuit model.

【０１３０】次に、誤差逆伝播計算について説明する。
溶存酸素濃度予測部３２によって予測された溶存酸素濃
度ＤＯ（−）_n+pと、目標溶存酸素濃度ＤＯ^ｄが曝気風
量修正部３３に入力されると、曝気風量修正部３３で
は、仮想曝気風量の修正量を計算し、修正する。溶存酸
素濃度予測部３２が神経回路モデル３６から構成される
場合には、仮想曝気風量の修正量は神経回路モデル３６
の誤差逆伝播計算によって計算できる。この曝気風量修
正部３３の計算式を示す。Next, the error back propagation calculation will be described.
When the dissolved oxygen concentration DO (−) _{n + p} predicted by the dissolved oxygen concentration prediction unit 32 and the target dissolved oxygen concentration DO ^d are input to the aeration air amount correction unit 33, the aeration air amount correction unit 33 causes the virtual aeration air amount. Calculate the correction amount of and correct it. When the dissolved oxygen concentration predicting unit 32 is composed of the neural circuit model 36, the correction amount of the virtual aeration volume is the neural circuit model 36.
It can be calculated by the error back-propagation calculation. The calculation formula of the aeration air volume correction unit 33 is shown below.

【０１３１】まず、溶存酸素濃度予測部３２によって予
測された溶存酸素濃度ＤＯ（−）_n+ _pと、目標溶存酸素
濃度ＤＯ^ｄとの誤差から誤差関数Ｅを次のように定義す
る。[0131] First, the dissolved oxygen concentration predicting section 32 by the predicted dissolved oxygen concentration DO (-) and n ₊ _p, the error function E from the error between the target dissolved oxygen concentration DO ^d is defined as follows.

【０１３２】[0132]

【数２７】次に、この誤差関数を減少させるユニット出力値の修正
量を、出力層から入力層へと層毎に順に求めていき、最
終的には入力信号の修正量を求める。まず、出力層ユニ
ットの出力値の修正量は、正の定数ηを使って[Equation 27] Next, the correction amount of the unit output value that reduces this error function is sequentially obtained for each layer from the output layer to the input layer, and finally the correction amount of the input signal is obtained. First, the correction amount of the output value of the output layer unit is calculated by using the positive constant η.

【数２８】と計算される。次に中間層ユニットの出力値の修正量
は、出力層ユニットの修正量を使って次のように表され
る。[Equation 28] Is calculated. Next, the correction amount of the output value of the intermediate layer unit is expressed as follows using the correction amount of the output layer unit.

【０１３３】[0133]

【数２９】同様に入力層ユニットの出力値の修正量は、中間層ユニ
ットの修正量を使って、(Equation 29) Similarly, the correction amount of the output value of the input layer unit, using the correction amount of the middle layer unit,

【数３０】と表される。ｆ´（ｘ）はシグモイド関数ｆ（ｘ）の導
関数を表していて、具体的には、[Equation 30] It is expressed as f ′ (x) represents the derivative of the sigmoid function f (x), and specifically,

【数３１】と表せる。入力層ユニットの入出力関数は恒等関数であ
るから、出力値の修正量はそのまま入力値の修正量にな
り、仮想曝気風量Ｑａｅｒ（＾）_ｎの修正量ΔＱａｅ
ｒ，_ｎは、(Equation 31) Can be expressed as Since the input / output function of the input layer unit is the identity function, the correction amount of the output value becomes the correction amount of the input value as it is, and the correction amount ΔQae of the virtual aeration air amount Qaer (^) _n.
r and _n are

【数３２】と表現される。この修正量が仮想曝気風量Ｑａｅｒ
（＾）_ｎに加算される。以上が誤差逆伝播計算と、その
計算によって実現される仮想曝気風量の修止である。(Equation 32) Is expressed as This correction amount is the virtual aeration air amount Qaer
(^) It is added to _n . The above is the error backpropagation calculation and the virtual aeration air volume stop achieved by the calculation.

【０１３４】次に、多層神経回路モデルの学習について
図２６をもとに説明する。学習は下水処理プロセスの緩
やかな特性変化に対し、溶存酸素濃度予測部３２が適応
するために行われる。Next, learning of the multilayer neural network model will be described with reference to FIG. The learning is performed because the dissolved oxygen concentration prediction unit 32 adapts to the gradual change in characteristics of the sewage treatment process.

【０１３５】図２６において、３０は観測データ記憶
部、３８は減算器、３９は誤差逆伝播学習器、４０は神
経回路モデル３６の入力層ユニットの出力値が誤差逆伝
播学習器３９に入力されることを示す矢印、４１は神経
回路モデル３６の中間層ユニットの出力値が誤差逆伝播
学習器３９に入力されることを示す矢印、４２は神経回
路モデル３６の中間層一出力層間の結合荷重値が誤差逆
伝播学習器３９に入力されることを示す矢印、４３は誤
差逆伝播学習器３９で計算された入力層一中間層間の結
合荷重値と中間層ユニットの閾値の修正量を各結合荷
重、中間層ユニットに出力し、各結合荷重値と閾値の修
正を行うことを示す矢印、４４は誤差逆伝播学習器３９
で計算された中間層一出力層間の結合荷重値と出力層ユ
ニットの閾値の修正量を各結合荷重、出力層ユニットに
出力し、各結合荷重値と閾値の修正を行うことを示す矢
印を表す。In FIG. 26, 30 is an observation data storage unit, 38 is a subtractor, 39 is an error backpropagation learning device, and 40 is an output value of the input layer unit of the neural network model 36, which is input to the error backpropagation learning device 39. Arrow 41 indicating that the output value of the intermediate layer unit of the neural network model 36 is input to the error backpropagation learning device 39, and 42 is the coupling load between the intermediate layer and the output layer of the neural circuit model 36. An arrow indicates that the value is input to the error backpropagation learning device 39, and 43 is a connection weight value between the input layer and the intermediate layer calculated by the error backpropagation learning device 39 and a correction amount of the threshold value of the intermediate layer unit, which are combined. The weights are output to the intermediate layer unit, and the arrows indicate that the weight values and threshold values are corrected, and 44 is the error backpropagation learning device 39.
Outputs the combined load value between the middle layer and the output layer and the correction amount of the threshold value of the output layer unit, which are calculated in, to each combined load and output layer unit, and represents an arrow indicating that each combined load value and the threshold value are corrected. .

【０１３６】過去の時刻ｓΔｔ（ｓ＝ｎ−ｌ＋ｍ_aex，
…，ｎ−ｐ−１）について、時刻（ｓ＋ｑ）Δｔの曝気
風量Ｑａｅｒ_s+qと時刻（ｓ−ｒ−ｍ_aer＋１）Δｔか
ら時刻（ｓ−ｒ−１）Δｔまでの曝気風量（Ｑａｅｒ
_s-r-ｍaer+1，Ｑａｅｒ_s-r- _ｍaer+2，…，Ｑａｅｒ
_s-r-1）、時刻（ｓ−ｒ−ｍ_do＋１）Δｔから時刻（ｓ
−ｒ）Δｔまでの溶存酸素濃度（ＤΟ_s-r-ｍdo+1，ＤＯ
_s-r-ｍdo+2，…，ＤＯ_s- _r）時刻（ｓ−ｒ−ｍ_inf＋
１）Δｔから時刻（ｓ−ｒ）Δｔまでの下水流入量（Ｑ
ｉｎｆ_{s-r-ｍinf+1 ，}Ｑｉｎｆ_s-r-ｍinf+2，…，Ｑｉ
ｎｆ_s-r）が入力信号として、観測データ記憶部３０か
ら神経回路モデル３６に出力される。神経回路モデル３
６では上記入力信号に対し、前向き計算を行い、出力信
号ＤＯ（−）_s+pを出力する。減算器３８では観測デー
タ記憶部３０から教師信号として出力された応答時間後
の溶存酸素濃度ＤＯ_s+pと神経回路モデル３６の出力信
号ＤＯ（−）_s+pとの差分ＤＯ_s+p−ＤΟ（−）_s+pが
計算され、誤差逆伝播学習器３９に入力される。誤差逆
伝播学習器３９には差分の他に、神経回路モデル３６の
入力層ユニットの各出力値、中間層ユニットの各出力
値、中間層一出力層間の各結合荷重値が入力され、結合
荷重値、閾値の修正量が計算される。Past time sΔt (s = n-1 + _maex ,
..., for the n-p-1), time (s + q) aeration amount of Δt Qaer _{s + q} and time _{(s-r-m aer +1} ) time from Δt (s-r-1) aeration air flow rate of up to Δt (Qaer
_{_{sr-maer + 1, Qaer sr-}} maer + 2, ..., Qaer
_sr-1 ), time (s-r-m _do +1) Δt to time (s
-R) Dissolved oxygen concentration up to Δt (D _{sr-mdo + 1} , DO
_{sr-mdo + 2} , ..., DO _s- _r ) Time (s-r-m _inf +
1) Sewage inflow (Q) from Δt to time (s−r) Δt
inf _{sr-minf + 1,} Qinf _{sr-minf + 2} , ..., Qi
nf _sr ) is output from the observation data storage unit 30 to the neural circuit model 36 as an input signal. Neural circuit model 3
At 6, the forward calculation is performed on the input signal, and the output signal DO (-) _{s + p} is output. In the subtractor 38, the difference DO _{s + p} − between the dissolved oxygen concentration DO _{s + p} after the response time output from the observation data storage unit 30 as a teacher signal and the output signal DO (−) _{s + p} of the neural circuit model 36. DΟ (−) _{s + p} is calculated and input to the error backpropagation learning device 39. In addition to the difference, the output value of the input layer unit of the neural network model 36, each output value of the intermediate layer unit, and each coupling weight value between the intermediate layer and one output layer are input to the error backpropagation learning unit 39, and the coupling weight is input. The value and the correction amount of the threshold value are calculated.

【０１３７】減算器３８では教師信号ＤＯ_s+pと神経回
路モデル３６の出力信号ＤＯ（−）_s+pとの差分から誤
差関数Ｅが次のように定義される。In the subtractor 38, the error function E is defined as follows from the difference between the teacher signal DO _{s + p} and the output signal DO (−) _{s + p} of the neural circuit model 36.

【０１３８】[0138]

【数３３】誤差逆伝播学習法に従って、神経回路モデル３６の結合
荷重値、閾値の修正量は次のように計算される。[Equation 33] According to the error backpropagation learning method, the connection weight value of the neural network model 36 and the correction amount of the threshold value are calculated as follows.

【０１３９】[0139]

【数３４】 Δｗ２_k,jは中間層第ｊ番ユニット−出力層第ｋ番ユニ
ット間の結合荷重値ｗ２_k,jの修正量、Δｔｈｏ_ｋは出
力層第ｋ番ユニットの閾値ｔｈｏ_ｋの修正量、Δｗ１
_j,iは入力層第ｉ番ユニット−中間層第ｊ番ユニット間
の結合荷重値ｗ１_j,iの修正量、Δｔｈｈ_ｊは中間層第
ｊ番ユニットの閾値ｔｈｈ_ｊの修正量、εは学習定数で
ある。ｙｏ^ｓ _ｋは時刻ｓΔｔのデータが入力された時の
出力層第ｋ番ユニットの出力値、ｘｏ^ｓ _ｋ時刻ｓΔｔの
データが入力された時の出力層第ｋ番ユニットの入力
値、ｙｈ^ｓ _ｊは時刻ｓΔｔのデータが入力された時の中
間層第ｊ番ユニットの出力値、ｘｈ^ｓ _ｋは時刻ｓΔｔの
データが入力された時の中間層第ｊ番ユニットの入力
値、ｙｉ^ｓ _ｉは時刻ｓΔｔのデータが入力された時の中
間層第ｉ番ユニットの出力値である。これらの修正量を
基に、結合荷重値、閾値が修正される。(Equation 34) Derutadaburyu2 k, _j is the intermediate layer j-th unit - correction amount of the coupling weight value w2 _{k, j} between the output layer the k-th unit, Derutatho _k is the correction amount of the threshold tho _k of the k-th unit output layer, .DELTA.W1
_{j, i} is the correction amount of the coupling load value w1 _{j, i} between the input layer i-th unit and the intermediate layer j-th unit, Δthh _j is the correction amount of the threshold thh _j of the intermediate layer j-th unit, and ε is the learning value. It is a constant. yo ^s _k is the output value of the output layer the k th unit when the data in time sΔt is input, the input value of the output layer the k th unit when data xo ^s _k time sΔt is input, yh ^s _j output values of the intermediate layer j-th unit of time in which data of time sΔt is input, the input value of the intermediate layer j-th unit when xh ^s _k is the data of the time sΔt is input, yi ^s _i at time It is the output value of the i-th unit in the middle layer when the data of sΔt is input. The coupling load value and the threshold value are corrected based on these correction amounts.

【０１４０】なお、本発明は前記実施例に限定されるこ
となく、本発明の要旨を逸脱しない範囲で種々変形可能
であることは勿論である。例えば、前記実施例では、予
測モデルとして３層型神経回路モデルを例に挙げたが、
多層の神経回路モデルで同様に適用可能であり、更に
は、学習機能を持つ関数近似モデルであれば、同様に適
用できる。また、溶存酸素濃度予測部の入出力信号につ
いても、実施例では、過去と現在に計測された溶存酸素
濃度、下水流入量、曝気風量からｐΔｔ時間後の溶存酸
素濃度を予測しているが、予測が可能な範囲であれぱ、
入力信号を減らすことや別の観測データを入力すること
も考えられる。また出力信号についても、ｐΔｔ時間後
の溶存酸素濃度を予測するだけでなく、複数時刻の溶存
酸素濃度を予測し、それらの予測溶存酸素濃度と目標溶
存酸素濃度との誤差から仮想曝気風量の修正量を求める
こともできる。The present invention is not limited to the above-mentioned embodiments, and it goes without saying that various modifications can be made without departing from the gist of the present invention. For example, in the above-mentioned embodiment, the three-layer type neural circuit model is given as an example of the prediction model.
It can be similarly applied to a multilayered neural circuit model, and can be similarly applied to a function approximation model having a learning function. Regarding the input / output signal of the dissolved oxygen concentration predicting unit, in the embodiment, the dissolved oxygen concentration after pΔt time is predicted from the dissolved oxygen concentration, the sewage inflow amount, and the aeration air amount measured in the past and the present, Within the predictable range,
It is also possible to reduce the input signal or input another observation data. As for the output signal, not only the dissolved oxygen concentration after pΔt time is predicted, but also the dissolved oxygen concentration at multiple times are predicted, and the virtual aeration volume is corrected from the error between the predicted dissolved oxygen concentration and the target dissolved oxygen concentration. You can also find the quantity.

【０１４１】[0141]

【発明の効果】以上説明したように、本発明によれば、
フィードバック制御のループの中に存在する時間的な要
因を考慮しているので、より正確な制御を行うことがで
きる。つまり、学習部は制御対象に操作量が入力されて
からその操作量による影響が制御量に現れるまでの応答
時間を考慮して学習を行っているので、より正確な制御
を行うことができる。As described above, according to the present invention,
Since the time factor existing in the feedback control loop is taken into consideration, more accurate control can be performed. That is, the learning unit performs learning in consideration of the response time from the input of the operation amount to the control target until the influence of the operation amount appears in the control amount, so that more accurate control can be performed.

【０１４２】また、操作部に操作遅れ時間があり、観測
データには調節信号が含まれず、制御対象に入力される
操作量、制御対象で計測される制御量が含まれる場合、
操作部の操作遅れ時間を計測し、この操作遅れ時間後に
制御対象に入力される操作量を学習データの操作量の入
力信号に利用することにより、制御量予測部は入力され
た操作量が操作遅れ時間後に制御対象に入力された場合
の制御量を予測するようになるので、制御時に操作部に
よる遅れ時間の影響を受けずに正確に制御することがで
きる。When the operation section has an operation delay time, the observation data does not include the adjustment signal, and the operation amount input to the control target and the control amount measured by the control target are included,
By measuring the operation delay time of the operation unit and using the operation amount input to the control target after this operation delay time as the input signal of the operation amount of the learning data, the control amount prediction unit operates the input operation amount. Since the control amount when it is input to the controlled object after the delay time is predicted, it is possible to perform accurate control without being affected by the delay time by the operation unit during control.

【０１４３】さらに、制御対象で計測された計測値が調
節部に伝送されるのに時間遅れがある場合には、この伝
送に伴う伝送むだ時間を計測し、この伝送むだ時間前に
計測された制御量を学習データの制御量の入力信号に利
用することにより、制御量予測部は、伝送むだ時間前に
計測され、伝送に伴う遅れ時間後に制御量予測部に入力
された制御量、操作量から制御量を予測するようになる
ので、制御時に伝送に伴う伝送時間遅れの影響を受けず
に正確に制御することができる。Further, when there is a time delay in transmitting the measured value measured by the controlled object to the adjusting section, the transmission dead time associated with this transmission is measured and measured before this transmission dead time. By using the control amount as the input signal of the control amount of the learning data, the control amount prediction unit measures the control amount and the operation amount measured before the transmission dead time and input to the control amount prediction unit after the delay time accompanying the transmission. Since the control amount is predicted from this, accurate control can be performed without being affected by the transmission time delay associated with transmission during control.

[Brief description of the drawings]

【図１】本発明の原理を説明するためのブロック図であ
る。FIG. 1 is a block diagram for explaining the principle of the present invention.

【図２】図１における調節部の一構成例を詳細に表した
図である。FIG. 2 is a diagram showing in detail one configuration example of an adjusting unit in FIG.

【図３】図２における制御量予測部が神経回路モデルか
ら構成される場合の一構成例を表した図である。FIG. 3 is a diagram showing a configuration example in which a controlled variable predicting unit in FIG. 2 is composed of a neural circuit model.

【図４】神経回路モデルの学習を説明するための図であ
る。FIG. 4 is a diagram for explaining learning of a neural circuit model.

【図５】制御対象の応答時間を算出する応答時間算出部
を装備した場合のブロック図である。FIG. 5 is a block diagram of a case where a response time calculation unit that calculates a response time of a controlled object is provided.

【図６】操作部の操作遅れを考慮した場合の本発明の原
理を説明するためのブロック図である。FIG. 6 is a block diagram for explaining the principle of the present invention when the operation delay of the operation unit is considered.

【図７】図６における調節部の一構成例を詳細に表した
図である。FIG. 7 is a diagram showing in detail one configuration example of an adjusting unit in FIG.

【図８】図７における制御量予測部が神経回路モデルか
ら構成される場合の一構成例を表した図である。FIG. 8 is a diagram showing a configuration example in the case where the control amount prediction unit in FIG. 7 is composed of a neural circuit model.

【図９】図８の神経回路モデルの学習を説明するための
図である。9 is a diagram for explaining learning of the neural circuit model of FIG.

【図１０】操作部の操作遅れ時間を計測する操作遅れ時
間計測部を装備している場合の本発明の原理を説明する
ためのブロック図である。FIG. 10 is a block diagram for explaining the principle of the present invention when the operation delay time measuring unit for measuring the operation delay time of the operation unit is equipped.

【図１１】伝送部の伝送時間遅れを考慮した場合の本発
明の原理を説明するためのブロック図である。FIG. 11 is a block diagram for explaining the principle of the present invention in consideration of a transmission time delay of a transmission unit.

【図１２】図１１における調節部の一構成例を詳細に表
した図である。FIG. 12 is a diagram showing in detail one configuration example of an adjusting unit in FIG.

【図１３】図１２における制御量予測部が神経回路モデ
ルから構成される場合の一構成例を表した図である。FIG. 13 is a diagram showing a configuration example in the case where the control amount prediction unit in FIG. 12 is composed of a neural circuit model.

【図１４】図１３の神経回路モデルの学習を説明するた
めの図である。FIG. 14 is a diagram for explaining learning of the neural circuit model of FIG.

【図１５】伝送部の伝送むだ時間を計測する伝送むだ時
間計測部を装備している場合の本発明の原理を説明する
ためのブロック図である。FIG. 15 is a block diagram for explaining the principle of the present invention when the transmission dead time measuring unit for measuring the transmission dead time of the transmission unit is installed.

【図１６】操作部の操作遅れ時間と伝送部の伝送むだ時
間を考慮した場合の本発明の原理を説明するためのブロ
ック図である。FIG. 16 is a block diagram for explaining the principle of the present invention in consideration of the operation delay time of the operation unit and the transmission dead time of the transmission unit.

【図１７】図１６における調節部の一構成例を詳細に表
した図である。FIG. 17 is a diagram showing in detail one configuration example of an adjusting unit in FIG.

【図１８】図１７における制御量予測部が神経回路モデ
ルから構成される場合の一構成例を表した図である。FIG. 18 is a diagram showing a configuration example in the case where the control amount prediction unit in FIG. 17 is composed of a neural circuit model.

【図１９】図１８の神経回路モデルの学習を説明するた
めの図である。19 is a diagram for explaining learning of the neural circuit model of FIG.

【図２０】操作部の操作遅れ時間を計測する操作遅れ時
間計測部と、伝送部の伝送むだ時間を計測する伝送むだ
時間計測部を装備している場合の本発明の原理を説明す
るためのブロック図である。FIG. 20 is a view for explaining the principle of the present invention when the operation delay time measuring unit for measuring the operation delay time of the operation unit and the transmission dead time measuring unit for measuring the transmission dead time of the transmission unit are provided. It is a block diagram.

【図２１】本発明を下水処理プロセスの溶存酸素濃度制
御に適用した一実施例のブロック図である。FIG. 21 is a block diagram of an example in which the present invention is applied to control of dissolved oxygen concentration in a sewage treatment process.

【図２２】図２１における調節部の一構成例を詳細に表
した図である。22 is a diagram showing in detail an example of the configuration of the adjusting section in FIG. 21. FIG.

【図２３】図２２における制御量予測部が神経回路モデ
ルから構成される場合の一構成例を表した図である。23 is a diagram showing a configuration example in the case where the control amount prediction unit in FIG. 22 is composed of a neural circuit model.

【図２４】図２２における制御量予測部が神経回路モデ
ルから構成される場合の一構成例を表した図である。FIG. 24 is a diagram showing a configuration example in the case where the control amount prediction unit in FIG. 22 is composed of a neural circuit model.

【図２５】図２５は制御装置の計算手順を説明するため
の流れ図である。FIG. 25 is a flow chart for explaining a calculation procedure of the control device.

【図２６】図２６は図２３の神経回路モデルの学習を説
明するための図である。FIG. 26 is a diagram for explaining learning of the neural circuit model of FIG. 23.

[Explanation of symbols]

１……制御対象２……調節部３……操作部４……学習部５……観測データ記憶部６……仮想操作量算出部７……制御量予測部８……操作量修正部９……第１スイッチ１０……第２スイッチ１２……減算器１３……誤差逆伝播学習器１９……応答時間算出部２０……操作遅れ時間計測部２１……伝送部２２……伝送むだ時間計測部 1 ... Control target 2 ... Adjusting unit 3 ... Operating unit 4 ... Learning unit 5 ... Observation data storage unit 6 ... Virtual operation amount calculation unit 7 ... Control amount prediction unit 8 ... Operation amount correction unit 9 ...... First switch 10 ...... Second switch 12 ...... Subtractor 13 ...... Error back propagation learning device 19 ...... Response time calculation unit 20 ...... Operation delay time measurement unit 21 ...... Transmission unit 22 ...... Transmission dead time Measuring unit

Claims

[Claims]

1. A control device for controlling a controlled variable of a controlled object by changing an operation amount input to the controlled object, wherein an adjustment signal is calculated based on a controlled variable and a target controlled variable measured by the controlled object. And an operation unit that outputs an operation amount to the controlled object based on an adjustment signal calculated by the adjustment unit, and an observation data storage that stores observation data having a control amount and an operation amount observed in the past. Section, based on the observation time stored in the observation data storage unit and the response time of the control object from the input of the operation amount from the operation unit to the control object until the influence of the operation amount appears in the control amount And a learning unit that learns the adjusting unit.

2. A control device for controlling a controlled variable of a controlled object by changing an operation amount input to the controlled object, wherein an adjustment signal is calculated based on the controlled variable and the target controlled variable measured by the controlled object. And an operation unit that outputs an operation amount to the controlled object based on an adjustment signal calculated by the adjustment unit, and an observation data storage that stores observation data having a control amount and an operation amount observed in the past. Section, based on the operation amount output from the operation unit and the control amount measured by the control target, after the operation amount is input from the operation unit to the control target, the influence of the operation amount appears in the control amount. Up to the response time calculation unit for calculating the response time of the controlled object, based on the response time calculated by the response time calculation unit and the observation data stored in the observation data storage unit,
A control unit comprising: a learning unit that learns the adjustment unit.

3. The control device according to claim 1, wherein the adjustment unit calculates a virtual operation amount based on the control amount and the target control amount measured by the control target, and a virtual operation amount calculation unit. A control amount prediction unit that predicts a control amount after the response time based on a control amount and a virtual operation amount measured by the control target; and a predicted control amount after a response time predicted by the control amount prediction unit and An operation amount correction unit that corrects the virtual operation amount based on the target control amount, a control amount prediction performed by the control amount prediction unit, and a virtual operation amount correction performed by the operation amount correction unit at least once. A control device, comprising means for inputting an operation amount as an adjustment signal to the operation unit.

4. The control device according to claim 3, wherein the control amount predicting unit is composed of a neural circuit model, and the calculation of the operation amount correcting unit is realized by an error backpropagation calculation of the neural circuit model, the learning unit. Is a learning device that creates learning data based on the response time and the observation data stored in the observation data storage unit, and learns a neural circuit model that constitutes the control amount prediction unit. .

5. The control device according to claim 4, wherein, when the learning unit creates learning data based on the observation data stored in the observation data storage unit, the control amount and the operation amount at the same time. The control device is characterized in that learning is performed by using observation data having the following as an input signal and using a control amount after the response time as a teacher signal.

6. A control device for controlling a control amount of a control target by changing an operation amount input to the control target, wherein an adjustment signal is calculated based on a control amount and a target control amount measured by the control target. And an operation unit that outputs an operation amount to the controlled object based on an adjustment signal calculated by the adjustment unit, and an observation data storage that stores observation data having a control amount and an operation amount observed in the past. Section, the response time of the control target from the input of the operation amount from the operation unit to the control target until the influence of the operation amount appears in the control amount, and the adjustment signal calculated by the adjustment unit through the operation unit. A control device comprising: a learning unit that learns the adjustment unit based on an operation delay time until it is realized as an operation amount and the observation data stored in the observation data storage unit.

7. A control device for controlling a control amount of a control target by changing an operation amount input to the control target, wherein an adjustment signal is calculated based on the control amount and the target control amount measured by the control target. And an operation unit that outputs an operation amount to the controlled object based on an adjustment signal calculated by the adjustment unit, and an observation data storage that stores observation data having a control amount and an operation amount observed in the past. Unit, the operation delay time until the adjustment signal calculated by the adjusting unit is realized as the operation amount through the operating unit, based on the adjusting signal calculated by the adjusting unit and the operation amount output by the operating unit. The operation delay time measuring unit for measuring the operation delay time, the operation sending time measured by the operation delay time measuring unit, and the operation amount input to the control target from the operation unit, and then the influence of the operation amount is the control amount. Based on the stored observation data in the observation data memory unit and the response time of the control object to appear, the control apparatus characterized by comprising a learning section for learning the adjustment portion.

8. The control device according to claim 6, wherein the adjustment unit calculates a virtual operation amount based on a control amount and a target control amount measured by the control target, and a virtual operation amount calculation unit. A control amount prediction unit that predicts a control amount after the response time based on a control amount and a virtual operation amount measured by the control target; and a predicted control amount after a response time predicted by the control amount prediction unit and An operation amount correction unit that corrects the virtual operation amount based on the target control amount, a control amount prediction performed by the control amount prediction unit, and a virtual operation amount correction performed by the operation amount correction unit at least once. A control device, comprising means for inputting an operation amount as an adjustment signal to the operation unit.

9. The control device according to claim 8, wherein the control amount prediction unit is composed of a neural circuit model, and the calculation of the operation amount correction unit is realized by an error backpropagation calculation of the neural circuit model, the learning unit. Based on the response time, the operation delay time, and the observation data stored in the observation data storage unit to create learning data and perform learning of the neural circuit model that constitutes the control amount prediction unit. Characteristic control device.

10. The control device according to claim 9, wherein, when the learning unit creates learning data based on the observation data stored in the observation data storage unit, the control amount at the time and the time A control device for performing learning by using as input signals observation data having an operation amount after an operation delay time and using a control amount after the response time as a teacher signal.

11. A control device for controlling a control amount of a control target by changing an operation amount input to the control target, wherein an adjustment signal is calculated based on the control amount and the target control amount measured by the control target. And an operation unit that outputs an operation amount to the controlled object based on an adjustment signal calculated by the adjustment unit, and an observation data storage that stores observation data having a control amount and an operation amount observed in the past. Section, the response time of the control target from the input of the operation amount from the operation unit to the control target until the influence of the operation amount appears in the control amount and the control amount measured by the control target are transmitted through the transmission unit. A control device comprising: a learning unit that learns the adjustment unit based on a transmission dead time until input to the adjustment unit and the observation data stored in the observation data storage unit.

12. A control device for controlling a controlled variable of a controlled object by changing an operation amount input to the controlled object, wherein an adjustment signal is calculated based on the controlled variable and the target controlled variable measured by the controlled object. And an operation unit that outputs an operation amount to the controlled object based on an adjustment signal calculated by the adjustment unit, and an observation data storage that stores observation data having a control amount and an operation amount observed in the past. Unit, a transmission dead time measuring unit that measures a transmission dead time by the transmission unit based on a control amount measured by the control target and a control amount input to the adjustment unit through the transmission unit, and the transmission dead time The transmission dead time measured by the measuring unit, the response time of the controlled object from the input of the manipulated variable from the operation unit to the controlled object until the influence of the manipulated variable appears on the controlled variable, and the observation data Based on the stored observation data in the data storage unit, the control apparatus characterized by comprising a learning section for learning the adjustment portion.

13. The control device according to claim 11 or 12, wherein the adjusting unit calculates a virtual operation amount based on the control amount and the target control amount measured by the control target. A control amount prediction unit that predicts a control amount after the response time based on a control amount and a virtual operation amount measured by the control target; and a predicted control amount after a response time predicted by the control amount prediction unit and An operation amount correction unit that corrects the virtual operation amount based on the target control amount, a control amount prediction performed by the control amount prediction unit, and a virtual operation amount correction performed by the operation amount correction unit at least once. A control device, comprising means for inputting an operation amount as an adjustment signal to the operation unit.

14. The control device according to claim 13, wherein the control amount predicting unit is composed of a neural circuit model, the calculation of the operation amount correcting unit is realized by an error backpropagation calculation of the neural circuit model, and the learning unit. Based on the response time, the transmission time delay, and the observation data stored in the observation data storage unit to create learning data and perform learning of the neural circuit model that constitutes the control amount prediction unit. Characteristic control device.

15. The control device according to claim 14, wherein when the learning unit creates learning data based on the observation data stored in the observation data storage unit, the control amount before the transmission dead time is set. A control device which performs learning by using as input signals observation data having an operation amount at the time and a control amount after the response time as a teacher signal.

16. A control device for controlling a controlled variable of a controlled object by changing an operation amount input to the controlled object, wherein an adjustment signal is calculated based on the controlled variable and the target controlled variable measured by the controlled object. And an operation unit that outputs an operation amount to the controlled object based on an adjustment signal calculated by the adjustment unit, and an observation data storage that stores observation data having a control amount and an operation amount observed in the past. Section, the response time of the control target from the input of the operation amount from the operation unit to the control target until the influence of the operation amount appears in the control amount and the control amount measured by the control target are transmitted through the transmission unit. It is stored in the observation data storage unit and the operation delay time until the control signal calculated by the control unit is realized as a manipulated variable through the operation unit. Observation data and on the basis the control apparatus characterized by comprising a learning section for learning the adjustment portion.

17. A control device for controlling a control amount of a control target by changing an operation amount input to the control target, wherein an adjustment signal is calculated based on the control amount and the target control amount measured by the control target. And an operation unit that outputs an operation amount to the controlled object based on an adjustment signal calculated by the adjustment unit, and an observation data storage that stores observation data having a control amount and an operation amount observed in the past. Unit, the operation delay time until the adjustment signal calculated by the adjusting unit is realized as the operation amount through the operating unit, based on the adjusting signal calculated by the adjusting unit and the operation amount output by the operating unit. And an operation delay time measuring unit for measuring the transmission delay time by the transmission unit based on the control amount measured by the controlled object and the control amount input to the adjustment unit through the transmission unit. A dead time measuring unit, an operation delay time measured by the operation delay time measuring unit, a transmission dead time measured by the transmission dead time measuring unit, and an operation amount input from the operation unit to the control target A learning unit for learning the adjusting unit based on the response time of the controlled object until the influence of the operation amount appears in the control amount and the observation data stored in the observation data storage unit. Control device.

18. The control device according to claim 16 or 17, wherein the adjusting unit calculates a virtual operation amount based on the control amount and the target control amount measured by the control target, and A control amount prediction unit that predicts a control amount after the response time based on a control amount and a virtual operation amount measured by the control target; and a predicted control amount after a response time predicted by the control amount prediction unit and An operation amount correction unit that corrects the virtual operation amount based on the target control amount, a control amount prediction performed by the control amount prediction unit, and a virtual operation amount correction performed by the operation amount correction unit at least once. A control device, comprising means for inputting an operation amount as an adjustment signal to the operation unit.

19. The control device according to claim 18, wherein the control amount predicting unit is composed of a neural circuit model, the calculation of the operation amount correcting unit is realized by an error backpropagation calculation of the neural circuit model, and the learning unit. Is based on the response time, the operation delay time, the transmission dead time, and the observation data stored in the observation data storage unit, creates learning data, and the neural network model that constitutes the control amount prediction unit. A control device characterized by performing learning.

20. The control device according to claim 19, wherein when the learning unit creates learning data based on the observation data stored in the observation data storage unit, the control amount before the transmission dead time is set. And a control amount after the operation delay time as an input signal, and a control amount after the response time as a teacher signal to perform learning.