JP2024067631A

JP2024067631A - Apparatus, method and program

Info

Publication number: JP2024067631A
Application number: JP2022177854A
Authority: JP
Inventors: 琢劉; 英二石井; 豪 ▲高▼見
Original assignee: Yokogawa Electric Corp
Current assignee: Yokogawa Electric Corp
Priority date: 2022-11-07
Filing date: 2022-11-07
Publication date: 2024-05-17

Abstract

【課題】制御モデルから出力される推奨制御パラメータを出力する。【解決手段】制御対象に関する状態の測定値および目標値の偏差を取得する偏差取得部と、前記制御対象に対して供給された制御パラメータをシフトさせたシフト済み制御パラメータを取得する制御パラメータ取得部と、偏差および制御パラメータが入力されることに応じて、前記制御対象に供給することを推奨する推奨制御パラメータを出力する制御モデルに対し、前記偏差取得部により取得された偏差と、前記制御パラメータ取得部により取得された前記シフト済み制御パラメータとを供給する第１供給部と、前記第１供給部から前記制御モデルに対する供給が行われたことに応じて当該制御モデルから出力される前記推奨制御パラメータを出力する出力部と、を備える装置が提供される。【選択図】図１[Problem] A device is provided that outputs recommended control parameters output from a control model. [Solution] An apparatus is provided that includes a deviation acquisition unit that acquires a deviation between a measured value and a target value of a state related to a controlled object, a control parameter acquisition unit that acquires shifted control parameters obtained by shifting control parameters supplied to the controlled object, a first supply unit that supplies the deviation acquired by the deviation acquisition unit and the shifted control parameters acquired by the control parameter acquisition unit to a control model that outputs recommended control parameters to be supplied to the controlled object in response to input of the deviation and control parameters, and an output unit that outputs the recommended control parameters output from the control model in response to supply to the control model from the first supply unit. [Selected Figure] Figure 1

Description

本発明は、装置、方法およびプログラムに関する。 The present invention relates to an apparatus, a method and a program.

特許文献１～４には、「目標値ＳＶに基づいて操作量マップを選択して、選択した操作量マップを用いて操作量ＭＶを算出する」（特許文献１の段落００３１）などと記載されている。
［先行技術文献］
［特許文献］
［特許文献１］特開２０２２－１５６７９７号公報
［特許文献２］特開２０２０－９５３５２号公報
［特許文献３］特開２０２１－１１７６９９号公報
［特許文献４］特開２０２２－０１４０９９号公報 Patent Documents 1 to 4 state, for example, that "an operation amount map is selected based on the target value SV, and the operation amount MV is calculated using the selected operation amount map" (paragraph 0031 of Patent Document 1).
[Prior Art Literature]
[Patent Documents]
[Patent Document 1] JP 2022-156797 A [Patent Document 2] JP 2020-95352 A [Patent Document 3] JP 2021-117699 A [Patent Document 4] JP 2022-014099 A

本発明の第１の態様においては、制御対象に関する状態の測定値および目標値の偏差を取得する偏差取得部と、前記制御対象に対して供給された制御パラメータをシフトさせたシフト済み制御パラメータを取得する制御パラメータ取得部と、偏差および制御パラメータが入力されることに応じて、前記制御対象に供給することを推奨する推奨制御パラメータを出力する制御モデルに対し、前記偏差取得部により取得された偏差と、前記制御パラメータ取得部により取得された前記シフト済み制御パラメータとを供給する第１供給部と、前記第１供給部から前記制御モデルに対する供給が行われたことに応じて当該制御モデルから出力される前記推奨制御パラメータを出力する出力部と、を備える装置が提供される。 In a first aspect of the present invention, there is provided an apparatus including: a deviation acquisition unit that acquires the deviation between a measured value and a target value of a state related to a control object; a control parameter acquisition unit that acquires shifted control parameters obtained by shifting control parameters supplied to the control object; a first supply unit that supplies the deviation acquired by the deviation acquisition unit and the shifted control parameters acquired by the control parameter acquisition unit to a control model that outputs recommended control parameters to be supplied to the control object in response to input of the deviation and the control parameters; and an output unit that outputs the recommended control parameters output from the control model in response to supply from the first supply unit to the control model.

上記の装置においては、前記状態の目標値を取得する目標値取得部をさらに備え、前記制御パラメータ取得部は、前記目標値が基準目標値から変更されたことに応じて、前記制御対象に対して供給された制御パラメータをシフトさせて、前記シフト済み制御パラメータを取得してよい。 The above device may further include a target value acquisition unit that acquires a target value for the state, and the control parameter acquisition unit may shift the control parameters supplied to the control object in response to the target value being changed from a reference target value, and acquire the shifted control parameters.

上記の装置においては、前記制御パラメータ取得部は、前記目標値が基準目標値から変更されたことに応じて、前記制御対象に対して供給された制御パラメータを、前記目標値に応じたシフト量だけシフトさせて、前記シフト済み制御パラメータを取得してよい。 In the above device, the control parameter acquisition unit may, in response to the target value being changed from the reference target value, shift the control parameter supplied to the control object by a shift amount corresponding to the target value, and acquire the shifted control parameter.

前記制御パラメータを前記目標値に応じたシフト量だけシフトさせる上記の装置においては、前記制御パラメータ取得部は、前記目標値および前記基準目標値の差分と、予め設定された係数とを乗算して前記シフト量を決定してよい。 In the above device that shifts the control parameter by a shift amount corresponding to the target value, the control parameter acquisition unit may determine the shift amount by multiplying the difference between the target value and the reference target value by a preset coefficient.

前記制御パラメータを前記目標値に応じたシフト量だけシフトさせる上記の装置においては、前記制御パラメータ取得部は、前記目標値に設定された値と、当該値に前記測定値が安定するときの制御パラメータの値との関係を示す、予め設定された関係式を用いて前記シフト量を決定してよい。 In the above device in which the control parameter is shifted by a shift amount corresponding to the target value, the control parameter acquisition unit may determine the shift amount using a pre-set relational expression that indicates the relationship between a value set as the target value and the value of the control parameter when the measured value stabilizes at that value.

上記何れかの装置においては、前記推奨制御パラメータにより前記制御対象が制御された後に前記偏差取得部により取得される偏差が基準範囲内に安定しないことを検出する第１検出部をさらに備え、前記制御パラメータ取得部は、前記偏差取得部により取得される偏差が前記基準範囲内に安定しないことが前記第１検出部により検出されたことに応じて、前記制御対象に対して供給された前記制御パラメータをシフトさせて、前記シフト済み制御パラメータを取得してよい。 Any of the above devices may further include a first detection unit that detects that the deviation acquired by the deviation acquisition unit is not stable within a reference range after the control object is controlled by the recommended control parameters, and the control parameter acquisition unit may shift the control parameters supplied to the control object and acquire the shifted control parameters in response to the first detection unit detecting that the deviation acquired by the deviation acquisition unit is not stable within the reference range.

上記の装置においては、偏差および当該偏差の変化速度が入力されることに応じて、前記制御パラメータ取得部により制御パラメータについてシフトすることを推奨する推奨シフト量を出力するシフト量出力モデルに対し、前記偏差取得部により取得された偏差と、当該偏差の変化速度とを供給する第２供給部をさらに備え、前記制御パラメータ取得部は、前記偏差取得部により取得される偏差が前記基準範囲内に安定しないことが前記第１検出部により検出され、かつ、前記第２供給部から前記シフト量出力モデルに対する供給が行われたことに応じて当該シフト量出力モデルから出力される前記推奨シフト量だけ、前記制御対象に対して供給された制御パラメータをシフトさせて、前記シフト済み制御パラメータを取得してよい。 The above device further includes a second supply unit that supplies the deviation acquired by the deviation acquisition unit and the rate of change of the deviation to a shift amount output model that outputs a recommended shift amount that recommends a shift of a control parameter by the control parameter acquisition unit in response to input of the deviation and the rate of change of the deviation, and the control parameter acquisition unit may shift the control parameter supplied to the controlled object by the recommended shift amount output from the shift amount output model in response to detection by the first detection unit that the deviation acquired by the deviation acquisition unit is not stable within the reference range and supply from the second supply unit to the shift amount output model to acquire the shifted control parameter.

前記偏差取得部により取得された偏差と、当該偏差の変化速度とをシフト量出力モデルに供給する上記の装置においては、前記第２供給部は、基準インターバルごとに前記シフト量出力モデルに対する供給を行ってよい。 In the above device that supplies the deviation acquired by the deviation acquisition unit and the rate of change of the deviation to a shift amount output model, the second supply unit may supply the deviation to the shift amount output model at each reference interval.

前記偏差取得部により取得された偏差と、当該偏差の変化速度とをシフト量出力モデルに供給する上記の装置においては、前記偏差取得部により取得される偏差と、当該偏差の変化速度と、前記制御パラメータ取得部による前記制御パラメータのシフト量とを含む学習データを用い、偏差および当該偏差の変化速度の入力に応じ、予め設定された報酬関数により定まる報酬値を高めるために推奨される前記推奨シフト量を出力するよう前記シフト量出力モデルの学習処理を行う第１学習処理部をさらに備えてよい。 The above device, which supplies the deviation acquired by the deviation acquisition unit and the rate of change of the deviation to a shift amount output model, may further include a first learning processing unit that performs learning processing of the shift amount output model using learning data including the deviation acquired by the deviation acquisition unit, the rate of change of the deviation, and the shift amount of the control parameter acquired by the control parameter acquisition unit, and outputs the recommended shift amount recommended to increase a reward value determined by a preset reward function in response to input of the deviation and the rate of change of the deviation.

上記何れかの装置においては、前記制御対象が変更されたことを検出する第２検出部をさらに備え、前記制御パラメータ取得部は、前記制御対象が変更されたことが前記第２検出部により検出されたことに応じて、前記制御対象に対して供給された制御パラメータをシフトさせて、前記シフト済み制御パラメータを取得してよい。 Any of the above devices may further include a second detection unit that detects that the control target has been changed, and the control parameter acquisition unit may shift the control parameters supplied to the control target in response to the second detection unit detecting that the control target has been changed, and acquire the shifted control parameters.

上記何れかの装置においては、前記制御モデルは、偏差および制御パラメータが入力されることに応じて、当該制御パラメータについて変更することを推奨する推奨変更量を出力する変更量出力モデルと、前記制御対象に供給された前記制御パラメータと、前記変更量出力モデルから出力される前記推奨変更量とを加算して前記推奨制御パラメータを算出する加算部と、を有してよい。 In any of the above devices, the control model may include a change amount output model that outputs a recommended change amount that recommends a change to the control parameter in response to input of the deviation and the control parameter, and an adder that calculates the recommended control parameter by adding the control parameter supplied to the control target and the recommended change amount output from the change amount output model.

上記の装置においては、前記偏差取得部により取得される偏差と、前記制御パラメータ取得部により取得される制御パラメータと、を含む学習データを用い、偏差および制御パラメータの入力に応じ、予め設定された報酬関数により定まる報酬値を高めるために推奨される前記推奨変更量を出力するよう前記変更量出力モデルの学習処理を行う第２学習処理部をさらに備えてよい。 The above device may further include a second learning processing unit that performs learning processing of the change amount output model using learning data including the deviation acquired by the deviation acquisition unit and the control parameters acquired by the control parameter acquisition unit, and outputs the recommended change amount recommended to increase a reward value determined by a preset reward function in response to input of the deviation and the control parameters.

本発明の第２の態様においては、制御対象に関する状態の測定値および目標値の偏差を取得する偏差取得段階と、前記制御対象に対して供給された制御パラメータをシフトさせたシフト済み制御パラメータを取得する制御パラメータ取得段階と、偏差および制御パラメータが入力されることに応じて、前記制御対象に供給することを推奨する推奨制御パラメータを出力する制御モデルに対し、前記偏差取得段階により取得された偏差と、前記制御パラメータ取得段階により取得された前記シフト済み制御パラメータとを供給する第１供給段階と、前記第１供給段階により前記制御モデルに対する供給が行われたことに応じて当該制御モデルから出力される前記推奨制御パラメータを出力する出力段階と、を備える方法が提供される。 In a second aspect of the present invention, a method is provided that includes a deviation acquisition step of acquiring a deviation between a measured value and a target value of a state related to a control object, a control parameter acquisition step of acquiring a shifted control parameter obtained by shifting a control parameter supplied to the control object, a first supply step of supplying the deviation acquired by the deviation acquisition step and the shifted control parameter acquired by the control parameter acquisition step to a control model that outputs a recommended control parameter recommended to be supplied to the control object in response to input of the deviation and the control parameter, and an output step of outputting the recommended control parameter output from the control model in response to supply to the control model by the first supply step.

本発明の第３の態様においては、コンピュータを、制御対象に関する状態の測定値および目標値の偏差を取得する偏差取得部と、前記制御対象に対して供給された制御パラメータをシフトさせたシフト済み制御パラメータを取得する制御パラメータ取得部と、偏差および制御パラメータが入力されることに応じて、前記制御対象に供給することを推奨する推奨制御パラメータを出力する制御モデルに対し、前記偏差取得部により取得された偏差と、前記制御パラメータ取得部により取得された前記シフト済み制御パラメータとを供給する第１供給部と、前記第１供給部から前記制御モデルに対する供給が行われたことに応じて当該制御モデルから出力される前記推奨制御パラメータを出力する出力部として機能させるプログラムが提供される。 In a third aspect of the present invention, a program is provided that causes a computer to function as a deviation acquisition unit that acquires the deviation between a measured value and a target value of a state related to a control object, a control parameter acquisition unit that acquires shifted control parameters obtained by shifting control parameters supplied to the control object, a first supply unit that supplies the deviation acquired by the deviation acquisition unit and the shifted control parameters acquired by the control parameter acquisition unit to a control model that outputs recommended control parameters to be supplied to the control object in response to input of the deviation and control parameters, and an output unit that outputs the recommended control parameters output from the control model in response to supply from the first supply unit to the control model.

なお、上記の発明の概要は、本発明の必要な特徴の全てを列挙したものではない。また、これらの特徴群のサブコンビネーションもまた、発明となりうる。 Note that the above summary of the invention does not list all of the necessary features of the present invention. Also, subcombinations of these features may also be inventions.

第１実施形態に係るシステム１を示す。1 shows a system 1 according to a first embodiment. 変更量出力モデル２０６１の一例を示す。2 shows an example of a change amount output model 2061. 変更量出力モデル２０６１の他の例を示す。2 shows another example of the change amount output model 2061. 制御モデル２０６に入力される制御パラメータをシフトすることによる効果を示す。The effect of shifting the control parameters input to the control model 206 is shown. サンプルデータの近似曲線を示す。The fitted curve of the sample data is shown. 装置２００の動作を示す。2 illustrates the operation of the device 200. 目標値ＳＰが変更される場合の測定値ＰＶと、制御パラメータとの推移を示す。13 shows the transition of the measured value PV and the control parameter when the target value SP is changed. 第２実施形態に係るシステム１Ａを示す。1 shows a system 1A according to a second embodiment. シフト量出力モデル２１３Ａの一例を示す。2 shows an example of a shift amount output model 213A. シフト量出力モデル２１３Ａの他の例を示す。13 shows another example of the shift amount output model 213A. 装置２００Ａの動作を示す。The operation of the device 200A is shown. 外乱が生じる場合の測定値ＰＶと、制御パラメータとの推移を示す。13 shows the transition of the measured value PV and the control parameter when a disturbance occurs. 第３実施形態に係るシステム１Ｂを示す。13 shows a system 1B according to a third embodiment. 本発明の複数の態様が全体的または部分的に具現化されてよいコンピュータ２２００の例を示す。22 illustrates an example computer 2200 in which aspects of the present invention may be embodied, in whole or in part.

以下、発明の実施の形態を通じて本発明を説明するが、以下の実施形態は特許請求の範囲にかかる発明を限定するものではない。また、実施形態の中で説明されている特徴の組み合わせの全てが発明の解決手段に必須であるとは限らない。 The present invention will be described below through embodiments of the invention, but the following embodiments do not limit the invention according to the claims. Furthermore, not all of the combinations of features described in the embodiments are necessarily essential to the solution of the invention.

＜１．第１実施形態＞
＜１．１．システム１＞
図１は、第１実施形態に係るシステム１を示す。システム１は設備１００と、装置２００とを備える。 <1. First embodiment>
<1.1. System 1>
1 shows a system 1 according to the first embodiment. The system 1 includes a facility 100 and an apparatus 200.

＜１．１．１．設備１００＞
設備１００は、制御対象１０１が備え付けられた施設や装置等である。例えば、設備１００は、プラントであってもよいし、複数の機器を複合させた複合装置であってもよい。プラントとしては、化学やバイオ等の工業プラントの他、ガス田や油田等の井戸元やその周辺を管理制御するプラント、水力・火力・原子力等の発電を管理制御するプラント、太陽光や風力等の環境発電を管理制御するプラント、上下水やダム等を管理制御するプラント等が挙げられる。 <1.1.1. Equipment 100>
The equipment 100 is a facility or device in which a control target 101 is installed. For example, the equipment 100 may be a plant or a composite device in which multiple devices are combined. In addition to industrial plants such as chemical and biotechnology plants, plants that manage and control wellheads and surrounding areas of gas and oil fields, plants that manage and control hydroelectric, thermal and nuclear power plants, and environmental power plants such as solar and wind power plants. Examples of such plants include those that manage and control water supply, sewage, and dams.

設備１００には、１または複数の制御対象１０１が設けられている。制御対象１０１は、制御の対象となる器具、機械または装置等であり、いわゆるフィールド機器であってよい。例えば、制御対象１０１は、圧力計、流量計、温度センサ等のセンサ機器、流量制御弁や開閉弁等のバルブ機器、またはファンやモータ等のアクチュエータ機器であってよい。制御対象１０１は、外部から有線または無線で制御されるが、手動で制御されてもよい。制御対象１０１は、装置２００における制御部２０７によって制御されてよい。本実施形態では一例として、制御対象１０１は、操作量ＭＶ（ＭａｎｉｐｕｌａｔｅｄＶａｒｉａｂｌｅ）についての指示値ＩＶ（Ｉｎｓｔｒｕｃｔｅｄｖａｌｕｅ）を制御部２０７から供給されることで制御されてよい。 The facility 100 is provided with one or more control objects 101. The control object 101 may be a tool, machine, or device to be controlled, and may be a so-called field device. For example, the control object 101 may be a sensor device such as a pressure gauge, flow meter, or temperature sensor, a valve device such as a flow control valve or an on-off valve, or an actuator device such as a fan or a motor. The control object 101 is controlled from the outside by wire or wirelessly, but may also be controlled manually. The control object 101 may be controlled by the control unit 207 in the device 200. In this embodiment, as an example, the control object 101 may be controlled by supplying an instruction value IV (Instructed Value) for a manipulated variable MV (Manipulated Variable) from the control unit 207.

また、設備１００には、１または複数のセンサ１０２が設けられていてよい。各センサ１０２は、設備１００の内外の状態の測定値、つまり、内外の状態を示す物理量の測定値を測定してよい。少なくとも１つのセンサ１０２は、制御対象１０１の状態の測定値ＰＶ（ＰｒｏｃｅｓｓＶａｒｉａｂｌｅ）を測定してよい。測定値ＰＶは、制御対象１０１を制御した結果の運転状態を示す運転データであってよく、制御の対象となる制御量を示してよい。一例として、測定値ＰＶは、制御対象１０１の出力そのものを示してもよいし、制御対象１０１の出力によって変化する様々な値を示してもよい。一例として、測定値ＰＶは、圧力、温度、ｐＨ、速度、流量などを示してよい。各センサ１０２は、測定した測定値ＰＶを装置２００に供給してよい。 The facility 100 may be provided with one or more sensors 102. Each sensor 102 may measure a measurement value of the inside and outside state of the facility 100, that is, a measurement value of a physical quantity indicating the inside and outside state. At least one sensor 102 may measure a measurement value PV (Process Variable) of the state of the control object 101. The measurement value PV may be operation data indicating the operation state as a result of controlling the control object 101, and may indicate the control amount to be controlled. As an example, the measurement value PV may indicate the output of the control object 101 itself, or may indicate various values that change depending on the output of the control object 101. As an example, the measurement value PV may indicate pressure, temperature, pH, speed, flow rate, etc. Each sensor 102 may supply the measured measurement value PV to the device 200.

＜１．１．２．装置２００＞
装置２００は、制御対象１０１を制御するものであり、例えば制御対象１０１のコントローラであってよい。装置２００は、制御対象１０１の操作量ＭＶについての指示値ＩＶを出力して温度の調節、液面の水位調整または流量の調整などのプロセス制御を実行してよい。 <1.1.2. Apparatus 200>
The device 200 controls the controlled object 101, and may be, for example, a controller for the controlled object 101. The device 200 outputs an instruction value IV for a manipulated variable MV of the controlled object 101 to adjust the temperature, Process controls such as adjusting the surface level or adjusting the flow rate may be performed.

装置２００は、ＰＣ（パーソナルコンピュータ）、タブレット型コンピュータ、スマートフォン、ワークステーション、サーバコンピュータ、または汎用コンピュータ等のコンピュータであってよく、複数のコンピュータが接続されたコンピュータシステムであってもよい。このようなコンピュータシステムもまた広義のコンピュータである。また、装置２００は、コンピュータ内で１または複数実行可能な仮想コンピュータ環境によって実装されてもよい。これに代えて、装置２００は、ＡＩ制御用に設計された専用コンピュータであってもよく、専用回路によって実現された専用ハードウェアであってもよい。また、装置２００がインターネットに接続可能な場合、装置２００は、クラウドコンピューティングにより実現されてもよい。 The device 200 may be a computer such as a PC (personal computer), a tablet computer, a smartphone, a workstation, a server computer, or a general-purpose computer, or may be a computer system to which multiple computers are connected. Such a computer system is also a computer in the broad sense. The device 200 may be implemented by a virtual computer environment in which one or more programs can be executed within a computer. Alternatively, the device 200 may be a dedicated computer designed for AI control, or may be dedicated hardware realized by a dedicated circuit. Furthermore, if the device 200 can be connected to the Internet, the device 200 may be realized by cloud computing.

装置２００は、測定値取得部２０１と、目標値取得部２０２と、偏差取得部２０３と、制御パラメータ取得部２０４と、第１供給部２０５と、制御モデル２０６と、制御部２０７と、学習処理部２０８とを有してよい。なお、これらブロックは、それぞれ機能的に分離された機能ブロックであって、実際のデバイス構成とは必ずしも一致していなくてもよい。即ち、本図において、１つのブロックとして示されている場合であっても、それが１つのデバイスにより構成されるものに限定されない。また、本図において、別々のブロックとして示されている場合であっても、それらが別々のデバイスにより構成されるものに限定されない。 The device 200 may have a measurement value acquisition unit 201, a target value acquisition unit 202, a deviation acquisition unit 203, a control parameter acquisition unit 204, a first supply unit 205, a control model 206, a control unit 207, and a learning processing unit 208. Note that these blocks are functionally separated functional blocks, and may not necessarily match the actual device configuration. That is, even if shown as one block in this diagram, it is not limited to being configured by one device. Also, even if shown as separate blocks in this diagram, it is not limited to being configured by separate devices.

＜１．１．２―１．測定値取得部２０１＞
測定値取得部２０１は、制御対象１０１に関する状態の測定値ＰＶを取得する。本実施形態では一例として、測定値取得部２０１は、一の物理量についての測定値ＰＶを一のセンサ１０２から取得することとして説明するが、複数の物理量のそれぞれについての測定値ＰＶを複数のセンサ１０２から取得してもよい。測定値取得部２０１は、取得した測定値ＰＶを偏差取得部２０３に供給してよい。 <1.1.2-1. Measurement value acquisition unit 201>
The measurement value acquiring unit 201 acquires a measurement value PV of a state related to the control target 101. In the present embodiment, as an example, the measurement value acquiring unit 201 acquires a measurement value PV for one physical quantity from one sensor 102, but the measurement value acquiring unit 201 may acquire measurement values PV for each of a plurality of physical quantities from a plurality of sensors 102. The measurement value acquiring unit 201 may supply the acquired measurement value PV to the deviation acquiring unit 203.

＜１．１．２－２．目標値取得部２０２＞
目標値取得部２０２は、制御対象１０１に関する状態の目標値ＳＰ（ＳｅｔＰｏｉｎｔ）を取得する。目標値取得部２０２は、測定値取得部２０１により取得される測定値ＰＶの目標値ＳＰを取得してよい。目標値取得部２０２は、図示しない入力部を介してオペレータから目標値ＳＰを取得してよい。目標値取得部２０２は、オペレータにより目標値ＳＰが設定されない場合には、予め設定された基準目標値を目標値ＳＰとして取得してよい。目標値取得部２０２は、取得した目標値ＳＰを偏差取得部２０３に供給してよい。目標値取得部２０２は、基準目標値とは異なる値に目標値ＳＰが設定されることに応じて、設定された目標値ＳＰを示す信号（目標値変更信号とも称する）を制御パラメータ取得部２０４に供給してよい。 <1.1.2-2. Target value acquisition unit 202>
The target value acquisition unit 202 acquires a target value SP (Set Point) of a state related to the control target 101. The target value acquisition unit 202 may acquire the target value SP of the measurement value PV acquired by the measurement value acquisition unit 201. The target value acquisition unit 202 may acquire the target value SP from an operator via an input unit (not shown). When the target value SP is not set by the operator, the target value acquisition unit 202 may acquire a preset reference target value as the target value SP. The target value acquisition unit 202 may supply the acquired target value SP to the deviation acquisition unit 203. In response to the target value SP being set to a value different from the reference target value, the target value acquisition unit 202 may supply a signal indicating the set target value SP (also referred to as a target value change signal) to the control parameter acquisition unit 204.

＜１．１．２－３．偏差取得部２０３＞
偏差取得部２０３は、制御対象１０１に関する状態の測定値ＰＶおよび目標値ＳＰの偏差を取得する。偏差取得部２０３は、測定値取得部２０１から測定値ＰＶを、目標値取得部２０２から目標値ＳＰを取得し、目標値ＳＰから測定値ＰＶを減算して偏差を算出してよい。これに代えて、偏差取得部２０３は、測定値ＰＶから目標値ＳＰを減算して偏差を算出してもよい。偏差取得部２０３は、取得した偏差を第１供給部２０５に供給してよい。偏差取得部２０３は、取得した偏差を、図示しない記憶部に記憶させてよい。 <1.1.2-3. Deviation acquisition unit 203>
The deviation acquisition unit 203 acquires the deviation between the measurement value PV and the target value SP of the state related to the control target 101. The deviation acquisition unit 203 may acquire the measurement value PV from the measurement value acquisition unit 201 and the target value SP from the target value acquisition unit 202, and calculate the deviation by subtracting the measurement value PV from the target value SP. Alternatively, the deviation acquisition unit 203 may calculate the deviation by subtracting the target value SP from the measurement value PV. The deviation acquisition unit 203 may supply the acquired deviation to the first supply unit 205. The deviation acquisition unit 203 may store the acquired deviation in a storage unit not shown.

＜１．１．２－４．制御パラメータ取得部２０４＞
制御パラメータ取得部２０４は、制御対象１０１に対して供給された制御パラメータＰをシフトさせたシフト済み制御パラメータＰ＋Δｐを取得する。制御パラメータ取得部２０４は、後述の制御部２０７から制御パラメータＰを取得してよく、本実施形態では一例として、制御部２０７が制御対象１０１に制御パラメータＰを供給するごとに当該制御パラメータＰを取得してよい。制御パラメータＰは、制御対象１０１の操作量ＭＶについての指示値ＩＶを示してよい。制御対象１０１がバルブである場合には、制御パラメータＰは一例としてバルブ開度を示してよい。制御パラメータ取得部２０４は、制御部２０７から取得した制御パラメータＰをシフトさせてシフト済み制御パラメータＰ＋Δｐを生成してよい。 <1.1.2-4. Control parameter acquisition unit 204>
The control parameter acquisition unit 204 acquires a shifted control parameter P+Δp by shifting the control parameter P supplied to the control object 101. The control parameter acquisition unit 204 may acquire the control parameter P from the control unit 207 described later, and in the present embodiment, as an example, may acquire the control parameter P every time the control unit 207 supplies the control parameter P to the control object 101. The control parameter P may indicate an instruction value IV for an operation amount MV of the control object 101. When the control object 101 is a valve, the control parameter P may indicate a valve opening degree, as an example. The control parameter acquisition unit 204 may shift the control parameter P acquired from the control unit 207 to generate the shifted control parameter P+Δp.

制御パラメータ取得部２０４は、目標値ＳＰが上述の基準目標値から変更されたことに応じて、制御対象１０１に対して供給された制御パラメータＰをシフトさせてシフト済み制御パラメータＰ＋Δｐを取得してよい。制御パラメータ取得部２０４は、目標値取得部２０２から目標値変更信号を受信したことに応じて、制御部２０７から取得した直近の制御パラメータＰをシフトさせ、シフト済み制御パラメータＰ＋Δｐを生成してよい。 The control parameter acquisition unit 204 may shift the control parameter P supplied to the control object 101 in response to a change in the target value SP from the above-mentioned reference target value, and acquire a shifted control parameter P+Δp. In response to receiving a target value change signal from the target value acquisition unit 202, the control parameter acquisition unit 204 may shift the most recent control parameter P acquired from the control unit 207, and generate a shifted control parameter P+Δp.

制御パラメータ取得部２０４は、目標値ＳＰが上述の基準目標値から変更されたことに応じて、制御対象１０１に対して供給された制御パラメータＰを、目標値ＳＰに応じたシフト量Δｐだけシフトさせてシフト済み制御パラメータＰ＋Δｐを取得してよい。制御パラメータ取得部２０４は、目標値変更信号で示される目標値ＳＰに基づいてシフト量Δｐを決定してよい。なお、シフト量Δｐについては詳細を後述する。 In response to the target value SP being changed from the above-mentioned reference target value, the control parameter acquisition unit 204 may shift the control parameter P supplied to the control target 101 by a shift amount Δp corresponding to the target value SP to acquire a shifted control parameter P+Δp. The control parameter acquisition unit 204 may determine the shift amount Δp based on the target value SP indicated by the target value change signal. The shift amount Δp will be described in detail later.

制御パラメータ取得部２０４は、取得したシフト済み制御パラメータＰ＋Δｐを第１供給部２０５に供給してよい。なお、本実施形態では一例として、制御パラメータ取得部２０４は、目標値ＳＰが基準目標値である場合には、制御部２０７から取得した制御パラメータＰをシフトさせずに第１供給部２０５に供給することとして説明するが、当該制御パラメータＰを０だけシフトさせたシフト済み制御パラメータＰ＋Δｐとして第１供給部２０５に供給してもよい。 The control parameter acquisition unit 204 may supply the acquired shifted control parameter P+Δp to the first supply unit 205. Note that, as an example in this embodiment, when the target value SP is a reference target value, the control parameter acquisition unit 204 supplies the control parameter P acquired from the control unit 207 to the first supply unit 205 without shifting it, but the control parameter P may be shifted by 0 and supplied to the first supply unit 205 as a shifted control parameter P+Δp.

＜１．１．２－５．第１供給部２０５＞
第１供給部２０５は、制御モデル２０６に対し、偏差取得部２０３により取得された偏差と、制御パラメータ取得部２０４により取得されたシフト済み制御パラメータＰ＋Δｐとを供給する。第１供給部２０５は、制御部２０７から制御対象１０１に供給された制御パラメータＰをシフトさせたシフト済み制御パラメータＰ＋Δｐと、当該制御パラメータＰにより制御対象１０１を制御した結果の運転状態を示す偏差とを制御モデル２０６に供給してよい。第１供給部２０５は、シフトされていない制御パラメータＰが制御パラメータ取得部２０４から供給される場合には、当該制御パラメータＰと偏差とを制御モデル２０６に供給してよい。 <1.1.2-5. First supply unit 205>
The first supply unit 205 supplies the deviation acquired by the deviation acquisition unit 203 and the shifted control parameter P+Δp acquired by the control parameter acquisition unit 204 to the control model 206. A shifted control parameter P+Δp obtained by shifting the control parameter P supplied from the control unit 207 to the controlled object 101 and a deviation indicating an operating state as a result of controlling the controlled object 101 using the control parameter P are supplied to the control model 206. When the unshifted control parameter P is supplied from the control parameter acquisition unit 204, the first supplying unit 205 may supply the control parameter P and the deviation to the control model 206.

＜１．１．２－６．制御モデル２０６＞
制御モデル２０６は、偏差および制御パラメータＰが入力されることに応じて、制御対象１０１に供給することを推奨する推奨制御パラメータＰｒを出力する。制御モデル２０６は、偏差と、一の制御対象１０１に供給された制御パラメータＰとが入力されることに応じて、当該一の制御対象１０１に供給することを推奨する推奨制御パラメータＰｒを出力してよい。推奨制御パラメータＰｒは、制御対象１０１の操作量ＭＶについての、推奨される指示値ＩＶを示してよい。 <1.1.2-6. Control model 206>
In response to input of the deviation and the control parameter P, the control model 206 outputs a recommended control parameter Pr recommended to be supplied to the controlled object 101. In response to input of the deviation and the control parameter P supplied to one controlled object 101, the control model 206 may output a recommended control parameter Pr recommended to be supplied to the one controlled object 101. The recommended control parameter Pr may indicate a recommended indicated value IV for the manipulated variable MV of the controlled object 101.

制御モデル２０６は、制御パラメータＰについて基準値Ｖ（一例として０）を基準として生成されていてよく、基準値Ｖに対する制御パラメータＰの相対値が偏差と共に入力されることに応じて、当該基準値Ｖを基準とした推奨制御パラメータＰｒの値を出力してよい。基準値Ｖを基準とした制御モデル２０６に対して偏差と制御パラメータＰとが入力されることに代えて、偏差とシフト済み制御パラメータＰ＋Δｐとが入力されることは、基準値Ｖ－Δｐを基準とした他の制御モデルに対して偏差と制御パラメータＰとが入力されることに相当してよい。このような他の制御モデルは、特許文献１で示されるような従来技術において、制御モデル２０６とは異なる環境で使用するべく生成され得る。 The control model 206 may be generated based on a reference value V (0, for example) for the control parameter P, and may output the value of the recommended control parameter Pr based on the reference value V in response to inputting the relative value of the control parameter P with respect to the reference value V together with the deviation. Inputting the deviation and the shifted control parameter P+Δp instead of inputting the deviation and the control parameter P to the control model 206 based on the reference value V may be equivalent to inputting the deviation and the control parameter P to another control model based on the reference value V-Δp. Such another control model may be generated for use in an environment different from that of the control model 206 in the conventional technology as shown in Patent Document 1.

制御モデル２０６は、偏差および制御パラメータＰが入力されることに応じて、目標値ＳＰが基準目標値である状態に応じた推奨制御パラメータＰｒを出力してよい。別言すれば、制御モデル２０６は、目標値が基準目標値である状態において推奨される推奨制御パラメータＰｒを出力してよい。このような制御モデル２０６に対して偏差と制御パラメータＰとが入力されることに代えて、偏差とシフト済み制御パラメータＰ＋Δｐとが入力されることは、目標値ＳＰが基準目標値とは異なる一の値である場合に適合した推奨制御パラメータＰｒを出力する他の制御モデルに対し、偏差と制御パラメータＰとが入力されることに相当してよい。 In response to the input of the deviation and the control parameter P, the control model 206 may output a recommended control parameter Pr corresponding to a state in which the target value SP is a reference target value. In other words, the control model 206 may output a recommended control parameter Pr that is recommended in a state in which the target value is a reference target value. Inputting the deviation and the shifted control parameter P+Δp instead of inputting the deviation and the control parameter P to such a control model 206 may be equivalent to inputting the deviation and the control parameter P to another control model that outputs a recommended control parameter Pr that is suitable when the target value SP is a value different from the reference target value.

制御モデル２０６は、第１供給部２０５から偏差と、シフト済み制御パラメータＰ＋Δｐまたは制御パラメータＰとを入力されることに応じて、制御部２０７に推奨制御パラメータＰｒを出力してよい。本実施形態では一例として、制御モデル２０６には一の物理量についての測定値ＰＶおよび目標値ＳＰの偏差が入力されることとして説明するが、複数の物理量についての測定値ＰＶおよび目標値ＳＰの偏差がそれぞれ入力されることとしてもよい。制御モデル２０６は、変更量出力モデル２０６１と、加算部２０６２とを有してよい。 The control model 206 may output a recommended control parameter Pr to the control unit 207 in response to input of the deviation and the shifted control parameter P+Δp or the control parameter P from the first supply unit 205. In this embodiment, as an example, the deviation of the measurement value PV and the target value SP for one physical quantity is described as being input to the control model 206, but the deviations of the measurement value PV and the target value SP for multiple physical quantities may also be input. The control model 206 may have a change amount output model 2061 and an adder 2062.

＜１．１．２－６（１）．変更量出力モデル２０６１＞
変更量出力モデル２０６１は、偏差および制御パラメータＰが入力されることに応じて、当該制御パラメータＰについて変更することを推奨する推奨変更量を出力する。変更量出力モデル２０６１は、推奨変更量を加算部２０６２に供給してよい。推奨変更量は、制御対象１０１に対して供給された直近の制御パラメータＰから変更することを推奨する変更量を示してよい。本実施形態では一例として推奨変更量は、操作量ＭＶについての直近の指示値ＩＶについての推奨される変更量を示してよい。変更量出力モデル２０６１は、学習処理部２０８による学習処理によって生成されてよく、図示しない記憶部に記憶されていてよい。 <1.1.2-6(1). Change amount output model 2061>
In response to input of the deviation and the control parameter P, the change amount output model 2061 outputs a recommended change amount that recommends a change to the control parameter P. The change amount output model 2061 may supply the recommended change amount to the adder 2062. The recommended change amount may indicate a change amount that is recommended to be changed from the most recent control parameter P supplied to the controlled object 101. In the present embodiment, as an example, the recommended change amount may indicate a recommended change amount for the most recent indicated value IV for the manipulated variable MV. The change amount output model 2061 may be generated by a learning process by the learning processor 208, and may be stored in a storage unit (not shown).

＜１．１．２－６（２）．加算部２０６２＞
加算部２０６２は、制御対象１０１に供給された制御パラメータＰと、変更量出力モデル２０６１から出力される推奨変更量とを加算して推奨制御パラメータＰｒを算出する。加算部２０６２は、制御部２０７から供給された直近の制御パラメータＰと、変更量出力モデル２０６１から供給された推奨変更量とを加算して推奨制御パラメータＰｒを算出してよい。 <1.1.2-6(2). Addition unit 2062>
The adder 2062 calculates a recommended control parameter Pr by adding the control parameter P supplied to the control target 101 and the recommended change amount output from the change amount output model 2061. The adder 2062 may calculate the recommended control parameter Pr by adding the most recent control parameter P supplied from the control unit 207 and the recommended change amount supplied from the change amount output model 2061.

加算部２０６２は、次の（１）式に示すように、時点ｔ－１での制御パラメータＰ_{（ｔ－１）}と、時点ｔでの推奨変更量Δｕ_（ｔ）とを加算して、時点ｔでの推奨制御パラメータＰｒ_（ｔ）を算出してよい。
Ｐｒ_（ｔ）＝Ｐ_{（ｔ－１）}＋Δｕ_（ｔ）（１） The adder 2062 may calculate the recommended control parameter Pr(t) at time t by adding the control parameter P _(t-1) at time t-1 and the recommended change amount Δu _(t) at time t, as shown in the following equation ₍₁₎ .
Pr _(t) = P _(t-1) + Δu _(t) (1)

加算部２０６２は、制御部２０７から供給される制御パラメータＰを記憶して、推奨制御パラメータＰｒの算出に用いてよい。加算部２０６２は、算出された推奨制御パラメータＰｒを制御部２０７に供給してよい。 The adder 2062 may store the control parameter P supplied from the control unit 207 and use it to calculate the recommended control parameter Pr. The adder 2062 may supply the calculated recommended control parameter Pr to the control unit 207.

＜１．１．２－７．制御部２０７＞
制御部２０７は、出力部の一例であり、第１供給部２０５から制御モデル２０６に対する供給が行われたことに応じて当該制御モデル２０６から出力される推奨制御パラメータＰｒを出力する。本実施形態では一例として、制御部２０７は、推奨制御パラメータＰｒを制御パラメータＰとして制御対象１０１に出力して、制御対象１０１を制御してよい。制御部２０７は、オペレータから入力される制御パラメータＰを制御対象１０１に出力して制御対象１０１を制御してもよい。制御部２０７は、制御対象１０１の制御周期に合わせて制御パラメータＰを制御対象１０１に出力してよい。 <1.1.2-7. Control unit 207>
The control unit 207 is an example of an output unit, and outputs a recommended control parameter Pr output from the control model 206 in response to supply from the first supply unit 205 to the control model 206. As an example in the present embodiment, the control unit 207 may output the recommended control parameter Pr to the control object 101 as a control parameter P to control the control object 101. The control unit 207 may output the control parameter P input by an operator to the control object 101 to control the control object 101. The control unit 207 may output the control parameter P to the control object 101 in accordance with the control period of the control object 101.

制御部２０７は、制御対象１０１に供給される制御パラメータＰを、図示しない記憶部に記憶させてよい。制御部２０７は、制御対象１０１に供給される制御パラメータＰを、偏差取得部２０３により取得される偏差と対応付けて記憶部に記憶させてよい。制御部２０７は、制御対象１０１に供給される制御パラメータＰを、当該制御パラメータＰにより制御対象１０１を制御した結果の運転状態を示す偏差と対応付けて記憶部に記憶させてよい。 The control unit 207 may store the control parameter P supplied to the control object 101 in a storage unit (not shown). The control unit 207 may store the control parameter P supplied to the control object 101 in the storage unit in association with the deviation acquired by the deviation acquisition unit 203. The control unit 207 may store the control parameter P supplied to the control object 101 in the storage unit in association with the deviation indicating the operating state resulting from controlling the control object 101 with the control parameter P.

＜１．１．２－８．学習処理部２０８＞
学習処理部２０８は、第２学習処理部の一例であり、偏差取得部２０３により取得される偏差と、制御パラメータ取得部２０４により取得される制御パラメータＰと、を含む学習データを用いて変更量出力モデル２０６１の学習処理を行う。学習データに含まれる偏差および制御パラメータＰは、目標値ＳＰが上述の基準目標値である場合に取得される偏差および制御パラメータＰであってよく、図示しない記憶部において対応付けて記憶されてよい。なお、学習データは、実際のシステム１から取得される代わりに、システム１のシミュレータ（図示せず）から取得されてもよい。シミュレータは、任意のシステム同定技術により設備１００の実測データなどを用いて作成されてよい。 <1.1.2-8. Learning processing unit 208>
The learning processing unit 208 is an example of a second learning processing unit, and performs learning processing of the change amount output model 2061 using learning data including the deviation acquired by the deviation acquisition unit 203 and the control parameter P acquired by the control parameter acquisition unit 204. The deviation and control parameter P included in the learning data may be the deviation and control parameter P acquired when the target value SP is the above-mentioned reference target value, and may be stored in association with each other in a storage unit (not shown). Note that the learning data may be acquired from a simulator (not shown) of the system 1 instead of being acquired from the actual system 1. The simulator may be created using actual measurement data of the facility 100 by any system identification technology.

学習処理部２０８は、偏差および制御パラメータＰの入力に応じ、報酬値を高めるために推奨される推奨変更量を出力するよう変更量出力モデル２０６１の学習を行ってよい。推奨変更量は、所定の時点（一例として偏差および制御パラメータＰの取得時点）での制御対象１０１の状態に対応する報酬値（一例としてその時点の測定値ＰＶに応じた値を報酬関数に入力して得られる報酬値）を基準報酬値とした場合に、当該基準報酬値よりも報酬値を高くするために推奨される変更量であってよい。一例として、学習処理部２０８は、カーネルダイナミックポリシープログラミング法（ＫｅｒｎｅｌＤｙｎａｍｉｃＰｏｌｉｃｙＰｒｏｇｒａｍｍｉｎｇ、ＫＤＰＰ）のアルゴリズムにより学習を行ってよい。報酬値は、予め設定された報酬関数により定まる値であってよい。報酬関数は、偏差に基づく関数であってよく、一例として、偏差が小さいほど報酬値が大きくなる関数であってよい。なお、偏差取得部２０３により複数の物理量のそれぞれについて偏差が取得される場合には、報酬関数は複数の偏差の総和に基づく関数であってもよいし、複数の偏差を重み付け加算した結果に基づく関数であってもよい。 The learning processing unit 208 may learn the change amount output model 2061 so as to output a recommended change amount recommended for increasing the reward value in response to the input of the deviation and the control parameter P. The recommended change amount may be a change amount recommended for increasing the reward value above a reference reward value (for example, a reward value obtained by inputting a value corresponding to the measurement value PV at that time into the reward function) corresponding to the state of the control object 101 at a predetermined time point (for example, the time point at which the deviation and the control parameter P are acquired) when the reference reward value is set as the reward value. As an example, the learning processing unit 208 may learn using an algorithm of the Kernel Dynamic Policy Programming (KDPP). The reward value may be a value determined by a preset reward function. The reward function may be a function based on the deviation, and as an example, a function in which the smaller the deviation, the larger the reward value. In addition, when the deviation acquisition unit 203 acquires deviations for each of multiple physical quantities, the reward function may be a function based on the sum of the multiple deviations, or may be a function based on the result of weighting and adding the multiple deviations.

以上の装置２００によれば、測定値ＰＶおよび目標値ＳＰの偏差と制御パラメータＰとが入力されることに応じて推奨制御パラメータＰｒを出力する制御モデル２０６に対し、制御パラメータＰをシフトさせたシフト済み制御パラメータＰ＋Δｐが供給される。従って、制御パラメータＰの基準値を制御モデル２０６よりもシフト量－Δｐだけシフトさせた他の制御モデルに制御パラメータＰを入力して出力される推奨制御パラメータＰｒを、制御モデル２０６から取得することができる。よって、環境の変化（本実施形態では一例として目標値の変化）に合わせて制御パラメータＰをシフトさせて制御モデル２０６に入力することで、変化した環境に適合した推奨制御パラメータＰｒを取得することができる。また、制御パラメータＰをシフトすることにより、基準値の異なる他の制御モデルから出力される推奨制御パラメータＰｒを制御モデル２０６から取得することができるため、複数の制御モデルを装置２００に内蔵する場合と比較して、装置２００を小型化することができる。 According to the above-described device 200, the control model 206 outputs the recommended control parameter Pr in response to the input of the deviation between the measured value PV and the target value SP and the control parameter P, and the shifted control parameter P+Δp is supplied to the control model 206. Therefore, the recommended control parameter Pr output by inputting the control parameter P to another control model in which the reference value of the control parameter P is shifted by the shift amount -Δp from the control model 206 can be obtained from the control model 206. Therefore, by shifting the control parameter P in accordance with the change in the environment (the change in the target value as an example in this embodiment) and inputting it to the control model 206, the recommended control parameter Pr adapted to the changed environment can be obtained. In addition, by shifting the control parameter P, the recommended control parameter Pr output from another control model with a different reference value can be obtained from the control model 206, so that the device 200 can be made smaller than when multiple control models are built into the device 200.

また、制御モデル２０６では、偏差と、制御対象１０１に供給済みの制御パラメータＰとに応じて当該制御パラメータＰの推奨変更量が変更量出力モデル２０６１から出力され、当該供給済みの制御パラメータＰと、当該推奨変更量とが加算されて推奨制御パラメータＰｒが算出される。従って、制御パラメータＰをシフト済み制御パラメータＰ＋ΔＰとして制御モデル２０６に入力することで制御モデル２０６を他の制御モデルとして用いる場合であっても、供給済みの制御パラメータＰをベースとした推奨制御パラメータＰｒを取得することができる。よって、制御対象１０１に適合した適切な推奨制御パラメータＰｒを取得することができる。 In addition, in the control model 206, a recommended change amount for the control parameter P is output from the change amount output model 2061 according to the deviation and the control parameter P that has already been supplied to the control object 101, and the recommended change amount is added to the supplied control parameter P to calculate the recommended control parameter Pr. Therefore, even when the control model 206 is used as another control model by inputting the control parameter P to the control model 206 as the shifted control parameter P+ΔP, a recommended control parameter Pr based on the supplied control parameter P can be obtained. Therefore, an appropriate recommended control parameter Pr that is suited to the control object 101 can be obtained.

また、偏差取得部２０３により取得される偏差と、制御パラメータ取得部２０４により取得される制御パラメータＰとを含む学習データを用い、偏差および制御パラメータＰの入力に応じ、報酬関数により定まる報酬値を高めるために推奨される推奨変更量を出力するよう変更量出力モデル２０６１の学習処理が行われる。従って、適切な推奨制御パラメータＰｒを制御モデル２０６から確実に取得することができる。 In addition, using learning data including the deviation acquired by the deviation acquisition unit 203 and the control parameter P acquired by the control parameter acquisition unit 204, a learning process is performed on the change amount output model 2061 so as to output a recommended change amount recommended for increasing the reward value determined by the reward function in response to the input of the deviation and the control parameter P. Therefore, an appropriate recommended control parameter Pr can be reliably acquired from the control model 206.

また、偏差および制御パラメータＰが制御モデル２０６に入力されることに応じて、目標値ＳＰが基準目標値である状態に応じた推奨制御パラメータＰｒが制御モデル２０６から出力され、目標値ＳＰが基準目標値から変更されたことに応じて、制御パラメータＰをシフトさせたシフト済み制御パラメータＰ＋Δｐが制御モデル２０６に供給される。従って、単一の制御モデル２０６を用い、目標値ＳＰの変化に応じた推奨制御パラメータＰｒを取得することができる。 In addition, in response to the deviation and control parameter P being input to the control model 206, a recommended control parameter Pr corresponding to the state in which the target value SP is the reference target value is output from the control model 206, and in response to the target value SP being changed from the reference target value, a shifted control parameter P+Δp obtained by shifting the control parameter P is supplied to the control model 206. Therefore, using a single control model 206, it is possible to obtain a recommended control parameter Pr corresponding to a change in the target value SP.

また、目標値ＳＰが上述の基準目標値から変更されたことに応じて、制御対象１０１に対して供給された制御パラメータＰを目標値ＳＰに応じたシフト量Δｐだけシフトさせてシフト済み制御パラメータＰ＋Δｐが取得される。従って、単一の制御モデル２０６を用い、目標値ＳＰの変化に応じた高精度の推奨制御パラメータＰｒを取得することができる。 In addition, in response to the target value SP being changed from the above-mentioned reference target value, the control parameter P supplied to the control target 101 is shifted by a shift amount Δp corresponding to the target value SP to obtain a shifted control parameter P+Δp. Therefore, using a single control model 206, it is possible to obtain a highly accurate recommended control parameter Pr corresponding to the change in the target value SP.

＜１．２．変更量出力モデル２０６１＞
図２は、変更量出力モデル２０６１の一例を示す。なお、図２や後述の図４等において縦軸は制御パラメータＰ（一例としてバルブの開度の指示値ＩＶ）を示し、横軸は偏差を示す。 <1.2. Change amount output model 2061>
Fig. 2 shows an example of the change amount output model 2061. In Fig. 2 and Fig. 4 described later, the vertical axis indicates the control parameter P (as an example, the command value IV of the valve opening) and the horizontal axis indicates the deviation.

変更量出力モデル２０６１は、偏差および制御パラメータＰの組み合わせと、推奨変更量との対応関係を示してよい。本例の変更量出力モデル２０６１は、偏差および制御パラメータＰの組み合わせと、推奨変更量との対応関係をマッピングした操作量マップであってよい。操作量マップは、制御パラメータＰと偏差との組み合わせに応じて、それぞれ別々の推奨変更量に対応付けられた複数の領域に分けられてよく、入力される制御パラメータＰおよび偏差の組み合わせの座標位置に対応付けられた推奨変更量を出力してよい。このような変更量出力モデル２０６１を用いると、偏差が０で、かつ、変更量が０の座標点（平衡点Ｏとも称する。図２では一例として偏差＝０かつ制御パラメータＰ＝約５０の点）でプロセスが安定状態となる。 The change amount output model 2061 may show the correspondence between the combination of the deviation and the control parameter P and the recommended change amount. The change amount output model 2061 in this example may be an operation amount map that maps the correspondence between the combination of the deviation and the control parameter P and the recommended change amount. The operation amount map may be divided into multiple regions that correspond to different recommended change amounts according to the combination of the control parameter P and the deviation, and may output recommended change amounts that correspond to the coordinate position of the input combination of the control parameter P and the deviation. When such a change amount output model 2061 is used, the process becomes stable at a coordinate point where the deviation is 0 and the change amount is 0 (also called the equilibrium point O. In FIG. 2, as an example, the point where the deviation is 0 and the control parameter P is about 50).

なお、変更量出力モデル２０６１は、操作量マップの全域に関する情報を含んでよい。これに代えて、変更量出力モデル２０６１は、各領域の境界を示す情報（一例として境界を示す座標や関数式）と、各領域に対応する推奨変更量とのみを含んでもよい。この場合には、変更量出力モデル２０６１を記憶するための記憶領域を小さくすることができる。 The change amount output model 2061 may include information about the entire area of the operation amount map. Alternatively, the change amount output model 2061 may include only information indicating the boundaries of each area (for example, coordinates or function formulas indicating the boundaries) and the recommended change amount corresponding to each area. In this case, the storage area for storing the change amount output model 2061 can be reduced.

図３は、変更量出力モデル２０６１の他の例を示す。この図に示すように、変更量出力モデル２０６１は、偏差および制御パラメータＰの組み合わせと、推奨変更量とを対応付けたテーブルであってもよい。 Figure 3 shows another example of the change amount output model 2061. As shown in this figure, the change amount output model 2061 may be a table that associates combinations of deviations and control parameters P with recommended change amounts.

＜１．３．制御パラメータＰのシフト＞
図４は、制御モデル２０６に入力される制御パラメータＰをシフトすることによる効果を示す。 1.3. Shift of control parameter P
FIG. 4 illustrates the effect of shifting the control parameter P input to the control model 206.

本実施形態に係る変更量出力モデル２０６１の操作量マップは偏差についての座標軸を有しており、目標値ＳＰが基準目標値から変更される等によりシステム１の環境が変化する場合にも、偏差＝０の線上でプロセスが安定することには変わりがない。従って、環境が変化することによる平衡点Ｏのずれは、操作量マップを制御パラメータＰの座標軸方向（本図では一例として縦軸方向）にシフトさせることで解消され得る。 The manipulated variable map of the change amount output model 2061 according to this embodiment has a coordinate axis for the deviation, and even if the environment of the system 1 changes, for example because the target value SP is changed from the reference target value, the process remains stable on the line of deviation = 0. Therefore, the deviation of the equilibrium point O caused by a change in the environment can be eliminated by shifting the manipulated variable map in the direction of the coordinate axis of the control parameter P (the vertical axis direction in this figure is used as an example).

本実施形態に係る装置２００では、環境が変化することにより元の平衡点Ｏとは異なる点Ｏｓが実際の平衡点となる場合に、制御モデル２０６の変更量出力モデル２０６１に対し、制御パラメータＰをシフト量Δｐだけシフトさせて入力する。これにより、図中上側に示す変更量出力モデル２０６１の操作量マップを制御パラメータＰの座標軸方向に―Δｐだけシフトさせて、図中下側に示す操作量マップとし、偏差と制御パラメータＰとを入力した場合と同様の推奨変更量が出力される。よって、制御パラメータＰをシフトさせて制御モデル２０６に入力することにより、操作量マップをシフトさせた他の制御モデルからの出力を取得することができる。 In the device 200 according to this embodiment, when a point Os different from the original equilibrium point O becomes the actual equilibrium point due to a change in the environment, the control parameter P is shifted by a shift amount Δp and input to the change amount output model 2061 of the control model 206. As a result, the operation amount map of the change amount output model 2061 shown in the upper part of the figure is shifted by -Δp in the coordinate axis direction of the control parameter P to become the operation amount map shown in the lower part of the figure, and a recommended change amount similar to that when the deviation and the control parameter P are input is output. Therefore, by shifting the control parameter P and inputting it to the control model 206, it is possible to obtain an output from another control model with a shifted operation amount map.

＜１．４．シフト量＞
制御パラメータ取得部２０４は、制御対象１０１に供給された制御パラメータＰを、設定された目標値ＳＰに応じたシフト量Δｐだけシフトさせてシフト済み制御パラメータＰ＋Δｐを生成してよい。例えば、制御パラメータ取得部２０４は、目標値ＳＰおよび基準目標値の差分と、予め設定された係数とを乗算してシフト量Δｐを決定してよい。これに代えて、または、これに加えて、制御パラメータ取得部２０４は、目標値ＳＰに設定された値と、当該値に測定値ＰＶが安定するときの制御パラメータＰの値（平衡点Ｏでの制御パラメータＰの値とも称する）との関係を示す、予め設定された関係式を用いてシフト量Δｐを決定してよい。 1.4. Shift amount
The control parameter acquisition unit 204 may shift the control parameter P supplied to the controlled object 101 by a shift amount Δp according to the set target value SP to generate a shifted control parameter P+Δp. For example, the control parameter acquisition unit 204 may determine the shift amount Δp by multiplying the difference between the target value SP and the reference target value by a preset coefficient. Alternatively or in addition to this, the control parameter acquisition unit 204 may determine the shift amount Δp using a preset relational expression that indicates the relationship between the value set in the target value SP and the value of the control parameter P when the measurement value PV stabilizes at that value (also referred to as the value of the control parameter P at the equilibrium point O).

測定値ＰＶが目標値ＳＰに安定するとは、制御部２０７から制御対象１０１に制御パラメータＰが出力されて第１の基準時間が経過した後に、第２の基準時間内に亘って測定値ＰＶが基準範囲内に収まることであってよい。基準範囲は、目標値ＳＰを中央に含む範囲であってよい。第１の基準時間および第２の基準時間は制御対象１０１の時定数に応じてそれぞれ予め設定されてよい。第１の基準時間および第２の基準時間は互いに異なる時間であってよい。第２の基準時間は第１の基準時間よりも長くてよく、制御部２０７による制御対象１０１の制御周囲より長くてよい。 The measurement value PV being stabilized at the target value SP may mean that after the control parameter P is output from the control unit 207 to the control object 101 and the first reference time has elapsed, the measurement value PV falls within a reference range over a second reference time. The reference range may be a range that includes the target value SP at the center. The first reference time and the second reference time may be set in advance according to the time constant of the control object 101. The first reference time and the second reference time may be different times from each other. The second reference time may be longer than the first reference time and may be longer than the control perimeter of the control object 101 by the control unit 207.

シフト量の決定に用いられる上述の係数や関係式は、平衡点Ｏでの測定値ＰＶと当該平衡点Ｏでの制御パラメータＰとを含む複数のサンプルデータに対する近似曲線から設定されてよい。平衡点Ｏでの測定値ＰＶを測定値ＰＶの目標値ＳＰとして捉えると、近似曲線は目標値ＳＰと、平衡点Ｏでの制御パラメータＰの値との関係を示してよい。当該基準時曲線の関数は、シフト量の決定に用いられる上述の関係式として予め設定されてよい。近似曲線が一次関数である場合には、目標値ＳＰの変化量に対する制御パラメータＰの変化量の比、つまり近似曲線の傾きを示す値は、シフト量の決定に用いられる上述の係数として予め設定されてよい。 The above-mentioned coefficients and relational expressions used to determine the shift amount may be set from an approximation curve for multiple sample data including the measurement value PV at the equilibrium point O and the control parameter P at the equilibrium point O. If the measurement value PV at the equilibrium point O is taken as the target value SP of the measurement value PV, the approximation curve may indicate the relationship between the target value SP and the value of the control parameter P at the equilibrium point O. The function of the reference time curve may be set in advance as the above-mentioned relational expression used to determine the shift amount. If the approximation curve is a linear function, the ratio of the change in the control parameter P to the change in the target value SP, i.e., a value indicating the slope of the approximation curve, may be set in advance as the above-mentioned coefficient used to determine the shift amount.

各サンプルデータは、制御対象１０１に供給する制御パラメータＰを一定値に維持して測定値ＰＶのステップ応答が一の値に安定したときの制御パラメータＰの値と、測定値ＰＶの値とを含んでよい。これに代えて、各サンプルデータは、目標値ＳＰを設定して制御対象１０１をＰＩＤ制御によって制御し、測定値ＰＶが目標値ＳＰに安定したときの制御パラメータＰの値と、目標値ＳＰとを含んでよい。各サンプルデータは制御対象１０１を実際に動作させて取得されてもよいし、シミュレータによって取得されてもよい。サンプルデータの近似曲線は、最小二乗法などの線形回帰により算出されてもよいし、多項式近似やガウス過程回帰などの非線形回帰により算出されてもよい。 Each sample data may include the value of the control parameter P when the step response of the measurement value PV stabilizes to a single value while maintaining the control parameter P supplied to the control object 101 at a constant value, and the value of the measurement value PV. Alternatively, each sample data may include the value of the control parameter P when the measurement value PV stabilizes to the target value SP by setting a target value SP and controlling the control object 101 by PID control, and the target value SP. Each sample data may be obtained by actually operating the control object 101, or may be obtained by a simulator. An approximation curve of the sample data may be calculated by linear regression such as the least squares method, or may be calculated by nonlinear regression such as polynomial approximation or Gaussian process regression.

図５は、サンプルデータの近似曲線を示す。図中の横軸（ｘ軸）は目標値ＳＰ（または平衡点Ｏでの測定値ＰＶ）を示し、縦軸（ｙ軸）は平衡点での制御パラメータＰの値を示す。本図の例では、近似曲線はｙ＝０．８１６５ｘ－０．０００２であってよく、目標値ＳＰは基準目標値としての５０から８０に変更されてよい。この場合に、制御パラメータ取得部２０４は、目標値ＳＰおよび基準目標値の差分である３０と、予め設定された係数としての０．８１６５とを乗算して、シフト量Δｐを２４．４９５と決定してよい。これに代えて、制御パラメータ取得部２０４は、予め設定された関数式ｙ＝０．８１６５ｘ－０．０００２を用いて、目標値ＳＰが基準目標値の５０である場合の制御パラメータＰの値を４０．８２４８（＝０．８１６５×５０）、目標値ＳＰが８０である場合の制御パラメータＰの値を６５．３１９８（＝０．８１６５×８０）としてそれぞれ算出し、両者の差分からシフト量Δｐを２４．４９５と決定してもよい。なお、目標値ＳＰが基準目標値である場合の制御パラメータＰの値（本例では４０．８２４８）は予め制御パラメータ取得部２０４に記憶されてもよい。 Figure 5 shows an approximation curve of sample data. The horizontal axis (x-axis) in the figure shows the target value SP (or the measurement value PV at the equilibrium point O), and the vertical axis (y-axis) shows the value of the control parameter P at the equilibrium point. In the example of this figure, the approximation curve may be y = 0.8165x - 0.0002, and the target value SP may be changed from 50 as the reference target value to 80. In this case, the control parameter acquisition unit 204 may multiply 30, which is the difference between the target value SP and the reference target value, by 0.8165, which is a preset coefficient, to determine the shift amount Δp as 24.495. Alternatively, the control parameter acquisition unit 204 may use a preset function formula y = 0.8165x - 0.0002 to calculate the value of the control parameter P as 40.8248 (= 0.8165 x 50) when the target value SP is the reference target value of 50, and the value of the control parameter P as 65.3198 (= 0.8165 x 80) when the target value SP is 80, and determine the shift amount Δp to be 24.495 from the difference between the two. Note that the value of the control parameter P when the target value SP is the reference target value (40.8248 in this example) may be stored in advance in the control parameter acquisition unit 204.

以上のような制御パラメータ取得部２０４を有する装置２００によれば、目標値ＳＰおよび基準目標値の差分と、予め設定された係数とを乗算してシフト量が決定されるので、変更後の目標値ＳＰに合わせた高精度の推奨制御パラメータＰｒを取得することができる。 With the device 200 having the control parameter acquisition unit 204 described above, the shift amount is determined by multiplying the difference between the target value SP and the reference target value by a preset coefficient, so that a highly accurate recommended control parameter Pr can be obtained that matches the changed target value SP.

また、目標値ＳＰに設定された値と、当該値に測定値ＰＶが安定するときの制御パラメータＰの値との関係を示す既定の関係式を用いてシフト量が決定されるので、関数式に目標値ＳＰを入力することでシフト量を決定することができる。従って、変更後の目標値ＳＰに合わせた高精度の推奨制御パラメータＰｒを容易に取得することができる。 In addition, since the shift amount is determined using a predefined relational equation that indicates the relationship between the value set for the target value SP and the value of the control parameter P when the measurement value PV stabilizes at that value, the shift amount can be determined by inputting the target value SP into the function equation. Therefore, it is possible to easily obtain a highly accurate recommended control parameter Pr that matches the changed target value SP.

＜１．５．動作＞
図６は、装置２００の動作を示す。装置２００は、ステップＳ１１～Ｓ２９の処理を行うことにより、制御対象１０１を制御してよい。なお、この動作は装置２００が起動されることに応じて開始してよい。また、動作の開始時点においては変更量出力モデル２０６１の学習処理が完了していてよい。 <1.5. Operation>
6 shows the operation of the device 200. The device 200 may control the control target 101 by performing the processes of steps S11 to S29. Note that this operation is performed in response to the device 200 being started up. In addition, the learning process of the change amount output model 2061 may be completed at the start of the operation.

ステップＳ１１において目標値取得部２０２は、制御対象１０１に関する状態の目標値ＳＰを取得する。目標値取得部２０２は、予め設定された基準目標値を目標値ＳＰとして取得してもよいし、基準目標値とは異なる値を目標値ＳＰとして取得してもよい。ステップＳ１１の処理が最初に実行される場合には、目標値取得部２０２は、基準目標値を目標値ＳＰとして取得してよい。目標値取得部２０２は、基準目標値とは異なる値を目標値ＳＰとして取得したことに応じて、設定された目標値ＳＰを示す目標値変更信号を出力してよい。 In step S11, the target value acquisition unit 202 acquires a target value SP for the state related to the control object 101. The target value acquisition unit 202 may acquire a preset reference target value as the target value SP, or may acquire a value different from the reference target value as the target value SP. When the process of step S11 is executed for the first time, the target value acquisition unit 202 may acquire the reference target value as the target value SP. In response to acquiring a value different from the reference target value as the target value SP, the target value acquisition unit 202 may output a target value change signal indicating the set target value SP.

ステップＳ１３において測定値取得部２０１は、制御対象１０１に関する状態の測定値ＰＶを取得する。目標値取得部２０２は、設備１００のセンサ１０２から測定値ＰＶを取得してよい。 In step S13, the measurement value acquisition unit 201 acquires the measurement value PV of the state of the control object 101. The target value acquisition unit 202 may acquire the measurement value PV from the sensor 102 of the equipment 100.

ステップＳ１５において偏差取得部２０３は、ステップＳ１１で取得された目標値ＳＰと、ステップＳ１３で取得された測定値ＰＶとの偏差を取得する。 In step S15, the deviation acquisition unit 203 acquires the deviation between the target value SP acquired in step S11 and the measurement value PV acquired in step S13.

ステップＳ１７において制御パラメータ取得部２０４は、制御対象１０１に対して供給された制御パラメータＰを取得する。制御パラメータ取得部２０４は、直近の制御周期において制御対象１０１に供給された制御パラメータＰを制御部２０７から取得してよい。一例として、制御パラメータ取得部２０４は、後述のステップＳ２７の処理で制御部２０７から制御対象１０１に出力される制御パラメータＰを取得して一時保存しておき、ステップＳ１７において当該制御パラメータＰを読み出してよい。ステップＳ１７が最初に実行される場合、つまりステップＳ２７の処理が実行されていない場合には、制御パラメータ取得部２０４は、予め設定された制御パラメータＰの初期値を取得してよい。 In step S17, the control parameter acquisition unit 204 acquires the control parameter P supplied to the control object 101. The control parameter acquisition unit 204 may acquire the control parameter P supplied to the control object 101 in the most recent control cycle from the control unit 207. As an example, the control parameter acquisition unit 204 may acquire and temporarily store the control parameter P output from the control unit 207 to the control object 101 in the processing of step S27 described below, and read out the control parameter P in step S17. When step S17 is executed for the first time, that is, when the processing of step S27 has not been executed, the control parameter acquisition unit 204 may acquire the initial value of the control parameter P that has been set in advance.

ステップＳ１９において制御パラメータ取得部２０４は、ステップＳ１１で取得された目標値ＳＰが基準目標値であるか否かを判定する。制御パラメータ取得部２０４は、目標値取得部２０２から目標値変更信号を取得したか否かに基づいて判定を行ってよい。ステップＳ１９において目標値ＳＰが基準目標値であると判定された場合（ステップＳ１９；Ｙｅｓ）にはステップＳ２５に処理が移行してよい。ステップＳ１９において目標値ＳＰが基準目標値でないと判定された場合（ステップＳ１９；Ｎｏ）にはステップＳ２１に処理が移行してよい。 In step S19, the control parameter acquisition unit 204 determines whether the target value SP acquired in step S11 is a reference target value. The control parameter acquisition unit 204 may make this determination based on whether a target value change signal has been acquired from the target value acquisition unit 202. If it is determined in step S19 that the target value SP is the reference target value (step S19; Yes), the process may proceed to step S25. If it is determined in step S19 that the target value SP is not the reference target value (step S19; No), the process may proceed to step S21.

ステップＳ２１において制御パラメータ取得部２０４は、制御パラメータＰのシフト量Δｐを決定する。制御パラメータ取得部２０４は、ステップＳ１１で取得された目標値ＳＰに応じてシフト量Δｐを決定してよい。 In step S21, the control parameter acquisition unit 204 determines a shift amount Δp of the control parameter P. The control parameter acquisition unit 204 may determine the shift amount Δp according to the target value SP acquired in step S11.

ステップＳ２３において、制御パラメータ取得部２０４は、ステップＳ１７で取得された制御パラメータＰを、決定されたシフト量Δｐだけシフトさせてシフト済み制御パラメータＰ＋Δｐを取得する。 In step S23, the control parameter acquisition unit 204 shifts the control parameter P acquired in step S17 by the determined shift amount Δp to acquire the shifted control parameter P+Δp.

ステップＳ２５において第１供給部２０５は、制御パラメータ取得部２０４から供給された制御パラメータＰと、偏差取得部２０３から供給された偏差とを制御モデル２０６に供給する。第１供給部２０５は、ステップＳ１９において目標値ＳＰが基準目標値であると判定されている場合には、ステップＳ１７で制御パラメータ取得部２０４により取得された制御パラメータＰをそのまま制御モデル２０６に供給してよい。第１供給部２０５は、ステップＳ１９において目標値ＳＰが基準目標値でないと判定されている場合には、ステップＳ２３でシフトされた制御パラメータＰ、つまりシフト済み制御パラメータＰ＋Δｐを制御モデル２０６に供給してよい。 In step S25, the first supply unit 205 supplies the control parameter P supplied from the control parameter acquisition unit 204 and the deviation supplied from the deviation acquisition unit 203 to the control model 206. If the target value SP is determined to be the reference target value in step S19, the first supply unit 205 may supply the control parameter P acquired by the control parameter acquisition unit 204 in step S17 to the control model 206 as is. If the target value SP is determined to not be the reference target value in step S19, the first supply unit 205 may supply the control parameter P shifted in step S23, i.e., the shifted control parameter P+Δp, to the control model 206.

これにより、入力された制御パラメータＰおよび偏差に応じた推奨制御パラメータＰｒが制御モデル２０６から出力される。本実施形態では一例として、入力された制御パラメータＰおよび偏差に応じた推奨変更量が変更量出力モデル２０６１から出力され、推奨変更量と、ステップＳ１７で取得された制御パラメータＰとが加算部２０６２により加算されて推奨制御パラメータＰｒが生成されてよい。 As a result, a recommended control parameter Pr corresponding to the input control parameter P and deviation is output from the control model 206. As an example in this embodiment, a recommended change amount corresponding to the input control parameter P and deviation is output from the change amount output model 2061, and the recommended change amount and the control parameter P acquired in step S17 are added by the adder 2062 to generate the recommended control parameter Pr.

ステップＳ２７において制御部２０７は、制御モデル２０６からの推奨制御パラメータＰｒを出力する。制御部２０７は、推奨制御パラメータＰｒを制御パラメータＰとして制御対象１０１に供給して、制御対象１０１を制御してよい。 In step S27, the control unit 207 outputs the recommended control parameters Pr from the control model 206. The control unit 207 may supply the recommended control parameters Pr to the control object 101 as control parameters P to control the control object 101.

ステップＳ２９において目標値取得部２０２は、オペレータにより目標値ＳＰが変更されるか否かを判定する。目標値ＳＰが変更されないと判定された場合（ステップＳ２９；Ｎｏ）にはステップＳ１３に処理が移行してよい。目標値ＳＰが変更されたと判定された場合（ステップＳ２９；Ｙｅｓ）にはステップＳ１１に処理が移行してよい。 In step S29, the target value acquisition unit 202 determines whether the target value SP is changed by the operator. If it is determined that the target value SP is not changed (step S29; No), the process may proceed to step S13. If it is determined that the target value SP is changed (step S29; Yes), the process may proceed to step S11.

＜１．６．動作例＞
図７は、目標値ＳＰが変更される場合の測定値ＰＶと、制御パラメータＰとの推移を示す。図中の横軸は時間（秒）を示し、縦軸は測定値ＰＶおよび制御パラメータＰの値を示す。なお、本図では一例として制御パラメータＰは、バルブの開度の指示値ＩＶを示しており、測定値ＰＶおよび制御パラメータＰは０～１００の範囲内に正規化されている。この図に示されるように、本実施形態に係る装置２００では、目標値ＳＰが変更される場合であっても、変更後の目標値ＳＰに測定値ＰＶが一致するよう制御対象１０１が制御される。 <1.6. Operation example>
7 shows the transition of the measurement value PV and the control parameter P when the target value SP is changed. The horizontal axis in the figure indicates time (seconds), and the vertical axis indicates the values of the measurement value PV and the control parameter P. Note that in this figure, as an example, the control parameter P indicates the indication value IV of the valve opening, and the measurement value PV and the control parameter P are normalized within the range of 0 to 100. As shown in this figure, in the device 200 according to this embodiment, even when the target value SP is changed, the controlled object 101 is controlled so that the measurement value PV coincides with the changed target value SP.

＜２．第２実施形態＞
＜２．１．システム１Ａ＞
図８は、第２実施形態に係るシステム１Ａを示す。なお、図１に示されたシステム１と略同一のものには同一の符号を付け、説明を省略する。システム１Ａは装置２００Ａを備える。装置２００Ａは、偏差取得部２０３Ａと、不安定状態検出部２１１Ａと、第２供給部２１２Ａと、シフト量出力モデル２１３Ａと、制御パラメータ取得部２０４Ａと、学習処理部２１４Ａとを有してよい。 <2. Second embodiment>
<2.1. System 1A>
Fig. 8 shows a system 1A according to the second embodiment. Components that are substantially the same as those in the system 1 shown in Fig. 1 are given the same reference numerals, and descriptions thereof will be omitted. The system 1A includes an apparatus 200A. The apparatus 200A may include a deviation acquisition unit 203A, an unstable state detection unit 211A, a second supply unit 212A, a shift amount output model 213A, a control parameter acquisition unit 204A, and a learning processing unit 214A.

＜２．１．１．偏差取得部２０３Ａ＞
偏差取得部２０３Ａは、上述の第１実施形態における偏差取得部２０３と同様にして、制御対象１０１に関する状態の測定値ＰＶおよび目標値ＳＰの偏差を取得する。偏差取得部２０３Ａは、取得した偏差の変化速度をさらに取得してよい。例えば、偏差取得部２０３Ａは、偏差を取得するごとに、直近の２つの偏差の変化量を取得タイミングのインターバルで除算することで、偏差の変化速度を算出してよい。偏差取得部２０３Ａは、取得した偏差を第１供給部２０５および不安定状態検出部２１１Ａに供給してよい。偏差取得部２０３Ａは、取得した偏差と、その変化速度とを第２供給部２１２Ａに供給してよい。偏差取得部２０３Ａは、取得した偏差と、その変化速度とを、図示しない記憶部に記憶させてよい。 <2.1.1. Deviation Acquisition Unit 203A>
The deviation acquisition unit 203A acquires the deviation between the measured value PV and the target value SP of the state of the control target 101 in the same manner as the deviation acquisition unit 203 in the first embodiment described above. For example, each time the deviation is acquired, the deviation acquisition unit 203A calculates the rate of change of the deviation by dividing the amount of change between the most recent two deviations by the interval between the acquisition timings. The deviation acquisition unit 203A may supply the acquired deviation to the first supply unit 205 and the unstable state detection unit 211A. The deviation acquisition section 203A may supply the deviation and the rate of change therein to the supply section 212A. The deviation acquisition section 203A may store the acquired deviation and the rate of change therein in a storage section (not shown).

＜２．１．２．不安定状態検出部２１１Ａ＞
不安定状態検出部２１１Ａは、第１検出部の一例であり、推奨制御パラメータＰｒにより制御対象１０１が制御された後に偏差取得部２０３Ａにより取得される偏差が基準範囲内に安定しないことを検出する。基準範囲は０を中央に含む任意の大きさの範囲であってよい。偏差が基準範囲に安定しないとは、制御部２０７から制御対象１０１に制御パラメータＰが出力されて第１の基準時間が経過した後に、第２の基準時間内の少なくとも一時点で偏差が基準範囲外となることであってよい。偏差が基準範囲内に安定しない状況は、設備１００の外部環境の変化（外乱とも称する）によって生じてよい。不安定状態検出部２１１Ａは、偏差が基準範囲内に安定しないことを検出したことに応じて、その旨を示す信号（不安定状態検出信号とも称する）を制御パラメータ取得部２０４Ａに供給してよい。 <2.1.2. Unstable state detection unit 211A>
The unstable state detection unit 211A is an example of a first detection unit, and detects that the deviation acquired by the deviation acquisition unit 203A after the control target 101 is controlled by the recommended control parameter Pr is not stable within the reference range. The reference range may be a range of any size including 0 at the center. The deviation not being stable within the reference range may mean that the deviation is outside the reference range at least at one point within the second reference time after the control parameter P is output from the control unit 207 to the control target 101 and the first reference time has elapsed. The situation in which the deviation is not stable within the reference range may be caused by a change in the external environment of the equipment 100 (also referred to as a disturbance). In response to detecting that the deviation is not stable within the reference range, the unstable state detection unit 211A may supply a signal indicating that fact (also referred to as an unstable state detection signal) to the control parameter acquisition unit 204A.

＜２．１．３．第２供給部２１２Ａ＞
第２供給部２１２Ａは、シフト量出力モデル２１３Ａに対し、偏差取得部２０３Ａにより取得された偏差と、当該偏差の変化速度とを供給する。第２供給部２１２Ａは、基準インターバルごとにシフト量出力モデル２１３Ａに対する供給を行ってよい。基準インターバルは、制御部２０７による制御対象１０１の制御周期以上の長さであってよい。基準インターバルは制御対象１０１の時定数に応じた長さ（一例として時定数に整数を乗算した長さ）を有してよい。 <2.1.3. Second supply unit 212A>
The second supply unit 212A supplies the deviation acquired by the deviation acquisition unit 203A and the rate of change of the deviation to the shift amount output model 213A. The reference interval may be equal to or longer than the control period of the control target 101 by the control unit 207. The reference interval may be set to a length according to the time constant of the control target 101 (for example, the time The length of the byte may be a constant multiplied by an integer.

＜２．１．４．シフト量出力モデル２１３Ａ＞
シフト量出力モデル２１３Ａは、偏差および当該偏差の変化速度が入力されることに応じて、制御パラメータ取得部２０４により制御パラメータＰについてシフトすることを推奨する推奨シフト量Δｐｒを出力する。シフト量出力モデル２１３Ａは、第２供給部２１２Ａから偏差および変化速度が入力されることに応じて、制御パラメータ取得部２０４Ａに推奨シフト量Δｐｒを出力してよい。 <2.1.4. Shift amount output model 213A>
In response to input of the deviation and the rate of change of the deviation, the shift amount output model 213A outputs a recommended shift amount Δpr that recommends a shift of the control parameter P by the control parameter acquisition unit 204. In response to input of the deviation and the rate of change from the second supply unit 212A, the shift amount output model 213A may output the recommended shift amount Δpr to the control parameter acquisition unit 204A.

シフト量出力モデル２１３Ａは、偏差および偏差の変化速度の組み合わせと、推奨シフト量Δｐｒとの対応関係を示してよい。一例としてシフト量出力モデル２１３Ａは、偏差および変化速度の組み合わせと、推奨シフト量Δｐｒとの対応関係をマッピングしたシフト量マップであってよい。これに代えて、シフト量出力モデル２１３Ａは、偏差および変化速度の組み合わせと、推奨シフト量Δｐｒとを対応付けたテーブルであってもよい。シフト量出力モデル２１３Ａは、学習処理部２１４Ａによる学習処理によって生成されてよく、図示しない記憶部に記憶されていてよい。 The shift amount output model 213A may indicate the correspondence between the combination of the deviation and the rate of change of the deviation, and the recommended shift amount Δpr. As an example, the shift amount output model 213A may be a shift amount map that maps the correspondence between the combination of the deviation and the rate of change, and the recommended shift amount Δpr. Alternatively, the shift amount output model 213A may be a table that associates the combination of the deviation and the rate of change with the recommended shift amount Δpr. The shift amount output model 213A may be generated by a learning process by the learning processing unit 214A, and may be stored in a storage unit (not shown).

＜２．１．５．制御パラメータ取得部２０４Ａ＞
制御パラメータ取得部２０４Ａは、上記第１実施形態における制御パラメータ取得部２０４と同様にして、シフト済み制御パラメータＰ＋Δｐを取得する。 <2.1.5. Control parameter acquisition unit 204A>
The control parameter acquisition unit 204A acquires the shifted control parameter P+Δp in the same manner as the control parameter acquisition unit 204 in the first embodiment.

これに加えて、制御パラメータ取得部２０４Ａは、偏差取得部２０３Ａにより取得される偏差が基準範囲内に安定しないことが不安定状態検出部２１１Ａにより検出されたことに応じて、制御対象１０１に対して供給された制御パラメータＰ（本実施形態では一例として制御部２０７から供給される制御パラメータＰ）をシフトさせて、シフト済み制御パラメータＰ＋Δｐを取得してよい。 In addition, in response to detection by the unstable state detection unit 211A that the deviation acquired by the deviation acquisition unit 203A is not stable within the reference range, the control parameter acquisition unit 204A may shift the control parameter P supplied to the control target 101 (the control parameter P supplied from the control unit 207 in this embodiment as an example) to acquire the shifted control parameter P+Δp.

制御パラメータ取得部２０４Ａは、偏差取得部２０３Ａにより取得される偏差が基準範囲内に安定しないことが不安定状態検出部２１１Ａにより検出され、かつ、第２供給部２１２Ａからシフト量出力モデル２１３Ａに対する供給が行われたことに応じて当該シフト量出力モデル２１３Ａから出力される推奨シフト量Δｐｒだけ、制御対象１０１に対して供給された制御パラメータＰをシフトさせてよい。 When the unstable state detection unit 211A detects that the deviation acquired by the deviation acquisition unit 203A is not stable within the reference range, and when a supply is made from the second supply unit 212A to the shift amount output model 213A, the control parameter acquisition unit 204A may shift the control parameter P supplied to the control object 101 by the recommended shift amount Δpr output from the shift amount output model 213A.

制御パラメータ取得部２０４Ａは、取得したシフト済み制御パラメータＰ＋Δｐを第１供給部２０５に供給してよい。なお、本実施形態では一例として、制御パラメータ取得部２０４Ａは、目標値ＳＰが基準目標値であり、かつ、偏差が目標値ＳＰに安定している場合には、制御部２０７から取得した制御パラメータＰをシフトさせずに第１供給部２０５に供給することとして説明するが、当該制御パラメータＰを０だけシフトさせたシフト済み制御パラメータＰ＋Δｐとして第１供給部２０５に供給してもよい。 The control parameter acquisition unit 204A may supply the acquired shifted control parameter P+Δp to the first supply unit 205. Note that, as an example in this embodiment, when the target value SP is a reference target value and the deviation is stable at the target value SP, the control parameter acquisition unit 204A supplies the control parameter P acquired from the control unit 207 to the first supply unit 205 without shifting it, but the control parameter P may be shifted by 0 to supply it to the first supply unit 205 as a shifted control parameter P+Δp.

＜２．１．６．学習処理部２１４Ａ＞
学習処理部２１４Ａは、第１学習処理部の一例であり、偏差取得部２０３Ａにより取得される偏差と、当該偏差の変化速度と、制御パラメータ取得部２０４Ａによる制御パラメータＰのシフト量Δｐとを含む学習データを用いてシフト量出力モデル２１３Ａの学習処理を行う。学習データに含まれる偏差および変化速度は、目標値ＳＰが基準目標値である場合に取得される偏差および変化速度であってよい。 <2.1.6. Learning processing unit 214A>
The learning processing unit 214A is an example of a first learning processing unit, and performs learning processing of the shift amount output model 213A using learning data including the deviation acquired by the deviation acquiring unit 203A, the rate of change of the deviation, and the shift amount Δp of the control parameter P acquired by the control parameter acquiring unit 204A. The deviation and the rate of change included in the learning data may be the deviation and the rate of change acquired when the target value SP is a reference target value.

学習データは、システム１のシミュレータ（図示せず）によって生成されてよく、設備１００に対して人為的に任意の外乱を与えたシミュレーションにおいて偏差取得部２０３Ａにより取得される偏差と、偏差の変化速度と、制御パラメータ取得部２０４Ａによりシフトされる制御パラメータＰのシフト量Δｐとの組み合わせを含んでよい。シミュレータは、任意のシステム同定技術により設備１００の実測データなどを用いて作成されてよい。シフト量出力モデル２１３Ａの生成後には、学習データは、実際に設備１００を運転して測定値ＰＶが目標値ＳＰに安定しない場合の偏差と、当該偏差の変化速度と、制御パラメータ取得部２０４Ａによる制御パラメータＰのシフト量Δｐとを含んでよい。 The learning data may be generated by a simulator (not shown) of the system 1, and may include a combination of the deviation acquired by the deviation acquisition unit 203A in a simulation in which an arbitrary disturbance is artificially applied to the equipment 100, the rate of change of the deviation, and the shift amount Δp of the control parameter P shifted by the control parameter acquisition unit 204A. The simulator may be created using actual measurement data of the equipment 100 by any system identification technique. After the shift amount output model 213A is generated, the learning data may include the deviation when the measurement value PV does not stabilize at the target value SP when the equipment 100 is actually operated, the rate of change of the deviation, and the shift amount Δp of the control parameter P by the control parameter acquisition unit 204A.

学習処理部２１４Ａは、偏差および当該偏差の変化速度の入力に応じ、報酬値を高めるために推奨される推奨シフト量Δｐｒを出力するようシフト量出力モデル２１３Ａの学習を行ってよい。推奨シフト量Δｐｒは、所定の時点（一例として偏差および変化速度の取得時点）での制御対象１０１の状態に対応する報酬値（一例としてその時点の測定値ＰＶに応じた値を報酬関数に入力して得られる報酬値）を基準報酬値とした場合に、当該基準報酬値よりも報酬値を高くするために推奨されるシフト量であってよい。一例として、学習処理部２１４Ａは、カーネルダイナミックポリシープログラミング法のアルゴリズムにより学習を行ってよい。報酬値は、予め設定された報酬関数により定まる値であってよい。報酬関数は、偏差に基づく関数であってよく、一例として、偏差が小さいほど報酬値が大きくなる関数であってよい。なお、偏差取得部２０３Ａにより複数の物理量のそれぞれについて偏差が取得される場合には、報酬関数は複数の偏差の総和に基づく関数であってもよいし、複数の偏差を重み付け加算した結果に基づく関数であってもよい。 The learning processing unit 214A may learn the shift amount output model 213A so as to output a recommended shift amount Δpr recommended for increasing the reward value in response to the input of the deviation and the change rate of the deviation. The recommended shift amount Δpr may be a shift amount recommended for increasing the reward value higher than the reference reward value when the reward value (for example, the reward value obtained by inputting a value corresponding to the measurement value PV at that time into the reward function) corresponding to the state of the control object 101 at a predetermined time point (for example, the time point at which the deviation and the change rate are acquired) is set as the reference reward value. As an example, the learning processing unit 214A may learn using an algorithm of the kernel dynamic policy programming method. The reward value may be a value determined by a preset reward function. The reward function may be a function based on the deviation, and as an example, a function in which the smaller the deviation, the larger the reward value. Note that, when the deviation acquisition unit 203A acquires deviations for each of the multiple physical quantities, the reward function may be a function based on the sum of the multiple deviations, or may be a function based on the result of weighted addition of the multiple deviations.

以上の装置２００Ａによれば、取得される偏差が目標値ＳＰに安定しないことが検出されたことに応じて、制御対象１０１に対して供給された制御パラメータＰがシフトされてシフト済み制御パラメータＰ＋Δｐが取得される。従って、単一の制御モデル２０６を用い、外乱の発生に応じた推奨制御パラメータＰｒを取得することができる。また、外乱の発生時にも推奨制御パラメータＰｒを取得することができるため、装置２００Ａのロバスト性を向上させることができる。 According to the above-described device 200A, in response to detection that the acquired deviation is not stable at the target value SP, the control parameter P supplied to the control object 101 is shifted to acquire the shifted control parameter P+Δp. Therefore, a single control model 206 can be used to acquire the recommended control parameter Pr in response to the occurrence of a disturbance. In addition, since the recommended control parameter Pr can be acquired even when a disturbance occurs, the robustness of the device 200A can be improved.

また、偏差と、当該偏差の変化速度とに応じてシフト量出力モデル２１３Ａから出力される推奨シフト量Δｐｒだけ制御パラメータＰがシフトされる。従って、予めシフト量出力モデル２１３Ａを生成しておくことにより、外乱が発生した場合の推奨制御パラメータＰｒを容易に取得することができる。 The control parameter P is shifted by the recommended shift amount Δpr output from the shift amount output model 213A according to the deviation and the rate of change of the deviation. Therefore, by generating the shift amount output model 213A in advance, the recommended control parameter Pr when a disturbance occurs can be easily obtained.

また、基準インターバルごとに偏差と、当該偏差の変化速度とがシフト量出力モデル２１３Ａに供給されるので、外乱の状況に合わせてシフト量Δｐを変更し、適切な推奨制御パラメータＰｒを取得することができる。 In addition, the deviation and the rate of change of the deviation are supplied to the shift amount output model 213A for each reference interval, so that the shift amount Δp can be changed according to the disturbance situation and an appropriate recommended control parameter Pr can be obtained.

また、偏差と、当該偏差の変化速度と、制御パラメータＰのシフト量Δｐとを含む学習データを用い、偏差および当該偏差の変化速度の入力に応じ既定の報酬関数により定まる報酬値を高めるために推奨される推奨シフト量Δｐｒを出力するようシフト量出力モデル２１３Ａの学習処理が行われる。従って、適切な推奨シフト量Δｐｒをシフト量出力モデル２１３Ａから確実に取得することができる。 In addition, using learning data including the deviation, the rate of change of the deviation, and the shift amount Δp of the control parameter P, the shift amount output model 213A performs a learning process to output a recommended shift amount Δpr that is recommended to increase the reward value determined by a default reward function in response to the input of the deviation and the rate of change of the deviation. Therefore, an appropriate recommended shift amount Δpr can be reliably obtained from the shift amount output model 213A.

＜２．２．シフト量出力モデル２１３Ａ＞
図９は、シフト量出力モデル２１３Ａの一例を示す。なお、図９において縦軸は偏差の変化速度を示し、横軸は偏差を示す。 2.2. Shift amount output model 213A
9 shows an example of the shift amount output model 213 A. In FIG. 9, the vertical axis indicates the rate of change of the deviation, and the horizontal axis indicates the deviation.

シフト量出力モデル２１３Ａは、偏差および変化速度の組み合わせと、推奨シフト量との対応関係を示してよい。本例のシフト量出力モデル２１３Ａは、偏差および制御パラメータＰの組み合わせと、推奨シフト量Δｐｒとの対応関係をマッピングしたシフト量マップであってよい。シフト量出力モデル２１３Ａは、偏差と変化速度との組み合わせに応じて、それぞれ別々の推奨シフト量Δｐｒに対応付けられた複数の領域に分けられてよく、入力される偏差および変化速度の組み合わせの座標位置に対応付けられた推奨シフト量Δｐｒを出力してよい。 The shift amount output model 213A may indicate the correspondence between the combination of deviation and change rate and the recommended shift amount. In this example, the shift amount output model 213A may be a shift amount map that maps the correspondence between the combination of deviation and control parameter P and the recommended shift amount Δpr. The shift amount output model 213A may be divided into multiple regions that correspond to different recommended shift amounts Δpr according to the combination of deviation and change rate, and may output the recommended shift amount Δpr that corresponds to the coordinate position of the input combination of deviation and change rate.

なお、シフト量出力モデル２１３Ａは、シフト量マップの全域に関する情報を含んでよい。これに代えて、シフト量出力モデル２１３Ａは、各領域の境界を示す情報（一例として境界を示す座標や関数式）と、当該領域に対応する推奨シフト量Δｐｒとのみを含んでもよい。この場合には、シフト量出力モデル２１３Ａを記憶するための記憶領域を小さくすることができる。 The shift amount output model 213A may include information about the entire area of the shift amount map. Alternatively, the shift amount output model 213A may include only information indicating the boundaries of each area (for example, coordinates or a function indicating the boundaries) and the recommended shift amount Δpr corresponding to the area. In this case, the storage area for storing the shift amount output model 213A can be reduced.

図１０は、シフト量出力モデル２１３Ａの他の例を示す。この図に示すように、シフト量出力モデル２１３Ａは、偏差および変化速度の組み合わせと、推奨シフト量Δｐｒとを対応付けたテーブルであってもよい。 Figure 10 shows another example of the shift amount output model 213A. As shown in this figure, the shift amount output model 213A may be a table that associates combinations of deviation and rate of change with the recommended shift amount Δpr.

＜２．３．動作＞
図１１は、装置２００Ａの動作を示す。装置２００Ａは、ステップＳ１１～Ｓ４５の処理を行うことにより、制御対象１０１を制御してよい。なお、この動作は装置２００Ａが起動されることに応じて開始してよい。また、動作の開始時点においては変更量出力モデル２０６１およびシフト量出力モデル２１３Ａの学習処理が完了していてよい。第２実施形態に係る装置２００Ａの動作は、第１実施形態に係る装置２００の動作と比較してステップＳ２７，Ｓ２９の間にステップＳ３１～Ｓ４５の処理を行う点で異なっている。 <2.3. Operation>
11 shows the operation of the device 200A. The device 200A may control the control target 101 by performing the processes of steps S11 to S45. Note that this operation is performed in response to the device 200A being started. At the start of the operation, the learning process of the change amount output model 2061 and the shift amount output model 213A may be completed. The operation of the device 200A according to the second embodiment may be performed in the same manner as in the first embodiment. 10. This differs from the operation of the apparatus 200 in that steps S31 to S45 are performed between steps S27 and S29.

ステップＳ３１において測定値取得部２０１は、制御対象１０１に関する状態の測定値ＰＶを取得する。目標値取得部２０２は、ステップＳ１３と同様にして測定値ＰＶを取得してよい。 In step S31, the measurement value acquisition unit 201 acquires the measurement value PV of the state of the control object 101. The target value acquisition unit 202 may acquire the measurement value PV in the same manner as in step S13.

ステップＳ３３において偏差取得部２０３Ａは、ステップＳ１１で取得された目標値ＳＰと、ステップＳ３１で取得された測定値ＰＶとの偏差を取得する。また、偏差取得部２０３Ａは、取得した偏差の変化速度をさらに取得する。 In step S33, the deviation acquisition unit 203A acquires the deviation between the target value SP acquired in step S11 and the measurement value PV acquired in step S31. The deviation acquisition unit 203A further acquires the rate of change of the acquired deviation.

ステップＳ３５において不安定状態検出部２１１Ａは、偏差が基準範囲内に安定するか否かを判定する。不安定状態検出部２１１Ａは、ステップＳ２７の処理により制御部２０７から制御対象１０１に制御パラメータＰが出力されて第１の基準時間が経過した後に、第２の基準時間内の少なくとも一時点で偏差が基準範囲外となったことに応じて、偏差が基準範囲内に安定しないことを検出し、その旨の判定を行ってよい。 In step S35, the unstable state detection unit 211A determines whether the deviation stabilizes within the reference range. After the control parameter P is output from the control unit 207 to the control object 101 by the processing of step S27 and the first reference time has elapsed, the unstable state detection unit 211A may detect that the deviation does not stabilize within the reference range in response to the deviation being outside the reference range at least at one point within the second reference time, and make a determination to that effect.

ステップＳ３５において偏差が基準範囲内に安定すると判定された場合（ステップＳ３５；Ｙｅｓ）にはステップＳ２９に処理が移行してよい。ステップＳ３５において偏差が基準範囲内に安定しないと判定された場合（ステップＳ３５；Ｎｏ）には、ステップＳ３７に処理が移行してよい。 If it is determined in step S35 that the deviation is stable within the reference range (step S35; Yes), the process may proceed to step S29. If it is determined in step S35 that the deviation is not stable within the reference range (step S35; No), the process may proceed to step S37.

ステップＳ３７において制御パラメータ取得部２０４Ａは、制御パラメータＰのシフト量Δｐを決定する。制御パラメータ取得部２０４Ａは、ステップＳ３３で取得された偏差および変化速度がシフト量出力モデル２１３Ａに供給されることに応じて当該シフト量出力モデル２１３Ａから出力される推奨シフト量Δｐｒをシフト量Δｐとして決定してよい。 In step S37, the control parameter acquisition unit 204A determines a shift amount Δp of the control parameter P. In response to the deviation and rate of change acquired in step S33 being supplied to the shift amount output model 213A, the control parameter acquisition unit 204A may determine the recommended shift amount Δpr output from the shift amount output model 213A as the shift amount Δp.

ステップＳ３９において制御パラメータ取得部２０４Ａは、制御対象１０１に対して供給された制御パラメータＰを取得する。制御パラメータ取得部２０４Ａは、直近の制御周期において制御対象１０１に供給された制御パラメータＰを制御部２０７Ａから取得してよい。一例として、制御パラメータ取得部２０４Ａは、ステップＳ２７および後述のステップＳ４５のうち、直近に実行された処理で制御部２０７から制御対象１０１に出力される制御パラメータＰを取得して一時保存しておき、ステップＳ３９において読み出してよい。 In step S39, the control parameter acquisition unit 204A acquires the control parameters P supplied to the control object 101. The control parameter acquisition unit 204A may acquire the control parameters P supplied to the control object 101 in the most recent control cycle from the control unit 207A. As an example, the control parameter acquisition unit 204A may acquire and temporarily store the control parameters P output from the control unit 207 to the control object 101 in the most recently executed process of step S27 or step S45 described below, and read them out in step S39.

ステップＳ４１において制御パラメータ取得部２０４Ａは、ステップＳ３９で取得された制御パラメータＰを、ステップＳ３７で決定されたシフト量Δｐだけシフトさせてシフト済み制御パラメータＰ＋Δｐを取得する。 In step S41, the control parameter acquisition unit 204A shifts the control parameter P acquired in step S39 by the shift amount Δp determined in step S37 to acquire the shifted control parameter P+Δp.

ステップＳ４３において第１供給部２０５は、ステップＳ４１で取得されたシフト済み制御パラメータＰ＋Δｐと、ステップＳ３３で取得された偏差とを制御モデル２０６に供給する。 In step S43, the first supply unit 205 supplies the shifted control parameter P+Δp obtained in step S41 and the deviation obtained in step S33 to the control model 206.

ステップＳ４５において制御部２０７は、制御モデル２０６からの推奨制御パラメータＰｒを出力する。制御部２０７は、推奨制御パラメータＰｒを制御パラメータＰとして制御対象１０１に供給して、制御対象１０１を制御してよい。ステップＳ４５の処理が完了したら、ステップＳ３１に処理が移行してよい。 In step S45, the control unit 207 outputs the recommended control parameters Pr from the control model 206. The control unit 207 may supply the recommended control parameters Pr to the control object 101 as control parameters P to control the control object 101. When the processing of step S45 is completed, the processing may proceed to step S31.

＜２．４．動作例＞
図１２は、外乱が生じる場合の測定値ＰＶと、制御パラメータＰとの推移を示す。図中の横軸は時間（秒）を示し、縦軸は測定値ＰＶおよび制御パラメータＰの値を示す。なお、本図では一例として制御パラメータＰは、バルブの開度の指示値ＩＶを示しており、測定値ＰＶおよび制御パラメータＰは０～１００の範囲内に正規化されている。この図に示されるように、本実施形態に係る装置２００Ａでは、外乱が生じる場合であっても、目標値ＳＰに測定値ＰＶが一致するよう制御対象１０１が制御される。 <2.4. Operation example>
12 shows the transition of the measurement value PV and the control parameter P when a disturbance occurs. The horizontal axis in the figure indicates time (seconds), and the vertical axis indicates the values of the measurement value PV and the control parameter P. Note that in this figure, as an example, the control parameter P indicates the indication value IV of the valve opening, and the measurement value PV and the control parameter P are normalized within the range of 0 to 100. As shown in this figure, in the device 200A according to this embodiment, even when a disturbance occurs, the controlled object 101 is controlled so that the measurement value PV coincides with the target value SP.

＜３．第３実施形態＞
＜３．１．システム１Ｂ＞
図１３は、第３実施形態に係るシステム１Ｂを示す。なお、図１，図２に示されたシステム１，１Ａと略同一のものには同一の符号を付け、説明を省略する。システム１Ｂは装置２００Ｂを備える。本実施形態に係る装置２００Ｂは、制御対象１０１を変更可能となっている。変更前後の制御対象１０１は、互いにプロセス動特性が近似してよく、それぞれ１次遅れ系であってもよいし、それぞれ２次遅れ系であってもよい。別言すれば、変更前後の制御対象１０１は、入出力の関係を示す伝達関数の分母の次数が等しくてよい。装置２００Ｂは、対象情報取得部２２１Ｂと、制御部２０７Ｂと、制御パラメータ取得部２０４Ｂとを有してよい。 <3. Third embodiment>
<3.1. System 1B>
FIG. 13 shows a system 1B according to the third embodiment. The same reference numerals are used for the parts that are substantially the same as those in the systems 1 and 1A shown in FIG. 1 and FIG. 2, and the description thereof is omitted. The system 1B includes an apparatus 200B. The apparatus 200B according to this embodiment is capable of changing the controlled object 101. The controlled object 101 before and after the change may have similar process dynamic characteristics, and may be a first-order lag system or a second-order lag system. In other words, the controlled object 101 before and after the change may have the same order of the denominator of the transfer function indicating the relationship between the input and output. The apparatus 200B may include an object information acquisition unit 221B, a control unit 207B, and a control parameter acquisition unit 204B.

＜３．１．１．対象情報取得部２２１Ｂ＞
対象情報取得部２２１Ｂは、第２検出部の一例であり、制御対象１０１が変更されたことを検出する。対象情報取得部２２１Ｂは、制御対象１０１が変更されるごとに、変更後の制御対象１０１の識別情報をオペレータから取得することにより、制御対象１０１が変更されたことを検出してよい。対象情報取得部２２１Ｂは、制御対象１０１が変更されたことに応じて、変更後の制御対象１０１の識別情報を示す信号（制御対象変更信号とも称する）を制御部２０７Ｂおよび制御パラメータ取得部２０４Ｂに供給してよい。 <3.1.1. Target information acquisition unit 221B>
The target information acquisition unit 221B is an example of a second detection unit, and detects that the control target 101 has been changed. The target information acquisition unit 221B may detect that the control target 101 has been changed by acquiring identification information of the changed control target 101 from the operator every time the control target 101 is changed. In response to the change in the control target 101, the target information acquisition unit 221B may supply a signal indicating the identification information of the changed control target 101 (also referred to as a control target change signal) to the control unit 207B and the control parameter acquisition unit 204B.

＜３．１．２．制御部２０７Ｂ＞
制御部２０７Ｂは、上記第１実施形態における制御部２０７Ｂと同様にして、第１供給部２０５から制御モデル２０６に対する供給が行われたことに応じて当該制御モデル２０６から出力される推奨制御パラメータＰｒを制御パラメータＰとして出力する。制御部２０７Ｂは、対象情報取得部２２１Ｂから制御対象変更信号を受信したことに応じて、変更後の制御対象１０１に制御パラメータＰを出力して、変更後の制御対象１０１を制御してよい。 <3.1.2. Control unit 207B>
Similar to the control unit 207B in the first embodiment, in response to a supply from the first supply unit 205 to the control model 206, the control unit 207B outputs the recommended control parameters Pr output from the control model 206 as control parameters P. In response to receiving a control target change signal from the target information acquisition unit 221B, the control unit 207B may output the control parameters P to the changed control target 101 to control the changed control target 101.

＜３．１．３．制御パラメータ取得部２０４Ｂ＞
制御パラメータ取得部２０４Ｂは、上記第２実施形態における制御パラメータ取得部２０４Ａと同様にして、シフト済み制御パラメータＰ＋Δｐを取得する。 <3.1.3. Control parameter acquisition unit 204B>
The control parameter acquisition unit 204B acquires the shifted control parameter P+Δp in the same manner as the control parameter acquisition unit 204A in the second embodiment.

これに加えて、制御パラメータ取得部２０４Ｂは、制御対象１０１が変更されたことが対象情報取得部２２１Ｂにより検出されたことに応じて、制御対象１０１に対して供給された制御パラメータＰ（本実施形態では一例として制御部２０７から供給される制御パラメータＰ）をシフトさせてシフト済み制御パラメータＰ＋Δｐを取得してよい。制御対象１０１に対して供給された制御パラメータＰは、変更前の制御対象１０１に対して供給された制御パラメータＰであってもよいし、変更後の制御対象１０１に対して供給された制御パラメータＰであってもよい。 In addition, in response to detection by the target information acquisition unit 221B that the control target 101 has been changed, the control parameter acquisition unit 204B may shift the control parameter P supplied to the control target 101 (the control parameter P supplied from the control unit 207 as an example in this embodiment) to acquire a shifted control parameter P+Δp. The control parameter P supplied to the control target 101 may be the control parameter P supplied to the control target 101 before the change, or may be the control parameter P supplied to the control target 101 after the change.

制御対象１０１が変更された場合に、制御パラメータ取得部２０４Ｂは、変更前後の各制御対象１０１の平衡点での制御パラメータＰの値の差分に基づいてシフト量Δｐを決定してよい。 When the control object 101 is changed, the control parameter acquisition unit 204B may determine the shift amount Δp based on the difference between the values of the control parameter P at the equilibrium point of each control object 101 before and after the change.

例えば、制御パラメータ取得部２０４Ｂは、予め目標値ＳＰを基準目標値に設定して変更前後の制御対象１０１をそれぞれ用いた場合に取得された平衡点Ｏでの制御パラメータＰの値の差分を、シフト量Δｐとして決定してよい。この場合には、制御パラメータ取得部２０４Ｂは、制御対象１０１ごとに、目標値ＳＰを基準目標値に設定した場合の平衡点Ｏでの制御パラメータＰの値を予め記憶してよい。 For example, the control parameter acquisition unit 204B may determine, as the shift amount Δp, the difference between the values of the control parameter P at the equilibrium point O acquired when the target value SP is set to a reference target value in advance and the control object 101 before and after the change are used. In this case, the control parameter acquisition unit 204B may store in advance, for each control object 101, the value of the control parameter P at the equilibrium point O when the target value SP is set to a reference target value.

これに代えて、制御パラメータ取得部２０４Ｂは、予め目標値ＳＰを現在の設定値と同じ値に設定して変更前後の制御対象１０１をそれぞれ用いた場合に取得された平衡点Ｏでの制御パラメータＰの値の差分を、シフト量Δｐとして決定してよい。この場合には、制御パラメータ取得部２０４Ｂは、制御対象１０１ごとに、目標値ＳＰと、平衡点Ｏでの制御パラメータＰとの関係を示す関係式を予め記憶してよく、現在の目標値ＳＰに対応する平衡点での制御パラメータＰの値を当該関数式から算出してよい。 Alternatively, the control parameter acquisition unit 204B may determine, as the shift amount Δp, the difference between the values of the control parameter P at the equilibrium point O acquired when the target value SP is set in advance to the same value as the current setting value and the control object 101 before and after the change are used. In this case, the control parameter acquisition unit 204B may store in advance, for each control object 101, a relational expression showing the relationship between the target value SP and the control parameter P at the equilibrium point O, and may calculate the value of the control parameter P at the equilibrium point corresponding to the current target value SP from the function expression.

制御パラメータ取得部２０４Ｂは、上述のステップＳ１９において目標値ＳＰが基準目標値であり、かつ、制御対象１０１が変更されたことに応じて、変更前後の各制御対象１０１の平衡点での制御パラメータＰの値の差分に基づいてシフト量Δｐを決定し、シフト済み制御パラメータＰ＋Δｐを第１供給部２０５に供給してよい。制御対象１０１が変更された後、目標値ＳＰが基準目標値から変更される場合には、制御パラメータ取得部２０４Ｂは、変更後の制御対象１０１について目標値ＳＰと平衡点Ｏでの制御パラメータＰの値との関係を示す上述の関係式を用いてシフト量Δｐを決定してよい。 In the above-mentioned step S19, when the target value SP is the reference target value and the control object 101 has been changed, the control parameter acquisition unit 204B may determine a shift amount Δp based on the difference between the values of the control parameter P at the equilibrium point of each control object 101 before and after the change, and supply the shifted control parameter P+Δp to the first supply unit 205. When the target value SP is changed from the reference target value after the control object 101 is changed, the control parameter acquisition unit 204B may determine the shift amount Δp using the above-mentioned relational expression showing the relationship between the target value SP and the value of the control parameter P at the equilibrium point O for the changed control object 101.

なお、本実施形態では一例として、制御パラメータ取得部２０４Ｂは、目標値ＳＰが基準目標値であり、偏差が目標値ＳＰに安定しており、かつ、制御対象１０１が変更されない場合には、制御部２０７から取得した制御パラメータＰをシフトさせずに第１供給部２０５に供給してよい。これに代えて、制御パラメータ取得部２０４は、当該制御パラメータＰを０だけシフトさせたシフト済み制御パラメータＰ＋Δｐとして第１供給部２０５に供給してもよい。 In this embodiment, as an example, when the target value SP is a reference target value, the deviation is stable at the target value SP, and the control target 101 is not changed, the control parameter acquisition unit 204B may supply the control parameter P acquired from the control unit 207 to the first supply unit 205 without shifting it. Alternatively, the control parameter acquisition unit 204 may supply the control parameter P to the first supply unit 205 as a shifted control parameter P+Δp, which is obtained by shifting the control parameter P by 0.

以上の装置２００Ｂによれば、制御対象１０１が変更されたことに応じて、制御対象１０１に対して供給された制御パラメータＰがシフトされてシフト済み制御パラメータＰ＋Δｐが取得される。従って、単一の制御モデル２０６を用い、制御対象１０１の変更に応じた推奨制御パラメータＰｒを取得することができるため、制御対象１０１ごとに別々の制御モデルを用いる場合と比較して、装置２００Ｂの汎用性を高め、装置２００Ｂを小型化することができる。 According to the above-described device 200B, in response to a change in the control object 101, the control parameter P supplied to the control object 101 is shifted to obtain the shifted control parameter P+Δp. Therefore, since a single control model 206 can be used to obtain the recommended control parameter Pr in response to a change in the control object 101, the versatility of the device 200B can be improved and the device 200B can be made smaller than when a separate control model is used for each control object 101.

また、制御モデル２０６では、偏差と、制御対象１０１に供給済みの制御パラメータＰとに応じて当該制御パラメータＰの推奨変更量Δｐｒが変更量出力モデル２０６１から出力され、当該供給済みの制御パラメータＰと、当該推奨変更量Δｐｒとが加算されて推奨制御パラメータＰｒが算出される。従って、制御パラメータＰをシフト済み制御パラメータＰ＋ΔＰとして制御モデル２０６に入力することで制御モデル２０６を他の制御対象１０１に流用する場合であっても、供給済みの制御パラメータＰをベースとした推奨制御パラメータＰｒを取得することができる。よって、制御対象１０１ごとのプロセスゲインの違いによらず、変更後の制御対象１０１に適合した適切な推奨制御パラメータＰｒを取得することができる。 In the control model 206, the recommended change amount Δpr of the control parameter P is output from the change amount output model 2061 according to the deviation and the control parameter P already supplied to the control object 101, and the recommended change amount Δpr is added to the already supplied control parameter P to calculate the recommended control parameter Pr. Therefore, even if the control model 206 is used for another control object 101 by inputting the control parameter P to the control model 206 as the shifted control parameter P+ΔP, it is possible to obtain a recommended control parameter Pr based on the already supplied control parameter P. Therefore, regardless of the difference in process gain for each control object 101, it is possible to obtain an appropriate recommended control parameter Pr that is suitable for the changed control object 101.

＜３．変形例＞
なお、上記の第１～第３実施形態においては、制御モデル２０６が変更量出力モデル２０６１と加算部２０６２とを有することとして説明したが、偏差および制御パラメータＰが入力されることに応じて推奨制御パラメータＰｒを出力する限りにおいて、これらを有しなくてもよい。この場合には、制御モデル２０６は、カーネルダイナミックポリシープログラミング法や深層強化学習、サポートベクトルマシン、ロジスティック回帰、決定木、ニューラルネットワークなどのアルゴリズムにより生成された学習モデルであってよい。学習処理部２０８は、偏差取得部２０３により取得された偏差と、制御パラメータ取得部２０４により取得された制御パラメータＰと、を含む学習データを用いて制御モデル２０６の学習処理を行ってよい。 3. Modifications
In the above first to third embodiments, the control model 206 has been described as having the change amount output model 2061 and the adder 2062, but as long as the recommended control parameter Pr is output in response to the input of the deviation and the control parameter P, the control model 206 may not have these. In this case, the control model 206 may be a learning model generated by an algorithm such as a kernel dynamic policy programming method, deep reinforcement learning, a support vector machine, a logistic regression, a decision tree, or a neural network. The learning processing unit 208 may perform learning processing of the control model 206 using learning data including the deviation acquired by the deviation acquisition unit 203 and the control parameter P acquired by the control parameter acquisition unit 204.

また、変更量出力モデル２０６１およびシフト量出力モデル２１３Ａをカーネルダイナミックポリシープログラミング法の学習アルゴリズムにより生成されたマップやテーブルとして説明したが、深層強化学習やサポートベクトルマシン、ロジスティック回帰、決定木、ニューラルネットワークなどの他のアルゴリズムにより生成されてもよいし、マップやテーブルとは異なる他の形態のモデルであってもよい。 In addition, the change amount output model 2061 and the shift amount output model 213A have been described as maps and tables generated by the learning algorithm of the kernel dynamic policy programming method, but they may be generated by other algorithms such as deep reinforcement learning, support vector machines, logistic regression, decision trees, and neural networks, or may be models of other forms different from maps and tables.

また、変更量出力モデル２０６１には偏差および制御パラメータＰが入力されることとして説明したが、他の値がさらに入力されてよい。同様に、シフト量出力モデル２１３Ａには偏差および変化速度が入力されることとして説明したが、他の値がさらに入力されてよい。他の値は、例えばセンサ１０２による測定値の微分値や積分値であってよい。 Furthermore, although it has been described that the deviation and the control parameter P are input to the change amount output model 2061, other values may also be input. Similarly, although it has been described that the deviation and the rate of change are input to the shift amount output model 213A, other values may also be input. The other values may be, for example, the differential value or integral value of the measurement value by the sensor 102.

また、装置２００，２００Ａ，２００Ｂが測定値取得部２０１、目標値取得部２０２、学習処理部２０８を有することとして説明したが、これらの何れかを有しなくてもよい。装置２００，２００Ａ，２００Ｂが測定値取得部２０１および目標値取得部２０２を有しない場合には、偏差取得部２０３，２０３Ａは外部機器で算出された偏差を取得してよい。装置２００，２００Ａ，２００Ｂが学習処理部２０８を有しない場合には、予め外部機器で学習された変更量出力モデル２０６１を有してよい。 In addition, although the devices 200, 200A, and 200B have been described as having a measurement value acquisition unit 201, a target value acquisition unit 202, and a learning processing unit 208, they may not have any of these. If the devices 200, 200A, and 200B do not have the measurement value acquisition unit 201 and the target value acquisition unit 202, the deviation acquisition units 203 and 203A may acquire a deviation calculated by an external device. If the devices 200, 200A, and 200B do not have the learning processing unit 208, they may have a change amount output model 2061 that has been learned in advance by an external device.

また、制御パラメータ取得部２０４，２０４Ａ，２０４Ｂはシフト済み制御パラメータＰ＋Δｐを算出して取得することとして説明したが、装置２００，２００Ａ，２００Ｂの外部で算出されたシフト済み制御パラメータＰ＋Δｐを取得してもよい。 In addition, although the control parameter acquisition units 204, 204A, and 204B have been described as calculating and acquiring the shifted control parameter P+Δp, the shifted control parameter P+Δp calculated outside the devices 200, 200A, and 200B may be acquired.

また、上記の第２，第３実施形態においては装置２００Ａ，２００Ｂが学習処理部２１４Ａを有することとして説明したが、学習処理部２１４Ａを有しないこととしてもよい。この場合には、装置２００Ａ，２００Ｂは予め外部機器で学習されたシフト量出力モデル２１３Ａを有してよい。 In addition, in the second and third embodiments, the devices 200A and 200B are described as having the learning processing unit 214A, but they may not have the learning processing unit 214A. In this case, the devices 200A and 200B may have a shift amount output model 213A that has been trained in advance by an external device.

また、上記の第２実施形態においては、制御パラメータ取得部２０４は目標値ＳＰが基準目標値から変更される場合と、偏差が基準範囲内に安定しない場合とに制御パラメータＰをシフトさせることとして説明したが、目標値ＳＰが基準目標値から変更される場合に制御パラメータＰをシフトさせなくてもよい。 In the second embodiment described above, the control parameter acquisition unit 204 shifts the control parameter P when the target value SP is changed from the reference target value and when the deviation is not stabilized within the reference range. However, it is not necessary to shift the control parameter P when the target value SP is changed from the reference target value.

同様に、上記の第３実施形態においては、制御パラメータ取得部２０４は目標値ＳＰが基準目標値から変更される場合と、偏差が基準範囲内に安定しない場合と、制御対象１０１が変更された場合とに制御パラメータＰをシフトさせることとして説明したが、目標値ＳＰが基準目標値から変更される場合と、偏差が基準範囲内に安定しない場合との少なくとも一方で制御パラメータＰをシフトさせなくてもよい。 Similarly, in the above third embodiment, the control parameter acquisition unit 204 has been described as shifting the control parameter P when the target value SP is changed from the reference target value, when the deviation is not stabilized within the reference range, and when the control target 101 is changed, but it is not necessary to shift the control parameter P when at least one of the cases is when the target value SP is changed from the reference target value and when the deviation is not stabilized within the reference range.

また、本発明の様々な実施形態は、フローチャートおよびブロック図を参照して記載されてよく、ここにおいてブロックは、（１）操作が実行されるプロセスの段階または（２）操作を実行する役割を持つ装置のセクションを表わしてよい。特定の段階およびセクションが、専用回路、コンピュータ可読媒体上に格納されるコンピュータ可読命令と共に供給されるプログラマブル回路、および／またはコンピュータ可読媒体上に格納されるコンピュータ可読命令と共に供給されるプロセッサによって実装されてよい。専用回路は、デジタルおよび／またはアナログハードウェア回路を含んでよく、集積回路（ＩＣ）および／またはディスクリート回路を含んでよい。プログラマブル回路は、論理ＡＮＤ、論理ＯＲ、論理ＸＯＲ、論理ＮＡＮＤ、論理ＮＯＲ、および他の論理操作、フリップフロップ、レジスタ、フィールドプログラマブルゲートアレイ（ＦＰＧＡ）、プログラマブルロジックアレイ（ＰＬＡ）等のようなメモリ要素等を含む、再構成可能なハードウェア回路を含んでよい。 Various embodiments of the present invention may also be described with reference to flow charts and block diagrams, where the blocks may represent (1) stages of a process in which operations are performed or (2) sections of an apparatus responsible for performing the operations. Particular stages and sections may be implemented by dedicated circuitry, programmable circuitry provided with computer readable instructions stored on a computer readable medium, and/or a processor provided with computer readable instructions stored on a computer readable medium. Dedicated circuitry may include digital and/or analog hardware circuitry and may include integrated circuits (ICs) and/or discrete circuits. Programmable circuitry may include reconfigurable hardware circuitry including logical AND, logical OR, logical XOR, logical NAND, logical NOR, and other logical operations, memory elements such as flip-flops, registers, field programmable gate arrays (FPGAs), programmable logic arrays (PLAs), and the like.

コンピュータ可読媒体は、適切なデバイスによって実行される命令を格納可能な任意の有形なデバイスを含んでよく、その結果、そこに格納される命令を有するコンピュータ可読媒体は、フローチャートまたはブロック図で指定された操作を実行するための手段を作成すべく実行され得る命令を含む、製品を備えることになる。コンピュータ可読媒体の例としては、電子記憶媒体、磁気記憶媒体、光記憶媒体、電磁記憶媒体、半導体記憶媒体等が含まれてよい。コンピュータ可読媒体のより具体的な例としては、フロッピー（登録商標）ディスク、ディスケット、ハードディスク、ランダムアクセスメモリ（ＲＡＭ）、リードオンリメモリ（ＲＯＭ）、消去可能プログラマブルリードオンリメモリ（ＥＰＲＯＭまたはフラッシュメモリ）、電気的消去可能プログラマブルリードオンリメモリ（ＥＥＰＲＯＭ）、静的ランダムアクセスメモリ（ＳＲＡＭ）、コンパクトディスクリードオンリメモリ（ＣＤ-ＲＯＭ）、デジタル多用途ディスク（ＤＶＤ）、ブルーレイ（ＲＴＭ）ディスク、メモリスティック、集積回路カード等が含まれてよい。 A computer-readable medium may include any tangible device capable of storing instructions that are executed by a suitable device, such that the computer-readable medium having instructions stored thereon comprises an article of manufacture that includes instructions that can be executed to create means for performing the operations specified in the flowchart or block diagram. Examples of computer-readable media may include electronic storage media, magnetic storage media, optical storage media, electromagnetic storage media, semiconductor storage media, and the like. More specific examples of computer-readable media may include floppy disks, diskettes, hard disks, random access memories (RAMs), read-only memories (ROMs), erasable programmable read-only memories (EPROMs or flash memories), electrically erasable programmable read-only memories (EEPROMs), static random access memories (SRAMs), compact disk read-only memories (CD-ROMs), digital versatile disks (DVDs), Blu-ray (RTM) disks, memory sticks, integrated circuit cards, and the like.

コンピュータ可読命令は、アセンブラ命令、命令セットアーキテクチャ（ＩＳＡ）命令、マシン命令、マシン依存命令、マイクロコード、ファームウェア命令、状態設定データ、またはＳｍａｌｌｔａｌｋ（登録商標）、ＪＡＶＡ（登録商標）、Ｃ＋＋等のようなオブジェクト指向プログラミング言語、および「Ｃ」プログラミング言語または同様のプログラミング言語のような従来の手続型プログラミング言語を含む、１または複数のプログラミング言語の任意の組み合わせで記述されたソースコードまたはオブジェクトコードのいずれかを含んでよい。 The computer readable instructions may include either assembler instructions, instruction set architecture (ISA) instructions, machine instructions, machine-dependent instructions, microcode, firmware instructions, state setting data, or source or object code written in any combination of one or more programming languages, including object-oriented programming languages such as Smalltalk®, JAVA®, C++, etc., and conventional procedural programming languages such as the "C" programming language or similar programming languages.

コンピュータ可読命令は、汎用コンピュータ、特殊目的のコンピュータ、若しくは他のプログラム可能なデータ処理装置のプロセッサまたはプログラマブル回路に対し、ローカルにまたはローカルエリアネットワーク（ＬＡＮ）、インターネット等のようなワイドエリアネットワーク（ＷＡＮ）を介して提供され、フローチャートまたはブロック図で指定された操作を実行するための手段を作成すべく、コンピュータ可読命令を実行してよい。プロセッサの例としては、コンピュータプロセッサ、処理ユニット、マイクロプロセッサ、デジタル信号プロセッサ、コントローラ、マイクロコントローラ等を含む。 The computer-readable instructions may be provided to a processor or programmable circuit of a general-purpose computer, special-purpose computer, or other programmable data processing apparatus, either locally or over a wide area network (WAN) such as a local area network (LAN), the Internet, etc., to execute the computer-readable instructions to create means for performing the operations specified in the flowcharts or block diagrams. Examples of processors include computer processors, processing units, microprocessors, digital signal processors, controllers, microcontrollers, etc.

図１４は、本発明の複数の態様が全体的または部分的に具現化されてよいコンピュータ２２００の例を示す。コンピュータ２２００にインストールされたプログラムは、コンピュータ２２００に、本発明の実施形態に係る装置に関連付けられる操作または当該装置の１または複数のセクションとして機能させることができ、または当該操作または当該１または複数のセクションを実行させることができ、および／またはコンピュータ２２００に、本発明の実施形態に係るプロセスまたは当該プロセスの段階を実行させることができる。そのようなプログラムは、コンピュータ２２００に、本明細書に記載のフローチャートおよびブロック図のブロックのうちのいくつかまたはすべてに関連付けられた特定の操作を実行させるべく、ＣＰＵ２２１２によって実行されてよい。 14 shows an example of a computer 2200 in which aspects of the present invention may be embodied in whole or in part. A program installed on the computer 2200 may cause the computer 2200 to function as or perform operations associated with an apparatus or one or more sections of the apparatus according to an embodiment of the present invention, and/or to perform a process or steps of the process according to an embodiment of the present invention. Such a program may be executed by the CPU 2212 to cause the computer 2200 to perform certain operations associated with some or all of the blocks of the flowcharts and block diagrams described herein.

本実施形態によるコンピュータ２２００は、ＣＰＵ２２１２、ＲＡＭ２２１４、グラフィックコントローラ２２１６、およびディスプレイデバイス２２１８を含み、それらはホストコントローラ２２１０によって相互に接続されている。コンピュータ２２００はまた、通信インタフェース２２２２、ハードディスクドライブ２２２４、ＤＶＤ－ＲＯＭドライブ２２２６、およびＩＣカードドライブのような入／出力ユニットを含み、それらは入／出力コントローラ２２２０を介してホストコントローラ２２１０に接続されている。コンピュータはまた、ＲＯＭ２２３０およびキーボード２２４２のようなレガシの入／出力ユニットを含み、それらは入／出力チップ２２４０を介して入／出力コントローラ２２２０に接続されている。 The computer 2200 according to this embodiment includes a CPU 2212, a RAM 2214, a graphics controller 2216, and a display device 2218, which are interconnected by a host controller 2210. The computer 2200 also includes input/output units such as a communication interface 2222, a hard disk drive 2224, a DVD-ROM drive 2226, and an IC card drive, which are connected to the host controller 2210 via an input/output controller 2220. The computer also includes legacy input/output units such as a ROM 2230 and a keyboard 2242, which are connected to the input/output controller 2220 via an input/output chip 2240.

ＣＰＵ２２１２は、ＲＯＭ２２３０およびＲＡＭ２２１４内に格納されたプログラムに従い動作し、それにより各ユニットを制御する。グラフィックコントローラ２２１６は、ＲＡＭ２２１４内に提供されるフレームバッファ等またはそれ自体の中にＣＰＵ２２１２によって生成されたイメージデータを取得し、イメージデータがディスプレイデバイス２２１８上に表示されるようにする。 The CPU 2212 operates according to the programs stored in the ROM 2230 and the RAM 2214, thereby controlling each unit. The graphics controller 2216 retrieves image data generated by the CPU 2212 into a frame buffer or the like provided in the RAM 2214 or into itself, and causes the image data to be displayed on the display device 2218.

通信インタフェース２２２２は、ネットワークを介して他の電子デバイスと通信する。ハードディスクドライブ２２２４は、コンピュータ２２００内のＣＰＵ２２１２によって使用されるプログラムおよびデータを格納する。ＤＶＤ－ＲＯＭドライブ２２２６は、プログラムまたはデータをＤＶＤ－ＲＯＭ２２０１から読み取り、ハードディスクドライブ２２２４にＲＡＭ２２１４を介してプログラムまたはデータを提供する。ＩＣカードドライブは、プログラムおよびデータをＩＣカードから読み取り、および／またはプログラムおよびデータをＩＣカードに書き込む。 The communication interface 2222 communicates with other electronic devices via a network. The hard disk drive 2224 stores programs and data used by the CPU 2212 in the computer 2200. The DVD-ROM drive 2226 reads programs or data from the DVD-ROM 2201 and provides the programs or data to the hard disk drive 2224 via the RAM 2214. The IC card drive reads programs and data from an IC card and/or writes programs and data to an IC card.

ＲＯＭ２２３０はその中に、アクティブ化時にコンピュータ２２００によって実行されるブートプログラム等、および／またはコンピュータ２２００のハードウェアに依存するプログラムを格納する。入／出力チップ２２４０はまた、様々な入／出力ユニットをパラレルポート、シリアルポート、キーボードポート、マウスポート等を介して、入／出力コントローラ２２２０に接続してよい。 ROM 2230 stores therein a boot program, etc., executed by computer 2200 upon activation, and/or a program that depends on the hardware of computer 2200. I/O chip 2240 may also connect various I/O units to I/O controller 2220 via a parallel port, a serial port, a keyboard port, a mouse port, etc.

プログラムが、ＤＶＤ－ＲＯＭ２２０１またはＩＣカードのようなコンピュータ可読媒体によって提供される。プログラムは、コンピュータ可読媒体から読み取られ、コンピュータ可読媒体の例でもあるハードディスクドライブ２２２４、ＲＡＭ２２１４、またはＲＯＭ２２３０にインストールされ、ＣＰＵ２２１２によって実行される。これらのプログラム内に記述される情報処理は、コンピュータ２２００に読み取られ、プログラムと、上記様々なタイプのハードウェアリソースとの間の連携をもたらす。装置または方法が、コンピュータ２２００の使用に従い情報の操作または処理を実現することによって構成されてよい。 The programs are provided by a computer-readable medium such as a DVD-ROM 2201 or an IC card. The programs are read from the computer-readable medium, installed in the hard disk drive 2224, RAM 2214, or ROM 2230, which are also examples of computer-readable media, and executed by the CPU 2212. The information processing described in these programs is read by the computer 2200, and brings about cooperation between the programs and the various types of hardware resources described above. An apparatus or method may be constructed by realizing the manipulation or processing of information according to the use of the computer 2200.

例えば、通信がコンピュータ２２００および外部デバイス間で実行される場合、ＣＰＵ２２１２は、ＲＡＭ２２１４にロードされた通信プログラムを実行し、通信プログラムに記述された処理に基づいて、通信インタフェース２２２２に対し、通信処理を命令してよい。通信インタフェース２２２２は、ＣＰＵ２２１２の制御下、ＲＡＭ２２１４、ハードディスクドライブ２２２４、ＤＶＤ－ＲＯＭ２２０１、またはＩＣカードのような記録媒体内に提供される送信バッファ処理領域に格納された送信データを読み取り、読み取られた送信データをネットワークに送信し、またはネットワークから受信された受信データを記録媒体上に提供される受信バッファ処理領域等に書き込む。 For example, when communication is performed between computer 2200 and an external device, CPU 2212 may execute a communication program loaded into RAM 2214 and instruct communication interface 2222 to perform communication processing based on the processing described in the communication program. Under the control of CPU 2212, communication interface 2222 reads transmission data stored in a transmission buffer processing area provided in RAM 2214, hard disk drive 2224, DVD-ROM 2201, or a recording medium such as an IC card, and transmits the read transmission data to the network, or writes reception data received from the network to a reception buffer processing area or the like provided on the recording medium.

また、ＣＰＵ２２１２は、ハードディスクドライブ２２２４、ＤＶＤ－ＲＯＭドライブ２２２６（ＤＶＤ－ＲＯＭ２２０１）、ＩＣカード等のような外部記録媒体に格納されたファイルまたはデータベースの全部または必要な部分がＲＡＭ２２１４に読み取られるようにし、ＲＡＭ２２１４上のデータに対し様々なタイプの処理を実行してよい。ＣＰＵ２２１２は次に、処理されたデータを外部記録媒体にライトバックする。 The CPU 2212 may also cause all or a necessary portion of a file or database stored on an external recording medium such as the hard disk drive 2224, the DVD-ROM drive 2226 (DVD-ROM 2201), an IC card, etc. to be read into the RAM 2214, and perform various types of processing on the data on the RAM 2214. The CPU 2212 then writes back the processed data to the external recording medium.

様々なタイプのプログラム、データ、テーブル、およびデータベースのような様々なタイプの情報が記録媒体に格納され、情報処理を受けてよい。ＣＰＵ２２１２は、ＲＡＭ２２１４から読み取られたデータに対し、本開示の随所に記載され、プログラムの命令シーケンスによって指定される様々なタイプの操作、情報処理、条件判断、条件分岐、無条件分岐、情報の検索／置換等を含む、様々なタイプの処理を実行してよく、結果をＲＡＭ２２１４に対しライトバックする。また、ＣＰＵ２２１２は、記録媒体内のファイル、データベース等における情報を検索してよい。例えば、各々が第２の属性の属性値に関連付けられた第１の属性の属性値を有する複数のエントリが記録媒体内に格納される場合、ＣＰＵ２２１２は、第１の属性の属性値が指定される、条件に一致するエントリを当該複数のエントリの中から検索し、当該エントリ内に格納された第２の属性の属性値を読み取り、それにより予め定められた条件を満たす第１の属性に関連付けられた第２の属性の属性値を取得してよい。 Various types of information, such as various types of programs, data, tables, and databases, may be stored in the recording medium and may undergo information processing. CPU 2212 may perform various types of processing on data read from RAM 2214, including various types of operations, information processing, conditional judgment, conditional branching, unconditional branching, information search/replacement, etc., as described throughout this disclosure and specified by the instruction sequence of the program, and write back the results to RAM 2214. CPU 2212 may also search for information in a file, database, etc. in the recording medium. For example, if multiple entries each having an attribute value of a first attribute associated with an attribute value of a second attribute are stored in the recording medium, CPU 2212 may search for an entry that matches a condition in which an attribute value of the first attribute is specified from among the multiple entries, read the attribute value of the second attribute stored in the entry, and thereby obtain the attribute value of the second attribute associated with the first attribute that satisfies a predetermined condition.

上で説明したプログラムまたはソフトウェアモジュールは、コンピュータ２２００上またはコンピュータ２２００近傍のコンピュータ可読媒体に格納されてよい。また、専用通信ネットワークまたはインターネットに接続されたサーバーシステム内に提供されるハードディスクまたはＲＡＭのような記録媒体が、コンピュータ可読媒体として使用可能であり、それによりプログラムを、ネットワークを介してコンピュータ２２００に提供する。 The above-described program or software module may be stored on a computer-readable medium on the computer 2200 or in the vicinity of the computer 2200. Also, a recording medium such as a hard disk or RAM provided in a server system connected to a dedicated communication network or the Internet can be used as a computer-readable medium, thereby providing the program to the computer 2200 via the network.

以上、本発明を実施の形態を用いて説明したが、本発明の技術的範囲は上記実施の形態に記載の範囲には限定されない。上記実施の形態に、多様な変更または改良を加えることが可能であることが当業者に明らかである。その様な変更または改良を加えた形態も本発明の技術的範囲に含まれ得ることが、特許請求の範囲の記載から明らかである。 The present invention has been described above using an embodiment, but the technical scope of the present invention is not limited to the scope described in the above embodiment. It is clear to those skilled in the art that various modifications and improvements can be made to the above embodiment. It is clear from the claims that forms with such modifications or improvements can also be included in the technical scope of the present invention.

特許請求の範囲、明細書、および図面中において示した装置、システム、プログラム、および方法における動作、手順、ステップ、および段階等の各処理の実行順序は、特段「より前に」、「先立って」等と明示しておらず、また、前の処理の出力を後の処理で用いるのでない限り、任意の順序で実現しうることに留意すべきである。特許請求の範囲、明細書、および図面中の動作フローに関して、便宜上「まず、」、「次に、」等を用いて説明したとしても、この順で実施することが必須であることを意味するものではない。 The order of execution of each process, such as operations, procedures, steps, and stages, in the devices, systems, programs, and methods shown in the claims, specifications, and drawings is not specifically stated as "before" or "prior to," and it should be noted that the processes may be performed in any order, unless the output of a previous process is used in a later process. Even if the operational flow in the claims, specifications, and drawings is explained using "first," "next," etc. for convenience, it does not mean that it is necessary to perform the processes in this order.

１システム
１００設備
１０１制御対象
１０２センサ
２００装置
２０１測定値取得部
２０２目標値取得部
２０３偏差取得部
２０４制御パラメータ取得部
２０５第１供給部
２０６制御モデル
２０７制御部
２０８学習処理部
２１１不安定状態検出部
２１２第２供給部
２１３シフト量出力モデル
２１４学習処理部
２２１対象情報取得部
２０６１変更量出力モデル
２０６２加算部
２２００コンピュータ
２２０１ＤＶＤ－ＲＯＭ
２２１０ホストコントローラ
２２１２ＣＰＵ
２２１４ＲＡＭ
２２１６グラフィックコントローラ
２２１８ディスプレイデバイス
２２２０入／出力コントローラ
２２２２通信インタフェース
２２２４ハードディスクドライブ
２２２６ＤＶＤ－ＲＯＭドライブ
２２３０ＲＯＭ
２２４０入／出力チップ
２２４２キーボード REFERENCE SIGNS LIST 1 System 100 Equipment 101 Control target 102 Sensor 200 Device 201 Measurement value acquisition unit 202 Target value acquisition unit 203 Deviation acquisition unit 204 Control parameter acquisition unit 205 First supply unit 206 Control model 207 Control unit 208 Learning processing unit 211 Unstable state detection unit 212 Second supply unit 213 Shift amount output model 214 Learning processing unit 221 Target information acquisition unit 2061 Change amount output model 2062 Addition unit 2200 Computer 2201 DVD-ROM
2210 host controller 2212 CPU
2214 RAM
2216 Graphic controller 2218 Display device 2220 Input/output controller 2222 Communication interface 2224 Hard disk drive 2226 DVD-ROM drive 2230 ROM
2240 Input/Output Chip 2242 Keyboard

Claims

a deviation acquisition unit for acquiring a deviation between a measured value of a state of a control object and a target value;
a control parameter acquisition unit that acquires shifted control parameters by shifting the control parameters supplied to the control target;
a first supply unit that supplies the deviation acquired by the deviation acquisition unit and the shifted control parameter acquired by the control parameter acquisition unit to a control model that outputs a recommended control parameter to be recommended to be supplied to the controlled object in response to an input of the deviation and the control parameter;
an output unit that outputs the recommended control parameters output from the control model in response to the supply from the first supply unit to the control model;
An apparatus comprising:

A target value acquisition unit that acquires a target value of the state,
The device according to claim 1 , wherein the control parameter acquisition unit shifts the control parameters supplied to the controlled object in response to the target value being changed from a reference target value, and acquires the shifted control parameters.

The device according to claim 2, wherein the control parameter acquisition unit, in response to the target value being changed from the reference target value, shifts the control parameters supplied to the control target by a shift amount corresponding to the target value, and acquires the shifted control parameters.

The device according to claim 3, wherein the control parameter acquisition unit determines the shift amount by multiplying the difference between the target value and the reference target value by a preset coefficient.

The device according to claim 3, wherein the control parameter acquisition unit determines the shift amount using a pre-set relational expression that indicates the relationship between a value set for the target value and the value of the control parameter when the measured value stabilizes at that value.

a first detection unit that detects that the deviation acquired by the deviation acquisition unit is not stable within a reference range after the control target is controlled by the recommended control parameters,
The device according to any one of claims 1 to 3, wherein the control parameter acquisition unit shifts the control parameter supplied to the controlled object in response to detection by the first detection unit that the deviation acquired by the deviation acquisition unit is not stable within the reference range, and acquires the shifted control parameter.

a second supply unit that supplies the deviation acquired by the deviation acquisition unit and the change rate of the deviation to a shift amount output model that outputs a recommended shift amount that recommends a shift of the control parameter by the control parameter acquisition unit in response to input of the deviation and the change rate of the deviation,
7. The device according to claim 6, wherein the control parameter acquisition unit shifts the control parameters supplied to the controlled object by the recommended shift amount output from the shift amount output model in response to the first detection unit detecting that the deviation acquired by the deviation acquisition unit is not stable within the reference range and the second supply unit supplying the shift amount to the shift amount output model, thereby acquiring the shifted control parameters.

The device according to claim 7, wherein the second supply unit supplies the shift amount output model for each reference interval.

The device according to claim 7, further comprising a first learning processing unit that performs learning processing of the shift amount output model using learning data including the deviation acquired by the deviation acquisition unit, the rate of change of the deviation, and the shift amount of the control parameter acquired by the control parameter acquisition unit, so as to output the recommended shift amount recommended to increase a reward value determined by a preset reward function in response to input of the deviation and the rate of change of the deviation.

A second detection unit that detects that the control target has been changed,
The device according to claim 1 , wherein the control parameter acquisition unit shifts the control parameters supplied to the control object in response to detection by the second detection unit that the control object has been changed, and acquires the shifted control parameters.

The control model is
a change amount output model that outputs a recommended change amount that recommends a change to the control parameter in response to input of the deviation and the control parameter;
an adder that calculates the recommended control parameter by adding the control parameter supplied to the controlled object and the recommended change amount output from the change amount output model;
The apparatus of claim 1 , further comprising:

The device according to claim 11, further comprising a second learning processing unit that performs learning processing of the change amount output model using learning data including the deviation acquired by the deviation acquisition unit and the control parameters acquired by the control parameter acquisition unit, so as to output the recommended change amount recommended for increasing a reward value determined by a preset reward function in response to input of the deviation and the control parameters.

A deviation acquisition step of acquiring a deviation between a measured value of a state of a control object and a target value;
a control parameter acquisition step of acquiring shifted control parameters by shifting the control parameters supplied to the control object;
a first supply step of supplying the deviation acquired in the deviation acquisition step and the shifted control parameter acquired in the control parameter acquisition step to a control model that outputs a recommended control parameter to be recommended to be supplied to the controlled object in response to an input of the deviation and the control parameter;
an output step of outputting the recommended control parameters output from the control model in response to the supply to the control model by the first supply step;
A method for providing the above.

Computer,
a deviation acquisition unit for acquiring a deviation between a measured value of a state of a control object and a target value;
a control parameter acquisition unit that acquires shifted control parameters by shifting the control parameters supplied to the control target;
a first supply unit that supplies the deviation acquired by the deviation acquisition unit and the shifted control parameter acquired by the control parameter acquisition unit to a control model that outputs a recommended control parameter to be recommended to be supplied to the controlled object in response to an input of the deviation and the control parameter;
an output unit that outputs the recommended control parameters output from the control model in response to the supply from the first supply unit to the control model.