JP2021111265A

JP2021111265A - Control device

Info

Publication number: JP2021111265A
Application number: JP2020004428A
Authority: JP
Inventors: 洋紀成田; Hironori Narita
Original assignee: Hino Motors Ltd
Current assignee: Hino Motors Ltd
Priority date: 2020-01-15
Filing date: 2020-01-15
Publication date: 2021-08-02
Anticipated expiration: 2040-01-15
Also published as: JP7451182B2

Abstract

To optimize an output to be controlled more suitably.SOLUTION: A control device 1 comprises: a first control unit 10 which derives and outputs a control target value based on an input value related to derivation of the control target value; and a second control unit 20 which has a prediction model 21 for deriving a corrected target value which is a correction value of the control target value based on the control target value output by the first control unit 10 and the input value and which outputs the corrected target value.SELECTED DRAWING: Figure 1

Description

本発明は、制御装置に関する。 The present invention relates to a control device.

制御対象の出力を最適化するための手法として、モデル予測制御が知られている（例えば特許文献１参照）。モデル予測制御では、コントローラの内部の予測モデル（制御対象モデル）に基づいて、制御対象の出力が目標値（最適値）に追従するように、制御対象に与える入力値が決定される。具体的には、モデル予測制御では、予測モデルが、制御対象の出力の目標値と現在の制御対象の出力値とに基づき、現在から未来の予測区間において目標値との追従誤差の面積が最小化するように、各時間における制御対象に与える入力値（入力値の時系列）を探索し、時系列の一番目の要素を、実際に制御対象に与える入力値として制御対象に適用する。このような処理が繰り返し実行されて、各時刻において未来の応答を予測しながら制御対象に与える入力値が逐次最適化されることにより、制御対象の出力を適切に目標値に追従させることができる。 Model predictive control is known as a method for optimizing the output of a controlled object (see, for example, Patent Document 1). In model prediction control, the input value given to the control target is determined based on the prediction model (control target model) inside the controller so that the output of the control target follows the target value (optimum value). Specifically, in model prediction control, the prediction model has the smallest area of tracking error with the target value in the current to future prediction interval based on the target value of the output of the controlled object and the output value of the current controlled object. The input value (time series of input values) given to the control target at each time is searched, and the first element of the time series is applied to the control target as the input value actually given to the control target. By repeatedly executing such processing and sequentially optimizing the input value given to the control target while predicting the future response at each time, the output of the control target can be appropriately followed by the target value. ..

特表２０１９−５２２２９０号公報Special Table 2019-522290

ここで、上述したモデル予測制御では、制御対象の出力を目標値に追従させるために、現在から未来の予測区間の入力値の時系列を探索する処理を繰り返し行う必要があるため、制御対象の出力を最適化するために時間を要してしまう。実際には、限られた時間で制御対象の出力を最適化する必要があるため、環境によっては、モデル予測制御のみでは制御対象の出力の最適化を行うことができないおそれがある。 Here, in the model prediction control described above, in order to make the output of the control target follow the target value, it is necessary to repeatedly search the time series of the input values of the prediction interval from the present to the future. It takes time to optimize the output. Actually, since it is necessary to optimize the output of the controlled object in a limited time, there is a possibility that the output of the controlled object cannot be optimized only by the model prediction control depending on the environment.

本発明は上記実情に鑑みてなされたものであり、より好適に制御対象の出力を最適化することを目的とする。 The present invention has been made in view of the above circumstances, and an object of the present invention is to more preferably optimize the output of the controlled object.

本発明の一態様に係る制御装置は、制御目標値の導出に係る入力値に基づき、制御目標値を導出し出力する第１制御部と、第１制御部によって出力された制御目標値と入力値とに基づき制御目標値の修正値である修正目標値を導出する予測モデルを有し、修正目標値を出力する第２制御部と、を備える。 The control device according to one aspect of the present invention has a first control unit that derives and outputs a control target value based on an input value related to the derivation of the control target value, and a control target value and an input output by the first control unit. It has a prediction model for deriving a correction target value which is a correction value of the control target value based on the value, and includes a second control unit for outputting the correction target value.

本発明の一態様に係る制御装置では、第１制御部によって導出された制御目標値が、第２制御部の予測モデルによって修正され、修正目標値が出力される。例えば、予測モデルのみによって制御に係る目標値を導出しようとした場合（制御対象の出力を最適化する場合）においては、繰り返し処理に時間を要してしまい、限られた時間内に制御対象の出力を最適化することができないおそれがある。一方で、例えば比較的短時間で制御対象の出力を最適化するモデル（学習モデル等）のみによって制御に係る目標値を導出しようとした場合においては、高精度なモデルを開発するために多大な労力を要してしまうと考えられる。この点、本発明の一態様に係る制御装置では、初期の目標値が第１制御部によってある程度大まかに設定された場合であっても、予測モデルによる修正によって、目標値設定を効率的に行うことができる。このことで、限られた時間で制御対象の出力を最適化することができ、制御対象の出力を好適に最適化することができる。 In the control device according to one aspect of the present invention, the control target value derived by the first control unit is modified by the prediction model of the second control unit, and the modified target value is output. For example, when trying to derive the target value related to control only by the prediction model (when optimizing the output of the controlled object), it takes time for the iterative process, and the controlled object is controlled within a limited time. The output may not be optimized. On the other hand, for example, when trying to derive the target value related to control only by a model (learning model, etc.) that optimizes the output of the controlled object in a relatively short time, it is a great deal to develop a highly accurate model. It will take a lot of effort. In this respect, in the control device according to one aspect of the present invention, even when the initial target value is roughly set to some extent by the first control unit, the target value is efficiently set by the correction by the prediction model. be able to. As a result, the output of the controlled object can be optimized in a limited time, and the output of the controlled object can be suitably optimized.

第１制御部は、第２制御部によって出力された修正目標値を学習する学習モデルを有し、修正目標値の学習結果を考慮して、入力値に基づき制御目標値を導出し出力してもよい。このように、修正目標値が学習されることにより、次回に同様のシチュエーションが現れた場合における、第１制御部による制御目標値の導出精度を向上させることができる。すなわち、予測モデルによる修正及び学習モデルによる修正した情報の学習が実行されることによって、その後の目標値設定を効率的に行うことができる。このことで、限られた時間で制御対象の出力を最適化することができ、制御対象の出力を好適に最適化することができる。 The first control unit has a learning model for learning the correction target value output by the second control unit, and derives and outputs the control target value based on the input value in consideration of the learning result of the correction target value. May be good. By learning the correction target value in this way, it is possible to improve the accuracy of deriving the control target value by the first control unit when a similar situation appears next time. That is, by executing the correction by the prediction model and the learning of the corrected information by the learning model, the subsequent target value setting can be efficiently performed. As a result, the output of the controlled object can be optimized in a limited time, and the output of the controlled object can be suitably optimized.

学習モデルは、修正目標値と該修正目標値に対応する入力値とを併せて学習してもよい。これにより、次回に同様のシチュエーションが現れた場合における、第１制御部による制御目標値の導出精度をより好適に向上させることができる。 The learning model may learn the correction target value and the input value corresponding to the correction target value together. As a result, it is possible to more preferably improve the accuracy of deriving the control target value by the first control unit when a similar situation appears next time.

本発明によれば、より好適に制御対象の出力を最適化することができる。 According to the present invention, the output to be controlled can be optimized more preferably.

本実施形態に係る制御装置の機能ブロックを示す図である。It is a figure which shows the functional block of the control device which concerns on this embodiment. 修正目標値の学習を説明する図である。It is a figure explaining the learning of the correction target value. 修正目標値の学習を説明する図である。It is a figure explaining the learning of the correction target value. 制御装置が行う処理を示すフローチャートである。It is a flowchart which shows the process performed by a control device.

以下、添付図面を参照しながら本発明の実施形態を詳細に説明する。図面の説明において、同一又は同等の要素には同一符号を用い、重複する説明を省略する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings. In the description of the drawings, the same reference numerals are used for the same or equivalent elements, and duplicate description is omitted.

図１は、本実施形態に係る制御装置１の機能ブロックを示す図である。図１に示される制御装置１は、例えば車両（不図示）における各種制御対象１００を制御する装置である。制御装置１は、例えば制御対象１００の現時刻からある有限区間に渡る未来の振る舞いを予測し、予測した結果を利用して、制御対象１００の挙動が最適化するように、制御対象１００に与える制御入力（後述する修正目標値）を決定する。 FIG. 1 is a diagram showing a functional block of the control device 1 according to the present embodiment. The control device 1 shown in FIG. 1 is, for example, a device that controls various control targets 100 in a vehicle (not shown). For example, the control device 1 predicts the future behavior of the control target 100 over a certain finite interval from the current time, and uses the predicted result to give the control target 100 so that the behavior of the control target 100 is optimized. Determine the control input (correction target value described later).

制御装置１は、例えばエンジン制御に適用される。制御対象１００は、例えばＥＧＲ（ExhaustGas Recirculation）バルブ及びＶＮＴ（Variable Nozzle Turbo）ノズルである。この場合、制御装置１は、例えばエンジン排出ＮＯｘ、煙と加速性（過給圧の上昇率）を最適化するように、制御対象１００であるＥＧＲバルブの開度及びＶＮＴノズルの位置を調整する制御入力（後述する修正目標値）を制御対象１００に入力する。制御装置１は、第１制御部１０と、第２制御部２０とを備えている。 The control device 1 is applied to, for example, engine control. The control target 100 is, for example, an EGR (Exhaust Gas Recirculation) valve and a VNT (Variable Nozzle Turbo) nozzle. In this case, the control device 1 adjusts the opening degree of the EGR valve and the position of the VNT nozzle, which are the control target 100, so as to optimize, for example, engine exhaust NOx, smoke and acceleration (increasing rate of boost pressure). The control input (correction target value described later) is input to the control target 100. The control device 1 includes a first control unit 10 and a second control unit 20.

第１制御部１０は、制御目標値の導出に係る入力値に基づき、制御目標値を導出し出力する。例えばエンジン制御を行う場合、上記入力値は、エンジン回転数、ドライバ要求トルク（又はアクセル開度、燃料噴射量）、インテークマニフォールド圧力、及び吸入空気量等である。また、上記制御目標値は、ＥＧＲバルブの開度及びＶＮＴノズルの位置である。 The first control unit 10 derives and outputs the control target value based on the input value related to the derivation of the control target value. For example, when engine control is performed, the input values are engine speed, driver required torque (or accelerator opening degree, fuel injection amount), intake manifold pressure, intake air amount, and the like. The control target values are the opening degree of the EGR valve and the position of the VNT nozzle.

第１制御部１０は、自身が出力した制御目標値と上記入力値とに基づき第２制御部２０によって出力された修正目標値を学習する学習モデル１１を有している。学習モデル１１は、修正目標値と該修正目標値に対応する入力値とを併せて学習する。学習モデル１１は、例えばＡＩ技術を利用して学習を行う。第１制御部１０は、学習モデル１１における修正目標値の学習結果を考慮して、入力値に基づき制御目標値を導出し出力する。このように、第２制御部２０によって出力された修正目標値を学習する学習モデル１１を利用して制御目標値が出力されることにより、次回に同様のシチュエーションが現れた場合（同様の入力値が制御装置１に入力された場合）における、第１制御部１０による制御目標値の導出精度を向上させることができる。学習モデル１１の学習が進むことにより、制御目標値の導出精度が向上する。 The first control unit 10 has a learning model 11 that learns the correction target value output by the second control unit 20 based on the control target value output by itself and the input value. The learning model 11 learns the correction target value and the input value corresponding to the correction target value together. The learning model 11 performs learning using, for example, AI technology. The first control unit 10 derives and outputs a control target value based on the input value in consideration of the learning result of the modified target value in the learning model 11. In this way, when the control target value is output using the learning model 11 that learns the modified target value output by the second control unit 20, the same situation appears next time (similar input value). Is input to the control device 1), the accuracy of deriving the control target value by the first control unit 10 can be improved. As the learning of the learning model 11 progresses, the accuracy of deriving the control target value is improved.

学習モデル１１を利用することによる効果について、図２及び図３を参照して説明する。図２及び図３は、修正目標値の学習を説明する図である。図２は、十分に学習が行われていない場合の制御目標値（破線）及び修正目標値（一点鎖線）を示している。図３は、ある程度学習が行われた場合の制御目標値（破線）及び修正目標値（一点鎖線）を示している。図２に示されるように、十分に学習が行われていない状態においては、特に初期の段階（時間が経過していない段階）において、制御目標値と修正目標値との乖離が大きい。一方で、ある程度学習が行われた後において、同様のシチュエーションが現れた場合には、図３に示されるように、制御目標値が前回学習された修正目標値に近づくように導出されるため、制御目標値と修正目標値との乖離が小さくなっている。このように、学習モデル１１によって修正目標値が学習されることにより、学習済みのシチュエーションと同様のシチュエーションが現れた場合に、より早く、最適な制御目標値を導出することができる。 The effect of using the learning model 11 will be described with reference to FIGS. 2 and 3. 2 and 3 are diagrams for explaining the learning of the correction target value. FIG. 2 shows a control target value (broken line) and a modified target value (dashed line) when learning is not sufficiently performed. FIG. 3 shows a control target value (broken line) and a modified target value (dashed line) when learning is performed to some extent. As shown in FIG. 2, in a state where sufficient learning is not performed, the discrepancy between the control target value and the correction target value is large, especially in the initial stage (the stage where time has not passed). On the other hand, if a similar situation appears after some learning has been performed, the control target value is derived so as to approach the previously learned correction target value, as shown in FIG. The discrepancy between the control target value and the correction target value is small. By learning the modified target value by the learning model 11 in this way, when a situation similar to the learned situation appears, the optimum control target value can be derived more quickly.

第２制御部２０は、第１制御部１０によって出力された制御目標値と上記入力値とに基づき、制御目標値の修正値である修正目標値を導出する予測モデル２１を有している。第２制御部２０は、予測モデル２１を用いて導出した修正目標値を出力する。予測モデル２１は、現時点での入力値を用いて将来の制御対象１００の挙動のシミュレーションを行い、シミュレーション結果を評価して、該評価が良くなるように、制御目標値を修正しながらシミュレーションを有限回数繰り返し、最終的な修正目標値を導出するモデルである。制御装置１では、タイムリーな制御を行う必要があるため、シミュレーションは有限回数しか行えない。有限回数のシミュレーションによって精度の高い修正目標値を導出するためには、第１制御部１０から第２制御部２０に入力される制御目標値の精度をある程度高くすることが重要となる。この点、上述したように、第１制御部１０が、学習モデル１１によって修正目標値を都度学習することにより、第１制御部１０から第２制御部２０に入力される制御目標値の精度を高めることができ、ひいては、第２制御部２０から制御対象１００に入力される修正目標値の精度を高めることができる。 The second control unit 20 has a prediction model 21 that derives a correction target value, which is a correction value of the control target value, based on the control target value output by the first control unit 10 and the input value. The second control unit 20 outputs the correction target value derived using the prediction model 21. The prediction model 21 simulates the behavior of the future controlled object 100 using the input value at the present time, evaluates the simulation result, and finitely modifies the control target value so that the evaluation is improved. It is a model that repeats the number of times and derives the final correction target value. Since the control device 1 needs to perform timely control, the simulation can be performed only a finite number of times. In order to derive a highly accurate correction target value by a finite number of simulations, it is important to increase the accuracy of the control target value input from the first control unit 10 to the second control unit 20 to some extent. In this regard, as described above, the first control unit 10 learns the correction target value each time by the learning model 11 to obtain the accuracy of the control target value input from the first control unit 10 to the second control unit 20. By extension, the accuracy of the correction target value input from the second control unit 20 to the control target 100 can be improved.

なお、制御装置１は、エンジン制御以外にも適用され、例えば自動運転制御に適用される。自動運転制御に適用される場合、制御対象１００は例えばステアリング、アクセル、ブレーキである。第１制御部１０及び第２制御部２０に入力される入力値は、例えば運行ルート情報（ナビゲーション等から取得される情報）、現在値、道路の基本情報（制限速度、道路幅、車線数等）、障害物の情報（同一車線の前後を走る車両の位置及び相対速度、対向車線の車両の位置及び相対速度、自転車の位置及び相対速度、歩行者の位置及び相対速度等）である。そして、制御目標値及び修正目標値は、車速及び進行方向等の決定に係る、ステアリング、アクセル、ブレーキ等の制御値である。 The control device 1 is applied to other than engine control, and is applied to, for example, automatic driving control. When applied to automatic driving control, the controlled object 100 is, for example, steering, accelerator, and brake. The input values input to the first control unit 10 and the second control unit 20 are, for example, operation route information (information acquired from navigation, etc.), current value, basic road information (speed limit, road width, number of lanes, etc.). ), Obstacle information (position and relative speed of vehicles running in front of and behind the same lane, position and relative speed of vehicles in the oncoming lane, position and relative speed of bicycles, position and relative speed of pedestrians, etc.). The control target value and the correction target value are control values of steering, accelerator, brake, etc. related to determination of vehicle speed, traveling direction, and the like.

次に、図４を参照して、制御装置１が行う処理について説明する。図４は、制御装置１が行う処理を示すフローチャートである。ここでは、制御装置１をエンジン制御に適用する例（制御対象１００がＥＧＲ（Exhaust Gas Recirculation）バルブ及びＶＮＴ（Variable NozzleTurbo）ノズルである例）を説明する。 Next, the process performed by the control device 1 will be described with reference to FIG. FIG. 4 is a flowchart showing the processing performed by the control device 1. Here, an example in which the control device 1 is applied to engine control (an example in which the control target 100 is an EGR (Exhaust Gas Recirculation) valve and a VNT (Variable Nozzle Turbo) nozzle) will be described.

図４に示されるように、制御装置１は、最初に入力値を取得する（ステップＳ１）。具体的には、制御装置１は、入力値として、例えば、エンジン回転数、ドライバ要求トルク（又はアクセル開度、燃料噴射量）、インテークマニフォールド圧力、及び吸入空気量等を取得する。 As shown in FIG. 4, the control device 1 first acquires the input value (step S1). Specifically, the control device 1 acquires, for example, engine speed, driver required torque (or accelerator opening degree, fuel injection amount), intake manifold pressure, intake air amount, and the like as input values.

つづいて、制御装置１の第１制御部１０は、入力値に基づき制御目標値を導出し出力する（ステップＳ２）。制御装置１は、制御目標値として、例えば、ＥＧＲバルブの開度及びＶＮＴノズルの位置を出力する。 Subsequently, the first control unit 10 of the control device 1 derives and outputs a control target value based on the input value (step S2). The control device 1 outputs, for example, the opening degree of the EGR valve and the position of the VNT nozzle as control target values.

つづいて、制御装置１の第２制御部２０は、予測モデル２１を用いて、制御目標値と入力値とに基づき制御目標値の修正値である修正目標値を導出し出力する（ステップＳ３）。修正目標値は、制御対象１００であるＥＧＲバルブ及びＶＮＴノズルに入力されると共に、第１制御部１０の学習モデル１１に入力される。 Subsequently, the second control unit 20 of the control device 1 derives and outputs a correction target value which is a correction value of the control target value based on the control target value and the input value by using the prediction model 21 (step S3). .. The correction target value is input to the EGR valve and the VNT nozzle which are the control target 100, and is also input to the learning model 11 of the first control unit 10.

つづいて、第１制御部１０の学習モデル１１は、入力された修正目標値と該修正目標値に対応する入力値とを併せて学習する（ステップＳ４）。これにより、同様のシチュエーションが現れた場合（同様の入力値が入力された場合）において、より適切に制御目標値を導出することができる。最後に、制御を終了するか否かが判定され（ステップＳ５）、制御を終了しない場合にはステップＳ１の処理から再度実行され、制御を終了する場合には処理が終了する。 Subsequently, the learning model 11 of the first control unit 10 learns the input correction target value and the input value corresponding to the correction target value together (step S4). As a result, when a similar situation appears (when a similar input value is input), the control target value can be derived more appropriately. Finally, it is determined whether or not to end the control (step S5). If the control is not terminated, the process is re-executed from the process of step S1, and if the control is terminated, the process is terminated.

次に、本実施形態に係る制御装置１の作用効果について説明する。 Next, the operation and effect of the control device 1 according to the present embodiment will be described.

本実施形態に係る制御装置１は、制御目標値の導出に係る入力値に基づき、制御目標値を導出し出力する第１制御部１０と、第１制御部１０によって出力された制御目標値と入力値とに基づき制御目標値の修正値である修正目標値を導出する予測モデル２１を有し、修正目標値を出力する第２制御部２０と、を備える。 The control device 1 according to the present embodiment has a first control unit 10 that derives and outputs a control target value based on an input value related to the derivation of the control target value, and a control target value output by the first control unit 10. It has a prediction model 21 for deriving a correction target value which is a correction value of the control target value based on an input value, and includes a second control unit 20 for outputting the correction target value.

このような制御装置１では、第１制御部１０によって導出された制御目標値が、第２制御部２０の予測モデル２１によって修正され、修正目標値が出力される。例えば、予測モデル２１のみによって制御に係る目標値を導出しようとした場合（制御対象の出力を最適化する場合）においては、繰り返し処理に時間を要してしまい、限られた時間内に制御対象１００の出力を最適化することができないおそれがある。一方で、例えば比較的短時間で制御対象１００の出力を最適化するモデル（学習モデル１１等）のみによって制御に係る目標値を導出しようとした場合においては、高精度なモデルを開発するために多大な労力を要してしまうと考えられる。この点、本実施形態に係る制御装置１では、初期の目標値が第１制御部１０によってある程度大まかに設定された場合であっても、予測モデル２１による修正によって、目標値設定を効率的に行うことができる。このことで、限られた時間で制御対象１００の出力を最適化することができ、制御対象１００の出力を好適に最適化することができる。以上のように、本実施形態に係る制御装置１によれば、モデル予測制御とＡＩを利用した制御の両方の欠点を補い、効率的に最適な制御を行うことができる。 In such a control device 1, the control target value derived by the first control unit 10 is corrected by the prediction model 21 of the second control unit 20, and the corrected target value is output. For example, when trying to derive the target value related to control only by the prediction model 21 (when optimizing the output of the controlled object), it takes time for the iterative process, and the controlled object is controlled within a limited time. There is a risk that the output of 100 cannot be optimized. On the other hand, for example, when trying to derive a target value related to control only by a model (learning model 11 or the like) that optimizes the output of the controlled object 100 in a relatively short time, in order to develop a highly accurate model. It is thought that a great deal of effort will be required. In this respect, in the control device 1 according to the present embodiment, even when the initial target value is roughly set by the first control unit 10, the target value can be set efficiently by the modification by the prediction model 21. It can be carried out. As a result, the output of the control target 100 can be optimized in a limited time, and the output of the control target 100 can be preferably optimized. As described above, according to the control device 1 according to the present embodiment, it is possible to compensate for the drawbacks of both the model prediction control and the control using AI, and efficiently perform the optimum control.

第１制御部１０は、第２制御部２０によって出力された修正目標値を学習する学習モデル１１を有し、修正目標値の学習結果を考慮して、入力値に基づき制御目標値を導出し出力してもよい。このように、修正目標値が学習されることにより、次回に同様のシチュエーションが現れた場合における、第１制御部１０による制御目標値の導出精度を向上させることができる。すなわち、予測モデル２１による修正及び学習モデル１１による修正した情報の学習が実行されることによって、その後の目標値設定を効率的に行うことができる。このことで、限られた時間で制御対象１００の出力を最適化することができ、制御対象１００の出力を好適に最適化することができる。 The first control unit 10 has a learning model 11 for learning the correction target value output by the second control unit 20, and derives a control target value based on the input value in consideration of the learning result of the correction target value. You may output it. By learning the correction target value in this way, it is possible to improve the accuracy of deriving the control target value by the first control unit 10 when a similar situation appears next time. That is, by executing the correction by the prediction model 21 and the learning of the corrected information by the learning model 11, the subsequent target value setting can be efficiently performed. As a result, the output of the control target 100 can be optimized in a limited time, and the output of the control target 100 can be suitably optimized.

学習モデル１１は、修正目標値と該修正目標値に対応する入力値とを併せて学習してもよい。これにより、次回に同様のシチュエーションが現れた場合における、第１制御部１０による制御目標値の導出精度をより好適に向上させることができる。 The learning model 11 may learn the correction target value and the input value corresponding to the correction target value together. As a result, the accuracy of deriving the control target value by the first control unit 10 when a similar situation appears next time can be more preferably improved.

１…制御装置、１０…第１制御部、１１…学習モデル、２０…第２制御部、２１…予測モデル、１００…制御対象。 1 ... Control device, 10 ... First control unit, 11 ... Learning model, 20 ... Second control unit, 21 ... Prediction model, 100 ... Control target.

Claims

A first control unit that derives and outputs the control target value based on the input value related to the derivation of the control target value, and
A second unit having a prediction model for deriving a correction target value which is a correction value of the control target value based on the control target value output by the first control unit and the input value, and outputting the correction target value. A control device including a control unit.

The first control unit has a learning model for learning the correction target value output by the second control unit, and in consideration of the learning result of the correction target value, the control target value is based on the input value. The control device according to claim 1, wherein the control device is derived and output.

The control device according to claim 2, wherein the learning model learns the correction target value and the input value corresponding to the correction target value together.