JP2019125021A

JP2019125021A - Information processing device, information processing method, computer program, control device of internal combustion engine

Info

Publication number: JP2019125021A
Application number: JP2018003459A
Authority: JP
Inventors: 竜大森安; Tatsuhiro Moriyasu; 松栄上田; Matsue Ueda; 真永岡; Makoto Nagaoka; 池田　太郎; Taro Ikeda; 太郎池田; 神保　智彦; Tomohiko Jinbo; 智彦神保; 俊洋中村; Toshihiro Nakamura; 松永　彰生; Akio Matsunaga; 彰生松永
Original assignee: Toyota Motor Corp; Toyota Central R&D Labs Inc
Current assignee: Toyota Motor Corp; Toyota Central R&D Labs Inc
Priority date: 2018-01-12
Filing date: 2018-01-12
Publication date: 2019-07-25
Anticipated expiration: 2038-01-12
Also published as: JP6933585B2

Abstract

To reduce a processing load, shorten processing time, and improve efficiency of initial condition generation in control of an internal combustion engine using model prediction control.SOLUTION: An information processing device generates time-series signals of a prediction value of a state of a control target part of an internal combustion engine by applying preliminarily prepared time-series signals of an operation amount of an actuator to a model expression modeling a change of the state of the control target part depending on a change of the operation amount of the actuator; stores the operation amount of the actuator and the generated state of the control target part as an initial condition; obtains an optimum operation amount of the actuator in the initial condition by repeating evaluation while changing an operation amount which is input for an objective function using the initial condition and the estimated state using the model expression; stores the obtained optimum operation amount in association with the initial condition; and obtains an approximate expression which shows a relation between the initial condition and the optimum operation amount.SELECTED DRAWING: Figure 3

Description

本発明は、情報処理装置に関する。 The present invention relates to an information processing apparatus.

各分野において、モデル予測制御（ＭＰＣ：Model Predictive Control）を利用した制御手法が利用されている。モデル予測制御とは、なんらかの制御パラメータについて、各時刻において未来の応答を予測しながら最適化を行う（最適解を見つける）制御手法である。例えば、特許文献１には、モデル予測制御を利用した内燃機関の制御に関し、内燃機関の制御要素（例えば、ターボチャージャ、排気再循環）のモデルを用いた反復計算によって、有限区間内で制御要素の動作を最適化することが記載されている。例えば、特許文献２には、モデル予測制御を利用して、車両制御を行うためのテーブルや、車両の制御変数を算出する近似式を作成することが記載されている。 In each field, a control method using model predictive control (MPC) is used. Model predictive control is a control method of optimizing (finding an optimal solution) while predicting future response at each time with respect to some control parameter. For example, Patent Document 1 relates to control of an internal combustion engine using model predictive control, and it is a control element within a finite interval by iterative calculation using a model of a control element of the internal combustion engine (eg, turbocharger, exhaust gas recirculation). It has been described to optimize the operation of. For example, Patent Document 2 describes that a table for performing vehicle control and an approximate expression for calculating a control variable of the vehicle are created using model predictive control.

特開２０１１−２５３５３６号公報JP, 2011-253536, A 特許第５８７０６３２号公報Patent No. 5870632

特許文献１では、各時刻においてリアルタイムに、最適解を見つけるための反復計算を行っている。この反復計算には膨大な量の演算が必要であるため、処理負荷が高く、処理に時間を要し、リアルタイム処理が不可能な場合も生じるという課題があった。特許文献２では、予め反復計算によって得られたテーブルあるいは近似式を作成しているため、リアルタイムな反復計算を必要としない。しかし、例えば内燃機関の制御のように、制御パラメータの最適解を導出するために、多くの要素（例えば、数十個）が関連する初期条件を持つ複雑なシステムを対象とする場合、極めて多次元のテーブルあるいは近似式が必要となる。このように多くの要素を含む初期条件について、網羅的な試行によって多次元のテーブルあるいは近似式を作成することは、組み合わせ数の爆発的増加のため困難であるが、特許文献２では、このような課題について何ら記載されていない。 In Patent Document 1, an iterative calculation for finding an optimal solution is performed in real time at each time. Since a large amount of computation is required for this iterative calculation, the processing load is high, processing takes time, and there is a problem that real-time processing may not be possible. In Patent Document 2, since a table or an approximate expression obtained in advance by iterative calculation is created, real-time iterative calculation is not required. However, for complex systems in which many elements (for example, dozens of elements) have associated initial conditions in order to derive an optimal solution of control parameters, such as control of an internal combustion engine, for example, A table or approximation of dimensions is required. It is difficult to create multi-dimensional tables or approximate expressions by exhaustive trials for initial conditions that include such many elements, but it is difficult in Patent Document 2 to No mention is made of

本発明は、上述した課題を解決するためになされたものであり、モデル予測制御を利用した内燃機関の制御において、処理負荷の低減と処理時間の短縮とを図るとともに、初期条件生成の効率化を図ることを目的とする。 The present invention has been made to solve the above-described problems, and in the control of an internal combustion engine using model predictive control, the processing load is reduced and the processing time is shortened, and the efficiency of initial condition generation is improved. The aim is to

本発明は、上述の課題の少なくとも一部を解決するためになされたものであり、以下の形態として実現することが可能である。 The present invention has been made to solve at least a part of the above-mentioned problems, and can be realized as the following modes.

（１）本発明の一形態によれば、情報処理装置が提供される。この情報処理装置は、モデル式記憶部と、初期条件記憶部と、最適操作量記憶部と、初期条件生成部と、予測処理部と、学習処理部とを備える。モデル式記憶部は、内燃機関のアクチュエータの操作量の変化に応じた、内燃機関の制御対象部の状態の変化をモデル化したモデル式を予め記憶する。初期条件記憶部は、前記アクチュエータの操作量の時系列信号と、前記制御対象部の状態の時系列信号と、を少なくとも含む初期条件を記憶する。最適操作量記憶部は、前記初期条件記憶部内の各時刻における前記初期条件に対して、当該初期条件における前記アクチュエータの最適な操作量である最適操作量を対応付けて記憶する。初期条件生成部は、予め作成された前記アクチュエータの操作量の時系列信号を、前記モデル式記憶部内の前記モデル式に適用することで、前記制御対象部の状態の予測値の時系列信号を生成し、前記初期条件として前記初期条件記憶部に記憶させる。予測処理部は、前記初期条件記憶部内の前記初期条件と、前記モデル式記憶部内の前記モデル式を用いて推定された前記状態と、を用いた目的関数について、入力する前記操作量を変化させつつ評価を繰り返すことによって前記最適操作量を求め、前記最適操作量記憶部に記憶させる。学習処理部は、前記最適操作量記憶部内の前記初期条件と前記最適操作量との関係を表す近似式を求める。 (1) According to one aspect of the present invention, an information processing apparatus is provided. The information processing apparatus includes a model expression storage unit, an initial condition storage unit, an optimal operation amount storage unit, an initial condition generation unit, a prediction processing unit, and a learning processing unit. The model formula storage unit stores in advance a model formula that models a change in the state of the control target portion of the internal combustion engine according to the change in the operation amount of the actuator of the internal combustion engine. The initial condition storage unit stores an initial condition including at least a time-series signal of an operation amount of the actuator and a time-series signal of a state of the control target unit. The optimal operation amount storage unit stores the initial operation condition at each time in the initial condition storage unit in association with an optimal operation amount which is an optimal operation amount of the actuator under the initial condition. The initial condition generation unit applies a time series signal of the operation amount of the actuator created in advance to the model equation in the model equation storage unit to obtain a time series signal of the predicted value of the state of the control target unit. It is generated and stored in the initial condition storage unit as the initial condition. The prediction processing unit changes the operation amount to be input for an objective function using the initial condition in the initial condition storage unit and the state estimated using the model expression in the model expression storage unit. While repeating the evaluation, the optimal operation amount is determined and stored in the optimal operation amount storage unit. The learning processing unit obtains an approximate expression representing the relationship between the initial condition and the optimal operation amount in the optimal operation amount storage unit.

この構成によれば、予測制御部は、モデル予測制御を利用して予め、各初期条件に応じた内燃機関のアクチュエータの最適操作量を求めて、最適操作量記憶部に記憶させておく。そして、学習処理部は予め、最適操作量記憶部内の初期条件と最適操作量との関係を表す近似式を求めておくことができる。このため、本構成によれば、実際の内燃機関の制御では、リアルタイムな反復計算を必要とせず、学習処理部によって求められた近似式を利用することで、初期条件内の各要素に対応する各実際値（または推定値）に応じた内燃機関のアクチュエータの最適操作量を素早く求めることができ、処理負荷の低減と処理時間の短縮とを図ることができる。また、初期条件生成部は、初期条件を生成する際に、内燃機関のアクチュエータの操作量の変化に応じた、内燃機関の制御対象部の状態の変化をモデル化したモデル式を使用する。このため、初期条件を生成する際に、各要素を網羅的に組み合わせた試行を行う場合と比較して、演算量を減らすことができ、初期条件生成の効率化を図ることができる。 According to this configuration, the prediction control unit obtains in advance the optimal operation amount of the actuator of the internal combustion engine according to each initial condition using model predictive control, and stores it in the optimal operation amount storage unit. Then, the learning processing unit can obtain in advance an approximate expression that represents the relationship between the initial condition in the optimum operation amount storage unit and the optimum operation amount. For this reason, according to the present configuration, control of an actual internal combustion engine does not require real-time repetitive calculation, and by using the approximate expression obtained by the learning processing unit, each element in the initial condition is accommodated. The optimum operation amount of the actuator of the internal combustion engine according to each actual value (or estimated value) can be obtained quickly, and the processing load can be reduced and the processing time can be shortened. Further, when generating the initial condition, the initial condition generation unit uses a model expression that models a change in the state of the control target unit of the internal combustion engine according to the change in the operation amount of the actuator of the internal combustion engine. For this reason, when generating the initial condition, the amount of calculation can be reduced as compared with the case where a trial in which each element is comprehensively combined is performed, and efficient generation of the initial condition can be achieved.

（２）上記形態の情報処理装置において、前記学習処理部は、前記初期条件と前記最適操作量とを教師データとしたニューラルネットワークの教師あり学習によって前記近似式を求めてもよい。ニューラルネットワークは複雑な関数近似を行う事ができるため、数多くの要素を初期条件として含み得る内燃機関の制御に適している。この構成によれば、学習処理部は、このようなニューラルネットワークを用いて近似式を求めるため、近似式の精度を向上させることができる。 (2) In the information processing apparatus of the above aspect, the learning processing unit may obtain the approximate expression by supervised learning of a neural network in which the initial condition and the optimal operation amount are teacher data. Since neural networks can perform complex function approximation, they are suitable for control of an internal combustion engine that can include many elements as initial conditions. According to this configuration, since the learning processing unit obtains the approximate expression using such a neural network, the accuracy of the approximate expression can be improved.

（３）上記形態の情報処理装置では、さらに、実験計画法を用いて、前記アクチュエータの操作量の前記時系列信号を生成する実験計画処理部を備えてもよい。この構成によれば、実験計画処理部は、実験計画法を用いてアクチュエータの操作量の時系列信号を生成するため、組み合わせとして物理的に無理のない時系列信号を生成できる。 (3) The information processing apparatus according to the above aspect may further include an experimental plan processing unit that generates the time-series signal of the operation amount of the actuator using an experimental design method. According to this configuration, since the experimental design processing unit generates a time series signal of the operation amount of the actuator using the experimental design method, it is possible to generate a physically reasonable time series signal as a combination.

（４）上記形態の情報処理装置において、前記初期条件には、さらに、前記内燃機関に対する外乱と、前記制御対象部の出力と、前記制御対象部の出力の目標値と、のうちの少なくとも一部が含まれ、前記初期条件に前記外乱が含まれる場合、前記モデル式では、前記操作量及び前記外乱の変化に応じた前記状態の変化がモデル化され、前記初期条件に前記出力が含まれる場合、前記モデル式では、前記操作量の変化に応じた前記状態及び前記出力の変化がモデル化されていてもよい。この構成によれば、初期条件として、内燃機関に対する外乱、制御対象部の出力、制御対象部の出力の目標値等の様々な要素を考慮することができる。 (4) In the information processing apparatus of the above aspect, the initial condition further includes at least one of a disturbance to the internal combustion engine, an output of the control target unit, and a target value of an output of the control target unit. If the initial condition includes the disturbance, the model equation models changes in the state according to changes in the manipulated variable and the disturbance, and the output includes the initial condition. In this case, in the model equation, changes in the state and the output according to changes in the manipulated variable may be modeled. According to this configuration, various factors such as disturbance to the internal combustion engine, the output of the control target unit, and the target value of the output of the control target unit can be considered as the initial condition.

（５）上記形態の情報処理装置において、前記モデル式として、線形状態方程式及び非線形状態方程式を使用してもよい。この構成によれば、モデル式として、線形状態方程式及び非線形状態方程式を利用できる。 (5) In the information processing apparatus of the above aspect, a linear state equation and a non-linear state equation may be used as the model equation. According to this configuration, linear equation of state and nonlinear equation of state can be used as model equations.

（６）上記形態の情報処理装置において、前記モデル式として、ＮＡＲＸモデルを用いて構成された非線形方程式を使用してもよい。この構成によれば、モデル式として、ＮＡＲＸモデルを用いて構成された非線形方程式を利用するため、精度の高い予測結果を得ることができる。 (6) In the information processing apparatus of the above aspect, a non-linear equation configured using an NARX model may be used as the model equation. According to this configuration, since a non-linear equation configured using the NARX model is used as a model equation, a highly accurate prediction result can be obtained.

（７）上記形態の情報処理装置において、前記実験計画法として、ステップ関数やランプ関数の組み合わせで表現される信号を生成する第１の方法と、周波数が時間に依存して変化するチャープ信号で表現される信号を生成する第２の方法と、のいずれかを使用してもよい。この構成によれば、実験計画法として、ステップ関数やランプ関数の組み合わせで表現される信号を生成する第１の方法（例えば、ＡＰＲＢＳ法）と、周波数が時間に依存して変化するチャープ信号で表現される信号を生成する第２の方法（例えば、ＳｉｎｕｓｏｉｄａｌＥｘｃｉｔａｔｉｏｎ法）と、のいずれかを利用できる。 (7) In the information processing apparatus of the above embodiment, as the experimental design method, a first method of generating a signal represented by a combination of a step function and a ramp function, and a chirp signal whose frequency changes depending on time. Any of the second methods of generating the signal to be represented may be used. According to this configuration, the first method (for example, the APRBS method) of generating a signal represented by a combination of a step function and a ramp function as an experimental design method, and a chirp signal whose frequency changes depending on time Either of the second method (e.g., Sinusoidal Excitation method) of generating a signal to be represented can be used.

（８）本発明の一形態によれば、内燃機関のアクチュエータの操作量の変化に応じた、内燃機関の制御対象部の状態の変化をモデル化したモデル式を利用した情報処理方法が提供される。この情報処理方法では、予め作成された前記アクチュエータの操作量の時系列信号を、前記モデル式に適用することで、前記制御対象部の状態の予測値の時系列信号を生成する工程と、前記アクチュエータの操作量の時系列信号と、生成された前記制御対象部の状態の時系列信号と、を初期条件として記憶させる工程と、前記初期条件と、前記モデル式を用いて推定された前記状態と、を用いた目的関数について、入力する前記操作量を変化させつつ評価を繰り返すことによって、当該初期条件における前記アクチュエータの最適な操作量である最適操作量を求める工程と、前記初期条件に対して、求めた前記最適操作量を対応付けて記憶させる工程と、前記初期条件と前記最適操作量との関係を表す近似式を求める工程と、を備える。この方法によれば、モデル予測制御を利用した内燃機関の制御において、処理負荷の低減と処理時間の短縮とを図るとともに、初期条件生成の効率化を図ることができる。 (8) According to one aspect of the present invention, there is provided an information processing method using a model expression that models a change in a state of a control target portion of an internal combustion engine according to a change in an operation amount of an actuator of the internal combustion engine Ru. In this information processing method, a time-series signal of a predicted value of the state of the control target unit is generated by applying a time-series signal of the operation amount of the actuator created in advance to the model equation, and Storing the time-series signal of the operation amount of the actuator and the generated time-series signal of the state of the control target unit as an initial condition, the initial condition, and the state estimated using the model equation And the step of obtaining an optimal operation amount, which is an optimum operation amount of the actuator under the initial condition, by repeating the evaluation while changing the input operation amount with respect to the objective function using and. And the step of correlating and storing the determined optimal operation amount, and the step of determining an approximate expression representing the relationship between the initial condition and the optimal operation amount. According to this method, in the control of the internal combustion engine using model predictive control, it is possible to reduce the processing load and shorten the processing time, and to improve the efficiency of the initial condition generation.

（９）本発明の一形態によれば、内燃機関のアクチュエータの操作量の変化に応じた、内燃機関の制御対象部の状態の変化をモデル化したモデル式を利用したコンピュータプログラムが提供される。このコンピュータプログラムでは、予め作成された前記アクチュエータの操作量の時系列信号を、前記モデル式に適用することで、前記制御対象部の状態の予測値の時系列信号を生成する機能と、前記アクチュエータの操作量の時系列信号と、生成された前記制御対象部の状態の時系列信号と、を初期条件として記憶させる機能と、前記初期条件と、前記モデル式を用いて推定された前記状態と、を用いた目的関数について、入力する前記操作量を変化させつつ評価を繰り返すことによって、当該初期条件における前記アクチュエータの最適な操作量である最適操作量を求める機能と、前記初期条件に対して、求めた前記最適操作量を対応付けて記憶させる機能と、前記初期条件と前記最適操作量との関係を表す近似式を求める機能と、を備える。このコンピュータプログラムによれば、モデル予測制御を利用した内燃機関の制御において、処理負荷の低減と処理時間の短縮とを図るとともに、初期条件生成の効率化を図ることができる。 (9) According to an aspect of the present invention, there is provided a computer program using a model expression that models a change in the state of a control target portion of an internal combustion engine according to a change in the operation amount of an actuator of the internal combustion engine. . In this computer program, a time-series signal of the predicted value of the state of the control target portion is generated by applying a time-series signal of the operation amount of the actuator created in advance to the model equation, and the actuator A function of storing, as an initial condition, a time-series signal of an operation amount of the control amount and a generated time-series signal of the state of the control target unit, the initial condition, and the state estimated using the model equation For the objective function using, the function of determining the optimum manipulated value, which is the optimum manipulated value of the actuator under the initial condition, by repeating the evaluation while changing the inputted manipulated variable, and the initial condition And a function of correlating and storing the determined optimal operation amount, and a function of determining an approximate expression representing a relationship between the initial condition and the optimal operation amount. That. According to this computer program, in control of an internal combustion engine using model predictive control, processing load can be reduced and processing time can be shortened, and efficient generation of initial conditions can be achieved.

（１０）本発明の一形態によれば、アクチュエータと、制御対象部とを備える内燃機関の制御装置が提供される。この内燃機関の制御装置では、前記アクチュエータの操作量の時系列信号と、前記制御対象部の状態の時系列信号と、を少なくとも含む初期条件について、前記初期条件と各時刻における前記初期条件に対して求められた前記アクチュエータの最適な操作量である最適操作量との関係を表す近似式を記憶する記憶部と、実際の前記アクチュエータの操作量と、前記制御対象部の状態とを取得する情報取得部と、取得された前記操作量及び前記状態と、前記記憶部内の前記近似式とを用いて、実際の前記操作量と前記状態とに対応した前記最適操作量を求め、前記最適操作量に従って前記アクチュエータを動作させる制御部と、を備える。この構成によれば、制御部は、情報取得部によって取得された操作量及び状態（各実際値または推定値）と、記憶部内の近似式とを用いて、各実際値に応じた内燃機関のアクチュエータの最適操作量を素早く求めることができ、処理負荷の低減と処理時間の短縮とを図ることができる。 (10) According to one aspect of the present invention, there is provided a control device of an internal combustion engine including an actuator and a control target portion. In this control device for an internal combustion engine, with respect to initial conditions including at least a time-series signal of an operation amount of the actuator and a time-series signal of a state of the control target portion, the initial condition and the initial condition at each time are described. Storage unit for storing an approximate expression representing a relationship with the optimum operation amount which is the optimum operation amount of the actuator, information for acquiring the actual operation amount of the actuator, and the state of the control target portion The optimum operation amount corresponding to the actual operation amount and the state is determined using the acquisition unit, the acquired operation amount and the state, and the approximate expression in the storage unit, and the optimum operation amount is obtained. And a controller configured to operate the actuator according to. According to this configuration, the control unit uses the operation amount and the state (each actual value or estimated value) acquired by the information acquisition unit, and the approximate expression in the storage unit to calculate the internal combustion engine according to each actual value. The optimum operation amount of the actuator can be obtained quickly, and the processing load can be reduced and the processing time can be shortened.

なお、本発明は、種々の態様で実現することが可能であり、例えば、モデル予測制御を利用した内燃機関の制御のための近似式を求める情報処理装置、情報処理方法、情報処理システム、コンピュータプログラム、モデル予測制御を利用した内燃機関の制御装置、制御方法、制御システム、コンピュータプログラム、内燃機関の制御装置の作成装置、作成方法、作成システム、コンピュータプログラム、これら各コンピュータプログラムを配布するためのサーバ装置、そのコンピュータプログラムを記憶した一時的でない記憶媒体等の形態で実現することができる。 The present invention can be realized in various aspects. For example, an information processing apparatus, an information processing method, an information processing system, and a computer for obtaining an approximate expression for control of an internal combustion engine using model predictive control Program, control device for internal combustion engine using model predictive control, control method, control system, computer program, production device for control device for internal combustion engine, production method, production system, computer program, for distributing these respective computer programs The present invention can be realized in the form of a server apparatus, a non-temporary storage medium storing the computer program, or the like.

本発明の一実施形態としての情報処理装置のブロック図である。It is a block diagram of an information processor as one embodiment of the present invention. 予測学習処理について説明する図である。It is a figure explaining a prediction learning process. 予測学習処理における処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the process in a prediction learning process. 予測学習処理における処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the process in a prediction learning process. 予測学習処理の各ステップについて説明する図である。It is a figure explaining each step of a prediction learning process. 本発明の一実施形態としての内燃機関の制御装置のブロック図である。It is a block diagram of a control device of an internal combustion engine as one embodiment of the present invention. 内燃機関制御における処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the process in internal combustion engine control. 内燃機関制御による動作の一例を示す。An example of operation by internal-combustion engine control is shown. 内燃機関制御に要した演算時間の一例を示す。An example of the calculation time required for internal combustion engine control is shown.

＜情報処理装置＞
図１は、本発明の一実施形態としての情報処理装置１のブロック図である。情報処理装置１は、モデル予測制御（ＭＰＣ：Model Predictive Control）を利用して、内燃機関の制御に使用するための近似式を求める装置である。本実施形態では、内燃機関の制御に使用する要素として、以下のａ１〜ａ５の５つを例示する。要素ａ１〜ａ５はモデル予測制御の初期条件として使用されるため、要素ａ１〜ａ５を総称して「初期条件」とも呼ぶ。本実施形態の情報処理装置１は、要素ａ１〜ａ５からなる初期条件と、その初期条件に対応する内燃機関のアクチュエータの最適な操作量（最適操作量）と、の関係を表す近似式を求める装置である。情報処理装置１によって求められた近似式は、内燃機関の制御装置に搭載されて内燃機関の制御に使用される。詳細は後述する。
（ａ１）内燃機関のアクチュエータの操作量ｕ
（ａ２）内燃機関に対する外乱ｗ
（ａ３）内燃機関の制御対象部の状態ｘ
（ａ４）内燃機関の制御対象部の出力ｙ
（ａ５）内燃機関の制御対象部の出力の目標値ｒ <Information processing device>
FIG. 1 is a block diagram of an information processing apparatus 1 according to an embodiment of the present invention. The information processing apparatus 1 is an apparatus for obtaining an approximate expression to be used for control of an internal combustion engine by using model predictive control (MPC: Model Predictive Control). In the present embodiment, the following five elements a1 to a5 are exemplified as elements used for control of an internal combustion engine. Since the elements a1 to a5 are used as an initial condition of model predictive control, the elements a1 to a5 are collectively referred to as "initial conditions". The information processing apparatus 1 of the present embodiment finds an approximate expression representing a relationship between an initial condition consisting of the elements a1 to a5 and an optimum operation amount (optimum operation amount) of the actuator of the internal combustion engine corresponding to the initial condition. It is an apparatus. The approximate expression obtained by the information processing device 1 is mounted on a control device for an internal combustion engine and used for control of the internal combustion engine. Details will be described later.
(A1) Operating amount u of actuator of internal combustion engine
(A2) Disturbance w for internal combustion engine
(A3) State x of the control target portion of the internal combustion engine
(A4) Output y of control target of internal combustion engine
(A5) Target value r of output of control target portion of internal combustion engine

（ａ１）操作量ｕは、内燃機関において操作することが可能な１つまたは複数のアクチュエータの動作状況を表す物理量である。例えば、スロットル開度、排気再循環（ＥＧＲ：Exhaust Gas Recirculation）システムにおけるＥＧＲバルブ開度等が操作量ｕに相当する。（ａ２）外乱ｗは、内燃機関の出力に影響を及ぼす１つまたは複数の物理量であり、操作量ｕは除く。例えば、エンジン回転数、外気温度、外気圧力等が外乱ｗに相当する。（ａ３）状態ｘは、内燃機関に含まれる１つまたは複数の制御対象部の状態を表す物理量である。例えば、ＥＧＲシステムにおける排気温度や排気流量等が状態ｘに相当する。 (A1) The manipulated variable u is a physical quantity that represents the operating condition of one or more actuators that can be operated in the internal combustion engine. For example, the throttle opening degree, the EGR valve opening degree in an exhaust gas recirculation (EGR: Exhaust Gas Recirculation) system, or the like corresponds to the operation amount u. (A2) The disturbance w is one or more physical quantities that affect the output of the internal combustion engine, excluding the manipulated variable u. For example, the engine speed, the outside air temperature, the outside air pressure, etc. correspond to the disturbance w. (A3) The state x is a physical quantity that represents the state of one or more control target parts included in the internal combustion engine. For example, the exhaust temperature, the exhaust flow rate, and the like in the EGR system correspond to the state x.

（ａ４）出力ｙは、内燃機関に含まれる１つまたは複数の制御対象部の出力を表す物理量である。例えば、ＥＧＲシステムにおけるＥＧＲ率、過給機における過給圧等が出力ｙに相当する。（ａ５）目標値ｒは、内燃機関に含まれる１つまたは複数の制御対象部の出力（すなわち要素ａ４）の目標値である。なお、上述した５つの各要素ａ１〜ａ５は、それぞれ、複数の項目を含み得る。例えば、外乱ｗとしてエンジン回転数と外気温度と外気圧力との３項目を含んでもよい。また、上述した５つの各要素ａ１〜ａ５において挙げた項目はあくまで例示であり、種々の項目を採用できる。 (A4) The output y is a physical quantity that represents the output of one or more control target units included in the internal combustion engine. For example, the EGR rate in the EGR system, the supercharging pressure in the turbocharger, etc. correspond to the output y. (A5) The target value r is a target value of the output (that is, the element a4) of one or more control target parts included in the internal combustion engine. Each of the five elements a1 to a5 described above may include a plurality of items. For example, three items of the engine speed, the outside air temperature, and the outside air pressure may be included as the disturbance w. In addition, the items listed in the five elements a1 to a5 described above are merely examples, and various items can be adopted.

情報処理装置１は、記憶部１００と、情報処理部２００と、図示しないＲＯＭ、ＲＡＭ及び通信部を備え、各部は図示しないバスにより相互に接続されている。記憶部１００は、ハードディスク、フラッシュメモリ、メモリカードなどで構成される。記憶部１００には、モデル式記憶部１１０と、初期条件記憶部１２０と、最適操作量記憶部１３０と、近似式記憶部１４０とが含まれている。 The information processing apparatus 1 includes a storage unit 100, an information processing unit 200, a ROM, a RAM, and a communication unit (not shown), and the units are mutually connected by a bus (not shown). The storage unit 100 includes a hard disk, a flash memory, a memory card, and the like. The storage unit 100 includes a model expression storage unit 110, an initial condition storage unit 120, an optimal operation amount storage unit 130, and an approximate expression storage unit 140.

モデル式記憶部１１０には、内燃機関のアクチュエータの操作量ｕ、及び、内燃機関に対する外乱ｗの変化に応じた、内燃機関の制御対象部の状態ｘ、及び、出力ｙの変化をモデル化したモデル式が予め記憶されている。モデル式は、予めの実験により求められ、操作量ｕ、外乱ｗ、状態ｘ、出力ｙの現在及び過去の情報が含まれる。どの程度過去の情報が含まれるかはモデル式中の時間サンプル数に依存する。モデル式としては、例えば、線形状態方程式、非線形状態方程式、ＮＡＲＸ（Nonlinear Auto-Regressive eXogenous）モデルを用いて構成された非線形方程式を使用できる。 The model storage unit 110 models changes in the state x of the control target portion of the internal combustion engine and the output y according to the change in the operation amount u of the actuator of the internal combustion engine and the disturbance w to the internal combustion engine. Model expressions are stored in advance. The model equation is obtained in advance by experiments, and includes current and past information of the manipulated variable u, the disturbance w, the state x, and the output y. How much past information is included depends on the number of time samples in the model equation. As a model equation, for example, a nonlinear equation constructed using a linear equation of state, a nonlinear equation of state, or a nonlinear auto-regressive eXogenous (NARX) model can be used.

線形状態方程式を使用した連続時間システムのモデル式は、操作量ｕ、外乱ｗ、状態ｘ、出力ｙの各変数に対して、例えば次のように表せる。なお、ｔは時間、Ａ，Ｂ，Ｃ，Ｄ，Ｅ，Ｆは適当な大きさの定数行列である。
・状態方程式：ｄｘ／ｄｔ＝Ａｘ＋Ｂｕ＋Ｅｗ・・・（１）
・出力方程式：ｙ＝Ｃｘ＋Ｄｕ＋Ｆｗ・・・（２） A model equation of a continuous time system using a linear equation of state can be expressed, for example, as follows for each variable of the manipulated variable u, the disturbance w, the state x, and the output y. Here, t is time, and A, B, C, D, E and F are constant matrices of appropriate sizes.
-Equation of state: dx / dt = Ax + Bu + Ew (1)
Output equation: y = Cx + Du + Fw (2)

線形状態方程式を使用した離散時間システムのモデル式は、上述の各変数と、離散時間ｋとを用いて、例えば次のように表せる。
・状態方程式：ｘ［ｋ＋１］＝Ａｘ［ｋ］＋Ｂｕ［ｋ］＋Ｅｗ［ｋ］・・・（３）
・出力方程式：ｙ［ｋ］＝Ｃｘ［ｋ］＋Ｄｕ［ｋ］＋Ｆｗ［ｋ］・・・（４） A model equation of a discrete time system using a linear equation of state can be expressed, for example, as follows, using each of the above-mentioned variables and a discrete time k.
-Equation of state: x [k + 1] = Ax [k] + Bu [k] + Ew [k] (3)
Output equation: y [k] = Cx [k] + Du [k] + Fw [k] (4)

非線形状態方程式を使用した連続時間システムのモデル式は、上述の各変数を用いて、例えば次のように表せる。なお、ｆ，ｇはそれぞれｘ，ｙと同次元の出力を与える非線形ベクトル関数である。このため、非線形状態方程式では、線形状態方程式を包含していると言える。
・状態方程式：ｄｘ／ｄｔ＝ｆ（ｘ，ｕ，ｗ）・・・（５）
・出力方程式：ｙ＝ｇ（ｘ，ｕ，ｗ）・・・（６） A model equation of a continuous time system using a nonlinear equation of state can be expressed, for example, as follows, using each of the variables described above. Here, f and g are non-linear vector functions giving an output of the same dimension as x and y, respectively. For this reason, it can be said that the nonlinear equation of state includes a linear equation of state.
-Equation of state: dx / dt = f (x, u, w) (5)
Output equation: y = g (x, u, w) (6)

非線形状態方程式を使用した離散時間システムのモデル式は、上述の各変数と、離散時間ｋとを用いて、例えば次のように表せる。
・状態方程式：ｘ［ｋ＋１］＝ｆ（ｘ［ｋ］，ｕ［ｋ］，ｗ［ｋ］）・・・（７）
・出力方程式：ｙ［ｋ］＝ｇ（ｘ［ｋ］，ｕ［ｋ］，ｗ［ｋ］）・・・（８） A model equation of a discrete time system using a nonlinear equation of state can be expressed, for example, as follows, using each of the above-described variables and a discrete time k.
State equation: x [k + 1] = f (x [k], u [k], w [k]) (7)
Output equation: y [k] = g (x [k], u [k], w [k]) (8)

ＮＡＲＸモデルを用いて構成された非線形方程式のモデル式は、上述の各変数と、離散時間ｋとを用いて、例えば次のように表せる。なお、ｇはｙと同次元の出力を与える非線形ベクトル関数であり、明示的に過去の時間を複数サンプルする点が上述の式８と異なる。
・ｙ［ｋ＋１］＝ｇ（ｙ［ｋ］，ｙ［ｋ−１］，・・・，ｙ［ｋ＋１−ｎ_y］，ｕ［ｋ］，ｕ［ｋ−１］，・・・，ｕ［ｋ＋１−ｎ_u］）・・・（９） The model equation of the nonlinear equation constructed using the NARX model can be expressed, for example, as follows, using the above-mentioned variables and the discrete time k. Here, g is a non-linear vector function giving an output of the same dimension as y, and is different from the above-mentioned equation 8 in that plural past times are explicitly sampled.
Y [k + 1] = g (y [k], y [k-1], ..., y [k + 1- _ny ], u [k], u [k-1], ..., u [ k + 1−n _u ]) (9)

このようにすれば、モデル式として、線形状態方程式、非線形状態方程式、及びＮＡＲＸモデルを利用できる。 In this way, linear equation of state, nonlinear equation of state, and NARX model can be used as model equations.

図２は、予測学習処理について説明する図である。図２（Ａ）は、初期条件記憶部１２０に記憶されている初期条件の一例を示す。図２（Ｂ）は、予測学習処理の各反復サイクルにおける、各要素の変化の一例を示す。図１の初期条件記憶部１２０には、操作量記憶部１２１と、外乱記憶部１２２と、状態記憶部１２３と、出力記憶部１２４と、目標値記憶部１２５とが含まれている。 FIG. 2 is a diagram for explaining prediction learning processing. FIG. 2A shows an example of the initial conditions stored in the initial condition storage unit 120. FIG. 2 (B) shows an example of change of each element in each iterative cycle of the predictive learning process. The initial condition storage unit 120 of FIG. 1 includes an operation amount storage unit 121, a disturbance storage unit 122, a state storage unit 123, an output storage unit 124, and a target value storage unit 125.

操作量記憶部１２１には、後述する予測学習処理のステップＳ１１０によって、内燃機関のアクチュエータの操作量ｕの複数時刻分の物理量の変化（換言すれば、継時的な物理量の変化）を表す時系列信号が記憶される。図２（Ａ）では、操作量ｕ１及び操作量ｕ２として、操作量ｕに属する２つの項目の時系列信号を例示している。同様に、外乱記憶部１２２には、後述する予測学習処理のステップＳ１１０によって、内燃機関に対する外乱ｗの複数時刻分の物理量の変化を表す時系列信号が記憶される。図２（Ａ）では、外乱ｗ１として、外乱ｗに属する１つの項目の時系列信号を例示している。 When the operation amount storage unit 121 represents a change in physical quantity (in other words, a change in physical quantity over time) for a plurality of times of the operation quantity u of the actuator of the internal combustion engine by step S110 of prediction learning processing described later. Sequence signals are stored. In FIG. 2A, time series signals of two items belonging to the operation amount u are illustrated as the operation amount u1 and the operation amount u2. Similarly, in the disturbance storage unit 122, a time-series signal representing a change in physical quantity for a plurality of times of the disturbance w with respect to the internal combustion engine is stored in step S110 of prediction learning processing described later. In FIG. 2A, a time-series signal of one item belonging to the disturbance w is illustrated as the disturbance w1.

状態記憶部１２３には、後述する予測学習処理のステップＳ１３０によって、内燃機関の制御対象部の状態ｘの複数時刻分の物理量の変化を表す時系列信号が記憶される。図２（Ａ）では、状態ｘ１として、状態ｘに属する１つの項目の時系列信号を例示している。同様に、出力記憶部１２４には、後述する予測学習処理のステップＳ１３０によって、内燃機関の制御対象部の出力ｙの複数時刻分の物理量の変化を表す時系列信号が記憶される。図２（Ａ）では、出力ｙ１として、出力ｙに属する１つの項目の時系列信号を例示している。 The state storage unit 123 stores time-series signals representing changes in physical quantities for a plurality of times of the state x of the control target portion of the internal combustion engine in step S130 of the prediction learning process described later. In FIG. 2A, a time-series signal of one item belonging to the state x is illustrated as the state x1. Similarly, in the output storage unit 124, a time-series signal representing a change in physical quantity for a plurality of times of the output y of the control target portion of the internal combustion engine is stored in step S130 of prediction learning processing described later. In FIG. 2A, a time-series signal of one item belonging to the output y is illustrated as the output y1.

目標値記憶部１２５には、後述する予測学習処理のステップＳ１１０によって、内燃機関の制御対象部の出力の目標値ｒの複数時刻分の物理量の変化を表す時系列信号が記憶される。なお、図２（Ａ）では、目標値ｒの時系列信号の例示は省略している。 The target value storage unit 125 stores time-series signals representing changes in physical quantities for a plurality of times of the target value r of the output of the control target portion of the internal combustion engine in step S110 of prediction learning processing described later. In FIG. 2A, the illustration of the time-series signal of the target value r is omitted.

最適操作量記憶部１３０には、後述する予測学習処理のステップＳ２００によって、初期条件記憶部１２０内の各時刻における初期条件（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）に対して、当該初期条件における内燃機関のアクチュエータの最適な操作量（最適操作量）が対応付けて記憶される。近似式記憶部１４０には、後述する予測学習処理のステップＳ３００によって、最適操作量記憶部１３０内の初期条件と最適操作量とから得られた近似式が記憶される。 In the optimum operation amount storage unit 130, the initial conditions (operation amount u, disturbance w, state x, output y, target value r) at each time in the initial condition storage unit 120 are obtained in step S200 of prediction learning processing described later. On the other hand, the optimum operation amount (optimum operation amount) of the actuator of the internal combustion engine under the initial condition is stored in association with it. The approximate expression storage unit 140 stores the approximate expression obtained from the initial condition and the optimal operation amount in the optimal operation amount storage unit 130 in step S300 of the prediction learning process described later.

情報処理部２００は、ＲＯＭに格納されているコンピュータプログラムをＲＡＭに展開して実行することにより、情報処理装置１の各部を制御する。そのほか情報処理部２００は、実験計画処理部２１０、初期条件生成部２２０、予測処理部２３０、学習処理部２４０として機能し、協働して後述する予測学習処理を実行する。実験計画処理部２１０は、実験計画法を用いて、初期条件のうち、操作量ｕ、外乱ｗ、目標値ｒの時系列信号を生成し、初期条件記憶部１２０に記憶させる。初期条件生成部２２０は、初期条件のうち、状態ｘ、出力ｙの時系列信号を生成し、初期条件記憶部１２０に記憶させる。 The information processing unit 200 controls each unit of the information processing apparatus 1 by expanding a computer program stored in the ROM in the RAM and executing it. In addition, the information processing unit 200 functions as an experimental plan processing unit 210, an initial condition generation unit 220, a prediction processing unit 230, and a learning processing unit 240, and cooperates to execute prediction learning processing described later. The experimental design processing unit 210 generates a time series signal of the operation amount u, the disturbance w, and the target value r among the initial conditions using the experimental design method, and stores the time series signal in the initial condition storage unit 120. The initial condition generation unit 220 generates time-series signals of the state x and the output y among the initial conditions and causes the initial condition storage unit 120 to store the time-series signals.

予測処理部２３０は、初期条件記憶部１２０内の初期条件を用いてモデル予測制御によって、各時刻における初期条件（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）に対応する最適操作量を求め、最適操作量記憶部１３０に記憶させる。学習処理部２４０は、最適操作量記憶部１３０内の初期条件と最適操作量とを教師データとしたニューラルネットワーク（ＮＮ：Neural Network）の教師あり学習によって近似式を求め、近似式記憶部１４０に記憶させる。 The prediction processing unit 230 uses the initial conditions in the initial condition storage unit 120 to perform model predictive control to optimize the initial conditions (operation amount u, disturbance w, state x, output y, target value r) at each time. The operation amount is obtained and stored in the optimum operation amount storage unit 130. The learning processing unit 240 obtains an approximate expression by supervised learning of a neural network (NN: Neural Network) in which the initial conditions and the optimum operation amount in the optimum operation amount storage unit 130 are supervised data, and Remember.

図３及び図４は、予測学習処理における処理の手順を示すフローチャートである。予測学習処理は、初期条件（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）を生成すると共に、生成された初期条件と内燃機関のアクチュエータの最適な操作量（最適操作量）との関係を表す近似式を求める処理である。予測学習処理は、情報処理装置１において任意のタイミングで実行される。 FIG.3 and FIG.4 is a flowchart which shows the procedure of the process in a prediction learning process. The predictive learning process generates initial conditions (operation amount u, disturbance w, state x, output y, target value r), and the generated initial conditions and optimum operation amount of the actuator of the internal combustion engine (optimum operation amount) It is the process which calculates | requires the approximate expression showing the relationship with and. The prediction learning process is performed in the information processing device 1 at an arbitrary timing.

図５は、予測学習処理の各ステップについて説明する図である。図５では、初期条件記憶部１２０に時系列信号として記憶されている初期条件の各要素について模式的に表している。縦軸には初期条件の各要素の名称を表し、横軸には時系列信号における各時刻を表している。通常、表の中には該当時刻における該当要素の物理量が表示されるが、図５では説明の便宜上、物理量の表示を省略して、説明のための文言を記載している。さらに、図３〜図５では説明の便宜上、初期条件の各要素が含み得る複数の項目について区別しない。例えば、操作量ｕが２つの項目、操作量ｕ１及び操作量ｕ２を含む場合、操作量ｕ１及び操作量ｕ２に対する処理は、以降説明する「操作量ｕ」に対する処理と同じ処理を適用すればよい。 FIG. 5 is a diagram for explaining each step of the predictive learning process. In FIG. 5, each element of the initial conditions stored as a time-series signal in the initial condition storage unit 120 is schematically represented. The vertical axis represents the name of each element of the initial condition, and the horizontal axis represents each time in the time-series signal. Usually, the physical quantities of the corresponding element at the corresponding time are displayed in the table, but in FIG. 5 the wording for the explanation is described by omitting the display of the physical quantities for the convenience of explanation. Furthermore, for convenience of explanation, FIGS. 3 to 5 do not distinguish among a plurality of items that each element of the initial condition may contain. For example, when the operation amount u includes two items, the operation amount u1 and the operation amount u2, the processing for the operation amount u1 and the operation amount u2 may be the same processing as the processing for “operation amount u” described below .

ステップＳ１００では、多様な初期条件（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）の生成を実行する。具体的には、ステップＳ１１０において実験計画処理部２１０は、実験計画法を用いて、操作量ｕ、外乱ｗ、目標値ｒの時系列信号を生成し、操作量記憶部１２１、外乱記憶部１２２、目標値記憶部１２５にそれぞれ記憶させる。実験計画処理部２１０は、実験計画法として例えば、以下の方法ｂ１、ｂ２のいずれかを用いることができる。 In step S100, generation of various initial conditions (operation amount u, disturbance w, state x, output y, target value r) is executed. Specifically, in step S110, the experimental design processing unit 210 generates a time series signal of the operation amount u, the disturbance w, and the target value r using the experimental design method, and the operation amount storage unit 121 and the disturbance storage unit 122. , And the target value storage unit 125 respectively. The experimental design processing unit 210 can use, for example, any of the following methods b1 and b2 as the experimental design method.

（ｂ１）ステップ関数やランプ関数の組み合わせで表現される信号を生成する第１の方法：第１の方法としては、例えば、ＡＰＲＢＳ（Amplitude modulated Pseudo Random Binary Sequences）法を利用できる。ＡＰＲＢＳ法では、信号のレベルを連続的に扱い、多様な組合せを効率的に生成するラテン超方格計画やＤ最適計画などを用いて、疑似ランダム的に連続信号の組み合わせを生成する。ＡＰＲＢＳ法では、信号の１区間の長さや、区間の移り変わり時の信号変化速度なども計画の対象に含めることができるため、信号の値の組合せに加えて、信号変化速度の組合せについても多様性を確保できる。すなわち、ＡＰＲＢＳ法では、過渡変化を含めた実験計画が可能である。 (B1) A first method of generating a signal represented by a combination of a step function and a ramp function: As a first method, for example, APRBS (Amplitude modulated Pseudo Random Binary Sequences) method can be used. In the APRBS method, a continuous signal combination is generated pseudorandomly using a Latin hypersquare design or a D-optimal design which handles the signal levels continuously and efficiently generates various combinations. In the APRBS method, the length of one section of the signal and the signal change rate at the transition of the section can also be included in the plan, so in addition to the combination of the signal values, the combination of the signal change rates is also diverse. Can be secured. That is, in the APRBS method, an experimental design including transient change is possible.

（ｂ２）周波数が時間に依存して変化するチャープ信号で表現される信号を生成する第２の方法：第２の方法としては、例えば、ＳｉｎｕｓｏｉｄａｌＥｘｃｉｔａｔｉｏｎ法を利用できる。ＳｉｎｕｓｏｉｄａｌＥｘｃｉｔａｔｉｏｎ法では、時間に依存して周波数が変化する正弦波信号を用いて連続信号の組み合わせを生成する。ＡＰＲＢＳ法により生成された信号よりも、信号レベルの時間変化率が多様である。 (B2) A second method of generating a signal represented by a chirp signal whose frequency changes depending on time: As a second method, for example, the Sinusoidal Excitation method can be used. In the Sinusoidal Excitation method, a combination of continuous signals is generated using sinusoidal signals whose frequency changes with time. The time change rate of the signal level is more varied than the signal generated by the APRBS method.

ステップＳ１２０において、初期条件生成部２２０の予測計算部２２１は、モデル式記憶部１１０に記憶されているモデル式と、操作量記憶部１２１に記憶されている操作量ｕの時系列信号と、外乱記憶部１２２に記憶されている外乱ｗの時系列信号と、をそれぞれ読み出す。ステップＳ１３０において、初期条件生成部２２０の予測計算部２２１は、読み出した操作量ｕと外乱ｗの時系列信号をモデル式に適用（印加）することで、状態ｘと出力ｙとの予測値の時系列信号を生成し、状態記憶部１２３と出力記憶部１２４とにそれぞれ記憶させる。 In step S120, the prediction calculation unit 221 of the initial condition generation unit 220 calculates the model expression stored in the model expression storage unit 110, the time series signal of the operation amount u stored in the operation amount storage unit 121, and the disturbance. The time-series signals of the disturbance w stored in the storage unit 122 are read out. In step S130, the prediction calculation unit 221 of the initial condition generation unit 220 applies (applies) the read time series signals of the operation amount u and the disturbance w to the model equation to obtain predicted values of the state x and the output y. A time-series signal is generated and stored in the state storage unit 123 and the output storage unit 124, respectively.

このように、予測学習処理のステップＳ１００では、モデル式を用いることで、物理的に発生する可能性の少ない初期条件の生成を回避すると共に、多様な初期条件を効率よく生成することができる。上述の通り、初期条件は、操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒの５つの要素から構成されるが、それら５要素の組み合わせを生成する単純な方法は、５要素のすべてに上下限を設定し、上下限の範囲内で５要素を網羅的に組み合わせることである。しかし、網羅的な組み合わせにより生成された初期条件には、物理的に発生する可能性が極めて低い初期条件が含まれる。これは、状態ｘや出力ｙの挙動は、操作量ｕや外乱ｗに依存する物理的な因果関係に支配されていることに起因する。この点、予測学習処理のステップＳ１００では、この因果関係を再現したモデル式を、ステップＳ２００以降のモデル予測制御だけでなく、初期条件の生成の段階から使用することによって、組み合わせとして物理的に無理のない状態ｘ及び出力ｙの時系列信号を生成できる。これにより、実際に起こり得る初期条件のみを効率的に生成することが可能となり、例えば内燃機関の制御のように、制御パラメータの最適解を導出するために、多くの要素（例えば、数十個）が関連する初期条件を持つ複雑なシステムにおいても、組み合わせ数の爆発的増加を招くことなく、初期条件を生成することが可能となる。 As described above, in step S100 of the predictive learning process, generation of initial conditions that are unlikely to occur physically can be avoided and various initial conditions can be efficiently generated by using a model expression. As described above, the initial condition consists of five elements of the manipulated variable u, the disturbance w, the state x, the output y, and the target value r, but a simple method of generating a combination of those five elements is five elements. The upper and lower limits are set for all, and five elements are comprehensively combined within the upper and lower limits. However, the initial conditions generated by the exhaustive combination include initial conditions that are extremely unlikely to occur physically. This is because the behavior of the state x and the output y is governed by a physical causality that depends on the operation amount u and the disturbance w. In this regard, in step S100 of the prediction learning process, it is physically impossible as a combination by using the model expression reproducing this causal relationship not only from model predictive control in step S200 onwards, but also from the stage of generation of initial conditions. It is possible to generate time-series signals of state x and output y without. As a result, it is possible to efficiently generate only the initial conditions that may actually occur. For example, as in the case of control of an internal combustion engine, many elements (eg, dozens of elements are required to derive an optimal solution of control parameters). Even in a complex system with an initial condition associated with), the initial condition can be generated without causing an explosive increase in the number of combinations.

ステップＳ２００では、ステップＳ１００で生成された多様な初期条件に対応する最適操作量を求める。具体的には、ステップＳ２１０において、予測処理部２３０の予測計算部２３１は、モデル式記憶部１１０に記憶されているモデル式を読み出す。 In step S200, the optimum operation amount corresponding to the various initial conditions generated in step S100 is obtained. Specifically, in step S210, the prediction calculation unit 231 of the prediction processing unit 230 reads out the model expression stored in the model expression storage unit 110.

ステップＳ２２０において、予測処理部２３０の予測計算部２３１は、初期条件記憶部１２０に記憶されている初期条件の時系列信号中の１時刻を起点とし、モデル予測制御による将来予測のために必要な時刻分の情報を読み出す。例えば、図５（Ａ）において、時刻ｔ０を起点とした場合、予測計算部２３１は、予測のために必要となる現在及び過去の時刻分の情報、具体的には、現在時刻ｔ０の初期条件（外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）と、過去時刻ｔ−１の初期条件（操作量ｕ）とを読み出す。ここで、操作量ｕのみ過去時刻ｔ−１の初期条件を読み出すのは、現在時刻ｔ０の操作量ｕが、後のステップにおける予測対象となるためである。なお、予測計算部２３１は、外乱ｗ等についても、現在の初期条件に加えて、過去の数時刻分における初期条件の読み出しを行ってもよい。どれだけ過去に遡って情報の読み出しを行うかは、予測計算部２３１が使用するモデル式に依存する。 In step S220, the prediction calculation unit 231 of the prediction processing unit 230 uses one time in the time-series signal of the initial condition stored in the initial condition storage unit 120 as a starting point, and is necessary for future prediction by model prediction control. Read information for time. For example, in FIG. 5A, when time t0 is the starting point, the prediction calculation unit 231 determines information on current and past times necessary for prediction, specifically, an initial condition of the current time t0. (Disturbance w, state x, output y, target value r) and an initial condition (operation amount u) of past time t-1 are read out. Here, the reason for reading out the initial condition of the past time t-1 only for the operation amount u is because the operation amount u at the current time t0 is to be predicted in the later step. In addition to the current initial conditions, the prediction calculation unit 231 may also read the initial conditions at several times in the past for the disturbance w and the like. How far back in time the information is read out depends on the model expression used by the prediction calculation unit 231.

ステップＳ２３０では、ステップＳ２２０で読み出した初期条件に対応する最適操作量を求める。具体的には、ステップＳ２３１において、予測処理部２３０の予測計算部２３１は、現在時刻から所定の将来時刻までの有限区間内における操作量ｕ、外乱ｗ、目標値ｒの時系列を決定する。この有限区間は、モデル予測制御における「予測ホライズン」に相当する。例えば、図５（Ｂ）に示すように予測計算部２３１は、現在時刻ｔ０から所定の将来時刻ｔ５までの有限区間内における操作量ｕ、外乱ｗ、目標値ｒを決定する。図５（Ｂ）の例では、予測計算部２３１は、操作量ｕには予め定められたデフォルト値（図５：Ｄ）を設定し、外乱ｗ及び目標値ｒの将来時刻ｔ１〜ｔ５には、ステップＳ２２０で読み出した現在時刻ｔ０の物理量（図５：現在）を設定している。なお、操作量ｕのデフォルト値の決定に際して、予測計算部２３１は、ステップＳ２２０で読み出された過去時刻ｔ−１の操作量ｕを考慮してもよい。 In step S230, the optimum operation amount corresponding to the initial condition read out in step S220 is determined. Specifically, in step S231, the prediction calculation unit 231 of the prediction processing unit 230 determines a time series of the operation amount u, the disturbance w, and the target value r in a finite interval from the current time to a predetermined future time. This finite interval corresponds to "prediction horizon" in model predictive control. For example, as shown in FIG. 5B, the prediction calculation unit 231 determines the operation amount u, the disturbance w, and the target value r in a finite interval from the current time t0 to a predetermined future time t5. In the example of FIG. 5 (B), the prediction calculation unit 231 sets a predetermined default value (FIG. 5: D) as the operation amount u, and sets future values t1 to t5 of the disturbance w and the target value r. The physical quantity (FIG. 5: present) of the current time t0 read in step S220 is set. When determining the default value of the operation amount u, the prediction calculation unit 231 may consider the operation amount u of the past time t-1 read in step S220.

ステップＳ２３２において、予測処理部２３０の予測計算部２３１は、モデル式を用いて、将来の有限区間内での状態ｘ、出力ｙを予測する。具体的には、予測計算部２３１は、現在時刻の条件をステップＳ２２０で読み出した初期条件（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）とし、ステップＳ２１０で読み出したモデル式に対して、ステップＳ２３１で決定された有限区間内の操作量ｕ、外乱ｗの時系列信号を適用（印加）することで、有限区間内の状態ｘと出力ｙとの予測値の時系列信号を生成する。例えば、図５（Ｃ）に示すように予測計算部２３１は、所定の将来時刻ｔ１からｔ５までの有限区間内における状態ｘ、出力ｙを予測する（図５：予測）。 In step S232, the prediction calculation unit 231 of the prediction processing unit 230 predicts the state x and the output y in a future finite interval using a model expression. Specifically, the prediction calculation unit 231 sets the condition of the current time as the initial condition (operation amount u, disturbance w, state x, output y, target value r) read in step S220, and the model formula read in step S210 On the other hand, by applying (applying) the time-series signal of the manipulated variable u and the disturbance w within the finite interval determined in step S231, the time-series signal of the predicted value of the state x and the output y within the finite interval Generate For example, as shown in FIG. 5C, the prediction calculation unit 231 predicts the state x and the output y in a finite interval from predetermined future times t1 to t5 (FIG. 5: prediction).

ステップＳ２３３において、予測処理部２３０の評価部２３２は、ステップＳ２３１で決定された有限区間内の操作量ｕ、外乱ｗ、目標値ｒの時系列信号と、ステップＳ２３２で予測された有限区間内の状態ｘ、出力ｙの時系列信号とを目的関数に入力して、目的関数を評価する。この目的関数は、制御性能を定量的に評価するための所定の式であり、記憶部１００内に記憶されている。目的関数としては、例えば、有限区間内における出力（出力ｙ）と目標（目標値ｒ）の差の二乗和を利用できる。 In step S233, the evaluation unit 232 of the prediction processing unit 230 determines the time series signal of the operation amount u, the disturbance w, and the target value r in the finite interval determined in step S231, and the finite interval predicted in step S232. The objective function is evaluated by inputting the time-series signal of the state x and the output y to the objective function. The objective function is a predetermined expression for quantitatively evaluating the control performance, and is stored in the storage unit 100. As the objective function, for example, the square sum of the difference between the output (output y) and the target (target value r) in a finite interval can be used.

ステップＳ２３４において、予測処理部２３０の反復処理部２３３は、目的関数の値が収束したか否かを判定する。収束した場合（ステップＳ２３４：ＹＥＳ）、反復処理部２３３は、処理をステップＳ２３６へ遷移させる。 In step S234, the iterative processing unit 233 of the prediction processing unit 230 determines whether the value of the objective function has converged. When it converges (step S234: YES), the iterative processing unit 233 causes the process to transition to step S236.

一方、収束していない場合（ステップＳ２３４：ＮＯ）、反復処理部２３３は、目的関数値をより良くするように有限区間内における操作量ｕの時系列信号を修正し、処理をステップＳ２３２へ遷移させ、予測と評価を繰り返す。例えば、図５（Ｄ）に示すように反復処理部２３３は、現在時刻ｔ０から所定の将来時刻ｔ５までの有限区間内における操作量ｕを修正し、その後、予測計算部２３１は、図５（Ｅ）に示すように、修正した操作量ｕに基づく状態ｘ、出力ｙを予測し、評価部２３２は、これらを用いた目的関数を評価する。例えば、図２（Ｂ）に示すように、有限区間（予測ホライズンＨＰ）内の操作量ｕ１及び操作量ｕ２が反復サイクルＣ１、Ｃ２、Ｃ３と修正されていくにつれて、対応する有限区間（予測ホライズンＨＰ）内の状態ｘ１及び出力ｙ１についても、サイクルＣ１、Ｃ２、Ｃ３に示すように変化していく。目的関数の値が収束した３回目のサイクルＣ３では、出力ｙ１の時系列信号は、目標値ｒの時系列信号にほぼ一致していることがわかる。なお、反復処理部２３３は、勾配法、シューティング法、Ｃ／ＧＭＲＥＳ法といった既知の手法を利用してもよい。 On the other hand, if convergence has not occurred (step S234: NO), the iterative processing unit 233 corrects the time series signal of the manipulated variable u in the finite interval so as to improve the objective function value, and shifts the processing to step S232. Let predictions and evaluations repeat. For example, as shown in FIG. 5D, the iterative processing unit 233 corrects the operation amount u in the finite interval from the current time t0 to a predetermined future time t5, and then the prediction calculation unit 231 performs the process shown in FIG. As shown in E), the state x based on the corrected operation amount u and the output y are predicted, and the evaluation unit 232 evaluates an objective function using these. For example, as shown in FIG. 2B, as the manipulated variable u1 and the manipulated variable u2 within the finite interval (prediction horizon HP) are corrected to the repetitive cycles C1, C2, and C3, corresponding finite intervals (prediction horizon (prediction horizon) The state x1 and the output y1 in HP) also change as shown in cycles C1, C2, and C3. In the third cycle C3 in which the value of the objective function converges, it can be seen that the time-series signal of the output y1 substantially matches the time-series signal of the target value r. The iterative processing unit 233 may use known methods such as a gradient method, a shooting method, and a C / GMRES method.

ステップＳ２３６において、予測処理部２３０の反復処理部２３３は、有限区間内における操作量ｕの時系列信号から、起点とした１時刻分の操作量ｕを「最適操作量」として取り出す。そして、反復処理部２３３は、この最適操作量と、ステップＳ２２０で読み出した初期条件（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）とを対応付けて、最適操作量記憶部１３０に記憶させる。例えば、図５（Ｆ）に示すように、反復処理部２３３は、現在時刻ｔ０の操作量ｕを最適操作量とし、ステップＳ２２０で読み出した初期条件（過去時刻ｔ−１の操作量ｕ、現在時刻ｔ０の外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）とを対応付けて、最適操作量記憶部１３０に記憶させる。 In step S236, the iterative processing unit 233 of the prediction processing unit 230 extracts, from the time series signal of the operation amount u in the finite section, the operation amount u for one time as the starting point as the “optimum operation amount”. Then, the iterative processing unit 233 associates the optimal operation amount with the initial conditions (operation amount u, disturbance w, state x, output y, target value r) read in step S220, and the optimal operation amount storage unit. Make it memorize in 130. For example, as illustrated in FIG. 5F, the iterative processing unit 233 sets the operation amount u at the current time t0 as the optimum operation amount, and the initial condition read in step S220 (the operation amount u at the past time t-1, the current The disturbance w at time t0, the state x, the output y, and the target value r) are associated with each other and stored in the optimal operation amount storage unit 130.

ステップＳ２４０において、予測処理部２３０の反復処理部２３３は、初期条件記憶部１２０に記憶された初期条件の時系列について、全ての時刻分の処理を終了したか否かを判定する。全ての時刻分の処理を終了していない場合（ステップＳ２４０：ＮＯ）、ステップＳ２５０において反復処理部２３３は、起点とする時刻を１時刻進め、ステップＳ２２０以降の処理を繰り返す。全ての時刻分の処理を終了した場合（ステップＳ２４０：ＹＥＳ）、反復処理部２３３は、処理をステップＳ３００へ遷移させる。 In step S240, the iterative processing unit 233 of the prediction processing unit 230 determines whether or not the processing for all the time has been completed for the time series of the initial conditions stored in the initial condition storage unit 120. When the processing for all the times has not been completed (step S240: NO), the repetitive processing unit 233 advances the starting time to one time in step S250, and repeats the processing after step S220. When the processing for all the times has been completed (step S240: YES), the iterative processing unit 233 causes the process to transition to step S300.

このように、予測学習処理のステップＳ２００では、ステップＳ１００で生成された多様な初期条件に対してモデル予測制御を実行し、各時刻の初期条件に対応する最適操作量を求める。上述の通り、予測処理部２３０は、初期条件を読み出す起点となる時刻を１時刻分ずつ移動させつつ、各時刻の初期条件に対して最適操作量を求める、という処理を時系列信号の全時刻分に対して行う。このため、予測処理部２３０は、多様な初期条件に対する最適操作量を求めて、最適操作量記憶部１３０に記憶させておくことができる。 As described above, in step S200 of the predictive learning process, model predictive control is performed on various initial conditions generated in step S100, and an optimal operation amount corresponding to the initial conditions at each time is determined. As described above, the prediction processing unit 230 moves the time serving as the starting point for reading out the initial condition by one time, and obtains the optimum operation amount for the initial condition of each time at all time of the time series signal Do it for a minute. Therefore, the prediction processing unit 230 can obtain the optimal operation amount for various initial conditions, and store the optimal operation amount in the optimal operation amount storage unit 130.

ステップＳ３００では、最適操作量記憶部１３０に記憶されている初期条件と最適操作量との関係を表す近似式を機械学習によって求める。具体的には、ステップＳ３１０において、学習処理部２４０のＮＮ計算部２４１は、最適操作量記憶部１３０に記憶されている初期条件と最適操作量とのセットをすべて読みだす。ステップＳ３２０において、学習処理部２４０のＮＮ計算部２４１は、ニューラルネットワーク（ＮＮ）中のパラメータを初期化する。 In step S300, an approximate expression representing the relationship between the initial condition and the optimal operation amount stored in the optimal operation amount storage unit 130 is obtained by machine learning. Specifically, in step S310, the NN calculation unit 241 of the learning processing unit 240 reads all sets of the initial conditions and the optimal operation amount stored in the optimal operation amount storage unit 130. In step S320, the NN calculating unit 241 of the learning processing unit 240 initializes parameters in the neural network (NN).

ステップＳ３３０において、学習処理部２４０のＮＮ計算部２４１は、初期条件と最適操作量とを教師データとしたＮＮの教師あり学習によって近似式を求める。具体的には、ＮＮ計算部２４１は、ステップＳ３１０で読み出した初期条件をＮＮに与え（印加し）て、ＮＮの出力を求める。ステップＳ３４０において、学習処理部２４０の評価部２４２は、ＮＮの出力と、ステップＳ３１０で読み出した初期条件及び最適操作量とを目的関数に入力して、目的関数を評価することで、ＮＮによる最適操作量の近似精度（誤差）を評価する。この目的関数としては、例えば、ＮＮの出力と、読み出した最適操作量の差の二乗和を利用できる。 In step S330, the NN calculating unit 241 of the learning processing unit 240 obtains an approximate expression by supervised learning of the NN using the initial condition and the optimal operation amount as teacher data. Specifically, the NN calculation unit 241 gives (applies) the initial condition read out in step S310 to the NN, and obtains the output of the NN. In step S340, the evaluation unit 242 of the learning processing unit 240 inputs the output of the NN and the initial conditions and the optimal operation amount read out in step S310 into the objective function, and evaluates the objective function to obtain an optimal NN. Evaluate the approximation accuracy (error) of the manipulated variable. As this objective function, for example, the square sum of the difference between the output of the NN and the read optimum manipulated variable can be used.

ステップＳ３５０において、学習処理部２４０の反復処理部２４３は、目的関数の値が収束したか否かを判定する。収束した場合（ステップＳ３５０：ＹＥＳ）、反復処理部２４３は、処理をステップＳ３７０へ遷移させる。一方、収束していない場合（ステップＳ３５０：ＮＯ）、反復処理部２４３は、目的関数値をより良くするようにＮＮのパラメータを修正し、処理をステップＳ３３０へ遷移させ、ＮＮ出力と評価を繰り返す。なお、反復処理部２４３は、バックプロパゲーションといった既知の手法を利用してもよい。 In step S350, the iterative processing unit 243 of the learning processing unit 240 determines whether the value of the objective function has converged. If the convergence has occurred (step S350: YES), the iterative processing unit 243 causes the process to transition to step S370. On the other hand, if convergence has not occurred (step S350: NO), the iterative processing unit 243 corrects the parameters of the NN so as to improve the objective function value, transitions the process to step S330, and repeats the NN output and evaluation. . The iterative processing unit 243 may use a known method such as back propagation.

ステップＳ３７０において、学習処理部２４０の反復処理部２４３は、最新のＮＮと、ＮＮのパラメータとを、学習済みＮＮとして近似式記憶部１４０に記憶させ、処理を終了する。 In step S370, the iterative processing unit 243 of the learning processing unit 240 stores the latest NN and the parameters of the NN in the approximate expression storage unit 140 as a learned NN, and ends the processing.

このように、予測学習処理のステップＳ３００では、最適操作量記憶部１３０に記憶されている初期条件と最適操作量との関係を表す近似式を機械学習によって求める。ステップＳ２００で生成された初期条件と最適操作量との関係は、初期条件が決まれば最適操作量が決まる、という形の非線形関数として表現できる。ステップＳ３００では、この関係から予め近似式を作成しておく。この近似式を利用すれば、実際の内燃機関の制御において、各時刻においてリアルタイムに、最適解を見つけるための反復計算（モデル予測制御）を行うことなく、最適操作量を高速に求めることが可能となる。ここで、初期条件と最適操作量との関係は一般的に極めて非線形性が高く、さらに、初期条件には数多くの要素が関連することから、多入力多出力の高次元な関係となるため、一般的な線形式やｎ次多項式では良好な近似精度が期待できない。この点、予測学習処理のステップＳ３００では、多入力多出力、かつ、強い非線形性を効率的に近似可能な手法として、ＮＮによる機械学習を使用している。ＮＮを用いて初期条件と最適操作量との関係を学習する場合、バックプロパゲーションによりＮＮ中のパラメータが目的関数値に及ぼす影響を計算でき、その情報を元に勾配法に基づく反復計算によって非線形関係を良好に再現する学習が可能となる。このように、十分に精度よく学習されたＮＮは、リアルタイムなモデル予測制御と同等の制御性能（予測性能）を、低演算負荷で実現することができる。 As described above, in step S300 of the predictive learning process, an approximate expression that represents the relationship between the initial condition and the optimal operation amount stored in the optimal operation amount storage unit 130 is determined by machine learning. The relationship between the initial condition and the optimal manipulated variable generated in step S200 can be expressed as a non-linear function in the form of determining the optimal manipulated variable if the initial condition is determined. In step S300, an approximate expression is created in advance from this relationship. By using this approximate expression, it is possible to obtain the optimum manipulated value at high speed without performing iterative calculation (model predictive control) for finding the optimum solution in real time at each time in control of an actual internal combustion engine. It becomes. Here, since the relationship between the initial condition and the optimal manipulated variable is generally extremely non-linear, and many elements are associated with the initial condition, the multi-input multi-output high-dimensional relationship is obtained. Good approximation accuracy can not be expected in a general linear form or an n-th order polynomial. In this regard, in step S300 of the predictive learning process, machine learning by NN is used as a method capable of efficiently approximating multiple inputs and multiple outputs and strong non-linearity. When learning the relationship between the initial condition and the optimum manipulated value using NN, the back propagation can calculate the influence of the parameter in NN on the objective function value, and based on the information, it is nonlinear by iterative calculation based on the gradient method. It becomes possible to learn to reproduce the relationship well. Thus, the NN learned with sufficient accuracy can realize control performance (predictive performance) equivalent to real-time model predictive control with a low computational load.

以上説明した通り、情報処理装置１によれば、予測処理部２３０は、モデル予測制御を利用して予め、各初期条件（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）に応じた内燃機関のアクチュエータの最適操作量を求めて、最適操作量記憶部１３０に記憶させておく（図３：ステップＳ２００）。そして、学習処理部２４０は予め、最適操作量記憶部１３０内の初期条件と最適操作量との関係を表す近似式（学習済みＮＮとパラメータ）を求めておくことができる（図４：ステップＳ３００）。このため、本構成の情報処理装置１によれば、実際の内燃機関の制御では、リアルタイムな反復計算を必要とせず、学習処理部２４０によって求められた近似式（学習済みＮＮとパラメータ）を利用することで、初期条件内の各要素に対応する各実際値（または推定値）に応じた、内燃機関のアクチュエータの最適操作量を素早く求めることができ、処理負荷の低減と処理時間の短縮とを図ることができる。また、初期条件生成部２２０は、初期条件を生成する際に、内燃機関のアクチュエータの操作量ｕ（及び外乱ｗ）の変化に応じた、内燃機関の制御対象部の状態ｘ（及び出力ｙ）の変化をモデル化したモデル式を使用する。このため、初期条件を生成する際に、各要素を網羅的に組み合わせた試行を行う場合と比較して、演算量を減らすことができ、初期条件生成の効率化を図ることができる。 As described above, according to the information processing apparatus 1, the prediction processing unit 230 uses the model prediction control to set each initial condition (operation amount u, disturbance w, state x, output y, target value r) in advance. The optimum operation amount of the actuator of the internal combustion engine according to the request is determined and stored in the optimum operation amount storage unit 130 (FIG. 3: step S200). Then, the learning processing unit 240 can obtain in advance an approximate expression (learned NN and parameter) representing the relationship between the initial condition in the optimum operation amount storage unit 130 and the optimum operation amount (FIG. 4: step S300). ). For this reason, according to the information processing apparatus 1 of this configuration, in the control of an actual internal combustion engine, the approximation formula (learned NN and parameters) obtained by the learning processing unit 240 is used without requiring real-time repetitive calculation. By doing this, it is possible to quickly obtain the optimum operating amount of the actuator of the internal combustion engine in accordance with each actual value (or estimated value) corresponding to each element in the initial conditions, thereby reducing the processing load and shortening the processing time. Can be In addition, when generating the initial condition, the initial condition generation unit 220 determines the state x (and the output y) of the control target unit of the internal combustion engine according to the change in the operation amount u (and the disturbance w) of the actuator of the internal combustion engine. Use a model equation that models the change of. For this reason, when generating the initial condition, the amount of calculation can be reduced as compared with the case where a trial in which each element is comprehensively combined is performed, and efficient generation of the initial condition can be achieved.

＜内燃機関の制御装置＞
図６は、本発明の一実施形態としての内燃機関の制御装置３のブロック図である。制御装置３は、情報処理装置１により作成された近似式を利用して、内燃機関を制御する装置である。内燃機関としては、例えば、ディーゼルエンジン、ガソリンエンジン等が挙げられる。制御装置３は、ドライバインターフェース（ＩＦ）３１０と、ＥＣＵ（Electronic Control Unit）３２０と、ハードウェアシステム３３０とを備え、各部は図示しない車載ネットワークにより相互に接続されている。ドライバインターフェース３１０は、内燃機関の運転者による操作信号を取得するためのインタフェースであり、例えば、アクセル、ブレーキ等である。 <Control device for internal combustion engine>
FIG. 6 is a block diagram of a control device 3 of an internal combustion engine according to an embodiment of the present invention. The control device 3 is a device that controls an internal combustion engine by using the approximate expression created by the information processing device 1. As an internal combustion engine, a diesel engine, a gasoline engine, etc. are mentioned, for example. The control device 3 includes a driver interface (IF) 310, an ECU (Electronic Control Unit) 320, and a hardware system 330, and the respective parts are mutually connected by an in-vehicle network (not shown). The driver interface 310 is an interface for acquiring an operation signal by the driver of the internal combustion engine, and is, for example, an accelerator, a brake, or the like.

ＥＣＵ３２０は、内燃機関の運転を制御するマイクロコントローラ（マイコン）である。そのほかＥＣＵ３２０は、目標値決定部３２１、制御部３２２、情報取得部３２３として機能し、協働して後述する内燃機関制御を実行する。目標値決定部３２１は、運転者による操作信号に応じた内燃機関の制御対象部の出力の目標値ｒを決定する。 The ECU 320 is a microcontroller (microcomputer) that controls the operation of the internal combustion engine. In addition, the ECU 320 functions as a target value determination unit 321, a control unit 322, and an information acquisition unit 323, and cooperates to execute internal combustion engine control described later. The target value determination unit 321 determines a target value r of the output of the control target portion of the internal combustion engine according to the operation signal by the driver.

制御部３２２は、情報処理装置１によって求められた近似式（学習済みＮＮとパラメータ）を利用して、アクチュエータ３３１の最適操作量を求め、アクチュエータ３３１を動作させる。なお、近似式は、例えば、制御装置３を搭載した車両の製造時に予め制御部３２２内の図示しない記憶部に記憶されてもよい。また、制御部３２２は、図示しない通信ネットワークを介して情報処理装置１と通信を行い、定期的に近似式を取得して制御部３２２内の記憶部に記憶させてもよい。 The control unit 322 obtains the optimal operation amount of the actuator 331 using the approximate expression (learned NN and parameters) obtained by the information processing device 1, and operates the actuator 331. The approximate expression may be stored in advance in a storage unit (not shown) in the control unit 322, for example, at the time of manufacture of a vehicle equipped with the control device 3. The control unit 322 may communicate with the information processing apparatus 1 via a communication network (not shown), periodically obtain an approximate expression, and store the approximate expression in the storage unit in the control unit 322.

情報取得部３２３は、センサ３３３によって取得された検出値に基づいて、初期条件に対応した各要素（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ）の各実際値を取得する。なお、情報取得部３２３は、センサ３３３によって取得された実際値から、制御対象のモデルに基づくオブザーバやカルマンフィルタなどを用いて、操作量ｕ、外乱ｗ、状態ｘ、出力ｙの各推定値を求めてもよい。内燃機関の制御では、最適操作量を求めるために多くの要素（例えば、数十個）を必要とする。このため、情報取得部３２３は、一部の要素についてセンサ３３３により取得された実際値を用い、他の要素について推定した推定値を使用してもよい。なお、初期条件に対応した目標値ｒは、目標値決定部３２１によって別途設定される。 The information acquisition unit 323 acquires each actual value of each element (operation amount u, disturbance w, state x, output y) corresponding to the initial condition based on the detection value acquired by the sensor 333. Note that the information acquisition unit 323 obtains estimated values of the operation amount u, the disturbance w, the state x, and the output y from the actual values acquired by the sensor 333 using an observer or a Kalman filter based on the model of the control target. May be In the control of an internal combustion engine, many elements (for example, several tens) are required to obtain an optimal manipulated variable. For this reason, the information acquisition unit 323 may use an actual value acquired by the sensor 333 for a part of the elements and use estimated values estimated for the other elements. The target value r corresponding to the initial condition is separately set by the target value determination unit 321.

ハードウェアシステム３３０は、内燃機関に搭載されているハードウェアである。アクチュエータ３３１は、内燃機関において操作することが可能な１つまたは複数のアクチュエータであり、スロットルや、ＥＧＲシステムにおけるＥＧＲバルブ等である。制御対象部３３２は、内燃機関に含まれる１つまたは複数の制御対象部であり、ＥＧＲシステムや、過給機等である。センサ３３３は、アクチュエータ３３１の動作状況（操作量ｕ）、制御対象部３３２の状態ｘ、制御対象部３３２の出力ｙ、内燃機関に対する外乱ｗの各々を検出するためのセンサである。 Hardware system 330 is hardware installed in an internal combustion engine. The actuator 331 is one or more actuators operable in the internal combustion engine, such as a throttle and an EGR valve in an EGR system. The control target unit 332 is one or more control target units included in the internal combustion engine, and is an EGR system, a supercharger, or the like. The sensor 333 is a sensor for detecting each of the operation state (operation amount u) of the actuator 331, the state x of the control target portion 332, the output y of the control target portion 332, and the disturbance w to the internal combustion engine.

図７は、内燃機関制御における処理の手順を示すフローチャートである。内燃機関制御は、情報処理装置１により作成された近似式を利用して求めた最適操作量を用いて、内燃機関（具体的にはアクチュエータ３３１）を制御する処理である。図７に示す内燃機関制御は、例えば制御装置３を搭載した車両の始動時に開始され、所定の制御周期ごとに繰り返し実行される。 FIG. 7 is a flowchart showing the procedure of processing in internal combustion engine control. The internal combustion engine control is processing for controlling the internal combustion engine (specifically, the actuator 331) using the optimal operation amount obtained using the approximate expression created by the information processing device 1. The internal combustion engine control shown in FIG. 7 is started, for example, when the vehicle equipped with the control device 3 is started, and is repeatedly executed at predetermined control cycles.

ステップＳ４１０において、ドライバインターフェース３１０は、アクセル開度やブレーキ開度から内燃機関の運転者による操作信号を取得し、目標値決定部３２１へと送信する。ステップＳ４２０において、ＥＣＵ３２０の目標値決定部３２１は、取得した操作信号（運転者による操作信号）と、情報取得部３２３から取得した現在の操作量ｕ、外乱ｗ、状態ｘ、出力ｙとを用いて、制御部３２２の出力ｙに対する目標値ｒを決定し、制御部３２２へ送信する。 In step S410, the driver interface 310 acquires an operation signal by the driver of the internal combustion engine from the accelerator opening degree and the brake opening degree, and transmits the operation signal to the target value determination unit 321. In step S420, the target value determination unit 321 of the ECU 320 uses the acquired operation signal (operation signal by the driver) and the current operation amount u acquired from the information acquisition unit 323, the disturbance w, the state x, and the output y. Then, the target value r for the output y of the control unit 322 is determined and transmitted to the control unit 322.

ステップＳ４３０において、ＥＣＵ３２０の制御部３２２は、取得した目標値ｒと、情報取得部３２３から取得した現在の操作量ｕ、外乱ｗ、状態ｘ、出力ｙとを近似式（学習済みＮＮとパラメータ）に適用する。これにより制御部３２２は、目標値決定部３２１より指定された目標値ｒを達成するための、内燃機関のアクチュエータ３３１の最適な操作量（最適操作量）を決定できる。ステップＳ４４０において、ＥＣＵ３２０の制御部３２２は、決定した最適操作量でアクチュエータ３３１を動作させる。 In step S430, the control unit 322 of the ECU 320 approximates the acquired target value r, the current operation amount u acquired from the information acquisition unit 323, the disturbance w, the state x, and the output y (learned NN and parameters) Apply to As a result, the control unit 322 can determine an optimal operation amount (optimal operation amount) of the actuator 331 of the internal combustion engine to achieve the target value r specified by the target value determination unit 321. In step S440, the control unit 322 of the ECU 320 operates the actuator 331 with the determined optimal operation amount.

ステップＳ４５０において、アクチュエータ３３１の動作の結果として、制御対象部３３２の状態ｘ及び出力ｙが変化する。ステップＳ４６０において、センサ３３３は、アクチュエータ３３１の動作状況（操作量ｕ）、制御対象部３３２の状態ｘ、制御対象部３３２の出力ｙ、内燃機関に対する外乱ｗについて、最新の情報を検出し、検出信号を情報取得部３２３へと送信する。ステップＳ４７０において、ＥＣＵ３２０の情報取得部３２３は、取得した最新の情報に基づく操作量ｕ、外乱ｗ、状態ｘ、出力ｙの各実際値または各推定値を求める。 In step S450, as a result of the operation of the actuator 331, the state x and the output y of the control target unit 332 change. In step S460, the sensor 333 detects and detects the latest information regarding the operation state (operation amount u) of the actuator 331, the state x of the control target portion 332, the output y of the control target portion 332, and the disturbance w for the internal combustion engine. The signal is transmitted to the information acquisition unit 323. In step S470, the information acquisition unit 323 of the ECU 320 calculates actual values or estimated values of the operation amount u, the disturbance w, the state x, and the output y based on the acquired latest information.

図８は、内燃機関制御による動作の一例を示す。図８では、操作量ｕとして操作量ｕ１〜ｕ３の３項目、外乱ｗとして外乱ｗ１〜ｗ４の４項目、状態ｘとして状態ｘ１〜ｘ７の７項目、出力ｙとして出力ｙ１〜ｙ２の２項目、目標値ｒとして目標値ｒ１〜ｒ２の２項目を使用して、内燃機関制御を実行した場合の具体例を示す。図８（Ａ）には、操作量ｕ及び外乱ｗの時系列信号を図示し、図（Ｂ）には、出力ｙ及び目標値ｒの時系列信号を図示している。図示の便宜上、状態ｘの時系列信号は省略している。図８（Ｂ）に示すように、最適操作量を求めるために多くの要素を必要とする多入力多出力の複雑なケースであっても、出力ｙの実際値は、設定された目標値ｒに適切に追従していることがわかる。 FIG. 8 shows an example of operation by internal combustion engine control. In FIG. 8, three items of operation amounts u1 to u3 as operation amount u, four items of disturbances w1 to w4 as disturbance w, seven items of states x1 to x7 as state x, and two items of outputs y1 to y2 as output y, The specific example at the time of performing internal combustion engine control is shown using two items of the target value r1-r2 as the target value r. FIG. 8A shows time-series signals of the operation amount u and the disturbance w, and FIG. 8B shows time-series signals of the output y and the target value r. For convenience of illustration, the time-series signal of the state x is omitted. As shown in FIG. 8 (B), the actual value of the output y is the set target value r even in the complex case of multiple inputs and multiple outputs that require many elements to obtain the optimum manipulated variable. Properly follow.

図９は、内燃機関制御に要した演算時間の一例を示す。図９の横軸には時間を、縦軸には制御装置３のＥＣＵ３２０による演算に要した時間（ｍｓ）を示している。図９に示す通り、ＥＣＵ３２０の１回あたりの演算時間は０．０３ｍｓ〜０．０６ｍｓであり、最大でも０．２ｍｓ以内であることがわかる。例えば、モデル予測制御の高速解法として知られているＣ／ＧＭＲＥＳ法を用いて、リアルタイムに内燃機関の制御を行った場合の従来例では、演算時間は約６０ｍｓであった（仲田勇人ほか，“ディーゼルエンジン吸排気システムへのＣ／ＧＭＲＥＳモデル予測制御の応用"）。この従来例と比較すると、本実施形態の内燃機関制御では、演算速度を約１０００倍以上も高速化できる。 FIG. 9 shows an example of calculation time required for internal combustion engine control. The horizontal axis in FIG. 9 represents time, and the vertical axis represents time (ms) required for calculation by the ECU 320 of the control device 3. As shown in FIG. 9, it can be seen that the calculation time per one operation of the ECU 320 is 0.03 ms to 0.06 ms, and is within 0.2 ms at the maximum. For example, in the conventional example in the case of performing control of an internal combustion engine in real time using the C / GMRES method known as a high-speed solution for model predictive control, the computation time was about 60 ms (Nakata et al., " Application of C / GMRES model predictive control to diesel engine intake and exhaust system "). Compared to this conventional example, in the internal combustion engine control of this embodiment, the calculation speed can be increased by about 1000 times or more.

以上説明した通り、内燃機関の制御装置３によれば、ＥＣＵ３２０の制御部３２２は、情報取得部３２３によって取得された操作量ｕ、外乱ｗ、状態ｘ、出力ｙの各実際値または推定値と、制御部３２２の記憶部内の近似式とを用いて、各実際値または推定値に応じた内燃機関のアクチュエータ３３１の最適操作量を素早く求めることができ、処理負荷の低減と処理時間の短縮とを図ることができる。 As described above, according to the control device 3 of the internal combustion engine, the control unit 322 of the ECU 320 causes the actual value or estimated value of the operation amount u, the disturbance w, the state x, and the output y acquired by the information acquiring unit 323 The optimum operation amount of the actuator 331 of the internal combustion engine according to each actual value or estimated value can be quickly obtained using the approximate expression in the storage unit of the control unit 322, and the processing load is reduced and the processing time is shortened. Can be

＜本実施形態の変形例＞
本発明は上記の実施形態に限られるものではなく、その要旨を逸脱しない範囲において種々の態様において実施することが可能であり、例えば次のような変形も可能である。 <Modification of this embodiment>
The present invention is not limited to the above embodiment, and can be implemented in various aspects without departing from the scope of the present invention. For example, the following modifications are possible.

［変形例１］
上記実施形態では、情報処理装置の構成の一例を示した。しかし、情報処理装置の構成は種々の変形が可能である。例えば、情報処理装置は、ネットワーク上に配置された複数の情報処理装置が協働することによって構成されてもよい。この場合、例えば、実験計画処理部、初期条件生成部、予測処理部、学習処理部のうちの少なくとも一部が異なる情報処理装置によって実現されてよい。 [Modification 1]
In the above embodiment, an example of the configuration of the information processing apparatus has been shown. However, the configuration of the information processing apparatus can be variously modified. For example, the information processing apparatus may be configured by cooperation of a plurality of information processing apparatuses arranged on a network. In this case, for example, at least a part of the experiment plan processing unit, the initial condition generation unit, the prediction processing unit, and the learning processing unit may be realized by a different information processing apparatus.

［変形例２］
上記実施形態では、モデル予測制御において初期条件として考慮すべき要素の一例を挙げた。しかし、モデル予測制御において、初期条件として考慮する要素ａ１〜ａ５のうちの少なくとも一部は、省略してもよく、さらなる他の要素を考慮してもよい。具体的には、要素ａ２の外乱ｗ、要素ａ４の出力ｙ、要素ａ５の目標値ｒのうちの少なくとも一部は省略してよい。例えば外乱ｗを省略する場合、モデル式における外乱ｗのパラメータは省略できる。また、予測学習処理（図３）のステップＳ１００における外乱ｗの初期条件生成、ステップＳ２００における初期条件としての外乱ｗの考慮は省略できる。また、内燃機関制御（図７）のステップＳ４２０、Ｓ４３０、Ｓ４７０における外乱ｗの考慮も省略してよい。出力ｙ及び目標値ｒについても同様に、省略する場合は、モデル式と、予測学習処理と、内燃機関制御の各々について、省略された要素に対する考慮は省略してよい。 [Modification 2]
In the said embodiment, an example of the element which should be considered as an initial condition in model predictive control was mentioned. However, in model predictive control, at least a part of the elements a1 to a5 to be considered as initial conditions may be omitted, and further other elements may be considered. Specifically, at least a part of the disturbance w of the element a2, the output y of the element a4, and the target value r of the element a5 may be omitted. For example, when the disturbance w is omitted, the parameter of the disturbance w in the model equation can be omitted. Further, initial condition generation of the disturbance w in step S100 of the prediction learning process (FIG. 3) and consideration of the disturbance w as the initial condition in step S200 can be omitted. Further, consideration of the disturbance w in steps S420, S430, and S470 of the internal combustion engine control (FIG. 7) may be omitted. Similarly for the output y and the target value r, in the case of omission, consideration of the omitted elements may be omitted for each of the model equation, the prediction learning process, and the internal combustion engine control.

［変形例３］
上記実施形態では、予測学習処理の一例を示した（図３、図４）。しかし、予測学習処理は種々の変形が可能である。例えば、ステップＳ１００において、実験計画法を用いずに操作量ｕなどの時系列信号を生成してもよい。例えば、ステップＳ３００において、サポートベクターマシン（ＳＶＭ：Support Vector Machine）等のＮＮ以外の手段を用いることで近似式を求めてもよい。例えば、ステップＳ１００、Ｓ２００、Ｓ３００は一連の処理として実行されず、個別に実行されてよい。 [Modification 3]
In the said embodiment, an example of the prediction learning process was shown (FIG. 3, FIG. 4). However, the prediction learning process can be variously modified. For example, in step S100, a time-series signal such as the operation amount u may be generated without using the experimental design method. For example, in step S300, an approximate expression may be obtained by using a means other than the NN, such as a support vector machine (SVM). For example, steps S100, S200, and S300 may not be performed as a series of processes, but may be performed individually.

例えば、予測学習処理では、上述した一部のステップを省略してもよく、さらなる他のステップを追加で実行してもよい。具体的には、例えば、ステップＳ１００において、初期条件のうちの少なくとも一部の時系列信号を第１の方法（実験計画法）で生成し、残りの時系列信号を第２の方法で生成してもよい。例えば、モデル式記憶部に複数のモデル式を予め記憶させておき、ステップＳ１３０やステップＳ２３２において、初期条件のうちの少なくとも一部の特性や実際値に応じて、適用するモデル式を変更してもよい。例えば、ステップＳ３００の終了後に、生成した近似式を内燃機関の制御装置へと配信してもよい。 For example, in the predictive learning process, some steps described above may be omitted, and other steps may be additionally performed. Specifically, for example, in step S100, a time-series signal of at least a part of the initial conditions is generated by the first method (experimental design method), and the remaining time-series signals are generated by the second method. May be For example, a plurality of model formulas are stored in advance in the model formula storage unit, and in step S130 or step S232, the model formula to be applied is changed according to at least a part of the initial conditions and the actual value. It is also good. For example, after the end of step S300, the generated approximate expression may be distributed to the control device of the internal combustion engine.

［変形例４］
上記実施形態では、内燃機関の制御装置の構成の一例を示した。しかし、内燃機関の制御装置の構成は種々の変形が可能である。例えば、制御装置は、モデル予測制御を利用して、内燃機関の制御に使用するための近似式を求める上述した情報処理装置の機能をさらに備えていてもよい。 [Modification 4]
In the said embodiment, an example of a structure of the control apparatus of an internal combustion engine was shown. However, the configuration of the control device for an internal combustion engine can be variously modified. For example, the control device may further include the function of the above-described information processing device for obtaining an approximate expression to be used for control of an internal combustion engine by using model predictive control.

［変形例５］
上記実施形態では、内燃機関制御の一例を示した（図７）。しかし、予測学習処理は種々の変形が可能である。例えば、上述した一部のステップを省略してもよく、さらなる他のステップを追加で実行してもよい。具体的には、例えば、ステップＳ４７０の終了後に、取得した操作量ｕ、外乱ｗ、状態ｘ、出力ｙの実際値を情報処理装置へと送信して、近似式の精度向上に役立ててもよい。 [Modification 5]
In the said embodiment, an example of internal combustion engine control was shown (FIG. 7). However, the prediction learning process can be variously modified. For example, some steps described above may be omitted, and other steps may be additionally performed. Specifically, for example, after the end of step S470, actual values of the acquired operation amount u, disturbance w, state x, and output y may be transmitted to the information processing apparatus to help improve the accuracy of the approximate expression. .

以上、実施形態、変形例に基づき本態様について説明してきたが、上記した態様の実施の形態は、本態様の理解を容易にするためのものであり、本態様を限定するものではない。本態様は、その趣旨並びに特許請求の範囲を逸脱することなく、変更、改良され得ると共に、本態様にはその等価物が含まれる。また、その技術的特徴が本明細書中に必須なものとして説明されていなければ、適宜、削除することができる。 As mentioned above, although this aspect was demonstrated based on embodiment and a modification, embodiment of the above-mentioned aspect is for making an understanding of this aspect easy, and does not limit this aspect. The present embodiment can be modified and improved without departing from the spirit and the scope of the claims, and the present embodiment includes the equivalents thereof. In addition, if the technical feature is not described as essential in the present specification, it can be deleted as appropriate.

１…情報処理装置
３…内燃機関の制御装置
１００…記憶部
１１０…モデル式記憶部
１２０…初期条件記憶部
１２１…操作量記憶部
１２２…外乱記憶部
１２３…状態記憶部
１２４…出力記憶部
１２５…目標値記憶部
１３０…最適操作量記憶部
１４０…近似式記憶部
２００…情報処理部
２１０…実験計画処理部
２２０…初期条件生成部
２２１…予測計算部
２３０…予測処理部
２３１…予測計算部
２３２…評価部
２３３…反復処理部
２４０…学習処理部
２４１…ＮＮ計算部
２４２…評価部
２４３…反復処理部
３１０…ドライバインターフェース
３２１…目標値決定部
３２２…制御部
３２３…情報取得部
３３０…ハードウェアシステム
３３１…アクチュエータ
３３２…制御対象部
３３３…センサ DESCRIPTION OF SYMBOLS 1 ... Information processing apparatus 3 ... Control apparatus 100 of internal combustion engine 100 ... Storage part 110 ... Model type storage part 120 ... Initial condition storage part 121 ... Operation amount storage part 122 ... Disturbance storage part 123 ... State storage part 124 ... Output storage part 125 ... Target value storage unit 130 ... Optimal operation amount storage unit 140 ... Approximate expression storage unit 200 ... Information processing unit 210 ... Experiment plan processing unit 220 ... Initial condition generation unit 221 ... Prediction calculation unit 230 ... Prediction processing unit 231 ... Prediction calculation unit 232 evaluation unit 233 iterative processing unit 240 learning processing unit 241 NN calculating unit 242 evaluation unit 243 iterative processing unit 310 driver interface 321 target value determination unit 322 control unit 323 information acquisition unit 330 hardware Wear system 331 ... actuator 332 ... control target portion 333 ... sensor

Claims

An information processing apparatus,
A model expression storage unit for storing in advance a model expression that models a change in a state of a control target portion of the internal combustion engine according to a change in an operation amount of an actuator of the internal combustion engine;
An initial condition storage unit storing an initial condition including at least a time series signal of an operation amount of the actuator and a time series signal of a state of the control target unit;
An optimal operation amount storage unit that stores, in association with the initial condition at each time in the initial condition storage unit, an optimal operation amount that is an optimal operation amount of the actuator under the initial condition;
The time series signal of the predicted value of the state of the control target unit is generated by applying the previously generated time series signal of the operation amount of the actuator to the model expression in the model expression storage unit, and the initial condition An initial condition generation unit to be stored in the initial condition storage unit as
The evaluation is repeated while changing the input operation amount for an objective function using the initial condition in the initial condition storage unit and the state estimated using the model expression in the model expression storage unit. A prediction processing unit which obtains the optimal operation amount by the equation and stores the optimal operation amount in the optimal operation amount storage unit;
A learning processing unit for obtaining an approximate expression representing a relationship between the initial condition and the optimal operation amount in the optimal operation amount storage unit;
An information processing apparatus comprising:

The information processing apparatus according to claim 1, wherein
The information processing apparatus, wherein the learning processing unit obtains the approximate expression by supervised learning of a neural network using the initial condition and the optimal operation amount as teacher data.

The information processing apparatus according to claim 1 or 2, further comprising:
An information processing apparatus, comprising: an experiment plan processing unit that generates the time-series signal of the operation amount of the actuator using an experiment design method.

The information processing apparatus according to any one of claims 1 to 3, wherein
In the initial condition, further,
Disturbance to the internal combustion engine,
An output of the control target unit;
A target value of the output of the control target unit;
Contains at least a portion of
When the initial condition includes the disturbance, the model equation models a change in the state according to a change in the operation amount and the disturbance.
The information processing apparatus according to any one of the first to third aspects, wherein when the output includes the output, the change in the state and the output according to a change in the operation amount is modeled in the model expression.

The information processing apparatus according to any one of claims 1 to 4, wherein
An information processing apparatus using linear equation of state and nonlinear equation of state as the model equation.

The information processing apparatus according to any one of claims 1 to 5, wherein
An information processing apparatus using a non-linear equation configured using an NARX model as the model equation.

The information processing apparatus according to any one of claims 4 to 6, which is dependent on claim 3 or claim 3,
As the experimental design method, a first method of generating a signal represented by a combination of a step function and a ramp function, and a second method of generating a signal represented by a chirp signal whose frequency changes depending on time An information processing apparatus that uses either or.

An information processing method using a model expression that models a change in a state of a control target portion of an internal combustion engine according to a change in an operation amount of an actuator of the internal combustion engine,
Applying a time-series signal of the operation amount of the actuator created in advance to the model equation to generate a time-series signal of the predicted value of the state of the control target unit;
Storing, as an initial condition, a time-series signal of an operation amount of the actuator and a generated time-series signal of the state of the control target unit;
For the objective function using the initial condition and the state estimated using the model equation, the evaluation operation is repeated while changing the operation amount to be input, whereby the optimum operation of the actuator under the initial condition is performed. Determining an optimal operation amount which is an amount
Storing the determined optimal operation amount in association with the initial condition;
Obtaining an approximate expression representing a relationship between the initial condition and the optimal operation amount;
An information processing method comprising:

A computer program using a model formula modeling a change in a state of a control target portion of an internal combustion engine according to a change in an operation amount of an actuator of the internal combustion engine,
A function of generating a time-series signal of a predicted value of the state of the control target unit by applying a time-series signal of the operation amount of the actuator created in advance to the model equation;
A function of storing, as an initial condition, a time-series signal of an operation amount of the actuator and a generated time-series signal of the state of the control target unit;
For the objective function using the initial condition and the state estimated using the model equation, the evaluation operation is repeated while changing the operation amount to be input, whereby the optimum operation of the actuator under the initial condition is performed. A function for determining an optimal operation amount which is an amount
A function of associating and storing the determined optimal operation amount with respect to the initial condition;
A function of obtaining an approximate expression representing a relationship between the initial condition and the optimal operation amount;
, A computer program.

A control device for an internal combustion engine comprising an actuator and a control target portion, the control device comprising:
The initial condition including at least the time-series signal of the operation amount of the actuator and the time-series signal of the state of the control target portion, the optimum of the actuator determined for the initial condition and the initial condition at each time A storage unit that stores an approximate expression representing a relationship with an optimal operation amount, which is a simple operation amount;
An information acquisition unit that acquires an actual operation amount of the actuator and a state of the control target unit;
The optimum operation amount corresponding to the actual operation amount and the state is determined using the acquired operation amount and the state, and the approximate expression in the storage unit, and the actuator is determined according to the optimum operation amount. A control unit to operate;
A control device for an internal combustion engine, comprising: