JP6933585B2

JP6933585B2 - Information processing device, information processing method, computer program, control device

Info

Publication number: JP6933585B2
Application number: JP2018003459A
Authority: JP
Inventors: 竜大森安; 松栄上田; 真永岡; 池田　太郎; 太郎池田; 神保　智彦; 智彦神保; 俊洋中村; 松永　彰生; 彰生松永
Original assignee: Toyota Motor Corp; Toyota Central R&D Labs Inc
Current assignee: Toyota Motor Corp; Toyota Central R&D Labs Inc
Priority date: 2018-01-12
Filing date: 2018-01-12
Publication date: 2021-09-08
Anticipated expiration: 2038-01-12
Also published as: JP2019125021A

Description

本発明は、情報処理装置に関する。 The present invention relates to an information processing device.

各分野において、モデル予測制御（ＭＰＣ：Model Predictive Control）を利用した制御手法が利用されている。モデル予測制御とは、なんらかの制御パラメータについて、各時刻において未来の応答を予測しながら最適化を行う（最適解を見つける）制御手法である。例えば、特許文献１には、モデル予測制御を利用した内燃機関の制御に関し、内燃機関の制御要素（例えば、ターボチャージャ、排気再循環）のモデルを用いた反復計算によって、有限区間内で制御要素の動作を最適化することが記載されている。例えば、特許文献２には、モデル予測制御を利用して、車両制御を行うためのテーブルや、車両の制御変数を算出する近似式を作成することが記載されている。 In each field, a control method using model predictive control (MPC) is used. Model prediction control is a control method that optimizes some control parameters while predicting future responses at each time (finding the optimum solution). For example, in Patent Document 1, regarding the control of an internal combustion engine using model predictive control, the control element is performed within a finite section by iterative calculation using a model of the control element of the internal combustion engine (for example, turbocharger, exhaust gas recirculation). It is described to optimize the operation of. For example, Patent Document 2 describes that a table for performing vehicle control and an approximate expression for calculating control variables of a vehicle are created by using model prediction control.

特開２０１１−２５３５３６号公報Japanese Unexamined Patent Publication No. 2011-253536 特許第５８７０６３２号公報Japanese Patent No. 5870632

特許文献１では、各時刻においてリアルタイムに、最適解を見つけるための反復計算を行っている。この反復計算には膨大な量の演算が必要であるため、処理負荷が高く、処理に時間を要し、リアルタイム処理が不可能な場合も生じるという課題があった。特許文献２では、予め反復計算によって得られたテーブルあるいは近似式を作成しているため、リアルタイムな反復計算を必要としない。しかし、例えば内燃機関の制御のように、制御パラメータの最適解を導出するために、多くの要素（例えば、数十個）が関連する初期条件を持つ複雑なシステムを対象とする場合、極めて多次元のテーブルあるいは近似式が必要となる。このように多くの要素を含む初期条件について、網羅的な試行によって多次元のテーブルあるいは近似式を作成することは、組み合わせ数の爆発的増加のため困難であるが、特許文献２では、このような課題について何ら記載されていない。 In Patent Document 1, iterative calculation for finding the optimum solution is performed in real time at each time. Since this iterative calculation requires a huge amount of operations, there is a problem that the processing load is high, the processing takes time, and real-time processing may not be possible. In Patent Document 2, since a table or an approximate expression obtained by iterative calculation is created in advance, real-time iterative calculation is not required. However, when targeting a complex system with initial conditions in which many elements (for example, dozens) are related in order to derive the optimum solution of control parameters, for example, control of an internal combustion engine, it is extremely common. A table of dimensions or an approximate expression is required. It is difficult to create a multidimensional table or an approximate expression by exhaustive trials for the initial conditions including such many elements due to the explosive increase in the number of combinations. There is no mention of any issues.

本発明は、上述した課題を解決するためになされたものであり、モデル予測制御を利用した制御において、処理負荷の低減と処理時間の短縮とを図るとともに、初期条件生成の効率化を図ることを目的とする。 The present invention has been made to solve the above-mentioned problems, and in the control using model prediction control, the processing load is reduced, the processing time is shortened, and the efficiency of initial condition generation is improved. The purpose is.

本発明は、上述の課題の少なくとも一部を解決するためになされたものであり、以下の形態として実現することが可能である。情報処理装置であって、アクチュエータの操作量の変化に応じた、制御対象部の状態の変化をモデル化したモデル式を予め記憶するモデル式記憶部と、前記アクチュエータの操作量の時系列信号と、前記制御対象部の状態の時系列信号と、を少なくとも含む初期条件を記憶する初期条件記憶部と、前記初期条件記憶部内の各時刻における前記初期条件に対して、当該初期条件における前記アクチュエータの最適な操作量である最適操作量を対応付けて記憶する最適操作量記憶部と、予め作成された前記アクチュエータの操作量の時系列信号を、前記モデル式記憶部内の前記モデル式に適用することで、前記制御対象部の状態の予測値の時系列信号を生成し、前記初期条件として前記初期条件記憶部に記憶させる初期条件生成部と、前記初期条件記憶部内の前記初期条件と、前記モデル式記憶部内の前記モデル式を用いて推定された前記状態と、を用いた目的関数について、入力する前記操作量を変化させつつ評価を繰り返すことによって前記最適操作量を求め、前記最適操作量記憶部に記憶させる予測処理部と、前記最適操作量記憶部内の前記初期条件と前記最適操作量との関係を表す近似式を求める学習処理部と、を備える、情報処理装置。そのほか、本発明は、以下の形態としても実現可能である。 The present invention has been made to solve at least a part of the above-mentioned problems, and can be realized as the following forms. An information processing device, a model type storage unit that stores in advance a model formula that models a change in the state of a controlled object unit in response to a change in the operation amount of the actuator, and a time-series signal of the operation amount of the actuator. An initial condition storage unit that stores an initial condition including at least a time-series signal of the state of the control target unit, and an actuator of the actuator in the initial condition with respect to the initial condition at each time in the initial condition storage unit. Applying the optimum operation amount storage unit that stores the optimum operation amount, which is the optimum operation amount, and the time-series signal of the operation amount of the actuator created in advance to the model formula in the model formula storage unit. An initial condition generation unit that generates a time-series signal of a predicted value of the state of the control target unit and stores it in the initial condition storage unit as the initial condition, the initial condition in the initial condition storage unit, and the model. The optimum operation amount is obtained by repeating the evaluation while changing the input operation amount for the state estimated using the model expression in the expression storage unit and the objective function using, and the optimum operation amount storage. An information processing apparatus including a prediction processing unit to be stored in a unit and a learning processing unit for obtaining an approximate expression representing the relationship between the initial conditions in the optimum operation amount storage unit and the optimum operation amount. In addition, the present invention can also be realized in the following forms.

（１）本発明の一形態によれば、情報処理装置が提供される。この情報処理装置は、モデル式記憶部と、初期条件記憶部と、最適操作量記憶部と、初期条件生成部と、予測処理部と、学習処理部とを備える。モデル式記憶部は、内燃機関のアクチュエータの操作量の変化に応じた、内燃機関の制御対象部の状態の変化をモデル化したモデル式を予め記憶する。初期条件記憶部は、前記アクチュエータの操作量の時系列信号と、前記制御対象部の状態の時系列信号と、を少なくとも含む初期条件を記憶する。最適操作量記憶部は、前記初期条件記憶部内の各時刻における前記初期条件に対して、当該初期条件における前記アクチュエータの最適な操作量である最適操作量を対応付けて記憶する。初期条件生成部は、予め作成された前記アクチュエータの操作量の時系列信号を、前記モデル式記憶部内の前記モデル式に適用することで、前記制御対象部の状態の予測値の時系列信号を生成し、前記初期条件として前記初期条件記憶部に記憶させる。予測処理部は、前記初期条件記憶部内の前記初期条件と、前記モデル式記憶部内の前記モデル式を用いて推定された前記状態と、を用いた目的関数について、入力する前記操作量を変化させつつ評価を繰り返すことによって前記最適操作量を求め、前記最適操作量記憶部に記憶させる。学習処理部は、前記最適操作量記憶部内の前記初期条件と前記最適操作量との関係を表す近似式を求める。 (1) According to one embodiment of the present invention, an information processing device is provided. This information processing device includes a model type storage unit, an initial condition storage unit, an optimum manipulated variable storage unit, an initial condition generation unit, a prediction processing unit, and a learning processing unit. The model type storage unit stores in advance a model type that models a change in the state of the controlled object unit of the internal combustion engine according to a change in the operating amount of the actuator of the internal combustion engine. The initial condition storage unit stores at least an initial condition including a time-series signal of the operation amount of the actuator and a time-series signal of the state of the controlled target unit. The optimum operation amount storage unit stores the optimum operation amount, which is the optimum operation amount of the actuator under the initial condition, in association with the initial condition at each time in the initial condition storage unit. The initial condition generation unit applies the time-series signal of the operation amount of the actuator created in advance to the model formula in the model formula storage unit, thereby generating the time-series signal of the predicted value of the state of the control target unit. It is generated and stored in the initial condition storage unit as the initial condition. The prediction processing unit changes the amount of operation to be input with respect to the objective function using the initial condition in the initial condition storage unit and the state estimated using the model formula in the model formula storage unit. While repeating the evaluation, the optimum operation amount is obtained and stored in the optimum operation amount storage unit. The learning processing unit obtains an approximate expression representing the relationship between the initial condition and the optimum manipulated variable in the optimal manipulated variable storage unit.

この構成によれば、予測制御部は、モデル予測制御を利用して予め、各初期条件に応じた内燃機関のアクチュエータの最適操作量を求めて、最適操作量記憶部に記憶させておく。そして、学習処理部は予め、最適操作量記憶部内の初期条件と最適操作量との関係を表す近似式を求めておくことができる。このため、本構成によれば、実際の内燃機関の制御では、リアルタイムな反復計算を必要とせず、学習処理部によって求められた近似式を利用することで、初期条件内の各要素に対応する各実際値（または推定値）に応じた内燃機関のアクチュエータの最適操作量を素早く求めることができ、処理負荷の低減と処理時間の短縮とを図ることができる。また、初期条件生成部は、初期条件を生成する際に、内燃機関のアクチュエータの操作量の変化に応じた、内燃機関の制御対象部の状態の変化をモデル化したモデル式を使用する。このため、初期条件を生成する際に、各要素を網羅的に組み合わせた試行を行う場合と比較して、演算量を減らすことができ、初期条件生成の効率化を図ることができる。 According to this configuration, the prediction control unit obtains the optimum operation amount of the actuator of the internal combustion engine according to each initial condition in advance by using the model prediction control, and stores it in the optimum operation amount storage unit. Then, the learning processing unit can obtain in advance an approximate expression expressing the relationship between the initial condition in the optimum manipulated variable storage unit and the optimum manipulated variable. Therefore, according to this configuration, the actual control of the internal combustion engine does not require real-time iterative calculation, and corresponds to each element in the initial condition by using the approximate expression obtained by the learning processing unit. The optimum operating amount of the actuator of the internal combustion engine can be quickly obtained according to each actual value (or estimated value), and the processing load can be reduced and the processing time can be shortened. Further, the initial condition generation unit uses a model formula that models a change in the state of the controlled object unit of the internal combustion engine according to a change in the operating amount of the actuator of the internal combustion engine when generating the initial condition. Therefore, when generating the initial condition, the amount of calculation can be reduced and the efficiency of initial condition generation can be improved as compared with the case where the trial in which each element is comprehensively combined is performed.

（２）上記形態の情報処理装置において、前記学習処理部は、前記初期条件と前記最適操作量とを教師データとしたニューラルネットワークの教師あり学習によって前記近似式を求めてもよい。ニューラルネットワークは複雑な関数近似を行う事ができるため、数多くの要素を初期条件として含み得る内燃機関の制御に適している。この構成によれば、学習処理部は、このようなニューラルネットワークを用いて近似式を求めるため、近似式の精度を向上させることができる。 (2) In the information processing apparatus of the above embodiment, the learning processing unit may obtain the approximate expression by supervised learning of a neural network using the initial conditions and the optimum operation amount as teacher data. Since neural networks can perform complex function approximations, they are suitable for controlling internal combustion engines that can include many elements as initial conditions. According to this configuration, the learning processing unit obtains an approximate expression using such a neural network, so that the accuracy of the approximate expression can be improved.

（３）上記形態の情報処理装置では、さらに、実験計画法を用いて、前記アクチュエータの操作量の前記時系列信号を生成する実験計画処理部を備えてもよい。この構成によれば、実験計画処理部は、実験計画法を用いてアクチュエータの操作量の時系列信号を生成するため、組み合わせとして物理的に無理のない時系列信号を生成できる。 (3) The information processing apparatus of the above-described embodiment may further include an experiment planning processing unit that generates the time-series signal of the operation amount of the actuator by using the design of experiments method. According to this configuration, the experimental design processing unit generates a time-series signal of the actuator operation amount by using the design of experiments method, so that it is possible to generate a physically reasonable time-series signal as a combination.

（４）上記形態の情報処理装置において、前記初期条件には、さらに、前記内燃機関に対する外乱と、前記制御対象部の出力と、前記制御対象部の出力の目標値と、のうちの少なくとも一部が含まれ、前記初期条件に前記外乱が含まれる場合、前記モデル式では、前記操作量及び前記外乱の変化に応じた前記状態の変化がモデル化され、前記初期条件に前記出力が含まれる場合、前記モデル式では、前記操作量の変化に応じた前記状態及び前記出力の変化がモデル化されていてもよい。この構成によれば、初期条件として、内燃機関に対する外乱、制御対象部の出力、制御対象部の出力の目標値等の様々な要素を考慮することができる。 (4) In the information processing apparatus of the above embodiment, the initial condition further includes at least one of a disturbance to the internal combustion engine, an output of the controlled object unit, and a target value of the output of the controlled object unit. When the unit is included and the initial condition includes the disturbance, the model formula models the change in the state according to the operation amount and the change in the disturbance, and the initial condition includes the output. In the case, in the model formula, the state and the change in the output according to the change in the operation amount may be modeled. According to this configuration, various factors such as disturbance to the internal combustion engine, output of the control target unit, target value of output of the control target unit, and the like can be considered as initial conditions.

（５）上記形態の情報処理装置において、前記モデル式として、線形状態方程式及び非線形状態方程式を使用してもよい。この構成によれば、モデル式として、線形状態方程式及び非線形状態方程式を利用できる。 (5) In the information processing apparatus of the above-described embodiment, a linear equation of state and a non-linear equation of state may be used as the model equation. According to this configuration, linear equations of state and nonlinear equations of state can be used as model equations.

（６）上記形態の情報処理装置において、前記モデル式として、ＮＡＲＸモデルを用いて構成された非線形方程式を使用してもよい。この構成によれば、モデル式として、ＮＡＲＸモデルを用いて構成された非線形方程式を利用するため、精度の高い予測結果を得ることができる。 (6) In the information processing apparatus of the above-described embodiment, a non-linear equation constructed by using the NARX model may be used as the model formula. According to this configuration, since the nonlinear equation constructed by using the NARX model is used as the model formula, a highly accurate prediction result can be obtained.

（７）上記形態の情報処理装置において、前記実験計画法として、ステップ関数やランプ関数の組み合わせで表現される信号を生成する第１の方法と、周波数が時間に依存して変化するチャープ信号で表現される信号を生成する第２の方法と、のいずれかを使用してもよい。この構成によれば、実験計画法として、ステップ関数やランプ関数の組み合わせで表現される信号を生成する第１の方法（例えば、ＡＰＲＢＳ法）と、周波数が時間に依存して変化するチャープ信号で表現される信号を生成する第２の方法（例えば、ＳｉｎｕｓｏｉｄａｌＥｘｃｉｔａｔｉｏｎ法）と、のいずれかを利用できる。 (7) In the information processing apparatus of the above embodiment, as the design of experiments, a first method for generating a signal represented by a combination of a step function and a ramp function and a chirp signal whose frequency changes depending on time are used. Either of the second method of generating the represented signal and the second method may be used. According to this configuration, as an experimental planning method, a first method (for example, the APRBS method) for generating a signal represented by a combination of a step function and a ramp function, and a chirp signal whose frequency changes depending on time are used. Either a second method of generating the represented signal (eg, the Sinusoidal Function method) can be used.

（８）本発明の一形態によれば、内燃機関のアクチュエータの操作量の変化に応じた、内燃機関の制御対象部の状態の変化をモデル化したモデル式を利用した情報処理方法が提供される。この情報処理方法では、予め作成された前記アクチュエータの操作量の時系列信号を、前記モデル式に適用することで、前記制御対象部の状態の予測値の時系列信号を生成する工程と、前記アクチュエータの操作量の時系列信号と、生成された前記制御対象部の状態の時系列信号と、を初期条件として記憶させる工程と、前記初期条件と、前記モデル式を用いて推定された前記状態と、を用いた目的関数について、入力する前記操作量を変化させつつ評価を繰り返すことによって、当該初期条件における前記アクチュエータの最適な操作量である最適操作量を求める工程と、前記初期条件に対して、求めた前記最適操作量を対応付けて記憶させる工程と、前記初期条件と前記最適操作量との関係を表す近似式を求める工程と、を備える。この方法によれば、モデル予測制御を利用した内燃機関の制御において、処理負荷の低減と処理時間の短縮とを図るとともに、初期条件生成の効率化を図ることができる。 (8) According to one embodiment of the present invention, there is provided an information processing method using a model formula that models a change in the state of a controlled object portion of an internal combustion engine in response to a change in the operating amount of an actuator of the internal combustion engine. NS. In this information processing method, a step of generating a time-series signal of a predicted value of a state of the controlled object portion by applying a time-series signal of an operation amount of the actuator created in advance to the model formula, and the above-mentioned A step of storing the time-series signal of the operation amount of the actuator and the generated time-series signal of the state of the controlled object portion as initial conditions, the initial conditions, and the state estimated using the model formula. By repeating the evaluation while changing the input operation amount for the objective function using A step of associating and storing the obtained optimum operation amount and a step of obtaining an approximate expression representing the relationship between the initial condition and the optimum operation amount are provided. According to this method, in the control of the internal combustion engine using the model prediction control, it is possible to reduce the processing load and the processing time, and to improve the efficiency of initial condition generation.

（９）本発明の一形態によれば、内燃機関のアクチュエータの操作量の変化に応じた、内燃機関の制御対象部の状態の変化をモデル化したモデル式を利用したコンピュータプログラムが提供される。このコンピュータプログラムでは、予め作成された前記アクチュエータの操作量の時系列信号を、前記モデル式に適用することで、前記制御対象部の状態の予測値の時系列信号を生成する機能と、前記アクチュエータの操作量の時系列信号と、生成された前記制御対象部の状態の時系列信号と、を初期条件として記憶させる機能と、前記初期条件と、前記モデル式を用いて推定された前記状態と、を用いた目的関数について、入力する前記操作量を変化させつつ評価を繰り返すことによって、当該初期条件における前記アクチュエータの最適な操作量である最適操作量を求める機能と、前記初期条件に対して、求めた前記最適操作量を対応付けて記憶させる機能と、前記初期条件と前記最適操作量との関係を表す近似式を求める機能と、を備える。このコンピュータプログラムによれば、モデル予測制御を利用した内燃機関の制御において、処理負荷の低減と処理時間の短縮とを図るとともに、初期条件生成の効率化を図ることができる。 (9) According to one embodiment of the present invention, a computer program using a model formula that models a change in the state of a controlled object portion of an internal combustion engine according to a change in the operating amount of an actuator of the internal combustion engine is provided. .. In this computer program, a function of generating a time-series signal of a predicted value of the state of the controlled object portion by applying a time-series signal of the operation amount of the actuator created in advance to the model formula, and the actuator. A function of storing the time-series signal of the manipulated variable and the generated time-series signal of the state of the controlled target portion as initial conditions, the initial conditions, and the states estimated using the model formula. With respect to the objective function using, the function of obtaining the optimum manipulated variable, which is the optimum manipulated variable of the actuator under the initial condition, by repeating the evaluation while changing the input manipulated variable, and the initial condition. It also has a function of associating and storing the obtained optimum operation amount, and a function of obtaining an approximate expression expressing the relationship between the initial condition and the optimum operation amount. According to this computer program, in the control of the internal combustion engine using the model prediction control, it is possible to reduce the processing load and the processing time, and to improve the efficiency of initial condition generation.

（１０）本発明の一形態によれば、アクチュエータと、制御対象部とを備える内燃機関の制御装置が提供される。この内燃機関の制御装置では、前記アクチュエータの操作量の時系列信号と、前記制御対象部の状態の時系列信号と、を少なくとも含む初期条件について、前記初期条件と各時刻における前記初期条件に対して求められた前記アクチュエータの最適な操作量である最適操作量との関係を表す近似式を記憶する記憶部と、実際の前記アクチュエータの操作量と、前記制御対象部の状態とを取得する情報取得部と、取得された前記操作量及び前記状態と、前記記憶部内の前記近似式とを用いて、実際の前記操作量と前記状態とに対応した前記最適操作量を求め、前記最適操作量に従って前記アクチュエータを動作させる制御部と、を備える。この構成によれば、制御部は、情報取得部によって取得された操作量及び状態（各実際値または推定値）と、記憶部内の近似式とを用いて、各実際値に応じた内燃機関のアクチュエータの最適操作量を素早く求めることができ、処理負荷の低減と処理時間の短縮とを図ることができる。 (10) According to one embodiment of the present invention, there is provided a control device for an internal combustion engine including an actuator and a controlled object portion. In the control device of the internal combustion engine, with respect to the initial condition including at least the time-series signal of the operation amount of the actuator and the time-series signal of the state of the controlled target portion, with respect to the initial condition and the initial condition at each time. Information for acquiring a storage unit that stores an approximate expression representing a relationship with the optimum operation amount, which is the optimum operation amount of the actuator, an actual operation amount of the actuator, and a state of the control target unit. Using the acquisition unit, the acquired operation amount and the state, and the approximate expression in the storage unit, the optimum operation amount corresponding to the actual operation amount and the state is obtained, and the optimum operation amount is obtained. A control unit for operating the actuator according to the above is provided. According to this configuration, the control unit uses the manipulated variable and the state (each actual value or estimated value) acquired by the information acquisition unit and the approximate expression in the storage unit to control the internal combustion engine according to each actual value. The optimum operating amount of the actuator can be quickly obtained, and the processing load can be reduced and the processing time can be shortened.

なお、本発明は、種々の態様で実現することが可能であり、例えば、モデル予測制御を利用した内燃機関の制御のための近似式を求める情報処理装置、情報処理方法、情報処理システム、コンピュータプログラム、モデル予測制御を利用した内燃機関の制御装置、制御方法、制御システム、コンピュータプログラム、内燃機関の制御装置の作成装置、作成方法、作成システム、コンピュータプログラム、これら各コンピュータプログラムを配布するためのサーバ装置、そのコンピュータプログラムを記憶した一時的でない記憶媒体等の形態で実現することができる。 The present invention can be realized in various aspects. For example, an information processing device, an information processing method, an information processing system, and a computer for obtaining an approximate expression for controlling an internal combustion engine using model prediction control. To distribute programs, internal combustion engine control devices using model predictive control, control methods, control systems, computer programs, internal combustion engine control device creation devices, creation methods, creation systems, computer programs, and each of these computer programs. It can be realized in the form of a server device, a non-temporary storage medium that stores the computer program, and the like.

本発明の一実施形態としての情報処理装置のブロック図である。It is a block diagram of the information processing apparatus as one Embodiment of this invention. 予測学習処理について説明する図である。It is a figure explaining the predictive learning process. 予測学習処理における処理の手順を示すフローチャートである。It is a flowchart which shows the process procedure in the predictive learning process. 予測学習処理における処理の手順を示すフローチャートである。It is a flowchart which shows the process procedure in the predictive learning process. 予測学習処理の各ステップについて説明する図である。It is a figure explaining each step of the predictive learning process. 本発明の一実施形態としての内燃機関の制御装置のブロック図である。It is a block diagram of the control device of the internal combustion engine as one Embodiment of this invention. 内燃機関制御における処理の手順を示すフローチャートである。It is a flowchart which shows the process procedure in the internal combustion engine control. 内燃機関制御による動作の一例を示す。An example of operation by internal combustion engine control is shown. 内燃機関制御に要した演算時間の一例を示す。An example of the calculation time required for internal combustion engine control is shown.

＜情報処理装置＞
図１は、本発明の一実施形態としての情報処理装置１のブロック図である。情報処理装置１は、モデル予測制御（ＭＰＣ：Model Predictive Control）を利用して、内燃機関の制御に使用するための近似式を求める装置である。本実施形態では、内燃機関の制御に使用する要素として、以下のａ１〜ａ５の５つを例示する。要素ａ１〜ａ５はモデル予測制御の初期条件として使用されるため、要素ａ１〜ａ５を総称して「初期条件」とも呼ぶ。本実施形態の情報処理装置１は、要素ａ１〜ａ５からなる初期条件と、その初期条件に対応する内燃機関のアクチュエータの最適な操作量（最適操作量）と、の関係を表す近似式を求める装置である。情報処理装置１によって求められた近似式は、内燃機関の制御装置に搭載されて内燃機関の制御に使用される。詳細は後述する。
（ａ１）内燃機関のアクチュエータの操作量ｕ
（ａ２）内燃機関に対する外乱ｗ
（ａ３）内燃機関の制御対象部の状態ｘ
（ａ４）内燃機関の制御対象部の出力ｙ
（ａ５）内燃機関の制御対象部の出力の目標値ｒ <Information processing device>
FIG. 1 is a block diagram of an information processing device 1 as an embodiment of the present invention. The information processing device 1 is a device that uses Model Predictive Control (MPC) to obtain an approximate expression for use in controlling an internal combustion engine. In this embodiment, the following five elements a1 to a5 are exemplified as elements used for controlling an internal combustion engine. Since the elements a1 to a5 are used as initial conditions for model prediction control, the elements a1 to a5 are also collectively referred to as "initial conditions". The information processing apparatus 1 of the present embodiment obtains an approximate expression expressing the relationship between the initial condition composed of the elements a1 to a5 and the optimum operation amount (optimum operation amount) of the actuator of the internal combustion engine corresponding to the initial condition. It is a device. The approximate expression obtained by the information processing device 1 is mounted on the control device of the internal combustion engine and used for controlling the internal combustion engine. Details will be described later.
(A1) Operation amount of actuator of internal combustion engine u
(A2) Disturbance w with respect to the internal combustion engine
(A3) State x of the controlled object portion of the internal combustion engine
(A4) Output y of the controlled object portion of the internal combustion engine
(A5) Target value r of the output of the controlled object portion of the internal combustion engine

（ａ１）操作量ｕは、内燃機関において操作することが可能な１つまたは複数のアクチュエータの動作状況を表す物理量である。例えば、スロットル開度、排気再循環（ＥＧＲ：Exhaust Gas Recirculation）システムにおけるＥＧＲバルブ開度等が操作量ｕに相当する。（ａ２）外乱ｗは、内燃機関の出力に影響を及ぼす１つまたは複数の物理量であり、操作量ｕは除く。例えば、エンジン回転数、外気温度、外気圧力等が外乱ｗに相当する。（ａ３）状態ｘは、内燃機関に含まれる１つまたは複数の制御対象部の状態を表す物理量である。例えば、ＥＧＲシステムにおける排気温度や排気流量等が状態ｘに相当する。 (A1) The operating quantity u is a physical quantity representing the operating status of one or a plurality of actuators that can be operated in the internal combustion engine. For example, the throttle opening degree, the EGR valve opening degree in the exhaust gas recirculation (EGR) system, and the like correspond to the operation amount u. (A2) The disturbance w is one or more physical quantities that affect the output of the internal combustion engine, and the manipulated quantity u is excluded. For example, the engine speed, the outside air temperature, the outside air pressure, and the like correspond to the disturbance w. (A3) The state x is a physical quantity representing the state of one or more controlled target units included in the internal combustion engine. For example, the exhaust temperature, the exhaust flow rate, and the like in the EGR system correspond to the state x.

（ａ４）出力ｙは、内燃機関に含まれる１つまたは複数の制御対象部の出力を表す物理量である。例えば、ＥＧＲシステムにおけるＥＧＲ率、過給機における過給圧等が出力ｙに相当する。（ａ５）目標値ｒは、内燃機関に含まれる１つまたは複数の制御対象部の出力（すなわち要素ａ４）の目標値である。なお、上述した５つの各要素ａ１〜ａ５は、それぞれ、複数の項目を含み得る。例えば、外乱ｗとしてエンジン回転数と外気温度と外気圧力との３項目を含んでもよい。また、上述した５つの各要素ａ１〜ａ５において挙げた項目はあくまで例示であり、種々の項目を採用できる。 (A4) The output y is a physical quantity representing the output of one or more controlled target units included in the internal combustion engine. For example, the EGR rate in the EGR system, the supercharging pressure in the turbocharger, and the like correspond to the output y. (A5) The target value r is a target value of the output (that is, element a4) of one or more controlled target units included in the internal combustion engine. It should be noted that each of the above-mentioned five elements a1 to a5 may include a plurality of items. For example, the disturbance w may include three items of engine speed, outside air temperature, and outside air pressure. Further, the items listed in the above-mentioned five elements a1 to a5 are merely examples, and various items can be adopted.

情報処理装置１は、記憶部１００と、情報処理部２００と、図示しないＲＯＭ、ＲＡＭ及び通信部を備え、各部は図示しないバスにより相互に接続されている。記憶部１００は、ハードディスク、フラッシュメモリ、メモリカードなどで構成される。記憶部１００には、モデル式記憶部１１０と、初期条件記憶部１２０と、最適操作量記憶部１３０と、近似式記憶部１４０とが含まれている。 The information processing device 1 includes a storage unit 100, an information processing unit 200, a ROM, a RAM, and a communication unit (not shown), and each unit is connected to each other by a bus (not shown). The storage unit 100 is composed of a hard disk, a flash memory, a memory card, and the like. The storage unit 100 includes a model type storage unit 110, an initial condition storage unit 120, an optimum manipulated variable storage unit 130, and an approximate type storage unit 140.

モデル式記憶部１１０には、内燃機関のアクチュエータの操作量ｕ、及び、内燃機関に対する外乱ｗの変化に応じた、内燃機関の制御対象部の状態ｘ、及び、出力ｙの変化をモデル化したモデル式が予め記憶されている。モデル式は、予めの実験により求められ、操作量ｕ、外乱ｗ、状態ｘ、出力ｙの現在及び過去の情報が含まれる。どの程度過去の情報が含まれるかはモデル式中の時間サンプル数に依存する。モデル式としては、例えば、線形状態方程式、非線形状態方程式、ＮＡＲＸ（Nonlinear Auto-Regressive eXogenous）モデルを用いて構成された非線形方程式を使用できる。 The model storage unit 110 models changes in the state x of the controlled object unit of the internal combustion engine and the output y in response to changes in the operating amount u of the actuator of the internal combustion engine and the disturbance w with respect to the internal combustion engine. The model formula is stored in advance. The model formula is obtained by an experiment in advance, and includes current and past information of the manipulated variable u, the disturbance w, the state x, and the output y. How much past information is included depends on the number of time samples in the model formula. As the model equation, for example, a linear state equation, a non-linear state equation, and a non-linear equation constructed by using a NARX (Nonlinear Auto-Regressive eXogenous) model can be used.

線形状態方程式を使用した連続時間システムのモデル式は、操作量ｕ、外乱ｗ、状態ｘ、出力ｙの各変数に対して、例えば次のように表せる。なお、ｔは時間、Ａ，Ｂ，Ｃ，Ｄ，Ｅ，Ｆは適当な大きさの定数行列である。
・状態方程式：ｄｘ／ｄｔ＝Ａｘ＋Ｂｕ＋Ｅｗ・・・（１）
・出力方程式：ｙ＝Ｃｘ＋Ｄｕ＋Ｆｗ・・・（２） The model equation of the continuous time system using the linear equation of state can be expressed as follows for each variable of the manipulated variable u, the disturbance w, the state x, and the output y. Note that t is time, and A, B, C, D, E, and F are constant matrices of appropriate size.
-Equation of state: dx / dt = Ax + Bu + Ew ... (1)
・ Output equation: y = Cx + Du + Fw ・・・ (2)

線形状態方程式を使用した離散時間システムのモデル式は、上述の各変数と、離散時間ｋとを用いて、例えば次のように表せる。
・状態方程式：ｘ［ｋ＋１］＝Ａｘ［ｋ］＋Ｂｕ［ｋ］＋Ｅｗ［ｋ］・・・（３）
・出力方程式：ｙ［ｋ］＝Ｃｘ［ｋ］＋Ｄｕ［ｋ］＋Ｆｗ［ｋ］・・・（４） The model equation of the discrete-time system using the linear equation of state can be expressed as follows, for example, by using each of the above variables and the discrete-time k.
-Equation of state: x [k + 1] = Ax [k] + Bu [k] + Ew [k] ... (3)
-Output equation: y [k] = Cx [k] + Du [k] + Fw [k] ... (4)

非線形状態方程式を使用した連続時間システムのモデル式は、上述の各変数を用いて、例えば次のように表せる。なお、ｆ，ｇはそれぞれｘ，ｙと同次元の出力を与える非線形ベクトル関数である。このため、非線形状態方程式では、線形状態方程式を包含していると言える。
・状態方程式：ｄｘ／ｄｔ＝ｆ（ｘ，ｕ，ｗ）・・・（５）
・出力方程式：ｙ＝ｇ（ｘ，ｕ，ｗ）・・・（６） The model formula of the continuous time system using the nonlinear state equation can be expressed as follows, for example, using each of the above variables. Note that f and g are nonlinear vector functions that give outputs of the same dimensions as x and y, respectively. Therefore, it can be said that the nonlinear equation of state includes the linear equation of state.
-Equation of state: dx / dt = f (x, u, w) ... (5)
・ Output equation: y = g (x, u, w) ・・・ (6)

非線形状態方程式を使用した離散時間システムのモデル式は、上述の各変数と、離散時間ｋとを用いて、例えば次のように表せる。
・状態方程式：ｘ［ｋ＋１］＝ｆ（ｘ［ｋ］，ｕ［ｋ］，ｗ［ｋ］）・・・（７）
・出力方程式：ｙ［ｋ］＝ｇ（ｘ［ｋ］，ｕ［ｋ］，ｗ［ｋ］）・・・（８） The model equation of the discrete-time system using the non-linear state equation can be expressed as follows, for example, by using each of the above variables and the discrete-time k.
-Equation of state: x [k + 1] = f (x [k], u [k], w [k]) ... (7)
-Output equation: y [k] = g (x [k], u [k], w [k]) ... (8)

ＮＡＲＸモデルを用いて構成された非線形方程式のモデル式は、上述の各変数と、離散時間ｋとを用いて、例えば次のように表せる。なお、ｇはｙと同次元の出力を与える非線形ベクトル関数であり、明示的に過去の時間を複数サンプルする点が上述の式８と異なる。
・ｙ［ｋ＋１］＝ｇ（ｙ［ｋ］，ｙ［ｋ−１］，・・・，ｙ［ｋ＋１−ｎ_y］，ｕ［ｋ］，ｕ［ｋ−１］，・・・，ｕ［ｋ＋１−ｎ_u］）・・・（９） The model equation of the nonlinear equation constructed by using the NARX model can be expressed as follows, for example, by using each of the above variables and the discrete time k. Note that g is a nonlinear vector function that gives an output of the same dimension as y, and is different from the above equation 8 in that a plurality of past times are explicitly sampled.
Y [k + 1] = g (y [k], y [k-1], ..., y [k + 1- _ny ], u [k], u [k-1], ..., u [ k + 1-n _u ]) ・・・ (9)

このようにすれば、モデル式として、線形状態方程式、非線形状態方程式、及びＮＡＲＸモデルを利用できる。 In this way, linear state equations, non-linear state equations, and NARX models can be used as model equations.

図２は、予測学習処理について説明する図である。図２（Ａ）は、初期条件記憶部１２０に記憶されている初期条件の一例を示す。図２（Ｂ）は、予測学習処理の各反復サイクルにおける、各要素の変化の一例を示す。図１の初期条件記憶部１２０には、操作量記憶部１２１と、外乱記憶部１２２と、状態記憶部１２３と、出力記憶部１２４と、目標値記憶部１２５とが含まれている。 FIG. 2 is a diagram for explaining the predictive learning process. FIG. 2A shows an example of the initial conditions stored in the initial condition storage unit 120. FIG. 2B shows an example of changes in each element in each iterative cycle of the predictive learning process. The initial condition storage unit 120 of FIG. 1 includes an operation amount storage unit 121, a disturbance storage unit 122, a state storage unit 123, an output storage unit 124, and a target value storage unit 125.

操作量記憶部１２１には、後述する予測学習処理のステップＳ１１０によって、内燃機関のアクチュエータの操作量ｕの複数時刻分の物理量の変化（換言すれば、継時的な物理量の変化）を表す時系列信号が記憶される。図２（Ａ）では、操作量ｕ１及び操作量ｕ２として、操作量ｕに属する２つの項目の時系列信号を例示している。同様に、外乱記憶部１２２には、後述する予測学習処理のステップＳ１１０によって、内燃機関に対する外乱ｗの複数時刻分の物理量の変化を表す時系列信号が記憶される。図２（Ａ）では、外乱ｗ１として、外乱ｗに属する１つの項目の時系列信号を例示している。 When the operation quantity storage unit 121 represents a change in the physical quantity (in other words, a change in the physical quantity over time) of the operation amount u of the actuator of the internal combustion engine for a plurality of times by the step S110 of the prediction learning process described later. The series signal is stored. In FIG. 2A, time-series signals of two items belonging to the manipulated variable u are illustrated as the manipulated variable u1 and the manipulated variable u2. Similarly, the disturbance storage unit 122 stores a time-series signal representing a change in the physical quantity of the disturbance w for a plurality of times with respect to the internal combustion engine by step S110 of the prediction learning process described later. In FIG. 2A, a time-series signal of one item belonging to the disturbance w is illustrated as the disturbance w1.

状態記憶部１２３には、後述する予測学習処理のステップＳ１３０によって、内燃機関の制御対象部の状態ｘの複数時刻分の物理量の変化を表す時系列信号が記憶される。図２（Ａ）では、状態ｘ１として、状態ｘに属する１つの項目の時系列信号を例示している。同様に、出力記憶部１２４には、後述する予測学習処理のステップＳ１３０によって、内燃機関の制御対象部の出力ｙの複数時刻分の物理量の変化を表す時系列信号が記憶される。図２（Ａ）では、出力ｙ１として、出力ｙに属する１つの項目の時系列信号を例示している。 The state storage unit 123 stores a time-series signal representing a change in the physical quantity of the state x of the controlled target unit of the internal combustion engine for a plurality of times by step S130 of the prediction learning process described later. In FIG. 2A, a time-series signal of one item belonging to the state x is illustrated as the state x1. Similarly, the output storage unit 124 stores a time-series signal representing a change in the physical quantity of the output y of the control target unit of the internal combustion engine for a plurality of times by step S130 of the prediction learning process described later. In FIG. 2A, a time-series signal of one item belonging to the output y is illustrated as the output y1.

目標値記憶部１２５には、後述する予測学習処理のステップＳ１１０によって、内燃機関の制御対象部の出力の目標値ｒの複数時刻分の物理量の変化を表す時系列信号が記憶される。なお、図２（Ａ）では、目標値ｒの時系列信号の例示は省略している。 In the target value storage unit 125, a time-series signal representing a change in the physical quantity of the target value r of the output of the control target unit of the internal combustion engine for a plurality of times is stored in step S110 of the prediction learning process described later. In FIG. 2A, the example of the time series signal of the target value r is omitted.

最適操作量記憶部１３０には、後述する予測学習処理のステップＳ２００によって、初期条件記憶部１２０内の各時刻における初期条件（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）に対して、当該初期条件における内燃機関のアクチュエータの最適な操作量（最適操作量）が対応付けて記憶される。近似式記憶部１４０には、後述する予測学習処理のステップＳ３００によって、最適操作量記憶部１３０内の初期条件と最適操作量とから得られた近似式が記憶される。 The optimum manipulated variable storage unit 130 is set to the initial conditions (manipulated quantity u, disturbance w, state x, output y, target value r) at each time in the initial condition storage unit 120 by step S200 of the predictive learning process described later. On the other hand, the optimum operating amount (optimal operating amount) of the actuator of the internal combustion engine under the initial conditions is stored in association with each other. The approximate expression storage unit 140 stores the approximate expression obtained from the initial conditions and the optimum operation amount in the optimum operation amount storage unit 130 by step S300 of the prediction learning process described later.

情報処理部２００は、ＲＯＭに格納されているコンピュータプログラムをＲＡＭに展開して実行することにより、情報処理装置１の各部を制御する。そのほか情報処理部２００は、実験計画処理部２１０、初期条件生成部２２０、予測処理部２３０、学習処理部２４０として機能し、協働して後述する予測学習処理を実行する。実験計画処理部２１０は、実験計画法を用いて、初期条件のうち、操作量ｕ、外乱ｗ、目標値ｒの時系列信号を生成し、初期条件記憶部１２０に記憶させる。初期条件生成部２２０は、初期条件のうち、状態ｘ、出力ｙの時系列信号を生成し、初期条件記憶部１２０に記憶させる。 The information processing unit 200 controls each unit of the information processing device 1 by expanding and executing a computer program stored in the ROM in the RAM. In addition, the information processing unit 200 functions as an experiment planning processing unit 210, an initial condition generation unit 220, a prediction processing unit 230, and a learning processing unit 240, and cooperates to execute the prediction learning process described later. The experiment planning processing unit 210 uses the design of experiments method to generate time-series signals of the manipulated variable u, the disturbance w, and the target value r among the initial conditions, and stores them in the initial condition storage unit 120. The initial condition generation unit 220 generates a time-series signal of the state x and the output y among the initial conditions, and stores it in the initial condition storage unit 120.

予測処理部２３０は、初期条件記憶部１２０内の初期条件を用いてモデル予測制御によって、各時刻における初期条件（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）に対応する最適操作量を求め、最適操作量記憶部１３０に記憶させる。学習処理部２４０は、最適操作量記憶部１３０内の初期条件と最適操作量とを教師データとしたニューラルネットワーク（ＮＮ：Neural Network）の教師あり学習によって近似式を求め、近似式記憶部１４０に記憶させる。 The prediction processing unit 230 optimally corresponds to the initial conditions (operation amount u, disturbance w, state x, output y, target value r) at each time by model prediction control using the initial conditions in the initial condition storage unit 120. The operation amount is obtained and stored in the optimum operation amount storage unit 130. The learning processing unit 240 obtains an approximate expression by supervised learning of a neural network (NN: Neural Network) using the initial condition and the optimum operation amount in the optimum manipulated variable storage unit 130 as supervised learning, and the approximate formula storage unit 140 obtains an approximate expression. Remember.

図３及び図４は、予測学習処理における処理の手順を示すフローチャートである。予測学習処理は、初期条件（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）を生成すると共に、生成された初期条件と内燃機関のアクチュエータの最適な操作量（最適操作量）との関係を表す近似式を求める処理である。予測学習処理は、情報処理装置１において任意のタイミングで実行される。 3 and 4 are flowcharts showing the processing procedure in the predictive learning process. The predictive learning process generates initial conditions (operation amount u, disturbance w, state x, output y, target value r), and at the same time, the generated initial conditions and the optimum operation amount (optimum operation amount) of the actuator of the internal combustion engine. This is a process for finding an approximate expression that expresses the relationship with. The predictive learning process is executed at an arbitrary timing in the information processing device 1.

図５は、予測学習処理の各ステップについて説明する図である。図５では、初期条件記憶部１２０に時系列信号として記憶されている初期条件の各要素について模式的に表している。縦軸には初期条件の各要素の名称を表し、横軸には時系列信号における各時刻を表している。通常、表の中には該当時刻における該当要素の物理量が表示されるが、図５では説明の便宜上、物理量の表示を省略して、説明のための文言を記載している。さらに、図３〜図５では説明の便宜上、初期条件の各要素が含み得る複数の項目について区別しない。例えば、操作量ｕが２つの項目、操作量ｕ１及び操作量ｕ２を含む場合、操作量ｕ１及び操作量ｕ２に対する処理は、以降説明する「操作量ｕ」に対する処理と同じ処理を適用すればよい。 FIG. 5 is a diagram illustrating each step of the predictive learning process. In FIG. 5, each element of the initial condition stored as a time-series signal in the initial condition storage unit 120 is schematically shown. The vertical axis represents the name of each element of the initial condition, and the horizontal axis represents each time in the time series signal. Normally, the physical quantity of the corresponding element at the corresponding time is displayed in the table, but in FIG. 5, for convenience of explanation, the display of the physical quantity is omitted and the wording for explanation is described. Further, in FIGS. 3 to 5, for convenience of explanation, a plurality of items that can be included in each element of the initial condition are not distinguished. For example, when the operation amount u includes two items, the operation amount u1 and the operation amount u2, the processing for the operation amount u1 and the operation amount u2 may be the same as the processing for the “operation amount u” described below. ..

ステップＳ１００では、多様な初期条件（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）の生成を実行する。具体的には、ステップＳ１１０において実験計画処理部２１０は、実験計画法を用いて、操作量ｕ、外乱ｗ、目標値ｒの時系列信号を生成し、操作量記憶部１２１、外乱記憶部１２２、目標値記憶部１２５にそれぞれ記憶させる。実験計画処理部２１０は、実験計画法として例えば、以下の方法ｂ１、ｂ２のいずれかを用いることができる。 In step S100, various initial conditions (operation amount u, disturbance w, state x, output y, target value r) are generated. Specifically, in step S110, the experimental design processing unit 210 uses the design of experiments method to generate time-series signals of the manipulated variable u, the disturbance w, and the target value r, and the manipulated variable storage unit 121 and the disturbance storage unit 122. , Each is stored in the target value storage unit 125. The experiment planning processing unit 210 can use any of the following methods b1 and b2 as the experiment planning method, for example.

（ｂ１）ステップ関数やランプ関数の組み合わせで表現される信号を生成する第１の方法：第１の方法としては、例えば、ＡＰＲＢＳ（Amplitude modulated Pseudo Random Binary Sequences）法を利用できる。ＡＰＲＢＳ法では、信号のレベルを連続的に扱い、多様な組合せを効率的に生成するラテン超方格計画やＤ最適計画などを用いて、疑似ランダム的に連続信号の組み合わせを生成する。ＡＰＲＢＳ法では、信号の１区間の長さや、区間の移り変わり時の信号変化速度なども計画の対象に含めることができるため、信号の値の組合せに加えて、信号変化速度の組合せについても多様性を確保できる。すなわち、ＡＰＲＢＳ法では、過渡変化を含めた実験計画が可能である。 (B1) First method for generating a signal represented by a combination of a step function and a ramp function: As the first method, for example, an APRBS (Amplitude modulated Pseudo Random Binary Sequences) method can be used. In the APRBS method, signal levels are continuously handled, and continuous signal combinations are generated in a pseudo-random manner using a Latin super-square design or a D-optimal design that efficiently generates various combinations. In the APRBS method, the length of one section of the signal and the signal change speed at the time of section change can be included in the planning target, so in addition to the combination of signal values, the combination of signal change speeds is also diverse. Can be secured. That is, in the APRBS method, it is possible to plan an experiment including transient changes.

（ｂ２）周波数が時間に依存して変化するチャープ信号で表現される信号を生成する第２の方法：第２の方法としては、例えば、ＳｉｎｕｓｏｉｄａｌＥｘｃｉｔａｔｉｏｎ法を利用できる。ＳｉｎｕｓｏｉｄａｌＥｘｃｉｔａｔｉｏｎ法では、時間に依存して周波数が変化する正弦波信号を用いて連続信号の組み合わせを生成する。ＡＰＲＢＳ法により生成された信号よりも、信号レベルの時間変化率が多様である。 (B2) Second method of generating a signal represented by a chirp signal whose frequency changes with time: As a second method, for example, a Sinemodal Excitation method can be used. In the Sinusoidal Excitation method, a combination of continuous signals is generated using a sinusoidal signal whose frequency changes with time. The rate of change in signal level over time is more diverse than the signal generated by the APRBS method.

ステップＳ１２０において、初期条件生成部２２０の予測計算部２２１は、モデル式記憶部１１０に記憶されているモデル式と、操作量記憶部１２１に記憶されている操作量ｕの時系列信号と、外乱記憶部１２２に記憶されている外乱ｗの時系列信号と、をそれぞれ読み出す。ステップＳ１３０において、初期条件生成部２２０の予測計算部２２１は、読み出した操作量ｕと外乱ｗの時系列信号をモデル式に適用（印加）することで、状態ｘと出力ｙとの予測値の時系列信号を生成し、状態記憶部１２３と出力記憶部１２４とにそれぞれ記憶させる。 In step S120, the prediction calculation unit 221 of the initial condition generation unit 220 includes the model formula stored in the model formula storage unit 110, the time series signal of the operation amount u stored in the operation amount storage unit 121, and the disturbance. The time-series signal of the disturbance w stored in the storage unit 122 is read out. In step S130, the prediction calculation unit 221 of the initial condition generation unit 220 applies (applies) the time-series signals of the read manipulated variable u and the disturbance w to the model formula to obtain the predicted values of the state x and the output y. A time-series signal is generated and stored in the state storage unit 123 and the output storage unit 124, respectively.

このように、予測学習処理のステップＳ１００では、モデル式を用いることで、物理的に発生する可能性の少ない初期条件の生成を回避すると共に、多様な初期条件を効率よく生成することができる。上述の通り、初期条件は、操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒの５つの要素から構成されるが、それら５要素の組み合わせを生成する単純な方法は、５要素のすべてに上下限を設定し、上下限の範囲内で５要素を網羅的に組み合わせることである。しかし、網羅的な組み合わせにより生成された初期条件には、物理的に発生する可能性が極めて低い初期条件が含まれる。これは、状態ｘや出力ｙの挙動は、操作量ｕや外乱ｗに依存する物理的な因果関係に支配されていることに起因する。この点、予測学習処理のステップＳ１００では、この因果関係を再現したモデル式を、ステップＳ２００以降のモデル予測制御だけでなく、初期条件の生成の段階から使用することによって、組み合わせとして物理的に無理のない状態ｘ及び出力ｙの時系列信号を生成できる。これにより、実際に起こり得る初期条件のみを効率的に生成することが可能となり、例えば内燃機関の制御のように、制御パラメータの最適解を導出するために、多くの要素（例えば、数十個）が関連する初期条件を持つ複雑なシステムにおいても、組み合わせ数の爆発的増加を招くことなく、初期条件を生成することが可能となる。 As described above, in step S100 of the predictive learning process, by using the model formula, it is possible to avoid the generation of the initial conditions that are unlikely to occur physically and to efficiently generate various initial conditions. As described above, the initial condition is composed of five elements of manipulated variable u, disturbance w, state x, output y, and target value r, but a simple method of generating a combination of these five elements is five elements. The upper and lower limits are set for all, and the five elements are comprehensively combined within the range of the upper and lower limits. However, the initial conditions generated by the exhaustive combination include initial conditions that are extremely unlikely to occur physically. This is because the behavior of the state x and the output y is governed by the physical causal relationship depending on the manipulated variable u and the disturbance w. In this regard, in step S100 of the prediction learning process, it is physically impossible as a combination by using the model formula that reproduces this causal relationship not only from the model prediction control after step S200 but also from the stage of generating the initial condition. It is possible to generate a time-series signal with no state x and output y. This makes it possible to efficiently generate only the initial conditions that can actually occur, and many elements (for example, dozens of elements) are used to derive the optimum solution of the control parameters, for example, in the control of an internal combustion engine. Even in a complex system with an initial condition related to), it is possible to generate an initial condition without causing an explosive increase in the number of combinations.

ステップＳ２００では、ステップＳ１００で生成された多様な初期条件に対応する最適操作量を求める。具体的には、ステップＳ２１０において、予測処理部２３０の予測計算部２３１は、モデル式記憶部１１０に記憶されているモデル式を読み出す。 In step S200, the optimum operation amount corresponding to various initial conditions generated in step S100 is obtained. Specifically, in step S210, the prediction calculation unit 231 of the prediction processing unit 230 reads out the model formula stored in the model formula storage unit 110.

ステップＳ２２０において、予測処理部２３０の予測計算部２３１は、初期条件記憶部１２０に記憶されている初期条件の時系列信号中の１時刻を起点とし、モデル予測制御による将来予測のために必要な時刻分の情報を読み出す。例えば、図５（Ａ）において、時刻ｔ０を起点とした場合、予測計算部２３１は、予測のために必要となる現在及び過去の時刻分の情報、具体的には、現在時刻ｔ０の初期条件（外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）と、過去時刻ｔ−１の初期条件（操作量ｕ）とを読み出す。ここで、操作量ｕのみ過去時刻ｔ−１の初期条件を読み出すのは、現在時刻ｔ０の操作量ｕが、後のステップにおける予測対象となるためである。なお、予測計算部２３１は、外乱ｗ等についても、現在の初期条件に加えて、過去の数時刻分における初期条件の読み出しを行ってもよい。どれだけ過去に遡って情報の読み出しを行うかは、予測計算部２３１が使用するモデル式に依存する。 In step S220, the prediction calculation unit 231 of the prediction processing unit 230 is required for future prediction by model prediction control, starting from one time in the time series signal of the initial condition stored in the initial condition storage unit 120. Read the time information. For example, in FIG. 5A, when the time t0 is the starting point, the prediction calculation unit 231 provides information on the current and past times required for prediction, specifically, the initial condition of the current time t0. (Disturbance w, state x, output y, target value r) and the initial condition (operation amount u) of the past time t-1 are read out. Here, the reason why the initial condition of the past time t-1 is read only for the manipulated variable u is that the manipulated variable u at the current time t0 is the prediction target in the later step. The prediction calculation unit 231 may also read out the initial conditions in the past several hours in addition to the current initial conditions for the disturbance w and the like. How far back the information is read out depends on the model formula used by the prediction calculation unit 231.

ステップＳ２３０では、ステップＳ２２０で読み出した初期条件に対応する最適操作量を求める。具体的には、ステップＳ２３１において、予測処理部２３０の予測計算部２３１は、現在時刻から所定の将来時刻までの有限区間内における操作量ｕ、外乱ｗ、目標値ｒの時系列を決定する。この有限区間は、モデル予測制御における「予測ホライズン」に相当する。例えば、図５（Ｂ）に示すように予測計算部２３１は、現在時刻ｔ０から所定の将来時刻ｔ５までの有限区間内における操作量ｕ、外乱ｗ、目標値ｒを決定する。図５（Ｂ）の例では、予測計算部２３１は、操作量ｕには予め定められたデフォルト値（図５：Ｄ）を設定し、外乱ｗ及び目標値ｒの将来時刻ｔ１〜ｔ５には、ステップＳ２２０で読み出した現在時刻ｔ０の物理量（図５：現在）を設定している。なお、操作量ｕのデフォルト値の決定に際して、予測計算部２３１は、ステップＳ２２０で読み出された過去時刻ｔ−１の操作量ｕを考慮してもよい。 In step S230, the optimum operation amount corresponding to the initial condition read in step S220 is obtained. Specifically, in step S231, the prediction calculation unit 231 of the prediction processing unit 230 determines a time series of the manipulated variable u, the disturbance w, and the target value r within a finite section from the current time to a predetermined future time. This finite interval corresponds to the "predictive horizon" in model predictive control. For example, as shown in FIG. 5B, the prediction calculation unit 231 determines the manipulated variable u, the disturbance w, and the target value r in a finite interval from the current time t0 to the predetermined future time t5. In the example of FIG. 5B, the prediction calculation unit 231 sets a predetermined default value (FIG. 5: D) for the manipulated variable u, and sets the future time t1 to t5 of the disturbance w and the target value r. , The physical quantity of the current time t0 read in step S220 (FIG. 5: present) is set. In determining the default value of the manipulated variable u, the prediction calculation unit 231 may consider the manipulated variable u of the past time t-1 read in step S220.

ステップＳ２３２において、予測処理部２３０の予測計算部２３１は、モデル式を用いて、将来の有限区間内での状態ｘ、出力ｙを予測する。具体的には、予測計算部２３１は、現在時刻の条件をステップＳ２２０で読み出した初期条件（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）とし、ステップＳ２１０で読み出したモデル式に対して、ステップＳ２３１で決定された有限区間内の操作量ｕ、外乱ｗの時系列信号を適用（印加）することで、有限区間内の状態ｘと出力ｙとの予測値の時系列信号を生成する。例えば、図５（Ｃ）に示すように予測計算部２３１は、所定の将来時刻ｔ１からｔ５までの有限区間内における状態ｘ、出力ｙを予測する（図５：予測）。 In step S232, the prediction calculation unit 231 of the prediction processing unit 230 predicts the state x and the output y in the future finite section by using the model formula. Specifically, the prediction calculation unit 231 sets the condition of the current time as the initial condition (operation amount u, disturbance w, state x, output y, target value r) read out in step S220, and sets the model formula read out in step S210. On the other hand, by applying (applying) the time-series signals of the manipulated variable u and the disturbance w in the finite section determined in step S231, the time-series signals of the predicted values of the state x and the output y in the finite section are applied. To generate. For example, as shown in FIG. 5C, the prediction calculation unit 231 predicts the state x and the output y within a finite interval from a predetermined future time t1 to t5 (FIG. 5: prediction).

ステップＳ２３３において、予測処理部２３０の評価部２３２は、ステップＳ２３１で決定された有限区間内の操作量ｕ、外乱ｗ、目標値ｒの時系列信号と、ステップＳ２３２で予測された有限区間内の状態ｘ、出力ｙの時系列信号とを目的関数に入力して、目的関数を評価する。この目的関数は、制御性能を定量的に評価するための所定の式であり、記憶部１００内に記憶されている。目的関数としては、例えば、有限区間内における出力（出力ｙ）と目標（目標値ｒ）の差の二乗和を利用できる。 In step S233, the evaluation unit 232 of the prediction processing unit 230 includes the time-series signals of the manipulated variable u, the disturbance w, and the target value r in the finite section determined in step S231, and the evaluation unit 232 in the finite section predicted in step S232. The time series signal of the state x and the output y is input to the objective function, and the objective function is evaluated. This objective function is a predetermined formula for quantitatively evaluating the control performance, and is stored in the storage unit 100. As the objective function, for example, the sum of squares of the difference between the output (output y) and the target (target value r) within a finite interval can be used.

ステップＳ２３４において、予測処理部２３０の反復処理部２３３は、目的関数の値が収束したか否かを判定する。収束した場合（ステップＳ２３４：ＹＥＳ）、反復処理部２３３は、処理をステップＳ２３６へ遷移させる。 In step S234, the iterative processing unit 233 of the prediction processing unit 230 determines whether or not the values of the objective functions have converged. When it converges (step S234: YES), the iterative processing unit 233 shifts the processing to step S236.

一方、収束していない場合（ステップＳ２３４：ＮＯ）、反復処理部２３３は、目的関数値をより良くするように有限区間内における操作量ｕの時系列信号を修正し、処理をステップＳ２３２へ遷移させ、予測と評価を繰り返す。例えば、図５（Ｄ）に示すように反復処理部２３３は、現在時刻ｔ０から所定の将来時刻ｔ５までの有限区間内における操作量ｕを修正し、その後、予測計算部２３１は、図５（Ｅ）に示すように、修正した操作量ｕに基づく状態ｘ、出力ｙを予測し、評価部２３２は、これらを用いた目的関数を評価する。例えば、図２（Ｂ）に示すように、有限区間（予測ホライズンＨＰ）内の操作量ｕ１及び操作量ｕ２が反復サイクルＣ１、Ｃ２、Ｃ３と修正されていくにつれて、対応する有限区間（予測ホライズンＨＰ）内の状態ｘ１及び出力ｙ１についても、サイクルＣ１、Ｃ２、Ｃ３に示すように変化していく。目的関数の値が収束した３回目のサイクルＣ３では、出力ｙ１の時系列信号は、目標値ｒの時系列信号にほぼ一致していることがわかる。なお、反復処理部２３３は、勾配法、シューティング法、Ｃ／ＧＭＲＥＳ法といった既知の手法を利用してもよい。 On the other hand, when it has not converged (step S234: NO), the iterative processing unit 233 modifies the time-series signal of the manipulated variable u in the finite interval so as to improve the objective function value, and shifts the processing to step S232. Let them repeat the prediction and evaluation. For example, as shown in FIG. 5 (D), the iterative processing unit 233 corrects the manipulated variable u in the finite interval from the current time t0 to the predetermined future time t5, and then the prediction calculation unit 231 corrects the manipulated variable u in FIG. 5 (D). As shown in E), the state x and the output y based on the modified manipulated variable u are predicted, and the evaluation unit 232 evaluates the objective function using these. For example, as shown in FIG. 2B, as the manipulated variable u1 and manipulated variable u2 in the finite interval (predicted horizon HP) are modified to repeat cycles C1, C2, C3, the corresponding finite interval (predicted horizon). The state x1 and the output y1 in HP) also change as shown in cycles C1, C2, and C3. In the third cycle C3 in which the values of the objective functions have converged, it can be seen that the time-series signal of the output y1 substantially matches the time-series signal of the target value r. The iterative processing unit 233 may use known methods such as a gradient method, a shooting method, and a C / GMRES method.

ステップＳ２３６において、予測処理部２３０の反復処理部２３３は、有限区間内における操作量ｕの時系列信号から、起点とした１時刻分の操作量ｕを「最適操作量」として取り出す。そして、反復処理部２３３は、この最適操作量と、ステップＳ２２０で読み出した初期条件（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）とを対応付けて、最適操作量記憶部１３０に記憶させる。例えば、図５（Ｆ）に示すように、反復処理部２３３は、現在時刻ｔ０の操作量ｕを最適操作量とし、ステップＳ２２０で読み出した初期条件（過去時刻ｔ−１の操作量ｕ、現在時刻ｔ０の外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）とを対応付けて、最適操作量記憶部１３０に記憶させる。 In step S236, the iterative processing unit 233 of the prediction processing unit 230 extracts the operation amount u for one hour as the starting point as the “optimal operation amount” from the time series signal of the operation amount u in the finite interval. Then, the iterative processing unit 233 associates this optimum operation amount with the initial conditions (operation amount u, disturbance w, state x, output y, target value r) read in step S220, and the optimum operation amount storage unit. Store in 130. For example, as shown in FIG. 5 (F), the iterative processing unit 233 sets the operation amount u at the current time t0 as the optimum operation amount, and sets the initial condition read in step S220 (the operation amount u at the past time t-1, the present). The disturbance w at time t0, the state x, the output y, and the target value r) are associated with each other and stored in the optimum manipulated variable storage unit 130.

ステップＳ２４０において、予測処理部２３０の反復処理部２３３は、初期条件記憶部１２０に記憶された初期条件の時系列について、全ての時刻分の処理を終了したか否かを判定する。全ての時刻分の処理を終了していない場合（ステップＳ２４０：ＮＯ）、ステップＳ２５０において反復処理部２３３は、起点とする時刻を１時刻進め、ステップＳ２２０以降の処理を繰り返す。全ての時刻分の処理を終了した場合（ステップＳ２４０：ＹＥＳ）、反復処理部２３３は、処理をステップＳ３００へ遷移させる。 In step S240, the iterative processing unit 233 of the prediction processing unit 230 determines whether or not the processing for all the times has been completed for the time series of the initial conditions stored in the initial condition storage unit 120. When the processing for all the times is not completed (step S240: NO), the iterative processing unit 233 advances the time starting from the starting point by one hour in step S250, and repeats the processing after step S220. When the processing for all the times is completed (step S240: YES), the iterative processing unit 233 shifts the processing to step S300.

このように、予測学習処理のステップＳ２００では、ステップＳ１００で生成された多様な初期条件に対してモデル予測制御を実行し、各時刻の初期条件に対応する最適操作量を求める。上述の通り、予測処理部２３０は、初期条件を読み出す起点となる時刻を１時刻分ずつ移動させつつ、各時刻の初期条件に対して最適操作量を求める、という処理を時系列信号の全時刻分に対して行う。このため、予測処理部２３０は、多様な初期条件に対する最適操作量を求めて、最適操作量記憶部１３０に記憶させておくことができる。 As described above, in step S200 of the prediction learning process, model prediction control is executed for various initial conditions generated in step S100, and the optimum operation amount corresponding to the initial conditions at each time is obtained. As described above, the prediction processing unit 230 performs the process of obtaining the optimum operation amount for the initial condition of each time while moving the time that is the starting point for reading the initial condition by one hour at a time for all the time of the time series signal. Do for minutes. Therefore, the prediction processing unit 230 can obtain the optimum operation amount for various initial conditions and store it in the optimum operation amount storage unit 130.

ステップＳ３００では、最適操作量記憶部１３０に記憶されている初期条件と最適操作量との関係を表す近似式を機械学習によって求める。具体的には、ステップＳ３１０において、学習処理部２４０のＮＮ計算部２４１は、最適操作量記憶部１３０に記憶されている初期条件と最適操作量とのセットをすべて読みだす。ステップＳ３２０において、学習処理部２４０のＮＮ計算部２４１は、ニューラルネットワーク（ＮＮ）中のパラメータを初期化する。 In step S300, an approximate expression expressing the relationship between the initial condition stored in the optimum manipulated variable storage unit 130 and the optimum manipulated variable is obtained by machine learning. Specifically, in step S310, the NN calculation unit 241 of the learning processing unit 240 reads out all the sets of the initial conditions and the optimum operation amount stored in the optimum operation amount storage unit 130. In step S320, the NN calculation unit 241 of the learning processing unit 240 initializes the parameters in the neural network (NN).

ステップＳ３３０において、学習処理部２４０のＮＮ計算部２４１は、初期条件と最適操作量とを教師データとしたＮＮの教師あり学習によって近似式を求める。具体的には、ＮＮ計算部２４１は、ステップＳ３１０で読み出した初期条件をＮＮに与え（印加し）て、ＮＮの出力を求める。ステップＳ３４０において、学習処理部２４０の評価部２４２は、ＮＮの出力と、ステップＳ３１０で読み出した初期条件及び最適操作量とを目的関数に入力して、目的関数を評価することで、ＮＮによる最適操作量の近似精度（誤差）を評価する。この目的関数としては、例えば、ＮＮの出力と、読み出した最適操作量の差の二乗和を利用できる。 In step S330, the NN calculation unit 241 of the learning processing unit 240 obtains an approximate expression by supervised learning of NN using the initial condition and the optimum operation amount as teacher data. Specifically, the NN calculation unit 241 gives (applies) the initial conditions read in step S310 to the NN, and obtains the output of the NN. In step S340, the evaluation unit 242 of the learning processing unit 240 inputs the output of the NN, the initial condition and the optimum operation amount read in step S310 into the objective function, and evaluates the objective function to optimize the NN. Evaluate the approximation accuracy (error) of the manipulated variable. As this objective function, for example, the sum of squares of the difference between the output of NN and the read optimum manipulated variable can be used.

ステップＳ３５０において、学習処理部２４０の反復処理部２４３は、目的関数の値が収束したか否かを判定する。収束した場合（ステップＳ３５０：ＹＥＳ）、反復処理部２４３は、処理をステップＳ３７０へ遷移させる。一方、収束していない場合（ステップＳ３５０：ＮＯ）、反復処理部２４３は、目的関数値をより良くするようにＮＮのパラメータを修正し、処理をステップＳ３３０へ遷移させ、ＮＮ出力と評価を繰り返す。なお、反復処理部２４３は、バックプロパゲーションといった既知の手法を利用してもよい。 In step S350, the iterative processing unit 243 of the learning processing unit 240 determines whether or not the values of the objective functions have converged. When it converges (step S350: YES), the iterative processing unit 243 shifts the processing to step S370. On the other hand, when it has not converged (step S350: NO), the iterative processing unit 243 modifies the NN parameter so as to improve the objective function value, shifts the processing to step S330, and repeats the NN output and the evaluation. .. The iterative processing unit 243 may use a known method such as backpropagation.

ステップＳ３７０において、学習処理部２４０の反復処理部２４３は、最新のＮＮと、ＮＮのパラメータとを、学習済みＮＮとして近似式記憶部１４０に記憶させ、処理を終了する。 In step S370, the iterative processing unit 243 of the learning processing unit 240 stores the latest NN and the parameters of the NN in the approximate expression storage unit 140 as learned NNs, and ends the processing.

このように、予測学習処理のステップＳ３００では、最適操作量記憶部１３０に記憶されている初期条件と最適操作量との関係を表す近似式を機械学習によって求める。ステップＳ２００で生成された初期条件と最適操作量との関係は、初期条件が決まれば最適操作量が決まる、という形の非線形関数として表現できる。ステップＳ３００では、この関係から予め近似式を作成しておく。この近似式を利用すれば、実際の内燃機関の制御において、各時刻においてリアルタイムに、最適解を見つけるための反復計算（モデル予測制御）を行うことなく、最適操作量を高速に求めることが可能となる。ここで、初期条件と最適操作量との関係は一般的に極めて非線形性が高く、さらに、初期条件には数多くの要素が関連することから、多入力多出力の高次元な関係となるため、一般的な線形式やｎ次多項式では良好な近似精度が期待できない。この点、予測学習処理のステップＳ３００では、多入力多出力、かつ、強い非線形性を効率的に近似可能な手法として、ＮＮによる機械学習を使用している。ＮＮを用いて初期条件と最適操作量との関係を学習する場合、バックプロパゲーションによりＮＮ中のパラメータが目的関数値に及ぼす影響を計算でき、その情報を元に勾配法に基づく反復計算によって非線形関係を良好に再現する学習が可能となる。このように、十分に精度よく学習されたＮＮは、リアルタイムなモデル予測制御と同等の制御性能（予測性能）を、低演算負荷で実現することができる。 As described above, in step S300 of the predictive learning process, an approximate expression expressing the relationship between the initial condition stored in the optimum manipulated variable storage unit 130 and the optimal manipulated variable is obtained by machine learning. The relationship between the initial condition generated in step S200 and the optimum manipulated variable can be expressed as a non-linear function in which the optimum manipulated variable is determined once the initial condition is determined. In step S300, an approximate expression is created in advance from this relationship. By using this approximate expression, in the actual control of the internal combustion engine, it is possible to obtain the optimum operation amount at high speed in real time at each time without performing iterative calculation (model prediction control) for finding the optimum solution. It becomes. Here, the relationship between the initial condition and the optimum manipulated variable is generally extremely non-linear, and since many elements are related to the initial condition, it is a high-dimensional relationship with multiple inputs and multiple outputs. Good approximation accuracy cannot be expected with general line formats and nth-order polynomials. In this regard, in step S300 of the predictive learning process, machine learning by NN is used as a method capable of efficiently approximating strong non-linearity with multiple inputs and multiple outputs. When learning the relationship between the initial condition and the optimum manipulated value using NN, the effect of the parameters in the NN on the objective function value can be calculated by backpropagation, and based on that information, iterative calculation based on the gradient method is non-linear. Learning that reproduces the relationship well becomes possible. In this way, the NN learned with sufficient accuracy can realize the control performance (prediction performance) equivalent to the real-time model prediction control with a low calculation load.

以上説明した通り、情報処理装置１によれば、予測処理部２３０は、モデル予測制御を利用して予め、各初期条件（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ、目標値ｒ）に応じた内燃機関のアクチュエータの最適操作量を求めて、最適操作量記憶部１３０に記憶させておく（図３：ステップＳ２００）。そして、学習処理部２４０は予め、最適操作量記憶部１３０内の初期条件と最適操作量との関係を表す近似式（学習済みＮＮとパラメータ）を求めておくことができる（図４：ステップＳ３００）。このため、本構成の情報処理装置１によれば、実際の内燃機関の制御では、リアルタイムな反復計算を必要とせず、学習処理部２４０によって求められた近似式（学習済みＮＮとパラメータ）を利用することで、初期条件内の各要素に対応する各実際値（または推定値）に応じた、内燃機関のアクチュエータの最適操作量を素早く求めることができ、処理負荷の低減と処理時間の短縮とを図ることができる。また、初期条件生成部２２０は、初期条件を生成する際に、内燃機関のアクチュエータの操作量ｕ（及び外乱ｗ）の変化に応じた、内燃機関の制御対象部の状態ｘ（及び出力ｙ）の変化をモデル化したモデル式を使用する。このため、初期条件を生成する際に、各要素を網羅的に組み合わせた試行を行う場合と比較して、演算量を減らすことができ、初期条件生成の効率化を図ることができる。 As described above, according to the information processing apparatus 1, the prediction processing unit 230 uses the model prediction control to set each initial condition (operation amount u, disturbance w, state x, output y, target value r) in advance. The optimum operating amount of the actuator of the internal combustion engine corresponding to the response is obtained and stored in the optimum operating amount storage unit 130 (FIG. 3: step S200). Then, the learning processing unit 240 can obtain in advance an approximate expression (learned NN and parameters) expressing the relationship between the initial condition and the optimum operation amount in the optimum operation amount storage unit 130 (FIG. 4: Step S300). ). Therefore, according to the information processing device 1 of the present configuration, the actual control of the internal combustion engine does not require real-time iterative calculation, and uses the approximate expression (learned NN and parameters) obtained by the learning processing unit 240. By doing so, the optimum operating amount of the actuator of the internal combustion engine can be quickly obtained according to each actual value (or estimated value) corresponding to each element in the initial condition, and the processing load can be reduced and the processing time can be shortened. Can be planned. Further, when the initial condition generation unit 220 generates the initial condition, the state x (and output y) of the controlled target unit of the internal combustion engine according to the change in the operation amount u (and the disturbance w) of the actuator of the internal combustion engine. Use a model formula that models the change in. Therefore, when generating the initial condition, the amount of calculation can be reduced and the efficiency of initial condition generation can be improved as compared with the case where the trial in which each element is comprehensively combined is performed.

＜内燃機関の制御装置＞
図６は、本発明の一実施形態としての内燃機関の制御装置３のブロック図である。制御装置３は、情報処理装置１により作成された近似式を利用して、内燃機関を制御する装置である。内燃機関としては、例えば、ディーゼルエンジン、ガソリンエンジン等が挙げられる。制御装置３は、ドライバインターフェース（ＩＦ）３１０と、ＥＣＵ（Electronic Control Unit）３２０と、ハードウェアシステム３３０とを備え、各部は図示しない車載ネットワークにより相互に接続されている。ドライバインターフェース３１０は、内燃機関の運転者による操作信号を取得するためのインタフェースであり、例えば、アクセル、ブレーキ等である。 <Control device for internal combustion engine>
FIG. 6 is a block diagram of a control device 3 for an internal combustion engine as an embodiment of the present invention. The control device 3 is a device that controls an internal combustion engine by using an approximate expression created by the information processing device 1. Examples of the internal combustion engine include a diesel engine and a gasoline engine. The control device 3 includes a driver interface (IF) 310, an ECU (Electronic Control Unit) 320, and a hardware system 330, and each part is connected to each other by an in-vehicle network (not shown). The driver interface 310 is an interface for acquiring an operation signal by the driver of the internal combustion engine, and is, for example, an accelerator, a brake, or the like.

ＥＣＵ３２０は、内燃機関の運転を制御するマイクロコントローラ（マイコン）である。そのほかＥＣＵ３２０は、目標値決定部３２１、制御部３２２、情報取得部３２３として機能し、協働して後述する内燃機関制御を実行する。目標値決定部３２１は、運転者による操作信号に応じた内燃機関の制御対象部の出力の目標値ｒを決定する。 The ECU 320 is a microcontroller (microcomputer) that controls the operation of the internal combustion engine. In addition, the ECU 320 functions as a target value determination unit 321, a control unit 322, and an information acquisition unit 323, and cooperates to execute internal combustion engine control described later. The target value determining unit 321 determines the target value r of the output of the controlled target unit of the internal combustion engine according to the operation signal by the driver.

制御部３２２は、情報処理装置１によって求められた近似式（学習済みＮＮとパラメータ）を利用して、アクチュエータ３３１の最適操作量を求め、アクチュエータ３３１を動作させる。なお、近似式は、例えば、制御装置３を搭載した車両の製造時に予め制御部３２２内の図示しない記憶部に記憶されてもよい。また、制御部３２２は、図示しない通信ネットワークを介して情報処理装置１と通信を行い、定期的に近似式を取得して制御部３２２内の記憶部に記憶させてもよい。 The control unit 322 obtains the optimum operation amount of the actuator 331 by using the approximate expression (learned NN and parameters) obtained by the information processing device 1, and operates the actuator 331. The approximate expression may be stored in advance in a storage unit (not shown) in the control unit 322 at the time of manufacturing the vehicle equipped with the control device 3, for example. Further, the control unit 322 may communicate with the information processing device 1 via a communication network (not shown), periodically acquire an approximate expression, and store it in a storage unit in the control unit 322.

情報取得部３２３は、センサ３３３によって取得された検出値に基づいて、初期条件に対応した各要素（操作量ｕ、外乱ｗ、状態ｘ、出力ｙ）の各実際値を取得する。なお、情報取得部３２３は、センサ３３３によって取得された実際値から、制御対象のモデルに基づくオブザーバやカルマンフィルタなどを用いて、操作量ｕ、外乱ｗ、状態ｘ、出力ｙの各推定値を求めてもよい。内燃機関の制御では、最適操作量を求めるために多くの要素（例えば、数十個）を必要とする。このため、情報取得部３２３は、一部の要素についてセンサ３３３により取得された実際値を用い、他の要素について推定した推定値を使用してもよい。なお、初期条件に対応した目標値ｒは、目標値決定部３２１によって別途設定される。 The information acquisition unit 323 acquires each actual value of each element (operation amount u, disturbance w, state x, output y) corresponding to the initial condition based on the detection value acquired by the sensor 333. The information acquisition unit 323 obtains each estimated value of the manipulated variable u, the disturbance w, the state x, and the output y from the actual value acquired by the sensor 333 by using an observer or a Kalman filter based on the model to be controlled. You may. In the control of an internal combustion engine, many elements (for example, several tens) are required to obtain the optimum manipulated variable. Therefore, the information acquisition unit 323 may use the actual value acquired by the sensor 333 for some elements and the estimated value estimated for the other elements. The target value r corresponding to the initial condition is separately set by the target value determining unit 321.

ハードウェアシステム３３０は、内燃機関に搭載されているハードウェアである。アクチュエータ３３１は、内燃機関において操作することが可能な１つまたは複数のアクチュエータであり、スロットルや、ＥＧＲシステムにおけるＥＧＲバルブ等である。制御対象部３３２は、内燃機関に含まれる１つまたは複数の制御対象部であり、ＥＧＲシステムや、過給機等である。センサ３３３は、アクチュエータ３３１の動作状況（操作量ｕ）、制御対象部３３２の状態ｘ、制御対象部３３２の出力ｙ、内燃機関に対する外乱ｗの各々を検出するためのセンサである。 The hardware system 330 is hardware mounted on an internal combustion engine. The actuator 331 is one or more actuators that can be operated in an internal combustion engine, such as a throttle, an EGR valve in an EGR system, and the like. The control target unit 332 is one or a plurality of control target units included in the internal combustion engine, such as an EGR system and a supercharger. The sensor 333 is a sensor for detecting each of the operating status (operation amount u) of the actuator 331, the state x of the controlled object unit 332, the output y of the controlled object unit 332, and the disturbance w with respect to the internal combustion engine.

図７は、内燃機関制御における処理の手順を示すフローチャートである。内燃機関制御は、情報処理装置１により作成された近似式を利用して求めた最適操作量を用いて、内燃機関（具体的にはアクチュエータ３３１）を制御する処理である。図７に示す内燃機関制御は、例えば制御装置３を搭載した車両の始動時に開始され、所定の制御周期ごとに繰り返し実行される。 FIG. 7 is a flowchart showing a processing procedure in internal combustion engine control. The internal combustion engine control is a process of controlling the internal combustion engine (specifically, the actuator 331) by using the optimum operation amount obtained by using the approximate expression created by the information processing apparatus 1. The internal combustion engine control shown in FIG. 7 is started, for example, when the vehicle equipped with the control device 3 is started, and is repeatedly executed at predetermined control cycles.

ステップＳ４１０において、ドライバインターフェース３１０は、アクセル開度やブレーキ開度から内燃機関の運転者による操作信号を取得し、目標値決定部３２１へと送信する。ステップＳ４２０において、ＥＣＵ３２０の目標値決定部３２１は、取得した操作信号（運転者による操作信号）と、情報取得部３２３から取得した現在の操作量ｕ、外乱ｗ、状態ｘ、出力ｙとを用いて、制御部３２２の出力ｙに対する目標値ｒを決定し、制御部３２２へ送信する。 In step S410, the driver interface 310 acquires an operation signal from the driver of the internal combustion engine from the accelerator opening and the brake opening, and transmits the operation signal to the target value determining unit 321. In step S420, the target value determination unit 321 of the ECU 320 uses the acquired operation signal (operation signal by the driver) and the current operation amount u, disturbance w, state x, and output y acquired from the information acquisition unit 323. The target value r with respect to the output y of the control unit 322 is determined and transmitted to the control unit 322.

ステップＳ４３０において、ＥＣＵ３２０の制御部３２２は、取得した目標値ｒと、情報取得部３２３から取得した現在の操作量ｕ、外乱ｗ、状態ｘ、出力ｙとを近似式（学習済みＮＮとパラメータ）に適用する。これにより制御部３２２は、目標値決定部３２１より指定された目標値ｒを達成するための、内燃機関のアクチュエータ３３１の最適な操作量（最適操作量）を決定できる。ステップＳ４４０において、ＥＣＵ３２０の制御部３２２は、決定した最適操作量でアクチュエータ３３１を動作させる。 In step S430, the control unit 322 of the ECU 320 approximates the acquired target value r and the current manipulated variable u, disturbance w, state x, and output y acquired from the information acquisition unit 323 (learned NN and parameter). Apply to. As a result, the control unit 322 can determine the optimum operation amount (optimum operation amount) of the actuator 331 of the internal combustion engine for achieving the target value r specified by the target value determination unit 321. In step S440, the control unit 322 of the ECU 320 operates the actuator 331 with the determined optimum operation amount.

ステップＳ４５０において、アクチュエータ３３１の動作の結果として、制御対象部３３２の状態ｘ及び出力ｙが変化する。ステップＳ４６０において、センサ３３３は、アクチュエータ３３１の動作状況（操作量ｕ）、制御対象部３３２の状態ｘ、制御対象部３３２の出力ｙ、内燃機関に対する外乱ｗについて、最新の情報を検出し、検出信号を情報取得部３２３へと送信する。ステップＳ４７０において、ＥＣＵ３２０の情報取得部３２３は、取得した最新の情報に基づく操作量ｕ、外乱ｗ、状態ｘ、出力ｙの各実際値または各推定値を求める。 In step S450, the state x and the output y of the control target unit 332 change as a result of the operation of the actuator 331. In step S460, the sensor 333 detects and detects the latest information regarding the operating status (operation amount u) of the actuator 331, the state x of the control target unit 332, the output y of the control target unit 332, and the disturbance w with respect to the internal combustion engine. The signal is transmitted to the information acquisition unit 323. In step S470, the information acquisition unit 323 of the ECU 320 obtains each actual value or each estimated value of the operation amount u, the disturbance w, the state x, and the output y based on the acquired latest information.

図８は、内燃機関制御による動作の一例を示す。図８では、操作量ｕとして操作量ｕ１〜ｕ３の３項目、外乱ｗとして外乱ｗ１〜ｗ４の４項目、状態ｘとして状態ｘ１〜ｘ７の７項目、出力ｙとして出力ｙ１〜ｙ２の２項目、目標値ｒとして目標値ｒ１〜ｒ２の２項目を使用して、内燃機関制御を実行した場合の具体例を示す。図８（Ａ）には、操作量ｕ及び外乱ｗの時系列信号を図示し、図（Ｂ）には、出力ｙ及び目標値ｒの時系列信号を図示している。図示の便宜上、状態ｘの時系列信号は省略している。図８（Ｂ）に示すように、最適操作量を求めるために多くの要素を必要とする多入力多出力の複雑なケースであっても、出力ｙの実際値は、設定された目標値ｒに適切に追従していることがわかる。 FIG. 8 shows an example of operation by internal combustion engine control. In FIG. 8, the operation amount u is 3 items of operation amounts u1 to u3, the disturbance w is 4 items of disturbance w1 to w4, the state x is 7 items of states x1 to x7, and the output y is 2 items of outputs y1 to y2. A specific example is shown in the case where the internal combustion engine control is executed by using the two items of the target values r1 to r2 as the target value r. FIG. 8A illustrates the time-series signals of the manipulated variable u and the disturbance w, and FIG. 8B illustrates the time-series signals of the output y and the target value r. For convenience of illustration, the time series signal of the state x is omitted. As shown in FIG. 8B, the actual value of the output y is the set target value r even in a complicated case of multiple inputs and multiple outputs that requires many elements to obtain the optimum manipulated variable. It can be seen that it follows properly.

図９は、内燃機関制御に要した演算時間の一例を示す。図９の横軸には時間を、縦軸には制御装置３のＥＣＵ３２０による演算に要した時間（ｍｓ）を示している。図９に示す通り、ＥＣＵ３２０の１回あたりの演算時間は０．０３ｍｓ〜０．０６ｍｓであり、最大でも０．２ｍｓ以内であることがわかる。例えば、モデル予測制御の高速解法として知られているＣ／ＧＭＲＥＳ法を用いて、リアルタイムに内燃機関の制御を行った場合の従来例では、演算時間は約６０ｍｓであった（仲田勇人ほか，“ディーゼルエンジン吸排気システムへのＣ／ＧＭＲＥＳモデル予測制御の応用"）。この従来例と比較すると、本実施形態の内燃機関制御では、演算速度を約１０００倍以上も高速化できる。 FIG. 9 shows an example of the calculation time required for controlling the internal combustion engine. The horizontal axis of FIG. 9 shows the time, and the vertical axis shows the time (ms) required for the calculation by the ECU 320 of the control device 3. As shown in FIG. 9, it can be seen that the calculation time per operation of the ECU 320 is 0.03 ms to 0.06 ms, and is within 0.2 ms at the maximum. For example, in the conventional example in which the internal combustion engine is controlled in real time by using the C / GMRES method known as a high-speed solution of model predictive control, the calculation time is about 60 ms (Hayato Nakata et al., “ Application of C / GMRES model predictive control to diesel engine intake / exhaust systems "). Compared with this conventional example, in the internal combustion engine control of the present embodiment, the calculation speed can be increased by about 1000 times or more.

以上説明した通り、内燃機関の制御装置３によれば、ＥＣＵ３２０の制御部３２２は、情報取得部３２３によって取得された操作量ｕ、外乱ｗ、状態ｘ、出力ｙの各実際値または推定値と、制御部３２２の記憶部内の近似式とを用いて、各実際値または推定値に応じた内燃機関のアクチュエータ３３１の最適操作量を素早く求めることができ、処理負荷の低減と処理時間の短縮とを図ることができる。 As described above, according to the control device 3 of the internal combustion engine, the control unit 322 of the ECU 320 includes the actual values or estimated values of the operation amount u, the disturbance w, the state x, and the output y acquired by the information acquisition unit 323. , The optimum operation amount of the actuator 331 of the internal combustion engine according to each actual value or estimated value can be quickly obtained by using the approximate expression in the storage unit of the control unit 322, and the processing load can be reduced and the processing time can be shortened. Can be planned.

＜本実施形態の変形例＞
本発明は上記の実施形態に限られるものではなく、その要旨を逸脱しない範囲において種々の態様において実施することが可能であり、例えば次のような変形も可能である。 <Modified example of this embodiment>
The present invention is not limited to the above-described embodiment, and can be implemented in various aspects without departing from the gist thereof. For example, the following modifications are also possible.

［変形例１］
上記実施形態では、情報処理装置の構成の一例を示した。しかし、情報処理装置の構成は種々の変形が可能である。例えば、情報処理装置は、ネットワーク上に配置された複数の情報処理装置が協働することによって構成されてもよい。この場合、例えば、実験計画処理部、初期条件生成部、予測処理部、学習処理部のうちの少なくとも一部が異なる情報処理装置によって実現されてよい。 [Modification 1]
In the above embodiment, an example of the configuration of the information processing device is shown. However, the configuration of the information processing device can be modified in various ways. For example, the information processing device may be configured by the cooperation of a plurality of information processing devices arranged on the network. In this case, for example, at least a part of the experiment planning processing unit, the initial condition generation unit, the prediction processing unit, and the learning processing unit may be realized by different information processing devices.

［変形例２］
上記実施形態では、モデル予測制御において初期条件として考慮すべき要素の一例を挙げた。しかし、モデル予測制御において、初期条件として考慮する要素ａ１〜ａ５のうちの少なくとも一部は、省略してもよく、さらなる他の要素を考慮してもよい。具体的には、要素ａ２の外乱ｗ、要素ａ４の出力ｙ、要素ａ５の目標値ｒのうちの少なくとも一部は省略してよい。例えば外乱ｗを省略する場合、モデル式における外乱ｗのパラメータは省略できる。また、予測学習処理（図３）のステップＳ１００における外乱ｗの初期条件生成、ステップＳ２００における初期条件としての外乱ｗの考慮は省略できる。また、内燃機関制御（図７）のステップＳ４２０、Ｓ４３０、Ｓ４７０における外乱ｗの考慮も省略してよい。出力ｙ及び目標値ｒについても同様に、省略する場合は、モデル式と、予測学習処理と、内燃機関制御の各々について、省略された要素に対する考慮は省略してよい。 [Modification 2]
In the above embodiment, an example of an element to be considered as an initial condition in model prediction control is given. However, in the model prediction control, at least a part of the elements a1 to a5 to be considered as the initial condition may be omitted, or further other elements may be considered. Specifically, at least a part of the disturbance w of the element a2, the output y of the element a4, and the target value r of the element a5 may be omitted. For example, when the disturbance w is omitted, the parameter of the disturbance w in the model formula can be omitted. Further, the generation of the initial condition of the disturbance w in step S100 of the prediction learning process (FIG. 3) and the consideration of the disturbance w as the initial condition in step S200 can be omitted. Further, consideration of the disturbance w in steps S420, S430, and S470 of the internal combustion engine control (FIG. 7) may be omitted. Similarly, when the output y and the target value r are omitted, consideration for the omitted elements may be omitted for each of the model formula, the predictive learning process, and the internal combustion engine control.

［変形例３］
上記実施形態では、予測学習処理の一例を示した（図３、図４）。しかし、予測学習処理は種々の変形が可能である。例えば、ステップＳ１００において、実験計画法を用いずに操作量ｕなどの時系列信号を生成してもよい。例えば、ステップＳ３００において、サポートベクターマシン（ＳＶＭ：Support Vector Machine）等のＮＮ以外の手段を用いることで近似式を求めてもよい。例えば、ステップＳ１００、Ｓ２００、Ｓ３００は一連の処理として実行されず、個別に実行されてよい。 [Modification 3]
In the above embodiment, an example of the predictive learning process is shown (FIGS. 3 and 4). However, the predictive learning process can be modified in various ways. For example, in step S100, a time-series signal such as the manipulated variable u may be generated without using the design of experiments. For example, in step S300, an approximate expression may be obtained by using a means other than NN such as a support vector machine (SVM). For example, steps S100, S200, and S300 are not executed as a series of processes, but may be executed individually.

例えば、予測学習処理では、上述した一部のステップを省略してもよく、さらなる他のステップを追加で実行してもよい。具体的には、例えば、ステップＳ１００において、初期条件のうちの少なくとも一部の時系列信号を第１の方法（実験計画法）で生成し、残りの時系列信号を第２の方法で生成してもよい。例えば、モデル式記憶部に複数のモデル式を予め記憶させておき、ステップＳ１３０やステップＳ２３２において、初期条件のうちの少なくとも一部の特性や実際値に応じて、適用するモデル式を変更してもよい。例えば、ステップＳ３００の終了後に、生成した近似式を内燃機関の制御装置へと配信してもよい。 For example, in the predictive learning process, some of the above-mentioned steps may be omitted, or other steps may be additionally executed. Specifically, for example, in step S100, at least a part of the initial conditions is generated by the first method (design of experiments), and the remaining time series signals are generated by the second method. You may. For example, a plurality of model formulas are stored in advance in the model formula storage unit, and in steps S130 and S232, the model formulas to be applied are changed according to at least some characteristics and actual values of the initial conditions. May be good. For example, after the end of step S300, the generated approximate expression may be delivered to the control device of the internal combustion engine.

［変形例４］
上記実施形態では、内燃機関の制御装置の構成の一例を示した。しかし、内燃機関の制御装置の構成は種々の変形が可能である。例えば、制御装置は、モデル予測制御を利用して、内燃機関の制御に使用するための近似式を求める上述した情報処理装置の機能をさらに備えていてもよい。 [Modification example 4]
In the above embodiment, an example of the configuration of the control device of the internal combustion engine is shown. However, the configuration of the control device of the internal combustion engine can be modified in various ways. For example, the control device may further include the function of the above-mentioned information processing device for obtaining an approximate expression for use in controlling an internal combustion engine by utilizing model prediction control.

［変形例５］
上記実施形態では、内燃機関制御の一例を示した（図７）。しかし、予測学習処理は種々の変形が可能である。例えば、上述した一部のステップを省略してもよく、さらなる他のステップを追加で実行してもよい。具体的には、例えば、ステップＳ４７０の終了後に、取得した操作量ｕ、外乱ｗ、状態ｘ、出力ｙの実際値を情報処理装置へと送信して、近似式の精度向上に役立ててもよい。 [Modification 5]
In the above embodiment, an example of internal combustion engine control is shown (FIG. 7). However, the predictive learning process can be modified in various ways. For example, some of the steps described above may be omitted, or additional steps may be performed. Specifically, for example, after the end of step S470, the acquired actual values of the manipulated variable u, the disturbance w, the state x, and the output y may be transmitted to the information processing apparatus to help improve the accuracy of the approximate expression. ..

以上、実施形態、変形例に基づき本態様について説明してきたが、上記した態様の実施の形態は、本態様の理解を容易にするためのものであり、本態様を限定するものではない。本態様は、その趣旨並びに特許請求の範囲を逸脱することなく、変更、改良され得ると共に、本態様にはその等価物が含まれる。また、その技術的特徴が本明細書中に必須なものとして説明されていなければ、適宜、削除することができる。 Although the present embodiment has been described above based on the embodiments and modifications, the embodiments of the above-described embodiments are for facilitating the understanding of the present embodiment, and do not limit the present embodiment. This aspect may be modified or improved without departing from its spirit and claims, and this aspect includes its equivalents. In addition, if the technical feature is not described as essential in the present specification, it may be deleted as appropriate.

１…情報処理装置
３…内燃機関の制御装置
１００…記憶部
１１０…モデル式記憶部
１２０…初期条件記憶部
１２１…操作量記憶部
１２２…外乱記憶部
１２３…状態記憶部
１２４…出力記憶部
１２５…目標値記憶部
１３０…最適操作量記憶部
１４０…近似式記憶部
２００…情報処理部
２１０…実験計画処理部
２２０…初期条件生成部
２２１…予測計算部
２３０…予測処理部
２３１…予測計算部
２３２…評価部
２３３…反復処理部
２４０…学習処理部
２４１…ＮＮ計算部
２４２…評価部
２４３…反復処理部
３１０…ドライバインターフェース
３２１…目標値決定部
３２２…制御部
３２３…情報取得部
３３０…ハードウェアシステム
３３１…アクチュエータ
３３２…制御対象部
３３３…センサ 1 ... Information processing device 3 ... Internal engine control device 100 ... Storage unit 110 ... Model storage unit 120 ... Initial condition storage unit 121 ... Operation amount storage unit 122 ... Disturbance storage unit 123 ... State storage unit 124 ... Output storage unit 125 ... Target value storage unit 130 ... Optimal operation amount storage unit 140 ... Approximate type storage unit 200 ... Information processing unit 210 ... Experiment plan processing unit 220 ... Initial condition generation unit 221 ... Prediction calculation unit 230 ... Prediction processing unit 231 ... Prediction calculation unit 232 ... Evaluation unit 233 ... Iterative processing unit 240 ... Learning processing unit 241 ... NN calculation unit 242 ... Evaluation unit 243 ... Iterative processing unit 310 ... Driver interface 321 ... Target value determination unit 322 ... Control unit 323 ... Information acquisition unit 330 ... Hard Wear system 331 ... Actuator 332 ... Control target part 333 ... Sensor

Claims

It is an information processing device
According to the change of the actuator manipulated variable, the model formula storage unit for previously storing a model formula that models the change in the state of the controlled portions,
An initial condition storage unit that stores an initial condition including at least a time-series signal of the operation amount of the actuator and a time-series signal of the state of the control target unit.
An optimum operation amount storage unit that stores the optimum operation amount, which is the optimum operation amount of the actuator in the initial condition, in association with the initial condition at each time in the initial condition storage unit.
By applying the time-series signal of the operation amount of the actuator created in advance to the model formula in the model formula storage unit, a time-series signal of the predicted value of the state of the control target unit is generated, and the initial condition. The initial condition generation unit to be stored in the initial condition storage unit and
Repeating the evaluation of the objective function using the initial condition in the initial condition storage unit and the state estimated using the model formula in the model formula storage unit while changing the input operation amount. A prediction processing unit that obtains the optimum operation amount and stores it in the optimum operation amount storage unit,
A learning processing unit that obtains an approximate expression representing the relationship between the initial condition and the optimum operation amount in the optimum operation amount storage unit, and
Information processing device.

The information processing device according to claim 1.
The learning processing unit is an information processing device that obtains the approximate expression by supervised learning of a neural network using the initial conditions and the optimum operation amount as teacher data.

The information processing apparatus according to claim 1 or 2, further comprising.
An information processing device including an experiment planning processing unit that generates the time-series signal of the operation amount of the actuator by using the design of experiments method.

The information processing apparatus according to any one of claims 1 to 3.
In addition to the initial conditions,
Disturbance, which is a physical quantity that affects the output of the controlled object,
The output of the control target unit and
The target value of the output of the control target unit and
Includes at least some of
When the disturbance is included in the initial conditions, the model formula models the change in the state according to the manipulated variable and the change in the disturbance.
An information processing apparatus in which, when the output is included in the initial conditions, the state and the change in the output are modeled in the model formula according to the change in the manipulated variable.

The information processing apparatus according to any one of claims 1 to 4.
An information processing device that uses a linear equation of state and a non-linear equation of state as the model equation.

The information processing apparatus according to any one of claims 1 to 5.
An information processing device that uses a nonlinear equation constructed using a NARX model as the model formula.

The information processing apparatus according to any one of claims 4 to 6, which is dependent on claim 3 or claim 3.
As the design of experiments, a first method of generating a signal represented by a combination of a step function and a ramp function, and a second method of generating a signal represented by a chirp signal whose frequency changes with time. An information processing device that uses either and.

According to the change of the actuator manipulated variable, an information processing method of the change of state of the control object unit utilizing modeled model equation,
A step of generating a time-series signal of a predicted value of the state of the controlled object portion by applying a time-series signal of the operation amount of the actuator created in advance to the model formula.
A step of storing the time-series signal of the operation amount of the actuator and the generated time-series signal of the state of the controlled object portion as initial conditions.
By repeating the evaluation of the objective function using the initial condition and the state estimated by using the model formula while changing the input manipulated variable, the optimum operation of the actuator under the initial condition is performed. The process of finding the optimum amount of operation, which is the amount,
A step of associating the obtained optimum operation amount with the initial condition and storing it.
A step of obtaining an approximate expression expressing the relationship between the initial conditions and the optimum manipulated variable, and
Information processing method.

According to the change of the actuator manipulated variable, a computer program using a model model expression changes in the state of the control object unit,
By applying a time-series signal of the operation amount of the actuator created in advance to the model formula, a function of generating a time-series signal of a predicted value of the state of the control target unit is provided.
A function of storing the time-series signal of the operation amount of the actuator and the generated time-series signal of the state of the controlled object portion as initial conditions.
By repeating the evaluation of the objective function using the initial condition and the state estimated by using the model formula while changing the input manipulated variable, the optimum operation of the actuator under the initial condition is performed. The function to find the optimum operation amount, which is the amount, and
A function of associating the obtained optimum operation amount with the initial condition and storing it.
A function to obtain an approximate expression expressing the relationship between the initial condition and the optimum manipulated variable, and
A computer program that includes.

An actuator, a control apparatus for engine Ru and a controlled portion,
It was generated by applying the time-series signal of the operation amount of the actuator and the time-series signal of the operation amount of the actuator created in advance to the model formula modeling the change of the state of the control target unit. Optimal operation of the actuator obtained by using the model formula for the initial condition and the initial condition at each time with respect to the initial condition including at least the time-series signal of the predicted value of the state of the controlled object portion. A storage unit that stores an approximate expression that expresses the relationship with the optimum manipulated variable, which is a quantity,
An information acquisition unit that acquires the actual operation amount of the actuator and the state of the control target unit,
Using the acquired operation amount and state and the approximate expression in the storage unit, the optimum operation amount corresponding to the actual operation amount and the state is obtained, and the actuator is set according to the optimum operation amount. The control unit to operate and
Comprising a control device.