JP5603827B2

JP5603827B2 - Generation of regression equation for control factor identification

Info

Publication number: JP5603827B2
Application number: JP2011108687A
Authority: JP
Inventors: 和博伊賀; 庸介稲葉; 英樹窪谷; 啓史上島
Original assignee: Toyota Technical Development Corp; Toyota Motor Corp
Current assignee: Toyota Technical Development Corp; Toyota Motor Corp
Priority date: 2010-05-18
Filing date: 2011-05-13
Publication date: 2014-10-08
Anticipated expiration: 2031-05-13
Also published as: JP2012003744A

Description

本発明は、例えば車両等の各種装置の制御において、制御結果を所望の範囲に適合させる制御因子を特定するための回帰式の生成方法に関する。 The present invention relates to a method for generating a regression equation for specifying a control factor that adapts a control result to a desired range in the control of various devices such as a vehicle.

車両をはじめとする各種装置の制御においては、様々な状況に応じて制御因子が適宜調整され、制御結果（評価指標）が所定の許容可能な範囲に収められる。かかる制御を行うに当たっては、関連する制御因子と制御結果（評価指標）との対応関係を表す式を予め求めておくのが一般的である。具体的には、制御因子と制御結果（評価指標）との組み合わせからなる複数のサンプルデータから、これらの制御因子と制御結果（評価指標）との対応関係を表す回帰式を事前に導き出しておき、各種装置の制御において許容可能な範囲にある制御結果（評価指標）を得るのに必要な制御因子を、上記回帰式を使用して算出する。このようにして、各種装置の特性に関する様々な要求が満たされる。 In the control of various devices including a vehicle, the control factor is appropriately adjusted according to various situations, and the control result (evaluation index) falls within a predetermined allowable range. In performing such control, it is common to obtain in advance an expression representing a correspondence relationship between a related control factor and a control result (evaluation index). Specifically, a regression equation representing the correspondence between these control factors and control results (evaluation indicators) is derived in advance from a plurality of sample data consisting of combinations of control factors and control results (evaluation indicators). Then, a control factor necessary for obtaining a control result (evaluation index) within an allowable range in the control of various devices is calculated using the regression equation. In this way, various requirements regarding the characteristics of various devices are met.

因みに、学習データ（制御因子と制御結果（評価指標）との組み合わせからなる複数のサンプルデータ）を用いて予測モデル（回帰式）を作成し、作成した予測モデルを用いて予測値（制御結果）を算出する装置や方法は、例えば、特開２００５−１３５２８７号公報（特許文献３）や特開２００６−３３８１２３号公報（特許文献４）において開示されている。 Incidentally, a prediction model (regression equation) is created using learning data (a plurality of sample data consisting of combinations of control factors and control results (evaluation indicators)), and a prediction value (control result) is created using the created prediction model. An apparatus and a method for calculating the value are disclosed in, for example, Japanese Unexamined Patent Application Publication No. 2005-135287 (Patent Document 3) and Japanese Unexamined Patent Application Publication No. 2006-338123 (Patent Document 4).

一方、サポートベクタマシン（ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ：以下「ＳＶＭ」とも称する）又はサポートベクタ回帰法（ＳｕｐｐｏｒｔＶｅｃｔｏｒＲｅｇｒｅｓｓｉｏｎ：以下「ＳＶＲ」とも称する）と呼ばれる手法が、パターン認識や回帰推定の分野で注目されている。これらの手法は、上述のようにサンプルデータから回帰式導き出す手法としても検討されており、このようなＳＶＭ又はＳＶＲを利用して車両の制御目標値を設定する技術が、例えば、特開２０１０−７８０８４号公報（特許文献１）や特開２００８−４５４７５号公報（特許文献２）に開示されている。 On the other hand, a technique called a support vector machine (Support Vector Machine: hereinafter referred to as “SVM”) or a support vector regression method (Support Vector Regression: hereinafter also referred to as “SVR”) has attracted attention in the field of pattern recognition and regression estimation. Yes. These methods are also studied as a method for deriving a regression equation from sample data as described above. A technique for setting a control target value of a vehicle using such SVM or SVR is disclosed in, for example, Japanese Patent Application Laid-Open No. 2010-2010. No. 78084 (Patent Document 1) and Japanese Patent Application Laid-Open No. 2008-45475 (Patent Document 2).

ＳＶＲは、予測性能の良い回帰式を生成できる長所を有する。しかしながら、定式化によっては、回帰式の算出時間を短縮可能な解法（例えば、ＳＭＯ法等のアルゴリズム）を用いずに二次計画問題を解く必要性が生ずる場合もある。このような場合、（ＳＶＲに限らず）一般的に、二次計画問題の求解時間はサンプルデータ数の３乗のオーダーで、二次計画問題を計算機で解く場合に必要とされる記憶領域の容量はサンプルデータ数の２乗のオーダーで、それぞれ増大することが知られている。従って、サンプルデータ数が多い場合、回帰式を生成するのに計算時間が非常に長くなるという短所があった。 SVR has the advantage of being able to generate regression equations with good prediction performance. However, depending on the formulation, it may be necessary to solve the quadratic programming problem without using a solution (for example, an algorithm such as the SMO method) that can reduce the calculation time of the regression equation. In such a case, the solution time of the quadratic programming problem is not limited to the SVR, and is generally the order of the cube of the number of sample data, and the storage area required when the quadratic programming problem is solved by the computer. It is known that the capacity increases in the order of the square of the number of sample data. Accordingly, when the number of sample data is large, the calculation time for generating the regression equation is very long.

尚、ＳＶＲにおける回帰式の生成を高速化する手段の１つとして、「逐次的学習法（ｃｈｕｎｋｉｎｇｍｅｔｈｏｄ）」が知られている。「逐次的学習法」としては種々の方法が知られているが、主要なものとして、サンプルデータを複数のグループに分け、第１のグループについてＳＶＲを行い、その際にサポートベクタとなったサンプルデータのみを残し、これらの残ったサンプルデータを次のグループのサンプルデータと併せたサンプルデータ群について更にＳＶＲを実施するというサイクルを繰り返す手法が挙げられる。 Note that “sequential learning method” is known as one of the means for speeding up the generation of regression equations in SVR. Various methods are known as “sequential learning methods”. However, as a main method, sample data is divided into a plurality of groups, SVR is performed on the first group, and a sample that becomes a support vector at that time There is a method of repeating a cycle in which only data is left, and SVR is further performed on a sample data group in which these remaining sample data are combined with sample data of the next group.

上記「逐次的学習法」は、ＳＶＲにおける回帰式の生成を高速化することができる長所を有する。しかしながら、前のグループのサンプルデータについてのＳＶＲでサポートベクタとなったサンプルデータを次サイクルで実施されるＳＶＲにおいてもサンプルデータとして残すという手法を採ることから、異常値混入の影響を受け易いという短所を有する。また、サポートベクタとなるサンプルデータの数が増大する場合には、結果として１回のＳＶＲにおいて対象となるサンプルデータ数が多くなり、回帰式生成の高速化という利点が薄れてしまう。 The “sequential learning method” has an advantage that the generation of a regression equation in SVR can be speeded up. However, the method of leaving the sample data that became the support vector in the SVR for the sample data of the previous group as the sample data also in the SVR executed in the next cycle is disadvantageous in that it is easily affected by the abnormal value mixing. Have Further, when the number of sample data serving as support vectors increases, the number of sample data to be processed in one SVR increases as a result, and the advantage of speeding up the generation of regression equations is diminished.

特開２０１０−０７８０８４号公報JP 2010-078084 A 特開２００８−０４５４７５号公報JP 2008-045475 A 特開２００５−１３５２８７号公報JP 2005-135287 A 特開２００６−３３８１２３号公報JP 2006-338123 A

本発明は、前述のような問題を解消すべく創作されたものである。即ち、本発明は、サンプルデータ数が膨大である場合でも回帰式を生成するのに必要とされる計算時間が著しく増大することが無く、かつ、サンプルデータにおける異常値混入の影響を受け難い、回帰式の生成方法を提供することを目的とする。より具体的には、本発明は、例えば車両等の各種装置の制御において使用される回帰式の生成に当たって、膨大なサンプルデータを用いる場合であっても、従来の回帰方法と比較して、回帰式生成に要する時間を短縮でき、かつ、異常値の混入があっても精度を維持することができる、制御因子特定用回帰式の生成方法を提供することを目的とする。 The present invention was created to solve the above-described problems. That is, the present invention does not significantly increase the calculation time required to generate the regression equation even when the number of sample data is enormous, and is not easily affected by outliers in the sample data, An object of the present invention is to provide a method for generating a regression equation. More specifically, in the present invention, for example, when generating a regression equation used in the control of various devices such as vehicles, even when a large amount of sample data is used, the regression is compared with the conventional regression method. It is an object of the present invention to provide a method for generating a regression equation for specifying a control factor, which can shorten the time required for generating an equation and can maintain accuracy even when an abnormal value is mixed.

本発明の上記目的は、
制御対象となる装置の制御因子と、制御因子に対応する評価指標との関係を表す回帰式を生成する方法であって、
制御因子及び評価指標に関する情報を含む複数のサンプルデータを複数のグループに分割する、サンプルデータ分割ステップ、
前記サンプルデータ分割ステップにおいて分割された複数のグループのうちの１つのグループに属するサンプルデータに基づいて回帰式を生成する回帰式生成ステップであって、
当該ステップの実行が初回である場合は、当該グループに属するサンプルデータに含まれる制御因子と評価指標との関係を表す回帰式を求め、
当該ステップの実行が初回ではない場合は、当該ステップの前回までの実行によって求められた回帰式に基づいて当該グループに属するサンプルデータに含まれる制御因子から得られる結果と、当該グループに属するサンプルデータに含まれる評価指標との誤差を算出し、当該誤差と当該制御因子との関係を表す回帰式を求める、
回帰式生成ステップ、
前記回帰式生成ステップにおいて求められた回帰式と既存の回帰式とを加法的に重ね合わせて新たな回帰式を生成する回帰式加算ステップであって、
当該ステップの実行が初回である場合は、前記回帰式生成ステップにおいて求められた回帰式を新たな回帰式とし、
当該ステップの実行が初回ではない場合は、前記回帰式生成ステップにおいて求められた回帰式と、当該ステップの前回の実行によって求められた既存の回帰式とを加法的に重ね合わせて新たな回帰式を生成する、
回帰式加算ステップ、
を含む方法によって達成される。 The above object of the present invention is to
A method of generating a regression equation representing a relationship between a control factor of a device to be controlled and an evaluation index corresponding to the control factor,
A sample data dividing step for dividing a plurality of sample data including information on control factors and evaluation indices into a plurality of groups;
A regression equation generating step for generating a regression equation based on sample data belonging to one of a plurality of groups divided in the sample data dividing step,
If the execution of this step is the first time, a regression equation representing the relationship between the control factor and the evaluation index included in the sample data belonging to the group is obtained,
When the execution of the step is not the first time, the result obtained from the control factor included in the sample data belonging to the group based on the regression equation obtained by the previous execution of the step and the sample data belonging to the group To calculate an error with the evaluation index included in and to obtain a regression equation representing the relationship between the error and the control factor,
Regression equation generation step,
A regression equation addition step for generating a new regression equation by additively overlapping the regression equation obtained in the regression equation generation step with an existing regression equation,
When the execution of this step is the first time, the regression equation obtained in the regression equation generation step is a new regression equation,
If the execution of the step is not the first time, the regression equation obtained in the regression equation generation step and the existing regression equation obtained by the previous execution of the step are additively superimposed to obtain a new regression equation Generate
Regression equation addition step,
Achieved by a method comprising:

本発明によれば、サンプルデータ数が膨大である場合でも回帰式を生成するのに必要とされる計算時間が著しく増大することが無く、かつ、サンプルデータにおける異常値混入の影響を受け難い、回帰式の生成方法が提供される。より具体的には、本発明によれば、例えば車両等の各種装置の制御において使用される回帰式の生成に当たって、膨大なサンプルデータを用いる場合であっても、従来の回帰方法と比較して、回帰式生成に要する時間を短縮でき、かつ、異常値の混入があっても精度を維持することができる、制御因子特定用回帰式の生成方法が提供される。 According to the present invention, even when the number of sample data is enormous, the calculation time required to generate the regression equation does not increase remarkably, and it is difficult to be affected by the abnormal value mixing in the sample data. A method for generating a regression equation is provided. More specifically, according to the present invention, even when a large amount of sample data is used for generating regression equations used in the control of various devices such as vehicles, for example, compared with the conventional regression method. There is provided a method for generating a regression equation for specifying a control factor that can reduce the time required for generating a regression equation and can maintain accuracy even when an abnormal value is mixed.

本発明の１つの実施態様に係る回帰式生成方法の概略を表すグラフである。It is a graph showing the outline of the regression type production | generation method which concerns on one embodiment of this invention. 本発明の１つの実施態様に係る回帰式生成方法における処理フローを表すフローチャートである。It is a flowchart showing the processing flow in the regression type production | generation method which concerns on one embodiment of this invention. 本発明の１つの実施態様に係る回帰式生成方法、従来法（逐次的学習法）、及び通常法（一括回帰）についての、異常値や計測ノイズの混入による計測時間への影響の程度を示すグラフである。The extent of the influence on the measurement time by mixing of an abnormal value and measurement noise about the regression equation generation method, the conventional method (sequential learning method), and the normal method (collective regression) according to one embodiment of the present invention is shown. It is a graph. 本発明の１つの実施態様に係る回帰式生成方法、従来法（逐次的学習法）、及び通常法（一括回帰）についての、異常値や計測ノイズの混入による回帰精度（相関係数）への影響の程度を示すグラフである。Regression formula generation method according to one embodiment of the present invention, conventional method (sequential learning method), and normal method (collective regression) to the regression accuracy (correlation coefficient) due to mixing of abnormal values and measurement noise It is a graph which shows the grade of influence. 本発明の１つの実施態様に係る回帰式生成方法において、回帰式の基底関数として用いられるガウス関数の幅Ｒと予測誤差との関係を、一部のサンプルデータについて求めた場合と全てのサンプルデータについて求めた場合とで比較するグラフである。In the regression equation generation method according to one embodiment of the present invention, the relationship between the prediction error and the width R of the Gaussian function used as the basis function of the regression equation and all sample data are obtained. It is a graph compared with the case where it asked about. 本発明の１つの実施態様に係る回帰式生成方法における初回の回帰式のガウス関数の幅Ｒと２回目の回帰式のガウス関数の幅Ｒとの組み合わせによる回帰式の予測精度（汎化性能推定値）の変化を表すグラフである。In the regression equation generation method according to one embodiment of the present invention, the prediction accuracy of the regression equation (generalized performance estimation) by the combination of the Gaussian function width R of the first regression equation and the Gaussian function width R of the second regression equation It is a graph showing a change of (value). 本発明の１つの実施態様に係る回帰式生成方法における初回の回帰式のガウス関数の幅Ｒと２回目の回帰式のガウス関数の幅Ｒとの組み合わせによる回帰式の予測精度（汎化性能推定値）の変化を二次元的に表すコンタ図である。In the regression equation generation method according to one embodiment of the present invention, the prediction accuracy of the regression equation (generalized performance estimation) by the combination of the Gaussian function width R of the first regression equation and the Gaussian function width R of the second regression equation It is a contour diagram that represents a change in (value) two-dimensionally.

前述のように、本発明は、例えば車両等の各種装置の制御において使用される回帰式の生成に当たって、膨大な数のサンプルデータを用いる場合であっても、従来の回帰方法と比較して、回帰式生成に要する時間を短縮でき、かつ、異常値の混入があっても精度を維持することができる、制御因子特定用回帰式の生成方法を提供することを目的とする。 As described above, the present invention, for example, in the generation of regression equations used in the control of various devices such as vehicles, even when using a huge number of sample data, compared with the conventional regression method, An object of the present invention is to provide a method for generating a regression equation for specifying a control factor, which can shorten the time required for generating a regression equation and can maintain accuracy even when an abnormal value is mixed.

即ち、本発明の第１の態様に係る方法は、
制御対象となる装置の制御因子と、制御因子に対応する評価指標との関係を表す回帰式を生成する方法であって、
制御因子及び評価指標に関する情報を含む複数のサンプルデータを複数のグループに分割する、サンプルデータ分割ステップ、
前記サンプルデータ分割ステップにおいて分割された複数のグループのうちの１つのグループに属するサンプルデータに基づいて回帰式を生成する回帰式生成ステップであって、
当該ステップの実行が初回である場合は、当該グループに属するサンプルデータに含まれる制御因子と評価指標との関係を表す回帰式を求め、
当該ステップの実行が初回ではない場合は、当該ステップの前回までの実行によって求められた回帰式に基づいて当該グループに属するサンプルデータに含まれる制御因子から得られる結果と、当該グループに属するサンプルデータに含まれる評価指標との誤差を算出し、当該誤差と当該制御因子との関係を表す回帰式を求める、
回帰式生成ステップ、
前記回帰式生成ステップにおいて求められた回帰式と既存の回帰式とを加法的に重ね合わせて新たな回帰式を生成する回帰式加算ステップであって、
当該ステップの実行が初回である場合は、前記回帰式生成ステップにおいて求められた回帰式を新たな回帰式とし、
当該ステップの実行が初回ではない場合は、前記回帰式生成ステップにおいて求められた回帰式と、当該ステップの前回の実行によって求められた既存の回帰式とを加法的に重ね合わせて新たな回帰式を生成する、
回帰式加算ステップ、
を含む方法である。 That is, the method according to the first aspect of the present invention is:
A method of generating a regression equation representing a relationship between a control factor of a device to be controlled and an evaluation index corresponding to the control factor,
A sample data dividing step for dividing a plurality of sample data including information on control factors and evaluation indices into a plurality of groups;
A regression equation generating step for generating a regression equation based on sample data belonging to one of a plurality of groups divided in the sample data dividing step,
If the execution of this step is the first time, a regression equation representing the relationship between the control factor and the evaluation index included in the sample data belonging to the group is obtained,
When the execution of the step is not the first time, the result obtained from the control factor included in the sample data belonging to the group based on the regression equation obtained by the previous execution of the step and the sample data belonging to the group To calculate an error with the evaluation index included in and to obtain a regression equation representing the relationship between the error and the control factor,
Regression equation generation step,
A regression equation addition step for generating a new regression equation by additively overlapping the regression equation obtained in the regression equation generation step with an existing regression equation,
When the execution of this step is the first time, the regression equation obtained in the regression equation generation step is a new regression equation,
If the execution of the step is not the first time, the regression equation obtained in the regression equation generation step and the existing regression equation obtained by the previous execution of the step are additively superimposed to obtain a new regression equation Generate
Regression equation addition step,
It is a method including.

ここで、上記制御対象となる装置には、例えば、車両や船舶等の移動体が含まれるが、これらの装置は移動体に限定されるものではない。即ち、本発明は、許容可能な範囲にある制御結果を得るのに好適な制御因子の目標値を、回帰式を利用して算出し、斯くして得られた目標値に制御因子を制御することによって、所望の制御結果を得ようとするものであれば、如何なる装置にも適用可能である。 Here, although the apparatus used as the said control object contains moving bodies, such as a vehicle and a ship, for example, these apparatuses are not limited to a moving body. That is, the present invention calculates a target value of a control factor suitable for obtaining a control result within an allowable range using a regression equation, and controls the control factor to the target value thus obtained. Therefore, the present invention can be applied to any device as long as it is intended to obtain a desired control result.

尚、上記装置が車両である場合、ここでいう車両とは、例えば、汽車、電車、自動車等、旅客・貨物を輸送する車等を意味し、車両の動力源の種類としては、例えば、内燃機関、電気モーター等が挙げられるが、これらに限定されない。また、車両における制御とは、車両の各種特性に関する様々な要求を満足することを指す。より具体的には、例えば、内燃機関における機関トルクの変動、燃焼室内で発生するＮＯｘ（窒素酸化物）の量、燃焼室から排出される未燃ＨＣ（未燃炭化水素）の量、ＡＴ（自動変速機）における変速ショック、運転中の車両姿勢維持と操縦安定性との両立等の各種特性を評価指標として、これらの評価指標が所望の範囲に収まるように調整することを車両制御と称するが、これらの具体例に限定されない。 In the case where the device is a vehicle, the vehicle here means, for example, a train, a train, an automobile, or the like, a vehicle that transports passengers / cargo, etc. Examples include, but are not limited to, engines and electric motors. Moreover, the control in a vehicle refers to satisfying various requirements regarding various characteristics of the vehicle. More specifically, for example, fluctuations in engine torque in an internal combustion engine, the amount of NOx (nitrogen oxide) generated in the combustion chamber, the amount of unburned HC (unburned hydrocarbon) discharged from the combustion chamber, AT ( Various characteristics such as a shift shock in an automatic transmission) and compatibility between vehicle attitude maintenance during driving and steering stability are used as evaluation indexes, and adjustment so that these evaluation indexes fall within a desired range is referred to as vehicle control. However, it is not limited to these specific examples.

かかる評価指標を調整するには、制御の対象となる装置の状態（例えば、車両の運転状況等）に応じて、対応する制御因子を適宜調節して、評価指標としての制御結果が所望の範囲に収まるようにする必要がある。制御の対象となる装置が車両である場合、かかる制御因子としては、例えば、スロットル弁の開度、点火プラグによる点火時期、吸気弁又は排気弁の開閉弁特性、燃料噴射弁からの燃料噴射量、変速のタイミング、アブソーバのロータリーバルブの回転角度等の各種パラメータ（制御因子）が挙げられる。 In order to adjust the evaluation index, the corresponding control factor is appropriately adjusted according to the state of the device to be controlled (for example, the driving situation of the vehicle), and the control result as the evaluation index is in a desired range. Need to fit in. When the device to be controlled is a vehicle, the control factors include, for example, the opening degree of the throttle valve, the ignition timing by the ignition plug, the opening / closing valve characteristics of the intake valve or the exhaust valve, the fuel injection amount from the fuel injection valve , timing of transmission, various parameters such as the rotation angle of the rotary valve of Abu source over bar (control factor).

即ち、装置の状態（例えば、車両の運転状況等）を制御しようとする際には、様々な状況に応じて制御因子が適宜調整され、制御結果（評価指標）が所定の許容可能な範囲に収められる。かかる制御を行うに当たっては、関連する制御因子と制御結果（評価指標）との対応関係を表す式を予め求めておくのが一般的である。具体的には、制御因子と制御結果（評価指標）との組み合わせからなる複数のサンプルデータ（実験値）から、これらの制御因子と制御結果（評価指標）との対応関係を表す回帰式を事前に導き出しておき、制御の対象となる装置の制御において許容可能な範囲にある制御結果（評価指標）を得るのに必要な制御因子の目標値を、上記回帰式を使用して算出し、斯くして算出された目標値に制御因子を調整して、当該装置の各種特性に関する様々な要求が満たされる。 That is, when trying to control the state of the device (for example, the driving situation of the vehicle, etc.), the control factor is appropriately adjusted according to various situations, and the control result (evaluation index) is within a predetermined allowable range. Can be stored. In performing such control, it is common to obtain in advance an expression representing a correspondence relationship between a related control factor and a control result (evaluation index). Specifically, from multiple sample data (experimental values) consisting of combinations of control factors and control results (evaluation indicators), a regression equation representing the correspondence between these control factors and control results (evaluation indicators) is pre- The target value of the control factor necessary for obtaining the control result (evaluation index) within the allowable range in the control of the device to be controlled is calculated using the above regression equation. By adjusting the control factor to the target value thus calculated, various requirements regarding various characteristics of the device are satisfied.

以下、図面を参照しながら、本発明の１つの実施態様の概要について説明する。図１は、通常の回帰式生成方法と、本発明の１つの実施態様に係る回帰式生成方法との違いを説明するためのグラフである。 Hereinafter, an outline of one embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a graph for explaining a difference between a normal regression equation generation method and a regression equation generation method according to one embodiment of the present invention.

図１（ａ）は、通常法による回帰式の生成方法を表している。この方法は、全てのサンプルデータについての因子と評価指標との関係を一度に回帰する方法である。前述したように、このように全てのサンプルデータについて一度に回帰式を求めようとすると、サンプルデータ数が多い場合には回帰式を生成するのに計算時間が非常に長くなるという短所がある。 FIG. 1A shows a method for generating a regression equation by a normal method. This method is a method of regressing the relationship between factors and evaluation indices for all sample data at once. As described above, if the regression equation is obtained at once for all the sample data in this way, there is a disadvantage that the calculation time becomes very long to generate the regression equation when the number of sample data is large.

一方、図１（ｂ）乃至（ｄ）は、本発明の１つの実施態様に係る回帰式生成方法におけるデータ処理手順の概略を説明している。 On the other hand, FIGS. 1B to 1D illustrate an outline of a data processing procedure in the regression equation generation method according to one embodiment of the present invention.

先ず、図１（ｂ）は、図１（ａ）の場合と同じサンプルデータを２つのグループに分け（前記「サンプルデータ分割ステップ」に対応）、そのうちの一方のグループ（第１グループ）に属するサンプルデータのみについて、回帰式を求める（前記「回帰式生成ステップ」の初回実行の場合に対応）ステップを示している。次に、図１（ｃ）においては、図１（ｂ）において得られた回帰式に対する他方のグループ（第２グループ）に属するサンプルデータからの誤差を表す回帰式を求めるステップ（前記「回帰式生成ステップ」の初回実行ではない場合に対応）を示している。更に、図１（ｄ）においては、図１（ｂ）において得られた回帰式と、図１（ｃ）において得られた（誤差についての）回帰式とを、加法的に重ね合わせるステップ（前記「回帰式加算ステップ」の初回実行ではない場合に対応）を示している。このステップにより、第１グループ及び第２グループに属する全てのサンプルデータ（即ち、全サンプルデータ）に対応する回帰式を得ることができる。 First, FIG. 1B divides the same sample data as in FIG. 1A into two groups (corresponding to the “sample data division step”), and belongs to one of the groups (first group). The figure shows the step of obtaining a regression equation for sample data only (corresponding to the first execution of the “regression equation generation step”). Next, in FIG. 1C, a step of obtaining a regression equation representing an error from sample data belonging to the other group (second group) with respect to the regression equation obtained in FIG. This corresponds to the case where the “generation step” is not executed for the first time). Further, in FIG. 1 (d), the regression equation obtained in FIG. 1 (b) and the regression equation (with respect to error) obtained in FIG. This corresponds to the case where the “regression formula addition step” is not executed for the first time). By this step, regression equations corresponding to all sample data belonging to the first group and the second group (that is, all sample data) can be obtained.

尚、上記説明においては、サンプルデータを２つのグループに分けた場合について説明したが、サンプルデータを３つ以上のグループに分ける場合は、グループの数に応じて、図１（ｃ）及び（ｄ）のステップを繰り返し実行することにより（前記「回帰式生成ステップ」の初回実行ではない場合に対応）、全てのグループに属するサンプルデータ（即ち、全サンプルデータ）に対応する回帰式が得られる。 In the above description, the case where the sample data is divided into two groups has been described. However, in the case where the sample data is divided into three or more groups, depending on the number of groups, FIGS. ) Is repeatedly executed (corresponding to the case where the “regression equation generation step” is not executed for the first time), the regression equations corresponding to the sample data belonging to all groups (that is, all sample data) are obtained.

ところで、本発明に係る方法において回帰式を生成する方法としては、特に限定されるものではなく、各種回帰方法を適用することができる。好ましい回帰方法の例としては、前述のように、サポートベクタ回帰法（ＳｕｐｐｏｒｔＶｅｃｔｏｒＲｅｇｒｅｓｓｉｏｎ：以下「ＳＶＲ」とも称する）と呼ばれる手法が挙げられる。 By the way, the method for generating the regression equation in the method according to the present invention is not particularly limited, and various regression methods can be applied. As an example of a preferable regression method, as described above, there is a method called a support vector regression method (Support Vector Regression: hereinafter also referred to as “SVR”).

従って、本発明の第２の態様に係る方法は、前記第１の態様に係る方法であって、前記回帰式生成ステップにおいて、サポートベクタ回帰によって回帰式を求めることを特徴とする方法である。 Therefore, the method according to the second aspect of the present invention is the method according to the first aspect, wherein the regression equation is obtained by support vector regression in the regression equation generation step.

次に、図２を参照すると、本発明の１つの実施態様に係る回帰式生成方法における処理フローを表すフローチャート１０が示されている。この例においては、ステップ１１及び１２において、サンプルデータ群が複数のグループに分けられている（前記「サンプルデータ分割ステップ」に対応）。サンプルデータ群を複数のグループに分ける際、制御用の因子（入力）空間上の特定の範囲に存在するサンプルデータが特定のグループに偏って存在するようなことの無いように、サンプルデータを各グループに満遍なく振り分けるのが望ましい。 Next, referring to FIG. 2, there is shown a flowchart 10 representing a processing flow in the regression equation generation method according to one embodiment of the present invention. In this example, the sample data group is divided into a plurality of groups in steps 11 and 12 (corresponding to the “sample data dividing step”). When dividing the sample data group into multiple groups, each sample data must be divided so that sample data existing in a specific range in the control factor (input) space does not exist in a specific group. It is desirable to distribute evenly to groups.

従って、本発明の第３の態様に係る方法は、前記第１又は第２の態様に係る方法であって、前記サンプルデータ分割ステップにおいて、制御因子空間上の特定の範囲に存在するサンプルデータが特定のグループに偏って存在しないように、サンプルデータを各グループに満遍なく振り分けることを特徴とする方法である。 Therefore, the method according to the third aspect of the present invention is the method according to the first or second aspect, wherein in the sample data dividing step, sample data existing in a specific range on the control factor space is stored. In this method, sample data is uniformly distributed to each group so as not to be biased to a specific group.

上記のようにサンプルデータ群を満遍なくグループ分けするための手順の一例につき、以下に詳細に説明する。先ず、サンプルデータ群について、因子（入力）空間上での中心（重心）に最も近いサンプルデータ（実験点）を選び出し、この因子につき「順序確定済み」とし、かつ「確定順＝１」とする（図２におけるステップ１１）。 An example of a procedure for uniformly grouping the sample data groups as described above will be described in detail below. First, sample data (experiment point) closest to the center (center of gravity) in the factor (input) space is selected for the sample data group, and the order is determined to be “confirmed” and “confirmed order = 1”. (Step 11 in FIG. 2).

次に、未だ「順序確定済み」となっていないサンプルデータのそれぞれにつき、既に「順序確定済み」となっている全てのサンプルデータとの因子（入力）空間上での距離を求める。各サンプルデータにつき、斯くして求められた距離の中から最小の距離（最小距離）を特定する。斯くして特定された最小距離が最大であるサンプルデータを特定し、このサンプルデータを「順序確定済み」とし、かつ確定順を１つカウントアップして、このサンプルデータに付与する。全てのサンプルデータが「順序確定済み」となり、確定順が付与されるまで、当該処理を繰り返す。その後、サンプルデータを確定順の昇順にソートし、個々のグループに必要な数だけサンプルデータを選択し、グループ分けする（図２におけるステップ１２）。 Next, for each piece of sample data that has not yet been “order fixed”, the distance in the factor (input) space with all the sample data that has already been “order fixed” is obtained. For each sample data, the minimum distance (minimum distance) is specified from the distances thus determined. The sample data having the maximum minimum distance specified in this way is specified, this sample data is set to “order fixed”, and the fixed order is counted up by one and given to this sample data. This process is repeated until all the sample data is “order fixed” and a fixed order is given. Thereafter, the sample data is sorted in ascending order of the confirmation order, and the sample data as many as necessary for each group are selected and grouped (step 12 in FIG. 2).

換言すれば、因子（入力）空間上での中心（重心）に最も近いサンプルデータを起点として順序を確定し、既に順序が確定された全サンプルデータに対して最も疎な（最小距離が最大である）サンプルデータに対して次々に順序を確定してゆき、斯くして確定された順序に従ってサンプルデータをソートし、各グループに必要な数だけサンプルデータを取り分けてゆく。 In other words, the order is determined starting from the sample data closest to the center (center of gravity) in the factor (input) space, and the least sparse (the minimum distance is the maximum) for all the sample data for which the order has already been determined. The order is determined one after another for the sample data, and the sample data is sorted according to the order thus determined, and as many sample data as necessary for each group are sorted.

尚、上記手順は、サンプルデータ群を複数のグループに分ける手順の一例としての例示に過ぎず、因子（入力）空間上の特定の範囲に存在するサンプルデータが特定のグループに偏って存在するようなことの無いように、サンプルデータを各グループに満遍なく振り分けることが出来る限り、上記手順に何ら限定されるものではない。 The above procedure is merely an example of a procedure for dividing a sample data group into a plurality of groups, and sample data existing in a specific range in the factor (input) space is biased to a specific group. As long as the sample data can be uniformly distributed to each group so that there is nothing, there is no limitation to the above procedure.

上記のようにサンプルデータ群を複数のグループに分けた後、処理フローは回帰回数ループに入る（図２におけるステップ２０〜７０）。先ず、初回の回帰であるかどうかの条件分岐があり（図２におけるステップ３０）、初回の回帰である場合は、因子と評価指標との関係を表す回帰式を求める（図２におけるステップ４１、前記「回帰式生成ステップ」の初回実行の場合に対応）。 After dividing the sample data group into a plurality of groups as described above, the processing flow enters the regression number loop (steps 20 to 70 in FIG. 2). First, there is a conditional branch as to whether or not it is the first regression (step 30 in FIG. 2), and in the case of the first regression, a regression equation representing the relationship between the factor and the evaluation index is obtained (step 41 in FIG. 2). (This corresponds to the case of the first execution of the “regression formula generation step”).

その後、処理フローは、前記「回帰式加算ステップ」の初回実行の場合に対応するステップ６０に進む。即ち、初回実行の場合は、未だ回帰式は１つしか生成されていないので、当該ステップ６０では回帰式の加算は行わず、ステップ７０に進み、ステップ２０に戻って、サンプルデータの次のグループの処理へと進む。 Thereafter, the process flow proceeds to step 60 corresponding to the first execution of the “regression formula addition step”. That is, in the case of the first execution, since only one regression equation has been generated yet, the regression equation is not added in the step 60, the process proceeds to step 70, the process returns to step 20, and the next group of sample data Proceed to the process.

第２回目以降の回帰においては、処理フローは条件分岐（図２におけるステップ３０）によってステップ５１に進む。このステップでは、初回の回帰のように制御因子と評価指標との関係を表す回帰式を求めるのではなく、前回求められた回帰式からの誤差を個々のサンプルデータ（制御因子と評価指標との組）について算出し（図２におけるステップ５１）、算出された誤差と制御因子との関係を表す回帰式を求める（図２におけるステップ５２、前記「回帰式生成ステップ」の初回実行ではない場合に対応）。 In the second and subsequent regressions, the process flow proceeds to step 51 by a conditional branch (step 30 in FIG. 2). In this step, instead of obtaining a regression equation that represents the relationship between the control factor and the evaluation index as in the first regression, the error from the previously obtained regression equation is calculated as individual sample data (control factor and evaluation index 2) (step 51 in FIG. 2), and a regression equation representing the relationship between the calculated error and the control factor is obtained (step 52 in FIG. 2, when the “regression equation generation step” is not the first execution). Correspondence).

その後、処理フローは、前記「回帰式加算ステップ」の初回実行ではない場合に対応するステップ６０に進む。即ち、初回実行ではない場合、即ち第２回目以降の回帰においては、前回の処理によって生成された回帰式が既に生成されているので、この既存の回帰式と、ステップ５２において求められた回帰式とを加法的に重ね合わせて新たな回帰式とする。その後、サンプルデータの全てのグループについての処理が完了するまで、ステップ２０とステップ７０との間の処理フローが繰り返される。 Thereafter, the processing flow proceeds to step 60 corresponding to the case where the “regression formula addition step” is not executed for the first time. That is, when it is not the first execution, that is, in the second and subsequent regressions, the regression equation generated by the previous processing has already been generated. Therefore, the existing regression equation and the regression equation obtained in step 52 are obtained. Are added to form a new regression equation. Thereafter, the processing flow between step 20 and step 70 is repeated until the processing for all groups of sample data is completed.

以上のようにして、サンプルデータの全てのグループについての処理が完了すると、処理フローを終了する。この時点での回帰式は、初回の回帰において求められた回帰式と、それ以降に求められた回帰式の全てとが、加法的に重ね合わせられた、全サンプルデータに対応する回帰式となっている。 As described above, when the processing for all the groups of sample data is completed, the processing flow ends. The regression equation at this point is a regression equation corresponding to all sample data in which the regression equation obtained in the first regression and all of the regression equations obtained thereafter are additively superimposed. ing.

ところで、上述のように、図２に示されているフローチャートにおいては、回帰式を生成する処理ループの１サイクル毎に、回帰式を加算する処理が実行されているが、回帰式加算処理は必ずしも上記タイミングで実行される必要は無い。例えば、前回の処理ループにおいて生成された回帰式を既存の回帰式に加算して新たな回帰式とする回帰式加算ステップを、ステップ５１の直前に実行してもよい（但し、この場合は、サンプルデータの最後のグループの処理が完了した以降に、当該グループに対応する回帰式（前回の回帰式との誤差の回帰式）を既存の回帰式に加算する処理が別途必要である）。 Incidentally, as described above, in the flowchart shown in FIG. 2, the process of adding the regression formula is executed for each cycle of the processing loop for generating the regression formula. There is no need to execute at the above timing. For example, a regression equation addition step that adds a regression equation generated in the previous processing loop to an existing regression equation to form a new regression equation may be executed immediately before step 51 (however, in this case, After the processing of the last group of sample data is completed, a separate process of adding the regression equation corresponding to the group (regression equation of error from the previous regression equation) to the existing regression equation is required).

尚、前述のように、本発明は、特定の装置の制御にのみ適用されるべきものではなく、許容可能な範囲にある制御結果を得るのに好適な制御因子の目標値を、回帰式を利用して算出し、斯くして得られた目標値に制御因子を制御することによって、所望の制御結果を得ようとするものであれば、如何なる装置にも適用可能である。従って、前記第１乃至３の態様のいずれかに係る方法であって、前記制御対象となる装置が車両であることを特徴とする方法もまた、本発明の態様の１つとして、本発明の範囲に含まれる。即ち、本発明の第４の態様に係る方法は、前記第１乃至３の態様のいずれかに係る方法であって、前記制御対象となる装置が車両であることを特徴とする方法である。 As described above, the present invention should not be applied only to the control of a specific device, but the target value of a control factor suitable for obtaining a control result within an allowable range is expressed by a regression equation. The present invention is applicable to any apparatus as long as it is intended to obtain a desired control result by calculating using it and controlling the control factor to the target value thus obtained. Therefore, the method according to any one of the first to third aspects, wherein the device to be controlled is a vehicle, is also an aspect of the present invention. Included in the range. That is, the method according to the fourth aspect of the present invention is the method according to any one of the first to third aspects, wherein the device to be controlled is a vehicle.

ところで、前述のように、サンプルデータを複数のグループに分け、それぞれのグループについて回帰を行うことにより、ＳＭＯ法等、回帰式の算出時間を短縮可能な解法を用いずに二次計画問題を解く際に、サンプルデータ数が多い場合であっても、回帰式生成に要する時間を短縮することができるが、これは、前述のように、二次計画問題の求解時間はサンプルデータ数の３乗のオーダーで、二次計画問題を計算機で解く場合に必要とされる記憶領域の容量はサンプルデータ数の２乗のオーダーで、それぞれ増大することに起因する。 By the way, as described above, by dividing the sample data into a plurality of groups and performing regression on each group, the quadratic programming problem can be solved without using the SMO method or the like that can shorten the calculation time of the regression equation. However, even when the number of sample data is large, the time required for generating the regression equation can be shortened. As described above, the solution time of the quadratic programming problem is the cube of the number of sample data. This is because the capacity of the storage area required when the quadratic programming problem is solved by a computer in the order of 2 is increased by the order of the square of the number of sample data.

より具体的には、Ｎ個のサンプルデータがあり、これらのサンプルデータを一括して回帰した場合の回帰式の計算時間をＴ_０とすると、これらのサンプルデータをｋ個のグループに分けることにより、個々のグループに含まれるサンプルデータ数はＮ／ｋ個となり、個々のグループにおける回帰式の計算時間は、Ｔ_０×（１／ｋ）^３となる。従って、全グループの計算時間は、ｋ×Ｔ_０×（１／ｋ）^３となり、これにサンプルデータのグループ化したり、２個目以降のグループを処理する際に既存の回帰式からの誤差を計算したり、個々のグループについて得られた回帰式を加算したりする処理に要する時間を加えたものが全体としての処理時間となる。しかしながら、これらの付随的な処理に要する時間は、回帰式の計算時間と比べると十分に小さい。従って、例えば、サンプルデータを１０個のグループに分けた場合、一括して処理する場合と比較すると、計算時間はおよそ１００分の１に短縮される。 More specifically, there are N pieces of sample data, and when these sample data are collectively regressed, assuming that the calculation time of the regression equation is T ₀ , these sample data are divided into k groups. The number of sample data included in each group is N / k, and the calculation time of the regression equation in each group is T ₀ × (1 / k) ³ . Therefore, the calculation time of all groups is k × T ₀ × (1 / k) ³ , and when this is grouped with sample data or when the second and subsequent groups are processed, an error from the existing regression equation is added. The total processing time is obtained by adding the time required for the calculation or adding the regression equations obtained for each group. However, the time required for these incidental processes is sufficiently small compared with the calculation time of the regression equation. Therefore, for example, when the sample data is divided into 10 groups, the calculation time is reduced to about 1/100 compared with the case where the sample data is processed in a lump.

上記のように、本発明に係る方法によれば、サンプルデータを複数のグループに分け、それぞれのグループについて回帰式を計算するので、サンプルデータ数が多いと計算時間が非常に長くなるという問題を解消することができる。 As described above, according to the method according to the present invention, the sample data is divided into a plurality of groups, and the regression equation is calculated for each group. Therefore, when the number of sample data is large, the calculation time becomes very long. Can be resolved.

また、前述のように、二次計画問題を計算機で解く場合に必要とされる記憶領域の容量は、サンプルデータ数の２乗のオーダーで増大するので、本発明に係る方法によれば、個々のグループについて回帰式を計算する際のサンプルデータ数を抑えることができることから、回帰式を求めるのに必要とされる計算機の記憶容量を抑えることが出来るので、記憶容量が比較的少ない計算機においても、多くのサンプルデータを用いて回帰式の計算を実行することができる。 Further, as described above, since the capacity of the storage area required when solving the quadratic programming problem with a computer increases in the order of the square of the number of sample data, according to the method according to the present invention, Since it is possible to reduce the number of sample data when calculating the regression equation for the group, it is possible to reduce the memory capacity of the computer required to obtain the regression equation, so even in a computer with a relatively small storage capacity The regression equation can be calculated using a lot of sample data.

ところで、本発明に係る方法に従って、サンプルデータを複数のグループに分け、それぞれのグループについて回帰式を計算することに伴って、上記のように回帰式の計算時間は大幅に短縮されるが、回帰精度については、サンプルデータをより多くのグループに分けるほど（換言すれば、個々のグループに属するサンプルデータ数が少なくなるほど）低下する傾向がある。即ち、計算速度と回帰精度とはトレードオフの関係にある。従って、目的とする評価指標（制御結果）の制御に求められる精度に応じて、サンプルデータのグループの個数（大きさ）を調整することにより、計算速度と回帰精度とのバランスを適宜とることができる。 By the way, according to the method according to the present invention, the sample data is divided into a plurality of groups and the regression equation is calculated for each group. The accuracy tends to decrease as the sample data is divided into more groups (in other words, as the number of sample data belonging to each group decreases). That is, the calculation speed and the regression accuracy are in a trade-off relationship. Therefore, by appropriately adjusting the number (size) of groups of sample data according to the accuracy required for controlling the target evaluation index (control result), the calculation speed and the regression accuracy can be appropriately balanced. it can.

尚、上記のように、本発明に係る方法における回帰精度は、サンプルデータをより多くのグループに分けるほど低下する傾向があるが、この傾向は極めて小さい。これは、本発明に係る方法においては、２回目（２グループ目）以降の回帰においては、直前の段階で求められた回帰式に対する誤差を近似するため、通常、基準となる回帰式に対して正になる誤差と負になる誤差とがほぼ均等に分布することから、これらの誤差についての回帰式を求め、この回帰式を基準となる回帰式に加法的に重ね合わせて新たな回帰式とする際には、各部における誤差が相殺される方向に作用し、誤差が増大され難いことに起因するものと考えられる。 As described above, the regression accuracy in the method according to the present invention tends to decrease as the sample data is divided into more groups, but this tendency is extremely small. This is because, in the method according to the present invention, in the second and subsequent regressions, the error with respect to the regression equation obtained in the immediately preceding stage is approximated. Since the error that becomes positive and the error that becomes negative are distributed almost evenly, a regression equation for these errors is obtained, and this regression equation is added to the reference regression equation to form a new regression equation. In this case, it is considered that the error in each part acts in the direction to be canceled and the error is hardly increased.

また、本発明に係る方法においては、サンプルデータにノイズや異常値が含まれる場合、当該サンプルデータを含むグループについての回帰式には当該ノイズや異常値の影響が生ずるが、他のグループについての回帰式には影響しない。従って、各グループについて求められた回帰式を加法的に重ね合わせて新たな回帰式を得る処理に伴い、ノイズや異常値の影響を受けた回帰式と、かかる影響を受けない回帰式とが相加平均される。その結果、本発明に係る方法においては、ノイズや異常値の影響が軽減される。 Further, in the method according to the present invention, when noise or abnormal value is included in the sample data, the regression equation for the group including the sample data is affected by the noise or abnormal value. It does not affect the regression equation. Therefore, as a result of additively superimposing the regression equations obtained for each group to obtain a new regression equation, the regression equation affected by noise and anomalous values and the regression equation not affected by this are compatible. Averaged. As a result, in the method according to the present invention, the influence of noise and abnormal values is reduced.

以上のように、本発明により、サンプルデータ数が膨大である場合であっても、従来の回帰方法と比較して、回帰式を生成するのに必要とされる計算時間が著しく増大することを回避しつつ、かつ、サンプルデータにおける異常値混入の影響による精度の低下をも回避することができる、制御因子特定用回帰式の生成方法が提供される。 As described above, according to the present invention, even when the number of sample data is enormous, the calculation time required for generating the regression equation is remarkably increased as compared with the conventional regression method. Provided is a method for generating a regression equation for specifying a control factor, which can avoid a decrease in accuracy due to the influence of an abnormal value in sample data.

尚、本発明に係る方法によって生成される制御因子特定用回帰式は、例えば、ＣＰＵ、記憶装置（例えば、揮発性メモリ、不揮発性メモリ、ハードディスク等）、各種センサ等の検出装置からの信号入力部、サーボモータ等のアクチュエータや他のデバイスへの信号出力部等を含んでなる制御ユニット（例えば、内燃機関を搭載する車両におけるＥＣＵ等）における記憶装置に格納され、対象となる装置の制御を行う際に制御用プログラム等によって読み出され、その時々の装置の状況に応じた制約条件下で、所望の制御結果を得るための制御因子の目標値を算出する処理等に使用される。 The regression equation for specifying the control factor generated by the method according to the present invention is, for example, a signal input from a detection device such as a CPU, a storage device (for example, a volatile memory, a non-volatile memory, a hard disk, etc.) or various sensors. Control unit (for example, ECU in a vehicle equipped with an internal combustion engine) including an actuator, an actuator such as a servo motor, a signal output unit to other devices, etc. When it is performed, it is read out by a control program or the like, and is used for processing for calculating a target value of a control factor for obtaining a desired control result under a constraint condition according to the situation of the apparatus at that time.

ところで、本発明に係る回帰式生成方法においては、所謂「ガウス関数」を基底関数として採用する回帰式も想定される。ガウス関数とは、例えば以下の一般式（Ａ）によって表される。 By the way, in the regression equation generation method according to the present invention, a regression equation employing a so-called “Gaussian function” as a basis function is also assumed. The Gaussian function is represented by the following general formula (A), for example.

従って、ガウス関数を基底関数として用いる回帰式は、以下の一般式（Ｂ）によって表される（式中、β_１は係数である）。 Therefore, a regression equation using a Gaussian function as a basis function is represented by the following general formula (B) (wherein β ₁ is a coefficient).

上記のように回帰式の基底関数としてガウス関数を採用する場合、ガウス関数の幅（一般式（Ｂ）におけるＲ）を、適切な方法を用いて、別途決定する必要がある。ガウス関数の幅Ｒを決定するための方法としては、当該技術分野において一般的に行われている種々のいずれの方法を用いてもよい。一例としては、例えば、ガウス関数の幅Ｒの候補を幾つか挙げておき、それぞれの幅Ｒを用いる回帰式の予測誤差を求め、最も小さい予測誤差を与える回帰式に用いられた幅Ｒを採択する方法が挙げられる。 As described above, when a Gaussian function is adopted as the basis function of the regression equation, it is necessary to separately determine the width of the Gaussian function (R in the general formula (B)) using an appropriate method. As a method for determining the width R of the Gaussian function, any of various methods generally performed in the technical field may be used. As an example, for example, several candidates for the width R of the Gaussian function are listed, the prediction error of the regression equation using each width R is obtained, and the width R used in the regression equation that gives the smallest prediction error is adopted. The method of doing is mentioned.

いずれの方法を採用するにせよ、回帰式の基底関数としてガウス関数を採用する場合、ガウス関数の幅Ｒを決定することは不可避の手続である。従って、かかる場合においても、サンプルデータ数が膨大である場合でも回帰式を生成するのに必要とされる計算時間を短縮し、かつ異常値の混入があっても高い精度を維持するという本発明の目的を達成するためには、高精度の回帰式を与えるガウス関数の幅Ｒを、より少ない処理量（作業量、計算量）で決定することができる方法が必要である。 Whichever method is adopted, when a Gaussian function is adopted as the basis function of the regression equation, determining the width R of the Gaussian function is an unavoidable procedure. Therefore, even in such a case, the present invention reduces the calculation time required to generate the regression equation even when the number of sample data is enormous, and maintains high accuracy even if there are abnormal values mixed in. In order to achieve the above objective, a method capable of determining the width R of the Gaussian function that gives a highly accurate regression equation with a smaller amount of processing (amount of work, calculation amount) is required.

そこで、本発明者は、鋭意研究の結果、ガウス関数を基底関数として用いる回帰式を採用する場合、先ず回帰式生成ステップの初回実行時にガウス関数の幅Ｒを上記のように決定しておき、回帰式生成ステップの２回目以降の実行時には、ガウス関数の幅Ｒを、初回実行時の幅Ｒよりも小さい値とすることにより、より少ない処理量（作業量、計算量）で２回目以降の回帰実行時の幅Ｒを決定することができ、斯くして生成される回帰式の精度も高いことを見出した。 Therefore, as a result of earnest research, the present inventor, when adopting a regression equation using a Gaussian function as a basis function, first determines the width R of the Gaussian function as described above at the first execution of the regression equation generation step, At the second and subsequent executions of the regression formula generation step, the width R of the Gaussian function is set to a value smaller than the width R at the first execution, so that the second and subsequent processings can be performed with a smaller amount of processing (amount of work and calculation) It has been found that the width R at the time of regression execution can be determined, and the accuracy of the regression equation thus generated is also high.

即ち、本発明の第５の態様に係る方法は、
前記第１乃至第４の態様のいずれかに係る方法であって、
前記回帰式がガウス関数を基底関数として用いること、及び
前記回帰式生成ステップにおいて、前記回帰式生成ステップの実行が初回である場合における前記ガウス関数の幅と比較して、前記回帰式生成ステップの実行が初回ではない場合における前記ガウス関数の幅がより小さいこと、
を特徴とする方法である。 That is, the method according to the fifth aspect of the present invention is:
A method according to any one of the first to fourth aspects,
The regression equation uses a Gaussian function as a basis function, and in the regression equation generation step, compared to the width of the Gaussian function when the regression equation generation step is executed for the first time, the regression equation generation step The width of the Gaussian function is smaller when the execution is not the first time,
It is the method characterized by this.

上記のように、本実施態様は、ガウス関数を基底関数として用いる回帰式を採用する場合を想定している。そして、本実施態様においては、上述のように、回帰式生成ステップにおいて、回帰式生成ステップの実行が初回である場合におけるガウス関数の幅（Ｒ）と比較して、回帰式生成ステップの実行が初回ではない場合におけるガウス関数の幅（Ｒ）の方がより小さくなるようにする。 As described above, the present embodiment assumes a case where a regression equation using a Gaussian function as a basis function is employed. In the present embodiment, as described above, in the regression formula generation step, the regression formula generation step is executed in comparison with the width (R) of the Gaussian function when the regression formula generation step is executed for the first time. The width (R) of the Gaussian function when it is not the first time is made smaller.

より具体的には、本実施態様においては、回帰式生成ステップの２回目以降の実行時には、初回実行時に求められたガウス関数の幅（Ｒ）よりも小さい値のみを、ガウス関数の幅（Ｒ）の候補として挙げればよい。換言すれば、回帰式生成ステップの２回目以降の実行時には、初回実行時に求められたガウス関数の幅（Ｒ）よりも大きい値については、ガウス関数の幅（Ｒ）の候補から除外することができる。これにより、ガウス関数の幅（Ｒ）を決定するための計算量を削減することができる。また、場合によっては、回帰式生成ステップの２回目以降の実行時には、複数の候補から最適値を選ぶのではなく、初回実行時に求められたガウス関数の幅（Ｒ）よりも十分に小さい値（例えば、１）をガウス関数の幅（Ｒ）として設定してもよい。 More specifically, in the present embodiment, at the second and subsequent executions of the regression equation generation step, only a value smaller than the Gaussian function width (R) obtained at the first execution is used as the Gaussian function width (R). ) As a candidate. In other words, when the regression equation generation step is executed for the second and subsequent times, a value larger than the Gaussian function width (R) obtained at the first execution may be excluded from the Gaussian function width (R) candidates. it can. Thereby, the amount of calculation for determining the width (R) of the Gaussian function can be reduced. In some cases, when the regression equation generation step is executed for the second time or later, an optimal value is not selected from a plurality of candidates, but a value sufficiently smaller than the width (R) of the Gaussian function obtained at the first execution ( For example, 1) may be set as the width (R) of the Gaussian function.

また、本実施態様においては、回帰式生成ステップの３回目以降の実行時にも上記手順を繰り返して、回帰式生成ステップの実行の度に、より小さいガウス関数の幅（Ｒ）を採択するようにしてもよい。あるいは、回帰式生成ステップの３回目以降の実行時には上記手順を省略して、回帰式生成ステップの２回目の実行時に求められたガウス関数の幅（Ｒ）を、回帰式生成ステップの３回目以降の実行時においても、そのまま使用してもよい。 In this embodiment, the above procedure is repeated at the third and subsequent executions of the regression equation generation step, and a smaller Gaussian function width (R) is adopted each time the regression equation generation step is executed. May be. Alternatively, when the regression equation generation step is executed for the third time or later, the above procedure is omitted, and the width (R) of the Gaussian function obtained during the second execution of the regression equation generation step is calculated as the third or subsequent time of the regression equation generation step. You may use it as it is even when executing.

尚、前述のように、本発明に係る回帰式生成方法においては、サンプルデータを複数のグループに分け、それぞれのグループについて回帰を行う。従って、回帰式の基底関数としてガウス関数を用いる場合、回帰式生成ステップの初回実行時には、複数のグループの中の１つのグループに属するサンプルデータのみに基づいてガウス関数の幅（Ｒ）が求められることになる。このように一部のサンプルデータのみに基づいて求められるガウス関数の幅（Ｒ）については、その妥当性が懸念されると考えることもできる。 As described above, in the regression equation generation method according to the present invention, sample data is divided into a plurality of groups, and regression is performed for each group. Accordingly, when a Gaussian function is used as the basis function of the regression equation, the width (R) of the Gaussian function is obtained based on only sample data belonging to one group among a plurality of groups at the first execution of the regression equation generation step. It will be. Thus, it can be considered that the validity (gauss function width) of the Gaussian function obtained based only on a part of sample data is concerned.

しかしながら、実際には、このように一部のサンプルデータのみに基づいて求められるガウス関数の幅（Ｒ）は、より多数のサンプルデータに基づいて求められるガウス関数の幅（Ｒ）と比較して、予測誤差（汎化性能推定値）の大きさについては差異が認められるものの、最も小さい予測誤差（汎化性能推定値）を与える最適なガウス関数の幅（Ｒ）の値については差異は認められない。つまり、本発明に係る回帰式生成方法のように一部のサンプルデータのみに基づいてガウス関数の幅（Ｒ）を求めても、最適なガウス関数の幅（Ｒ）を決定することができる（詳しくは、実施例において説明する）。 However, in practice, the width (R) of the Gaussian function obtained based on only a part of the sample data is compared with the width (R) of the Gaussian function obtained based on a larger number of sample data. Although there is a difference in the magnitude of the prediction error (generalization performance estimate), there is no difference in the optimal Gaussian function width (R) value that gives the smallest prediction error (generalization performance estimate). I can't. That is, even if the Gaussian function width (R) is obtained based on only a part of sample data as in the regression equation generation method according to the present invention, the optimum Gaussian function width (R) can be determined ( Details will be described in Examples).

ところで、前述のように、ガウス関数の幅Ｒの最適値は、例えば、ガウス関数の幅Ｒの候補を幾つか挙げておき、それぞれの幅Ｒを用いる回帰式の予測誤差を求め、最も小さい予測誤差を与える回帰式に用いられた幅Ｒを最適値として採択することができる。かかる手順の好ましい具体例としては、例えば、所謂「交差確認法」（ＣｒｏｓｓＶａｌｉｄａｔｉｏｎ）が挙げられる。 By the way, as described above, the optimum value of the width R of the Gaussian function, for example, lists several candidates for the width R of the Gaussian function, obtains the prediction error of the regression equation using each width R, and obtains the smallest prediction. The width R used in the regression equation giving the error can be adopted as the optimum value. A preferable specific example of such a procedure is, for example, a so-called “cross validation method” (Cross Validation).

即ち、本発明の第６の態様に係る方法は、
前記第５の態様に係る方法であって、
前記回帰式生成ステップにおいて、前記回帰式生成ステップの実行が初回である場合における前記ガウス関数の幅及び前記回帰式生成ステップの実行が初回ではない場合における前記ガウス関数の幅の少なくとも一部が、交差確認法によって決定されること、
を特徴とする方法である。 That is, the method according to the sixth aspect of the present invention is:
A method according to the fifth aspect,
In the regression equation generation step, at least a part of the width of the Gaussian function when the execution of the regression equation generation step is the first time and the width of the Gaussian function when the execution of the regression equation generation step is not the first time are: Determined by the cross-validation method,
It is the method characterized by this.

上記のように、交差確認法は、回帰式生成ステップの初回実行時にのみ適用してもよく、回帰式生成ステップの２回目以降のいずれかの実行時にも適用してもよい。あるいは、かかる交差確認法は、回帰式生成ステップの初回及び２回目以降の実行時のうち複数回において適用してもよく、あるいは、回帰式生成ステップの初回及び２回目以降の全ての実行時において適用してもよい。 As described above, the intersection confirmation method may be applied only at the first execution of the regression equation generation step, or may be applied at any one of the second and subsequent executions of the regression equation generation step. Alternatively, the intersection confirmation method may be applied multiple times during the first and second and subsequent executions of the regression formula generation step, or at the first and second and subsequent executions of the regression formula generation step. You may apply.

また、交差確認法は、例えば、先ず元のデータをｎ個のブロックに分割し、１つ目のブロックをテストデータ、その他のブロックをトレーニングデータとしてモデルの構築と精度の算出を行ない、次に２つ目のブロックをテストデータ、その他のブロックをトレーニングデータとしてモデル構築を行ない、かかる手続きをｎ回繰返し、各回において算出された精度の平均をモデルの推定精度とする方法であり、少ないデータであっても推定誤差が少くなることを特徴とする。 In the intersection confirmation method, for example, the original data is first divided into n blocks, the model is constructed and the accuracy is calculated using the first block as test data and the other blocks as training data. This is a method that builds a model using the second block as test data and the other blocks as training data, repeats this procedure n times, and uses the average accuracy calculated at each time as the estimated accuracy of the model. Even if it exists, it is characterized by a small estimation error.

かかる交差確認法の具体的な実行手順には複数の種類が知られており、例えば、「１つだけ別に取り出す交差確認法」又は「１つ抜き法（ＬＯＯ法）」（Ｌｅａｖｅ−ｏｎｅ−ｏｕｔＣｒｏｓｓＶａｌｉｄａｔｉｏｎ）、「１０分割交差確認法」（１０−ｆｏｌｄＣｒｏｓｓＶａｌｉｄａｔｉｏｎ）、「２分割交差確認法」（２−ｆｏｌｄＣｒｏｓｓＶａｌｉｄａｔｉｏｎ）等を挙げることができる。 A plurality of types of specific execution procedures of such a cross-confirmation method are known. For example, “a cross-confirmation method for taking out only one” or “one-out method (LOO method)” (Leave-one-out) Cross Validation), “10-fold Cross Validation”, “2-fold Cross Validation”, and the like.

以下に記載する実施例によって本発明を更に詳しく説明するが、本発明の技術的範囲は、これらの例に限定されるものではない。 The present invention will be described in more detail with reference to the following examples, but the technical scope of the present invention is not limited to these examples.

１．本発明の一態様に係る方法の計算時間及び回帰精度の検証
（１）人工データ
本発明の一態様に係る方法の有効性を、以下の人工データを使用して検証した。
ａ）入力ｘ_ｉを、｛０，３０００｝からランダムに１３００点サンプリングする。
ｂ）計測ノイズη_ｉを、正規分布Ｎ（０，σ^２）で生成させる（本実施例においてノイズを付加する場合、σ^２＝０．１５とした）。
ｃ）異常値５点に付加する値θ_ｉを、±０．０、±３．０、及び±５．０とする。
ｄ）異常値５点は、いずれも学習用データに含める。
ｅ）出力ｙ_ｉを、以下の式（Ｉ）で定義する。 1. Verification of calculation time and regression accuracy of method according to one aspect of the present invention (1) Artificial data The effectiveness of the method according to one aspect of the present invention was verified using the following artificial data.
a) The input x _i is sampled at 1300 points randomly from {0, 3000}.
b) The measurement noise η _i is generated with a normal distribution N (0, σ ² ) (when noise is added in this embodiment, σ ² = 0.15).
c) The values θ _i to be added to five abnormal values are ± 0.0, ± 3.0, and ± 5.0.
d) All five abnormal values are included in the learning data.
e) The output y _i is defined by the following formula (I).

（２）回帰式の計算時間の評価
上記人工データ１３００点からランダムに抽出した３００点を学習用データとし、ハードマージン（ＨａｒｄＭａｒｇｉｎ）問題を解いて学習を行い、回帰式を求めるのに要した計算時間を、以下に列挙するそれぞれの回帰方法について測定した。
ｉ）本発明の一態様に係る方法
ii）従来法（逐次的学習法）
iii)通常法（逐次的学習法を用いず、全てのテストデータを一括回帰）
尚、上記ｉ）乃至iii)のいずれの回帰方法においても、ガウシアンカーネルを用いるＳＶＲによって回帰式を求めた。また、上記ｉ）及びii）の回帰方法においては、学習用データを６つのグループに分割して回帰式を求め、その計算に要した時間を測定した。得られた結果につき、図３を参照しつつ以下に説明する。 (2) Evaluation of calculation time of regression equation It took 300 points randomly extracted from the above-mentioned artificial data 1300 points as learning data, and it was necessary to learn by solving a hard margin problem and to obtain a regression equation. The calculation time was measured for each regression method listed below.
i) Method according to one aspect of the invention
ii) Conventional method (sequential learning method)
iii) Ordinary method (batch regression of all test data without using sequential learning method)
In any of the above regression methods i) to iii), the regression equation was obtained by SVR using a Gaussian kernel. In the regression methods i) and ii), the learning data was divided into six groups to obtain a regression equation, and the time required for the calculation was measured. The obtained results will be described below with reference to FIG.

前述のように、図３は、本発明の１つの実施態様に係る回帰式生成方法、従来法（逐次的学習法）、及び通常法（一括回帰）における、異常値の混入による回帰式の計算時間への影響の程度を示すグラフである。尚、図３において、（ａ）は計測ノイズを付加しなかった場合、及び（ｂ）は計測ノイズを付加した場合の、異常値の混入による回帰式の計算時間への影響を示している。尚、（ｂ）において「ノイズ０．１５」とあるのは、前述のように、正規分布Ｎ（０，σ^２）で生成させる計測ノイズにおいて、σ^２＝０．１５としたことを示している。 As described above, FIG. 3 illustrates the calculation of the regression equation by mixing out abnormal values in the regression equation generation method, the conventional method (sequential learning method), and the normal method (collective regression) according to one embodiment of the present invention. It is a graph which shows the grade of the influence on time. In FIG. 3, (a) shows the influence on the calculation time of the regression equation due to mixing of abnormal values when measurement noise is not added and (b) when measurement noise is added. Incidentally, “noise 0.15” in (b) indicates that σ ² = 0.15 in the measurement noise generated by the normal distribution N (0, σ ² ) as described above. Yes.

図３に示されているように、本発明の一態様に係る回帰式生成方法は、異常値の混入の有無、計測ノイズの有無にかかわらず、通常法（一括回帰）と比較して、回帰式の計算時間が極めて短く、学習用データへの異常値の混入に伴う回帰式の計算時間の変動も著しく小さい。尚、従来法（逐次的学習法）もまた、本発明の一態様に係る回帰式生成方法と概ね同様の傾向を示したが、本発明の一態様に係る回帰式生成方法と比較すると、学習用データへの異常値の混入や、計測ノイズの付加に伴う回帰式の計算時間の変動が大きい。このように、本発明の一態様に係る回帰式生成方法は、通常法（一括回帰）と比較して、回帰式の計算時間が極めて短く、また従来法（逐次的学習法）及び通常法（一括回帰）と比較して、学習用データへの異常値や計測ノイズの混入に伴う回帰式の計算時間の変動も小さいことが確認された。 As shown in FIG. 3, the regression equation generation method according to one aspect of the present invention performs a regression compared with the normal method (collective regression) regardless of the presence of abnormal values and the presence or absence of measurement noise. The calculation time of the equation is extremely short, and the fluctuation of the calculation time of the regression equation accompanying the mixing of abnormal values into the learning data is also extremely small. In addition, the conventional method (sequential learning method) also showed a tendency similar to that of the regression equation generation method according to one aspect of the present invention. Fluctuations in the calculation time of the regression equation due to the addition of abnormal values to measurement data and the addition of measurement noise are large. As described above, the regression equation generation method according to one aspect of the present invention has an extremely short calculation time for the regression equation as compared with the normal method (collective regression), and the conventional method (sequential learning method) and the normal method ( Compared to (batch regression), it was confirmed that the fluctuation in the calculation time of the regression equation due to the inclusion of abnormal values and measurement noise in the learning data was also small.

（３）回帰式の精度の評価
残りの１０００点のデータをテストデータとして用いて、上記ｉ）乃至iii)のそれぞれの回帰方法（学習方法）について、テストデータに対する真の値と回帰値との相関係数を求めた。 (3) Evaluation of accuracy of regression equation Using the remaining 1000 points of data as test data, for each regression method (learning method) of i) to iii) above, the true value and the regression value of the test data Correlation coefficient was obtained.

尚、上記（２）及び（３）の評価は、それぞれの回帰方法について、異常値への付加値θｉ（±０．０、±３．０、及び±５．０の３種類）とノイズの有無との組み合わせ毎に１５回ずつ行い、相関係数及び計算時間のそれぞれの平均値を求めた。得られた結果につき、図４を参照しつつ以下に説明する。 The evaluations in (2) and (3) above are based on the values of the additional value θi (± 0.0, ± 3.0, and ± 5.0) to the abnormal value and noise for each regression method. The measurement was performed 15 times for each combination with and without, and the average values of the correlation coefficient and the calculation time were obtained. The obtained results will be described below with reference to FIG.

前述のように、図４は、本発明の１つの実施態様に係る回帰式生成方法、従来法（逐次的学習法）及び通常法（一括回帰）における、異常値の混入による回帰精度（相関係数）への影響の程度を示すグラフである。尚、図４において、（ａ）は計測ノイズを付加しなかった場合、及び（ｂ）は計測ノイズを付加した場合の、異常値の混入による相関係数への影響を示している。尚、（ｂ）において「ノイズ０．１５」とあるのは、前述のように、正規分布Ｎ（０，σ^２）で生成させる計測ノイズにおいて、σ^２＝０．１５としたことを示している。 As described above, FIG. 4 illustrates the regression accuracy (correlation) due to mixing of abnormal values in the regression equation generation method, the conventional method (sequential learning method), and the normal method (collective regression) according to one embodiment of the present invention. It is a graph which shows the grade of the influence on number). In FIG. 4, (a) shows the influence on the correlation coefficient due to mixing of abnormal values when no measurement noise is added, and (b) when measurement noise is added. Incidentally, “noise 0.15” in (b) indicates that σ ² = 0.15 in the measurement noise generated by the normal distribution N (0, σ ² ) as described above. Yes.

図４に示されているように、本発明の一態様に係る回帰式生成方法は、従来法（逐次的学習法）及び通常法（一括回帰）と比較して、学習用データへの異常値の混入に伴う相関係数の低下が著しく小さい。換言すれば、本発明の一態様に係る回帰式生成方法は、学習用データへの異常値の混入に対して頑健性が極めて高く、異常値が混入しても高い汎化性（精度）を維持することができる。また、本発明の一態様に係る回帰式生成方法は、従来法（逐次的学習法）及び通常法（一括回帰）と比較して、計測ノイズの有無による相関係数の変動も小さいことが確認された。 As shown in FIG. 4, the regression equation generation method according to one aspect of the present invention has an abnormal value in the learning data as compared with the conventional method (sequential learning method) and the normal method (collective regression). The decrease in correlation coefficient due to the mixing of is extremely small. In other words, the regression equation generation method according to one aspect of the present invention has extremely high robustness against the inclusion of abnormal values in the learning data, and has high generalization (accuracy) even when abnormal values are mixed. Can be maintained. Further, it is confirmed that the regression equation generation method according to one aspect of the present invention has a smaller variation in the correlation coefficient due to the presence or absence of measurement noise compared to the conventional method (sequential learning method) and the normal method (collective regression). It was done.

（４）まとめ
以上のように、本発明に係る回帰式生成方法は、通常法（一括回帰）と比較して、回帰式の計算時間が極めて短く、また従来法（逐次的学習法）及び通常法（一括回帰）と比較して、学習用データへの異常値の混入や計測ノイズの付与に対して、回帰精度の頑健性が極めて高い（相関係数の低下が著しく小さい）ことが確認された。従って、本発明に係る方法によれば、各種装置の制御における制御因子と制御結果（評価指標）との関係を表す回帰式を、従来技術に比べて極めて短い時間で算出することができ、また異常値や計測ノイズが混入しても、汎化性の高い回帰式を生成することができるので、精度の高い制御を可能とすることができる。 (4) Summary As described above, the regression equation generation method according to the present invention has an extremely short calculation time for the regression equation as compared with the ordinary method (collective regression), and the conventional method (sequential learning method) and the ordinary method. Compared to the method (batch regression), it is confirmed that the robustness of the regression accuracy is extremely high (the decrease in the correlation coefficient is extremely small) against the addition of abnormal values to the learning data and the addition of measurement noise. It was. Therefore, according to the method according to the present invention, the regression equation representing the relationship between the control factor and the control result (evaluation index) in the control of various devices can be calculated in a very short time compared to the conventional technique, Even if an abnormal value or measurement noise is mixed, a highly generalized regression equation can be generated, so that highly accurate control can be performed.

２．サンプルデータのグループの大きさと回帰式の計算時間及び精度
本実施例においては、車両の制御因子と対応する制御結果（評価指標）との関係を、実データから３５８４箇所についてサンプリングし、これらの中からランダムに選んだ５８８点のサンプルデータを学習用データとして回帰式を生成した。この際、これら５８８点の学習用データを分割して作成するグループの大きさ（データ点数）を表１に示されているように変化させ、本発明の一態様に係る方法による回帰式の計算時間の変化を調べた。 2. In this embodiment, the relationship between the control factors of the vehicle and the corresponding control results (evaluation indices) is sampled from actual data at 3,584 locations. A regression equation was generated using 588 points of sample data randomly selected from as learning data. At this time, the size (number of data points) of the group created by dividing the learning data of 588 points is changed as shown in Table 1, and the regression equation is calculated by the method according to one aspect of the present invention. The change of time was investigated.

次に、残りの２９９６点のデータを用いて、個々の回帰式の精度（平均誤差）についても調査した。尚、上記回帰方法の詳細については、前述の１（２）と同様とした。また、表１における「平均誤差」とは、回帰式に基づく算出値と実測値との差の絶対値を全データに亘って平均したものである。 Next, the accuracy (average error) of each regression equation was also investigated using the remaining 2996 points. The details of the regression method were the same as in 1 (2) above. The “average error” in Table 1 is an average of the absolute value of the difference between the calculated value based on the regression equation and the actually measured value over all data.

表１から明らかなように、本発明に係る方法に従って、サンプルデータを複数のグループに分け、それぞれのグループについて回帰式を計算することにより、全てのサンプルデータを一括回帰する通常法と比較して、回帰式の計算時間は大幅に短縮された。しかし、サンプルデータをより多くのグループに分けるほど（換言すれば、個々のグループに属するサンプルデータ数が少なくなるほど）回帰精度が低下する（平均誤差が大きくなる）傾向が認められた。 As is clear from Table 1, according to the method according to the present invention, the sample data is divided into a plurality of groups, and a regression equation is calculated for each group, so that all sample data are compared with the usual method for performing a collective regression. The calculation time for the regression equation was greatly reduced. However, as the sample data was divided into a larger number of groups (in other words, the smaller the number of sample data belonging to each group), the regression accuracy tended to decrease (average error increased).

従って、前述のように、サンプルデータのグループ分けに伴う計算速度の短縮の程度と比較すると、回帰精度の低下の程度は極僅かであるが、極めて高い精度が要求される場合は、求められる精度に応じて、サンプルデータのグループの個数（大きさ）を調整することにより、トレードオフの関係にある計算速度と回帰精度とのバランスを適宜とることができる。 Therefore, as described above, the degree of decrease in regression accuracy is negligible compared to the degree of reduction in calculation speed associated with grouping of sample data, but the required accuracy is required when extremely high accuracy is required. Accordingly, by adjusting the number (size) of groups of sample data, it is possible to appropriately balance the calculation speed and the regression accuracy in a trade-off relationship.

尚、本実施例においては車両の制御について検証したが、前述のように、本発明は、許容可能な範囲にある制御結果を得るのに好適な制御因子の目標値を、回帰式を利用して算出し、斯くして得られた目標値に制御因子を制御することによって、所望の制御結果を得ようとするものであれば、如何なる装置にも適用可能である。 In the present embodiment, the control of the vehicle has been verified, but as described above, the present invention uses a regression equation to obtain a target value of a control factor suitable for obtaining a control result within an allowable range. Any device can be applied as long as the desired control result is obtained by controlling the control factor to the target value thus obtained.

１．初回の回帰式のガウス関数の幅Ｒの決定
本発明に係る回帰式生成方法において回帰式の基底関数としてガウス関数を用いる場合も、基本的には図２に示すフローチャートに従って一連の処理が実行される。但し、回帰式の基底関数としてガウス関数を用いるため、ガウス関数の幅Ｒを決定する処理が更に必要となる。具体的には、初回の回帰実行時にはステップ４１に付随してガウス関数の幅Ｒを決定し、２回目以降の回帰実行時にはステップ５１に付随してガウス関数の幅Ｒを決定する。これらのステップにおいてガウス関数の幅Ｒを決定する方法は、前述のような種々の方法から選ぶことができ、一例としては、交差確認法が挙げられる。 1. Determination of the width R of the Gaussian function of the first regression equation In the method of generating a regression equation according to the present invention, when a Gaussian function is used as the basis function of the regression equation, a series of processing is basically executed according to the flowchart shown in FIG. The However, since a Gaussian function is used as the basis function of the regression equation, a process for determining the width R of the Gaussian function is further required. Specifically, the Gaussian function width R is determined at the time of the first regression execution, and the Gaussian function width R is determined at the time of the second and subsequent regression executions. The method for determining the width R of the Gaussian function in these steps can be selected from various methods as described above, and an example is an intersection confirmation method.

ところで、本発明に係る回帰式生成方法において回帰式の基底関数としてガウス関数を用いる場合には、前述のように、回帰式生成ステップの初回実行時には、複数のグループの中の１つのグループに属するサンプルデータのみに基づいてガウス関数の幅（Ｒ）が求められることになる。しかしながら、実際には、このように一部のサンプルデータのみに基づいてガウス関数の幅（Ｒ）を求めても、最適なガウス関数の幅（Ｒ）を決定することができる。この点につき、図５を参照しながら、以下に詳しく説明する。 By the way, when the Gaussian function is used as the basis function of the regression formula in the regression formula generation method according to the present invention, as described above, when the regression formula generation step is executed for the first time, it belongs to one group among a plurality of groups. The width (R) of the Gaussian function is obtained based only on the sample data. However, in practice, the optimum Gaussian function width (R) can be determined by obtaining the Gaussian function width (R) based on only a part of the sample data. This point will be described in detail below with reference to FIG.

前述のように、図５は、本発明の１つの実施態様に係る回帰式生成方法において、回帰式の基底関数として用いられるガウス関数の幅Ｒと予測誤差との関係を、一部のサンプルデータについて求めた場合と全てのサンプルデータについて求めた場合とで比較するグラフである。この例においては、５００点のサンプルデータを用意し、このうち３２９点をトレーニングデータ、１７１点を汎化性能確認用データとした。幅Ｒの最適値を決定する方法としては、前述の「１０分割交差確認法」（１０−ｆｏｌｄＣｒｏｓｓＶａｌｉｄａｔｉｏｎ）を採用した。また、幅Ｒの候補としては、１、２、３、４、及び５の５つの値を用意した。 As described above, FIG. 5 shows the relationship between the prediction error and the width R of the Gaussian function used as the basis function of the regression equation in the regression equation generation method according to one embodiment of the present invention. It is a graph compared with the case where it calculates | requires about and the case where it asks about all the sample data. In this example, 500 points of sample data are prepared, of which 329 points are training data and 171 points are generalization performance confirmation data. As a method for determining the optimum value of the width R, the aforementioned “10-fold cross validation method” (10-fold Cross Validation) was adopted. Further, five values of 1, 2, 3, 4, and 5 were prepared as candidates for the width R.

先ず、本発明の１つの実施態様に係る回帰式生成方法の例として、上記トレーニングデータ３２９点のうち１００点のみを用いて初回の回帰式を求め、汎化性能確認用データ１７１点を用いて、それぞれの幅Ｒの候補値につき、予測誤差（汎化性能推定値）を求めた（グラフ中、黒い四角のプロット）。これに対し、参考例として、全てのトレーニングデータ３２９点を用いて、それぞれの幅Ｒの候補値につき、上記と同様に予測誤差（汎化性能推定値）を求めた（グラフ中、黒い丸のプロット）。 First, as an example of a regression equation generation method according to one embodiment of the present invention, an initial regression equation is obtained using only 100 points of the training data 329 points, and generalization performance confirmation data 171 points are used. For each candidate value of width R, a prediction error (generalized performance estimation value) was obtained (black square plot in the graph). On the other hand, as a reference example, using all the training data 329 points, the prediction error (generalized performance estimation value) was obtained in the same manner as described above for each candidate value of the width R (the black circle in the graph). plot).

図５に示すグラフからも明らかであるように、トレーニングデータを１００点しか用いなかった本発明の１つの実施態様に係る回帰式生成方法の例においては、全てのトレーニングデータ３２９点を用いた参考例と比較して、全ての幅Ｒの候補値において、予測誤差の値そのものは大きい。しかしながら、トレーニングデータを１００点しか用いなかった場合も、全てのトレーニングデータ３２９点を用いた場合も、予測誤差が最小となる最適な幅Ｒの値は同じ（３）である。 As is clear from the graph shown in FIG. 5, in the example of the regression equation generation method according to one embodiment of the present invention in which only 100 points of training data are used, the reference using all 329 points of training data is used. Compared to the example, the prediction error value itself is large in all the candidate values of the width R. However, the optimal value of the width R that minimizes the prediction error is the same (3) whether only 100 points of training data are used or 329 points of all training data are used.

上記のように、本発明の１つの実施態様に係る回帰式生成方法において、一部のサンプルデータのみに基づいてガウス関数の幅（Ｒ）を求めても、より少ない処理量（作業量、計算量）で、最適なガウス関数の幅（Ｒ）を決定することができることが確認された。 As described above, in the regression equation generation method according to one embodiment of the present invention, even if the width (R) of the Gaussian function is obtained based only on a part of sample data, a smaller amount of processing (amount of work, calculation) It has been confirmed that the optimum width (R) of the Gaussian function can be determined.

２．２回目以降の回帰式のガウス関数の幅Ｒの決定（１）
前述のように、本発明の１つの実施態様に係る回帰式生成方法の回帰式生成ステップにおいて、回帰式生成ステップの実行が初回である場合におけるガウス関数の幅と比較して、回帰式生成ステップの実行が初回ではない場合におけるガウス関数の幅がより小さい。即ち、本発明の１つの実施態様に係る回帰式生成方法における２回目以降の回帰式のガウス関数の幅Ｒとしては、初回の回帰式のガウス関数の幅Ｒよりも小さい値が用いられる。これにより、より少ない処理量（作業量、計算量）で、２回目以降の回帰実行時の幅Ｒを決定することができ、斯くして生成される回帰式の精度も向上する。この点につき、図６を参照しながら、以下に詳しく説明する。 2.2 Determining the width R of the Gaussian function of the regression equation after the first time (1)
As described above, in the regression formula generation step of the regression formula generation method according to one embodiment of the present invention, the regression formula generation step is compared with the width of the Gaussian function when the regression formula generation step is executed for the first time. The width of the Gaussian function is smaller when execution of is not the first time. That is, a value smaller than the width R of the Gaussian function of the first regression equation is used as the width R of the Gaussian function of the regression equation for the second and subsequent times in the regression equation generation method according to one embodiment of the present invention. Thereby, the width R at the time of the second or subsequent regression execution can be determined with a smaller amount of processing (work amount, calculation amount), and the accuracy of the regression equation thus generated is also improved. This point will be described in detail below with reference to FIG.

前述のように、図６は、本発明の１つの実施態様に係る回帰式生成方法における初回の回帰式のガウス関数の幅Ｒと２回目の回帰式のガウス関数の幅Ｒとの組み合わせによる回帰式の予測精度（汎化性能推定値）の変化を表すグラフである。より詳しくは、図６のグラフにおいて、（ａ）はサンプル点１（０，１）及びサンプル点２（３，−１）を用いて得た初回の回帰式Ｆ（実線）を表す。ここで、サンプル点の座標は（位置，値）を表す。即ち、（ａ）に示す初回の回帰においては、位置が０及び３であり、値がそれぞれ＋１及び−１である２つのサンプル点を用いた。（ａ）に示すように、初回の回帰式のガウス関数の幅Ｒは３である。ここで、Ｆ１はサンプル点１のみについての回帰式を表し、Ｆ２はサンプル点２のみについての回帰式を表す。即ち、初回の回帰式Ｆは、Ｆ１（点線）とＦ２（破線）との重ね合わせである。 As described above, FIG. 6 shows a regression based on a combination of the Gaussian function width R of the first regression equation and the Gaussian function width R of the second regression equation in the regression equation generation method according to one embodiment of the present invention. It is a graph showing the change of the prediction precision (generalization performance estimated value) of a type | formula. More specifically, in the graph of FIG. 6, (a) represents the first regression equation F (solid line) obtained using sample point 1 (0, 1) and sample point 2 (3, -1). Here, the coordinates of the sample points represent (position, value). That is, in the first regression shown in (a), two sample points with positions 0 and 3 and values +1 and −1 were used. As shown in (a), the width R of the Gaussian function of the first regression equation is 3. Here, F1 represents a regression equation for only sample point 1, and F2 represents a regression equation for only sample point 2. That is, the first regression equation F is an overlay of F1 (dotted line) and F2 (dashed line).

次に、２回目の回帰においては、位置が１及び２であり、値がそれぞれ初回の回帰式Ｆからの残差が＋０．２及び−０．２である２つのサンプル点３及び４を用いた。尚、図６における（ｂ）及び（ｃ）はいずれも２回目の回帰式を表し、初回の回帰式Ｆは細い方の実線で、２回目の回帰式Ｆは太い方の実線で表されている。また、ｒｅｓ１及びｒｅｓ２は、それぞれサンプル点３及びサンプル点４と初回の回帰式Ｆとの残差についての回帰式を表し、２回目の回帰式Ｆは、初回の回帰式Ｆとｒｅｓ１及びｒｅｓ２との重ね合わせである。 Next, in the second regression, two sample points 3 and 4 whose positions are 1 and 2 and whose values are the residuals from the first regression equation F are +0.2 and −0.2, respectively. It was. Note that (b) and (c) in FIG. 6 both represent the second regression equation, the first regression equation F is represented by a thin solid line, and the second regression equation F is represented by a thick solid line. Yes. Also, res1 and res2 represent regression equations for the residuals of the sample points 3 and 4 and the first regression equation F, respectively, and the second regression equation F is the first regression equation F and the res1 and res2 It is a superposition of.

但し、（ｂ）と（ｃ）とでは、ガウス関数の幅Ｒがそれぞれ１及び３と異なる値に設定されている。即ち、（ｂ）においては２回目の回帰式のガウス関数の幅Ｒが初回の回帰式のガウス関数の幅Ｒよりも小さいことから、本発明の１つの実施態様に係る回帰式生成方法に該当し、（ｃ）においては２回目の回帰式のガウス関数の幅Ｒが初回の回帰式のガウス関数の幅Ｒと等しいことから、比較例に係る回帰式生成方法に該当する。 However, in (b) and (c), the width R of the Gaussian function is set to a value different from 1 and 3, respectively. That is, in (b), since the width R of the Gaussian function of the second regression equation is smaller than the width R of the Gaussian function of the first regression equation, it corresponds to the regression equation generation method according to one embodiment of the present invention. In (c), since the width R of the Gaussian function of the second regression equation is equal to the width R of the Gaussian function of the first regression equation, this corresponds to the regression equation generation method according to the comparative example.

図６における（ｂ）と（ｃ）との比較からも明らかであるように、初回の回帰式Ｆ（細い実線）とｒｅｓ１（点線）及びｒｅｓ２（破線）との重ね合わせである２回目の回帰式Ｆ（太い実線）の、２回目のサンプル点３及び４に対する精度は、（ｃ）よりも（ｂ）の方が良好である。これは、ｒｅｓ１及びｒｅｓ２の周囲への影響度の違いに起因するものである。即ち、初回回帰実行時の幅Ｒと同じ値を用いた（ｃ）においては、一方のサンプル点の影響が他方のサンプル点の位置にまで及んでおり、このために回帰式の精度が低下している。これに対し、初回回帰実行時よりも小さい値を幅Ｒとして用いた（ｂ）においては、一方のサンプル点から他方のサンプル点にまで及ぶ影響が小さく、その結果、２回目の回帰式のサンプル点からの乖離が小さい（回帰式の精度が高い）。 As is clear from the comparison between (b) and (c) in FIG. 6, the second regression, which is an overlay of the first regression equation F (thin solid line) and res1 (dotted line) and res2 (dashed line). The accuracy of Formula F (thick solid line) for the second sample points 3 and 4 is better in (b) than in (c). This is due to the difference in the degree of influence on the surroundings of res1 and res2. In other words, in (c) using the same value as the width R at the time of the first regression, the influence of one sample point extends to the position of the other sample point, which reduces the accuracy of the regression equation. ing. On the other hand, in (b) using a smaller value than the first regression execution as the width R, the influence from one sample point to the other sample point is small, and as a result, the sample of the second regression equation The deviation from the point is small (the accuracy of the regression equation is high).

上記のように、本発明の１つの実施態様に係る回帰式生成方法の回帰式生成ステップにおいて、２回目以降の回帰式のガウス関数の幅Ｒとして、初回の回帰式のガウス関数の幅Ｒよりも小さい値を用いることにより、２回目以降の回帰式の精度が向上することが確認された。また、２回目以降の回帰式のガウス関数の幅Ｒとして、初回の回帰式のガウス関数の幅Ｒよりも大きい値は候補として上げる必要性が無くなることから、より少ない処理量（作業量、計算量）で、２回目以降の回帰実行時の幅Ｒを決定することができる。 As described above, in the regression equation generation step of the regression equation generation method according to one embodiment of the present invention, as the width R of the Gauss function of the regression equation for the second and subsequent times, from the width R of the Gauss function of the first regression equation It was confirmed that the accuracy of the regression equation after the second time is improved by using a small value. Further, since the value R larger than the width R of the Gaussian function of the first regression equation as the width R of the Gaussian function of the regression equation after the second time does not need to be raised as a candidate, a smaller amount of processing (work load, calculation) Quantities) can determine the width R for the second and subsequent regression runs.

３．２回目以降の回帰式のガウス関数の幅Ｒの決定（２）
上述のように、２回目以降の回帰式のガウス関数の幅Ｒを初回の回帰式のガウス関数の幅Ｒよりも小さく設定することにより、より少ない処理量（作業量、計算量）で、２回目以降の回帰実行時の幅Ｒを決定することができるばかりでなく、２回目以降の回帰式の精度を向上させることができる。そこで、初回の回帰式のガウス関数の幅Ｒと２回目以降の回帰式のガウス関数の幅Ｒとの様々な組み合わせに対して、結果として得られる回帰式の精度を評価した。かかる評価結果について、図７を参照しながら以下に詳しく説明する。 3. Determination of the width R of the Gaussian function of the regression equation after the second time (2)
As described above, by setting the width R of the Gaussian function of the regression equation for the second and subsequent times to be smaller than the width R of the Gaussian function of the initial regression equation, the processing amount (work amount, calculation amount) can be reduced to 2 Not only can the width R at the time of the regression execution after the first time be determined, but also the accuracy of the regression equations after the second time can be improved. Accordingly, the accuracy of the resulting regression equation was evaluated for various combinations of the width R of the Gaussian function of the first regression equation and the width R of the Gaussian function of the regression equation for the second and subsequent regression equations. The evaluation result will be described in detail below with reference to FIG.

前述のように、図７は、本発明の１つの実施態様に係る回帰式生成方法における初回の回帰式のガウス関数の幅Ｒと２回目以降の回帰式のガウス関数の幅Ｒとの組み合わせによる回帰式の予測精度（汎化性能推定値）の変化を二次元的に表すコンタ図である。図７に示す白い実線は、初回の回帰式のガウス関数の幅Ｒと２回目以降の回帰式のガウス関数の幅Ｒとが１対１となる（即ち、等しい）状態を表している。図７に示すように、この白い実線よりも下側の領域（即ち、初回の幅Ｒよりも２回目以降の幅Ｒの方が小さい領域）においては、結果として得られる回帰式の精度が高い（平均誤差が小さい）。これに対し、上記白い実線よりも上側の領域（即ち、初回の幅Ｒよりも２回目以降の幅Ｒの方が大きい領域）においては、結果として得られる回帰式の精度が低い（平均誤差が大きい）。 As described above, FIG. 7 shows a combination of the width R of the Gaussian function of the first regression equation and the width R of the Gaussian function of the regression equation for the second and subsequent times in the regression equation generation method according to one embodiment of the present invention. It is a contour figure which represents the change of the prediction accuracy (generalization performance estimated value) of a regression equation two-dimensionally. The white solid line shown in FIG. 7 represents a state in which the width R of the Gaussian function of the first regression equation and the width R of the Gaussian function of the second and subsequent regression equations are in a one-to-one relationship (that is, equal). As shown in FIG. 7, in the region below the white solid line (that is, the region where the second and subsequent widths R are smaller than the initial width R), the accuracy of the resulting regression equation is high. (Average error is small). On the other hand, in the region above the white solid line (that is, the region in which the width R after the first time is larger than the first width R), the accuracy of the resulting regression equation is low (average error is smaller). large).

ここで、図７における白い一点鎖線は、初回の回帰式のガウス関数の幅Ｒが４である状態を表している。即ち、この一点鎖線上では、初回の回帰式のガウス関数の幅Ｒが４で固定されており、２回目以降の回帰式のガウス関数の幅Ｒが種々の値となっている状態を表す。この一点鎖線上で２回目以降の幅Ｒを様々な値に変更した際の平均誤差の変動を以下の表２に示す。 Here, the white alternate long and short dash line in FIG. 7 represents a state where the width R of the Gaussian function of the first regression equation is 4. That is, on the one-dot chain line, the width R of the Gaussian function of the first regression equation is fixed at 4, and the width R of the Gaussian function of the regression equation for the second and subsequent times represents various values. Table 2 below shows changes in the average error when the width R after the second time is changed to various values on the one-dot chain line.

表２に示す結果からも明らかであるように、２回目以降の回帰式のガウス関数の幅Ｒを初回の回帰式のガウス関数の幅Ｒよりも小さく設定することにより、結果として得られる回帰式の精度を向上させることができる。また、２回目以降の回帰式のガウス関数の幅Ｒとして、初回の回帰式のガウス関数の幅Ｒよりも大きい値は候補として上げる必要性が無くなることから、より少ない処理量（作業量、計算量）で、２回目以降の回帰実行時の幅Ｒを決定することができるという利点も得られる。 As is clear from the results shown in Table 2, the regression equation obtained as a result is obtained by setting the width R of the Gaussian function of the regression equation for the second and subsequent times smaller than the width R of the Gaussian function of the first regression equation. Accuracy can be improved. Further, since the value R larger than the width R of the Gaussian function of the first regression equation as the width R of the Gaussian function of the regression equation after the second time does not need to be raised as a candidate, a smaller amount of processing (work load, calculation) The amount R) of the width R at the second and subsequent regression executions can be determined.

以上説明してきたように、本発明によれば、サンプルデータ数が膨大である場合でも回帰式を生成するのに必要とされる計算時間が著しく増大することが無く、かつ、サンプルデータにおける異常値混入の影響を受け難い、回帰式の生成方法が提供される。従って、本発明によれば、従来の回帰方法と比較して、膨大なサンプルデータを用いる場合であっても大幅に短い計算時間で、かつ異常値の混入があっても高い精度を維持したままで、回帰式を生成することができる。本発明にかかる方法は、例えば車両等の各種装置の制御における制御因子特定用回帰式の生成において有用である。 As described above, according to the present invention, even when the number of sample data is enormous, the calculation time required to generate the regression equation does not increase remarkably, and an abnormal value in the sample data is obtained. Provided is a method for generating a regression equation that is not easily affected by contamination. Therefore, according to the present invention, compared with the conventional regression method, even when a large amount of sample data is used, the calculation time is significantly shortened and high accuracy is maintained even if abnormal values are mixed. Thus, a regression equation can be generated. The method according to the present invention is useful in generating a regression equation for specifying a control factor in controlling various devices such as a vehicle.

更に、本発明によれば、回帰式の基底関数としてガウス関数を用いる場合においても、回帰式生成ステップにおいて、回帰式生成ステップの実行が初回ではない（即ち、２回目以降である）場合におけるガウス関数の幅（Ｒ）を、回帰式生成ステップの実行が初回である場合におけるガウス関数の幅（Ｒ）よりも小さい値に設定することにより、より少ない処理量（作業量、計算量）で２回目以降の回帰実行時の幅（Ｒ）を決定することができるのみならず、結果として得られる回帰式の精度を向上させることができる。 Furthermore, according to the present invention, even when a Gaussian function is used as a basis function of a regression equation, a Gaussian in a case where the regression equation generation step is not executed for the first time (that is, after the second time) in the regression equation generation step. By setting the function width (R) to a value smaller than the width (R) of the Gaussian function when the regression equation generation step is executed for the first time, the processing amount (work amount, calculation amount) is reduced to 2 Not only can the width (R) for the regression execution after the first time be determined, but also the accuracy of the resulting regression equation can be improved.

１０本発明の１つの実施態様に係る回帰式生成方法の処理フロー
１１因子空間上での中心サンプルデータを選出するステップ
１２サンプルデータをソートし、グループ分けするステップ
２０回帰回数ループ入口
３０初回の回帰であるか否かによる条件分岐
４１因子と評価指標との関係を表す回帰式を求めるステップ
５１既存の回帰式からの誤差を個々のサンプルデータについて算出するステップ
５２算出された誤差と制御因子との関係を表す回帰式を求めるステップ
６０ステップ５２において求められた回帰式を既存の回帰式に加算して新たな回帰式とするステップ
７０回帰回数ループ出口 10 Process Flow of Regression Formula Generation Method According to One Embodiment of the Present Invention 11 Step of Selecting Center Sample Data on Factor Space Step 12 Sorting and Grouping Sample Data Step 20 Regression Loop Inlet 30 Initial Regression Conditional branch 41 depending on whether or not a step 41 A step of obtaining a regression equation representing the relationship between the factor and the evaluation index 51 A step of calculating an error from the existing regression equation for each sample data 52 A step of calculating the error and the control factor Step 60 for obtaining a regression equation representing the relationship Step 70: adding the regression equation obtained in Step 52 to the existing regression equation to obtain a new regression equation Step 70 Regression loop exit

Claims

A method of generating a regression equation representing a relationship between a control factor of a device to be controlled and an evaluation index corresponding to the control factor,
A sample data dividing step for dividing a plurality of sample data including information on control factors and evaluation indices into a plurality of groups;
A regression equation generating step for generating a regression equation based on sample data belonging to one of a plurality of groups divided in the sample data dividing step,
If the execution of this step is the first time, a regression equation representing the relationship between the control factor and the evaluation index included in the sample data belonging to the group is obtained,
When the execution of the step is not the first time, the result obtained from the control factor included in the sample data belonging to the group based on the regression equation obtained by the previous execution of the step and the sample data belonging to the group To calculate an error with the evaluation index included in and to obtain a regression equation representing the relationship between the error and the control factor,
Regression equation generation step ,
The regression equation determined in those said step when executed is a first-time before Symbol regression equation generating step as a new regression equation, if the execution of the step is not the first time obtained in the previous SL regression equation generating step and existing regression equation determined by a previous execution of the regression equation and those said step were additively superimposed to generate a new regression equation,
Regression equation addition step,
Only including,
A method of repeating the regression equation generation step and the regression equation addition step until execution of the regression equation generation step and the regression equation addition step is completed for all groups divided in the sample data division step .

2. The method according to claim 1, wherein a regression equation is obtained by support vector regression in the regression equation generation step.

3. The method according to claim 1, wherein in the sample data dividing step, the sample data is divided into groups so that sample data existing in a specific range on the control factor space does not exist in a specific group. A method characterized by sorting evenly.

4. The method according to claim 1, wherein the device to be controlled is a vehicle.

A method according to any one of claims 1 to 4, wherein
The regression equation uses a Gaussian function as a basis function, and in the regression equation generation step, compared to the width of the Gaussian function when the regression equation generation step is executed for the first time, the regression equation generation step The width of the Gaussian function is smaller when the execution is not the first time,
A method characterized by.

6. A method according to claim 5, wherein
In the regression equation generation step, at least a part of the width of the Gaussian function when the execution of the regression equation generation step is the first time and the width of the Gaussian function when the execution of the regression equation generation step is not the first time are: Determined by the cross-validation method,
A method characterized by.