JPH0973438A

JPH0973438A - Device and method for statistical learning, and information storage medium

Info

Publication number: JPH0973438A
Application number: JP8099740A
Authority: JP
Inventors: Kenji Fukumizu; 健次福水
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1995-06-30
Filing date: 1996-04-22
Publication date: 1997-03-18

Abstract

PROBLEM TO BE SOLVED: To enable a regression curve estimation part to estimate an unknown system with a minimum error by generating data (x) minimizing the learning error of the regression curve estimation part and setting it to a learning data output part. SOLUTION: The regression curve estimation part 3 has a linear model 'f(x:θ)=Σiθifi(x)' having an M-dimensional parameter θ set previously in a model storage part 8 so as to estimate the unknown system 7 as a regression curve 'E(p(y1x))', and has a parameter estimation part 9 which estimates the M-dimensional parameter θ of the linear model. A learning data output part 5 outputs a vector (y) outputted as, a vector (x) inputted to the unknown system 7 to a regression curve setting part 3 as learning data (x, y), and a learning data generation part 6 generates the vector (x) minimizing the learning error of the regression curve estimation part 3 and sets it to the learning data output part 5. Therefore, the unknown system 7 outputs a K-dimensional vector (y) to an L-dimensional vector (x) by operating a function and adding Guassian noise.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、統計的学習装置及
び統計的学習方法、情報記憶媒体に関する。TECHNICAL FIELD The present invention relates to a statistical learning device, a statistical learning method, and an information storage medium.

【０００２】[0002]

【従来の技術】現在、ブラックボックスとして機能する
未知システムを外部から推定する統計的学習装置が研究
されている。このような統計的学習装置は、例えば、回
帰曲線推定部と学習データ出力部とを有し、この学習デ
ータ出力部が出力する学習データにより回帰曲線推定部
が未知システムを推定する。2. Description of the Related Art Currently, a statistical learning device for externally estimating an unknown system functioning as a black box is being studied. Such a statistical learning device has, for example, a regression curve estimation unit and a learning data output unit, and the regression curve estimation unit estimates the unknown system from the learning data output by the learning data output unit.

【０００３】より詳細には、未知システムが、データｘ
の入力に対して関数の演算とガウスノイズの加算とによ
りデータｙを出力するとき、この未知システムを推定す
る回帰曲線推定部に、Ｍ個の線形モデル“ｆ(ｘ；θ)＝
Σ_iθ_iｆ_i(ｘ）”を設定する。学習データ出力部は、未
知システムにデータｘを入力して出力されるデータｙを
採取し、これらのデータｘ，ｙの組み合わせを学習デー
タとして回帰曲線推定部に出力する。この回帰曲線推定
部は、学習データ(ｘ，ｙ)の入力に対して線形モデルを
利用した最尤推定法などを実行し、未知システムを回帰
曲線“Ｅ[ｐ(ｙ｜ｘ)]”として推定する。In more detail, the unknown system uses the data x
When the data y is output by the operation of the function and the addition of Gaussian noise with respect to the input of M, the M linear models “f (x; θ) =
Σ _i θ _i f _i (x) ”is set. The learning data output unit inputs the data x to the unknown system, collects the output data y, and sets the combination of these data x, y as the learning data. The regression curve estimation unit executes the maximum likelihood estimation method using a linear model on the input of the learning data (x, y), and outputs the unknown system to the regression curve “E [p (y | x)] ”.

【０００４】また、未知システムに一次元の実数直線か
らデータｘが入力される場合は、平均推定部にＭ次元の
パラメータθを有する多項式を設定し、この多項式によ
り未知システムを推定することも可能である。このよう
な多項式は、例えば、下記のように設定される。Further, when the data x is inputted to the unknown system from a one-dimensional real number straight line, a polynomial having an M-dimensional parameter θ can be set in the average estimating section and the unknown system can be estimated by this polynomial. Is. Such a polynomial is set as follows, for example.

【０００５】[0005]

【数７】 (Equation 7)

【０００６】[0006]

【発明が解決しようとする課題】上述のような統計的学
習装置では、推定部に学習データを的確に入力すること
により、未知システムを良好に推定することができる。In the statistical learning device as described above, the unknown system can be satisfactorily estimated by accurately inputting the learning data to the estimation unit.

【０００７】しかし、学習データを的確に作成すること
ができないため、未知システムを充分に推定するために
は、膨大な学習データを入力する必要があった。However, since the learning data cannot be created accurately, it is necessary to input a huge amount of learning data in order to sufficiently estimate the unknown system.

【０００８】[0008]

【課題を解決するための手段】請求項１記載の発明の統
計的学習装置は、未知システムがデータｘの入力に対し
て関数の演算とガウスノイズの加算とによりデータｙを
出力するとき、この未知システムに入力するデータｘと
出力されたデータｙとを学習データ(ｘ，ｙ)として出力
する学習データ出力部を設け、学習データ(ｘ，ｙ)の入
力に対してＭ次元のパラメータθを有する線形モデル
“ｆ(ｘ；θ)＝Σ_iθ_iｆ_i(ｘ）”により前記未知システ
ムを回帰曲線“Ｅ[ｐ(ｙ｜ｘ)]”として推定する回帰曲
線推定部を設けた統計的学習装置において、前記回帰曲
線推定部の学習誤差が最小となるデータｘを作成して前
記学習データ出力部に設定する学習データ作成部を設け
た。According to the statistical learning device of the invention described in claim 1, when the unknown system outputs the data y by the operation of the function and the addition of Gaussian noise with respect to the input of the data x, A learning data output unit that outputs the data x input to the unknown system and the output data y as learning data (x, y) is provided, and the M-dimensional parameter θ is set for the input of the learning data (x, y). Statistics provided with a regression curve estimation unit for estimating the unknown system as a regression curve "E [p (y | x)]" by the linear model "f (x; θ) = Σ _i θ _i f _i (x)" In the dynamic learning device, there is provided a learning data creation unit that creates data x that minimizes the learning error of the regression curve estimation unit and sets it in the learning data output unit.

【０００９】なお、本発明で云うデータｘやデータｙな
どは、ベクトルやスカラーなどの数値を示すデータであ
る。The data x, data y, etc. referred to in the present invention are data indicating numerical values such as vectors and scalars.

【００１０】請求項２記載の発明の統計的学習装置で
は、請求項１記載の発明の統計的学習装置において、学
習データ作成部は、パラメータｖを有して学習データ
(ｘ，ｙ)のデータｘを発生させる確率密度関数“ｒ
(ｘ；ｖ)”を保持した入力分布保持部と、外部から入力
されるデータｘの分布の密度関数の推定量“ハットｑ
(ｘ)”を保持した推定量保持部と、密度関数の推定量
“ハットｑ(ｘ)”を用いて確率密度関数“ｒ(ｘ；ｖ)”
に従って学習データ(ｘ，ｙ)のデータｘを発生させた場
合に回帰曲線推定部の学習誤差の推定値Ｅ(ｖ)が小さく
なるパラメータｖを算出する誤差最小化部とを有し、こ
の算出されたパラメータｖを確率密度関数“ｒ(ｘ；
ｖ)”に設定して学習データ(ｘ，ｙ)のデータｘを発生
させる。In the statistical learning device of the invention described in claim 2, in the statistical learning device of the invention described in claim 1, the learning data creating section has the parameter v and the learning data.
Probability density function "r that generates data x of (x, y)
(x; v) ”holding the input distribution holding unit, and the estimated amount“ hat q ”of the density function of the distribution of the data x input from the outside.
The probability density function “r (x; v)” is calculated by using the estimator holding unit that holds (x) ”and the density function estimator“ hat q (x) ”.
An error minimization unit that calculates a parameter v that reduces the learning error estimation value E (v) of the regression curve estimation unit when the data x of the learning data (x, y) is generated according to The probability density function “r (x;
v) ”to generate the data x of the learning data (x, y).

【００１１】なお、本明細書の文章中の“ハットＡ”と
は、“＾”が付加された“Ａ”をコード表現したもので
あり、これはイメージ入力した数式では通常通りに記載
している。The "hat A" in the text of this specification is a code expression of "A" to which "^" is added. There is.

【００１２】請求項３記載の発明の統計的学習装置で
は、請求項２記載の発明の統計的学習装置において、密
度関数の推定量“ハットｑ(ｘ)”を用いてIn the statistical learning device of the invention described in claim 3, in the statistical learning device of the invention described in claim 2, the density function estimator "hat q (x)" is used.

【００１３】[0013]

【数８】 (Equation 8)

【００１４】を計算する第一行列計算部を設け、確率密
度関数“ｒ(ｘ；ｖ)”を用いてA first matrix calculation unit for calculating is used, and the probability density function "r (x; v)" is used.

【００１５】[0015]

【数９】 [Equation 9]

【００１６】を計算する第二行列計算部を設け、誤差最
小化部は、学習誤差の推定値Ｅ(ｖ)をA second matrix calculation unit for calculating is provided, and the error minimization unit calculates the estimated value E (v) of the learning error.

【００１７】[0017]

【数１０】 (Equation 10)

【００１８】として計算する。Calculate as

【００１９】請求項４記載の発明の統計的学習装置で
は、請求項３記載の発明の統計的学習装置において、誤
差最小化部は、学習誤差の推定値Ｅ(ｖ)を最小化するパ
ラメータｖを勾配方向を利用した逐次的手法により計算
する。In the statistical learning device of the present invention as defined in claim 4, in the statistical learning device of the invention as defined in claim 3, the error minimization section minimizes the estimated value E (v) of the learning error. Is calculated by a sequential method using the gradient direction.

【００２０】請求項５記載の発明の統計的学習装置で
は、請求項４記載の発明の統計的学習装置において、入
力分布保持部は、確率密度関数“ｒ(ｘ；ｖ)”が“Ｍ
(Ｍ＋１)／２”以下の個数の離散分布として設定されて
いる。In the statistical learning device of the invention described in claim 5, in the statistical learning device of the invention described in claim 4, the probability density function "r (x; v)" is "M" in the input distribution holding unit.
It is set as a discrete distribution of the number of (M + 1) / 2 "or less.

【００２１】請求項６記載の発明の統計的学習装置で
は、請求項４記載の発明の統計的学習装置において、入
力分布保持部は、確率密度関数“ｒ(ｘ；ｖ)”がIn the statistical learning device of the invention described in claim 6, in the statistical learning device of the invention described in claim 4, the probability distribution function "r (x; v)"

【００２２】[0022]

【数１１】 [Equation 11]

【００２３】なる関数で一次独立の実数の個数の離散分
布として設定されている。The following function is set as a discrete distribution of the number of real numbers that are linearly independent.

【００２４】請求項７記載の発明の統計的学習装置は、
未知システムが一次元の実数直線からのデータｘの入力
に対して関数の演算とガウスノイズの加算とによりデー
タｙを出力するとき、この未知システムに入力するデー
タｘと出力されたデータｙとを学習データ(ｘ，ｙ)とし
て出力する学習データ出力部を設け、学習データ(ｘ，
ｙ)の入力に対してＭ次元のパラメータθを有する多項
式A statistical learning device according to a seventh aspect of the invention is
When an unknown system outputs data y by inputting data x from a one-dimensional real straight line by calculating a function and adding Gaussian noise, the data x input to this unknown system and the output data y are A learning data output unit that outputs the learning data (x, y) is provided, and the learning data (x, y
y) polynomial with M-dimensional parameter θ for input

【００２５】[0025]

【数１２】 (Equation 12)

【００２６】により前記未知システムを推定する平均推
定部を設けた統計的学習装置において、予め設定された
二つの分布の混合分布に従って前記平均推定部の学習誤
差が最小となるデータｘを作成して前記学習データ出力
部に設定する学習データ作成部を設けた。According to the statistical learning device provided with the average estimation unit for estimating the unknown system, data x which minimizes the learning error of the average estimation unit is created in accordance with the preset mixture distribution of two distributions. A learning data creating unit to be set in the learning data output unit is provided.

【００２７】請求項８記載の発明の統計的学習方法は、
未知システムがデータｘの入力に対して関数の演算とガ
ウスノイズの加算とによりデータｙを出力するとき、こ
の未知システムに入力するデータｘと出力されたデータ
ｙとを学習データ(ｘ，ｙ)として学習データ出力部に出
力させ、この学習データ(ｘ，ｙ)の入力に対してＭ次元
のパラメータθを有する線形モデル“ｆ(ｘ；θ)＝Σ_i
θ_iｆ_i(ｘ）”により回帰曲線推定部が前記未知システ
ムを回帰曲線“Ｅ[ｐ(ｙ｜ｘ)]”として推定するように
した統計的学習方法において、前記回帰曲線推定部の学
習誤差が最小となるデータｘを学習データ作成部が作成
して前記学習データ出力部に設定するようにした。The statistical learning method according to the invention of claim 8 is
When the unknown system outputs the data y by the operation of the function and the addition of Gaussian noise with respect to the input of the data x, the data x input to this unknown system and the output data y are learned data (x, y). As a linear model “f (x; θ) = Σ _i having an M-dimensional parameter θ for the input of this learning data (x, y).
In the statistical learning method in which the regression curve estimation unit estimates the unknown system as a regression curve “E [p (y | x)]” by θ _i f _i (x) ”, learning of the regression curve estimation unit The learning data creation unit creates the data x having the minimum error and sets it in the learning data output unit.

【００２８】請求項９記載の発明の統計的学習方法は、
未知システムが一次元の実数直線からのデータｘの入力
に対して関数の演算とガウスノイズの加算とによりデー
タｙを出力するとき、この未知システムに入力するデー
タｘと出力されたデータｙとを学習データ(ｘ，ｙ)とし
て学習データ出力部に出力させ、この学習データ(ｘ，
ｙ)の入力に対してＭ次元のパラメータθを有する多項
式A statistical learning method according to a ninth aspect of the invention is
When an unknown system outputs data y by inputting data x from a one-dimensional real straight line by calculating a function and adding Gaussian noise, the data x input to this unknown system and the output data y are The learning data (x, y) is output to the learning data output unit, and the learning data (x, y
y) polynomial with M-dimensional parameter θ for input

【００２９】[0029]

【数１３】 (Equation 13)

【００３０】により平均推定部が前記未知システムを推
定するようにした統計的学習方法において、学習データ
作成部が予め設定された二つの分布の混合分布に従って
前記平均推定部の学習誤差が最小となるデータｘを作成
して前記学習データ出力部に設定するようにした。In the statistical learning method according to which the average estimating unit estimates the unknown system, the learning data creating unit minimizes the learning error of the average estimating unit according to a preset mixture distribution of two distributions. The data x was created and set in the learning data output section.

【００３１】請求項１０記載の発明の情報記憶媒体は、
マイクロコンピュータを動作させるソフトウェアが書き
込まれた情報記憶媒体において、請求項１，２，３，
４，５，６又は７記載の統計的学習装置の各種機能がソ
フトウェアとして書き込まれている。An information storage medium according to the invention of claim 10 is
An information storage medium in which software for operating a microcomputer is written.
Various functions of the statistical learning device described in 4, 5, 6 or 7 are written as software.

【００３２】請求項１及び８記載の発明では、未知シス
テムがデータｘの入力に対して関数の演算とガウスノイ
ズの加算とによりデータｙを出力するとき、この未知シ
ステムに入力するデータｘと出力されたデータｙとを学
習データ(ｘ，ｙ)として学習データ出力部に出力させ、
この学習データ(ｘ，ｙ)の入力に対してＭ次元のパラメ
ータθを有する線形モデル“ｆ(ｘ；θ)＝Σ_iθ_iｆ
_i(ｘ）”により回帰曲線推定部が未知システムを回帰曲
線“Ｅ[ｐ(ｙ｜ｘ)]”として推定する。この時、回帰曲
線推定部の学習誤差が最小となるデータｘを学習データ
作成部が作成して学習データ出力部に設定するので、回
帰曲線推定部が少数の学習データで未知システムを良好
に推定する。According to the invention described in claims 1 and 8, when the unknown system outputs the data y by the operation of the function and the addition of the Gaussian noise with respect to the input of the data x, the data x input to the unknown system and the output Output the learned data y as learning data (x, y) to the learning data output unit,
A linear model “f (x; θ) = Σ _i θ _i f having an M-dimensional parameter θ with respect to the input of this learning data (x, y)
_The regression curve estimation unit estimates the unknown system as a regression curve “E [p (y | x)]” by “ _i (x).” At this time, the data x that minimizes the learning error of the regression curve estimation unit is the learning data. Since the creation unit creates and sets it in the learning data output unit, the regression curve estimation unit satisfactorily estimates the unknown system with a small number of learning data.

【００３３】請求項２記載の発明では、パラメータｖを
有して学習データ(ｘ，ｙ)のデータｘを発生させる確率
密度関数“ｒ(ｘ；ｖ)”が入力分布保持部に保持されて
おり、外部から入力されるデータｘの分布の密度関数の
推定量“ハットｑ(ｘ)”が推定量保持部に保持されてい
る。密度関数の推定量“ハットｑ(ｘ)”を用いて確率密
度関数“ｒ(ｘ；ｖ)”に従って学習データ(ｘ，ｙ)のデ
ータｘを発生させた場合に回帰曲線推定部の学習誤差の
推定値Ｅ(ｖ)が小さくなるパラメータｖを誤差最小化部
が算出すると、この算出されたパラメータｖを確率密度
関数“ｒ(ｘ；ｖ)”に設定して学習データ(ｘ，ｙ)のデ
ータｘを発生させるので、回帰曲線推定部の学習誤差が
最小となるデータｘが簡易に作成される。According to the second aspect of the present invention, the probability density function "r (x; v)" for generating the data x of the learning data (x, y) with the parameter v is held in the input distribution holding unit. The estimated amount “hat q (x)” of the density function of the distribution of the data x input from the outside is held in the estimated amount holding unit. Learning error of the regression curve estimation unit when the data x of the learning data (x, y) is generated according to the probability density function “r (x; v)” using the density function estimator “hat q (x)” When the error minimization unit calculates a parameter v having a smaller estimated value E (v) of the learning data (x, y), the calculated parameter v is set to the probability density function “r (x; v)”. Since the data x is generated, the data x that minimizes the learning error of the regression curve estimation unit is easily created.

【００３４】請求項３記載の発明では、密度関数の推定
量“ハットｑ(ｘ)”を用いて第一行列計算部が下記の行
列を計算し、According to the third aspect of the invention, the first matrix calculation unit calculates the following matrix using the density function estimator "hat q (x)",

【００３５】[0035]

【数１４】 [Equation 14]

【００３６】確率密度関数“ｒ(ｘ；ｖ)”を用いて第二
行列計算部が下記の行列を計算し、The second matrix calculator calculates the following matrix using the probability density function "r (x; v)",

【００３７】[0037]

【数１５】 (Equation 15)

【００３８】学習誤差の推定値Ｅ(ｖ)を誤差最小化部が
下記のように計算する。The error minimization unit calculates the learning error estimated value E (v) as follows.

【００３９】[0039]

【数１６】 (Equation 16)

【００４０】従って、回帰曲線推定部の学習誤差の推定
値Ｅ(ｖ)が二つの行列から簡易に計算される。Therefore, the estimated value E (v) of the learning error of the regression curve estimator is easily calculated from the two matrices.

【００４１】請求項４記載の発明では、誤差最小化部が
学習誤差の推定値Ｅ(ｖ)を最小化するパラメータｖを勾
配方向を利用した逐次的手法により計算するので、推定
値Ｅ(ｖ)を最小化するパラメータｖが簡易に計算され
る。According to the fourth aspect of the invention, the error minimization unit calculates the parameter v for minimizing the estimated value E (v) of the learning error by the sequential method using the gradient direction. Therefore, the estimated value E (v The parameter v that minimizes) is simply calculated.

【００４２】請求項５記載の発明では、入力分布保持部
の確率密度関数“ｒ(ｘ；ｖ)”が“Ｍ(Ｍ＋１)／２”以
下の個数の離散分布として設定されているので、学習デ
ータの個数が削減される。In the invention according to claim 5, since the probability density function "r (x; v)" of the input distribution holding unit is set as a discrete distribution of a number of "M (M + 1) / 2" or less, learning The number of data is reduced.

【００４３】請求項６記載の発明では、入力分布保持部
の確率密度関数“ｒ(ｘ；ｖ)”が下記の関数で一次独立
の実数の個数の離散分布として設定されているので、学
習データの個数が必要最小限に制限される。According to the sixth aspect of the present invention, since the probability density function "r (x; v)" of the input distribution holding unit is set as a discrete distribution of the number of real independent primary values by the following function, the learning data Is limited to the necessary minimum.

【００４４】[0044]

【数１７】 [Equation 17]

【００４５】請求項７及び９記載の発明では、未知シス
テムが一次元の実数直線からのデータｘの入力に対して
関数の演算とガウスノイズの加算とによりデータｙを出
力するとき、この未知システムに入力するデータｘと出
力されたデータｙとを学習データ(ｘ，ｙ)として学習デ
ータ出力部に出力させ、この学習データ(ｘ，ｙ)の入力
に対してＭ次元のパラメータθを有する下記の多項式に
より平均推定部が未知システムを推定する。According to the seventh and ninth aspects of the present invention, when the unknown system outputs the data y by the operation of the function and the addition of Gaussian noise with respect to the input of the data x from the one-dimensional real number straight line, the unknown system The input data x and the output data y are output as learning data (x, y) to the learning data output unit, and the following data having an M-dimensional parameter θ with respect to the input of this learning data (x, y) The average estimator estimates the unknown system by the polynomial of.

【００４６】[0046]

【数１８】 (Equation 18)

【００４７】この時、平均推定部の学習誤差が最小とな
るデータｘを学習データ作成部が予め設定された二つの
分布の混合分布に従って作成して学習データ出力部に設
定するので、回帰曲線推定部が少数の学習データで未知
システムを良好に推定する。At this time, since the learning data creating unit creates the data x having the minimum learning error of the average estimating unit according to the preset mixed distribution of the two distributions and sets it in the learning data output unit, the regression curve estimation is performed. The department estimates the unknown system well with a small number of training data.

【００４８】請求項１０記載の発明では、請求項１，
２，３，４，５，６又は７記載の統計的学習装置の各種
機能がソフトウェアとして書き込まれているので、この
情報記憶媒体のソフトウェアにより動作するマイクロコ
ンピュータは、請求項１，２，３，４，５，６又は７記
載の統計的学習装置として機能する。According to the invention of claim 10, claim 1
Since the various functions of the statistical learning device described in 2, 3, 4, 5, 6 or 7 are written as software, the microcomputer operated by the software of this information storage medium is defined in claim 1, 2, 3, It functions as the statistical learning device described in 4, 5, 6 or 7.

【００４９】[0049]

【発明の実施の形態】本発明の実施の第一の形態を図１
に基づいて以下に説明する。まず、本実施の形態の統計
的学習装置１は、外部データ入力部２、回帰曲線推定部
３、外部データ出力部４、学習データ出力部５、学習デ
ータ作成部６、を有しており、未知システム７に接続さ
れている。FIG. 1 shows a first embodiment of the present invention.
It will be described below based on. First, the statistical learning device 1 of the present embodiment has an external data input unit 2, a regression curve estimation unit 3, an external data output unit 4, a learning data output unit 5, and a learning data creation unit 6, It is connected to the unknown system 7.

【００５０】この未知システム７は、外部のベクトル空
間からデータｘとしてＬ次元のベクトルｘが入力される
と、所定の関数の演算とガウスノイズの加算とを実行
し、データｙとしてＫ次元のベクトルｙを外部に出力す
る。前記外部データ入力部２は、外部から入力されるベ
クトルｘを前記回帰曲線推定部３に入力し、前記外部デ
ータ出力部４は、前記回帰曲線推定部３が出力するベク
トルｙを外部に出力する。When the L-dimensional vector x is input as the data x from the external vector space, the unknown system 7 executes the operation of a predetermined function and the addition of Gaussian noise, and the data y is the K-dimensional vector. Output y to the outside. The external data input unit 2 inputs the vector x input from the outside to the regression curve estimation unit 3, and the external data output unit 4 outputs the vector y output by the regression curve estimation unit 3 to the outside. .

【００５１】前記回帰曲線推定部３は、前記未知システ
ム７を回帰曲線“Ｅ[ｐ(ｙ｜ｘ)]”として推定するた
め、Ｍ次元のパラメータθを有する線形モデル“ｆ
(ｘ；θ)＝Σ_iθ_iｆ_i(ｘ）”がモデル記憶部８に予め設
定されており、線形モデルのＭ次元のパラメータθを推
定するパラメータ推定部９を有している。The regression curve estimator 3 estimates the unknown system 7 as a regression curve "E [p (y | x)]", and therefore a linear model "f" having an M-dimensional parameter θ is used.
(x; θ) = Σ _i θ _i f _i (x) ”is preset in the model storage unit 8 and has a parameter estimation unit 9 for estimating the M-dimensional parameter θ of the linear model.

【００５２】前記学習データ出力部５は、前記未知シス
テム７に入力するベクトルｘと出力されたベクトルｙと
を学習データ(ｘ，ｙ)前記回帰曲線推定部３に出力し、
前記学習データ作成部６は、前記回帰曲線推定部３の学
習誤差が最小となるベクトルｘを作成して前記学習デー
タ出力部５に設定する。The learning data output section 5 outputs the vector x input to the unknown system 7 and the output vector y to the learning curve (x, y) regression curve estimation section 3,
The learning data creation unit 6 creates a vector x that minimizes the learning error of the regression curve estimation unit 3 and sets it in the learning data output unit 5.

【００５３】このような構成において、未知システム７
は、関数の演算とガウスノイズの加算とにより、Ｌ次元
のベクトルｘの入力に対してＫ次元のベクトルｙを出力
するので、本実施の形態の統計的学習装置１は、回帰曲
線推定部３が未知システム７の未知の関数を推定する。In such a configuration, the unknown system 7
Outputs the K-dimensional vector y with respect to the input of the L-dimensional vector x by the calculation of the function and the addition of the Gaussian noise. Therefore, the statistical learning device 1 according to the present embodiment includes the regression curve estimation unit 3 Estimates the unknown function of the unknown system 7.

【００５４】より詳細には、学習データ出力部５が、未
知システム７に入力するベクトルｘと出力されたベクト
ルｙとを学習データ(ｘ，ｙ)として回帰曲線推定部３に
出力するので、この回帰曲線推定部３は、例えば、線形
モデル“ｆ(ｘ；θ)＝Σ_iθ_iｆ_i(ｘ）”を利用した最尤
推定法である最小自乗誤差推定法により、下記の演算が
最小となるθを求めることにより、未知システム７の関
数を回帰曲線“Ｅ[ｐ(ｙ｜ｘ)]”として推定する。More specifically, the learning data output unit 5 outputs the vector x input to the unknown system 7 and the output vector y to the regression curve estimation unit 3 as learning data (x, y). The regression curve estimator 3 uses the least square error estimation method, which is the maximum likelihood estimation method using the linear model “f (x; θ) = Σ _i θ _i f _i (x)”, for example, to minimize the following calculation. The function of the unknown system 7 is estimated as a regression curve “E [p (y | x)]” by obtaining θ that satisfies

【００５５】[0055]

【数１９】 [Equation 19]

【００５６】この時、学習データ作成部６が、回帰曲線
推定部３の学習誤差が最小となるベクトルｘを作成して
学習データ出力部５に設定するので、学習データ出力部
５は、学習データ作成部６により設定されたベクトルｘ
で学習データ(ｘ，ｙ)を採取して回帰曲線推定部３に出
力する。このため、回帰曲線推定部３は、的確な学習デ
ータ(ｘ，ｙ)が必要かつ充分に入力されることになり、
膨大な学習を要することなく最小の誤差で未知システム
７を良好に推定することができる。At this time, the learning data creation unit 6 creates the vector x with which the learning error of the regression curve estimation unit 3 is minimized and sets it in the learning data output unit 5, so that the learning data output unit 5 Vector x set by the creating unit 6
The learning data (x, y) is sampled and output to the regression curve estimation unit 3. Therefore, the regression curve estimation unit 3 is required to input accurate learning data (x, y) and sufficiently,
The unknown system 7 can be well estimated with a minimum error without requiring enormous learning.

【００５７】このように未知システム７を推定した回帰
曲線推定部３は、ベクトルｘの入力に対して未知システ
ム７と同様にベクトルｙを出力することができるので、
未知システム７が出力するベクトルｙを直接に採取しな
くとも、これを推定して未知システム７を制御するよう
なことができる。Since the regression curve estimator 3 estimating the unknown system 7 in this way can output the vector y to the input of the vector x in the same manner as the unknown system 7,
Even if the vector y output by the unknown system 7 is not directly collected, it is possible to estimate it and control the unknown system 7.

【００５８】このような統計的学習装置１の具体的な利
用方法としては、カラースキャナの特性同定が想定でき
る。例えば、ある原稿の実際のカラーの物理量であるＬ
*ａ*ｂ* 値から、これに対してカラースキャナが出力す
るＹＭＣ(Yellow,Magenta,Cyanide)値への変換関数を未
知システム７として、これに回帰曲線推定部３を同定す
る統計的学習装置１が想定できる。As a specific method of using such a statistical learning device 1, characteristic identification of a color scanner can be assumed. For example, L, which is the physical quantity of the actual color of a certain document
A statistical learning device that identifies the regression curve estimator 3 with the conversion function from the * a * b * value to the YMC (Yellow, Magenta, Cyanide) value output from the color scanner as the unknown system 7. 1 can be assumed.

【００５９】このような場合、必要な範囲を充分に微細
な単位でＬ*ａ*ｂ* 値を変化させた標準原稿を用意して
おき、学習データ作成部６により与えられた学習データ
を観測するための入力ベクトルｘに最も近いＬ*ａ*ｂ*
値の原稿をカラースキャナで読み取り、これが出力する
ＹＭＣ値をベクトルｙとして一個の学習データ(ｘ，ｙ)
を作成する。このような作業をＮ回まで繰り返すことに
より、本発明の手法によるＮ個の学習データを作成する
ことができ、これに基づいて回帰曲線推定部３がパラメ
ータθを推定してカラースキャナの特性を同定すること
ができる。In such a case, a standard manuscript in which the L * a * b * values are changed in a necessary range in a sufficiently fine unit is prepared, and the learning data provided by the learning data creating section 6 is observed. L * a * b * closest to the input vector x for
The original of the value is read by the color scanner, and the YMC value output by this is set as the vector y, and one learning data (x, y)
Create By repeating such an operation up to N times, N pieces of learning data can be created by the method of the present invention, and the regression curve estimation unit 3 estimates the parameter θ based on the learning data to determine the characteristics of the color scanner. Can be identified.

【００６０】なお、製品として出荷するカラースキャナ
の最終調整において、未知システム７であるカラースキ
ャナに回帰曲線推定部３を同定させれば、学習データ出
力部５や学習データ作成部６は製品であるカラースキャ
ナに搭載する必要はない。In the final adjustment of the color scanner to be shipped as a product, if the regression curve estimation unit 3 is identified by the color scanner which is the unknown system 7, the learning data output unit 5 and the learning data creation unit 6 are products. It does not need to be installed on the color scanner.

【００６１】また、上述のような統計的学習装置１は、
その各部２〜６の各々を固有のハードウェアとして製作
することでも実現できるが、各部２〜６の機能をソフト
ウェアとして情報記憶媒体に書き込み、これでマイクロ
コンピュータを動作させることでも統計的学習装置１を
実現することができる。同様に、各部２〜６の一部をハ
ードウェアで製作し、他の一部をソフトウェアとして情
報記憶媒体に書き込むことも可能である。Further, the statistical learning device 1 as described above is
This can be realized by manufacturing each of the units 2 to 6 as unique hardware, but the statistical learning device 1 can also be realized by writing the functions of the units 2 to 6 as software in an information storage medium and operating the microcomputer. Can be realized. Similarly, it is possible to manufacture a part of each of the units 2 to 6 with hardware and write the other part as software into the information storage medium.

【００６２】より具体的には、情報記憶媒体に、未知シ
ステムに入力するデータｘと出力されたデータｙとを学
習データ(ｘ，ｙ)として出力すること、学習データ
(ｘ，ｙ)の入力に対してＭ次元のパラメータθを有する
線形モデル“ｆ(ｘ；θ)＝Σ_iθ_iｆ_i(ｘ）”により未知
システム７を回帰曲線“Ｅ[ｐ(ｙ｜ｘ)]”として推定す
ること、この学習誤差が最小となるデータｘを作成して
設定すること、等をソフトウェアとして書き込む。この
ように統計的学習装置１の各種機能をソフトウェアとし
て書き込んだ情報記憶媒体は、単体の製品として取り扱
うことも可能であり、例えば、ＲＡＭ(Random Access M
emory)、ＲＯＭ(Read Only Memory)、ＣＤ−ＲＯＭ(Com
pact Disk-ROM)、ＦＤ(Floppy Disk）等の形態で供給す
ることができる。More specifically, the data x input to the unknown system and the output data y are output as learning data (x, y) to the information storage medium.
The unknown system 7 is subjected to the regression curve “E [p (y) by a linear model“ f (x; θ) = Σ _i θ _i f _i (x) ”having an M-dimensional parameter θ for the input of (x, y). | X)] ", writing and setting data x that minimizes this learning error, etc. are written as software. The information storage medium in which the various functions of the statistical learning device 1 are written as software can be handled as a single product, for example, a RAM (Random Access Memory).
emory), ROM (Read Only Memory), CD-ROM (Com
It can be supplied in the form of a pact disk-ROM), an FD (Floppy Disk), or the like.

【００６３】つぎに、本発明の実施の第二の形態を図２
および図３に基づいて以下に説明する。なお、この実施
の第二の形態の統計的学習装置１１に関し、上述した実
施の第一の形態の統計的学習装置１と同一の部分は、同
一の名称と符号とを利用して詳細な説明は省略する。Next, a second embodiment of the present invention will be described with reference to FIG.
The following description is based on FIG. Regarding the statistical learning device 11 of the second embodiment, the same parts as those of the statistical learning device 1 of the first embodiment described above will be described in detail by using the same names and reference numerals. Is omitted.

【００６４】本実施の形態の統計的学習装置１１は、学
習データ作成部１２の構成と作用とを具体的に説明する
ものであり、図２に示すように、学習データ作成部１２
が、推定量保持部１３、入力分布保持部１４、第一行列
計算部１５、第二行列計算部１６、誤差最小化部１７、
を有している。The statistical learning device 11 of the present embodiment specifically explains the structure and operation of the learning data creating unit 12, and as shown in FIG.
Is an estimation amount holding unit 13, an input distribution holding unit 14, a first matrix calculation unit 15, a second matrix calculation unit 16, an error minimization unit 17,
have.

【００６５】前記推定量保持部１３は、外部から入力さ
れるベクトルｘの分布の密度関数の推定量“ハットｑ
(ｘ)”を保持しており、前記第一行列計算部１５は、密
度関数の推定量“ハットｑ(ｘ)”を用いて下記の行列を
計算する。なお、このように“ハット”が付加された記
号は、本案では推定であることを意味している。The estimated amount holding unit 13 estimates the estimated amount “hat q” of the density function of the distribution of the vector x input from the outside.
(x) ”is held, and the first matrix calculation unit 15 calculates the following matrix using the estimated amount“ hat q (x) ”of the density function. The added symbol means in the present case that it is an estimate.

【００６６】[0066]

【数２０】 (Equation 20)

【００６７】前記入力分布保持部１４は、パラメータｖ
を有して学習データ(ｘ，ｙ)のベクトルｘを発生させる
確率密度関数“ｒ(ｘ；ｖ)”を保持しており、前記第二
行列計算部１６は、確率密度関数“ｒ(ｘ；ｖ)”を用い
て下記の行列を計算する。The input distribution holding unit 14 has a parameter v
Holds a probability density function “r (x; v)” for generating a vector x of learning data (x, y), and the second matrix calculation unit 16 holds the probability density function “r (x V) ”is used to calculate the following matrix.

【００６８】[0068]

【数２１】 (Equation 21)

【００６９】前記誤差最小化部１７は、密度関数の推定
量“ハットｑ(ｘ)”を用いて確率密度関数“ｒ(ｘ；
ｖ)”に従って学習データ(ｘ，ｙ)のベクトルｘを発生
させた場合に、回帰曲線推定部３の学習誤差の推定値Ｅ
(ｖ)が小さくなるパラメータｖを算出する。このため、
学習誤差の推定値Ｅ(ｖ)を下記のように計算し、この学
習誤差の推定値Ｅ(ｖ)を最小化するパラメータｖを、山
下り法のような勾配方向を利用した逐次的手法により計
算する。The error minimization unit 17 uses the probability density function “r (x;
v) ”, a vector x of the learning data (x, y) is generated.
A parameter v that reduces (v) is calculated. For this reason,
The estimated value E (v) of the learning error is calculated as follows, and the parameter v that minimizes the estimated value E (v) of the learning error is calculated by the sequential method using the gradient direction, such as the mountain descent method. calculate.

【００７０】[0070]

【数２２】 (Equation 22)

【００７１】そこで、前記学習データ作成部１２は、前
記誤差最小化部１７により算出されたパラメータｖを、
前記入力分布保持部１４に保持されている確率密度関数
“ｒ(ｘ；ｖ)”に設定し、学習データ(ｘ，ｙ)のベクト
ルｘを発生させる。Therefore, the learning data creating section 12 uses the parameter v calculated by the error minimizing section 17 as
The probability density function “r (x; v)” held in the input distribution holding unit 14 is set to generate a vector x of learning data (x, y).

【００７２】このような構成において、本実施の形態の
統計的学習装置１１は、前述した統計的学習装置１と同
様に、学習データ作成部１２がベクトルｘを作成して学
習データ出力部５に設定すると、この学習データ出力部
５は、設定されたベクトルｘで学習データ(ｘ，ｙ)を採
取して回帰曲線推定部３に出力するので、この回帰曲線
推定部３は、入力された学習データ(ｘ，ｙ)に従って未
知システム７の関数を推定する。In the statistical learning device 11 of the present embodiment having such a configuration, the learning data creation unit 12 creates the vector x and outputs it to the learning data output unit 5 as in the statistical learning device 1 described above. When set, this learning data output unit 5 collects the learning data (x, y) with the set vector x and outputs it to the regression curve estimation unit 3. Therefore, the regression curve estimation unit 3 receives the input learning data. Estimate the function of the unknown system 7 according to the data (x, y).

【００７３】ここで、学習データ作成部１２が、学習デ
ータ(ｘ，ｙ)のベクトルｘを発生させる過程を以下に順
次詳述する。まず、入力分布保持部１４には、学習デー
タ(ｘ，ｙ)のベクトルｘを発生させる確率密度関数“ｒ
(ｘ；ｖ)”が予め保持されているので、そのパラメータ
ｖが決定されるとベクトルｘも確定する。Here, the process in which the learning data creating unit 12 generates the vector x of the learning data (x, y) will be sequentially described in detail below. First, in the input distribution holding unit 14, a probability density function “r which generates a vector x of learning data (x, y)
Since (x; v) ”is held in advance, the vector x is also fixed when the parameter v is determined.

【００７４】そこで、第一行列計算部１５は、推定量保
持部１３に予め設定されている密度関数の推定量“ハッ
トｑ(ｘ)”を用いて下記の行列を計算し、Therefore, the first matrix calculating unit 15 calculates the following matrix using the estimated amount “hat q (x)” of the density function preset in the estimated amount holding unit 13,

【００７５】[0075]

【数２３】 (Equation 23)

【００７６】第二行列計算部１６は、確率密度関数“ｒ
(ｘ；ｖ)”を用いて下記の行列を計算する。The second matrix calculation section 16 uses the probability density function "r
The following matrix is calculated using (x; v) ”.

【００７７】[0077]

【数２４】 (Equation 24)

【００７８】誤差最小化部１７は、学習誤差の推定値Ｅ
(ｖ)を下記のように計算し、この学習誤差の推定値Ｅ
(ｖ)を最小化するパラメータｖを、勾配方向を利用した
逐次的手法により計算する。The error minimization unit 17 calculates the estimated value E of the learning error.
(v) is calculated as follows, and the estimated value E of this learning error is calculated.
The parameter v that minimizes (v) is calculated by a sequential method using the gradient direction.

【００７９】[0079]

【数２５】 (Equation 25)

【００８０】そして、学習データ作成部１２は、上述の
ように計算されたパラメータｖを確率密度関数“ｒ
(ｘ；ｖ)”に設定してベクトルｘを発生させるので、こ
のベクトルｘによる学習データ(ｘ，ｙ)は、回帰曲線推
定部３の学習誤差を最小化することができる。Then, the learning data creating unit 12 uses the probability density function "r" for the parameter v calculated as described above.
Since the vector x is generated by setting (x; v) ", the learning data (x, y) by this vector x can minimize the learning error of the regression curve estimation unit 3.

【００８１】ここで、本実施の形態の統計的学習装置１
１の有効性を以下に検証する。まず、推定する未知シス
テム７の真の回帰曲線ｆ(ｘ)が、回帰曲線推定部３に設
定したＭ個の線形モデルｆ(ｘ；θ)に含まれると仮定
し、ｆ(ｘ)＝ｆ(ｘ；θ₀）とする。つぎに、入力分布保
持部１４に設定した密度関数をｒ(ｘ)とし、これから作
成した学習データ(ｘ，ｙ)から回帰曲線推定部３が最小
自乗誤差推定により獲得したパラメータθを“ハット
θ”とする。このような状態の平均的な学習誤差は、下
記のように計算することができる。Here, the statistical learning device 1 of the present embodiment
The effectiveness of 1 is verified below. First, it is assumed that the true regression curve f (x) of the unknown system 7 to be estimated is included in the M linear models f (x; θ) set in the regression curve estimation unit 3, and f (x) = f (x; θ ₀ ). Next, the density function set in the input distribution holding unit 14 is set to r (x), and the parameter θ acquired by the least square error estimation by the regression curve estimation unit 3 from the learning data (x, y) created from this is calculated as “hat θ ” The average learning error in such a state can be calculated as follows.

【００８２】[0082]

【数２６】 (Equation 26)

【００８３】この学習誤差の学習データの出方に対応し
た期待値をＥ₀ とすると、これは近似的に下記のように
表現される。Letting E _{0 be} the expected value corresponding to the appearance of the learning data of this learning error, this is approximately expressed as follows.

【００８４】[0084]

【数２７】 [Equation 27]

【００８５】この場合、統計的推定で一般的な計算方法
により、下記のことが成立する。In this case, the following is established by a general calculation method in statistical estimation.

【００８６】[0086]

【数２８】 [Equation 28]

【００８７】従って、学習誤差の期待値Ｅ₀ は、下記の
ようになる。Therefore, the expected value E ₀ of the learning error is as follows.

【００８８】[0088]

【数２９】 (Equation 29)

【００８９】この学習誤差の期待値Ｅ₀ を最小にできれ
ば、回帰曲線推定部３の学習誤差を最小にすることがで
きるので、本実施の形態の統計的学習装置１１は、上述
のようなことを満足するよう学習データ作成部１２を形
成している。If the expected value E ₀ of the learning error can be minimized, the learning error of the regression curve estimator 3 can be minimized. Therefore, the statistical learning device 11 of the present embodiment does the above. The learning data creation unit 12 is formed so as to satisfy

【００９０】なお、本実施の形態の統計的学習装置１１
では、上述のように誤差最小化部１７が逐次的手法によ
り学習誤差の推定値Ｅ(ｖ)のパラメータｖを計算するの
で、適正なパラメータｖを解析的に計算できない場合で
も、適正なパラメータｖを獲得することができる。Incidentally, the statistical learning device 11 of the present embodiment.
Then, since the error minimization unit 17 calculates the parameter v of the estimated value E (v) of the learning error by the sequential method as described above, even if the proper parameter v cannot be analytically calculated, the proper parameter v Can be earned.

【００９１】また、このように誤差最小化部１７が逐次
的手法によりパラメータｖを計算する場合、入力分布保
持部１４に、確率密度関数“ｒ(ｘ；ｖ)”が“Ｍ(Ｍ＋
１)／２”以下の個数の離散分布として設定されていれ
ば、誤差最小化部１７のトレースＴr の計算コストを削
減することができる。さらに、確率密度関数“ｒ(ｘ；
ｖ)”を下記のような関数で一次独立の実数の個数の離
散分布として設定することができれば、誤差最小化部１
７の計算コストを必要最小限まで削減することができ
る。Further, when the error minimization unit 17 calculates the parameter v by the sequential method in this way, the probability density function “r (x; v)” in the input distribution holding unit 14 is “M (M +
1) / 2 "or less, the calculation cost of the trace Tr of the error minimization unit 17 can be reduced. Furthermore, the probability density function" r (x;
v) ”can be set as a discrete distribution of the number of real independent primary values by the following function, the error minimization unit 1
The calculation cost of 7 can be reduced to the necessary minimum.

【００９２】[0092]

【数３０】 [Equation 30]

【００９３】ここで、上述のような統計的学習装置１１
による統計的学習方法の一具体例を図３に基づいて以下
に簡単に説明する。まず、前述のように、第一行列計算
部１５が、推定量保持部１３に予め設定されている密度
関数の推定量“ハットｑ(ｘ)”を用いて第一の行列“ハ
ットＩ”を計算する（ステップＳ１）。つぎに、第二行
列計算部１６が、入力分布保持部１４に予め保持されて
いる確率密度関数“ｒ(ｘ；ｖ)”を用いて第二の行列
“Ｊab(ｖ)”を計算し（ステップＳ４）、誤差最小化部
１７が、学習誤差の推定値Ｅ(ｖ)を計算する（ステップ
Ｓ５）。Here, the statistical learning device 11 as described above is used.
A specific example of the statistical learning method according to is briefly described below with reference to FIG. First, as described above, the first matrix calculating unit 15 uses the estimated amount “hat q (x)” of the density function preset in the estimated amount holding unit 13 to calculate the first matrix “hat I”. Calculate (step S1). Next, the second matrix calculation unit 16 calculates the second matrix “Jab (v)” using the probability density function “r (x; v)” stored in the input distribution storage unit 14 in advance ( In step S4), the error minimizing unit 17 calculates the estimated value E (v) of the learning error (step S5).

【００９４】このとき、この学習誤差の推定値Ｅ(ｖ)を
最小化するパラメータｖを、勾配方向を利用した逐次的
手法により計算するため（ステップＳ６）、最初にパラ
メータｖが乱数により初期化されるとともに（ステップ
Ｓ２）、逐次的手法である最急降下法の繰り返し回数ｋ
が“０”に初期化され（ステップＳ３）、これが繰り返
し回数ｋ_end となるまで第二の行列“Ｊab(ｖ)”と学習
誤差の推定値Ｅ(ｖ)とによりパラメータｖが計算される
（ステップＳ４〜Ｓ８）。なお、図中の“α”は収束係
数である。At this time, since the parameter v that minimizes the estimated value E (v) of the learning error is calculated by the sequential method using the gradient direction (step S6), the parameter v is first initialized by a random number. (Step S2), the number of iterations k of the steepest descent method that is a sequential method is repeated.
Is initialized to "0" (step S3), and the parameter v is calculated by the second matrix "Jab (v)" and the estimated value E (v) of the learning error until the number of repetitions k _end is reached (step S3). Steps S4 to S8). In addition, “α” in the figure is a convergence coefficient.

【００９５】このように学習誤差の推定値Ｅ(ｖ)を最小
化するパラメータｖが計算されると、学習データ作成部
１２が、パラメータｖを確率密度関数“ｒ(ｘ；ｖ)”に
設定して“Ｎ−１”個のベクトルｘを発生させ（ステッ
プＳ９）、同時に未知システム７が実際に出力する“Ｎ
−１”個のベクトルｙを取得するので（ステップＳ１
０）、これらの“Ｎ−１”個の学習データ(ｘ，ｙ)によ
り、未知システム７の線形モデルｆ(ｘ；θ)のパラメー
タθが推定される（ステップＳ１１）。When the parameter v that minimizes the estimated value E (v) of the learning error is calculated in this way, the learning data creation unit 12 sets the parameter v to the probability density function “r (x; v)”. Then, "N-1" vectors x are generated (step S9), and at the same time, "N" which the unknown system 7 actually outputs.
Since -1 "vectors y are acquired (step S1
0), the parameter θ of the linear model f (x; θ) of the unknown system 7 is estimated from these “N−1” pieces of learning data (x, y) (step S11).

【００９６】つぎに、本発明の実施の第三の形態を図４
に基づいて以下に説明する。なお、この実施の第三の形
態の統計的学習装置２１に関し、上述した実施の第二の
形態の統計的学習装置１１と同一の部分は、同一の名称
と符号とを利用して詳細な説明は省略する。Next, a third embodiment of the present invention will be described with reference to FIG.
It will be described below based on. Regarding the statistical learning device 21 of the third embodiment, the same parts as those of the statistical learning device 11 of the second embodiment described above will be described in detail by using the same names and reference numerals. Is omitted.

【００９７】本実施の形態の統計的学習装置２１は、学
習データ作成部２２の推定量保持部２３を具体的に説明
するもので、この推定量保持部２３は、パラメータｕを
有するパラメトリックモデル“ｑ(ｘ；ｕ)”が予め設定
されており、外部データ入力部２に接続されている。そ
こで、前記推定量保持部２３は、外部から外部データ入
力部２に実際に入力される(Ｎ−１)次元のベクトルｘを
採取し、その分布の密度関数の推定量“ハットｑ(ｘ)”
をパラメトリックモデル“ｑ(ｘ；ｕ)”によりパラメト
リック推定して保持する。The statistical learning device 21 of the present embodiment specifically explains the estimated amount holding unit 23 of the learning data creation unit 22, and the estimated amount holding unit 23 has a parametric model “parameter u”. q (x; u) ”is preset and is connected to the external data input unit 2. Therefore, the estimated amount holding unit 23 collects an (N-1) -dimensional vector x that is actually input to the external data input unit 2 from the outside, and estimates the density function of the distribution "hat q (x)". ”
Is parametrically estimated and held by the parametric model “q (x; u)”.

【００９８】なお、このパラメトリックモデル“ｑ
(ｘ；ｕ)”としては、パラメトリック推定を実現する各
種モデルが利用できるが、例えば、正規分布からなる混
合分布により、下記のように設定されている。Note that this parametric model "q
As (x; u) ”, various models for realizing parametric estimation can be used, and are set as follows by, for example, a mixture distribution that is a normal distribution.

【００９９】[0099]

【数３１】 [Equation 31]

【０１００】この数式の“ｎ(ｘ；ｍ,ｓ²)”は、平均ｍ
と分散ｓ² との正規分布の確率密度関数であり、これは
下記のように表されるので、パラメトリックモデル“ｑ
(ｘ；ｕ)”のパラメータｕは、“ｕ＝(ｍ_t，ｓ_t)”であ
る。"N (x; m, s ² )" in this equation is the average m
And the variance s ² of the normal distribution probability density function. Since this is expressed as follows, the parametric model “q
(x; u) "parameter u of," it is _{_{u = (m t, s t}} ) ".

【０１０１】[0101]

【数３２】 (Equation 32)

【０１０２】このような構成において、本実施の形態の
統計的学習装置２１も、前述した統計的学習装置１１と
同様に、学習データ作成部２２が推定量保持部２３に保
持されているベクトルｘの分布の密度関数の推定量“ハ
ットｑ(ｘ)”を用いて学習データ(ｘ，ｙ)のベクトルｘ
を作成する。In such a configuration, also in the statistical learning device 21 of the present embodiment, similarly to the above-described statistical learning device 11, the learning data creation unit 22 holds the vector x held in the estimated amount holding unit 23. Vector x of the training data (x, y) using the estimated value of the density function of the distribution “hat q (x)”
Create

【０１０３】この推定量保持部２３は、外部から入力さ
れるベクトルｘから推定量“ハットｑ(ｘ)”をパラメト
リック推定して保持するため、このパラメトリック推定
の時点で回帰曲線推定部３の使用状況と同等なベクトル
ｘを外部データ入力部２に入力する。すると、推定量保
持部２３は、外部から外部データ入力部２に入力される
ベクトルｘを採取し、予め設定されているパラメトリッ
クモデル“ｑ(ｘ；ｕ)”のパラメータｕを最尤推定法な
どによりパラメトリック推定することにより、ベクトル
ｘの分布の密度関数の推定量“ハットｑ(ｘ)”を推定し
て保持する。Since the estimated amount holding unit 23 parametrically estimates and holds the estimated amount "hat q (x)" from the vector x input from the outside, the regression curve estimation unit 3 uses it at the time of this parametric estimation. The vector x equivalent to the situation is input to the external data input unit 2. Then, the estimated amount holding unit 23 collects the vector x input from the outside to the external data input unit 2 and uses the parameter u of the preset parametric model “q (x; u)” as the maximum likelihood estimation method or the like. By performing parametric estimation with, the estimated amount “hat q (x)” of the density function of the distribution of the vector x is estimated and held.

【０１０４】本実施の形態の統計的学習装置２１は、上
述のように実際の使用状況と同等なベクトルｘに基づい
て、学習データ作成部２２の推定量保持部２３にベクト
ルｘの分布の密度関数の推定量“ハットｑ(ｘ)”を設定
することができる。The statistical learning device 21 of the present embodiment stores the distribution density of the vector x in the estimated amount holding unit 23 of the learning data creation unit 22 based on the vector x equivalent to the actual use condition as described above. An estimator "hat q (x)" of the function can be set.

【０１０５】つぎに、本発明の実施の第四の形態を図５
に基づいて以下に説明する。なお、この実施の第四の形
態の統計的学習装置３１に関し、上述した実施の第三の
形態の統計的学習装置２１と同一の部分は、同一の名称
と符号とを利用して詳細な説明は省略する。Next, a fourth embodiment of the present invention will be described with reference to FIG.
It will be described below based on. Regarding the statistical learning device 31 of the fourth embodiment, the same parts as those of the statistical learning device 21 of the third embodiment described above will be described in detail by using the same names and reference numerals. Is omitted.

【０１０６】本実施の形態の統計的学習装置３１も、学
習データ作成部３２の推定量保持部３３を具体的に説明
するもので、この推定量保持部３３は、ノンパラメトリ
ックモデル“ｑ(ｘ)”が予め設定されており、外部から
外部データ入力部２に実際に入力されるベクトルｘを採
取し、その分布の密度関数の推定量“ハットｑ(ｘ)”を
ノンパラメトリック推定して保持する。The statistical learning device 31 of the present embodiment also specifically describes the estimated amount holding unit 33 of the learning data creation unit 32. The estimated amount holding unit 33 is a non-parametric model "q (x ) ”Is preset and the vector x that is actually input to the external data input unit 2 from the outside is sampled, and the estimated amount“ hat q (x) ”of the distribution density function is nonparametrically estimated and stored. To do.

【０１０７】なお、このノンパラメトリックモデル“ｑ
(ｘ)”としては、ノンパラメトリック推定を実現する各
種モデルが利用できる。ベクトルｘは、下記のように入
力される。The non-parametric model "q
Various models that realize non-parametric estimation can be used as (x) ”. The vector x is input as follows.

【０１０８】[0108]

【数３３】 [Expression 33]

【０１０９】この場合、ノンパラメトリックモデル“ｑ
(ｘ)”は、例えば、正規分布により下記のように設定さ
れる。In this case, the non-parametric model "q
(x) ”is set as follows by a normal distribution, for example.

【０１１０】[0110]

【数３４】 (Equation 34)

【０１１１】このような構成において、本実施の形態の
統計的学習装置３１も、前述した統計的学習装置２１と
同様に、実際の使用状況と同等なベクトルｘに基づい
て、学習データ作成部３２の推定量保持部３３にベクト
ルｘの分布の密度関数の推定量“ハットｑ(ｘ)”を設定
する。この推定量保持部３３は、外部から外部データ入
力部２に入力されるベクトルｘを採取し、ベクトルｘの
分布の密度関数の推定量“ハットｑ(ｘ)”をノンパラメ
トリック推定して保持する。In such a configuration, the statistical learning device 31 of the present embodiment also, like the statistical learning device 21 described above, based on the vector x equivalent to the actual usage condition, the learning data creation unit 32. The estimated amount “hat q (x)” of the density function of the distribution of the vector x is set in the estimated amount holding unit 33. The estimated amount holding unit 33 samples the vector x input from the outside to the external data input unit 2 and non-parametrically estimates and holds the estimated amount “hat q (x)” of the density function of the distribution of the vector x. .

【０１１２】なお、本実施の形態の統計的学習装置３１
では、推定量保持部３３に正規分布を利用したノンパラ
メトリックモデル“ｑ(ｘ)を設定することを例示した
が、本発明は上記形態に限定されるものではなく、この
ようなノンパラメトリックモデル“ｑ(ｘ)”としては、
矩形関数などの各種のカーネル関数も利用でき、例え
ば、カーネル関数にデルタ関数を用いた下記のような演
算なども利用可能である。Incidentally, the statistical learning device 31 of the present embodiment.
In the above, the non-parametric model “q (x) using a normal distribution is set in the estimation amount holding unit 33, but the present invention is not limited to the above-described embodiment, and such a non-parametric model“ q As q (x) ”,
Various kernel functions such as a rectangular function can also be used. For example, the following calculation using a delta function as the kernel function can also be used.

【０１１３】[0113]

【数３５】 (Equation 35)

【０１１４】つぎに、本発明の実施の第五の形態を図６
に基づいて以下に説明する。なお、この実施の第五の形
態の統計的学習装置４１に関し、前述した実施の第二の
形態の統計的学習装置１１と同一の部分は、同一の名称
と符号とを利用して詳細な説明は省略する。Next, a fifth embodiment of the present invention will be described with reference to FIG.
It will be described below based on. Regarding the statistical learning device 41 of the fifth embodiment, the same parts as those of the statistical learning device 11 of the second embodiment described above will be described in detail by using the same names and reference numerals. Is omitted.

【０１１５】本実施の形態の統計的学習装置４１は、学
習データ作成部４２を具体的に説明するもので、この学
習データ作成部４２は、推定量保持部１３を有しておら
ず、第一行列計算部４３が外部データ入力部２に接続さ
れている。この第一行列計算部４３は、外部から外部デ
ータ入力部２に実際に入力される(Ｎ′−１)次元のベク
トルｘを採取して下記のような計算を実行する。The statistical learning device 41 of the present embodiment specifically describes the learning data creating unit 42. The learning data creating unit 42 does not have the estimated amount holding unit 13, The one-row calculation unit 43 is connected to the external data input unit 2. The first matrix calculation unit 43 collects the (N'-1) -dimensional vector x that is actually input to the external data input unit 2 from the outside and executes the following calculation.

【０１１６】[0116]

【数３６】 [Equation 36]

【０１１７】このような構成において、本実施の形態の
統計的学習装置４１も、前述した統計的学習装置１１と
同様に、学習データ作成部４２は第一行列計算部４３の
計算結果を利用して学習データ(ｘ，ｙ)のベクトルｘを
作成する。この第一行列計算部４３は、外部から入力さ
れるベクトルｘを利用して、積分を実行することなく加
算により計算を実行するので、計算の負担を軽減して時
間を短縮することができる。このような手法は、カーネ
ル関数にデルタ関数を用いた場合に相当する。With such a configuration, also in the statistical learning device 41 of the present embodiment, the learning data creation unit 42 uses the calculation result of the first matrix calculation unit 43 as in the statistical learning device 11 described above. To create a vector x of learning data (x, y). Since the first matrix calculation unit 43 uses the vector x input from the outside to perform the calculation by addition without performing the integration, the calculation load can be reduced and the time can be shortened. Such a method corresponds to the case where the delta function is used as the kernel function.

【０１１８】つぎに、本発明の実施の第六の形態を図７
ないし図９に基づいて以下に説明する。なお、この実施
の第六の形態の統計的学習装置５１に関し、前述した実
施の第一の形態の統計的学習装置１と同一の部分は、同
一の名称と符号とを利用して詳細な説明は省略する。Next, a sixth embodiment of the present invention will be described with reference to FIG.
This will be described below with reference to FIG. With regard to the statistical learning device 51 of the sixth embodiment, the same parts as those of the statistical learning device 1 of the first embodiment described above will be described in detail by using the same names and reference numerals. Is omitted.

【０１１９】本実施の形態の統計的学習装置５１は、図
７に示すように、一次元の実数直線からのベクトルｘの
入力に対してデータｙを出力する未知システム５２に対
応したものであり、これを推定する平均推定部５３のモ
デル記憶部５４には、Ｍ次元のパラメータθを有する多
項式が下記のように設定されている。As shown in FIG. 7, the statistical learning device 51 of the present embodiment corresponds to an unknown system 52 which outputs data y for an input of a vector x from a one-dimensional real number straight line. A polynomial having an M-dimensional parameter θ is set in the model storage unit 54 of the average estimation unit 53 that estimates the polynomial as follows.

【０１２０】[0120]

【数３７】 (37)

【０１２１】そして、このような平均推定部５３の学習
データ(ｘ，ｙ)のベクトルｘを作成する学習データ作成
部５５は、予め設定された二つの分布の混合分布“ｒ
(ｘ：ｖ)”に従って前記平均推定部５３の学習誤差Ｅ
(ｖ)が最小となるベクトルｘを作成して学習データ出力
部５に設定する。Then, the learning data creating unit 55 for creating the vector x of the learning data (x, y) of the average estimating unit 53 as described above, has a mixture distribution “r” of two preset distributions.
(x: v) ”according to the learning error E of the average estimation unit 53.
A vector x that minimizes (v) is created and set in the learning data output unit 5.

【０１２２】なお、このような混合分布“ｒ(ｘ：ｖ)”
は、二つの各種分布により形成されるが、ここでは“０
＜ａ＜１”なる実数ａにより表現される二つの正規分布
により“ａｎ(ｘ；０，τ₁ ²)＋(１−ａ)ｎ(ｘ；０，τ₂
²)”として予め設定されている。この場合、二つの正規
分布は、各々の線形結合係数を“ａ，１−ａ”とし、各
々の分散を“τ₁ ²，τ₂ ²”とすると、“０＜ａ＜0.1”
を満足する微小なａと“τ₁ ²＞10”を満足するτ₁ とが
“ａτ₁ ²＞10”を満足するよう設定されている。Note that such a mixture distribution "r (x: v)"
Is formed by two different distributions, but here "0
<A <1 "by the two normal distributions are represented by real number a made" an (x; 0, τ 1 2) + (1-a) n (x; 0, τ 2
² ) ”in this case. In this case, two normal distributions have respective linear combination coefficients of“ a, 1-a ”and respective variances of“ τ ₁ ² , τ ₂ ² ”. "0 <a <0.1"
Is set so as to satisfy the tau ₁ is satisfied is "aτ ₁ ^2> 10" to "τ ₁ ^2> 10" with small a that satisfies.

【０１２３】このような構成において、本実施の形態の
統計的学習装置５１の統計的学習方法では、図８に示す
ように、学習データ作成部５５が、予め設定された二つ
の分布の混合分布“ｒ(ｘ：ｖ)”に従って学習誤差Ｅ
(ｖ)が最小となるベクトルｘを作成するので（ステップ
Ｔ１）、平均推定部５３は、学習データ(ｘ，ｙ)により
多項式ｆ(ｙ；θ)を利用して未知システム５２を良好に
推定することができる（ステップＴ２〜Ｔ４）。With such a configuration, in the statistical learning method of the statistical learning device 51 of the present embodiment, as shown in FIG. 8, the learning data creation unit 55 causes the preset distribution distribution of the two distributions to be set. Learning error E according to "r (x: v)"
Since the vector x that minimizes (v) is created (step T1), the average estimating unit 53 favorably estimates the unknown system 52 by using the polynomial f (y; θ) from the learning data (x, y). Can be performed (steps T2 to T4).

【０１２４】ここで、本実施の形態の統計的学習装置５
１の有効性を以下に検証する。まず、平均ｍ、分散τ²
の正規分布の密度関数を、Here, the statistical learning device 5 of the present embodiment.
The effectiveness of 1 is verified below. First, mean m, variance τ ²
The density function of the normal distribution of

【０１２５】[0125]

【数３８】 (38)

【０１２６】とし、平均推定部５３に設定された二つの
正規分布の混合分布“ｒ(ｘ：ｖ)”を“ａｎ(ｘ；０，
τ₁ ²)＋(１−ａ)ｎ(ｘ；０，τ₂ ²)”とする。そして、
τ₂ を有界な値とし、“ａτ₁ ²→∞”が維持されるよう
に“ａ→０”“τ₁ →∞”と極限を設定すると、学習誤
差Ｅ(ｖ)のトレース値は下記のようになる。Then, the mixture distribution “r (x: v)” of the two normal distributions set in the average estimation unit 53 is defined as “an (x; 0,
τ ₁ ² ) + (1-a) n (x; 0, τ ₂ ² ) ”, and
If τ ₂ is a bounded value and the limit is set as “a → 0” “τ ₁ → ∞” so that “a τ ₁ ² → ∞” is maintained, the learning error E (v) trace value is become that way.

【０１２７】[0127]

【数３９】 [Equation 39]

【０１２８】上述のように極限操作を実行すると、
“１”より大きい学習誤差Ｅ(ｖ)のトレース値が最小の
“１”に無限に近付くので、最適な入力分布を獲得する
ことができる。しかし、実際には無限に操作を行なうこ
とはできないので、具体的な数値を設定する必要があ
る。When the extreme operation is executed as described above,
Since the trace value of the learning error E (v) larger than "1" approaches infinitely "1", the optimum input distribution can be obtained. However, since it is not possible to operate indefinitely, it is necessary to set a specific numerical value.

【０１２９】これを確認するため、本実施の形態の統計
的学習装置５１を以下のようにシミュレートしたとこ
ろ、充分な効果が確認された。まず、[−１，１]上の一
様分布をＵ、平均０分散１の正規分布に従う独立な確率
変数をＺとし、ｑ(ｘ)による実システムの入力分布を
“Ｕ＋ 0.3Ｚ”に設定した。そして、未知システム５２
の真の関数を“ｆ(ｘ)＝ｘ＋２”としたところ、“ａ＝
0.1，τ₁＝30，τ₂＝１ ”で充分な効果が確認された。In order to confirm this, the statistical learning device 51 of this embodiment was simulated as follows, and a sufficient effect was confirmed. First, let U be the uniform distribution on [-1, 1], Z be an independent random variable that follows a normal distribution with mean 0 variance 1, and set the input distribution of the real system by q (x) to "U + 0.3Z". did. And the unknown system 52
When the true function of is defined as “f (x) = x + 2”, “a =
A sufficient effect was confirmed at 0.1, τ ₁ = 30, τ ₂ = 1 ”.

【０１３０】そこで、これらの数値を統計的学習装置５
１に設定し、推定のモデルを一次から五次まで変化さ
せ、前述した二つの密度関数“ｒ(ｘ：ｖ)，ｑ(ｘ)”に
より1000個の学習データ(ｘ，ｙ)を採取してパラメータ
の推定量“ハットθ”を計算し、その学習後の推定誤差
を以下の数式による平均二乗誤差により評価した。Therefore, the statistical learning device 5 uses these numerical values.
Set to 1 and change the estimation model from first-order to fifth-order, and collect 1000 pieces of learning data (x, y) by the two density functions “r (x: v), q (x)” described above. The estimated amount of the parameter, “hat θ”, was calculated by the above, and the estimation error after the learning was evaluated by the mean square error by the following formula.

【０１３１】[0131]

【数４０】 (Equation 40)

【０１３２】このような二つの密度関数“ｒ(ｘ：ｖ)，
ｑ(ｘ)”で 500回ずつ実行したところ、図９に示すよう
に、密度関数“ｑ(ｘ)”の場合にはモデルの次数の増加
に比例して誤差が増加したが、密度関数“ｒ(ｘ：ｖ)”
の場合にはモデルの次数が増加しても誤差が増加しなか
った。しかも、密度関数“ｒ(ｘ：ｖ)”の誤差は密度関
数“ｑ(ｘ)”の最小の誤差（次数が０次の場合）に同等
であり、上述のように二つの分布の混合分布“ｒ(ｘ：
ｖ)”を利用してベクトルｘを作成すれば学習誤差Ｅ
(ｖ)が最小となることが確認された。Two such density functions “r (x: v),
As shown in FIG. 9, when the density function “q (x)” was executed 500 times each, the error increased in proportion to the increase of the model order. r (x: v) ”
In the case of, the error did not increase even if the order of the model increased. Moreover, the error of the density function “r (x: v)” is equivalent to the minimum error of the density function “q (x)” (when the order is 0th order), and as described above, the mixed distribution of the two distributions. "R (x:
v) ”is used to create the vector x, the learning error E
It was confirmed that (v) was the minimum.

【０１３３】なお、本実施の形態の統計的学習装置５１
は、前述のようにベクトルｘの入力空間が一次元の実数
直線の場合に対応しているが、このような入力空間が有
界な区間[−Ａ，Ｂ]に制限できる場合もある。このよう
な場合、その両端で学習データ(ｘ，ｙ)を割合“ａ／
２”ずつ採取し、残りの“１−ａ”の割合を“（Ａ＋
Ｂ)²／４”より小さい分散の分布から発生させることが
好ましい。このようにすることで、近似的に実数直線の
最適分布に近い採取が可能となるので、より良好な学習
データを作成することができる。Incidentally, the statistical learning device 51 of the present embodiment.
Corresponds to the case where the input space of the vector x is a one-dimensional real straight line as described above, but such an input space may be limited to the bounded section [-A, B]. In such a case, the learning data (x, y) at the both ends of the ratio "a /
2 "each, and the remaining" 1-a "ratio is" (A +
B) ^2/4 "to generate the distribution of smaller dispersion is preferred. In this way, since approximately taken close to the optimal distribution of the real line is possible, to create a better learning data be able to.

【０１３４】[0134]

【発明の効果】請求項１及び８記載の発明では、未知シ
ステムがデータｘの入力に対して関数の演算とガウスノ
イズの加算とによりデータｙを出力するとき、この未知
システムに入力するデータｘと出力されたデータｙとを
学習データ(ｘ，ｙ)として学習データ出力部に出力さ
せ、この学習データ(ｘ，ｙ)の入力に対してＭ次元のパ
ラメータθを有する線形モデル“ｆ(ｘ；θ)＝Σ_iθ_iｆ
_i(ｘ）”により回帰曲線推定部が未知システムを回帰曲
線“Ｅ[ｐ(ｙ｜ｘ)]”として推定する統計的学習装置及
び統計的学習方法において、回帰曲線推定部の学習誤差
が最小となるデータｘを学習データ作成部が作成して学
習データ出力部に設定するようにしたことにより、この
学習データ作成部が出力するデータｘに基づいて学習デ
ータ出力部が回帰曲線推定部に学習データ(ｘ，ｙ)を出
力するので、この回帰曲線推定部は未知システムを最小
誤差で推定することができる。According to the first and eighth aspects of the present invention, when the unknown system outputs the data y by the operation of the function and the addition of Gaussian noise with respect to the input of the data x, the data x input to this unknown system is input. And output data y as learning data (x, y) to the learning data output unit, and a linear model “f (x (x) y) having an M-dimensional parameter θ with respect to the input of this learning data (x, y). ; Θ) = Σ _i θ _i f
_In the statistical learning device and the statistical learning method in which the regression curve estimation unit estimates the unknown system as the regression curve “E [p (y | x)]” by _i (x) ”, the learning error of the regression curve estimation unit is minimum. Since the learning data creation unit creates the data x to be set in the learning data output unit, the learning data output unit learns in the regression curve estimation unit based on the data x output by the learning data creation unit. Since the data (x, y) is output, this regression curve estimation unit can estimate the unknown system with the minimum error.

【０１３５】請求項２記載の発明では、学習データ作成
部は、パラメータｖを有して学習データ(ｘ，ｙ)のデー
タｘを発生させる確率密度関数“ｒ(ｘ；ｖ)”を保持し
た入力分布保持部と、外部から入力されるデータｘの分
布の密度関数の推定量“ハットｑ(ｘ)”を保持した推定
量保持部と、密度関数の推定量“ハットｑ(ｘ)”を用い
て確率密度関数“ｒ(ｘ；ｖ)”に従って学習データ
(ｘ，ｙ)のデータｘを発生させた場合に回帰曲線推定部
の学習誤差の推定値Ｅ(ｖ)が小さくなるパラメータｖを
算出する誤差最小化部とを有し、この算出されたパラメ
ータｖを確率密度関数“ｒ(ｘ；ｖ)”に設定して学習デ
ータ(ｘ，ｙ)のデータｘを発生させることにより、回帰
曲線推定部の学習誤差が最小となる学習データ(ｘ，ｙ)
のデータｘを簡易に作成することができる。According to the second aspect of the invention, the learning data creation unit holds the probability density function "r (x; v)" for generating the data x of the learning data (x, y) with the parameter v. The input distribution holding unit, the estimation amount holding unit that holds the density function estimation amount “hat q (x)” of the distribution of the data x input from the outside, and the density function estimation amount “hat q (x)” Learning data according to probability density function "r (x; v)"
an error minimization unit that calculates a parameter v that reduces the learning error estimation value E (v) of the regression curve estimation unit when the data x of (x, y) is generated. By setting v to the probability density function “r (x; v)” and generating the data x of the learning data (x, y), the learning data (x, y) that minimizes the learning error of the regression curve estimation unit. )
The data x can be easily created.

【０１３６】請求項３記載の発明では、密度関数の推定
量“ハットｑ(ｘ)”を用いてAccording to the third aspect of the invention, the density function estimator "hat q (x)" is used.

【０１３７】[0137]

【数４１】 [Equation 41]

【０１３８】を計算する第一行列計算部を設け、確率密
度関数“ｒ(ｘ；ｖ)”を用いてA first matrix calculation unit for calculating is provided and the probability density function "r (x; v)" is used.

【０１３９】[0139]

【数４２】 (Equation 42)

【０１４０】を計算する第二行列計算部を設け、誤差最
小化部は、学習誤差の推定値Ｅ(ｖ)をA second matrix calculation unit for calculating is provided, and the error minimization unit calculates the estimated value E (v) of the learning error.

【０１４１】[0141]

【数４３】 [Equation 43]

【０１４２】として計算することにより、最適なデータ
ｘの作成に必要な学習誤差の推定値Ｅ(ｖ)を簡易に算出
することができる。By calculating as, the estimated value E (v) of the learning error required for creating the optimum data x can be easily calculated.

【０１４３】請求項４記載の発明では、誤差最小化部
は、学習誤差の推定値Ｅ(ｖ)を最小化するパラメータｖ
を勾配方向を利用した逐次的手法により計算することに
より、学習誤差Ｅ(ｖ)を最小化するパラメータｖを簡易
に算出することができる。According to the fourth aspect of the invention, the error minimization section minimizes the estimated value E (v) of the learning error by the parameter v.
The parameter v that minimizes the learning error E (v) can be easily calculated by calculating γ by a sequential method using the gradient direction.

【０１４４】請求項５記載の発明では、入力分布保持部
は、確率密度関数“ｒ(ｘ；ｖ)”が“Ｍ(Ｍ＋１)／２”
以下の個数の離散分布として設定されていることによ
り、学習データの個数を削減することができる。According to the fifth aspect of the invention, in the input distribution holding unit, the probability density function “r (x; v)” is “M (M + 1) / 2”.
The number of learning data can be reduced by setting the following discrete distributions.

【０１４５】請求項６記載の発明では、入力分布保持部
は、確率密度関数“ｒ(ｘ；ｖ)”がAccording to the sixth aspect of the invention, the probability distribution function “r (x; v)” is

【０１４６】[0146]

【数４４】 [Equation 44]

【０１４７】なる関数で一次独立の実数の個数の離散分
布として設定されていることにより、学習データの個数
を必要最小限に制限することができる。Since the function is set as a discrete distribution of the number of real numbers which is linearly independent, the number of learning data can be limited to the necessary minimum.

【０１４８】請求項７及び９記載の発明では、未知シス
テムが一次元の実数直線からのデータｘの入力に対して
関数の演算とガウスノイズの加算とによりデータｙを出
力するとき、この未知システムに入力するデータｘと出
力されたデータｙとを学習データ(ｘ，ｙ)として学習デ
ータ出力部に出力させ、この学習データ(ｘ，ｙ)の入力
に対してＭ次元のパラメータθを有する多項式In the inventions according to claims 7 and 9, when the unknown system outputs the data y by the operation of the function and the addition of Gaussian noise with respect to the input of the data x from the one-dimensional real number straight line, the unknown system The input data x and the output data y are output as learning data (x, y) to the learning data output unit, and a polynomial having an M-dimensional parameter θ with respect to the input of this learning data (x, y)

【０１４９】[0149]

【数４５】 [Equation 45]

【０１５０】により平均推定部が未知システムを推定す
る統計的学習装置及び統計的学習方法において、平均推
定部の学習誤差が最小となるデータｘを学習データ作成
部が作成して学習データ出力部に設定するようにしたこ
とにより、この学習データ作成部が出力するデータｘに
基づいて学習データ出力部が平均推定部に学習データ
(ｘ，ｙ)を出力するので、この平均推定部は未知システ
ムを最小誤差で推定することができる。According to the statistical learning device and the statistical learning method in which the average estimating unit estimates the unknown system, the learning data creating unit creates the data x having the minimum learning error of the average estimating unit and outputs it to the learning data output unit. By setting the learning data, the learning data output unit outputs the learning data to the average estimation unit based on the data x output by the learning data creation unit.
Since (x, y) is output, this average estimation unit can estimate the unknown system with the minimum error.

【０１５１】請求項１０記載の発明の情報記憶媒体は、
マイクロコンピュータを動作させるソフトウェアが書き
込まれた情報記憶媒体において、請求項１，２，３，
４，５，６又は７記載の統計的学習装置の各種機能がソ
フトウェアとして書き込まれているので、このソフトウ
ェアによりマイクロコンピュータを動作させれば、請求
項１，２，３，４，５，６又は７記載の統計的学習装置
を簡易に実現することができる。An information storage medium according to the invention of claim 10 is
An information storage medium in which software for operating a microcomputer is written.
Since various functions of the statistical learning device according to claim 4, 5, 6 or 7 are written as software, if the microcomputer is operated by this software, the claim 1, 2, 3, 4, 5, 6 or The statistical learning device described in 7 can be easily realized.

[Brief description of drawings]

【図１】本発明の実施の第一の形態の統計的学習装置を
示すブロック図である。FIG. 1 is a block diagram showing a statistical learning device according to a first embodiment of the present invention.

【図２】本発明の実施の第二の形態の統計的学習装置を
示すブロック図である。FIG. 2 is a block diagram showing a statistical learning device according to a second embodiment of the present invention.

【図３】その統計的学習方法を示すフローチャートであ
る。FIG. 3 is a flowchart showing the statistical learning method.

【図４】本発明の実施の第三の形態の統計的学習装置を
示すブロック図である。FIG. 4 is a block diagram showing a statistical learning device according to a third embodiment of the present invention.

【図５】本発明の実施の第四の形態の統計的学習装置を
示すブロック図である。FIG. 5 is a block diagram showing a statistical learning device according to a fourth exemplary embodiment of the present invention.

【図６】本発明の実施の第五の形態の統計的学習装置を
示すブロック図である。FIG. 6 is a block diagram showing a statistical learning device according to a fifth embodiment of the present invention.

【図７】本発明の実施の第六の形態の統計的学習装置を
示すブロック図である。FIG. 7 is a block diagram showing a statistical learning device according to a sixth embodiment of the present invention.

【図８】その統計的学習方法を示すフローチャートであ
る。FIG. 8 is a flowchart showing the statistical learning method.

【図９】その推定誤差の試験結果を示す特性図である。FIG. 9 is a characteristic diagram showing test results of the estimation error.

[Explanation of symbols]

１，１１，２１，３１，４１，５１統計的学習装置３回帰曲線推定部５，２２学習データ出力部６，１２，３２，４２学習データ作成部７，５２未知システム１３，２３，３３推定量保持部１４入力分布保持部１５第一行列計算部１６第二行列計算部１７誤差最小化部５３平均推定部５５学習データ作成部 1,11,21,31,41,51 Statistical learning device 3 Regression curve estimation unit 5,22 Learning data output unit 6,12,32,42 Learning data creation unit 7,52 Unknown system 13,23,33 Estimated amount Storage unit 14 Input distribution storage unit 15 First matrix calculation unit 16 Second matrix calculation unit 17 Error minimization unit 53 Average estimation unit 55 Learning data creation unit

─────────────────────────────────────────────────────
─────────────────────────────────────────────────── ───

【手続補正書】[Procedure amendment]

【提出日】平成８年６月１３日[Submission date] June 13, 1996

【手続補正１】[Procedure amendment 1]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】００８４[Correction target item name]

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【００８４】[0084]

【数２７】 [Equation 27]

Claims

[Claims]

1. When an unknown system outputs data y by an operation of a function and addition of Gaussian noise with respect to an input of data x, the data x input to this unknown system and the output data y are learned data. A learning data output unit for outputting as (x, y) is provided, and a linear model “f” having an M-dimensional parameter θ with respect to the input of learning data (x, y)
(x; θ) = Σ _i θ _i f _i (x) ”In a statistical learning device provided with a regression curve estimation unit that estimates the unknown system as a regression curve“ E [p (y | x)] ”, A statistical learning device comprising: a learning data creation unit that creates data x that minimizes the learning error of the regression curve estimation unit and sets the data x in the learning data output unit.

2. The learning data creation unit, an input distribution holding unit holding a probability density function “r (x; v)” for generating data x of learning data (x, y) with a parameter v, The probability density function “using the estimated quantity holding unit that holds the estimated quantity“ hat q (x) ”of the density function of the distribution of the data x input from the outside and the estimated quantity“ hat q (x) ”of the density function r
When the data x of the learning data (x, y) is generated according to (x; v) ", the estimated value E of the learning error of the regression curve estimation unit E
and an error minimization unit that calculates a parameter v in which (v) becomes smaller, and the calculated parameter v is set to the probability density function “r (x; v)” of the learning data (x, y). The statistical learning device according to claim 1, wherein the data x is generated.

3. A density function estimator “hat q (x)” is used to obtain: The probability density function "r
(x; v) ” A second matrix calculation unit for calculating is provided, and the error minimization unit calculates the estimated value E (v) of the learning error as 3. The statistical learning device according to claim 2, wherein

4. The error minimization unit estimates the learning error E
4. The statistical learning device according to claim 3, wherein the parameter v that minimizes (v) is calculated by a sequential method using the gradient direction.

5. The probability distribution function “r
5. The statistical learning device according to claim 4, wherein (x; v) "is set as a discrete distribution of a number of" M (M + 1) / 2 "or less.

6. The probability distribution function “r
(x; v) ”is 5. The statistical learning device according to claim 4, wherein the function is set as a discrete distribution of the number of real numbers that are linearly independent.

7. When the unknown system outputs the data y by inputting the data x from the one-dimensional real number straight line by the calculation of the function and the addition of Gaussian noise, the data x input to the unknown system is output. The learning data output unit that outputs the learned data y and the learning data (x, y) is provided, and the M-dimensional parameter θ is input to the learning data (x, y).
Polynomial with In the statistical learning device provided with the average estimation unit for estimating the unknown system according to the above, the learning data is created by creating the data x with which the learning error of the average estimation unit is minimized in accordance with the preset mixed distribution of two distributions. A statistical learning device, characterized in that a learning data creation unit for setting the output unit is provided.

8. When the unknown system outputs data y by inputting data x by calculating a function and adding Gaussian noise, the data x input to this unknown system and the output data y are learned data. (x, y) is output to the learning data output unit, and a linear model “f” having an M-dimensional parameter θ for the input of this learning data (x, y)
(x; θ) = Σ _i θ _i f _i (x) ”In the statistical learning method, the regression curve estimation unit estimates the unknown system as a regression curve“ E [p (y | x)] ”. The statistical learning method is characterized in that the learning data creation unit creates data x that minimizes the learning error of the regression curve estimation unit and sets it in the learning data output unit.

9. When an unknown system outputs data y by inputting data x from a one-dimensional real number straight line by calculating a function and adding Gaussian noise, it is output as data x input to this unknown system. Output data y as learning data (x, y) to the learning data output unit, and an M-dimensional parameter θ for the input of this learning data (x, y).
A polynomial with According to the statistical learning method in which the average estimation unit estimates the unknown system, the learning data creation unit calculates the data x with which the learning error of the average estimation unit is minimized according to the mixture distribution of two distributions set in advance. A statistical learning method, which is created and set in the learning data output unit.

10. An information storage medium in which software for operating a microcomputer is written, and various functions of the statistical learning device according to claim 1, 2, 3, 4, 5, 6 or 7 are written as software. An information storage medium characterized by being present.