JP2004295638A

JP2004295638A - Parameter estimating method, data predicting method, parameter estimating device, data predicting device, computer program, and recording medium

Info

Publication number: JP2004295638A
Application number: JP2003088726A
Authority: JP
Inventors: Kazuhiro Takeyasu; 数博竹安
Original assignee: Japan Research Institute Ltd
Current assignee: Japan Research Institute Ltd
Priority date: 2003-03-27
Filing date: 2003-03-27
Publication date: 2004-10-21

Abstract

<P>PROBLEM TO BE SOLVED: To provide a method for estimating parameters in time sequence that is more positive than a conventional one, a method for predicting data from the estimated parameters, a parameter estimating device, a data predicting device, a computer program, and a recording medium. <P>SOLUTION: A time series äx<SB>n</SB>} comprising a plurality of pieces of data is read (S1), white noise is generated (S3), dispersion σ<SB>e</SB><SP>2</SP>of the white noise is calculated (S4), an autocorrelation function of the white noise with zero lag is regarded as σ<SB>e</SB><SP>2</SP>, an autocorrelation function of the white noise with lag other than zero is regarded as 0, a crosscorrelation function between the time series and white noise that is in the future than the time series is regarded as 0, and by expanding an inverse matrix included in an arithmetic expression of parameters and using an approximate expression excluding high order terms, parameters (a) and b are calculated (S7), convergence of the calculated parameters is determined (S8), and if they are not converging, new white noise äe<SB>n</SB>} is generated by using an expression of an ARMA model from the calculated parameters (S9), and calculation is repeated until the parameters converge. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、データの時系列からパラメータを推定する方法、推定されたパラメータを用いて新たなデータを予測する方法、パラメータ推定装置、データ予測方法、コンピュータをパラメータ推定装置またはデータ予測装置として実現するためのコンピュータプログラム、及び記録媒体に関する。
【０００２】
【従来の技術】
蓄積されたデータの時系列を解析し、新たなデータの値を予想する方法は、従来より様々な方法が提案されている。株価または販売量などのデータから構成される一般の経済的な時系列は、白色ノイズによって励起された系の出力とみることができる。データの時系列｛ｘ_ｎ：ｎ＝１，２，…，Ｎ，…｝を、定常エルゴード的正規過程ｘ（ｔ）の標本時系列であるとしたときには、（ｐ，ｑ）次のＡＲＭＡモデルでは、｛ｘ_ｎ｝の要素間の関係は、以下の式で表される。
【０００３】
【数５】

【０００４】
ここで、｛ｅ_ｎ｝は、平均値０及び分散σ_ｅ ^２の正規性雑音であり、｛ａ_ｉ：ｉ＝１，２，…，ｐ｝及び｛ｂ_ｊ：ｊ＝１，２，…，ｑ｝は、
Ａ（Ｚ^−１）＝１＋ａ_１Ｚ^−１＋…＋ａ_ｐＺ^−ｐ
Ｂ（Ｚ^−１）＝１＋ｂ_１Ｚ^−１＋…＋ｂ_ｑＺ^−ｑ
とおいたときに、Ａ（Ｚ^−１）及びＢ（Ｚ^−１）が規約であり、更に定常条件、可逆条件および強正実条件を満たすものとする。
【０００５】
標本の時系列｛ｘ_ｎ｝から（１）式のパラメータ｛ａ_ｉ｝及び｛ｂ_ｊ｝を推定するための方法として、非特許文献１に開示された疑似線形回帰法がある。現代制御理論では、ｐ＝ｑとしても一般性を失わないため、以下、ｐ＝ｑとして議論を進める。ベクトルＺ_ｎ及びベクトルθを次の様に定義する。
Ｚ_ｎ＝［−ｘ_ｎ−１，…，−ｘ_ｎ−ｐ，ｅ_ｎ−１，…，ｅ_ｎ−ｐ］^Ｔ
θ＝［ａ_１，ａ_２，…，ａ_ｐ，ｂ_１，ｂ_２，…，ｂ_ｐ］^Ｔ
（１）式をｘ_ｎについての式に書き直したときには、（１）式は、
ｘ_ｎ＝θ^ＴＺ_ｎ＋ｅ_ｎ …（２）
と表すことができる。Ｎ個のデータから構成される時系列｛ｘ_ｎ：ｎ＝１，２，…，Ｎ｝が得られたとき、ベクトルθを時系列から最小二乗法により求めるためには、以下の式
【０００６】
【数６】

【０００７】
の右辺を最小にするθを求めればよい。従って、（３）式の右辺をθで偏微分して０に等しいとすることにより、時系列から推定したベクトルθは、次の式で与えられる。
【０００８】
【数７】

【０００９】
ラグがｋである｛ｘ_ｎ｝の自己相関関数をＲ_ｋとすると、
Ｒ_ｋ＝Ｅ［ｘ_ｎｘ_ｎ＋ｋ］
である。ここで、Ｅ［ｙ_ｎ］は、データ列｛ｙ_ｎ：ｎ＝１，２，…｝の期待値である。また、
Ｒ_−ｋ＝Ｒ_ｋ
である。ラグがｋである｛ｅ_ｎ｝の自己相関関数をＳ_ｋとすると、同様に、
Ｓ_ｋ＝Ｅ［ｅ_ｎｅ_ｎ＋ｋ］，Ｓ_−ｋ＝Ｓ_ｋ
である。また、｛ｘ_ｎ｝と｛ｅ_ｎ｝との相互相関関数は、
Ｔ_ｋ＝Ｅ［ｅ_ｎｘ_ｎ＋ｋ］
Ｔ_−ｋ＝Ｅ［ｅ_ｎｘ_ｎ−ｋ］
とおくと、
Ｅ［ｘ_ｎｅ_ｎ＋ｋ］＝Ｅ［ｅ_ｎｘ_ｎ−ｋ］＝Ｔ_−ｋ
Ｅ［ｘ_ｎｅ_ｎ−ｋ］＝Ｅ［ｅ_ｎｘ_ｎ＋ｋ］＝Ｔ_ｋ
となる。
【００１０】
（４）式の右辺は、｛ｘ_ｎ｝の自己相関関数、｛ｅ_ｎ｝の自己相関関数、及び｛ｘ_ｎ｝と｛ｅ_ｎ｝との相互相関関数で表すことができ、（４）式は、
【００１１】
【数８】

【００１２】
となる。ベクトルａ及びｂを、
ａ＝［ａ_１，ａ_２，…，ａ_ｐ］^Ｔ
ｂ＝［ｂ_１，ｂ_２，…，ｂ_ｐ］^Ｔ
と定義し、（５）式に含まれる行列を、
【００１３】
【数９】

【００１４】
と定義したときには、（５）式は、
【００１５】
【数１０】

【００１６】
と表すことができる。ベクトルａ，ｂ，ｒ，ｔはｐ行１列の行列であり、行列Ｒ，Ｔ，Ｓはｐ行ｐ列の行列であるため、（７）式はベクトルａ及びｂについての連立方程式の形に書き直すことができ、
Ｒ・ａ−τ・ｂ＝−ｒ …（８）
−τ^Ｔ・ａ＋Ｓ・ｂ＝ｔ …（９）
となる。（９）式より、
ｂ＝Ｓ^−１（ｔ＋τ^Ｔ・ａ） …（１０）
であり、（１０）式を（８）式へ代入することにより、
ａ＝（Ｒ−τ・Ｓ^−１・τ^Ｔ）^−１（τ・Ｓ^−１・ｔ−ｒ） …（１１）
が求められる。（１１）式よりベクトルａが得られ、得られたベクトルａを（１０）式に代入することにより、ベクトルｂを計算することができる。
【００１７】
疑似線形回帰法では、白色雑音｛ｅ_ｎ｝を乱数などを用いて生成し、生成した｛ｅ_ｎ｝と時系列｛ｘ_ｎ｝とから、（１１）式および（１０）式を用いて、（１）式のパラメータ｛ａ_ｉ｝及び｛ｂ_ｊ｝を計算し、計算した｛ａ_ｉ｝及び｛ｂ_ｊ｝から、（１）式を用いて白色雑音｛ｅ_ｎ｝を計算し、パラメータ｛ａ_ｉ｝及び｛ｂ_ｊ｝が収束するまで計算を繰り返し、パラメータ｛ａ_ｉ｝及び｛ｂ_ｊ｝を求める。
【００１８】
【非特許文献１】
片山徹「システム同定入門」朝倉書店１９９４年
【００１９】
【発明が解決しようとする課題】
疑似線形回帰法は、問題の性質によっては、パラメータ｛ａ_ｉ｝及び｛ｂ_ｊ｝が収束せず、パラメータの推定ができない場合があるという問題がある。また、ｐ行ｐ列の行列計算を行うため、計算時間が長いという問題があった。
【００２０】
本発明は、以上の問題を解決するためになされたものであって、その目的とするところは、計算に用いる数値のうち理論的に０となるはずの数値を０とおき、更に逆行列を展開した計算を行うことにより、パラメータの値の収束を容易にし、同時に計算時間の短縮を図ることができるパラメータ推定方法、パラメータ推定装置、コンピュータを該パラメータ推定装置として実現するためのコンピュータプログラム、及び記録媒体を提供することにある。
【００２１】
また、本発明の他の目的とするところは、前記パラメータ推定方法により推定したパラメータを用いてデータの推定を行うことにより、従来の手法ではデータの予測ができなかった時系列についてもデータの予測が可能となるデータ予測方法、データ予測装置、コンピュータを該データ予測装置として実現するためのコンピュータプログラム、及び記録媒体を提供することにある。
【００２２】
【課題を解決するための手段】
第１発明に係るパラメータ推定方法は、複数のデータから構成される時系列、及び平均値が０で分散がσ_ｅ ^２である白色雑音から、前記時系列の自己相関関数、前記白色雑音の自己相関関数、及び前記時系列と前記白色雑音との相互相関関数を用いて、前記時系列を（ｐ，ｐ）次のＡＲＭＡモデルに当てはめたときのパラメータを推定する方法において、ラグが０の前記白色雑音の自己相関関数の値をσ_ｅ ^２とし、ラグが０以外の前記白色雑音の自己相関関数の値を０とし、前記時系列と前記白色雑音との相互相関関数のうち、前記時系列に含まれるデータと該データよりも未来の時点の白色雑音の値との相互相関関数の値を０とし、前記時系列の自己相関関数及び前記相互相関関数からなる２ｐ×２ｐの行列の逆行列を前記自己相関関数からなるｐ×ｐの行列及び前記相互相関関数からなるｐ×ｐの行列を用いて展開して高次項を省略した近似式に基づいて、前記パラメータを計算することを特徴とする。
【００２３】
第２発明に係るパラメータ推定方法は、記憶部及び演算部を備えたコンピュータを用いて、Ｎ個のデータから構成される時系列を（ｐ，ｐ）次のＡＲＭＡモデルに当てはめたときのパラメータを推定する方法において、Ｎ個のデータから構成される時系列｛ｘ_ｎ：ｎ＝１，２，…，Ｎ｝を記憶部に記憶し、Ｒ_ｋ＝Ｅ［ｘ_ｎｘ_ｎ＋ｋ］で定義される｛ｘ_ｎ｝の自己相関関数｛Ｒ_ｋ：ｋ＝０，１，…，ｐ｝を演算部にて計算し、平均値が０であるＮ個の値が時系列で並んだ白色雑音｛ｅ_ｎ：ｎ＝１，２，…，Ｎ｝を記憶部に記憶し、前記白色雑音の分散σ_ｅ ^２を演算部にて計算し、Ｔ_ｋ＝Ｅ［ｅ_ｎｘ_ｎ＋ｋ］で定義される｛ｅ_ｎ｝と｛ｘ_ｎ｝との相互相関関数｛Ｔ_ｋ：ｋ＝０，１，…，ｐ｝を演算部にて計算し、前記ＡＲＭＡモデルにおけるＡＲ過程のｐ個のパラメータを行列要素とする行列ａ＝［ａ_１，ａ_２，…，ａ_ｐ］^Ｔ、及び前記ＡＲＭＡモデルにおけるＭＡ過程のｐ個のパラメータを行列要素とする行列ｂ＝［ｂ_１，ｂ_２，…，ｂ_ｐ］^Ｔを、以下の式
【００２４】
【数１１】

【００２５】
に基づいて演算部にて計算することを特徴とする。
【００２６】
第３発明に係るパラメータ推定方法は、計算した行列ａ及び行列ｂに含まれる前記ＡＲＭＡモデルのパラメータ、並びに前記時系列の値を前記ＡＲＭＡモデルの式に代入して、新たな白色雑音を演算部にて計算し、計算した新たな白色雑音を用いて新たな行列ａ及び行列ｂを演算部にて計算し、行列ａ及び行列ｂが収束するまで行列ａ及び行列ｂの計算を演算部にて繰り返すことを特徴とする。
【００２７】
第４発明に係るデータ予測方法は、記憶部及び演算部を備えたコンピュータを用いて、複数のデータから構成される時系列から該時系列に加わる新たなデータを予測する方法において、時系列を記憶部に記憶し、第２又は第３発明に係るパラメータ推定方法を用い、前記時系列をＡＲＭＡモデルに当てはめたときのパラメータを演算部にて計算し、計算したパラメータ、及び記憶している前記時系列の値を前記ＡＲＭＡモデルの式に代入して、前記時系列に加わる新たなデータを演算部にて計算することを特徴とする。
【００２８】
第５発明に係るパラメータ推定装置は、Ｎ個のデータから構成される時系列を（ｐ，ｐ）次のＡＲＭＡモデルに当てはめたときのパラメータを推定する装置において、Ｎ個のデータから構成される時系列｛ｘ_ｎ：ｎ＝１，２，…，Ｎ｝を記憶する手段と、Ｒ_ｋ＝Ｅ［ｘ_ｎｘ_ｎ＋ｋ］で定義される｛ｘ_ｎ｝の自己相関関数｛Ｒ_ｋ：ｋ＝０，１，…，ｐ｝を計算する手段と、平均値が０であるＮ個の値が時系列で並んだ白色雑音｛ｅ_ｎ：ｎ＝１，２，…，Ｎ｝を記憶する手段と、前記白色雑音の分散σ_ｅ ^２を計算する手段と、Ｔ_ｋ＝Ｅ［ｅ_ｎｘ_ｎ＋ｋ］で定義される｛ｅ_ｎ｝と｛ｘ_ｎ｝との相互相関関数｛Ｔ_ｋ：ｋ＝０，１，…，ｐ｝を計算する手段と、前記ＡＲＭＡモデルにおけるＡＲ過程のｐ個のパラメータを行列要素とする行列ａ＝［ａ_１，ａ_２，…，ａ_ｐ］^Ｔ、及び前記ＡＲＭＡモデルにおけるＭＡ過程のｐ個のパラメータを行列要素とする行列ｂ＝［ｂ_１，ｂ_２，…，ｂ_ｐ］^Ｔを、以下の式
【００２９】
【数１２】

【００３０】
に基づいて計算する手段とを備えることを特徴とする。
【００３１】
第６発明に係るパラメータ推定装置は、計算した行列ａ及び行列ｂに含まれる前記ＡＲＭＡモデルのパラメータ、並びに前記時系列の値を前記ＡＲＭＡモデルの式に代入して、新たな白色雑音を計算する手段と、計算した新たな白色雑音を用いて新たな行列ａ及び行列ｂを計算する手段と、行列ａ及び行列ｂが収束するまで行列ａ及び行列ｂの計算を繰り返す手段とを更に備えることを特徴とする。
【００３２】
第７発明に係るデータ予測装置は、複数のデータから構成される時系列から該時系列に加わる新たなデータを予測する装置において、時系列を記憶する手段と、第５又は第６発明に係るパラメータ推定装置と、該パラメータ推定装置を用いて、前記時系列から、前記時系列をＡＲＭＡモデルに当てはめたときのパラメータを計算する手段と、計算したパラメータ、及び記憶している前記時系列の値を前記ＡＲＭＡモデルの式に代入して、前記時系列に加わる新たなデータを計算する手段とを備えることを特徴とする。
【００３３】
第８発明に係るコンピュータプログラムは、コンピュータに、Ｎ個のデータから構成される時系列を（ｐ，ｐ）次のＡＲＭＡモデルに当てはめたときのパラメータを推定させるコンピュータプログラムにおいて、コンピュータに、Ｎ個のデータから構成される時系列｛ｘ_ｎ：ｎ＝１，２，…，Ｎ｝から、Ｒ_ｋ＝Ｅ［ｘ_ｎｘ_ｎ＋ｋ］で定義される｛ｘ_ｎ｝の自己相関関数｛Ｒ_ｋ：ｋ＝０，１，…，ｐ｝を計算させる手順と、コンピュータに、平均値が０であるＮ個の値が時系列で並んだ白色雑音｛ｅ_ｎ：ｎ＝１，２，…，Ｎ｝の分散σ_ｅ ^２を計算させる手順と、コンピュータに、Ｔ_ｋ＝Ｅ［ｅ_ｎｘ_ｎ＋ｋ］で定義される｛ｅ_ｎ｝と｛ｘ_ｎ｝との相互相関関数｛Ｔ_ｋ：ｋ＝０，１，…，ｐ｝を計算させる手順と、コンピュータに、前記ＡＲＭＡモデルにおけるＡＲ過程のｐ個のパラメータを行列要素とする行列ａ＝［ａ_１，ａ_２，…，ａ_ｐ］^Ｔ、及び前記ＡＲＭＡモデルにおけるＭＡ過程のｐ個のパラメータを行列要素とする行列ｂ＝［ｂ_１，ｂ_２，…，ｂ_ｐ］^Ｔを、以下の式
【００３４】
【数１３】

【００３５】
に基づいて計算させる手順とを含むことを特徴とする。
【００３６】
第９発明に係るコンピュータプログラムは、コンピュータに、計算した行列ａ及び行列ｂに含まれる前記ＡＲＭＡモデルのパラメータ、並びに前記時系列の値を前記ＡＲＭＡモデルの式に代入して、新たな白色雑音を計算させる手順と、コンピュータに、計算した新たな白色雑音を用いて新たな行列ａ及び行列ｂを計算させる手順と、コンピュータに、行列ａ及び行列ｂが収束するまで行列ａ及び行列ｂの計算を繰り返させる手順とを更に含むことを特徴とする。
【００３７】
第１０発明に係るコンピュータプログラムは、コンピュータに、複数のデータから構成される時系列から該時系列に加わる新たなデータを予測させるコンピュータプログラムにおいて、コンピュータに、第８又は第９発明に係るコンピュータプログラムを用いて、記憶している時系列から、該時系列をＡＲＭＡモデルに当てはめたときのパラメータを計算させる手順と、コンピュータに、計算したパラメータ、及び記憶している前記時系列の値を前記ＡＲＭＡモデルの式に代入して、前記時系列に加わる新たなデータを計算させる手順とを含むことを特徴とする。
【００３８】
第１１発明に係るコンピュータでの読み取りが可能な記録媒体は、コンピュータに、Ｎ個のデータから構成される時系列を（ｐ，ｐ）次のＡＲＭＡモデルに当てはめたときのパラメータを推定させるコンピュータプログラムを記録してあるコンピュータでの読み取りが可能な記録媒体において、コンピュータに、Ｎ個のデータから構成される時系列｛ｘ_ｎ：ｎ＝１，２，…，Ｎ｝から、Ｒ_ｋ＝Ｅ［ｘ_ｎｘ_ｎ＋ｋ］で定義される｛ｘ_ｎ｝の自己相関関数｛Ｒ_ｋ：ｋ＝０，１，…，ｐ｝を計算させる手順と、コンピュータに、平均値が０であるＮ個の値が時系列で並んだ白色雑音｛ｅ_ｎ：ｎ＝１，２，…，Ｎ｝の分散σ_ｅ ^２を計算させる手順と、コンピュータに、Ｔ_ｋ＝Ｅ［ｅ_ｎｘ_ｎ＋ｋ］で定義される｛ｅ_ｎ｝と｛ｘ_ｎ｝との相互相関関数｛Ｔ_ｋ：ｋ＝０，１，…，ｐ｝を計算させる手順と、コンピュータに、前記ＡＲＭＡモデルにおけるＡＲ過程のｐ個のパラメータを行列要素とする行列ａ＝［ａ_１，ａ_２，…，ａ_ｐ］^Ｔ、及び前記ＡＲＭＡモデルにおけるＭＡ過程のｐ個のパラメータを行列要素とする行列ｂ＝［ｂ_１，ｂ_２，…，ｂ_ｐ］^Ｔを、以下の式
【００３９】
【数１４】

【００４０】
に基づいて計算させる手順とを含むコンピュータプログラムを記録してあることを特徴とする。
【００４１】
本発明者は、ＡＲＭＡモデルの式の構成から導かれるａｐｒｉｏｒｉｋｎｏｗｌｅｄｇｅを導入することにより、従来の疑似線形回帰法の改良を試みた。（１）式より、ＡＲＭＡモデルでは、現在のデータの値は、過去のデータの値および過去の白色雑音の値に関連づけられている。しかし、現在のデータの値は、未来の白色雑音の値とは関連性がないため、データと該データよりも未来の白色雑音との相互相関関数は、理論的に、
Ｅ［ｘ_ｎｅ_ｎ＋ｋ］＝０
となる。同様に、
Ｅ［ｅ_ｎｘ_ｎ−ｋ］＝０
である。これにより、（５）式に含まれるＴ_−ｋは理論的に０とすることができる。また、白色雑音｛ｅ_ｎ｝の自己相関関数は、理論的に、
【００４２】
【数１５】

【００４３】
となる。実際の場合は、Ｎの値が有限であるため、相互相関関数および自己相関関数を計算した値は、理論的な値とは違った値となる。理論的な値が得られる部分は、理論的な値と置き換えることにより、（５）式は、
【００４４】
【数１６】

【００４５】
と表すことができる。ｐ×ｐの単位行列Ｉを用いて、（１２）式は、更に、
【００４６】
【数１７】

【００４７】
と表すことができる。ここで、行列Ｋ，Ｌ，Ｍ，Ｎと、
Ｚ＝Ｎ−ＭＫ^−１Ｌ
Ｘ＝Ｋ^−１Ｌ
Ｙ＝ＭＫ^−１
である行列Ｘ，Ｙ，Ｚに関する以下の公式
【００４８】
【数１８】

【００４９】
を（１３）式に含まれる逆行列に対して適用し、更に、行列Ａ，Ｂ，Ｃ及びＢの転置行列Ｂ^Ｔに関する以下の公式
［Ａ＋ＢＣＢ^Ｔ］^−１＝Ａ^−１−Ａ^−１Ｂ［Ｃ^−１＋Ｂ^ＴＡ^−１Ｂ］^−１Ｂ^ＴＡ^−１
を繰り返し適用すると、前記逆行列は、以下の如く表すことができる。
【００５０】
【数１９】

【００５１】
（１３）式に含まれる逆行列は、正定値対称行列であるため、（１３）式は一意の解を持つ。（１５）式においてｎ＝０としたものを１次近似とし、ｎ＝１としたものを２次近似とする。前記逆行列を１次近似で表すと、（１３）式より、求めるべきパラメータであるベクトルａ，ｂを計算する式は、
【００５２】
【数２０】

【００５３】
となる。また、前記逆行列を２次近似で表すと、求めるべきパラメータであるベクトルａ，ｂを計算する式は、
【００５４】
【数２１】

【００５５】
となる。１次近似の（１６）式または２次近似の（１７）式は、従来の（１０）式および（１１）式に比べて、式を構成する各行列が単純になっており、更に、必要な逆行列の計算が少なくなっている。従って、（１６）式または（１７）式を用いることで、（１０）式および（１１）式を用いる従来の方法に比べて簡易的にパラメータの推定値を計算することができる。
【００５６】
第１発明においては、時系列をＡＲＭＡモデルに当てはめたときの前記ＡＲＭＡモデルのパラメータを推定する際に、白色雑音の自己相関関数、及び時系列と白色雑音との相互相関関数に理論的な値を用い、更にパラメータの推定値を計算する式に含まれる逆行列を展開して高次の項を省略した近似式に基づいて、簡易的にパラメータの推定値を計算する。
【００５７】
第２、第５、第８及び第１１発明においては、白色雑音｛ｅ_ｎ｝を記憶し、記憶した白色雑音と時系列とから、ＡＲＭＡモデルのパラメータであるベクトルａ及びｂの推定値を、理論的な値を含む１次近似の（１６）式または２次近似の（１７）式に基づいて計算する。
【００５８】
第３、第６及び第９発明においては、計算したベクトルａ及びｂから、（１）式を用いて新たな白色雑音｛ｅ_ｎ｝を計算し、推定すべきパラメータであるベクトルａ及びｂが収束するまで計算を繰り返す。従来に比べて単純化された行列を含む式を用いるため、ベクトルａ及びｂがより収束しやすい。
【００５９】
第４、第７、第１０発明においては、従来より確実にパラメータを推定できる方法により推定したパラメータを用いてデータを予測することにより、従来ではデータの予測ができなかった時系列についてもデータを予測できる。
【００６０】
【発明の実施の形態】
以下本発明をその実施の形態を示す図面に基づき具体的に説明する。
図１は、本発明のデータ予測装置の構成を示すブロック図である。データ予測装置１は、本発明のパラメータ推定装置の機能を兼ね備えており、コンピュータを用いて構成されている。データ予測装置１は、演算を行うＣＰＵ（演算部）１１と、演算に伴って発生する一時的な情報を記憶するＲＡＭ（記憶部）１２と、ＣＤ−ＲＯＭドライブ等の外部記憶装置１３と、ハードディスク等の内部記憶装置１４とを備えており、ＣＤ−ＲＯＭ等の本発明に係る記録媒体２から本発明に係るコンピュータプログラム２０を外部記憶装置１３にて読み取り、読み取ったコンピュータプログラム２０を内部記憶装置１４に記憶し、ＲＡＭ１２にコンピュータプログラム２０をロードし、ＣＰＵ１１は、ロードしたコンピュータプログラム２０に基づいてデータ予測装置１に必要な処理を実行する。データ予測装置１は、キーボード又はマウス等の入力装置１５と、液晶ディスプレイ又はＣＲＴディスプレイ等の出力装置１６とを備えており、データの入力を初めとするオペレータからの操作を受け付ける構成となっている。
【００６１】
なお、データ予測装置１は、図示しない通信インタフェースを備え、通信インタフェースに接続されている図示しない外部のサーバ装置から本発明に係るコンピュータプログラム２０をダウンロードし、ＣＰＵ１１にて処理を実行する形態であってもよい。
【００６２】
図２は、本発明のデータ予測装置が行うパラメータ推定の処理およびデータ予測の処理を示すフローチャートである。データ予測装置１は、オペレータの操作により入力装置１５に入力されたデータを読み込む、又は内部記憶装置１４に記憶してあるデータを読み込む等して、ＡＲＭＡモデルにおけるＡＲ過程のパラメータ（ａ_１，ａ_２，…，ａ_ｐ）及びＭＡ過程のパラメータ（ｂ_１，ｂ_２，…，ｂ_ｐ）の数ｐと、Ｎ個のデータから構成されるデータの時系列｛ｘ_ｎ：ｎ＝１，２，…，Ｎ｝とをＲＡＭ１２に読み込む（Ｓ１）。ＣＰＵ１１は、コンピュータプログラム２０をＲＡＭ１２へロードし、ロードしたコンピュータプログラム２０に従って、ＲＡＭ１２に読み込んだ｛ｘ_ｎ｝を用いて、｛ｘ_ｎ｝の自己相関関数Ｒ_ｋを、ｋ＝０，１，…，ｐとした次の式
【００６３】
【数２２】

【００６４】
に基づいて計算して、（６）式で定義されるｐ×ｐの行列Ｒの逆行列Ｒ^−１、及びｐ×１の行列ｒを生成する（Ｓ２）。ＣＰＵ１１は、次に、ＲＡＭ１２にロードしたコンピュータプログラム２０に従って、乱数生成のルーチンを用いて時刻の値を種値とする乱数を発生させる等して、Ｎ個の値から構成される平均値０の白色雑音｛ｅ_ｎ：ｎ＝１，２，…，Ｎ｝を生成し（Ｓ３）、生成した白色雑音｛ｅ_ｎ｝の分散σ_ｅ ^２を計算する（Ｓ４）。ＣＰＵ１１は、次に、ＲＡＭ１２にロードしたコンピュータプログラム２０に従って、データの時系列｛ｘ_ｎ｝及び白色雑音｛ｅ_ｎ｝を用いて、｛ｘ_ｎ｝と｛ｅ_ｎ｝との相互相関関数Ｔ_ｋを、ｋ＝０，１，…，ｐとした次の式
【００６５】
【数２３】

【００６６】
に基づいて計算して、（１４）式で定義されるｐ×ｐの行列Ｔ、及びｐ×１の行列ｔを生成し（Ｓ５）、生成したＲ^−１及びＴを用いて、行列Ｒ^−１Ｔ，Ｔ^ＴＲ^−１を計算する（Ｓ６）。ここで、２次近似の計算式を用いてパラメータの推定値を計算する場合は、ステップＳ６において、行列Ｒ^−１Ｔ，Ｔ^ＴＲ^−１に加えて、更に行列Ｔ^ＴＲ^−１Ｔを計算する。ＣＰＵ１１は、次に、ＲＡＭ１２にロードしたコンピュータプログラム２０に従って、計算したＲ^−１、Ｒ^−１Ｔ、Ｔ^ＴＲ^−１、又はＴ^ＴＲ^−１Ｔを（１６）式または（１７）式へ代入して、求めるべきパラメータであるベクトルａ＝［ａ_１，ａ_２，…，ａ_ｐ］^Ｔ及びｂ＝［ｂ_１，ｂ_２，…，ｂ_ｐ］^Ｔを計算する（Ｓ７）。
【００６７】
ＣＰＵ１１は、次に、ＲＡＭ１２にロードしたコンピュータプログラム２０に従って、計算したベクトルａ及びｂが収束したか否かを判定し（Ｓ８）、計算したベクトルａ及びｂが収束していない場合は（Ｓ８：ＮＯ）、ベクトルａ及びｂとデータの時系列｛ｘ_ｎ｝とを（１）式に代入して、新たな白色雑音｛ｅ_ｎ｝を計算し（Ｓ９）、処理をステップＳ４へ戻し、ベクトルａ及びｂが収束するまでステップＳ４〜Ｓ９の処理を繰り返す。ステップＳ８では、ＣＰＵ１１は、例えば、最新のベクトルａ及びｂと前回のループで計算したベクトルａ及びｂであるベクトルａ’及びｂ’とから、
Δ＝｜ａ−ａ’｜＋｜ｂ−ｂ’｜
を計算し、Δの値が所定値よりも小さい場合に、ベクトルａ及びｂが収束したと判定する。ステップＳ８においてベクトルａ及びｂが収束している場合には（Ｓ８：ＹＥＳ）、ＣＰＵ１１は、ＲＡＭ１２にロードしたコンピュータプログラム２０に従って、計算したベクトルａ及びｂを、求めるべきＡＲＭＡモデルのパラメータの推定値であると決定し（Ｓ１０）、（１）式のＡＲＭＡモデルにより時系列｛ｘ_ｎ｝の予測データを計算し（Ｓ１１）、処理を終了する。
【００６８】
以上の処理により、ＡＲＭＡモデルのパラメータθが推定され、時系列の新たなデータが予測できる。従来のパラメータ推定方法では、（１０）式および（１１）式を用いて計算を行っており、ｐ×ｐのフルマトリックスＲ、τ、及びＳを用いて計算を行っている。これに対して、本発明では、ａｐｒｉｏｒｉｋｎｏｗｌｅｄｇｅを導入することにより、ベクトルａ及びｂは、ｐ×ｐの一つのフルマトリックスであるＲ、三角行列であるＴ、及び単位行列Ｉを用いて計算され、パラメータの推定値であるベクトルａ及びｂがより収束し易くなる。また、更に２ｐ×２ｐの逆行列を展開して高次項を省略する近似を行った式を用いて計算を行っているため、繰り返し計算でベクトルａ及びｂを計算する都度（１１）式を用いて逆行列の計算を行う必要がある従来の方法に比べて、本発明では、逆行列の計算はＲ^−１を計算する一度のみであり、逆行列の計算が大幅に少なくなって、より簡単にパラメータの推定値を計算することができ、計算時間を短縮することができる。従って、従来ではデータの予測ができなかった時系列について、データの予測が可能となり、株価または売上高などの経済的な時系列について本発明を適用して、従来よりも確実に株価または売上高などのデータの予測が可能となる。
【００６９】
なお、本実施の形態においては、本発明のデータ予測装置１は、本発明のパラメータ推定装置の機能を兼ね備えている形態を示したが、これに限るものではなく、パラメータ推定装置と、ＡＲＭＡモデルによりデータを予測するデータ予測装置とを夫々別のコンピュータにて構成し、互いに情報を入出力してパラメータの推定とデータの予測とを行う形態としても良い。また、本実施の形態においては、パラメータ推定を行った後にデータ予測を行う形態を示しているが、これに限るものではなく、パラメータの推定値を記憶しておき、必要時に記憶しているパラメータの推定値を用いて予測データを計算する処理を行う形態であってもよい。
【００７０】
【発明の効果】
第１発明においては、時系列をＡＲＭＡモデルに当てはめたときの前記ＡＲＭＡモデルのパラメータを推定する際に、白色雑音の自己相関関数、及び時系列と白色雑音との相互相関関数に理論的な値を用い、更にパラメータの推定値を計算する式に含まれる逆行列を展開して高次の項を省略した近似式に基づいて、簡易的にパラメータの推定値を計算することにより、確実にパラメータの値を推定することができる。
【００７１】
第２、第５、第８及び第１１発明においては、白色雑音｛ｅ_ｎ｝を記憶し、記憶した白色雑音と時系列とから、ＡＲＭＡモデルのパラメータであるベクトルａ及びｂの推定値を、理論的な値を用いて行列を単純化し、更に計算式に含まれる逆行列を展開して高次項を省略する近似式を用いて計算するため、逆行列の計算が少なくなって、従来より簡単にパラメータの値を計算することができる。
【００７２】
第３、第６及び第９発明においては、パラメータの推定値が収束するまで計算を繰り返す際に、理論的な値を用いて単純化した計算式を用いてパラメータの推定値を計算するため、従来に比べてよりパラメータの推定値が収束しやすくなって確実にパラメータの値を推定することが可能となり、更に計算時間が短縮される。
【００７３】
第４、第７、第１０発明においては、従来より確実にパラメータを推定できる方法により推定したパラメータを用いてデータを予測することにより、従来ではデータの予測ができなかった時系列についてもデータを予測することができる。また、株価または売上高などの経済的な時系列について本発明を適用して、従来よりも確実に株価または売上高などのデータの予測が可能となる等、本発明は優れた効果を奏する。
【図面の簡単な説明】
【図１】本発明のデータ予測装置の構成を示すブロック図である。
【図２】本発明のデータ予測装置が行うパラメータ推定の処理およびデータ予測の処理を示すフローチャートである。
【符号の説明】
１データ予測装置（パラメータ推定装置）
１１ＣＰＵ（演算部）
１２ＲＡＭ（記憶部）
２記録媒体
２０コンピュータプログラム[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention implements a method for estimating parameters from a time series of data, a method for estimating new data using the estimated parameters, a parameter estimating device, a data estimating method, and a computer as a parameter estimating device or a data estimating device. And a recording medium for the same.
[0002]
[Prior art]
Conventionally, various methods have been proposed as a method of analyzing a time series of accumulated data and predicting a value of new data. A general economic time series composed of data such as stock prices or sales volume can be regarded as the output of a system excited by white noise. When the time series of data {x _n : n = 1, 2,..., N,...} Is a sample time series of the stationary ergodic normal process x (t), the (p, q) -th ARMA model Then, the relationship between the elements of {x _n } is represented by the following equation.
[0003]
(Equation 5)

[0004]
Here, _{{e n}} is a normalized noise of zero mean and variance _{^{_{σ e 2, {a i:}}} i = 1,2, ..., p} and _{{b j: j = 1,2,} ... , Q} is
^{_{^{A (Z -1) = 1 +}}} a 1 Z -1 + ... + a p Z -p
^{_{^{B (Z -1) = 1 +}}} b 1 Z -1 + ... + b q Z -q
In this case, A (Z ^-1 ) and B (Z ^-1 ) are rules, and further satisfy a steady condition, a reversible condition and a strong real condition.
[0005]
As a method for estimating the parameters {a _i } and {b _j } of equation (1) from the sample time series {x _n }, there is a pseudo-linear regression method disclosed in Non-Patent Document 1. In modern control theory, the generality is not lost even if p = q. To define the vector Z _n and the vector θ in the following manner.
_{_{_{Z n = [- x n-}}} 1, ..., -x n-p, e n-1, ..., e n-p] T
_{_{_{θ = [a 1, a 2}}} , ..., a p, b 1, b 2, ..., b p] T
When the equation (1) is rewritten into an equation for _xn , the equation (1) becomes
_{^{_{x n = θ T Z n +}}} e n ... (2)
It can be expressed as. When a time series {x _n : n = 1, 2,..., N} composed of N data is obtained, to obtain the vector θ from the time series by the least square method, the following equation is used. ]
(Equation 6)

[0007]
May be obtained to minimize the right side of. Therefore, the vector θ estimated from the time series is given by the following equation by partially differentiating the right side of the equation (3) with θ and making it equal to 0.
[0008]
(Equation 7)

[0009]
If the autocorrelation function of {x _n } with lag k is R _k ,
_{_{_{R k = E [x n x}}} n + k]
It is. Here, E [y _n ] is an expected value of the data sequence {y _n : n = 1, 2,...}. Also,
R- _k = _Rk
It is. The autocorrelation function of the lag is k _{{e n}} When _{S k,} similarly,
_{_{_{S k = E [e n e}}} n + k], S -k = S k
It is. Further, the cross-correlation function between _{{x n}} and _{{e n}} is
_{_{_{T k = E [e n x}}} n + k]
_{_{_{T -k = E [e n x}}} n-k]
After all,
_{_{_{E [x n e n + k}}} ] = E [e n x n-k] = T -k
_{_{_{E [x n e n-k}}} ] = E [e n x n + k] = T k
It becomes.
[0010]
(4) the right side of the equation can be represented by a cross-correlation function between the autocorrelation function of _{{x n},} the autocorrelation function of _{{e n},} and a _{_{{x n} {e n}}} , (4) ceremony,
[0011]
(Equation 8)

[0012]
It becomes. Vectors a and b are
a = [a ₁ , a ₂ ,..., a _p ] ^T
_{_{b = [b 1, b 2}} , ..., b p] T
And the matrix included in equation (5) is
[0013]
(Equation 9)

[0014]
Equation (5) gives:
[0015]
(Equation 10)

[0016]
It can be expressed as. Since vectors a, b, r, and t are matrices of p rows and 1 column, and matrices R, T, and S are matrices of p rows and p columns, equation (7) is a form of a simultaneous equation for vectors a and b. Can be rewritten to
R · a−τ · b = −r (8)
−τ ^T · a + S · b = t (9)
It becomes. From equation (9),
^{b = S -1 (t + τ} T · a) ... (10)
By substituting equation (10) into equation (8),
a = (R−τ · S ⁻¹ · τ ^T ) ⁻¹ (τ · S ⁻¹ · tr) (11)
Is required. The vector a is obtained from the equation (11), and the vector b can be calculated by substituting the obtained vector a into the equation (10).
[0017]
The pseudo-linear regression, the white noise _{{e n}} generated using a random number, from the generated and _{{e n}} time series _{{x n},} with (11) and (10), (1) formula to calculate the parameters _{{a i}} and _{{b j},} the calculated _{{a i}} and _{{b j},} calculate the white noise _{{e n}} using the equation (1), the parameters The calculation is repeated until {a _i } and {b _j } converge, and parameters {a _i } and {b _j } are obtained.
[0018]
[Non-patent document 1]
Toru Katayama "Introduction to System Identification" Asakura Shoten 1994
[Problems to be solved by the invention]
The quasi-linear regression method has a problem that parameters {a _i } and {b _j } do not converge and parameters cannot be estimated depending on the nature of the problem. Further, there is a problem that the calculation time is long because the matrix calculation of p rows and p columns is performed.
[0020]
The present invention has been made in order to solve the above-described problem, and the purpose is to set a numerical value that should theoretically be 0 among numerical values used for calculation to be 0, and furthermore, to form an inverse matrix. A parameter estimation method, a parameter estimation device, and a computer program for realizing a computer as the parameter estimation device, which can facilitate convergence of parameter values by performing expanded calculations, and at the same time, reduce the calculation time. It is to provide a recording medium.
[0021]
Another object of the present invention is to estimate data using parameters estimated by the parameter estimation method, and thereby to estimate data even for a time series for which data cannot be estimated by the conventional method. It is an object of the present invention to provide a data prediction method, a data prediction device, a computer program for realizing a computer as the data prediction device, and a recording medium.
[0022]
[Means for Solving the Problems]
The parameter estimation method according to a first aspect of the present invention provides a time series composed of a plurality of data and an autocorrelation function of the time series and an autocorrelation function of the white noise from a white noise having an average value of 0 and a variance of σ _e ^2. A method of estimating parameters when the time series is applied to an (p, p) -order ARMA model using a correlation function and a cross-correlation function between the time series and the white noise, wherein the lag of 0 is used. The value of the autocorrelation function of the white noise is σ _e ² , the value of the autocorrelation function of the white noise other than 0 is 0, and the time series of the cross correlation function between the time series and the white noise is The value of the cross-correlation function between the data contained in the data and the value of the white noise at a point in time later than the data is 0, and the inverse matrix of a 2p × 2p matrix composed of the time-series autocorrelation function and the cross-correlation function Is the autocorrelation function The parameter is calculated based on an approximate expression in which expansion is performed using a p × p matrix composed of the following and a p × p matrix composed of the cross-correlation function and high-order terms are omitted.
[0023]
A parameter estimation method according to a second aspect of the present invention uses a computer including a storage unit and an operation unit to calculate a parameter when a time series composed of N data is applied to an (p, p) -th ARMA model. In the estimation method, a time series {x _n : n = 1, 2,..., N} composed of N data is stored in the storage unit, and is defined by R _k = E [x _n x _{n + k} ]. An autocorrelation function {R _k : k = 0, 1,..., P} of {x _n } is calculated by an arithmetic unit, and white noise {e} in which N values having an average value of 0 are arranged in time series _{n: n} = 1,2, ..., store n} in the storage unit, the variance sigma _e ² of the white noise calculated by the calculating unit _is defined by _{_{T k = E [e n x}} n + k] { e _n} and _{{x n}} and the cross-correlation function _{T k of: k = 0, 1, ..., a p} calculated by the calculating unit, the ARMA Matrix _a = a p number of parameters matrix elements of the AR process in del _{[a 1, a 2, ...} , a p] T, and matrix b to p number of parameters matrix elements of MA process in the ARMA model = [B ₁ , b ₂ ,..., B _p ] ^T is represented by the following equation:
[Equation 11]

[0025]
The calculation is performed by the calculation unit based on
[0026]
The parameter estimation method according to a third aspect of the present invention is a method for calculating a new white noise by substituting the parameters of the ARMA model and the time series values included in the calculated matrix a and matrix b into the equation of the ARMA model. And a new matrix a and a matrix b are calculated by the calculation unit using the calculated new white noise, and the calculation of the matrices a and b is performed by the calculation unit until the matrices a and b converge. It is characterized by repeating.
[0027]
A data prediction method according to a fourth invention is a method for predicting new data to be added to a time series from a time series composed of a plurality of data by using a computer having a storage unit and an arithmetic unit. The parameter when the time series is applied to the ARMA model is calculated by the calculation unit, and the calculated parameter is stored in the storage unit and the parameter estimation method according to the second or third invention is used. A time series value is substituted into the ARMA model formula, and new data added to the time series is calculated by an arithmetic unit.
[0028]
A parameter estimating apparatus according to a fifth aspect of the present invention is an apparatus for estimating a parameter when a time series composed of N data is applied to an (p, p) -th ARMA model. time series _{{x n: n = 1,2,} ..., n} means for _{_{_{storing, R k = E [x n}}} x n + k] autocorrelation function of _{{x n}} defined by _{R k: k = 0,1, ..., means for calculating a p}, the average value of 0 n number of values the white noise arranged in time series _{{e n: n = 1,2,} ..., means for storing the n} When the means for calculating the variance sigma _e ² of white _{_{_{noise, T k = E [e n}}} x n + k] cross-correlation function is defined as _{{e n}} and _{{x n}} in _{T k: k = 0, 1,..., P}, and p parameters of the AR process in the ARMA model are defined as matrix elements. That matrix _{_{a = [a 1, a 2}} , ..., a p] T, and the matrix _b = a p number of parameters matrix elements of MA process in the ARMA model _{[b 1, b 2, ...} , b p] ^{T is represented} by the following formula:
(Equation 12)

[0030]
Means for calculating based on
[0031]
The parameter estimating apparatus according to the sixth aspect of the present invention calculates new white noise by substituting the parameters of the ARMA model and the values of the time series included in the calculated matrices a and b into the equations of the ARMA model. Means, means for calculating new matrices a and b using the calculated new white noise, and means for repeating calculations of matrices a and b until the matrices a and b converge. Features.
[0032]
A data prediction device according to a seventh aspect of the present invention is a device for predicting new data to be added to the time series from a time series composed of a plurality of data. A parameter estimating apparatus, means for calculating a parameter when the time series is applied to an ARMA model from the time series using the parameter estimating apparatus, a calculated parameter, and a stored value of the time series Is substituted into the ARMA model equation to calculate new data added to the time series.
[0033]
The computer program according to the eighth invention is a computer program for causing a computer to estimate parameters when a time series composed of N data is applied to a (p, p) next ARMA model. time series from the data made _{{x n: n = 1,2,} ..., n} _from, R k _{= E} autocorrelation function of the _{[x n} x _{n + k]} is defined by _{_{x n}} _{R k: k = 0, 1, ..., a procedure for computing the p}, a computer, an average value of 0 n number of values the white noise arranged in time series _{{e n: n = 1,2,} ..., n a procedure for calculating the variance sigma _e ² of}, to a _{_{_{computer, T k = E [e n}}} x n + k] cross-correlation function is defined as _{{e n}} and _{{x n}} in _{T k: k = 0 , 1, ..., p｝ and computer Matrix _{_{a = [a 1, a 2}} , ..., a p] and p number of parameters matrix elements of the AR process in the ARMA model and ^T, and p number of parameters matrix elements of MA process in the ARMA model The matrix b = [b ₁ , b ₂ ,..., B _p ] ^T is expressed by the following equation:
(Equation 13)

[0035]
And a procedure for performing calculation based on
[0036]
A computer program according to a ninth invention is a computer program, which substitutes the calculated parameters a and b of the ARMA model included in the matrix a and the matrix b and the value of the time series into a formula of the ARMA model to generate new white noise. A procedure for causing the computer to calculate new matrices a and b using the calculated new white noise; and a step for causing the computer to calculate the matrices a and b until the matrices a and b converge. And a step of repeating.
[0037]
The computer program according to the tenth invention is a computer program for causing a computer to predict new data to be added to the time series from a time series composed of a plurality of data. By using the stored time series to calculate a parameter when the time series is applied to the ARMA model; and causing a computer to calculate the calculated parameter and the stored value of the time series in the ARMA model. Substituting into a model equation to calculate new data added to the time series.
[0038]
A computer-readable recording medium according to an eleventh invention is a computer program for causing a computer to estimate parameters when a time series composed of N data is applied to a (p, p) next ARMA model. Is recorded on a computer-readable recording medium on which is stored a time series {x _n : n = 1, 2,..., N} composed of N data, and R _k = E [ x _{_n} x _n _{+ k]} autocorrelation function of _{{x n}} defined by _{{R k: k = 0,1,} ..., p} and procedure for calculating the a computer, an average value of 0 n number of values is defined _{by: {n = 1,2, ...,} n e n} and procedure for calculating the variance sigma _e ² of the _{_{_{computer, T k = E [e n}}} x n + k] but when white noise aligned with sequence _{{e n}} and the _{{x n}} A procedure for calculating the cross-correlation function {T _k : k = 0, 1,..., P}, and a computer in which a matrix a = [a ₁ , a using the p parameters of the AR process in the ARMA model as matrix elements ₂ ,..., A _p ] ^T , and a matrix b = [b ₁ , b ₂ ,..., Bp] ^T having _p parameters of the MA process in the ARMA model as matrix elements, are represented by the following equations.
[Equation 14]

[0040]
A computer program that includes a procedure for calculating based on the computer program.
[0041]
The present inventor tried to improve the conventional pseudo-linear regression method by introducing a priori knowledge derived from the construction of the ARMA model equation. From equation (1), in the ARMA model, the value of the current data is associated with the value of the past data and the value of the past white noise. However, since the value of the current data is not related to the value of the future white noise, the cross-correlation function between the data and the future white noise than the data is theoretically:
_{_{E [x n e n + k}} ] = 0
It becomes. Similarly,
_{_{E [e n x n-k}} ] = 0
It is. Thereby, T− _k included in the equation (5) can be theoretically set to 0. Furthermore, the autocorrelation function of white noise {e _n} is theoretically
[0042]
(Equation 15)

[0043]
It becomes. In an actual case, since the value of N is finite, the value obtained by calculating the cross-correlation function and the auto-correlation function is different from the theoretical value. By replacing the part where the theoretical value is obtained with the theoretical value, the equation (5) becomes
[0044]
(Equation 16)

[0045]
It can be expressed as. Using the p × p unit matrix I, equation (12) further
[0046]
[Equation 17]

[0047]
It can be expressed as. Where the matrices K, L, M, N
Z = N-MK ^-1 L
X = K ^-1 L
Y = MK ^-1
The following formula for the matrices X, Y, Z that is
(Equation 18)

[0049]
(13) was applied to the inverse matrix included in the formula, further, the matrix A, B, C and the following formula ^[A ⁺ BCB ^T] relates transposed matrix ^{B T} of B ^{^-1} = A ^-1 ^-A ^-1 B ^{^{^{^{[C -1 + B T A -1}}}} B] -1 B T A -1
Is repeatedly applied, the inverse matrix can be expressed as follows.
[0050]
[Equation 19]

[0051]
Since the inverse matrix included in equation (13) is a positive definite symmetric matrix, equation (13) has a unique solution. In equation (15), the one where n = 0 is the first-order approximation, and the one where n = 1 is the second-order approximation. When the inverse matrix is represented by a first-order approximation, the expression for calculating the vectors a and b, which are the parameters to be obtained, from Expression (13) is as follows.
[0052]
(Equation 20)

[0053]
It becomes. When the inverse matrix is expressed by a quadratic approximation, an expression for calculating the vectors a and b, which are parameters to be obtained, is as follows.
[0054]
(Equation 21)

[0055]
It becomes. In the first-order approximation (16) or the second-order approximation (17), each matrix constituting the expression is simpler than the conventional expressions (10) and (11). Inverse matrix calculations are reduced. Therefore, by using the expression (16) or the expression (17), it is possible to calculate the estimated value of the parameter more easily than the conventional method using the expressions (10) and (11).
[0056]
In the first invention, when estimating parameters of the ARMA model when the time series is applied to the ARMA model, theoretical values are used for an autocorrelation function of white noise and a cross-correlation function of the time series and white noise. Is used, and an inverse matrix included in the expression for calculating the estimated value of the parameter is expanded, and the estimated value of the parameter is simply calculated based on an approximate expression in which higher-order terms are omitted.
[0057]
Second, fifth, in the eighth and eleventh invention, stores white noise {e _n}, from a storage white noise and time-series, an estimate of the vectors a and b are parameters of the ARMA model, The calculation is performed based on the first-order approximation (16) or the second-order approximation (17) including a theoretical value.
[0058]
Third, in the sixth and ninth invention, the calculated vectors a and b, (1) formula to calculate the new white noise {e _n}, a vector a and b are parameters to be estimated Repeat calculation until convergence. Vectors a and b are more likely to converge because an equation including a simplified matrix is used as compared with the related art.
[0059]
According to the fourth, seventh, and tenth aspects of the present invention, data is predicted using parameters estimated by a method capable of more reliably estimating parameters than in the past, so that data can be obtained even in a time series in which data could not be predicted conventionally. Can be predicted.
[0060]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, the present invention will be specifically described with reference to the drawings showing the embodiments.
FIG. 1 is a block diagram showing the configuration of the data prediction device of the present invention. The data prediction device 1 has the function of the parameter estimation device of the present invention, and is configured using a computer. The data prediction device 1 includes a CPU (arithmetic unit) 11 that performs an arithmetic operation, a RAM (a storage unit) 12 that stores temporary information generated by the arithmetic operation, an external storage device 13 such as a CD-ROM drive, An internal storage device 14 such as a hard disk; reads a computer program 20 according to the present invention from a recording medium 2 according to the present invention such as a CD-ROM in an external storage device 13; The data is stored in the device 14 and the computer program 20 is loaded into the RAM 12, and the CPU 11 executes processing necessary for the data prediction device 1 based on the loaded computer program 20. The data prediction device 1 includes an input device 15 such as a keyboard or a mouse, and an output device 16 such as a liquid crystal display or a CRT display, and is configured to receive an operation from an operator including data input. .
[0061]
The data prediction device 1 includes a communication interface (not shown), downloads the computer program 20 according to the present invention from an external server device (not shown) connected to the communication interface, and executes the processing by the CPU 11. You may.
[0062]
FIG. 2 is a flowchart illustrating a parameter estimation process and a data prediction process performed by the data prediction device of the present invention. The data prediction device 1 reads the data input to the input device 15 by the operation of the operator, or reads the data stored in the internal storage device 14 to read the parameters (a ₁ , a) of the AR process in the ARMA model. _2, ..., _{a p)} of and MA process parameters _{_{(b 1, b 2, ...}} , the number p of _{b p),} the time-series data consisting of n data _{x n: n = 1,2 ,..., N} are read into the RAM 12 (S1). The CPU 11 loads the computer program 20 into the RAM 12 and, according to the loaded computer program 20, uses {x _n } read into the RAM 12 to calculate the autocorrelation function R _k of {x _n }, k = 0, 1,. , P, the following equation:
(Equation 22)

[0064]
To generate an inverse matrix R ^{−1 of} a p × p matrix R defined by the equation (6) and a p × 1 matrix r (S2). Next, the CPU 11 generates a random number using the time value as a seed value by using a random number generation routine according to the computer program 20 loaded into the RAM 12, and the like. white noise _{{e n: n = 1,2,} ..., n} to generate the (S3), calculating the variance sigma _e ² of the generated white noise _{{e n}} (S4). CPU11 then, in accordance with a computer program 20 loaded into the RAM 12, using the time series _{{x n}} and the white noise _{{e n}} of data, the cross-correlation function between _{{x n}} and _{{e n}} _{T k} [Mathematical formula-see original document] where k = 0, 1,..., P
[Equation 23]

[0066]
, A p × p matrix T defined by the equation (14) and a p × 1 matrix t are generated (S5), and using the generated R ⁻¹ and T, a matrix R ^{− 1} ^T, calculate the ^T T ^{R -1} (S6). Here, when calculating the estimates of the parameters using the equation of second-order approximation, in step S6, the matrix ^R -1 ^T, in addition to T ^{T R -1,} the further matrix ^T T ^{R -1} T calculate. CPU11 then, in accordance with a computer program 20 loaded into the RAM 12, the calculated ^{^{^{R -1, R -1 T, T}}} T R -1, or ^{T T} ^{R -1} ^T (16) or (17) to the equation by substituting the vector _{_{a = [a 1, a 2}} , ..., a p] is a parameter to be determined ^T and _{_{b = [b 1, b 2}} , ..., b p] to compute the ^T (S7).
[0067]
Next, the CPU 11 determines whether or not the calculated vectors a and b have converged in accordance with the computer program 20 loaded in the RAM 12 (S8). If the calculated vectors a and b have not converged (S8: NO), by substituting a time series of vectors a and b and the data _{{x n}} (1) equation, to calculate a new white noise _{{e n}} (S9), the process returns to step S4, vector Steps S4 to S9 are repeated until a and b converge. In step S8, the CPU 11 calculates, for example, the latest vectors a and b and the vectors a ′ and b ′ that are the vectors a and b calculated in the previous loop,
Δ = | aa ′ | + | bb ′ |
Is calculated, and when the value of Δ is smaller than a predetermined value, it is determined that the vectors a and b have converged. If the vectors a and b have converged in step S8 (S8: YES), the CPU 11 calculates the calculated vectors a and b according to the computer program 20 loaded into the RAM 12 by using the estimated values of the parameters of the ARMA model to be obtained. Is determined (S10), the prediction data of the time series {x _n } is calculated by the ARMA model of the equation (1) (S11), and the process ends.
[0068]
Through the above processing, the parameter θ of the ARMA model is estimated, and new data in a time series can be predicted. In the conventional parameter estimation method, the calculation is performed using the equations (10) and (11), and the calculation is performed using the p × p full matrix R, τ, and S. In contrast, in the present invention, by introducing a priori knowledge, the vectors a and b are calculated using R, which is one full matrix of p × p, T, which is a triangular matrix, and unit matrix I. Thus, the vectors a and b, which are the estimated values of the parameters, are more likely to converge. Further, since the calculation is performed using the approximated expression in which the inverse matrix of 2p × 2p is expanded and the higher-order terms are omitted, the expression (11) is used each time the vectors a and b are calculated by the repeated calculation. In the present invention, the calculation of the inverse matrix is performed only once to calculate R ^−1, and the calculation of the inverse matrix is greatly reduced, which is simpler than the conventional method that needs to perform the calculation of the inverse matrix. In this case, the estimated value of the parameter can be calculated, and the calculation time can be reduced. Therefore, it is possible to predict data for a time series in which data could not be predicted in the past, and to apply the present invention to economical time series such as stock prices or sales, so that stock prices or sales can be more reliably calculated than before. And other data can be predicted.
[0069]
In the present embodiment, the data prediction device 1 according to the present invention has been described as having the function of the parameter estimating device according to the present invention. However, the present invention is not limited to this. And a data prediction device that predicts data by using different computers, and inputs and outputs information to each other to estimate parameters and predict data. Further, in the present embodiment, a mode in which data prediction is performed after parameter estimation is performed is shown. However, the present invention is not limited to this, and parameter estimation values are stored, and stored when necessary. The processing for calculating the prediction data by using the estimated value of may be performed.
[0070]
【The invention's effect】
In the first invention, when estimating parameters of the ARMA model when the time series is applied to the ARMA model, theoretical values are used for an autocorrelation function of white noise and a cross-correlation function of the time series and white noise. By further calculating the parameter estimation value based on the approximate expression that expands the inverse matrix included in the expression for calculating the parameter estimation value and omits higher-order terms, Can be estimated.
[0071]
Second, fifth, in the eighth and eleventh invention, stores white noise {e _n}, from a storage white noise and time-series, an estimate of the vectors a and b are parameters of the ARMA model, The matrix is simplified using theoretical values, the inverse matrix included in the calculation formula is expanded, and calculations are performed using an approximation formula that omits higher-order terms. The value of the parameter can be calculated.
[0072]
According to the third, sixth, and ninth aspects, when the calculation is repeated until the estimated value of the parameter converges, the estimated value of the parameter is calculated using a simplified formula using a theoretical value. Compared to the conventional case, the parameter estimation value is more likely to converge, and the parameter value can be reliably estimated, and the calculation time is further reduced.
[0073]
According to the fourth, seventh, and tenth aspects of the present invention, data is predicted using parameters estimated by a method capable of more reliably estimating parameters than in the past, so that data can be obtained even in a time series in which data could not be predicted conventionally. Can be predicted. In addition, the present invention has an excellent effect, for example, by applying the present invention to economical time series such as stock prices or sales, thereby enabling more reliable prediction of data such as stock prices or sales than before.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating a configuration of a data prediction device according to the present invention.
FIG. 2 is a flowchart showing a parameter estimation process and a data prediction process performed by the data prediction device of the present invention.
[Explanation of symbols]
1 Data prediction device (parameter estimation device)
11 CPU (arithmetic unit)
12 RAM (storage unit)
2 Recording medium 20 Computer program

Claims

From a time series composed of a plurality of data, and a white noise having an average value of 0 and a variance of σ _e ² , an autocorrelation function of the time series, an autocorrelation function of the white noise, and the time series and the white A method for estimating parameters when the time series is applied to an (p, p) -order ARMA model using a cross-correlation function with noise,
Let σ _e ^{2 be} the value of the autocorrelation function of the white noise having a lag of 0,
The value of the autocorrelation function of the white noise having a lag other than 0 is set to 0,
Of the cross-correlation function between the time series and the white noise, the value of the cross-correlation function between the data included in the time series and the value of the white noise at a point in time future than the data is set to 0,
An inverse matrix of a 2p × 2p matrix composed of the time-series autocorrelation function and the cross-correlation function is developed using a p × p matrix composed of the autocorrelation function and a p × p matrix composed of the cross-correlation function. And estimating the parameters based on an approximate expression in which higher-order terms are omitted.

In a method for estimating parameters when a time series composed of N data is applied to a (p, p) next ARMA model using a computer including a storage unit and an arithmetic unit,
A time series {x _n : n = 1, 2,..., N} composed of N data is stored in the storage unit,
_{_{_{R k = E [x n x}}} n + k] defined in the _{{x n}} autocorrelation function of _{{R k: k = 0,1,} ..., p} to calculate the at computing unit,
Zero mean and is N values are time series lined white noise _{{e n: n = 1,2,} ..., N} is stored in the storage unit,
The variance σ _e ² of the white noise is calculated by an arithmetic unit,
_{_{_{T k = E [e n x}}} n + k] cross-correlation function defined as _{{e n}} and _{{x n}} in _{{T k: k = 0,1,} ..., p} to calculate the at computing unit,
Matrix a = [a ₁ , a ₂ ,..., A _p ] ^T using _p parameters of the AR process in the ARMA model as matrix elements, and p parameters of the MA process in the ARMA model as matrix elements matrix _{_{b = [b 1, b 2}} , ..., b p] to ^T, the following equation

A parameter estimating method, wherein the calculation is performed by an arithmetic unit based on

The parameters of the ARMA model included in the calculated matrices a and b, and the values of the time series are substituted into the equations of the ARMA model, and a new white noise is calculated by the calculation unit.
Using the calculated new white noise, a new matrix a and a new matrix b are calculated by the calculation unit,
3. The parameter estimation method according to claim 2, wherein the calculation of the matrices a and b is repeated by the calculation unit until the matrices a and b converge.

Using a computer having a storage unit and a calculation unit, in a method of predicting new data to be added to the time series from a time series composed of a plurality of data,
Store the time series in the storage unit,
A parameter when the time series is applied to an ARMA model is calculated by an arithmetic unit using the parameter estimation method according to claim 2 or 3,
A data prediction method, wherein the calculated parameters and the stored values of the time series are substituted into the ARMA model formula, and new data added to the time series is calculated by an arithmetic unit.

In a device for estimating parameters when a time series composed of N data is applied to an (p, p) -order ARMA model,
Means for storing a time series {x _n : n = 1, 2,..., N} composed of N data;
_{_{_{R k = E [x n x}}} n + k] autocorrelation function of _{{x n}} defined by _{{R k: k = 0,1,} ..., p} means for calculating a,
White noise N values average is 0 is arranged at time series _{{e n: n = 1,2,} ..., N} means for storing,
Means for calculating the variance σ _e ² of the white noise;
T _k = _E cross-correlation function [e _{n x n + k]} is the a _{{e n}} defined as _{_{{x n} {T k:}} k = 0,1, ..., p} means for calculating a,
Matrix a = [a ₁ , a ₂ ,..., A _p ] ^T using _p parameters of the AR process in the ARMA model as matrix elements, and p parameters of the MA process in the ARMA model as matrix elements matrix _{_{b = [b 1, b 2}} , ..., b p] to ^T, the following equation

Means for calculating based on the parameter.

Means for calculating a new white noise by substituting the parameters of the ARMA model included in the calculated matrices a and b, and the values of the time series into the expression of the ARMA model,
Means for calculating new matrices a and b using the calculated new white noise;
The parameter estimating apparatus according to claim 5, further comprising: means for repeating calculation of the matrices a and b until the matrices a and b converge.

In a device for predicting new data to be added to the time series from a time series composed of a plurality of data,
Means for storing a time series;
A parameter estimation device according to claim 5 or 6,
Means for calculating a parameter when the time series is applied to an ARMA model from the time series using the parameter estimation device;
Means for substituting the calculated parameters and the stored values of the time series into the ARMA model equation to calculate new data added to the time series.

In a computer program for causing a computer to estimate parameters when a time series composed of N data is applied to an (p, p) -order ARMA model,
From a time series {x _n : n = 1, 2,..., N} composed of N pieces of data, an autocorrelation of {x _n } defined by R _k = E [x _n x _{n + k} ] Function ｛R _k
: A procedure for calculating k = 0, 1,..., P｝;
The computer, the mean value 0 and a N values are arranged in time series white noise _{{e n: n = 1,2,} ..., N} and procedure for calculating the variance sigma _e ² of
A _{_{_{computer, T k = E [e n}}} x n + k] cross-correlation function defined as _{{e n}} and _{{x n}} in _{{T k: k = 0,1,} ..., p} and procedure for calculating the ,
A computer calculates a matrix a = [a ₁ , a ₂ ,..., A _p ] ^T using _p parameters of the AR process in the ARMA model as matrix elements, and a matrix of p parameters of the MA process in the ARMA model. matrix _{_{b = [b 1, b 2}} , ..., b p] as an element of ^T, the following equation

A computer program based on the computer program.

A step of causing a computer to calculate a new white noise by substituting the parameters of the ARMA model included in the calculated matrices a and b, and the value of the time series into the equation of the ARMA model,
Causing the computer to calculate new matrices a and b using the calculated new white noise;
9. The computer program according to claim 8, further comprising: causing the computer to repeat the calculation of the matrices a and b until the matrices a and b converge.

A computer program for causing a computer to predict new data to be added to the time series from a time series composed of a plurality of data,
A procedure for causing a computer to calculate a parameter when the time series is applied to an ARMA model from the stored time series using the computer program according to claim 8 or 9;
Causing the computer to substitute the calculated parameters and the stored values of the time series into the formula of the ARMA model to calculate new data added to the time series. .

In a computer-readable recording medium that stores a computer program for causing a computer to estimate parameters when a time series composed of N data is applied to a (p, p) -order ARMA model,
From a time series {x _n : n = 1, 2,..., N} composed of N pieces of data, an autocorrelation of {x _n } defined by R _k = E [x _n x _{n + k} ] Function ｛R _k
: A procedure for calculating k = 0, 1,..., P｝;
The computer, the mean value 0 and a N values are arranged in time series white noise _{{e n: n = 1,2,} ..., N} and procedure for calculating the variance sigma _e ² of
A _{_{_{computer, T k = E [e n}}} x n + k] cross-correlation function defined as _{{e n}} and _{{x n}} in _{{T k: k = 0,1,} ..., p} and procedure for calculating the ,
A computer calculates a matrix a = [a ₁ , a ₂ ,..., A _p ] ^T using _p parameters of the AR process in the ARMA model as matrix elements, and a matrix of p parameters of the MA process in the ARMA model. matrix _{_{b = [b 1, b 2}} , ..., b p] as an element of ^T, the following equation

A computer-readable recording medium having recorded thereon a computer program including a procedure for performing calculation based on a computer program.