JP2011107782A

JP2011107782A - Volatility prediction system and volatility prediction method

Info

Publication number: JP2011107782A
Application number: JP2009259343A
Authority: JP
Inventors: Kenichi Yoshihara; 健一吉原
Original assignee: Individual
Current assignee: Individual
Priority date: 2009-11-12
Filing date: 2009-11-12
Publication date: 2011-06-02
Anticipated expiration: 2029-11-12
Also published as: JP5543177B2

Abstract

<P>PROBLEM TO BE SOLVED: To determine a confidence interval of the same type of estimated numeric values by using numeric values obtained so far, for a numerical string obtained by continuously observing a natural phenomenon or a social phenomenon. <P>SOLUTION: The volatility prediction system is configured to process numeric values obtained by a prediction start time for a numerical string obtained by continuously observing the numeric values of a target phenomenon, and to apply the bootstrap method of the latest statistical theory, and to create a region (called, confidence band) configured of connecting the upper limit and the lower limit of the confidence interval of values at each point of time of the regression curve of predetermined confidence coefficients, and to input a predicted request time to determine the confidence interval of the prediction value on the point of time. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、目的とする現象の変動数値の将来における予測範囲を、予測前に継続的に観測して得られる数値列から予測する変動数値予測に関する。 The present invention relates to a fluctuation numerical prediction for predicting a future prediction range of a fluctuation numerical value of a target phenomenon from a numerical string obtained by continuously observing before prediction.

自然現象または社会現象を継続的に観測して得られる数値列に対し、現時点までに得られている数値を利用して、同種の数値の指定された時点における予測値を得るために作られている従来の方法は、「得られている数値列はすべて独立に同じ正規分布に従う」という大前提に立った従来の統計理論に基づいている。また、特許文献１には、関連性が推測される現象において他の現象に基づいて予測する方法が開示されている。 Created to obtain a predicted value at a specified point in time for the same kind of numerical value using a numerical value obtained so far for a numerical sequence obtained by continuously observing natural or social phenomena The conventional method is based on the conventional statistical theory based on the major premise that “the obtained numerical sequences independently follow the same normal distribution”. Further, Patent Document 1 discloses a method of predicting a phenomenon whose relevance is estimated based on another phenomenon.

特開２００４−２０５２３３号公報JP 2004-205233 A

しかしながら、現実の問題では、「得られている数値列はすべて独立に同じ正規分布に従う」という大前提が正しいということを確かめる術がない。例えば、金融商品価格が正規分布に従っているか否かを検証することはできず、２００８年のサブプライムローンをきっかけに起きたような経済現象に対応することはできなかった。その上、f(t)の推定量の分布は一般には求められない。 However, in the real problem, there is no way to confirm that the major premise that “the obtained numerical sequences independently follow the same normal distribution” is correct. For example, it was not possible to verify whether the price of a financial product follows a normal distribution, and it was not possible to cope with an economic phenomenon caused by the 2008 subprime loan. In addition, the distribution of estimators of f (t) is generally not determined.

本発明は、自然現象または社会現象を継続的に観測して得られる数値列に対し、現在までに得られている数値を利用して、同種の数値の予測値の信頼区間を求める変動数値予測システムおよびその方法を提供する。 The present invention relates to a numerical sequence obtained by continuously observing a natural phenomenon or a social phenomenon, using a numerical value obtained so far, and obtaining a confidence interval of a predicted value of the same type of numerical value. A system and method are provided.

かかる課題を解決するために、本発明の変動数値予測システムは、目的とする現象の変動数値の将来における予測範囲を、予測前に継続的に観測して得られる数値列から予測する変動数値予測システムであって、目的とする現象の数値を継続的に観測して得られる数値列を、…、x_-m-r、x_-m-r+1、…、x_-1、x₀、x₁、x₂、…とするとき、予測開始時点の値をx₀、その前の値をx_-1、…、そのm+r-1前の値をx_-m-r+1とおいて、

を計算し、元データとしてx_-1、…、x_-mを作成する元データ作成手段と、前記元データ作成手段により作成された元データに統計理論のブートストラップ法を適用して、前記元データの中から無作為復元抽出法でｎ（≦m）個のデータをリサンプルし、このリサンプルの操作をＢ回（Ｂは正の整数）行って得られたＢ組のリサンプルを用いて、予め与えられた信頼係数の回帰関数の各点における信頼区間を算出し、該信頼区間の上限及び下限をそれぞれ結んで回帰関数の信頼帯を求め、得られた回帰関数に予測日時を入力して、得られた信頼帯を使って予測値の信頼区間を算出する信頼区間算出手段とを備え、得られた予測値の信頼区間を変動数値の未来における予測範囲として出力することを特徴とする。 In order to solve such a problem, the fluctuation numerical value prediction system of the present invention predicts the future prediction range of the fluctuation value of a target phenomenon from a numerical sequence obtained by continuously observing before prediction, A numerical sequence obtained by continuously observing a numerical value of a target phenomenon, which is a system, ..., x _-mr , x _{-m-r + 1} , ..., x _-1 , x ₀ , x ₁ , When x ₂ ,..., the prediction start time value is x ₀ , the previous value is x ₋₁ ,..., and the previous m + r−1 value is x _{−m−r + 1} .

And x- ₁ ,..., X _-m as original data, and applying the statistical theory bootstrap method to the original data created by the original data creating means, Resample n (≦ m) data from the data by random restoration extraction method, and use B sets of resamples obtained by performing this resample operation B times (B is a positive integer). Calculate the confidence interval at each point of the regression function with a given confidence coefficient, connect the upper and lower limits of the confidence interval to obtain the confidence band of the regression function, and enter the prediction date and time in the obtained regression function And a confidence interval calculation means for calculating a confidence interval of the predicted value using the obtained confidence band, and outputting the confidence interval of the obtained predicted value as a prediction range in the future of the fluctuation value, To do.

又、目的とする現象の変動数値が継続的に観測され、現時点までに得られた数値列から、未知の定数を含むxのｐ次多項式、f(x) = a₀ + a₁x + a₂x² + … + a_px^pを用いて、将来の前記現象の数値範囲を予測する変動数値予測システムであって、目的とする現象の数値を継続的に観測して得られる数値列を、…、x_-m-r、x_-m-r+1、…、y_-1、y₀、y₁、y₂、…とするとき、予測開始時点の値をy₀、その前の値をy_-1、…、そのm+r-1前の値をy_-m-r+1とおいて、

を計算し、元データとして２次元データ(1,y₁)、(2,y₂)、…、(m,y_m)を生成する手段と、前記元データを用いて未知の定数a₀、a₁、…、a_pの推定値、

を計算する手段と、前記元データの中から無作為復元抽出法でm個のデータ、

をリサンプルし、このリサンプルしたm個のデータを使って未知の定数a₁、a₂、…、a_pの推定値、

を計算する操作をｎ回（ｎは正の整数）繰り返して、ｐ行ｎ列の値、

を作成する手段と、
各行の平均値、

と標準偏差、

とを計算し、これら平均値と標準偏差とを使って、

を求める操作をＢ回（Ｂは正の整数）行って、ｐ行Ｂ列の値、

を作成する手段と、
各行の標準偏差、

を計算して、信頼係数（1-α）に対して、正の数uの整数部分を表す記号［u］を用いて、Z_1,1、 Z_1,2、…、Z_1,Bを大きさの順に並べて、小さい方からαB/2番目のZ_1,αB/2の値をZ_1,α、(1-α/2)B番目のZ_1,(1-α/2)Bの値をZ_1,1-α、標準正規分布の上裾の面積が（1-α/2）になる値をz_1-α/2とおくとき、実数c,dの内小さくない方を表す記号max(c,d)を用いて、

ならば、a₁の信頼区間を、

また上の条件が成り立たない場合は、a₁の信頼区間を、

とし、a₂、a₃、…、a_pの信頼区間、

も同様に計算して、x = (1 + 2+ … + m)/mとおくときa₀の信頼区間を、

および

と決定する手段と、予測に使う関数の下限、上限を、

として、この関数からt日後における予測値の下限及び上限として、

を計算して、その結果を記憶する手段と、上記処理をｑ回（ｑは正の整数）繰り返してその都度結果を記憶し、それぞれ下限及び上限の平均値を計算してt日後における予測値の下限、上限として計算して、t日後における予測値の信頼区間を生成する手段とを有し、得られた予測値の信頼区間を変動数値の未来における予測範囲として出力することを特徴とする。 Moreover, the fluctuation value of the target phenomenon is continuously observed, and the p-order polynomial of x including unknown constants, f (x) = a ₀ + a ₁ x + a, from the numerical sequence obtained up to the present time. ₂ x ² +… + a _p x ^p is a fluctuating numerical prediction system that predicts the numerical range of the phenomenon in the future, and a numerical sequence obtained by continuously observing the numerical value of the target phenomenon , ..., x _-mr , x _{-m-r + 1} , ..., y _-1 , y ₀ , y ₁ , y ₂ , ..., the value at the prediction start time is y ₀ , and the previous value is y _-1 , ..., the value before m + r-1 is y- _{m-r + 1} ,

, A unit for generating two-dimensional data (1, y ₁ ), (2, y ₂ ),..., (M, y _m ) as original data, and an unknown constant a ₀ using the original data, a ₁ , ..., an estimate of a _p ,

And m data by a random restoration extraction method from the original data,

, And using the resampled m pieces of data, the estimated values of unknown constants a ₁ , a ₂ , ..., a _p ,

The operation of calculating n is repeated n times (n is a positive integer), the value of p rows and n columns,

And a means to create
The average value for each row,

And standard deviation,

And using these mean and standard deviation,

Is performed B times (B is a positive integer), the value of p rows and B columns,

And a means to create
Standard deviation for each row,

The calculated relative confidence factor (1-alpha), with the symbol [u] represents an integer portion of a positive number _{_{u, Z 1,1, Z 1,2,}} ..., the Z _{1, B} Arrange in order of size _{, the αB / 2th} Z _{1, αB / 2} values from the smallest to Z _{1, α} , (1-α / 2) Bth Z _{1, (1-α / 2) B} When the value is Z _1,1-α , and the value where the area of the upper tail of the standard normal distribution is (1-α / 2) is z _{1-α / 2} , it represents the lesser of the real numbers c and d Using the symbol max (c, d),

Then the confidence interval for a ₁

If the above condition does not hold, the confidence interval of a ₁ is

And a confidence interval of a ₂ , a ₃ , ..., a _p ,

Similarly, if x = (1 + 2+… + m) / m, the confidence interval of a ₀

and

And the lower and upper limits of the function used for prediction,

As the lower limit and upper limit of the predicted value after t days from this function,

And a means for storing the result, and the above process is repeated q times (q is a positive integer), the result is stored each time, and the average value of the lower limit and the upper limit is calculated, respectively, and the predicted value after t days And a means for generating a confidence interval for the predicted value after t days, and outputting the confidence interval of the obtained predicted value as a prediction range in the future of the fluctuation value .

更に、上記変動数値予測システムをコンピュータ・システムに実現させるプログラムや該プログラムを記憶する記憶媒体、変動数値予測方法も提供する。 Furthermore, a program for causing a computer system to realize the fluctuation numerical value prediction system, a storage medium for storing the program, and a variable numerical value prediction method are also provided.

本発明により、自然現象または社会現象を継続的に観測して得られる数値列に対し、現在までに得られている数値を利用して、同種の数値の予測値の信頼区間を求める変動数値予測システムおよびその方法を提供できる。すなわち、本発明では、f(t)の推定量を効果的に推定できるので、結果として金融商品価格等の予測値の分布を仮定しないで、よい予測値が得られる。 According to the present invention, a numerical sequence obtained by continuously observing a natural phenomenon or a social phenomenon is used to obtain a confidence interval for a predicted value of the same kind of numerical value using a numerical value obtained so far. A system and method thereof can be provided. That is, in the present invention, the estimated amount of f (t) can be estimated effectively, and as a result, a good predicted value can be obtained without assuming a distribution of predicted values such as financial product prices.

本実施例の変動数値予測システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the fluctuation | variation numerical value prediction system of a present Example. 図１のメモリ２４の記憶構成例を示す図である。It is a figure which shows the memory | storage structure example of the memory 24 of FIG. 図１の補助記憶装置２６の記憶構成例を示す図である。It is a figure which shows the memory | storage structure example of the auxiliary storage device 26 of FIG. 本実施例の変動数値予測システムの動作手順例を示すフローチャートである。It is a flowchart which shows the example of an operation | movement procedure of the fluctuation | variation numerical prediction system of a present Example. 本実施例の変動数値予測システムの動作手順例を示すフローチャートである。It is a flowchart which shows the example of an operation | movement procedure of the fluctuation | variation numerical prediction system of a present Example. 本実施例の変動数値予測システムの動作例を説明する模式図である。It is a schematic diagram explaining the operation example of the fluctuation | variation numerical prediction system of a present Example. 本実施例の信頼帯の概念を示す図である。It is a figure which shows the concept of the trust zone of a present Example. 本実施例の変動数値予測の具体的な予測結果を示す図であり、（ａ）は日経平均株価の予測結果、（ｂ）は原油価格の予測結果を示す図である。It is a figure which shows the specific prediction result of the fluctuation | variation numerical value prediction of a present Example, (a) is a prediction result of a Nikkei average stock price, (b) is a figure which shows the prediction result of a crude oil price.

以下、添付図面を参照しながら、本発明の一実施例及び具体的な適用例を詳細に説明する。なお、実施例及び具体的では、好適な例として日経平均株価を例に説明するが、これに限定されない。 Hereinafter, an embodiment and a specific application example of the present invention will be described in detail with reference to the accompanying drawings. In the examples and specific examples, the Nikkei average stock price will be described as a preferred example, but the present invention is not limited to this.

＜本実施例の変動数値予測システムの構成例＞
本実施例による変動数値予測は、通常のコンピュータ・システムを用いて実行している。図１は、本実施例の変動数値予測システムのハードウエアの構成例を示すブロック図である。 <Configuration example of fluctuation numerical value prediction system of this embodiment>
The fluctuation numerical value prediction according to the present embodiment is executed using a normal computer system. FIG. 1 is a block diagram illustrating a hardware configuration example of a fluctuation numerical value prediction system according to the present embodiment.

図１において、処理装置本体２０中にＣＰＵ２２、メモリ２４、各インターフェース２８およびインターフェース２８の１つと接続されているディスク等の補助記憶装置２６を備えている。また、インターフェース２８を介して、入力装置（キーボード等）３０、出力装置（ディスプレイ等）４０と接続されている。本実施例のシステムは、補助記憶装置２６に本実施例の機能を実現するプログラムとして格納され、入力装置３０からの指示により起動されると、メモリ２４上にロードされ、ＣＰＵ２２で実行される。実行結果は、ディスプレイやプリンタ等の出力装置４０に出力される。 In FIG. 1, a processing apparatus main body 20 includes a CPU 22, a memory 24, each interface 28, and an auxiliary storage device 26 such as a disk connected to one of the interfaces 28. Further, an input device (such as a keyboard) 30 and an output device (such as a display) 40 are connected via an interface 28. The system of the present embodiment is stored in the auxiliary storage device 26 as a program that implements the functions of the present embodiment. When activated by an instruction from the input device 30, the system is loaded on the memory 24 and executed by the CPU 22. The execution result is output to an output device 40 such as a display or a printer.

（メモリ２４の記憶構成例）
図２は、本実施例の変動数値予測を実現するために必要とするデータを記憶するメモリ２４の記憶構成例を示している。なお、図２には本実施例に特有のデータのみを示し汎用のデータや変数パラメータ（例えば、フローチャートのｉやｊなど）は省かれている。 (Storage configuration example of the memory 24)
FIG. 2 shows a storage configuration example of the memory 24 that stores data necessary for realizing the fluctuation numerical value prediction of this embodiment. FIG. 2 shows only data peculiar to this embodiment, and general-purpose data and variable parameters (for example, i and j in the flowchart) are omitted.

２４ａは、入力装置３０などからオペレータにより指示された設定パラメータが記憶される領域である。かかる設定パラメータはデータの計算やフローチャートでの繰り返し回数などとして使用される。本実施例の設定パラメータには、回帰曲線の次数ｐ（ｐ＝１の場合は、以下の具体例で示す回帰直線を表す）、元データの数ｍ、元データ算出に使用するデータ数ｒ、本実施例の復元抽出法のリサンプル回数ｎ、復元抽出法の抽出回数Ｂ、信頼係数（１ーα）、予測までの日数ｔ、予測値の算出回数ｑが含まれる。 Reference numeral 24a denotes an area for storing setting parameters instructed by an operator from the input device 30 or the like. Such setting parameters are used as data calculation or the number of repetitions in the flowchart. The setting parameters of this example include the order p of the regression curve (when p = 1, the regression line shown in the following specific example represents the regression line), the number m of original data, the number r of data used to calculate the original data, The number of resampling times n of the restoration extraction method of the present embodiment, the number of times of extraction B of the restoration extraction method, the reliability coefficient (1−α), the number of days t until the prediction, and the number of times q of the prediction value calculation are included.

２４ｂは、元データｙ_i（ｉ＝１〜ｍ）が記憶される領域である。２４ｃは、元データｙ_i（ｉ＝１〜ｍ）から算出される回帰係数ｂの推定量ｂ^を記憶する領域である。２４ｄは、リサンプルの平均値ｘ、ｙ、回帰係数ｂ_j（ｊ＝１〜ｎ）を記憶する領域である。２４ｅは、回帰係数ｂ_j（ｊ＝１〜ｎ）の平均値ｂ_k、標準偏差Ｓ_k、Ｚ_k（ｋ＝１〜Ｂ）＝（ｎ（ｂ_kーｂ^）／Ｓ_k）^1/2を記憶する領域である。２４ｆは、平均値ｂ_k（ｋ＝１〜Ｂ）の標準偏差Ｖを記憶する領域である。２４ｇは、回帰係数ｂの上限値ｂ_max、下限値ｂ_minを記憶する領域である。２４ｈは、回帰係数ａの上限値ａ_max、下限値ａ_minを記憶する領域である。なお、２４ｇ及び２４ｈは後述の具体例であるｐ＝１の回帰直線ｙ＝ａ＋ｂｘの場合を示しており、ｐ次回帰曲線に一般化すると、２４ｇにはａ₁、ａ₂、…、ａ_pの上下限値が記憶され、２４ｈにはａ₀の上下限値が記憶される。２４ｉは、上限・下限の回帰曲線（直線）から日数ｔに基づいて算出された上限値Ｆ_max,h、下限値_min,h（ｈ＝１〜ｑ）を記憶する領域である。２４ｊは、予測結果として、ｈ個の上限値を平均した上限予測値、ｈ個の下限値を平均した下限予測値を記憶する領域である。 Reference numeral 24b denotes an area in which original data y _i (i = 1 to m) is stored. An area 24c stores an estimated amount b ^ of the regression coefficient b calculated from the original data y _i (i = 1 to m). Reference numeral 24d denotes an area for storing the average value x, y of the resample and the regression coefficient b _j (j = 1 to n). 24e is an average value b _{k of} regression coefficients b _j (j = 1 to n), standard deviation S _k , Z _k (k = 1 to B) = (n (b _k −b ^) / S _k ) ^{1 / This} is an area for storing ² . Reference numeral 24f denotes an area for storing the standard deviation V of the average value b _k (k = 1 to B). 24g is an area for storing an upper limit value b _max and a lower limit value b _min of the regression coefficient b. 24h is an area for storing an upper limit value a _max and a lower limit value a _min of the regression coefficient a. Incidentally, 24 g and 24h shows the case of the regression line y = a + bx of p = 1 is a specific example described later, is generalized to the p-th order regression curve, the _{_{24g a 1, a 2, ...}} , a p And the upper and lower limit values of a ₀ are stored in 24h. 24i is an area for storing the upper limit value F _{max, h} and the lower limit value _{min, h} (h = 1 to q) calculated based on the number of days t from the upper and lower limit regression curves (straight lines). 24j is an area for storing an upper limit predicted value obtained by averaging h upper limit values and a lower limit predicted value obtained by averaging h lower limit values as prediction results.

（補助記憶装置２６の記憶構成例）
図３は、本実施例の変動数値予測を実現するために必要とするデータを記憶する補助記憶装置２６の記憶構成例を示している。なお、図３には本実施例に特有のデータ及びプログラムのみを示し汎用のデータやプログラム（例えば、ＯＳやＢＩＯＳなど）は省かれている。 (Storage configuration example of the auxiliary storage device 26)
FIG. 3 shows a storage configuration example of the auxiliary storage device 26 that stores data necessary for realizing the fluctuation numerical value prediction of this embodiment. FIG. 3 shows only data and programs peculiar to this embodiment, and general-purpose data and programs (for example, OS and BIOS) are omitted.

２６ｂは、予測に使用される過去の変動数値（例えば、日経平均株価や為替、原油価格など）を蓄積するデータベースの領域である。２６ｃは、本実施例の変動数値予測を実現する変動数値予測プログラム（図４Ａ及び図４Ｂに従って後述）を記憶する領域である。２６ｄ〜２６ｈには、変動数値予測プログラムで使用する計算ルーチンが記憶される。２６ｄは、元データ作成ルーチンを記憶する領域である。２６ｅは、元データｙ_i（ｉ＝１〜ｍ）から回帰係数ｂの推定量ｂ^を算出するルーチンを記憶する領域である。２６ｆは、回帰係数ｂ_j（ｊ＝１〜ｎ）を作成する作成ルーチンを記憶する領域である。２６ｇは、回帰係数ｂ_j（ｊ＝１〜ｎ）の平均値ｂ_k、標準偏差Ｓ_kから、Ｚ_k（ｋ＝１〜Ｂ）＝（ｎ（ｂ_k−ｂ^）／Ｓ_k）^1/2を算出するルーチンを記憶する領域である。２６ｇは、平均値ｂ_k（ｋ＝１〜Ｂ）の標準偏差Ｖを記憶する領域である。２４ｇは、回帰係数ａ，ｂの信頼区間（上限値、下限値）を算出するルーチンを記憶する領域である。なお、２６ｇは後述の具体例であるｐ＝１の回帰直線ｙ＝ａ＋ｂｘの場合を示しており、ｐ次回帰曲線に一般化すると、２６ｇはａ₀、ａ₁、ａ₂、…、ａ_pの上下限値を算出する。２６ｈは、上限・下限の回帰曲線（直線）から日数ｔに基づいて算出された上限値Ｆ_max,h、下限値_min,h（ｈ＝１〜ｑ）、更に、予測結果として、ｈ個の上限値を平均した上限予測値、ｈ個の下限値を平均した下限予測値を算出するルーチンを記憶する領域である。 Reference numeral 26b denotes an area of a database that accumulates past fluctuation numerical values (for example, Nikkei average stock price, exchange rate, crude oil price, etc.) used for prediction. 26c is an area for storing a fluctuation numerical value prediction program (which will be described later with reference to FIGS. 4A and 4B) for realizing the fluctuation numerical value prediction of this embodiment. In 26d to 26h, a calculation routine used in the fluctuation numerical value prediction program is stored. An area 26d stores an original data creation routine. An area 26e stores a routine for calculating an estimated amount b ^ of the regression coefficient b from the original data y _i (i = 1 to m). An area 26f stores a creation routine for creating a regression coefficient b _j (j = 1 to n). 26 g represents Z _k (k = 1 to B) = (n (b _k −b ^) / S _k ) ¹ from the average value b _k and standard deviation S _k of the regression coefficients b _j (j = 1 to n). ^This area stores a routine for calculating ^{/ 2} . 26 g is an area for storing the standard deviation V of the average value b _k (k = 1 to B). An area 24g stores a routine for calculating confidence intervals (upper limit value and lower limit value) of the regression coefficients a and b. In addition, 26g shows the case of the regression line y = a + bx of p = 1 which is a specific example described later. When generalized to a p-order regression curve, 26g is a ₀ , a ₁ , a ₂ _,. The upper and lower limit values are calculated. 26h is an upper limit value F _{max, h} calculated from the upper limit / lower limit regression curve (straight line) based on the number of days t, a lower limit value _{min, h} (h = 1 to q), and h This is an area for storing a routine for calculating an upper limit predicted value obtained by averaging the upper limit values and a lower limit predicted value obtained by averaging h lower limit values.

＜本実施例の変動数値予測システムの動作例＞
図４Ａ及び図４Ｂは、本実施例の変動数値予測システムの動作手順例を示すフローチャートである。以下、図４Ａ及び図４Ｂに従って、具体的な動作手順と一般化された動作手順を関連付けて説明する。 <Operation Example of Fluctuation Numerical Prediction System of This Example>
4A and 4B are flowcharts illustrating an example of an operation procedure of the fluctuation numerical value prediction system according to this embodiment. Hereinafter, according to FIGS. 4A and 4B, a specific operation procedure and a generalized operation procedure will be described in association with each other.

（ステップＳ１０）オペレータや外部装置からの指示により、ＣＰＵ２２は本変動数値予測システムの初期化とパラメータの設定を行う。設定パラメータは、図２で上述したように、回帰曲線の次数ｐ、元データの数ｍ、元データ算出に使用するデータ数ｒ、本実施例の復元抽出法のリサンプル回数ｎ、復元抽出法の抽出回数Ｂ、信頼係数（１ーα）、予測までの日数ｔ、予測値の算出回数ｑを含み、メモリ２４に記憶する。 (Step S10) In response to an instruction from an operator or an external device, the CPU 22 initializes the variable numerical value prediction system and sets parameters. As described above with reference to FIG. 2, the setting parameters include the order p of the regression curve, the number m of the original data, the number r of data used for calculating the original data, the number n of resamples of the restoration extraction method of the present embodiment, and the restoration extraction method. The number of times of extraction B, the reliability coefficient (1−α), the number of days t until prediction, and the number of times q of calculation of the predicted value are stored in the memory 24.

本例では、目的とする現象の変動数値が継続的に観測され、現時点までに得られた数値列から、未知の定数を含むxのｐ次多項式、f(x) = a₀ + a₁x + a₂x² + … + a_px^pを用いて、将来の前記現象の数値範囲を予測する。以下、図４Ａ及び図４Ｂでは、具体例で示すようなｐ＝１の場合、すなわち回帰直線の場合を例に示し、その一般形、すなわちｐ≧２の場合（以下では回帰曲線という）を各ステップにおいて対応して示す。 In this example, the fluctuation value of the target phenomenon is continuously observed, and the p-degree polynomial of x including unknown constants, f (x) = a ₀ + a ₁ x + a ₂ x ² +… + a _p x ^p is used to predict the numerical range of the phenomenon in the future. Hereinafter, in FIG. 4A and FIG. 4B, the case of p = 1 as shown in a specific example, that is, the case of a regression line is shown as an example, and the general form thereof, that is, the case of p ≧ 2 (hereinafter referred to as a regression curve) is shown. The corresponding steps are shown.

（ステップＳ２０）ＣＰＵ２２は、目的とする現象の数値を継続的に観測して得られる数値列を、…、x_-m-r、x_-m-r+1、…、y_-1、y₀、y₁、y₂、…とするとき、予測開始時点の値をy₀、その前の値をy_-1、…、そのm+r-1前の値をy_-m-r+1とおいて、

を計算し、元データとして２次元データ(1,y₁)、(2,y₂)、…、(m,y_m)を生成して、メモリ２４に記憶する。。 (Step S20) CPU 22 is a numeric string obtained by continuously monitoring the value of the phenomenon of _{_{interest, ..., x -mr, x -m}} -r + 1, ..., y -1, y 0, y _{When 1} , y ₂ ,..., The prediction start time value is y ₀ , the previous value is y ₋₁ , ..., and the previous m + r−1 value is y _{−m−r + 1} ,

, Two-dimensional data (1, y ₁ ), (2, y ₂ ),..., (M, y _m ) are generated as original data and stored in the memory 24. .

（ステップＳ３０）ＣＰＵ２２は、元データを用いて、回帰直線ｙ＝ａ＋ｂｘの回帰係数ｂの推定量ｂ^を算出して、メモリ２４に記憶する。 (Step S30) The CPU 22 calculates an estimated amount b ^ of the regression coefficient b of the regression line y = a + bx using the original data, and stores it in the memory 24.

一般の回帰曲線の場合、ＣＰＵ２２は、前記元データを用いて未知の定数a₀、a₁、…、a_pの推定値、

を計算して、メモリ２４に記憶する。 For general regression curve, CPU 22, the unknown constants a _0, using the original data a _1, ..., estimated value of a _p,

Is stored in the memory 24.

（ステップＳ４０−Ｓ５０）ＣＰＵ２２は、復元抽出法による各リサンプルの平均値ｘ_j、ｙ_j、回帰直線の回帰係数ｂの推定量ｂ^_jを算出して、メモリ２４に記憶する。かかるリサンプルをステップＳ４０に戻ってｎ回繰り返す（Ｊ＝１〜ｎ）。 (Steps S40 to S50) The CPU 22 calculates the average values x _j and y _j of each resample by the restoration extraction method and the estimated amount b ^ _j of the regression coefficient b of the regression line, and stores them in the memory 24. Such re-sampling returns to step S40 and is repeated n times (J = 1 to n).

一般の回帰曲線の場合、ＣＰＵ２２は、前記元データの中から無作為復元抽出法でm個のデータ、

を作成する。 In the case of a general regression curve, the CPU 22 selects m pieces of data from the original data by a random restoration extraction method,

Create

（ステップＳ６０−Ｓ７０）ＣＰＵ２２は、ｂ^_j（Ｊ＝１〜ｎ）の平均値をｂk、標準偏差をＳkとして、Ｚ_k＝（ｎ（ｂ_k−ｂ^）／Ｓ_k）^1/2を算出して、メモリ２４に記憶する。かかる復元抽出法による計算をステップＳ４０に戻ってＢ回繰り返す（ｋ＝１〜Ｂ）。 (Steps S60-S70) The CPU 22 sets Z _k = (n (b _k −b ^) / S _k ) ^1/2 _where b _{k is} the average value of b ^ _j (J = 1 to n) and Sk is the standard deviation. Is calculated and stored in the memory 24. The calculation by the restoration extraction method returns to step S40 and is repeated B times (k = 1 to B).

一般の回帰曲線の場合、ＣＰＵ２２は、ｐ行ｎ列の値から各行の平均値、

と標準偏差、

とを計算し、これら平均値と標準偏差とを使って、

を作成する。 In the case of a general regression curve, the CPU 22 calculates the average value of each row from the value of p rows and n columns,

And standard deviation,

And using these mean and standard deviation,

Create

（ステップＳ８０）ＣＰＵ２２は、ｂ_k（ｋ＝１〜Ｂ）の標準偏差Ｖに基づいて、回帰係数ｂの目標とする信頼係数（１ーα）の信頼区間（ｂ_max,h、ｂ_min,h）を求める。 (Step S80) Based on the standard deviation V of b _k (k = 1 to B), the CPU 22 sets the confidence interval (b _{max, h} , b _{min, h} ).

一般の回帰曲線の場合、ＣＰＵ２２は、ｐ行Ｂ列の値から各行の標準偏差、

ならば、a₁の信頼区間を、

また上の条件が成り立たない場合は、a₁の信頼区間を、

とし、a₂、a₃、…、a_pの信頼区間、

も同様に計算する。 In the case of a general regression curve, the CPU 22 calculates the standard deviation of each row from the value of p row B column,

Then the confidence interval for a ₁

If the above condition does not hold, the confidence interval of a ₁ is

And a confidence interval of a ₂ , a ₃ , ..., a _p ,

Calculate in the same way.

（ステップＳ９０）ＣＰＵ２２は、ステップＳ８０で求めた信頼区間（ｂ_max,h、ｂ_min,h）から、回帰係数ａの信頼係数（１ーα）の信頼区間（ａ_max,h、ａ_min,h）を求める。 (Step S90) CPU 22, the confidence interval calculated in step _{S80 (b max, h, b} min, h) from the confidence interval (a _max of confidence factors of the regression coefficients a (1 over _alpha), h, a _{min, h} ).

一般の回帰曲線の場合、ＣＰＵ２２は、ステップＳ８０で求めたa₁、a₂、a₃、…、a_pの信頼区間から、x = (1 + 2+ … + m)/mとおくときa₀の信頼区間を、

および

と決定する。 For general regression curve, CPU 22 is, a ₁ obtained in step _{_{S80, a 2, a 3,}} ..., the confidence interval of _{a p, x = (1 +} 2+ ... + m) when placing a / m a A confidence interval of ₀

and

And decide.

（ステップＳ１００）ＣＰＵ２２は、上限・下限の回帰直線（ｙ＝ａ＋ｂｘ）から、ｔ日後の上限予測値Ｆ_max,h、Ｆ_min,hを算出して、メモリ２４に記憶する。 (Step S <b> 100) The CPU 22 calculates the upper limit predicted values F _{max, h} and F _{min, h} after t days from the upper / lower regression line (y = a + bx) and stores them in the memory 24.

一般の回帰曲線の場合、ＣＰＵ２２は、予測に使う関数の下限、上限を、

を計算して、その結果をメモリ２４に記憶する。 In the case of a general regression curve, the CPU 22 sets the lower limit and upper limit of the function used for prediction,

And the result is stored in the memory 24.

（ステップＳ１１０）ＣＰＵ２２は、上記ステップＳ４０からＳ１００の処理をステップＳ４０に戻ってｑ回繰り返す（ｈ＝１〜ｑ）。一般の回帰曲線の場合も、ＣＰＵ２２は、上記ステップＳ４０からＳ１００の処理をｑ回（ｑは正の整数）繰り返してその都度結果をメモリ２４に記憶する。 (Step S110) The CPU 22 returns to Step S40 and repeats the processes from Step S40 to S100 q times (h = 1 to q). Also in the case of a general regression curve, the CPU 22 repeats the processing of steps S40 to S100 q times (q is a positive integer) and stores the result in the memory 24 each time.

（ステップＳ１２０）ＣＰＵ２２は、上限値Ｆ_max,h（ｈ＝１〜ｑ）の平均値を予測上限値とし、下限値Ｆ_min,h（ｈ＝１〜ｑ）の平均値を予測下限値として、出力装置４０に出力（表示あるいは印刷）する。一般の回帰曲線の場合も、ＣＰＵ２２は、それぞれ下限及び上限の平均値を計算してt日後における予測値の下限、上限として計算して、t日後における予測値の信頼区間を生成し、得られた予測値の信頼区間を変動数値の未来における予測範囲として出力装置４０に出力する。 (Step S120) The CPU 22 sets the average value of the upper limit value F _{max, h} (h = 1 to q) as the predicted upper limit value, and sets the average value of the lower limit value F _{min, h} (h = 1 to q) as the predicted lower limit value. And output (display or print) to the output device 40. Also in the case of a general regression curve, the CPU 22 calculates the average value of the lower limit and the upper limit, respectively, calculates the lower limit and the upper limit of the predicted value after t days, and generates a confidence interval for the predicted value after t days. The confidence interval of the predicted value is output to the output device 40 as the future prediction range of the fluctuation value.

なお、上記フローチャートの各ステップは説明の便宜上に分けられており、それらのステップは統合されても、更に分割されてもよい。 Note that each step of the flowchart is divided for convenience of description, and these steps may be integrated or further divided.

＜日経平均株価の予測の具体例＞
図５を参照しながら、平成21年9月25日までの日経平均株価を使って、４週間後の10月23日の日経平均株価を予測する方法を考える。この場合の、設定パラメータは、p=1, m=16, r=3, n=7, B=120, α= 0.05, t=20, q=10が設定される。 <Examples of Nikkei Stock Average Forecast>
With reference to FIG. 5, a method of predicting the Nikkei Stock Average on October 23, four weeks later, using the Nikkei Stock Average until September 25, 2009 will be considered. In this case, the setting parameters are set to p = 1, m = 16, r = 3, n = 7, B = 120, α = 0.05, t = 20, and q = 10.

(1) 最初に、9月25日から営業日18日(=m+r-1=16+3-1)遡った8月28日以降9月25日までの日経平均株価を日順に、

のように並べ（図５の５１参照）、これを使って連続する３日間(r=3)の移動平均y_iを順次計算し、それぞれに番号xをつける。これを表示すると次のようになる。 (1) First, the Nikkei Stock Average from August 28 to September 25, which is 18 days after September 25 (= m + r-1 = 16 + 3-1)

(See 51 in FIG. 5), and using this, the moving average y _i for three consecutive days (r = 3) is sequentially calculated, and a number x is assigned to each. This is displayed as follows.

この組｛(x_i,y_i)、i=1, 2,…, 16｝（図５の５２参照）を元データと考える。ここで、xの平均値 = (1 + 2 + … + 16)/16 = 8.5 である、Yの平均値 = （10518.91 + 10434.35 + … + 10230.15)/16 = 10353.70 となる。 This set {(x _i , y _i ), i = 1, 2,..., 16} (see 52 in FIG. 5) is considered as original data. Here, the average value of x = (1 + 2 +++ 16) /16=8.5, and the average value of Y = (10518.91 + 10434.35 + ... + 10230.15) /16=10353.70.

この２次元データから得られる回帰直線（y=a+bx）の回帰係数bの推定量b^を計算すると、

となる。 When calculating the estimated amount b ^ of the regression coefficient b of the regression line (y = a + bx) obtained from this two-dimensional data,

It becomes.

（第一段階）
(2) 次に、元データから、復元抽出法により、無作為に元データの個数と同数のリサンプルをとる（図５の５３ー１参照）。その結果は、

となる。 (the first stage)
(2) Next, the same number of resamples as the number of original data are taken from the original data by the restoration extraction method (see 53-1 in FIG. 5). The result is

It becomes.

これを使って、(1)の場合と同様に、このリサンプルに対する各平均値x、 Y 及び回帰係数b₁ ^*を計算すると、
平均値x₁ = 9.4375 , 平均値Y₁ = 10342.61, b₁ ^* = -6.98052となる。 Using this, as in the case of (1), calculating each average value x, Y and regression coefficient b ₁ ^* for this resample,
The average value x ₁ = 9.4375, the average value Y ₁ = 10342.61, b ₁ ^* = -6.98052.

(3) （2）を７回(n=7)繰り返すと、各リサンプルに対してそれぞれ１個の回帰係数が得られる（図５の５３ー１〜５３ー７参照）。それらをb₁ ^*、…、b₇ ^*とおくと
b₁ ^* = -6.98052, b₂ ^* = -3.04553, b₃ ^* = -1.91733, b₄ ^* = 0.095334,
b₅ ^* = -3.334066, b₆ ^* = -4.17861, b₇ ^* = -1.17185 となる。 (3) If (2) is repeated seven times (n = 7), one regression coefficient is obtained for each resample (see 53-1 to 53-7 in FIG. 5). If we put them as b ₁ ^* , ..., b ₇ ^*
b ₁ ^* = -6.98052, b ₂ ^* = -3.04553, b ₃ ^* = -1.91733, b ₄ ^* = 0.095334,
b ₅ ^* = -3.334066, b ₆ ^* = -4.17861, b ₇ ^* = -1.17185.

(4) （3）のb₁ ^*, … b₇ ^*の平均をb^* ₁、標準偏差をS^* ₁とし、Z₁=（7（b^* ₁-b^)/S^* ₁）^1/2を計算する（図５の５４ー１参照）と、
b^* ₁ = -1.6458, S^* ₁ = 3.480352, Z₁ = -0.51755 となる。 (4) The average of b ₁ ^* ,… b ₇ ^* in (3) is b ^* ₁ and the standard deviation is S ^* _1, and Z ₁ = (7 (b ^* ₁ -b ^) / S ^* ₁ ) ^{1 / 2} is calculated (see 54-1 in FIG. 5),
b ^* ₁ = -1.6458, S ^* ₁ = 3.480352, Z ₁ = -0.51755.

(5) （3）及び（4）を120回(B=120)繰り返し（図５の５４ー１〜５４ー１２０参照）、その都度結果を記録する。結果のうち、｛b^* ₁, b^* ₂,…,b^* ₁₂₀｝、｛Z₁, Z₂,…,Z₁₂₀｝を使う。 (5) Repeat (3) and (4) 120 times (B = 120) (see 54-1 to 54-120 in FIG. 5), and record the results each time. Of ^{_{^{consequence, {b * 1, b *}}} 2, ..., b * 120}, {Z 1, Z 2, ..., Z 120} use.

(5-1) ｛b^* ₁, b^* ₂,…,b^* ₁₂₀｝の標準偏差を求める（図５の５５参照）と、

となる。 (5-1) Obtain the standard deviation of {b ^* ₁ , b ^* ₂ ,..., B ^* ₁₂₀ } (see 55 in FIG. 5).

It becomes.

(5-2) 回帰係数bの信頼係数0.95(1-α=1-0.05)の信頼区間を求めるので、上、下裾とも0.025の点で切り捨てるため、｛Z₁, Z₂,…,Z₁₂₀｝を大きさの順に並べたとき、小さい方から３番目(=0.025B=0.025×120=3)のものZ₍₃₎と、１１７番目（0.975B=0.975×120=117)のZ₍₁₁₇₎のものを選ぶ。今の場合は、
Z₍₃₎ = -3.24476 , Z₍₁₁₇₎ = 3.13832 となる。 (5-2) Since a confidence interval of 0.95 (1-α = 1-0.05) confidence coefficient for regression coefficient b is calculated, {Z ₁ , Z ₂ , ..., Z ₁₂₀ } are arranged in order of size, the third (= 0.025B = 0.025 × 120 = 3) Z ₍₃₎ and the 117th (0.975B = 0.975 × 120 = 117) Z ₍ Select ₁₁₇₎ . In this case,
Z ₍₃₎ = -3.24476 and Z ₍₁₁₇₎ = 3.13832.

(6) (6-1) |Z(3)|≦1.96 および |Z(117)|≦1.96 が成り立てば、回帰係数bの信頼係数0.95の信頼区間は、
［b^-Z₍₁₁₇₎・V、b^-Z(3)・V］と考える。 (6) (6-1) If | Z (3) | ≦ 1.96 and | Z (117) | ≦ 1.96, then the confidence interval of 0.95 for the regression coefficient b is
Consider [b ^ -Z ₍₁₁₇₎ · V, b ^ -Z (3) · V].

(6-2) (6-1)の条件が成り立たない場合は、回帰係数bの信頼係数0.95の信頼区間は、
［b^-1.96V、b^-1.96V］と考える。 (6-2) If the condition of (6-1) does not hold, the confidence interval of 0.95 confidence coefficient for regression coefficient b is
Consider [b ^ -1.96V, b ^ -1.96V].

本例の場合は(6-1)の条件は成り立たないので、(6-2)を使って回帰係数bの信頼係数0.95の信頼区間は、
［b^-1.96V、b^-1.96V］
=［-0.96501 - 1.96×2.111998 , -0.96501 + 1.96 × 2.111998 ]
=［-5.00452, 3.07451］となる。 In the case of this example, the condition of (6-1) does not hold, so using (6-2) the confidence interval of 0.95 confidence coefficient for regression coefficient b is
[B ^ -1.96V, b ^ -1.96V]
= [-0.96501-1.96 × 2.111998, -0.96501 + 1.96 × 2.111998]
= [-5.00452, 3.07451].

(7) （6）より回帰係数aの信頼係数0.95の信頼区間は、
［y - 3.07451x, Y - (-5.00452)x］
=［10353.70 - 3.07451 × 8.5, 10353.70 + 5.00452 × 8.5]
=［10327.56667, 10396.23842］となる。 (7) From (6), the confidence interval of 0.95 confidence coefficient of regression coefficient a is
[Y-3.07451x, Y-(-5.00452) x]
= [10353.70-3.07451 x 8.5, 10353.70 + 5.00452 x 8.5]
= [10327.56667, 10396.23842].

(8) （6）及び（7）より得られる回帰直線は、
下限：y = f(x) = 10327.56667 - 5.00452x
上限：y = f(x) = 10396.23842 + 3.07451x
となる。 (8) The regression line obtained from (6) and (7) is
Lower limit: y = f (x) = 10327.56667-5.00452x
Upper limit: y = f (x) = 10396.23842 + 3.07451x
It becomes.

かかる上限と下限の回帰曲線に囲まれた領域を信頼帯と呼び、図６のその概念図を示した。図６で６１が上限の回帰曲線、６２が下限の回帰曲線である。この回帰曲線の間が信頼帯であり、横軸の予想対象日６３が決まると、信頼区間（予測上限値、予測下限値）が決定する。 The area surrounded by the upper and lower regression curves is called a confidence band, and the conceptual diagram of FIG. 6 is shown. In FIG. 6, 61 is an upper limit regression curve, and 62 is a lower limit regression curve. Between the regression curves is a confidence band, and when the prediction target date 63 on the horizontal axis is determined, the confidence interval (prediction upper limit value, prediction lower limit value) is determined.

(9) （8）を使って、営業日t日後の日経平均株価の下限の予測値F(t)と上限の予測値F(t)は、それぞれ、
下限：F(t) = f(16+t) = 10327.56667 - 5.00452(16+t)
上限：F(t) = f(16+t) = 10396.23842 + 3.07451(16+t)
と考える。 (9) Using (8), the lower limit forecast value F (t) and the upper limit forecast value F (t) of the Nikkei Stock Average after business day t
Lower limit: F (t) = f (16 + t) = 10327.56667-5.00452 (16 + t)
Upper limit: F (t) = f (16 + t) = 10396.23842 + 3.07451 (16 + t)
I think.

(10) （9）を使い、営業日２０日後(t=20)の予測値を考える場合は、
下限：F(20) = 10327.56667 - 5.00452(16+20) = 10147.40
上限：F(20) = 10396.23842 + 3.07451(16+20) = 10506.92
である。 (10) When using (9) and considering the forecast value after 20 business days (t = 20),
Lower limit: F (20) = 10327.56667-5.00452 (16 + 20) = 10147.40
Upper limit: F (20) = 10396.23842 + 3.07451 (16 + 20) = 10506.92
It is.

(11) 以上（2）〜（10）の結果をストックする（図５の５６ー１参照）。 (11) Stock the results of (2) to (10) above (see 56-1 in FIG. 5).

（第二段階）
(12) 第一段階の（2）〜（10）を計１０回(q=10)繰り返し、得られた結果をストックする（図５の５６ー１〜５６ー１０参照）。 (Second stage)
(12) Repeat (2) to (10) of the first step 10 times (q = 10) in total, and stock the obtained results (see 56-1 to 56-10 in FIG. 5).

(13) （12）で得られた結果の下限だけの平均、上限だけの平均をそれぞれ求めて、それらを目的の予測値の下限、上限とする（図５の５７参照）。 (13) Obtain the average of only the lower limit and the average of the upper limit of the results obtained in (12), respectively, and use them as the lower and upper limits of the target predicted value (see 57 in FIG. 5).

なお、本発明の最初の部分のｒ個の値の平均は推定値のバラツキを抑えるため、これをＢ回行うのはリサンプルを通して元データから得られる回帰係数の分布状態、従って「真」の回帰係数を推測するため、最後にｑ回の平均をとるのは、バラツキを抑えるためもあるが、予測値のチェックのためもある。また、「営業日t日後」のtは1、2、… 20、… などなんでもよいが、（2）〜（13）を実行中は変えることはできない。また、回帰係数bの推定を信頼係数0.95で行った場合、最終の予測値の信頼区間の信頼係数は大体0.90ほどとなる。 Note that the average of the r values in the first part of the present invention suppresses the variation of the estimated value, so that this is performed B times because the distribution state of the regression coefficient obtained from the original data through resampling, and therefore “true”. In order to estimate the regression coefficient, the average of q times is used for the purpose of checking the prediction value as well as suppressing variation. In addition, “t” after “business day t” may be anything such as 1, 2,..., 20,..., But cannot be changed while (2) to (13) are being executed. Further, when the regression coefficient b is estimated with a confidence coefficient of 0.95, the confidence coefficient of the confidence interval of the final predicted value is about 0.90.

＜本実施例による変動数値予測の例＞
図７に、本実施例による変動数値予測を「日経平均株価」（図７の（ａ）参照）、「原油価格」（図７の（ｂ）参照）に適用した例の予測結果と実際の数値を比較した表を示す。なお、共に、上記具体例で示した２０日（３週間）後の予測値であり、「日経平均株価」は２００９年の３月から８月の１月置きの予測、「原油価格」は２００８年の３月から１０月の１月半置きの予測である。 <Example of fluctuation numerical prediction according to this embodiment>
FIG. 7 shows an example of prediction results and actual values obtained by applying the numerical fluctuation prediction according to the present embodiment to “Nikkei average stock price” (see FIG. 7A) and “crude oil price” (see FIG. 7B). A table comparing the numerical values is shown. In addition, both are forecast values after 20 days (3 weeks) shown in the above specific example, “Nikkei Stock Average” is forecast every March to August of 2009, and “Crude Oil Price” is 2008 This is a forecast for every half of January from March to October of the year.

これらの結果から、本発明の変動数値予測は、上述のように、回帰係数bの推定を信頼係数0.95で行った場合、最終の予測値の信頼区間の信頼係数は大体0.90ほどとなるとの予想も当っており、自然現象または社会現象を継続的に観測して得られる数値列に対し、現在までに得られている数値を利用して、同種の数値の予測値の信頼区間を求めることができる。すなわち、本発明では、f(t)の推定量を効果的に推定できるので、結果として金融商品価格等の予測値の分布を仮定しないで、よい予測値が得られる。 From these results, the fluctuation value prediction of the present invention predicts that the reliability coefficient of the confidence interval of the final predicted value is about 0.90 when the regression coefficient b is estimated with the reliability coefficient 0.95 as described above. It is possible to obtain confidence intervals for predicted values of the same kind of numerical values using numerical values obtained so far for numerical sequences obtained by continuously observing natural or social phenomena. it can. That is, in the present invention, the estimated amount of f (t) can be estimated effectively, and as a result, a good predicted value can be obtained without assuming a distribution of predicted values such as financial product prices.

Claims

A fluctuation numerical prediction system that predicts a future prediction range of a fluctuation value of a target phenomenon from a numerical sequence obtained by continuously observing before prediction,
The numerical sequence obtained by continuously observing the numerical value of the target phenomenon is as follows: x _-mr , x _{-m-r + 1} , ..., x _-1 , x ₀ , x ₁ , x ₂ , ... When the prediction start point value is x ₀ , the previous value is x ₋₁ ,..., The m + r−1 previous value is x _{−m−r + 1} ,

And original data creating means for creating x ₋₁ ,..., X _-m as original data,
A statistical theory bootstrap method is applied to the original data created by the original data creating means, and n (≦ m) data are resampled from the original data by a random restoration extraction method. Using the B sets of resamples obtained by performing the sample operation B times (B is a positive integer), a confidence interval at each point of the regression function of a given confidence coefficient is calculated. A confidence interval calculation means for calculating a confidence band of a regression function by connecting an upper limit and a lower limit, inputting a prediction date and time to the obtained regression function, and calculating a confidence interval of the predicted value using the obtained confidence band; Prepared,
A fluctuation numerical prediction system, wherein the confidence interval of the obtained prediction value is output as a prediction range in the future of the fluctuation numerical value.

The numerical value of fluctuation of the target phenomenon is continuously observed, and from the numerical sequence obtained so far,
A p-th order polynomial of x containing unknown constants,
f (x) = a ₀ + a ₁ x + a ₂ x ² +… + a _p x ^p , a fluctuating numerical prediction system that predicts the numerical range of the phenomenon in the future,
The numerical sequence obtained numerical phenomena of interest continuously observed _{_{by, ..., x -mr, x -m}} -r + 1, ..., y -1, y 0, y 1, y 2, ... a When the prediction start time value is y ₀ , the previous value is y ₋₁ ,..., And the previous m + r−1 value is y _{−m−r + 1} .

To generate two-dimensional data (1, y ₁ ), (2, y ₂ ), ..., (m, y _m ) as original data;
Estimated values of unknown constants a ₀ , a ₁ ,..., A _p using the original data,

A means of calculating
M data from the original data by random restoration extraction method,

And a means to create
The average value for each row,

And standard deviation,

And using these mean and standard deviation,

, B times (B is a positive integer)

And a means to create
Standard deviation for each row,

Then the confidence interval for a ₁

If the above condition does not hold, the confidence interval of a ₁ is

And a confidence interval of a ₂ , a ₃ , ..., a _p ,

Is calculated in the same way,
a confidence interval of a ₀ when you put the x = (1 + 2+ ... + m) / m,

and

Means to determine,
The lower and upper limits of the function used for prediction

Means for calculating and storing the result;
Repeat the above process q times (q is a positive integer), store the results each time, calculate the lower limit and upper limit average values respectively, calculate the lower limit and upper limit of the predicted value after t days, and predict after t days Means for generating confidence intervals for the values,
A fluctuation numerical prediction system, wherein the confidence interval of the obtained prediction value is output as a prediction range in the future of the fluctuation numerical value.

A storage medium storing a program capable of causing a computer system to realize the fluctuating numerical value prediction system according to claim 1.

The program which can make a computer system implement | achieve the fluctuation | variation numerical value prediction system of Claim 1 or 2.

The numerical value of fluctuation of the target phenomenon is continuously observed, and from the numerical sequence obtained so far,
A p-th order polynomial of x containing unknown constants,
f (x) = a ₀ + a ₁ x + a ₂ x ² +… + a _p x ^p , a fluctuating numerical prediction method for predicting the numerical range of the phenomenon in the future,
The numerical sequence obtained by continuously observing the numerical value of the target phenomenon is, ..., x _-mr , x _{-m-r + 1} , ..., y _-1 , y ₀ , y ₁ , y ₂ , ... When the prediction start time value is y ₀ , the previous value is y ₋₁ ,..., And the previous m + r−1 value is y _{−m−r + 1} .

Calculating two-dimensional data (1, y ₁ ), (2, y ₂ ),..., (M, y _m ) as original data;
Estimated values of unknown constants a ₀ , a ₁ ,..., A _p using the original data,

A step of calculating
M data from the original data by random restoration extraction method,

The steps of creating
The average value for each row,

And standard deviation,

And using these mean and standard deviation,

The steps of creating
Standard deviation for each row,

Then, the confidence interval of a1 is

If the above condition does not hold, the confidence interval of a ₁ is

And a confidence interval of a ₂ , a ₃ , ..., a _p ,

and

A step of determining
The lower and upper limits of the function used for prediction

Calculating and storing the result;
Repeat the above process q times (q is a positive integer), store the results each time, calculate the average value of the lower limit and upper limit respectively, calculate the lower limit and upper limit of the predicted value after t days, and predict after t days Generating a confidence interval for the value;
A variation numerical prediction method, wherein the confidence interval of the obtained prediction value is output as a prediction range in the future of the variation numerical value.