JP2001175735A

JP2001175735A - Device, system and method for predicting time series and storage medium

Info

Publication number: JP2001175735A
Application number: JP35980999A
Authority: JP
Inventors: Hiroto Yoshii; 裕人吉井
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1999-12-17
Filing date: 1999-12-17
Publication date: 2001-06-29

Abstract

PROBLEM TO BE SOLVED: To predict future data by fetching reproducibility and future predictability from time series having strong nonlinearity. SOLUTION: A characteristic extraction processing 12 acquires the characteristic of a time series concerned from past time series data 11. Regression tree preparation processing 13 prepares a regression tree 14 on the basis of acquired characteristic quantity. Meanwhile, characteristic extraction processing 16 acquires characteristic quantity from the current time series data 15 by using the same algorithm as that of the processing 12. Future prediction processing 17 calculates a future predicted value 18 by tracing the regression tree 14 by using the characteristic quantity acquired by the processing 16.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は例えば株式市場の値
動きといった、非線形性の強い時系列を予測する時系列
予測装置及びシステム及び方法及び記憶媒体に関するも
のである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a time series prediction apparatus, system, method, and storage medium for predicting a time series having a strong nonlinearity such as a stock market price movement.

【０００２】[0002]

【従来の技術】一般に時系列といえば、物理現象から得
られる変量を時刻順に並べて得られる時系列を意味して
いた。そして、その解析の中心的な方法は線形回帰分析
と呼ばれる方法であった。この方法は過去のデータから
未来のデータを予測する際に、その関係が線形関数で近
似できるという理論に基づいている。さらに、最近は経
済現象も時系列の一種だと解釈して、色々な経済指数を
予測するアルゴリズムも登場してきた。2. Description of the Related Art In general, a time series means a time series obtained by arranging variables obtained from physical phenomena in chronological order. And the central method of the analysis was a method called linear regression analysis. This method is based on the theory that when predicting future data from past data, the relationship can be approximated by a linear function. In addition, recently, algorithms that interpret economic phenomena as a kind of time series and predict various economic indices have appeared.

【０００３】現在、例えば「適応的モデルによる経済時
系列分析」（杉原敏夫著、工学図書株式会社ISBN4-76
92-0369-1 C3058）にあるように、経済時系列の予測を
カルマンフィルターやニューラルネットといったアルゴ
リズムを用いて行う方法が提案されている。これらの方
法は時系列のある程度の非線形性にも対応でき、例えば
“日本国内における電力需要の時系列”といったデータ
に対しては非常に良い予測性能を発揮していた。[0003] At present, for example, "Economic time series analysis by adaptive model" (Toshio Sugihara, Engineering Book Co., Ltd. ISBN4-76)
92-0369-1 C3058), there has been proposed a method of predicting an economic time series using an algorithm such as a Kalman filter or a neural network. These methods can cope with a certain degree of nonlinearity in the time series, and have demonstrated very good prediction performance for data such as "time series of power demand in Japan".

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、従来の
技術によれば、例えば株式市場の個々の銘柄の値動きと
いったような非線形性の非常に強い時系列には、予測性
能が著しく落ちてしまうという問題が存在した。However, according to the conventional technique, the prediction performance is significantly reduced in a time series having a very strong nonlinearity such as a price movement of each stock in a stock market. There was.

【０００５】経済現象の中に現れてくる時系列は、経済
システムそのものが複雑なこともあり、単純な構造を持
ったものは稀である。現在、その多くは全くランダムな
時系列過程だという扱いで確率的解析が行われている。
ところが、一見ランダムに見れる時系列に何らかの再現
性があって、それを抽出することが出来得るならば、そ
の方法は非常に大きな価値があるわけである。[0005] The time series that appears in economic phenomena is that the economic system itself is complicated and rarely has a simple structure. At present, probabilistic analysis is being performed, treating most of them as completely random time series processes.
However, if the seemingly random time series has some reproducibility and it can be extracted, the method is of great value.

【０００６】本発明は上記の問題に鑑みてなされたもの
であり、その目的とするところは、非線形性の強い時系
列から再現性・未来予測性を取り出し、未来のデータを
予測可能とすることにある。SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and an object thereof is to extract reproducibility and future predictability from a time series having a strong nonlinearity so that future data can be predicted. It is in.

【０００７】[0007]

【課題を解決するための手段】上記の目的を達成するた
めの本発明の一態様による時系列予測装置は例えば以下
の構成を備える。すなわち、過去の時系列データから当
該時系列の特徴を取得する特徴取得手段と、前記特徴取
得手段で取得された特徴量を元に回帰木を作成する作成
手段と、前記特徴取得手段と同じアルゴリズムを用いて
現在の時系列データから特徴量を取得する現時系列特徴
取得手段と前記現時系列特徴取得手段で取得された特徴
量と、前記作成手段で作成された回帰木とを用いて将来
の予測値を求める予測手段とを備える。Means for Solving the Problems A time series prediction apparatus according to an aspect of the present invention for achieving the above object has, for example, the following configuration. That is, a feature acquiring unit that acquires a feature of the time series from past time-series data, a creating unit that creates a regression tree based on the feature amount acquired by the feature acquiring unit, and an algorithm that is the same as the feature acquiring unit. The present time series feature acquisition means for acquiring a feature quantity from the current time series data using the feature quantity acquired by the current time series feature acquisition means, and the future prediction using the regression tree created by the creation means Prediction means for obtaining a value.

【０００８】また、上記の目的を達成するための本発明
の態様による時系列予測システムは、上記時系列予測装
置と、前記時系列予測装置によって取得された予測値を
他の情報処理装置へ配信する配信手段とを備える。A time series prediction system according to an aspect of the present invention for achieving the above object distributes the above time series prediction apparatus and a prediction value obtained by the time series prediction apparatus to another information processing apparatus. And distribution means for performing the distribution.

【０００９】[0009]

【発明の実施の形態】以下、添付の図面に基づいて本発
明の好適な一実施形態を説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A preferred embodiment of the present invention will be described below with reference to the accompanying drawings.

【００１０】図１は実施形態による時系列予測システム
を最もよく表す図である。本システムは学習フェーズと
予測フェーズの２段階のフェーズからなっている。学習
フェーズにおいて、１１は過去の時系列データ、１２は
過去の時系列データ１１から特徴量を抽出する特徴抽出
処理、１３は特徴抽出処理１２より得られた特徴量を用
いて回帰木を作成する回帰木作成処理、１４は回帰木作
成処理１３で得られた「回帰木」である。学習フェーズ
におけるインプットは過去の時系列データ１１で、アウ
トプットは回帰木１４となる。FIG. 1 is a diagram best representing a time-series prediction system according to an embodiment. This system has two phases, a learning phase and a prediction phase. In the learning phase, reference numeral 11 denotes past time-series data, 12 denotes a feature extraction process for extracting a feature amount from the past time-series data 11, and 13 denotes a regression tree using the feature amount obtained from the feature extraction process 12. The regression tree creation process 14 is the “regression tree” obtained in the regression tree creation process 13. The input in the learning phase is the past time-series data 11, and the output is the regression tree 14.

【００１１】予測フェーズにおいて、１５は現在の時系
列データ、１６は現在の時系列データ１５より特徴量を
算出する現時系列特徴抽出処理、１７は現特徴量抽出処
理１６スと回帰木１４とを用いて将来の時系列値を予測
する将来予測処理１７、１８は将来予測処理１７より得
られた予測値である。予測フェーズにおけるインプット
は現在の時系列データ１５で、アウトプットは予測値１
８となる。In the prediction phase, 15 is current time series data, 16 is current time series feature extraction processing for calculating feature values from the current time series data 15, and 17 is current feature value extraction processing 16 and regression tree 14. The future prediction processes 17 and 18 for predicting a future time-series value by using these are prediction values obtained from the future prediction process 17. The input in the prediction phase is the current time-series data 15, and the output is the predicted value 1
It becomes 8.

【００１２】図１において重要なことは、一般に週去の
時系列データは非常に膨大になることである。よって、
特徴抽出処理１２より得られた特徴量が回帰木作成処理
１３に入力される際、非常に大きな回数のループが回る
ことになる。What is important in FIG. 1 is that the time series data of the past week is generally very large. Therefore,
When the feature amount obtained from the feature extraction process 12 is input to the regression tree creation process 13, a very large number of loops are performed.

【００１３】また、特徴抽出処理１２と現時系列特徴抽
出処理１６は全く同じアルゴリズムを使用している。図
２を用いてこのアルゴリズムを詳しく説明する。The feature extraction process 12 and the current time series feature extraction process 16 use exactly the same algorithm. This algorithm will be described in detail with reference to FIG.

【００１４】図２の上に描かれているグラフはいわゆる
株価の“ローソク足”グラフである。図では２０日分の
株価変動の履歴がグラフとして描かれている。グラフに
は、始値、終値、高値、安値の情報が盛り込まれてい
る。本実施形態が試みようとしている時系列予測とは、
例えば図２の上の“ローソク足”グラフが与えられた
時、明日の株価がどのようになるかを予測することにな
る。以下、株価変動を予測するシステムを実施形態とし
て本発明を説明していく。なお、もちろん時系列データ
としては日足データだけではなく、週足、年足と時間間
隔の大きな時系列データにも本発明は適用できる。The graph depicted at the top of FIG. 2 is a so-called "candlestick" graph of stock prices. In the figure, the history of the stock price fluctuation for 20 days is drawn as a graph. The graph includes information on the opening price, closing price, high price, and low price. The time series prediction that this embodiment is trying to perform is:
For example, given the "candlestick" graph at the top of FIG. 2, it will predict what the stock price will be tomorrow. Hereinafter, the present invention will be described using a system for predicting stock price fluctuations as an embodiment. Of course, the present invention can be applied not only to daily data but also to time-series data having a large time interval between weekly and annual data.

【００１５】本実施形態では、例えばある年度のある銘
柄の第１日目から第２０日目までの２０日間の始値、終
値、高値、安値を第１の時系列データセット、次の２０
日間（第２日目から第２１日目まで）の始値、終値、高
値、安値を第２の時系列データセットというように、時
系列データセットを決定する。そして、各時系列データ
セットを規格化することにより複数銘柄の全ての時系列
データセットを同じ時系列モデルとして使用する。In the present embodiment, for example, the opening price, closing price, high price, and low price of a certain stock in a certain year from the first day to the twentieth day are described in a first time-series data set,
The time series data set is determined such that the opening price, the closing price, the high price, and the low price of the day (from the second day to the 21st day) are referred to as a second time series data set. Then, all the time series data sets of a plurality of brands are used as the same time series model by normalizing each time series data set.

【００１６】まず、規格化について説明する。株式市場
には多くの銘柄が存在し、その値はまちまちである。し
かし、株価の銘柄それぞれに対して本実施形態の回帰木
を作成していたのでは、学習データが慢性的に不足して
しまう。よって、全ての銘柄を規格化して、同じ時系列
モデルであるという前提で学習を進めていく。First, the normalization will be described. There are many stocks in the stock market, and their values vary. However, if the regression tree according to the present embodiment is created for each stock price, learning data is chronically insufficient. Therefore, all brands are standardized, and learning proceeds on the premise that they are the same time-series model.

【００１７】規格化としては、規格化する時系列データ
セット（ターゲットとしているグラフ）の最終日の終値
を基準（１．０）として、当該時系列データセットの各
値段を相対値段に変換する。すなわち、各時系列データ
セット毎に、当該時系列データセットの最終日の終値で
過去２０日分の始値、終値、高値、安値の各々を割って
相対値段に変換する。すると、時系列パターンの特徴量
としては、２０×４＝８０個の実数値が得られることに
なる。For normalization, each price of the time-series data set is converted into a relative price with the closing price of the last day of the time-series data set to be standardized (the target graph) as a reference (1.0). That is, for each time-series data set, the opening price, the closing price, the high price, and the low price for the past 20 days are divided by the closing price on the last day of the time-series data set, and converted into relative prices. Then, as the feature amount of the time-series pattern, 20 × 4 = 80 real numerical values are obtained.

【００１８】本実施形態では、学習フェーズにおいて、
上記の規格化された時系列データセットを用いて、後述
する手順で回帰木を作成する。そして、予測フェーズで
は、所望の銘柄の今日を最終日とする過去２０日間の規
格化された時系列データセット（現在の時系列データ１
５）を用いてその回帰木をたどり、当該所望の銘柄につ
いて、明日の株価の今日の終値に対する相対値段を予想
する。In the present embodiment, in the learning phase,
Using the standardized time-series data set, a regression tree is created in a procedure described later. Then, in the prediction phase, a standardized time-series data set (the current time-series data 1
Tracing the regression tree using 5), and predicting the relative price of tomorrow's stock price to today's closing price for the desired stock.

【００１９】このように、規格化によって、全ての銘柄
の全ての期間が学習データとして使用できることにな
り、十分大量の学習データが確保できるようになる。上
記の例を、統計の言葉を用いて言えば、「８０次元の特
徴空間で記述される説明変数／１次元の被説明変数の組
が大量に学習データとして与えられた時の回帰問題を解
く課題」という問題の図式化ができることになる。As described above, by standardization, all periods of all brands can be used as learning data, and a sufficiently large amount of learning data can be secured. The above example is described in terms of statistics, "Solving a regression problem when a large set of explanatory variables / one-dimensional explained variables described in an 80-dimensional feature space is given as learning data. The problem of "the problem" can be schematized.

【００２０】上記の例では始値、終値、高値、安値の４
つの値を用いたが、出来高や季節（日付）なども特徴量
を算出するための値として使用することもできる。ま
た、上記の例では、今日（＝ターゲットとしているグラ
フの最終日）の終値を基準値にしたが、最終日から遡っ
て任意の日の任意の値を基準値としてもよい。また、日
経平均株価や住宅着工件数といった統計的な経済データ
や、景気一致指数といった主観的な経済データを用いて
特徴量を算出してもよい。In the above example, the open price, the close price, the high price, and the low price
Although the two values are used, the trading volume, season (date), and the like can also be used as values for calculating the feature amount. Further, in the above example, the closing price of today (= the last day of the targeted graph) is used as the reference value, but any value on any day retroactively from the last day may be used as the reference value. Further, the feature value may be calculated using statistical economic data such as the Nikkei Stock Average or the number of housing starts, or subjective economic data such as the economic coincidence index.

【００２１】また、上記の例では、もともとの値段を基
準値となる値段で割ることによって相対値段を導いてい
たが、それの自然対数を取るなどの非線形処理を施し
て、特徴量を算出してもよい。また、例えば株価チャー
ト解析でよく用いられる“２５日移動平均”などのよう
な、複数の日付にまたがるデータを変換することによっ
て導きだされる新しい特徴量を用いてもよい。Further, in the above example, the relative price was derived by dividing the original price by the price serving as the reference value, but a non-linear process such as taking a natural logarithm thereof is performed to calculate the feature amount. You may. Further, a new feature amount derived by converting data over a plurality of dates, such as “25 day moving average” often used in stock price chart analysis, may be used.

【００２２】次に図１の回帰木１４について、図３を用
いて説明する。図３は本実施形態による回帰木の例を示
す図である。回帰木とは木構造を持った予測システムの
ことである。図３にある通り、回帰木は“ノード”と呼
ばれる“節”とそれらを結ぶ“枝”で構成されている。
例えば、図３の回帰木には１５個のノードと１４個の枝
があることになる。特に回帰木の大元のノード（図３の
０番のノード）を“ルートノード”と呼ぶ。図３におい
て、枝を持つノードは必ず２つの枝を出しているが、こ
れはいわゆる“２分木”と呼ばれる回帰木であり、一般
に枝の本数は何本でもよい。Next, the regression tree 14 in FIG. 1 will be described with reference to FIG. FIG. 3 is a diagram illustrating an example of a regression tree according to the present embodiment. A regression tree is a prediction system with a tree structure. As shown in FIG. 3, the regression tree is composed of "nodes" called "nodes" and "branches" connecting them.
For example, the regression tree of FIG. 3 has 15 nodes and 14 branches. In particular, the origin node (node 0 in FIG. 3) of the regression tree is called a “root node”. In FIG. 3, although a node having a branch always outputs two branches, this is a regression tree called a so-called "binary tree", and in general, the number of branches may be any.

【００２３】枝でつながれたノードには親子関係があ
り、例えば、「ルートノードの子ノードは１番と６番の
ノードであり、また、１番と６番のノードの親がルート
ノードである」という。子ノードを持たないノードは、
リーフノードと呼ぶ。なお、リーフノードの特別な例と
してヌルノードがある。なお、ヌルノードの定義は後で
詳しく述べる。また、２分木の回帰木にはヌルノードは
存在し得ない。Nodes connected by branches have a parent-child relationship. For example, "child nodes of the root node are the first and sixth nodes, and the parent of the first and sixth nodes is the root node. " Nodes without child nodes are
Call it a leaf node. A special example of a leaf node is a null node. The definition of the null node will be described later in detail. In addition, a null node cannot exist in a regression tree of a binary tree.

【００２４】リーフノード以外のノードを“インターナ
ルノード’と呼ぶ。インターナルノードには、枝分かれ
をするための条件式がある。例えば図２の株式データ予
測を例にすると、ルートノードでは「２日前の始値が今
日の終値に対して０．８５以上ならば１番のノード、そ
れ以外なら６番のノード」というような条件になる。こ
の子ノードに枝分かれするための条件を“ノード分岐条
件”と呼ぶ。週去の時系列データを元に、ノード分岐条
件を作成していくのが学習フェーズであり、ノード分岐
条件に従って現在の時系列データを分類していくのが予
測フェーズとなる。Nodes other than leaf nodes are called "internal nodes." The internal nodes have conditional expressions for branching. For example, in the case of stock data prediction in FIG. If the opening price of the previous day is 0.85 or more of the closing price of the current day, the first node, otherwise, the sixth node ". The condition for branching to the child node is called a “node branch condition”. The learning phase is to create node branch conditions based on the time series data of the last week, and the prediction phase is to classify the current time series data according to the node branch conditions.

【００２５】学習フェーズにおいては、ノード分岐条件
を作っていくと同時に、過去の時系列データそのものも
分類されていく。すなわち、規格化された時系列データ
セットを単位として、インターナルノードへの分割が行
われていく。ルートノードでは全ての過去のデータが残
っているが、インターナルノードを作成していくに従っ
て、どんどん条件が厳しくなり、残る時系列データセッ
トは少なくなっていく。そして、新しく出来た子ノード
に全く過去の時系列データセットがなくなると、そのノ
ードをヌルノードとする。リーフノードとは、ある条件
を満たすノードで、その条件が満たされるとそれ以上子
ノードは作らない。この条件のことを“リーフノード成
立条件”と呼ぶ。リーフノード成立条件としては、「残
っている過去のデータがある数より小さくなる」とか
「残っている過去のデータの予測値（本実施形態では、
各時系列データセットの最終日の翌日における当該銘柄
の実際の株価）の分散がある値より小さくなる」などの
条件が適用される。In the learning phase, past time-series data itself is classified at the same time as node branch conditions are created. That is, division into internal nodes is performed in units of a normalized time-series data set. Although all past data remains in the root node, as the internal node is created, the conditions become more and more strict and the remaining time-series data set decreases. Then, when there is no past time-series data set in the newly created child node, the node is set as a null node. A leaf node is a node that satisfies a certain condition. When the condition is satisfied, no further child nodes are created. This condition is called "leaf node establishment condition". The conditions for establishing a leaf node include “remaining past data becomes smaller than a certain number” or “predicted value of remaining past data (in the present embodiment,
The variance of the actual stock price of the stock on the day after the last day of each time-series data set becomes smaller than a certain value. "

【００２６】学習フェーズにおいて、ヌルノード以外の
任意のノードには過去のデータが残っている。あるノー
ドに残っている過去のデータの予測値の相加平均・相乗
平均・中央値などの代表値をそのノードの予測値とす
る。特にルートノードに付随する予測値は、過去の時系
列データ全ての予測値の代表値（平均）となる。ヌルノ
ードに付随する予測値は、親ノードに付随する予測値と
することによって、全てのノードに予測値が割り振られ
ることになる。例えば図３の回帰木では、ノードの右側
に書かれている数字がこのノードに付随する予測値を示
す。In the learning phase, past data remains at any node other than the null node. A representative value such as an arithmetic mean, a geometric mean, or a median of the predicted values of the past data remaining in a certain node is set as the predicted value of the node. In particular, the predicted value associated with the root node is a representative value (average) of the predicted values of all past time-series data. By setting the predicted value associated with the null node as the predicted value associated with the parent node, the predicted value is allocated to all nodes. For example, in the regression tree of FIG. 3, a number written on the right side of a node indicates a predicted value associated with this node.

【００２７】予測フェーズにおいては、それぞれのノー
ド分岐条件に基づいてルートノードからノードをたどっ
ていくことになる。この時、たどっていったノードに付
随する予測値が、現在の時系列データの予測値となる。
例えば、現在の時系列データを図３の回帰木を使ってた
どった時、０番１番３番４番という結果になれば、予測
値は１．２／１．５／１．３／１．１ということにな
る。結局、現在の時系列データの未来値として、たどっ
た階層分の予測値がえられることになる。In the prediction phase, nodes are traced from the root node based on each node branch condition. At this time, the predicted value associated with the traced node becomes the predicted value of the current time-series data.
For example, when the current time-series data is traced using the regression tree of FIG. 3, if the result is 0, 1, 3, 4, the predicted value is 1.2 / 1.5 / 1.3 / 1. .1. As a result, as the future value of the current time-series data, the predicted value of the traced hierarchy is obtained.

【００２８】一般に回帰木の階層が深くなれば深くなる
ほど、この予測値は精度が高くなる。従って、上記の例
では、一番深い階層における予測値である１．１を当該
時系列データに対する予測値とすればよい。しかしなが
ら、階層がある深さを過ぎると、かえって予測値が不正
確になることがある。この現象は、過去の時系列データ
の“過学習”と呼ばれる現象である。このことを防止す
るために、「最後からｎ番目のノードの予測値を現在の
時系列データの予測値とする」というようなことをする
こともある。この処理はいわゆる“回帰木の技刈り”と
呼ばれる処理であるが、ターゲットとしている時系列パ
ターンが本質的にランダムな現象を含む場合、特に有効
となる。In general, the deeper the hierarchy of the regression tree, the higher the accuracy of this predicted value. Therefore, in the above example, 1.1, which is the predicted value in the deepest hierarchy, may be used as the predicted value for the time-series data. However, beyond a certain depth in the hierarchy, the predicted values may be inaccurate. This phenomenon is a phenomenon called “over-learning” of past time-series data. In order to prevent this, there may be a case where “the predicted value of the nth node from the end is used as the predicted value of the current time-series data”. This processing is a so-called “regression tree cutting”, and is particularly effective when the target time-series pattern includes an essentially random phenomenon.

【００２９】次に図１の回帰木作成処理１３について、
図４のフローチャートを用いて説明する。回帰木作成処
理１３においては、回帰木のノードを再帰的プログラム
という方法で作成する。例えば図３の回帰木でいえば、
ノードに割り振られた番号の順でノードが作成されてい
く。回帰木作成処理１３には“当該ノード”(curentnod
e）という概念があり、そのノードを中心にフローチャ
ートは回る。Next, the regression tree creation processing 13 in FIG.
This will be described with reference to the flowchart of FIG. In the regression tree creation processing 13, nodes of the regression tree are created by a method called a recursive program. For example, in the regression tree of FIG.
Nodes are created in the order of the numbers assigned to the nodes. The “relevant node” (curentnod
There is the concept of e), and the flowchart turns around that node.

【００３０】まず、ルートノードを当該ノードとする
（ステップＳ４０１）。次に、当該ノードの分岐条件の
決定を行う（ステップＳ４０２）。なお、分岐条件の決
定方法については後に詳しく説明する。分岐条件が決定
されたら、子ノード作成のループ（ステップＳ４０３）
によって、その分岐条件に従って当該ノードに属する過
去の時系列パターンを分割していく。例えば、回帰木が
２分木だったら、当該ノードで残っている過去の時系列
パターンが２つに分割されることになる。First, the root node is set as the node concerned (step S401). Next, the branch condition of the node is determined (step S402). The method for determining the branch condition will be described later in detail. When the branch condition is determined, a loop for creating a child node (step S403)
Divides the past time-series pattern belonging to the node according to the branch condition. For example, if the regression tree is a binary tree, the past time-series pattern remaining at the node is divided into two.

【００３１】ステップＳ４０４以下はループ処理とな
る。まず、ステップＳ４０４で当該ノードの次の子ノー
ドを当該ノードとする。但し、ループの最初では、一番
目の子ノードを当該ノードとする。ステップＳ４０４の
処理は、上記の過去の時系列データの分割の処理も含
む。その結果、過去の時系列データが全く残らなけれ
ば、すなわち、当該ノードがヌルノードであれば（ステ
ップＳ４０５）ヌルノードの確定（ステップＳ４０６）
を行い、ループの終了判定（ステップＳ４０９）に進
む。The process from step S404 is a loop process. First, in step S404, the next child node of the node is set as the node. However, at the beginning of the loop, the first child node is the node. The process of step S404 includes the above-described process of dividing the past time-series data. As a result, if no past time-series data remains, that is, if the node is a null node (step S405), a null node is determined (step S406).
And the flow proceeds to loop end determination (step S409).

【００３２】当該ノードがヌルノードでない場合で、リ
ーフノード成立条件を満たす場合は（ステップＳ４０
７）、リーフノードの確定（ステップＳ４０８）を行
い、ループの終了判定（ステップＳ４０９）に進む。当
該ノードがヌルノードでもリーフノードでもない場合、
当該ノードを親ノードとして、更に深く子ノードを作っ
ていく。つまり、当該ノードのノード分岐条件の決定
（ステップＳ４０２）に戻る。If the node is not a null node and the condition for establishing a leaf node is satisfied (step S40)
7), a leaf node is determined (step S408), and the process proceeds to loop end determination (step S409). If the node is neither a null node nor a leaf node,
With this node as a parent node, child nodes are created more deeply. That is, the process returns to the determination of the node branch condition of the node (step S402).

【００３３】ステップＳ４０９のループの終了判定で
は、当該ノードの親ノードに接続されている全ての子ノ
ードについてステップＳ４０４からＳ４０８の処理を実
行した場合にループの終了とする。例えば、本実施形態
のように２分木であれば、当該ループは２回実行される
こととなる。以上のようにして子ノードを作成するルー
プ（ステップＳ４０４〜Ｓ４０９）が終了すると、親ノ
ードが存在するかどうかチェックする（ステップＳ４１
０）。もし、親ノードがあれば、そのノードを当該ノー
ドとして（ステップＳ４１１）からステップＳ４０４へ
戻り、次の子ノードを当該ノードとする。ここで注意す
べきなのは、子ノード作成のループは終了しているの
で、当該ノードは「子ノード作成ループ」の中のノード
からみれば、親ノードに移っていることである。つま
り、ステップＳ４１０では「子ノード作成ループ」の中
のノードから見れば、親ノードの親ノードがあるかどう
かの判定を行うことになる。よって、ノード作成が進ん
でいき、ルートノードの全ての子ノードが作成される
と、ステップＳ４１０の親ノード存在判定でルートノー
ドの親ノードは存在しないので、ステップＳ４１２の回
帰木作成終了になり、プログラムは終了する。In the loop end determination of step S409, the loop ends when the processing of steps S404 to S408 has been executed for all child nodes connected to the parent node of the node. For example, in the case of a binary tree as in the present embodiment, the loop is executed twice. When the loop for creating the child node (steps S404 to S409) ends as described above, it is checked whether or not the parent node exists (step S41).
0). If there is a parent node, the node is set as the node (step S411) and the process returns to step S404 to set the next child node as the node. It should be noted here that since the loop for creating the child node has been completed, the node has moved to the parent node when viewed from the nodes in the “child node creation loop”. That is, in step S410, from the viewpoint of the nodes in the “child node creation loop”, it is determined whether or not there is a parent node of the parent node. Therefore, the node creation proceeds, and when all the child nodes of the root node are created, the parent node of the root node does not exist in the parent node existence determination in step S410, and the regression tree creation in step S412 is completed. The program ends.

【００３４】次に図４のノード分岐条件の決定（ステッ
プＳ４０２）について、図５を使って説明する。前述し
たように、ノード分岐条件は、例えば「最後から２日前
の始値が最後の日の終値に対して０．８５以上ならば左
のノード、それ以外なら右のノード」というふうにな
る。回帰木が２分木の場合、一般的には、このノード分
岐条件は当該ノードに残っている過去の時系列データの
集合を２つに分けるものならば、なんでもよい（ｋ分木
の場合は、ｋ分割する）。過去の時系列データがｎ次元
の特徴量で記述されるならば、ｎ次元の空間を分割する
ものなら、なんでもよいことになる。ノード分岐条件を
ニューラルネットを使って作成しようとすれば、この分
割を実現する面はｎ次元中の超曲面となる。Next, the determination of the node branch condition (step S402) in FIG. 4 will be described with reference to FIG. As described above, the node branch condition is, for example, "the left node if the opening price two days before the last is 0.85 or more of the closing price of the last day, otherwise the right node". When the regression tree is a binary tree, in general, this node branch condition may be anything as long as the set of past time-series data remaining at the node is divided into two (in the case of a k-ary tree, , Divided by k). If the past time-series data is described by n-dimensional feature amounts, anything may be used as long as it divides the n-dimensional space. If a node branch condition is to be created using a neural network, the surface that realizes this division is a hypersurface in n dimensions.

【００３５】しかしながら、個々のノードでニューラル
ネットを使って学習させるのは非常にコストがかかる。
よって、通常は超平面を使用して分割を行う。また、超
平面で当該ノードに残っている過去の時糸列データを分
割するといっても、特徴量の次元が大きくなってくる
と、超平面の法線ベクトルの自由度は急激に上がってき
て、学習コストが大きくなってくる。However, it is very costly to train each node using a neural network.
Therefore, division is usually performed using a hyperplane. Even if the past thread string data remaining at the node is divided on the hyperplane, the degree of freedom of the normal vector of the hyperplane sharply increases as the dimension of the feature increases. , The learning cost will increase.

【００３６】そこで、過去の時系列データが非常に多
く、特徴量の次元が大きい場合は、特徴量の軸に垂直な
超平面に限って、ノード分岐条件（超平面）を探索する
ことがよく行われる。その様子を図に示したのが図５で
ある。Therefore, when the past time-series data is very large and the dimension of the feature is large, it is often the case that the node branch condition (hyperplane) is searched for only the hyperplane perpendicular to the axis of the feature. Done. FIG. 5 shows this state.

【００３７】５１は当該ノードを示し、５２と５３はそ
れぞれの子ノードを示す。５４のグラフは、ある特徴量
を横軸、当該ノードに残っている過去の時系列データの
分布密度を縦軸に取ったグラフである。図５にあるよう
に、間仕切りを特徴量の小さい方から大きい方へ移動し
ていった時の分割効率の良さを計算していって、もっと
も分割効率の高い位置を分割面とする。ここで述べた分
割効率とは親ノードの予測値の分散をＰ、子ノードの予
測値の分散をそれぞれｐ１，ｐ２、子ノードで残る過去
の時系列データの個数をそれぞれｎ１，ｎ２とすると、Ｐ−｛ｎ１・ｐ１＋ｎ２・ｐ２｝／｛ｎ１＋ｎ２｝で表される数値である。上記の例ではノードに残ってい
るデータの予測値の分散を用いて分割効率を計算した
が、通常の統計学でいう分散、１／ｎ Σ（ｙ−mean）² に代えて、次の式で示される絶対分散１／ｎ Σ｜ｙ−median｜を用いてもよい。Reference numeral 51 denotes the node, and reference numerals 52 and 53 denote respective child nodes. The graph 54 is a graph in which a horizontal axis indicates a certain feature amount and a vertical axis indicates the distribution density of past time-series data remaining at the node. As shown in FIG. 5, good partitioning efficiency when the partition is moved from a smaller feature amount to a larger feature amount is calculated, and a position having the highest partitioning efficiency is defined as a divided surface. The division efficiency described here means that the variance of the predicted value of the parent node is P, the variance of the predicted value of the child node is p1 and p2, and the number of past time-series data remaining in the child node is n1 and n2, respectively. P− {n1 · p1 + n2 · p2} / {n1 + n2}. In the above example, the division efficiency was calculated using the variance of the predicted value of the data remaining in the node. Instead of the variance in ordinary statistics, 1 / nｎ (y-mean) ² , The absolute variance 1 / nｎ | y-median |

【００３８】以上の処理の結果、特徴量毎に最適な分割
面と、その分割面における分割効率が求められる。最後
に全ての特徴量の結果を集計し、一番、分割効率の高い
特徴量の分割を当該ノードの分岐条件とする。As a result of the above processing, the optimum division plane for each feature amount and the division efficiency in the division plane are obtained. Finally, the results of all the feature amounts are totaled, and the division of the feature amount with the highest division efficiency is set as the branch condition of the node.

【００３９】さらにノードの分岐条件を高速、かつ安定
に探索する方法を図５、図６、図７を元に説明する。図
５の５４において、分割する間仕切りが極端に小さい方
や大きい方になると、子ノードで残る時系列データの個
数は、極端に子ノードのどちらかに偏ってしまうことに
なる。特徴量の軸のどこで間仕切りしてもあまり分割効
率に差がない場合、子ノードの個数の偏りは予測性能の
低下につながってしまう。また、一般に過去の時系列デ
ータにランダムな要素が多く含まれる場合、上記の間仕
切りの位置で分割効率に大差がない状況が発生しやすく
なる。この状況を避けるために、図６の６１にあるよう
に当該ノードで残っている過去の時系列データの分布の
中央付近で間仕切りするように、間仕切りの仕方を制限
すると予測性能の高い回帰木ができるようになる。この
中央付近は平均値や中央値、最大値と最小値の中間値な
どの値をもとに求めることができる。さらに、この方法
を取ると間仕切りの探索範囲がずっと狭められ、ノード
分岐条件の探索時間の節約にもなる。A method for quickly and stably searching for a branch condition of a node will be described with reference to FIGS. 5, 6, and 7. FIG. In 54 of FIG. 5, when the partition to be divided becomes extremely small or large, the number of time series data remaining in the child node is extremely biased toward one of the child nodes. If there is not much difference in the division efficiency even if the partitioning is performed on the axis of the feature amount, the bias in the number of the child nodes leads to a decrease in the prediction performance. In general, when many random elements are included in past time-series data, a situation in which there is not much difference in the division efficiency at the position of the partition tends to occur. In order to avoid this situation, if the partitioning method is restricted so as to partition around the center of the distribution of the past time-series data remaining at the node as shown at 61 in FIG. become able to. The vicinity of the center can be obtained based on values such as an average value, a median value, and an intermediate value between the maximum value and the minimum value. Furthermore, this method further narrows the search range of the partition and saves the search time for the node branch condition.

【００４０】以上、説明してきたノード分岐条件の探索
は、図５にある５４のような当該ノードに残った過去の
時系列データの分布密度関数が元になっている。つま
り、この分布関数はノードに固有な関数となるわけであ
る。よって、図６で示したような分布の中央も、厳密に
はノードが異なると異なってくる。しかし、ルートノー
ドでの分布を元に、各特徴量の間仕切りを予め固定して
も、予測性能にあまり影響のない場合が多い。図７にあ
るように、ルートノードに存在する（＝全部の）過去の
時系列データの特徴量に対する分布密度関数を元に予め
“階層的な固定の間仕切り”をすると、特徴量が連続値
であったのが量子化された値になることと同値になる。
このことによって、メモリーの節約ばかりでなく、ノー
ド分岐条件の探索時間の大幅な削減が実現できる。The search for the node branch condition described above is based on the distribution density function of past time-series data remaining at the node, such as 54 in FIG. That is, this distribution function is a function unique to the node. Therefore, the center of the distribution as shown in FIG. 6 also differs strictly if the nodes are different. However, even if the partition of each feature is fixed in advance based on the distribution at the root node, the prediction performance is often not significantly affected. As shown in FIG. 7, if “hierarchical fixed partitioning” is performed in advance based on the distribution density function for the feature amount of the past time series data (= all) existing at the root node, the feature amount becomes a continuous value. What is present is the same value as the quantized value.
This not only saves memory, but also significantly reduces the search time for node branch conditions.

【００４１】また、以上説明してきた２分木の回帰木を
使った時系列予測システムは、全く同様に、４分木また
は一般のｎ分木の回帰木を使用する時系列予測システム
に適用できる。The time series prediction system using a binary tree regression tree described above can be applied to a time series prediction system using a quadtree tree or a general n-ary tree regression tree. .

【００４２】次に、本実施形態の時系列予測システムを
実現するためのハードウエア構成について説明する。Next, a hardware configuration for realizing the time-series prediction system of the present embodiment will be described.

【００４３】図８は本実施形態における時系列予測シス
テムのハードウエア構成の一例を示すブロック図であ
る。８１は回線であり、電話回線や衝星回線、または専
用線などの回線を示す。回線８１から現在の時系列デー
タが予測システムに入力されることになる。８２はＣＰ
Ｕであり、実施形態で説明した、具体的な予測システム
のプログラムを実行する。８３はメモリであり、現在の
時系列データをストアしたり、プログラムの使用するテ
ンポラリーの作業領域を確保したりする働きがある。８
４のはハードディスク（ＨＤＤ）であり、制御プログラ
ムや、時系列予測システムで使用する回帰木などが保存
される。最後に、８５はディスプレイであり、予測結果
等の表示を含む各種表示が実行される。FIG. 8 is a block diagram showing an example of a hardware configuration of the time-series prediction system according to this embodiment. Reference numeral 81 denotes a line, such as a telephone line, a star line, or a dedicated line. The current time-series data is input from the line 81 to the prediction system. 82 is CP
U, which executes the specific prediction system program described in the embodiment. A memory 83 stores current time-series data and secures a temporary work area used by the program. 8
A hard disk (HDD) 4 stores a control program, a regression tree used in the time-series prediction system, and the like. Finally, reference numeral 85 denotes a display on which various displays including a display of a prediction result and the like are executed.

【００４４】図８で示した時系列予測システムの構成
は、本実施形態の時系列予測処理を、いわゆる“スタン
ドアロン”のアプリケーションプログラムによって実現
するものである。これに対して、予測結果をサーバーか
ら受け取り、クライアントにその結果を表示する時系列
予測システムの構成を図９に示す。図９における個々の
クライアント９２、９３、９４の中身は図８と同様であ
るが、決定的に異なるのは、図９のクライアントマシン
９２、９３、９４は回帰木の情報を含んでおらず、サー
バ９１が回帰木を含んでいることである。The configuration of the time-series prediction system shown in FIG. 8 realizes the time-series prediction processing of this embodiment by a so-called “stand-alone” application program. In contrast, FIG. 9 shows a configuration of a time-series prediction system that receives prediction results from a server and displays the results on a client. The contents of the individual clients 92, 93, 94 in FIG. 9 are the same as in FIG. 8, except that the client machines 92, 93, 94 in FIG. 9 do not include the information of the regression tree. That is, the server 91 includes a regression tree.

【００４５】図８のディスプレイ、および、図９のクラ
イアントマシンのディスプレイに表示される画面の例を
図１０に示す。予測しようとしている時系列が株価の変
動であれば、実施形態で説明した時系列予測処理によ
り、例えば、明日の値上がり期待銘柄のＴＯＰ３と明日
の値下がり期待銘柄ＴＯＰ３を表示したりすることがで
きる。FIG. 10 shows an example of a screen displayed on the display of FIG. 8 and the display of the client machine of FIG. If the time series to be predicted is a change in the stock price, the time series prediction processing described in the embodiment can display, for example, TOP3 of the expected stock price increase tomorrow and TOP3 of the expected stock price decrease tomorrow.

【００４６】また、図９のクライアントサーバシステム
上で本実施形態の時系列予測システムを実現した場合、
Ｗｅｂ等の公共の回線網を使用することが可能である。
この場合、サーバ９１からクライアント９２〜９４への
データが非常に価値の高いものとなる。これに対してク
ライアントを操作しているユーザが代金を支払うシステ
ムが存在する。例えば図１１に見られるように、ユーザ
ーのＩＤとパスワードを入力させることによって、ユー
ザーが予め指定した口座からお金を引き落とすようにす
るのである。また、公共の回線網を使用するため、ＳＳ
Ｌなどの暗証化技術を使用して、ユーザＩＤとパスワー
ドを安全に転送することが一般的に行われている。この
ようにすることによって、Ｗｅｂ等の公共の回線網を使
用したクライアントサーバー環境に本実施形態の時系列
予測システムをインプリメントしても、安全確実にサー
ビスに対しての課金ができるようになる。When the time-series prediction system according to the present embodiment is realized on the client-server system shown in FIG.
It is possible to use a public network such as the Web.
In this case, the data from the server 91 to the clients 92 to 94 becomes very valuable. On the other hand, there is a system in which a user operating a client pays the price. For example, as shown in FIG. 11, by inputting a user ID and a password, money is debited from an account designated in advance by the user. In addition, since the public network is used, SS
It is common practice to securely transfer a user ID and a password by using an encryption technique such as L. By doing so, even if the time-series prediction system of the present embodiment is implemented in a client-server environment using a public line network such as the Web, it is possible to securely charge for services.

【００４７】以上説明したように、上記実施形態によれ
ば、株価等のランダム性の高い時系列データについて、
再現性を取りだし、将来の時系列データの変動を予測で
きるようになる。As described above, according to the above embodiment, time series data with high randomness such as stock prices
You will be able to predict reproducibility and predict future changes in time series data.

【００４８】また、図６及び図７を参照して説明した方
法によって回帰木の分岐条件の導出に制約を加えること
により、メモリーを節約し、かつ、高速に回帰木を作成
できるという効果もある。Also, by adding a constraint to the derivation of the branch condition of the regression tree by the method described with reference to FIGS. 6 and 7, there is also an effect that the memory can be saved and the regression tree can be created at high speed. .

【００４９】なお、本発明の目的は、前述した実施形態
の機能を実現するソフトウェアのプログラムコードを記
録した記憶媒体（または記録媒体）を、システムあるい
は装置に供給し、そのシステムあるいは装置のコンピュ
ータ（またはCPUやMPU）が記憶媒体に格納されたプログ
ラムコードを読み出し実行することによっても、達成さ
れることは言うまでもない。この場合、記憶媒体から読
み出されたプログラムコード自体が前述した実施形態の
機能を実現することになり、そのプログラムコードを記
憶した記憶媒体は本発明を構成することになる。また、
コンピュータが読み出したプログラムコードを実行する
ことにより、前述した実施形態の機能が実現されるだけ
でなく、そのプログラムコードの指示に基づき、コンピ
ュータ上で稼働しているオペレーティングシステム(OS)
などが実際の処理の一部または全部を行い、その処理に
よって前述した実施形態の機能が実現される場合も含ま
れることは言うまでもない。An object of the present invention is to supply a storage medium (or a recording medium) on which a program code of software for realizing the functions of the above-described embodiments is recorded to a system or an apparatus, and to provide a computer (a computer) of the system or the apparatus. It is needless to say that the present invention can also be achieved by a CPU or an MPU) reading and executing the program code stored in the storage medium. In this case, the program code itself read from the storage medium implements the functions of the above-described embodiment, and the storage medium storing the program code constitutes the present invention. Also,
When the computer executes the readout program code, not only the functions of the above-described embodiments are realized, but also the operating system (OS) running on the computer based on the instructions of the program code.
It goes without saying that a case where the functions of the above-described embodiments are implemented by performing some or all of the actual processing, and the processing performs the functions of the above-described embodiments.

【００５０】さらに、記憶媒体から読み出されたプログ
ラムコードが、コンピュータに挿入された機能拡張カー
ドやコンピュータに接続された機能拡張ユニットに備わ
るメモリに書込まれた後、そのプログラムコードの指示
に基づき、その機能拡張カードや機能拡張ユニットに備
わるCPUなどが実際の処理の一部または全部を行い、そ
の処理によって前述した実施形態の機能が実現される場
合も含まれることは言うまでもない。Further, after the program code read from the storage medium is written into a memory provided in a function expansion card inserted into the computer or a function expansion unit connected to the computer, the program code is read based on the instruction of the program code. Needless to say, the CPU included in the function expansion card or the function expansion unit performs part or all of the actual processing, and the processing realizes the functions of the above-described embodiments.

【００５１】[0051]

【発明の効果】以上説明したように、本発明によれば、
非線形性の強い時系列から再現性・未来予測性を取り出
し、未来のデータを予測することが可能となる。As described above, according to the present invention,
It is possible to extract reproducibility and future predictability from a time series with strong nonlinearity and predict future data.

[Brief description of the drawings]

【図１】本実施形態の時系列予測処理を説明する図であ
る。FIG. 1 is a diagram illustrating a time-series prediction process according to an embodiment.

【図２】本実施形態の時系列予測システムが適用される
時系列、及び時系列からの特徴抽出パターンの例を示す
図である。FIG. 2 is a diagram showing an example of a time series to which the time series prediction system of the present embodiment is applied, and a feature extraction pattern from the time series.

【図３】本実施形態の時系列予測システムで作成される
回帰木の例を示す図である。FIG. 3 is a diagram illustrating an example of a regression tree created by the time-series prediction system according to the embodiment.

【図４】本発明の時系列予測システムの回帰木作成処理
を示すフローチャートである。FIG. 4 is a flowchart showing a regression tree creation process of the time series prediction system of the present invention.

【図５】回帰木作成処理におけるノード分岐条件探索の
方法を示した図である。FIG. 5 is a diagram illustrating a method of searching for a node branch condition in a regression tree creation process.

【図６】回帰木作成処理におけるノード分岐条件探索に
おいて処理の高速化のために加えられる制限を説明する
図である。FIG. 6 is a diagram illustrating restrictions imposed for speeding up processing in a node branch condition search in regression tree creation processing.

【図７】回帰木作成処理におけるノード分岐条件探索に
おける特徴量の量子化を説明する図である。FIG. 7 is a diagram illustrating quantization of a feature amount in a node branch condition search in a regression tree creation process.

【図８】単独のアプリケーションプログラムとして、本
実施形態の時系列予測処理を実現するための装置構成を
示すブロック図である。FIG. 8 is a block diagram showing a device configuration for realizing the time-series prediction processing of the present embodiment as a single application program.

【図９】クライアントサーバーシステムにおいて本実施
形態の時系列予測処理を実現する構成を示すブロック図
である。FIG. 9 is a block diagram showing a configuration for realizing the time-series prediction processing of the embodiment in a client-server system.

【図１０】本実施形態の時系列予測システムによる結果
表示画面の例を示す図である。FIG. 10 is a diagram showing an example of a result display screen by the time-series prediction system of the embodiment.

【図１１】クライアントサーバーシステムに、本時系列
予測システムをインプリメントした時に、課金に使用す
る画面例を示す図である。FIG. 11 is a diagram showing an example of a screen used for billing when the present time series prediction system is implemented in a client server system.

[Explanation of symbols]

１１過去の時系列データ１２特徴抽出処理１３回帰木作成処理１４回帰木１５現在の時系列データ１６現時系列特徴抽出処理１７将来予測処理１８予測値 11 Past time series data 12 Feature extraction processing 13 Regression tree creation processing 14 Regression tree 15 Current time series data 16 Current time series feature extraction processing 17 Future prediction processing 18 Predicted value

Claims

[Claims]

A feature acquisition unit configured to acquire a feature of the time series from past time series data; a creation unit configured to create a regression tree based on the feature amount acquired by the feature acquisition unit; A current time series feature acquisition unit that acquires a feature amount from current time series data using the same algorithm as above, a feature amount acquired by the current time series feature acquisition unit, and a regression tree created by the creation unit. A time-series prediction device, comprising: a prediction unit that obtains a future predicted value.

2. The method according to claim 1, wherein the creating unit searches for a division position of a feature amount for determining branching of the regression tree from a plurality of division positions based on each division efficiency. Time series prediction device.

3. The method according to claim 1, wherein the generating unit sets a search range of a feature value division position for determining a branch of the regression tree to a vicinity of an average value or a median value of the feature value distribution, or an intermediate value between a maximum value and a minimum value. 3. The time-series prediction device according to claim 2, wherein the time-series prediction device is configured to restrict the time-series to near.

4. The time-series prediction device according to claim 2, wherein the creation unit searches for the division position from a value obtained by quantizing the feature amount.

5. When tracing a regression tree created by the creation means using the feature quantity acquired by the current time series feature acquisition means, the prediction means stops the prediction value in the middle of the regression tree. The time-series prediction device according to claim 1, wherein the time-series prediction device obtains the time-series prediction data.

6. A time-series prediction device according to claim 1, further comprising: a distribution unit that distributes a prediction value obtained by the time-series prediction device to another information processing device. Time series forecasting system.

7. The distribution means according to claim 5, wherein the distribution means
The time series prediction system according to claim 6, wherein the time series prediction system is formed using an eb system.

8. The method according to claim 6, wherein the distribution unit is formed using a client server system in which the time-series prediction device is a server and the other information processing device is a client. Series prediction system.

9. The time-series prediction system according to claim 6, further comprising a charging unit configured to charge when said distribution value is distributed by said distribution unit based on a contract with a user. .

10. A feature acquiring step of acquiring a feature of the time series from past time series data; a creating step of creating a regression tree based on the feature amount acquired in the feature acquiring step; Using the current time series feature acquisition step of acquiring the feature quantity from the current time series data using the same algorithm as the feature quantity acquired in the current time series feature acquisition step, and the regression tree created in the creation step A prediction step of obtaining a predicted value of the future.

11. The method according to claim 10, wherein, in the creating step, a division position of a feature amount for determining a branch of the regression tree is searched from a plurality of division positions based on each division efficiency. Time series prediction method.

12. The method according to claim 1, wherein a search range of a feature amount division position for determining branching of a regression tree is set near an average value or a median value of the feature value distribution, or an intermediate value between a maximum value and a minimum value. 12. The time-series prediction method according to claim 11, wherein the time-series prediction method is limited to the vicinity of.

13. The time-series prediction method according to claim 11, wherein in the creating step, the division position is searched from a value obtained by quantizing the feature amount.

14. The predicting step, when tracing a regression tree created in the creation step using the feature amount acquired in the current time series feature acquisition step, stops the regression tree in the middle of the regression tree to calculate a prediction value. The time-series prediction method according to claim 10, wherein the time-series prediction method is acquired.

15. A time-series prediction step for executing the time-series prediction method according to claim 10, and a distribution for distributing a prediction value obtained in the time-series prediction step to another information processing apparatus. And a control method for the time-series prediction system.

16. The control method for a time-series prediction system according to claim 15, wherein the distribution step distributes the predicted value using a Web system on the Internet.

17. The method according to claim 15, wherein, in the distribution step, the predicted value is distributed using a client server system in which the time-series prediction method is a server and the other information processing apparatus is a client. The control method of the time-series prediction system described in the above.

18. The time-series prediction system according to claim 15, further comprising a charging step of charging when said predicted value is distributed by said distribution step based on a contract with a user. .

19. A storage medium storing a program for realizing the time-series prediction method according to claim 10. Description:

20. A storage medium for storing a control program for realizing a time series prediction process by a computer, the control program comprising: a feature acquisition step of acquiring a feature of the time series from past time series data. A code of a creation step of creating a regression tree based on the feature amount acquired in the feature acquisition step, and a current time series of acquiring a feature amount from current time series data using the same algorithm as the feature acquisition step A code for a feature acquisition step, a code for a prediction step for determining a future prediction value using the feature amount acquired in the current time series feature acquisition step, and the regression tree created in the creation step. Storage medium.