JP2013089112A

JP2013089112A - Analysis device and analysis program of time series data

Info

Publication number: JP2013089112A
Application number: JP2011230633A
Authority: JP
Inventors: Hiroshi Wakimori; 浩志脇森
Original assignee: Nihon Unisys Ltd
Current assignee: Nihon Unisys Ltd
Priority date: 2011-10-20
Filing date: 2011-10-20
Publication date: 2013-05-13
Anticipated expiration: 2031-10-20
Also published as: JP5773838B2

Abstract

PROBLEM TO BE SOLVED: To analyze not only a singular point but also increase and decrease tendency in time series data.SOLUTION: A time series data analysis device comprises: a regression line generation unit 2 which, for time series data of a predetermined period which is specified by a user within time series data, generates a plurality of patterns of regression lines from the time series data of the predetermined period; an index calculation unit 3 which calculates predetermined indexes for the respective patterns; a separation point extraction unit 4 which extracts separation points of a regression line from a pattern having the best index among the plurality of patterns; and a tendency line generation unit 5 which generates a tendency line of the time series data by calculating a regression line of the time series data using the plurality of separation points extracted from the time series data as a boundary. By specifying tendency of time series data with a line, it is possible to analyze increase and decrease tendency of the time series data by the inclination of the line, and analyze a point at which inclination of line significantly changes as a singular point.

Description

本発明は、時系列データの解析装置および解析用プログラムに関し、特に、時系列データに対してデータマイニングを行う解析装置および解析用プログラムに用いて好適なものである。 The present invention relates to a time-series data analysis apparatus and analysis program, and is particularly suitable for use in an analysis apparatus and analysis program that performs data mining on time-series data.

一般に、統計学やパターン認識などに基づきデータ解析を行うことで、大量のデータから何らかの知識を取り出す技術が知られている。データマイニングと呼ばれる解析技術である。例えば、時系列データにＡＲモデル（Auto-Regressive：自己回帰モデル）による曲線を適用して解析することにより、時系列データに現れる統計的な外れ値および変化点を検出する手法も提案されている（例えば、特許文献１参照）。 In general, a technique for extracting some knowledge from a large amount of data by performing data analysis based on statistics or pattern recognition is known. This is an analysis technique called data mining. For example, a method for detecting statistical outliers and change points appearing in time-series data by applying an AR model (Auto-Regressive) curve to time-series data and analyzing it has also been proposed. (For example, refer to Patent Document 1).

特開２００４−５４３７０号公報JP 2004-54370 A

しかしながら、上記特許文献１に示される従来技術では、時系列データに曲線を適用して解析するため、時系列データの外れ値や変化点などの特異点を検出することは可能であるものの、時系列データの増減傾向を把握することはできないという問題があった。 However, in the conventional technique disclosed in Patent Document 1, since a curve is applied to time series data for analysis, it is possible to detect singular points such as outliers and change points of time series data. There was a problem that the trend of increase / decrease in series data could not be grasped.

本発明は、このような問題を解決するために成されたものであり、時系列データの特異点だけでなく、時系列データの増減傾向も解析できるようにすることを目的とする。 The present invention has been made to solve such a problem, and an object thereof is to analyze not only the singular points of time series data but also the increase / decrease tendency of the time series data.

上記した課題を解決するために、本発明では、時系列データに回帰直線を適用して解析するようにしている。具体的には、本発明は、時系列データの中から設定した所定期間を対象として、当該所定期間内の時系列データである期間内時系列データから複数パターンの回帰直線を生成し、当該複数パターンの中で所定の指標値が最も良いパターンから回帰直線の分割点を抽出する。そして、所定期間を時系列データの最初から終わりまで順次移動させて同様の処理を行い、それによって抽出される複数の分割点を境界として回帰直線を求めることにより、時系列データの傾向直線を生成するようにしている。 In order to solve the above-described problems, in the present invention, a regression line is applied to time series data for analysis. Specifically, the present invention generates a plurality of patterns of regression lines from time-series data within a period, which is time-series data within the predetermined period, for a predetermined period set from time-series data. The dividing point of the regression line is extracted from the pattern having the best predetermined index value among the patterns. Then, the trend line of the time-series data is generated by moving the predetermined period sequentially from the beginning to the end of the time-series data and performing the same process, and obtaining the regression line with the multiple division points extracted as a boundary. Like to do.

上記のように構成した本発明によれば、時系列データの傾向が直線により特定されるので、その直線の傾きにより、時系列データの増減傾向を解析することができる。また、直線の傾きが大きく変わる点などを特異点として解析することもできる。これにより、本発明によれば、時系列データに現れる特異点に加え、時系列データの増減傾向も解析することができる。 According to the present invention configured as described above, since the tendency of the time series data is specified by a straight line, the increase / decrease tendency of the time series data can be analyzed by the slope of the straight line. In addition, a point where the slope of the straight line changes greatly can be analyzed as a singular point. Thereby, according to this invention, in addition to the singular point which appears in time series data, the increase / decrease tendency of time series data can also be analyzed.

本実施形態による時系列データの解析装置の機能構成例を示すブロック図である。It is a block diagram which shows the function structural example of the analysis apparatus of the time series data by this embodiment. 本実施形態による回帰直線生成部の処理内容を説明するための図であり、所定期間を時系列データの中から特定した状態を示す図である。It is a figure for demonstrating the processing content of the regression line production | generation part by this embodiment, and is a figure which shows the state which specified the predetermined period from the time series data. 本実施形態による回帰直線生成部の処理内容を説明するための図であり、期間内時系列データから複数パターンの回帰直線を生成する状態を示す図である。It is a figure for demonstrating the processing content of the regression line production | generation part by this embodiment, and is a figure which shows the state which produces | generates the regression line of a some pattern from the time series data in a period. 本実施形態による傾向直線生成部の処理内容を説明するための図である。It is a figure for demonstrating the processing content of the tendency straight line generation part by this embodiment. 本実施形態による時系列データの解析装置の動作例を示すフローチャートである。It is a flowchart which shows the operation example of the analysis apparatus of the time series data by this embodiment.

以下、本発明の一実施形態を図面に基づいて説明する。図１は、本実施形態による時系列データの解析装置の機能構成例を示すブロック図である。図１に示すように、本実施形態による時系列データの解析装置は、その機能構成として、期間指定受付部１、回帰直線生成部２、指標算出部３、分割点抽出部４、傾向直線生成部５、第２の指標算出部６、最適傾向直線特定部７、増減傾向特定部８、特異点特定部９、類似データ検索部１０、時系列データ記憶部２０、分割点記憶部２１、傾向直線記憶部２２および検索対象データ記憶部２３を備えている。 Hereinafter, an embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram illustrating a functional configuration example of a time-series data analysis apparatus according to the present embodiment. As shown in FIG. 1, the time-series data analysis apparatus according to the present embodiment has, as its functional configuration, a period designation receiving unit 1, a regression line generation unit 2, an index calculation unit 3, a dividing point extraction unit 4, and a trend line generation. Unit 5, second index calculation unit 6, optimum trend straight line specifying unit 7, increase / decrease tendency specifying unit 8, singular point specifying unit 9, similar data searching unit 10, time series data storage unit 20, division point storage unit 21, trend A straight line storage unit 22 and a search target data storage unit 23 are provided.

なお、以上に列挙した機能構成は、ハードウェア構成、ＤＳＰ、ソフトウェアの何れによっても実現することが可能である。例えばソフトウェアによって実現する場合、本実施形態による時系列データの解析装置は、実際にはコンピュータのＣＰＵあるいはＭＰＵ、ＲＡＭ、ＲＯＭなどを備えて構成され、ＲＡＭやＲＯＭに記憶された解析用プログラムが動作することによって実現できる。 Note that the functional configurations listed above can be realized by any of a hardware configuration, a DSP, and software. For example, when realized by software, the time-series data analysis apparatus according to the present embodiment is actually configured with a computer CPU or MPU, RAM, ROM, etc., and an analysis program stored in the RAM or ROM operates. It can be realized by doing.

したがって、上記機能構成は、解析用プログラムを例えばＣＤ−ＲＯＭのような記録媒体に記録し、当該解析用プログラムをコンピュータに読み込ませることによって実現できるものである。この解析用プログラムを記録する記録媒体としては、ＣＤ−ＲＯＭ以外に、フレキシブルディスク、ハードディスク、磁気テープ、光ディスク、光磁気ディスク、ＤＶＤ、不揮発性メモリカード等を用いることができる。また、解析用プログラムをインターネット等のネットワークを介してコンピュータにダウンロードするようにしてもよい。 Therefore, the above functional configuration can be realized by recording the analysis program on a recording medium such as a CD-ROM and causing the computer to read the analysis program. As a recording medium for recording the analysis program, a flexible disk, a hard disk, a magnetic tape, an optical disk, a magneto-optical disk, a DVD, a nonvolatile memory card, and the like can be used in addition to the CD-ROM. The analysis program may be downloaded to a computer via a network such as the Internet.

期間指定受付部１は、キーボードやマウス等の操作部（図示せず）のユーザによる操作を通じて、所定期間の大きさの指定をパラメータｗとして受け付ける。なお、ここで受け付けるパラメータｗは、所定期間の大きさを示す値であって、所定期間の位置を示すものではない。所定期間の大きさは、例えば、複数のデータから成る時系列データのうち何個のデータを解析対象とするかを示すものである。このパラメータｗは、１つまたは複数を指定することが可能である。 The period designation accepting unit 1 accepts designation of the size of a predetermined period as a parameter w through an operation by a user of an operation unit (not shown) such as a keyboard or a mouse. The parameter w received here is a value indicating the size of the predetermined period, and does not indicate the position of the predetermined period. The size of the predetermined period indicates, for example, how many pieces of data among time-series data composed of a plurality of data are to be analyzed. One or a plurality of parameters w can be designated.

回帰直線生成部２は、時系列データ記憶部２０に記憶されている時系列データを回帰分析することにより、当該時系列データの回帰直線を生成する。ここで、回帰直線生成部２は、期間指定受付部１により指定されたパラメータｗに基づいて決められる所定期間を対象として、当該所定期間内の時系列データである期間内時系列データから複数パターンの回帰直線を生成する。具体的には、回帰直線生成部２は、複数パターンの回帰直線として、所定期間内の時点を境界として前後に２本の回帰直線を生成するとともに、所定期間内に境界を設定せずに１本の回帰直線を生成する。 The regression line generation unit 2 performs regression analysis on the time series data stored in the time series data storage unit 20 to generate a regression line of the time series data. Here, the regression line generation unit 2 targets a predetermined period determined based on the parameter w specified by the period specification receiving unit 1 and uses a plurality of patterns from the time-series data in the period that is the time-series data in the predetermined period. Generate a regression line for Specifically, the regression line generation unit 2 generates two regression lines before and after a time point within a predetermined period as a plurality of patterns of regression lines, and 1 without setting a boundary within the predetermined period. Generate a regression line of books.

図２および図３は、回帰直線生成部２の処理内容を説明するための図である。図２は、パラメータｗで指定される大きさの所定期間を時系列データの中から特定した状態を示す図である。図３は、特定された所定期間を対象として、当該所定期間内の期間内時系列データから複数パターンの回帰直線を生成する状態を示す図である。なお、図２（ａ）は、時系列データの最初の部分に所定期間を特定した状態を示し、図３は、図２（ａ）で特定された所定期間において複数パターンの回帰直線を生成した状態を示している。 2 and 3 are diagrams for explaining the processing contents of the regression line generation unit 2. FIG. 2 is a diagram illustrating a state in which a predetermined period having a size specified by the parameter w is specified from time-series data. FIG. 3 is a diagram illustrating a state in which a plurality of patterns of regression lines are generated from time-series data within a predetermined period for the specified predetermined period. 2A shows a state in which a predetermined period is specified in the first part of the time-series data, and FIG. 3 generates a plurality of patterns of regression lines in the predetermined period specified in FIG. 2A. Indicates the state.

図２および図３の例では、パラメータｗの値は“５”に設定されている。ｗ＝５の場合、回帰直線生成部２は、図３（ａ）〜（ｃ）に示すように、３つのパターンの回帰直線を生成する。図３（ａ）は、時点ｔ１〜ｔ１７の１７個のデータから成る時系列データの最初の部分に設定された所定期間（ｔ１〜ｔ５の時点）のうち、２番目の時点ｔ２を境界として前後に２本の回帰直線を生成した状態を示している。すなわち、時点ｔ１〜ｔ２の間で１本、時点ｔ３〜ｔ５の間で１本の回帰直線を生成している。 In the example of FIGS. 2 and 3, the value of the parameter w is set to “5”. In the case of w = 5, the regression line generation unit 2 generates three patterns of regression lines as shown in FIGS. FIG. 3A shows a predetermined period (time point t1 to t5) set in the first part of the time series data composed of 17 pieces of data from time points t1 to t17, with the second time point t2 as a boundary. 2 shows a state in which two regression lines are generated. That is, one regression line is generated between time points t1 and t2, and one regression line is generated between time points t3 and t5.

図３（ｂ）は、３番目の時点ｔ３を境界として前後に２本の回帰直線を生成した状態を示している。すなわち、時点ｔ１〜ｔ３の間で１本、時点ｔ４〜ｔ５の間で１本の回帰直線を生成している。図３（ｃ）は、最初の部分に設定された所定期間（ｔ１〜ｔ５の時点）に境界を設定せずに、時点ｔ１〜ｔ５の間で１本の回帰直線を生成した状態を示している。 FIG. 3B shows a state in which two regression lines are generated before and after the third time point t3 as a boundary. That is, one regression line is generated between the time points t1 and t3 and one between the time points t4 and t5. FIG. 3C shows a state in which one regression line is generated between time points t1 and t5 without setting a boundary in a predetermined period (time points t1 to t5) set in the first part. Yes.

指標算出部３は、回帰直線生成部２により生成された回帰直線と期間内時系列データの実値との誤差の大きさ、および直線モデルの複雑性を評価するための指標を、図３（ａ）〜（ｃ）に示す複数パターンのそれぞれについて算出する。この指標として、例えば、統計モデルの良さを表す指標として公知の情報量規準を用いることが可能である。本実施形態では、次の（式１）および（式２）で示すような赤池情報量規準ＡＩＣ_sum、ＡＩＣ_termを用いる。 The index calculation unit 3 shows an index for evaluating the magnitude of the error between the regression line generated by the regression line generation unit 2 and the actual value of the time-series data within the period, and the complexity of the linear model in FIG. Calculation is performed for each of a plurality of patterns shown in a) to (c). As this index, for example, a known information criterion can be used as an index representing the goodness of a statistical model. In this embodiment, the Akaike information criterion AIC _sum and AIC _term as shown in the following (Expression 1) and (Expression 2) are used.

なお、（式１）は回帰直線が２本ある場合（図３（ａ）および（ｂ）の場合）に用いる指標であり、（式２）は回帰直線が１本しかない場合（図３（ｃ）の場合）に用いる指標である。上記（式１）において、ｔは所定期間の始点から見て境界点が何番目にあるかを示す数値である。図３（ａ）の場合はｔ＝２、図３（ｂ）の場合はｔ＝３である。 Note that (Equation 1) is an index used when there are two regression lines (in the case of FIGS. 3 (a) and (b)), and (Equation 2) is when there is only one regression line (FIG. 3 ( This is an index used in the case of c). In the above (Expression 1), t is a numerical value indicating the number of the boundary point when viewed from the start point of the predetermined period. In the case of FIG. 3A, t = 2, and in the case of FIG. 3B, t = 3.

また、（式１）の右辺の第１項にあるＳ_e1は、境界点より前の回帰直線と期間内時系列データの実値との誤差の残差平方和を示し、同じく右辺の第２項にあるＳ_e2は、境界点より後の回帰直線と期間内時系列データの実値との誤差の残差平方和を示す。（式２）の右辺にあるＳ_eは、１本の回帰直線と期間内時系列データの実値との誤差の残差平方和を示す。 Further, S _e1 in the first term of the right side of (Equation 1) indicates the residual sum of squares of the errors between the actual value of the time series data regression in the linear and duration before the boundary point, also the right-hand side 2 _{Se 2} in the term represents the residual sum of squares of errors between the regression line after the boundary point and the actual value of the time-series data within the period. S _e on the right side of equation (2) shows the residual sum of squares of the errors between the actual value of the time series data within a single regression line and duration.

分割点抽出部４は、回帰直線生成部２により生成された回帰直線の複数パターンの中から、指標算出部３により算出された指標が最も良いパターンを特定し、当該特定したパターンから回帰直線の分割点を抽出する。指標が最も良いとは、上記（式１）および（式２）で算出される赤池情報量規準ＡＩＣ_sum、ＡＩＣ_termの値が最も小さいという意味である。 The dividing point extraction unit 4 identifies the pattern with the best index calculated by the index calculation unit 3 from the plurality of regression line patterns generated by the regression line generation unit 2, and determines the regression line from the identified pattern. Extract dividing points. The best index means that the values of the Akaike information criterion AIC _sum and AIC _term calculated by (Equation 1) and (Equation 2) are the smallest.

分割点抽出部４が抽出する分割点は、図３（ａ）または（ｂ）のように２本の回帰直線を含むパターンの場合は、その境界点が該当する。すなわち、図３（ａ）に示すパターンの指標が最も良い場合は、分割点はｔ２となる。図３（ｂ）に示すパターンの指標が最も良い場合は、分割点はｔ３となる。一方、図３（ｃ）のように１本の回帰直線を含むパターンの指標が最も良い場合は、分割点は無しとなる。 The division point extracted by the division point extraction unit 4 corresponds to the boundary point in the case of a pattern including two regression lines as shown in FIG. That is, when the pattern index shown in FIG. 3A is the best, the division point is t2. When the index of the pattern shown in FIG. 3B is the best, the division point is t3. On the other hand, when the index of the pattern including one regression line is the best as shown in FIG.

分割点抽出部４は、分割点を抽出した場合は、その分割点を分割点記憶部２１に記憶させる。また、分割点抽出部４は、処理結果を回帰直線生成部２に通知する。すなわち、分割点を抽出した場合はその分割点を回帰直線生成部２に通知し、分割点を抽出しなかった場合はその旨を回帰直線生成部２に通知する。分割点抽出部４から通知を受けた回帰直線生成部２は、通知された処理結果に応じた量だけ所定期間を移動させ、移動後の所定期間において上述の処理を実行する。 When the division point extraction unit 4 extracts the division point, the division point extraction unit 4 stores the division point in the division point storage unit 21. In addition, the dividing point extraction unit 4 notifies the regression line generation unit 2 of the processing result. That is, when a division point is extracted, the division point is notified to the regression line generation unit 2, and when a division point is not extracted, that fact is notified to the regression line generation unit 2. The regression line generation unit 2 that has received the notification from the dividing point extraction unit 4 moves the predetermined period by an amount corresponding to the notified processing result, and executes the above-described processing in the predetermined period after the movement.

ここで、回帰直線生成部２は、抽出された分割点の通知を受けた場合、その分割点まで所定期間の始点を移動させて上述の処理を行う。例えば、図３（ｂ）に示すパターンの指標が最良で分割点ｔ３が抽出されたとした場合、回帰直線生成部２は、図２（ｂ）に示すように、分割点ｔ３まで所定期間の始点を移動させて上述の処理を行う。一方、分割点を抽出しなかった旨の通知を受けた場合、回帰直線生成部２は、直前の所定期間における始点の次の時点まで所定期間の始点を移動させて上述の処理を行う。 Here, when the regression line generation unit 2 receives notification of the extracted division point, the regression line generation unit 2 moves the start point of the predetermined period to the division point and performs the above-described processing. For example, when the index of the pattern shown in FIG. 3B is the best and the dividing point t3 is extracted, the regression line generation unit 2 starts the predetermined period until the dividing point t3 as shown in FIG. 2B. And the above-described processing is performed. On the other hand, when receiving a notification that the dividing points have not been extracted, the regression line generation unit 2 performs the above-described processing by moving the start point of the predetermined period to the next time point after the start point of the immediately preceding predetermined period.

移動後の所定期間において回帰直線生成部２により回帰直線が生成されたら、指標算出部３および分割点抽出部４においても、移動後の所定期間において上述の処理を行う。このような処理を順次繰り返し行う。すなわち、回帰直線生成部２、指標算出部３および分割点抽出部４の処理を、所定期間を時系列データの最初から終わりまで順次移動させて複数回行う。なお、時系列データの終わり部分で所定期間をパラメータｗの長さだけとれない場合は、時系列データの末尾までを所定期間とする。このような複数回の処理を行うことにより、分割点記憶部２１には複数の分割点が記憶されることになる。 If a regression line is generated by the regression line generation unit 2 in a predetermined period after movement, the index calculation unit 3 and the dividing point extraction unit 4 also perform the above-described processing in the predetermined period after movement. Such processing is sequentially repeated. That is, the processes of the regression line generation unit 2, the index calculation unit 3, and the dividing point extraction unit 4 are performed a plurality of times by sequentially moving the predetermined period from the beginning to the end of the time series data. When the predetermined period cannot be taken by the length of the parameter w at the end of the time series data, the predetermined period is set to the end of the time series data. By performing such processing a plurality of times, the dividing point storage unit 21 stores a plurality of dividing points.

傾向直線生成部５は、分割点記憶部２１に記憶された複数の分割点を境界として、時系列データ記憶部２０に記憶されている時系列データの回帰直線を求めることにより、時系列データの傾向直線を生成する。図４は、傾向直線生成部５の処理内容を説明するための図である。図４の例では、分割点抽出部４によって３つの分割点ｔ３，ｔ９，ｔ１３が抽出され、これらが分割点記憶部２１に記憶されている。この場合、傾向直線生成部５は、３つの分割点ｔ３，ｔ９，ｔ１３を境界として、その境界の前後で時系列データの回帰直線を求める。 The trend line generation unit 5 obtains a regression line of the time series data stored in the time series data storage unit 20 with a plurality of division points stored in the division point storage unit 21 as boundaries, thereby obtaining time series data Generate a trend line. FIG. 4 is a diagram for explaining the processing content of the trend line generator 5. In the example of FIG. 4, three division points t3, t9, and t13 are extracted by the division point extraction unit 4 and stored in the division point storage unit 21. In this case, the trend line generation unit 5 uses three division points t3, t9, and t13 as boundaries, and obtains a regression line of time series data before and after the boundaries.

すなわち、傾向直線生成部５は、時点ｔ１〜ｔ３の間で回帰直線１、時点ｔ４〜ｔ９の間で回帰直線２、時点ｔ１０〜ｔ１３の間で回帰直線３、時点ｔ１４〜ｔ１７の間で回帰直線４と、合計４本の回帰直線を求める。これら４本の回帰直線またはそれらを繋げた全体の直線が、時点ｔ１〜ｔ１７の１７個のデータから成る時系列データの傾向直線となる。傾向直線生成部５は、このようにして生成した傾向直線を傾向直線記憶部２２に記憶させる。 That is, the trend line generator 5 performs the regression line 1 between the time points t1 and t3, the regression line 2 between the time points t4 and t9, the regression line 3 between the time points t10 and t13, and the regression between the time points t14 and t17. A straight line 4 and a total of four regression lines are obtained. These four regression lines or the entire straight line connecting them is a trend line of time series data composed of 17 data from time points t1 to t17. The trend line generation unit 5 stores the trend line generated in this way in the trend line storage unit 22.

なお、期間指定受付部１が複数のパラメータｗ（ｗ１，ｗ２，・・・）の指定を受け付けた場合、回帰直線生成部２、指標算出部３、分割点抽出部４および傾向直線生成部５は、各パラメータｗ１，ｗ２，・・・についてそれぞれ上述の処理を行う。これにより、各パラメータｗ１，ｗ２，・・・に対応する複数の傾向直線が生成されて傾向直線記憶部２２に記憶される。 When the period specification receiving unit 1 receives specification of a plurality of parameters w (w1, w2,...), The regression line generation unit 2, the index calculation unit 3, the dividing point extraction unit 4, and the trend line generation unit 5 Performs the above-described processing for each parameter w1, w2,. Thereby, a plurality of trend lines corresponding to the parameters w1, w2,... Are generated and stored in the trend line storage unit 22.

第２の指標算出部６は、傾向直線と時系列データの実値との誤差の大きさおよび直線モデルの複雑性を評価するための第２の指標を、傾向直線記憶部２２に記憶された複数の傾向直線のそれぞれについて算出する。この第２の指標も、例えば、統計モデルの良さを表す指標として公知の情報量規準を用いることが可能である。本実施形態では、次の（式３）で示すような赤池情報量規準ＡＩＣ_allを用いる。 The second index calculation unit 6 stores in the trend line storage unit 22 a second index for evaluating the magnitude of error between the trend line and the actual value of the time series data and the complexity of the line model. Calculation is performed for each of a plurality of trend lines. For this second index, for example, a known information criterion can be used as an index representing the goodness of the statistical model. In this embodiment, the Akaike information criterion AIC _all as shown in the following (formula 3) is used.

上記（式３）において、ｉは傾向直線に含まれる複数の回帰直線がそれぞれ先頭から何番目のものであるかを表す数値である。ｎ_iは各回帰直線の始点から見て分割点が何番目にあるかを数値である。図４の例において、回帰直線１の場合はｎ₁＝３、回帰直線２の場合はｎ₂＝６、回帰直線３の場合はｎ₃＝４、回帰直線４の場合はｎ₄＝４である。また、（式３）の右辺にあるＳ_eiは、ｉ番目の回帰直線と時系列データの実値との誤差の残差平方和を示す。 In the above (Formula 3), i is a numerical value indicating the number of the plurality of regression lines included in the trend line from the beginning. n _i is a numerical value indicating the number of division points when viewed from the start point of each regression line. In the example of FIG. 4, n ₁ = 3 for the regression line ₁ , n ₂ = 6 for the regression line ₂ , n ₃ = 4 for the regression line 3, and n ₄ = 4 for the regression line 4. is there. S _{ei on} the right side of (Expression 3) indicates the residual sum of squares of errors between the i-th regression line and the actual value of the time series data.

最適傾向直線特定部７は、傾向直線記憶部２２に記憶された複数の傾向直線のうち、第２の指標算出部６により算出された第２の指標が最も良い傾向直線を特定する。指標が最も良いとは、上記（式３）で算出される赤池情報量規準ＡＩＣ_allの値が最も小さいという意味である。最適傾向直線特定部７は、特定した最適傾向直線を、他の傾向直線から識別できる状態にして傾向直線記憶部２２に記憶させる。 The optimum trend straight line identifying unit 7 identifies the trend straight line having the best second index calculated by the second index calculating unit 6 among the plurality of trend straight lines stored in the trend straight line storage unit 22. The best index means that the value of the Akaike information criterion AIC _all calculated by (Equation 3) is the smallest. The optimum trend straight line specifying unit 7 stores the identified optimum trend straight line in the trend straight line storage unit 22 in a state where it can be identified from other trend straight lines.

また、最適傾向直線特定部７は、特定した最適傾向直線に対応する時系列データを時系列データ記憶部２０から読み出して、最適傾向直線とそれに対応する時系列データとを関連付けて検索対象データ記憶部２３に記憶させる。検索対象データ記憶部２３には、異なる時系列データを対象として上述の処理を行うことによって最適傾向直線が求められる度に、当該最適傾向直線とそれに対応する時系列データとが記憶される。このようにして検索対象データ記憶部２３に記憶される複数組の時系列データとその傾向直線は、後述するようにパターン認識の検索対象データとして用いられる。 Further, the optimum trend straight line specifying unit 7 reads time-series data corresponding to the identified optimum trend straight line from the time-series data storage unit 20, and associates the optimum trend straight line with the corresponding time-series data to store search target data. Store in the unit 23. Each time the optimum trend line is obtained by performing the above-described processing on different time series data, the search target data storage unit 23 stores the optimum trend line and the corresponding time series data. A plurality of sets of time-series data and its trend line stored in the search target data storage unit 23 in this way are used as search target data for pattern recognition as will be described later.

なお、期間指定受付部１が１つのパラメータｗだけの指定を受け付けた場合、傾向直線は１つのみ生成されて、傾向直線記憶部２２に記憶される。その場合、第２の指標算出部６による第２の指標の算出処理と、最適傾向直線特定部７による最適傾向直線の特定処理は不要である。最適傾向直線特定部７は、傾向直線記憶部２２に記憶された１つの傾向直線と、それに対応する時系列データとを関連付けて検索対象データ記憶部２３に記憶させる。 When the period designation receiving unit 1 receives designation of only one parameter w, only one trend line is generated and stored in the trend line storage unit 22. In this case, the second index calculation process by the second index calculation unit 6 and the optimum trend line identification process by the optimum trend line identification unit 7 are unnecessary. The optimum trend line specifying unit 7 associates one trend line stored in the trend line storage unit 22 with time series data corresponding to the trend line and stores it in the search target data storage unit 23.

増減傾向特定部８は、傾向直線生成部５により生成され傾向直線記憶部２２に記憶された傾向直線（複数のパラメータｗ１，ｗ２，・・・が指定された場合は最適傾向直線）の傾きに基づいて、時系列データの増減傾向を特定する。例えば、増減傾向特定部８は、任意の時点ｔの指定をパラメータとして受け付け、指定された時点ｔにおける時系列データの増減傾向を特定する。具体的には、指定された時点ｔにおいて傾向直線の傾きが正であったら増加傾向、負であったら減少傾向と判断する。 The increase / decrease trend specifying unit 8 determines the inclination of the trend line (the optimum trend line when a plurality of parameters w1, w2,... Are specified) generated by the trend line generation unit 5 and stored in the trend line storage unit 22. Based on this, the increase / decrease tendency of the time series data is specified. For example, the increase / decrease trend specifying unit 8 accepts designation of an arbitrary time t as a parameter, and specifies the increase / decrease trend of the time-series data at the specified time t. Specifically, it is determined that if the slope of the trend line is positive at a specified time t, it is an increasing trend, and if it is negative, it is a decreasing trend.

特異点特定部９は、時系列データの急増点または急減点を特異点として特定する。急増点・急減点の候補となるのは、回帰直線の境目として分割点記憶部２１に記録した複数の分割点である。すなわち、特異点特定部９は、分割点抽出部４により抽出され分割点記憶部２１に記憶された複数の分割点について、当該分割点の前後の点における時系列データの実値の差異を算出し、当該差異が所定値以上となる分割点を急増点または急減点として特定する。例えば、特異点特定部９は、傾向直線と時系列データの実値との誤差の標準偏差σを求め、分割点の前後の点における時系列データの実値の差異が２σ以上の増加だったら急増点、−２σ以下の減少だったら急減点と判断する。 The singular point specifying unit 9 specifies a sudden increase point or a sudden decrease point of the time series data as a singular point. Candidates for sudden increase / decrease points are a plurality of division points recorded in the division point storage unit 21 as boundaries between regression lines. That is, the singular point specifying unit 9 calculates, for a plurality of division points extracted by the division point extraction unit 4 and stored in the division point storage unit 21, differences in actual values of time-series data at points before and after the division point. Then, the division point where the difference is equal to or greater than a predetermined value is specified as a sudden increase point or a sudden decrease point. For example, the singularity specifying unit 9 obtains the standard deviation σ of the error between the trend line and the actual value of the time series data, and if the difference between the actual values of the time series data at the points before and after the dividing point is an increase of 2σ or more. If it is a sudden increase or a decrease of -2σ or less, it is judged as a rapid decrease.

類似データ検索部１０は、傾向直線生成部５により生成された傾向直線（複数のパラメータｗ１，ｗ２，・・・が指定された場合は最適傾向直線）に類似した傾向直線をパターン認識により検索対象データ記憶部２３から検索し、検索した傾向直線に対応する時系列データを検索対象データ記憶部２３から抽出する。すなわち、類似データ検索部１０は、今回解析対象としている時系列データと類似の傾向を有する別の時系列データを検索対象データ記憶部２３から検索する。 The similar data search unit 10 searches for a trend line similar to the trend line generated by the trend line generation unit 5 (an optimal trend line when a plurality of parameters w1, w2,... Are specified) by pattern recognition. A search is performed from the data storage unit 23, and time series data corresponding to the searched trend line is extracted from the search target data storage unit 23. That is, the similar data search unit 10 searches the search target data storage unit 23 for another time series data having a tendency similar to the time series data to be analyzed this time.

図５は、本実施形態による時系列データの解析装置の動作例を示すフローチャートである。なお、図５は、時系列データから傾向直線を生成する際の動作例を示すものである。図５に示すフローチャートは、時系列データ記憶部２０から解析対象の時系列データを読み出し、ユーザが図示しない操作部を操作してパラメータｗの値を指定したときに開始する。 FIG. 5 is a flowchart illustrating an operation example of the time-series data analysis apparatus according to the present embodiment. FIG. 5 shows an operation example when generating a trend line from time series data. The flowchart shown in FIG. 5 starts when the time-series data to be analyzed is read from the time-series data storage unit 20 and the user specifies the value of the parameter w by operating the operation unit (not shown).

図５において、回帰直線生成部２は、指定されたパラメータｗを取得し（ステップＳ１）、当該パラメータｗの幅を有する所定期間を時系列データ中に設定する（ステップＳ２）。最初は、時系列データの始めの部分に所定期間を設定する。そして、回帰直線生成部２は、設定した所定期間を対象として、時系列データ記憶部２０から読み出した時系列データのうち期間内時系列データを回帰分析することにより、複数パターンの回帰直線を生成する（ステップＳ３）。 In FIG. 5, the regression line generation unit 2 acquires the designated parameter w (step S1), and sets a predetermined period having the width of the parameter w in the time series data (step S2). Initially, a predetermined period is set at the beginning of the time series data. Then, the regression line generation unit 2 generates a plurality of patterns of regression lines by performing regression analysis of the time series data in the period among the time series data read from the time series data storage unit 20 for the set predetermined period. (Step S3).

指標算出部３は、回帰直線生成部２により生成された複数パターンの回帰直線のそれぞれについて、（式１）および（式２）で示す赤池情報量規準ＡＩＣ_sum、ＡＩＣ_termによる指標を算出する（ステップＳ４）。次に、分割点抽出部４は、回帰直線生成部２により生成された回帰直線の複数パターンの中から、指標算出部３により算出された指標が最も良いパターンを特定し、当該特定したパターンから回帰直線の分割点を抽出する（ステップＳ５）。 The index calculation unit 3 calculates an index based on the Akaike information criterion AIC _sum and AIC _term shown in (Equation 1) and (Equation 2) for each of the plurality of patterns of regression lines generated by the regression line generation unit 2 ( Step S4). Next, the dividing point extraction unit 4 identifies a pattern with the best index calculated by the index calculation unit 3 from the plurality of regression line patterns generated by the regression line generation unit 2, and uses the identified pattern. A division point of the regression line is extracted (step S5).

ここで、分割点抽出部４は、解析対象としている所定期間から分割点を抽出できたか否かを判定し（ステップＳ６）、抽出できなかった場合はステップＳ８の処理に遷移する。一方、分割点を抽出できた場合には、分割点抽出部４は、その分割点を分割点記憶部２１に記憶させる（ステップＳ７）。また、分割点抽出部４は、分割点を抽出できなかった場合にはその旨を、抽出できた場合には分割点を回帰直線生成部２に通知する。 Here, the division point extraction unit 4 determines whether or not the division point can be extracted from the predetermined period to be analyzed (step S6). If the division point cannot be extracted, the process proceeds to step S8. On the other hand, when the dividing point can be extracted, the dividing point extracting unit 4 stores the dividing point in the dividing point storage unit 21 (step S7). The dividing point extracting unit 4 notifies the regression line generating unit 2 of the fact that the dividing point cannot be extracted, and when it can be extracted.

この通知を受けた回帰直線生成部２は、時系列データの終わりまで所定期間を設定して解析を終了したか否かを判定する（ステップＳ８）。まだ時系列データの終わりまで解析を終了していない場合は、ステップＳ２に戻り、通知結果に応じて所定期間を移動させる。そして、移動後の所定期間についてステップＳ３以降の処理を同様に行う。 Receiving this notification, the regression line generator 2 determines whether or not the analysis has been completed by setting a predetermined period until the end of the time series data (step S8). If the analysis has not been completed yet until the end of the time series data, the process returns to step S2 to move a predetermined period according to the notification result. And the process after step S3 is similarly performed about the predetermined period after a movement.

一方、時系列データの終わりまで解析を終了した場合、傾向直線生成部５は、分割点記憶部２１に記憶された複数の分割点を境界として時系列データの回帰直線を求めることにより、時系列データの傾向直線を生成する（ステップＳ９）。そして、その傾向直線を傾向直線記憶部２２に記憶させる（ステップＳ１０）。 On the other hand, when the analysis is completed until the end of the time series data, the trend line generation unit 5 obtains a regression line of the time series data by using a plurality of division points stored in the division point storage unit 21 as boundaries, thereby obtaining a time series. A trend line of data is generated (step S9). And the tendency straight line is memorize | stored in the tendency straight line memory | storage part 22 (step S10).

続いて、回帰直線生成部２は、ユーザにより指定されたパラメータｗを全て処理したか否かを判定する（ステップＳ１１）。ここで、ユーザによりパラメータｗが１つだけ指定されていた場合は、指定されたパラメータｗを全て処理したことになる。一方、ユーザによりパラメータｗが複数指定されていた場合は、未処理のものがないかどうかを判定し、なければ、指定されたパラメータｗを全て処理したことになる。 Subsequently, the regression line generation unit 2 determines whether or not all parameters w designated by the user have been processed (step S11). If only one parameter w is designated by the user, all designated parameters w have been processed. On the other hand, if a plurality of parameters w are designated by the user, it is determined whether or not there are any unprocessed ones. If not, all designated parameters w have been processed.

未処理のパラメータｗが残っている場合は、ステップＳ１に戻り、新たに取得したパラメータｗについてステップＳ２以降の処理を同様に行う。一方、指定されたパラメータｗを全て処理した場合、第２の指標算出部６は、パラメータｗの指定が複数であったか否かを判定する（ステップＳ１２）。複数のパラメータｗが指定されていた場合、傾向直線記憶部２２には複数の傾向直線が記憶されていることになるので、第２の指標算出部６は、当該複数の傾向直線のそれぞれについて第２の指標を算出する（ステップＳ１３）。 If an unprocessed parameter w remains, the process returns to step S1, and the process after step S2 is similarly performed on the newly acquired parameter w. On the other hand, when all the designated parameters w have been processed, the second index calculation unit 6 determines whether there are a plurality of designations of the parameter w (step S12). When a plurality of parameters w are specified, a plurality of trend lines are stored in the trend line storage unit 22, and therefore the second index calculation unit 6 performs the operation for each of the plurality of trend lines. 2 is calculated (step S13).

続いて、最適傾向直線特定部７は、傾向直線記憶部２２に記憶された複数の傾向直線のうち、第２の指標算出部６により算出された第２の指標が最も良い傾向直線を特定する（ステップＳ１４）。そして、最適傾向直線特定部７は、特定した最適傾向直線を、他の傾向直線から識別できる状態にして傾向直線記憶部２２に記憶させる。また、最適傾向直線特定部７は、特定した最適傾向直線に対応する時系列データを時系列データ記憶部２０から読み出して、最適傾向直線とそれに対応する時系列データとを関連付けて検索対象データ記憶部２３に記憶させる（ステップＳ１５）。 Subsequently, the optimum trend line specifying unit 7 specifies the trend line having the best second index calculated by the second index calculating unit 6 among the plurality of trend lines stored in the trend line storage unit 22. (Step S14). Then, the optimum trend straight line identifying unit 7 stores the identified optimum trend straight line in the trend straight line storage unit 22 in a state where it can be identified from other trend straight lines. Further, the optimum trend straight line specifying unit 7 reads time-series data corresponding to the identified optimum trend straight line from the time-series data storage unit 20, and associates the optimum trend straight line with the corresponding time-series data to store search target data. The data is stored in the unit 23 (step S15).

なお、パラメータｗの指定が１つのみであった場合、傾向直線記憶部２２には傾向直線が１つのみ記憶されていることになる。この場合は、ステップＳ１３，Ｓ１４の処理は行わず、最適傾向直線特定部７は、傾向直線記憶部２２に記憶されている１つの傾向直線に対応する時系列データを時系列データ記憶部２０から読み出して、当該傾向直線とそれに対応する時系列データとを関連付けて検索対象データ記憶部２３に記憶させる（ステップＳ１５）。これにより、図５に示すフローチャートの処理を終了する。 When only one parameter w is specified, the trend line storage unit 22 stores only one trend line. In this case, the processes of steps S13 and S14 are not performed, and the optimum trend line specifying unit 7 obtains time series data corresponding to one trend line stored in the trend line storage unit 22 from the time series data storage unit 20. The data is read out, and the trend line and the time-series data corresponding thereto are associated with each other and stored in the search target data storage unit 23 (step S15). Thereby, the process of the flowchart shown in FIG.

以上詳しく説明したように、本実施形態によれば、時系列データの傾向が直線により特定されるので、その直線の傾きにより時系列データの増減傾向を解析したり、直線の傾きが大きく変わる点などを特異点として解析したりすることができる。例えば、傾向直線生成部５により生成された傾向直線をグラフとしてディスプレイに表示させれば、時系列データの増減傾向や特異点をユーザが直感的に把握することができる。また、傾向直線生成部５により生成された傾向直線を対象として増減傾向特定部８または特異点特定部９による処理を行うことで、時系列データの増減傾向や特異点をコンピュータにより特定することができる。 As described above in detail, according to the present embodiment, the trend of time-series data is specified by a straight line. Therefore, the increase / decrease tendency of the time-series data is analyzed by the slope of the straight line, or the slope of the straight line changes greatly. Etc. can be analyzed as singular points. For example, if the trend line generated by the trend line generation unit 5 is displayed as a graph on the display, the user can intuitively grasp the increase / decrease tendency and singularity of the time series data. Further, by performing processing by the increase / decrease trend specifying unit 8 or the singular point specifying unit 9 for the trend line generated by the trend line generating unit 5, it is possible to specify the increase / decrease trend or singular point of the time series data by a computer. it can.

なお、上記実施形態では、所定期間の幅をパラメータｗとしてユーザが指定できるにしているが、固定の値としてもよい。ただし、所定期間の幅をパラメータｗとして可変にすることで、時系列データの解析をフレキシブルに行うことができるようになるというメリットを有する。すなわち、パラメータｗの値を大きくすれば、時系列データの大まかな傾向を分析することができ、パラメータｗの値を小さくすれば、時系列データの細かい部分の傾向を分析することができる。 In the above embodiment, the user can designate the width of the predetermined period as the parameter w, but it may be a fixed value. However, by making the width of the predetermined period variable as the parameter w, there is an advantage that time series data can be analyzed flexibly. That is, if the value of the parameter w is increased, the general tendency of the time series data can be analyzed. If the value of the parameter w is decreased, the tendency of the fine part of the time series data can be analyzed.

また、上記実施形態では、パラメータｗを１つまたは複数の何れかで指定できるようにしているが、１つのみとしてもよい。その場合、第２の指標算出部６および最適傾向直線特定部７は不要となる。ただし、複数のパラメータｗを指定して第２の指標算出部６および最適傾向直線特定部７の処理を行うことにより、時系列データの特徴をより忠実に表した傾向直線を生成することができるようになるというメリットを有する。 In the above embodiment, the parameter w can be specified by one or a plurality of parameters. However, only one parameter w may be specified. In this case, the second index calculation unit 6 and the optimum trend straight line identification unit 7 are not necessary. However, a trend line that more faithfully represents the characteristics of the time-series data can be generated by specifying the plurality of parameters w and performing the processing of the second index calculation unit 6 and the optimum trend line specifying unit 7. It has the merit of becoming

また、上記実施形態では、増減傾向特定部８、特異点特定部９、類似データ検索部１０を設けているが、これらは必須の構成ではなく、省略してもよい。あるいは、何れか１つまたは２つのみを設けるようにしてもよい。例えば、増減傾向特定部８および特異点特定部９を省略しても、傾向直線生成部５により生成された傾向直線をグラフとしてディスプレイに表示させれば、時系列データの増減傾向や特異点をユーザが直感的に把握することができる。 In the above embodiment, the increase / decrease tendency specifying unit 8, the singular point specifying unit 9, and the similar data searching unit 10 are provided, but these are not essential components and may be omitted. Alternatively, only one or two of them may be provided. For example, even if the increase / decrease trend specifying unit 8 and the singular point specifying unit 9 are omitted, if the trend line generated by the trend line generating unit 5 is displayed as a graph on the display, the increase / decrease trend or singularity of the time series data can be displayed. The user can grasp intuitively.

また、上記実施形態では、傾向直線生成部５により生成された傾向直線とそれに対応する時系列データとを検索対象データ記憶部２３に記憶させ、後に別の時系列データをパターン認識するときの検索対象データとして用いるようにしたが、本発明はこれに限定されない。例えば、検索対象とする時系列データとその傾向直線をあらかじめ数パターン生成して検索対象データ記憶部２３に記憶しておくようにしてもよい。 In the above embodiment, the trend line generated by the trend line generation unit 5 and the time series data corresponding to the trend line are stored in the search target data storage unit 23, and a search for pattern recognition of another time series data later is performed. Although used as target data, the present invention is not limited to this. For example, several patterns of time-series data to be searched and its trend line may be generated in advance and stored in the search target data storage unit 23.

また、上記実施形態では、指標の一例として赤池情報量規準を用いたが、本発明はこれに限定されない。例えば、ベイズ統計規準などの他の情報量規準を用いてもよい。 In the above embodiment, the Akaike information criterion is used as an example of the index, but the present invention is not limited to this. For example, other information criterion such as Bayesian statistical criterion may be used.

その他、上記実施形態は、何れも本発明を実施するにあたっての具体化の一例を示したものに過ぎず、これによって本発明の技術的範囲が限定的に解釈されてはならないものである。すなわち、本発明はその要旨、またはその主要な特徴から逸脱することなく、様々な形で実施することができる。 In addition, each of the above-described embodiments is merely an example of implementation in carrying out the present invention, and the technical scope of the present invention should not be construed in a limited manner. That is, the present invention can be implemented in various forms without departing from the gist or the main features thereof.

１期間指定受付部
２回帰直線生成部
３指標算出部
４分割点抽出部
５傾向直線生成部
６第２の指標算出部
７最適傾向直線特定部
８増減傾向特定部
９特異点特定部
１０類似データ検索部 DESCRIPTION OF SYMBOLS 1 Period designation reception part 2 Regression line generation part 3 Index calculation part 4 Division point extraction part 5 Trend line generation part 6 2nd index calculation part 7 Optimal tendency straight line specific part 8 Increase / decrease tendency specific part 9 Singular point specific part 10 Similar data Search part

Claims

For a predetermined period set out of time series data composed of a plurality of data, a regression line generation unit that generates a plurality of patterns of regression lines from time series data within a period that is time series data within the predetermined period;
Index calculation for calculating an index for evaluating the magnitude of error between the regression line generated by the regression line generation unit and the actual value of the time-series data within the period and the complexity of the linear model for each of the plurality of patterns And
A dividing point extracting unit that identifies a pattern with the best index calculated by the index calculating unit and extracts a dividing point of a regression line from the identified pattern;
A plurality of division points extracted by performing the processing of the regression line generation unit, the index calculation unit, and the division point extraction unit a plurality of times by sequentially moving the predetermined period from the beginning to the end of the time series data. An apparatus for analyzing time-series data, comprising: a trend line generation unit that generates a trend line of the time-series data by obtaining a regression line of the time-series data as a boundary.

The regression line generation unit generates two regression lines before and after the time point within the predetermined period as a boundary of the plurality of patterns, and sets one line without setting the boundary within the predetermined period. The time series data analysis apparatus according to claim 1, wherein a regression line is generated.

2. The time series data according to claim 1, further comprising an increase / decrease tendency specifying unit that specifies an increase / decrease tendency of the time series data based on an inclination of the trend line generated by the trend line generation unit. Analysis device.

For a plurality of division points extracted by the division point extraction unit, a difference between actual values of the time series data at points before and after the division point is calculated, and a difference between actual values at points before and after the division point is predetermined. The time-series data analysis apparatus according to claim 1, further comprising a singular point identification unit that identifies a division point that is equal to or greater than a value as a sudden increase point or a sudden decrease point.

A search target data storage unit that stores a plurality of sets of arbitrary time-series data and its trend line as search target data;
A trend line similar to the trend line generated by the trend line generation unit is searched from the search target data storage unit by pattern recognition, and time series data corresponding to the searched trend line is extracted from the search target data storage unit The time-series data analysis apparatus according to claim 1, further comprising a similar data search unit.

A period designation accepting unit that accepts designation of the size of the predetermined period as a parameter;
The regression line generator generates a plurality of patterns of regression from time-series data within a period, which is time-series data within the predetermined period, for the predetermined period determined based on the parameter received by the period specification receiving unit. The time series data analysis apparatus according to claim 1, wherein a straight line is generated.

When the period specification receiving unit receives specification of a plurality of parameters, the regression line generation unit, the index calculation unit, the division point extraction unit, and the trend line generation unit perform processing for each of the plurality of parameters. To generate multiple trend lines,
A second index calculation unit that calculates a second index for evaluating the magnitude of error between the trend line and the actual value of the time-series data and the complexity of the line model for each of the plurality of trend lines; ,
7. An optimum trend straight line specifying unit for specifying a best trend straight line for the second index calculated by the second index calculating unit among the plurality of trend straight lines. Analyzing device for time series data described in 1.

Regression line generating means for generating a plurality of patterns of regression lines from time-series data within a period, which is time-series data within the predetermined period, for a predetermined period set from time-series data composed of a plurality of data,
Index calculation for calculating each of the plurality of patterns as an index for evaluating the magnitude of error between the regression line generated by the regression line generation means and the actual value of the time-series data within the period and the complexity of the linear model means,
The index calculated by the index calculation means identifies a pattern having the best index, and a dividing point extracting means for extracting a dividing point of a regression line from the identified pattern, the regression line generating means, the index calculating means, and the division By calculating a regression line of the time series data with a plurality of division points extracted as a boundary by sequentially moving the predetermined period from the beginning to the end of the time series data and performing a plurality of times. A program for analyzing time series data for causing a computer to function as a trend line generating means for generating a trend line of the time series data.