JP2014211827A

JP2014211827A - Derivation device, derivation method and derivation program

Info

Publication number: JP2014211827A
Application number: JP2013088839A
Authority: JP
Inventors: 合原　一幸; Kazuyuki Aihara; 一幸合原; 洛南陳; Rakunan Chin; 歓飛馬; Huan Fei Ma
Original assignee: Japan Science and Technology Agency
Current assignee: Japan Science and Technology Agency
Priority date: 2013-04-19
Filing date: 2013-04-19
Publication date: 2014-11-13
Also published as: WO2014171544A1

Abstract

PROBLEM TO BE SOLVED: To provide a derivation device, a derivation method and a derivation program for, on the basis of observation data of m points observed in time series for each of n-dimensional items related to would-be which changes with time, deriving time series data at a point of time not included in the observation data.SOLUTION: A derivation device 1 acquires the observation data of m points arranged in time series for each of n-dimensional items (S101), and accepts the designation of an item to be derived (S103), and derives a conversion map for searching, as mapping from a start region matrix consisting of n rows×m columns in which items are arranged so as to be aligned in a row direction and time series are arranged so as to be aligned in a column direction with the acquired observation data as elements, an end region matrix consisting of L rows×m columns in which the observation data of m+1-i (i is a natural number) points and the derivation data of i-1 points are arranged in a time sequential order in the i-th row with the observation data and derivation data of the designated item as elements (S105 to S108), and derives the derivation data by the derived conversion map (S109).

Description

本発明は、経時的に変化する事象に関するｎ次元の項目それぞれについて時系列で観測したｍ点の観測データに基づいて、前記観測データに含まれていない時点のデータを導出する導出装置、導出方法及び導出プログラムに関する。 The present invention relates to a derivation device and a derivation method for deriving data at a time point not included in the observation data based on observation data of m points observed in time series for each of n-dimensional items related to events that change over time. And a derivation program.

株価、為替、物価等の金融市場、気象、電力消費量、生命分子レベルの進展データ、複雑な疾患の進展等の経時的に変化する事象について、時系列の変化を予測することは、産業界だけでなく科学研究分野においても重要である。従来の時系列予測方法は、一般的に十分なサンプル数が存在することを前提として開発されている。代表的な時系列モデルとしては、ＡＲモデル、ＭＡモデル、ＡＲＭＡモデルを例示することができる。また、Takensの埋め込み理論（非特許文献１）を応用してアトラクタの再構成を行うことで時系列予測を行う方法もある。このような時系列予測において、予測モデルを構築するためには、十分なサンプル数を含む長期的時系列データが必要となる。 Predicting time-series changes in time-varying events such as financial markets such as stock prices, exchange rates, prices, weather, electricity consumption, biomolecular progress data, and complicated disease progress It is important not only in scientific research. Conventional time series prediction methods are generally developed on the assumption that a sufficient number of samples exist. As a typical time series model, an AR model, an MA model, and an ARMA model can be exemplified. There is also a method of performing time series prediction by reconstructing an attractor by applying Takens' embedding theory (Non-Patent Document 1). In such time series prediction, in order to construct a prediction model, long-term time series data including a sufficient number of samples is required.

一方、近年では、データ測定技術がめざましい発展を遂げており、少ないサンプルから膨大な項目のデータを高速に得ることが容易となってきている。例えば、遺伝子研究に用いられるハイスループット・バイオデータ、即ち、マイクロアレイデータでは、数十個のサンプルから、１つのチップで何万個もの遺伝子プローブを同時に測定することが可能である。少ないサンプルから多くの項目のデータを観測することができる事象としては、様々な統計レポート、気象情報、金融市場情報、天体観測情報等の情報を例示することができる。 On the other hand, in recent years, data measurement technology has made remarkable progress, and it has become easy to obtain enormous items of data from a small number of samples at high speed. For example, in high-throughput biodata used for genetic research, that is, microarray data, it is possible to simultaneously measure tens of thousands of gene probes from one tens of samples on one chip. Examples of events that can observe data of many items from a small sample include various statistical reports, weather information, financial market information, astronomical observation information, and the like.

タケンス・エフ（Takens,F. ）著,「動的システムと乱流（in Dynamical Systems and Turbulence），シュプリンガー・フェアラーク・ニューヨーク（Springer-Verlag New York），１９８１年，ｐ．３６６−３８１Takens, F., “In Dynamical Systems and Turbulence, Springer-Verlag New York, 1981, pp. 366-381.

しかしながら、従来の時系列予測方法は、予測モデルを構築するために、十分長い長期的時系列データが必要であるにも関わらず、実際の系（システム）では、長期的時系列データを得ることが難しい場合も多いという問題がある。例えば、分子生物学の実験においては、各種の制約から、一般的に、数十点の時系列データしか得られず、長期的時系列データを得ることは難しい。また、市場における株式データの場合も、データ数は多く得られるが、条件依存度又は時変モデルのため、時系列データのうち予測に有用なのは直近のデータに限られるので、実質的には短期的なデータと等価である。 However, the conventional time-series prediction method obtains long-term time-series data in an actual system even though long-term time-series data that is sufficiently long is necessary to construct a prediction model. There is a problem that is often difficult. For example, in molecular biology experiments, due to various limitations, generally only a few tens of time series data can be obtained, and it is difficult to obtain long-term time series data. In addition, in the case of stock data in the market, a large number of data can be obtained, but because of the degree of condition dependence or time-varying model, only the most recent data is useful for forecasting because of time-dependent data. Is equivalent to typical data.

一方、多くの項目を有する短期的時系列データは、あまり時系列予測には用いられていない。その理由は、短期的時系列データは、事象の予測に用いるには、情報が不足していると考えられているからである。 On the other hand, short-term time-series data having many items is not often used for time-series prediction. The reason is that short-term time-series data is considered to lack information to be used for event prediction.

本発明は斯かる事情に鑑みてなされたものであり、多くの項目を有する高次元時系列データを事象の予測に用いるものである。本願発明者らは、短期的であっても多くの項目を有する高次元時系列データは、事象の様々な面を反映したデータであるため、各変数間の相互作用情報を豊富に含んでいるという観点から、事象に蓄積された動特性を反映していると考える。このような発想から、本願発明者らは、Takensの埋め込み理論を拡張して、高次元時系列データを長期的低次元の時系列データに変換する方法、即ち、Takensの埋め込み理論による変換の逆変換法を見つけ出した。この方法を用いることにより、高次元時系列データを長期的低次元の時系列データに変換することで、事象の一部の項目の動特性を予測することが可能となる。即ち、本発明は、このような発想を具現化したものであり、事象の観測データに基づいて、未知の時系列データを推測することが可能な導出装置、導出方法及び導出プログラムを提供することを目的とする。 The present invention has been made in view of such circumstances, and uses high-dimensional time-series data having many items for event prediction. The inventors of the present application have abundant interaction information between variables because high-dimensional time-series data having many items even in the short term reflects various aspects of the event. From this point of view, I think that it reflects the dynamic characteristics accumulated in the event. From such an idea, the present inventors extended the Takens embedding theory to convert high-dimensional time-series data into long-term low-dimensional time-series data, that is, the inverse of the conversion by Takens embedding theory. I found a conversion method. By using this method, it is possible to predict the dynamic characteristics of some items of an event by converting high-dimensional time-series data into long-term low-dimensional time-series data. That is, the present invention embodies such an idea, and provides a derivation device, a derivation method, and a derivation program capable of estimating unknown time-series data based on event observation data. With the goal.

上記目的を達成するために、本発明に係る導出装置は、経時的に変化する事象に関するｎ次元（ｎは２以上の整数）の項目それぞれについて時系列で観測して得られたｍ点（ｍは２以上の整数）の観測データに基づいて、前記観測データに含まれていない時点のデータを導出データとして導出する導出装置であって、ｎ次元の項目それぞれについてのｍ点の観測データを取得する取得部と、導出する項目の指定を受け付ける項目受付手段と、前記取得部が取得した観測データを要素とし、項目が行方向に並び、時系列が列方向に並ぶように配置したｎ行×ｍ列の始域行列を記録する記録部と、指定された項目の観測データ及び導出データを要素とし、ｉ行目（ｉは自然数）にｍ＋１−ｉ点の観測データ及びｉ−１点の導出データを時系列順に配置したＬ行×ｍ列（Ｌは２以上の整数）の終域行列を、前記始域行列からの写像とする変換マップを導出する手段と、導出した変換マップにより、前記記録部に記録した始域行列からの写像として前記終域行列を導出する手段と、導出した終域行列に含まれる導出データを出力する出力部とを備えることを特徴とする。 In order to achieve the above object, the deriving device according to the present invention provides m points (m) obtained by observing each item of n dimensions (n is an integer of 2 or more) in time series regarding events that change over time. Is a derivation device for deriving data at a time point not included in the observation data as derived data based on observation data of 2 or more), and obtaining m-point observation data for each n-dimensional item An acquisition unit that receives the designation of an item to be derived, and observation data acquired by the acquisition unit as elements, n rows arranged such that items are arranged in a row direction and time series are arranged in a column direction × A recording unit for recording an m-column start matrix, and observation data and derived data of a specified item as elements, and derivation of observation data of m + 1-i points and i-1 points in the i-th row (i is a natural number). Data in chronological order An L row × m column (L is an integer of 2 or more) placed end region matrix is recorded in the recording unit by means for deriving a conversion map that is a mapping from the start region matrix, and the derived conversion map. It is characterized by comprising means for deriving the end region matrix as a map from the start region matrix, and an output unit for outputting the derived data included in the derived end region matrix.

また、本発明に係る導出装置は、経時的に変化する事象に関するｎ次元（ｎは２以上の整数）の項目それぞれについて時系列に観測して得られたｍ点（ｍは２以上の整数）の観測データに基づいて、前記観測データに含まれていない時点のデータを導出データとして導出する導出装置であって、ｎ次元の項目それぞれについてのｍ点の観測データを取得する取得部と、導出する項目の指定を受け付ける項目受付手段と、ｎ個の係数及び観測データ並びに指定された項目についてのｍ＋１−ｉ点の観測データ（ｉ＝１，２，... ，Ｌ）及びｉ−１点の導出データに係る変数にて構成されるｍ＋１−ｉ個の数式を生成する数式生成手段と、生成した数式に基づいて、導出データを導出する導出手段と、導出した導出データを出力する出力部とを備えることを特徴とする。 In addition, the deriving device according to the present invention provides m points (m is an integer equal to or greater than 2) obtained by observing each of n-dimensional (n is an integer equal to or greater than 2) items relating to events that change over time. A deriving device for deriving, as derived data, data at a time point not included in the observation data based on the observed data, an acquisition unit for acquiring m-point observation data for each of n-dimensional items, and a deriving device Item accepting means for accepting designation of an item to be performed, n coefficients and observation data, and m + 1-i point observation data (i = 1, 2,..., L) and i-1 point for the designated item Formula generating means for generating m + 1-i formulas composed of variables related to the derived data, derivation means for deriving derived data based on the generated formulas, and an output unit for outputting the derived data And with It is characterized by that.

また、本発明に係る導出装置は、前記数式生成手段は、ｎ個の係数及びｎ項目の観測データの積、並びに指定された項目の観測データ及び導出データに係る変数にて構成される線形性を有する数式をｉ個生成し、前記導出手段は、前記数式生成手段が生成した数式にて求められるｎ個の係数の値を用いて、前記変数に係る導出データを導出することを特徴とする。 In the derivation device according to the present invention, the mathematical expression generation means is a linearity composed of a product of n coefficients and n items of observation data, and variables relating to observation data and derivation data of specified items. I is generated, and the derivation means derives derived data related to the variable by using the values of n coefficients obtained from the mathematical expression generated by the mathematical expression generation means. .

また、本発明に係る導出装置は、非線形関数の指定及び指定された非線形関数の展開次数を受け付ける関数指定部と、前記数式生成手段は、前記関数指定部により指定された非線形関数及び展開次数に基づいて、ｉ個の数式として、それぞれ観測データ及び導出データを変数とするｎ個の非線形関数を生成することを特徴とする。 Further, the derivation device according to the present invention includes a function designating unit that receives designation of a non-linear function and an expansion order of the designated non-linear function, and the mathematical expression generating means uses the non-linear function and the expansion order designated by the function designating unit. On the basis of this, n nonlinear functions are generated as i equations, each of which has observation data and derived data as variables.

また、本発明に係る導出装置は、前記導出データは、ｍ番目の観測データより後のＬ−１点分のデータ、又は１番目の観測データより前のＬ−１点分のデータであることを特徴とする。 In the derivation device according to the present invention, the derivation data is data for L-1 points after the m-th observation data, or data for L-1 points before the first observation data. It is characterized by.

本発明に係る導出方法は、経時的に変化する事象に関するｎ次元（ｎは２以上の整数）の項目それぞれについて時系列に観測して得られたｍ点（ｍは２以上の整数）の観測データに基づいて、前記観測データに含まれていない時点のデータを導出データとして導出する導出装置を用いた導出方法であって、前記導出装置は、ｎ次元の項目それぞれについてのｍ点の観測データを取得するステップと、導出する項目を決定するステップと、ｎ個の係数及び観測データ並びに指定された項目についてのｍ＋１−Ｌ点の観測データ（Ｌ＝１，２，... ，ｉ）及びＬ−１点の導出データに係る変数にて構成されるｉ個の数式を生成するステップと、生成した数式に基づいて、導出データを導出するステップとを実行することを特徴とする。 In the derivation method according to the present invention, observation of m points (m is an integer of 2 or more) obtained by observing each of n-dimensional (n is an integer of 2 or more) items in time series regarding events that change over time. A derivation method using a derivation device that derives data at a time point not included in the observation data as derivation data based on data, the derivation device comprising m points of observation data for each of n-dimensional items , Determining the item to be derived, n coefficients and observation data, and observation data (L = 1, 2,..., I) of m + 1-L points for the specified item and A step of generating i formulas composed of variables related to the derived data of L-1 points and a step of deriving derived data based on the generated formulas are performed.

本発明に係る導出プログラムは、コンピュータに、経時的に変化する事象に関するｎ次元（ｎは２以上の整数）の項目それぞれについて時系列に観測して得られたｍ点（ｍは２以上の整数）の観測データに基づいて、前記観測データに含まれていない時点のデータを導出データとして導出させる導出プログラムであって、コンピュータに、ｎ次元の項目それぞれについてのｍ点の観測データを取得するステップと、導出する項目を決定するステップと、ｎ個の係数及び観測データ並びに指定された項目についてのｍ＋１−Ｌ点の観測データ（Ｌ＝１，２，... ，ｉ）及びＬ−１点の導出データに係る変数にて構成されるｉ個の数式を生成するステップと、生成した数式に基づいて、導出データを導出するステップとを実行させることを特徴とする。 The derivation program according to the present invention allows a computer to obtain m points (m is an integer greater than or equal to 2) obtained by observing each of n-dimensional items (n is an integer greater than or equal to 2) relating to an event that changes over time. ) Is a derivation program for deriving data at a time point not included in the observation data as derivation data, and the computer obtains m-point observation data for each of the n-dimensional items. Determining items to be derived, n coefficients and observation data, and m + 1-L point observation data (L = 1, 2,..., I) and L−1 points for the specified item. Generating i mathematical formulas composed of variables related to the derived data, and deriving derived data based on the generated mathematical formulas. The

上記特徴を有する導出装置、導出方法及び導出プログラムは、高次元の観測データから未知の導出データを導出することにより、指定された項目についての将来又は過去の未知のデータを推測することが可能である。 The derivation device, derivation method, and derivation program having the features described above can estimate future or past unknown data for a specified item by deriving unknown derived data from high-dimensional observation data. is there.

本発明によれば、経時的に変化する事象に関するｎ次元の項目それぞれについて時系列で観測して得られたｍ点の観測データに基づいて、未知の時点のデータを導出することができる。これにより、ｎ次元の項目の観測データ相互の関係から未知の時点のデータを導出し、事象を推定することが可能である等、優れた効果を奏する。 According to the present invention, unknown point-in-time data can be derived based on m-point observation data obtained by observing each n-dimensional item related to an event that changes over time in time series. As a result, it is possible to derive data at an unknown time point from the relationship between the observation data of n-dimensional items, and to obtain an excellent effect such as being able to estimate an event.

Takensの埋め込み理論による動的システムの再構築を概念的に示す説明図である。It is explanatory drawing which shows notionally the reconstruction of the dynamic system by Takens embedding theory. 本発明に係る導出装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the derivation | leading-out apparatus based on this invention. 本発明に係る導出装置の線形性時系列データの導出処理の一例を示すフローチャートである。It is a flowchart which shows an example of the derivation | leading-out process of the linearity time series data of the derivation | leading-out apparatus based on this invention. 本発明に係る導出装置の非線形性時系列データの導出処理の一例を示すフローチャートである。It is a flowchart which shows an example of the derivation | leading-out process of the nonlinear time series data of the derivation | leading-out apparatus based on this invention. 本発明に係る導出装置が取得する観測データの一例を示す説明図である。It is explanatory drawing which shows an example of the observation data which the derivation device concerning the present invention acquires. 本発明に係る導出装置が出力する導出データの一例を示す説明図である。It is explanatory drawing which shows an example of the derivation data which the derivation | leading-out apparatus based on this invention outputs.

＜時系列データの推測＞
まず、本発明の前提として、時系列データについて説明する。経時的に変化する事象をｎ次元（ｎは２以上の整数）の項目を含む動的システム（系）と捉えた場合、動的システムは、ｎ次元の状態ベクトルを示す下記の式（１）として表すことができる。 <Estimation of time series data>
First, time series data will be described as a premise of the present invention. When an event that changes over time is regarded as a dynamic system (system) including items of n dimensions (n is an integer of 2 or more), the dynamic system represents the following equation (1) indicating an n-dimensional state vector: Can be expressed as

ｘ（ｔ）＝（ｘ₁（ｔ），ｘ₂ （ｔ），... ，ｘ_n （ｔ））^T∈Ｒⁿ …式（１） x (t) = (x ₁ (t), x ₂ (t),..., x _n (t)) ^T ∈ R ⁿ (1)

式（１）において、ｘ_j （ｔ），（ｊ＝１，２，... ，ｎ）は動的システムの状態変数、ｎは系の次元、そしてｔは観測時刻等の時点をそれぞれ示している。また、Ｒⁿは、ｎ次元のユークリッド空間を示している。状態変数ｘ_j（ｔ）は経時的に変化する変数であり、連続システムの場合、状態変数の変化はｄｘ（ｔ）／ｄｔ＝Ｆ（ｘ（ｔ））と表され、離散システムの場合、状態変数の変化はｘ（ｔ＋１）＝Ｆ（ｘ（ｔ））と表される。 In Expression (1), x _j (t), (j = 1, 2,..., N) are state variables of the dynamic system, n is the dimension of the system, and t is a time point such as an observation time. ing. R ⁿ represents an n-dimensional Euclidean space. The state variable x _j (t) is a variable that changes with time. In the case of a continuous system, the change in the state variable is expressed as dx (t) / dt = F (x (t)). The change of the state variable is expressed as x (t + 1) = F (x (t)).

動的システムの構成、性質、外部からの影響等の要因が明確で数式化することができる場合、Ｆ（ｘ）を直接求めることにより、系の動的特性（状態変数ｘ_j（ｔ）（ｔ＝０→Ｔ）の値）を導出することができる。一方、動的システムの構成、性質、外部からの影響等の要因が不明確で数式化することができない場合、系の動的特性を導出することができない。そこで、過去の一定期間（ｔ＝ｔ−τ，ｔ−２τ，... ，ｔ−ｋτ）（τは時間間隔）に観測された状態変数ｘ_j（ｔ）の一部又は全部のデータに基づいて、特定の状態変数ｘ_s （ｔ）の将来の一定期間（ｔ＝ｔ＋τ，ｔ＋２τ，... ，ｔ＋（Ｌ−１）τ）のデータを推測する動的システムにおける時系列データの推測が行われる。 When factors such as the structure, properties, and external influences of a dynamic system are clear and can be mathematically expressed, the dynamic characteristics of the system (state variable x _j (t) ( The value of t = 0 → T)) can be derived. On the other hand, when factors such as the configuration, properties, and external influences of the dynamic system are unclear and cannot be mathematically expressed, the dynamic characteristics of the system cannot be derived. Therefore, a part or all of the data of the state variable x _j (t) observed in the past fixed period (t = t−τ, t−2τ,..., T−kτ) (τ is a time interval) Based on the estimation of time series data in a dynamic system that estimates data of a specific state variable x _s (t) for a certain period in the future (t = t + τ, t + 2τ,..., T + (L−1) τ) Is done.

また、動的システムの時系列データは、期間及び次元数に基づいて、長期的低次元時系列データ、短期的高次元時系列データ等の区分に分類することができる。長期的低次元時系列データとは、比較的少数の状態変数（例えば、ｘ₁（ｔ），ｘ₂（ｔ），ｘ₃（ｔ）)について、比較的長い期間の多数の時点（例えば、ｔ＝ｔ₁，ｔ₂，... ，ｔ_k）におけるデータ（［ｘ₁ （ｔ₁ ），ｘ₁ （ｔ₂ ），... ｘ₁ （ｔ_k ）］^T ，［ｘ₂ （ｔ₁ ），ｘ₂ （ｔ₂ ），... ｘ₂ （ｔ_k ）］^T，［ｘ₃ （ｔ₁ ），ｘ₃ （ｔ₂ ），... ｘ₃ （ｔ_k ）］^T）である。短期的高次元時系列データとは、比較的多数の状態変数（例えば、ｘ₁（ｔ），ｘ₂（ｔ），... ，ｘ_n（ｔ）)について、比較的短い期間の少数の時点（例えば、ｔ＝ｔ₁，ｔ₂，... ，ｔ_m）（ｍ≪ｎ）におけるデータ（［ｘ₁ （ｔ₁ ），ｘ₁ （ｔ₂ ），... ｘ₁ （ｔ_m ）］^T ，［ｘ₂ （ｔ₁ ），ｘ₂ （ｔ₂ ），... ｘ₂ （ｔ_m ）］^T，... ，［ｘ_n （ｔ₁ ），ｘ_n （ｔ₂ ），... ｘ_n （ｔ_m ）］^T）である。 In addition, the time series data of the dynamic system can be classified into categories such as long-term low-dimensional time series data and short-term high-dimensional time series data based on the period and the number of dimensions. Long-term low-dimensional time-series data means that a relatively small number of state variables (eg, x ₁ (t), x ₂ (t), x ₃ (t)) t = t ₁ , t ₂ ,..., t _k ) ([x ₁ (t ₁ ), x ₁ (t ₂ ),... x ₁ (t _k )] ^T , [x ₂ (t ₁ ), x ₂ (t ₂ ), ... x ₂ (t _k )] ^T , [x ₃ (t ₁ ), x ₃ (t ₂ ), ... x ₃ (t _k )] ^T ) is there. Short-term high-dimensional time-series data means that a relatively small number of state variables (eg, x ₁ (t), x ₂ (t),..., X _n (t)) Data ([x ₁ (t ₁ ), x ₁ (t ₂ ),... X ₁ (t _m ) at a time point (for example, t = t ₁ , t ₂ ,..., T _m ) ( _m << _n ) )] ^T , [x ₂ (t ₁ ), x ₂ (t ₂ ), ... x ₂ (t _m )] ^T , ..., [x _n (t ₁ ), x _n (t ₂ ), ... it is a _{_{^{x n (t m)] T}}} ).

従来の時系列データの予測は、過去に観測された長期的低次元時系列データを用いて、特定の状態変数についての将来の時系列データを推測する方法である。それに対して、本発明では、過去に観測された短期的高次元時系列データを用いて、特定の状態変数の将来の時系列データを推測することに適している。なお、時系列データの推測は、状態変数が観測された期間より後の将来の時系列データに限らず、状態変数が観測された期間より前の過去の時系列データについても行うことが可能である。即ち、既知の短期的高次元時系列データを、未知の時系列データを含む長期的低次元時系列データに変換することに適している。しかも、推測の対象となる状態変数を変更することにより、全ての状態変数について未知の時系列データを推測することも可能である。 Conventional prediction of time-series data is a method for estimating future time-series data for a specific state variable using long-term low-dimensional time-series data observed in the past. On the other hand, the present invention is suitable for estimating future time series data of a specific state variable using short-term high-dimensional time series data observed in the past. Note that time-series data can be estimated not only for future time-series data after the period when the state variable is observed, but also for past time-series data before the period when the state variable is observed. is there. That is, it is suitable for converting known short-term high-dimensional time-series data into long-term low-dimensional time-series data including unknown time-series data. Moreover, it is possible to estimate unknown time-series data for all the state variables by changing the state variables to be estimated.

＜短期的高次元時系列データと長期的低次元時系列データとの関係＞
次に短期的高次元時系列データと長期的低次元時系列データとの関係について説明する。図１は、Takensの埋め込み理論による動的システムの再構築を概念的に示す説明図である。Takensの埋め込み理論は、動的システムにおいて、長期的低次元時系列データから動的特性を再構築することが可能であることを示している。 <Relationship between short-term high-dimensional time-series data and long-term low-dimensional time-series data>
Next, the relationship between short-term high-dimensional time-series data and long-term low-dimensional time-series data will be described. FIG. 1 is an explanatory diagram conceptually showing reconstruction of a dynamic system based on Takens' embedding theory. Takens' embedding theory shows that in dynamic systems, it is possible to reconstruct dynamic properties from long-term low-dimensional time-series data.

図１（ａ）は、状態変数ｘ₁ ，ｘ₂ ，ｘ₃ からなる３次元の動的システムのアトラクタを概念的に示している。図１（ａ）に示す動的システムに対し、十分な長さを有する期間Ｔの間、３個の状態変数のうちのｘ₁を観測して得られた時系列データをｙ＝ｘ₁（ｔ）として示したものが、図１（ｂ）に示すグラフである。即ち、図１（ｂ）は、３次元の動的システムを観測して得られた長期的低次元時系列データを示している。 FIG. 1A conceptually shows an attractor of a three-dimensional dynamic system composed of state variables x ₁ , x ₂ and x ₃ . For the dynamic system shown in FIG. 1A, time series data obtained by observing x ₁ of the three state variables during a period T having a sufficient length is expressed as y = x ₁ ( What is shown as t) is the graph shown in FIG. That is, FIG. 1B shows long-term low-dimensional time series data obtained by observing a three-dimensional dynamic system.

図１（ｂ）に示す観測された長期的低次元時系列データを基に、時間遅延量「τ」についての遅延埋め込みにより生成した期間「Ｔ−２τ」の時系列データを示したものが、図１（ｃ）に示すグラフである。図１（ｃ）は、ｚ₁（ｔ）＝ｙ（ｔ）、ｚ₂（ｔ）＝ｙ（ｔ＋τ）及びｚ₃（ｔ）＝ｙ（ｔ＋２τ）として示される３次元時系列データを示している。 Based on the observed long-term low-dimensional time-series data shown in FIG. 1B, the time-series data of the period “T-2τ” generated by delay embedding for the time delay amount “τ” is shown. It is a graph shown in FIG.1 (c). FIG. 1C shows three-dimensional time-series data represented as z ₁ (t) = y (t), z ₂ (t) = y (t + τ) and z ₃ (t) = y (t + 2τ). Yes.

図１（ｃ）に示した３次元時系列データ（ｚ₁，ｚ₂，ｚ₃）からなる位相空間として、Ｐｎ＝（ｚ₁ （ｔ_n ），ｚ₂ （ｔ_n ），ｚ₃ （ｔ_n ））として示される動的システムのアトラクタを再構築したものが図１（ｄ）に示す概念図である。図１（ｄ）は、３次元の時系列データｚ₁ （ｔ_n ），ｚ₂ （ｔ_n ），ｚ₃ （ｔ_n ）からなる位相空間において、Ｐｎ＝（ｚ₁ （ｔ_n ），ｚ₂ （ｔ_n ），ｚ₃ （ｔ_n ））として示される動的システムのアトラクタを概念的に示している。図１（ｄ）に示すアトラクタは、図１（ａ）に示すアトラクタと位相幾何学的に同相である。 As a phase space composed of the three-dimensional time series data (z ₁ , z ₂ , z ₃ ) shown in FIG. 1C, Pn = (z ₁ (t _n ), z ₂ (t _n ), z ₃ (t _The conceptual diagram shown in FIG. 1 (d) is obtained by reconstructing the attractor of the dynamic system shown as _n )). FIG. 1 (d) shows Pn = (z ₁ (t _n ), z in a phase space consisting of three-dimensional time series data z ₁ (t _n ), z ₂ (t _n ), z ₃ (t _n ). _{_{_{2 (t n), z 3}}} (t n)) conceptually illustrates the attractor dynamic system shown as. The attractor shown in FIG. 1 (d) is topologically in phase with the attractor shown in FIG. 1 (a).

図１を用いて説明したようにTakensの埋め込み理論は、長期的低次元時系列データから短期的高次元時系列データを導出する方法を提供している。本願では、高次元の時系列データ、例えば短期的高次元時系列データを、長期的低次元時系列データに変換する変換マップを用いる方法を開示する。変換マップは、各変数間の相互作用情報を抽出し、更にその相互作用情報を時系列情報に変換することができるので、短期的高次元時系列データから長期的低次元時系列データを導出することが可能となる。このように、本願では、例えば、Takensの埋め込み理論を用いて行われた長期的低次元時系列データから短期的高次元時系列データへの変換の逆変換として、従来は実現されていなかった短期的高次元時系列データから長期的低次元時系列データへの変換をも可能とする方法を開示するものである。 As described with reference to FIG. 1, Takens' embedding theory provides a method for deriving short-term high-dimensional time-series data from long-term low-dimensional time-series data. The present application discloses a method using a conversion map for converting high-dimensional time-series data, for example, short-term high-dimensional time-series data into long-term low-dimensional time-series data. The transformation map can extract the interaction information between each variable, and further convert the interaction information into time-series information, so that long-term low-dimensional time-series data is derived from short-term high-dimensional time-series data. It becomes possible. Thus, in the present application, for example, as a reverse conversion of conversion from long-term low-dimensional time-series data to short-term high-dimensional time-series data performed using Takens embedding theory, short-term that has not been realized in the past Disclosed is a method that enables conversion from static high-dimensional time-series data to long-term low-dimensional time-series data.

＜逆埋め込み理論（逆変換）の概要＞
時系列データの変換に用いる変換マップΦについて説明する。ユークリッド空間Ｒⁿ の開部分集合ＵにおけるフローＦ、ボックス次元がｄである開部分集合Ｕのコンパクト部分集合Ａ、及び整数Ｌ（Ｌ＞２ｄ）を規定し、時間間隔τ（τ＞０）が適切な値をとるものとする。この場合、開部分集合Ｕにおけるほとんど全てのスムーズ関数ｙ及び帰結遅延座標ｚ＝（ｙ（ｘ），ｙ（Ｆ_T （ｘ）），... ，ｙ（Ｆ_(L-1)T（ｘ）））^T （ｘ∈Ａ）に対して、変換マップΦ：Ｒⁿ →Ｒ^L が存在する（下付添字のT 、(L-1)Tは、τ、(L-1) τを示す。）。このときΦ（ｘ）＝ｚは、一対一であり、コンパクト部分集合Ａに含まれる滑らかな多様体の任意のコンパクト部分集合上でのはめ込みである。そして、変換マップΦは、コンパクト部分集合Ａに含まれる状態ｘをフローＦに沿って、将来の状態ｙ（Ｆ_iT（ｘ）），（ｉ＝１，２，... ，Ｌ−１）を示すスカラー関数に変換することができる（下付添字のiTはi τを示す）。 <Outline of reverse embedding theory (inverse transformation)>
The conversion map Φ used for converting time series data will be described. The flow F in the open subset U of the Euclidean space R ⁿ , the compact subset A of the open subset U whose box dimension is d, and the integer L (L> 2d) are defined, and the time interval τ (τ> 0) is Appropriate values shall be taken. In this case, almost all smooth functions y and resulting delay coordinates z = (y (x), y (F _T (x)),..., Y (F _{(L−1) T} (x ))) For ^T (x∈A), there is a transformation map Φ: R ⁿ → ^RL (subscripts T and (L-1) T denote τ and (L-1) τ) .) At this time, Φ (x) = z is one-to-one, and is an inset on an arbitrary compact subset of a smooth manifold included in the compact subset A. Then, the transformation map Φ shows the state x included in the compact subset A along the flow F and the future state y (F _iT (x)), (i = 1, 2,..., L−1). (Subscript iT indicates i τ).

逆埋め込み理論に基づく短期的高次元時系列データから長期的低次元時系列データへの変換について詳述する。前述の式（１）を用いてｎ次元の状態ベクトルとして示した動的システムの系ｘ（ｔ）には、ボックス次元ｄのコンパクト部分集合Ａで表されるアトラクタが存在する。また、状態変数ｘ_i（ｔ）を用いた関数としてｙ（ｔ）を規定する。ここでは、簡単にするために、ｙ（ｔ）＝ｘ_i （ｔ）とする。さらに、ｚ₁（ｔ）＝ｙ（ｔ），ｚ₂（ｔ）＝ｙ（ｔ＋τ），... ，ｚ_L（ｔ）＝ｙ（ｔ＋（Ｌ−１）τ）（τ：遅延定数）のようにｙ（ｔ）の遅延埋め込みを行うことにより、下記式（２）として示される新たな系を構築することができる。 The conversion from short-term high-dimensional time-series data to long-term low-dimensional time-series data based on reverse embedding theory will be described in detail. In the system x (t) of the dynamic system shown as an n-dimensional state vector using the above equation (1), there is an attractor represented by a compact subset A of the box dimension d. Further, y (t) is defined as a function using the state variable x _i (t). Here, for simplicity, y (t) = x _i (t). Further, z ₁ (t) = y (t), z ₂ (t) = y (t + τ),..., Z _L (t) = y (t + (L−1) τ) (τ: delay constant) By performing delay embedding of y (t) as in the following, a new system represented by the following formula (2) can be constructed.

ｚ（ｔ）＝（ｚ₁ （ｔ）, ｚ₂ （ｔ）,... ,ｚ_L （ｔ））^T ∈Ｒ^L ・・・式（２） z (t) = (z ₁ (t), z ₂ (t),..., z _L (t)) ^T ∈ R ^L (2)

式（２）においてｚ（ｔ）は再構築された系∈Ｒ^Lの状態変数を示し、Ｌは当該系の次元を示している。 Z (t) in Equation (2) shows a state variable of the system ∈R ^L reconstructed, L is shows the dimensions of the system.

非特許文献１として示したTakensの埋め込み理論及びSauer 、Yorke 、Casdagliのフラクタルアトラクタに対する一般化された埋め込み理論によると、Ｌ＞２ｄの場合再構築された系ｚ（ｔ）のアトラクタは元の系ｘ（ｔ）と位相幾何学的に同相である。したがって、系ｘ（ｔ）のユークリッド空間Ｒⁿ と系ｚ（ｔ）のユークリッド空間Ｒ^Lとの間にマップΦ（Φ（ｘ（ｔ）））＝ｚ（ｔ））が存在する。マップΦは、系ｘ（ｔ）を写像して系ｚ（ｔ）に変換する変換行列である。マップΦ下では、位相幾何学的に同相という観点から、一次元の時系列データｙ（ｔ）は、ｘ（ｔ）のｍ点の時点のデータと同様のアトラクタ情報を保有している。したがって、当該マップΦによって、系ｘ（ｔ）∈Ｒⁿは系ｚ（ｔ）∈Ｒ^Lに変換される。即ち、マップΦは、始域ｘ（ｔ）∈Ｒⁿから終域ｚ（ｔ）∈Ｒ^Lへの写像である。 According to Takens' embedding theory and the generalized embedding theory for Sauer, Yorke and Casdagli's fractal attractor shown as Non-Patent Document 1, the attractor of the reconstructed system z (t) is the original system when L> 2d. Topologically in phase with x (t). Therefore, a map Φ (Φ (x (t))) = z (t)) exists between the Euclidean space R ⁿ of the system x (t) and the Euclidean space R ^L of the system z (t). The map Φ is a transformation matrix that maps the system x (t) and transforms it into the system z (t). Under the map Φ, from the viewpoint of topologically in-phase, the one-dimensional time series data y (t) has the same attractor information as the data at the point m of x (t). Thus, by the map [Phi, system x (t) ∈R ⁿ are converted to the system z (t) ∈R ^L. That is, the map Φ is a mapping from the start region x (t) εR ⁿ to the end region z (t) εR ^L.

マップΦについて更に詳述する。時間間隔τでｍ点観測したｎ次元の時系列データを、状態変数ｘ（ｔ）∈Ｒⁿを用いてｘ_j ＝ｘ（ｔ＋（ｊ−１）τ），（ｊ＝１，２，... ，ｍ）で表すものとすると、マップΦは、Φ（ｘ₁，ｘ₂，... ，ｘ_m）又は下記の式（３）として得ることができる。 The map Φ will be further described in detail. The n-dimensional time series data observed at m points at time interval τ is ^expressed as x _j = x (t + (j−1) τ), (j = 1, 2,...) Using state variable x (t) ∈R ⁿ . .., M), the map Φ can be obtained as Φ (x ₁ , x ₂ ,..., X _m ) or the following equation (3).

上記式（３）において、ｘ_ij＝ｘ_i （ｔ＋（ｊ−１）τ）、ｚ_ij＝ｚ_i （ｔ＋（ｊ−１）τ）であり、ｚ_ij＝ｙ（ｔ＋（ｉ＋ｊ−２）τ）であることから、マップΦは、下記の式（４）に書き換えることができる。 In the above equation (3), x _ij = x _i (t + (j−1) τ), z _ij = z _i (t + (j−1) τ), and z _ij = y (t + (i + j−2)) Since τ), the map Φ can be rewritten as the following equation (4).

Φ：Ｘ→Ｙ・・・式（４） Φ: X → Y (4)

式（４）において、行列Ｘ及びＹは下記の通りである。 In Equation (4), the matrices X and Y are as follows.

以上のように、マップΦによって、ＸがＹに変換される。式（４）として示したように、Ｘは、ｎ次元の状態変数ｘ（ｔ）のそれぞれのｍ点の時点での観測データからなる高次元行列である。また、Ｙは、形式上Ｌ×ｍの行列であるが、状態変数ｘ（ｔ）のうちのいずれか１つの状態変数ｙ（ｔ）の（ｍ＋Ｌ−１）個の時点のデータを、列ごとに時点を１つ前方へずらして配列した行列である。このように配列されたＹは、状態変数ｙ（ｔ）として示される一次元の時系列データを複数集めたものとみなすこともできる。即ち、マップΦは、高次元時系列データＸを一次元（低次元）時系列データＹに変換する写像としての性質を有するということができる。 As described above, X is converted to Y by the map Φ. As shown in Expression (4), X is a high-dimensional matrix composed of observation data at the time of each m point of the n-dimensional state variable x (t). Y is a matrix of L × m in form, and the data at (m + L−1) time points of any one state variable y (t) of the state variables x (t) The matrix is arranged by shifting the time points forward by one. Y arranged in this way can be regarded as a collection of a plurality of one-dimensional time-series data indicated as the state variable y (t). That is, it can be said that the map Φ has a property as a map for converting the high-dimensional time-series data X into the one-dimensional (low-dimensional) time-series data Y.

状態変数ｙ（ｔ）は、状態変数ｘ（ｔ）として示される複数の次元（項目）の観測データのうちから選択された任意の一の次元（項目）について導出した長期間の一次元時系列データである。また、他の任意の次元（項目）の状態変数ｘ（ｔ）を選択し、新たな系ｚ（ｔ）を再構築することも可能である。従って、Ｘを複数のＹに変換することが可能であり、理論的にはＸに含まれる全ての状態変数ｘ（ｔ）について、一次元時系列データＹを導出することが可能である。 The state variable y (t) is a long-term one-dimensional time series derived for any one dimension (item) selected from observation data of a plurality of dimensions (items) indicated as the state variable x (t). It is data. It is also possible to select a state variable x (t) of any other dimension (item) and reconstruct a new system z (t). Therefore, it is possible to convert X into a plurality of Ys, and theoretically, it is possible to derive the one-dimensional time series data Y for all the state variables x (t) included in X.

元の系についての観測データを示す状態変数ｘ（ｔ）の時系列的に観測して得られた点の数がｍ点の場合、変換された一次元時系列データｙ（ｔ）に含まれる要素の個数は、（ｍ＋Ｌ−１）個となる。上述のようにＬ＞２ｄの場合、再構築された系のアトラクタは元の系のアトラクタと位相幾何学的に同相である。このため状態変数ｘ（ｔ）に係る観測データの数より変換後の時系列データｙ（ｔ）に含まれる要素の個数の方が多くなる。従って、マップΦにより、ｎ個の状態変数についてｍ点分の時点で観測した得られた短期的高次元時系列データは、１個の状態変数についてｍ＋Ｌ−１点分の時点における長期的低次元時系列データに変換することが可能であるということができる。本願では、このような変換を「逆埋め込み」と称する。 When the number of points obtained by time series observation of the state variable x (t) indicating the observation data for the original system is m, it is included in the converted one-dimensional time series data y (t). The number of elements is (m + L−1). As described above, when L> 2d, the attractor of the reconstructed system is topologically in phase with the attractor of the original system. For this reason, the number of elements included in the converted time-series data y (t) is larger than the number of observation data related to the state variable x (t). Accordingly, the obtained short-term high-dimensional time series data observed at the time of m points for the n state variables by the map Φ is the long-term low dimension at the time of m + L−1 points for one state variable. It can be said that it can be converted into time-series data. In the present application, such conversion is referred to as “reverse embedding”.

＜マップΦの決定方法＞
次に、マップΦの決定方法について説明する。上述の逆埋め込み理論による変換を行うためには、マップΦを決定しなければならない。マップΦの決定は、系ｘ（ｔ）∈Ｒⁿの各変数のｍ点の時点において観測された値ｘ_j＝ｘ（ｔ＋（ｊ−１）τ），（ｊ＝１，２，... ，ｍ）を用いて、Φ₁（ｘ_j）＝ｙ_j，（ｊ＝１，２，... ，ｍ）及びΦ_i（ｘ_j）＝Φ_i-1 （ｘ_j+1 ），（ｉ＝２，... ，Ｌ），（ｊ＝１，... ，ｍ−１）という（ｍ＋（Ｌ−１）×（ｍ−１））個の制約条件下で、Φ＝（Φ₁，Φ₂，... ，Φ_L）^T を導出することにより行われる。本願では、予測の精度を高めるために、系ｘ（ｔ）∈Ｒⁿの性質に応じて、マップΦが線形性を有する線形マップである場合と線形性を有しない非線形マップである場合とについて、それぞれのマップΦの決定方法について説明する。 <Determination method of map Φ>
Next, a method for determining the map Φ will be described. In order to perform the conversion based on the reverse embedding theory described above, the map Φ must be determined. The map Φ is determined by the values x _j = x (t + (j−1) τ), (j = 1, 2,...) Observed at the point m of each variable of the system x (t) ∈R ^n. , M), Φ ₁ (x _j ) = y _j , (j = 1, 2,..., M) and Φ _i (x _j ) = Φ _i−1 (x _{j + 1} ), Under (m + (L−1) × (m−1)) constraint conditions (i = 2,..., L), (j = 1,..., M−1), Φ = ( Φ ₁ , Φ ₂ , ... , Φ _L ) ^T. In the present application, in order to increase the accuracy of prediction, depending on the properties of the system x (t) εR ⁿ , the case where the map Φ is a linear map having linearity and the case of being a nonlinear map having no linearity A method for determining each map Φ will be described.

＜線形マップΦの決定＞
系ｘ（ｔ）∈Ｒⁿの変化に線形性がある場合、又は線形近似できる場合、変換マップΦとして線形性を有する線形マップΦを用いる。線形マップΦによる写像Φ（Ｘ）は、Ｌ行×ｎ列の行列Ａとして、下記の式（５）にて表現することができる。 <Determination of linear map Φ>
When the change of the system x (t) εR ⁿ has linearity or can be linearly approximated, the linear map Φ having linearity is used as the transformation map Φ. The mapping Φ (X) by the linear map Φ can be expressed by the following equation (5) as a matrix A of L rows × n columns.

Φ（Ｘ）＝ＡＸ＝Ｙ・・・式（５） Φ (X) = AX = Y (5)

式（５）において、行列Ｘ及びＹは、前述の式（４）として示した通りであり、行列Ａは下記の通りである。 In the equation (5), the matrices X and Y are as shown as the above equation (4), and the matrix A is as follows.

Ｘは、各状態変数ｘ（ｔ）∈Ｒⁿのｍ点の時点で観測された観測データからなるｎ行×ｍ列の行列である。Ｙは、ｘ（ｔ）のうちのいずれか１つの状態変数ｙ（ｔ）のｍ＋Ｌ−１点分の時点のデータを一列ごとに時点を前方へ１つずらして配列したＬ行×ｍ列の行列である。さらに、式（５）として示したＡＸ＝Ｙは、下記の式（６）として表すことができる。 X is a matrix of n rows × m columns composed of observation data observed at m points of each state variable x (t) ∈R ⁿ . Y is an L row × m column in which data of m + L−1 points of any one state variable y (t) in x (t) is arranged by shifting the time point forward by one for each column. It is a matrix. Furthermore, AX = Y shown as the equation (5) can be expressed as the following equation (6).

式（６）において、ａⁱ ＝（ａ_i1，ａ_i2，... ，ａ_in），（ｉ＝１，２，... ，Ｌ）、ｘ_j ＝（ｘ_1j，ｘ_1j，... ，ｘ_1j）^T ，（ｊ＝１，２，... ，ｍ）である。式（６）から、ｘ₁，ｘ₂，... ，ｘ_mについて得られた観測データｙ₁ ，ｙ₂ ，... ，ｙ_m に対して、ｍ個の線形方程式ａⁱ ・ｘ_j ＝ｙ_j ，（ｊ＝１，２，... ，ｍ）と、（ｍ−１）×（Ｌ−１）個の線形方程式ａⁱ ・ｘ_j −ａ^i-1 ・ｘ_j+1 ＝０，（ｉ＝２，３，... ，ｎ、ｊ＝１，２，... ，ｍ−１）を得ることができる。ｙ（ｔ）において、ｙ₁ ，ｙ₂ ，... ，ｙ_m が既知の観測データであり、ｙ_m+1 ，ｙ_m+2 ... ，ｙ_L+m-1 が推測の対象となる未知の時系列データである。従って、式（６）として示した式から、ｙ（ｔ）に含まれる未知の時系列データを算出することにより、ｘ（ｔ）から選択された状態変数ｙ（ｔ）の時点ｍより後の時系列データを算出することができる。即ち、選択された次元（項目）に係る状態変数ｘ（ｔ）の未知の時系列データを導出することが可能となり、導出した時系列データに基づいて観測対象の状態変化を推測することが可能となる。 In equation (6), a ⁱ = (a _i1 , a _i2 ,..., A _in ), (i = 1, 2,..., L), x _j = (x _1j , x _1j ,... , X _1j ) ^T , (j = 1, 2,..., M). From equation _{(6), x 1, x} 2, ..., observations were obtained for x _m data y _1, y _2, ..., with respect to y _m, m pieces of linear equations a ⁱ · x _j = Y _j , (j = 1, 2,..., M) and (m−1) × (L−1) linear equations a ⁱ · x _j −a ⁱ⁻¹ · x _{j + 1} = 0, (i = 2, 3,..., N, j = 1, 2,..., M−1) can be obtained. In _{y (t), y 1,} y 2, ..., y m are known observation _{_{data, y m + 1, y m}} + 2 ..., and the target of y _{L + m-1} is presumed This is unknown time-series data. Therefore, by calculating unknown time-series data included in y (t) from the equation shown as equation (6), the state variable y (t) selected from x (t) is later than time m. Time series data can be calculated. That is, it is possible to derive unknown time-series data of the state variable x (t) related to the selected dimension (item), and it is possible to estimate the state change of the observation target based on the derived time-series data. It becomes.

次に線形マップΦに基づく計算アルゴリズムについて説明する。先ず、観測された時系列データｘ₁，ｘ₂ ，... ，ｘ_m を取得し、選択された項目に係る状態変数ｘ_s （ｔ）に係る観測データに、時系列データを示す変数ｙ₁，ｙ₂ ，... ，ｙ_mを割り付ける。これらの時系列データから下記の式（７）に示す数式を生成し、生成した数式を用いたａ¹ についての一次方程式を解いて、ａ¹ を求める（手順Ｓａ１）。 Next, a calculation algorithm based on the linear map Φ will be described. First, the observed time series data x ₁ , x ₂ ,..., X _m are acquired, and the variable y indicating the time series data is added to the observed data related to the state variable x _s (t) related to the selected item. _1, y _2, ..., allocates y _m. A mathematical formula shown in the following formula (7) is generated from these time series data, and a linear equation for a ¹ using the generated mathematical formula is solved to obtain a ¹ (procedure Sa1).

ａ¹ ・ｘ_j ＝ｙ_j ・・・式（７）
但し、ｊ＝１，２，... ,ｍ a ¹ · x _j = y _j (7)
Where j = 1, 2,..., M

次に、ｉ＝２，３，... ,Ｌについて示した下記の式（８）に基づいて、ａⁱ についての一次方程式をｉ＝２，３，... ，Ｌの順に生成し、生成した数式を用いたａⁱ についての一次方程式を解く手順を繰り返し行いａⁱ を求める（手順Ｓａ２）。 Then, i = 2,3, ..., based on the equation (8) below shows the L, and the primary equation for a ^{i i} = 2,3, ..., generated in the order of L, The procedure for solving the linear equation for a ⁱ using the generated mathematical expression is repeated to obtain a ⁱ (procedure Sa2).

ａⁱ ・ｘ_j ＝ｙ_i+j-1 ・・・式（８）
但し、ｊ＝１，２，... ，ｍ−１ a ⁱ · x _j = y _{i + j−1} Expression (8)
Where j = 1, 2,..., M−1.

式（７）及び（８）に基づき算出した係数ａⁱ ，（ｉ＝１，２，... ，Ｌ）を用いて、ｍ以降の時点における時系列データｙ_m+i-1 を下記の式（９）により算出する（手順Ｓａ３）。 Using the coefficients a ⁱ , (i = 1, 2,..., L) calculated based on the equations (7) and (8), the time series data y _{m + i−1} at the time after m is expressed as follows: Calculation is performed according to equation (9) (procedure Sa3).

ｙ_m+i-1 ＝ａⁱ ・ｘ_m ・・・式（９） y _{m + i-1} = a ⁱ · x _m (9)

式（９）により算出された時系列データｙ_m+i-1 が、推測すべき未知の時系列データである。 The time series data ym _{+ i-1} calculated by the equation (9) is unknown time series data to be estimated.

このような線形マップΦに基づく計算アルゴリズムでは、ｎ×Ｌ個の未知数に対して、（ｍ−１）×Ｌ＋１個の線形方程式が存在し、これらの線形方程式を解くために繰り返し演算が実施される。時系列データＸとして、短期的高次元時系列データＸを用いる場合、ｎ≫ｍであり、ｎ×Ｌ≫（ｍ−１）×Ｌ＋１である。通常、未知の変数の数が線形方程式の数よりも遙かに多い場合、解を求めることができない。本願では、元の系の次元が高く、しかも一般的にｍ点の時点におけるデータが系の漸近的アトラクタから観測されたものであるため、その次元ｄが低く、ｄ≪ｎとなる系に適用することを前提としている。Ｌはｄより若干大きい整数であるのでＬ≪ｎとなる。従って、ｎ×Ｌ個の未知の変数には線形的に独立である変数が少ない状態となる。従って、一定の条件を満足すれば、解を求めることが可能である。 In such a calculation algorithm based on the linear map Φ, there are (m−1) × L + 1 linear equations for n × L unknowns, and iterative operations are performed to solve these linear equations. The When the short-term high-dimensional time-series data X is used as the time-series data X, n >> m and n × L >> (m−1) × L + 1. Usually, if the number of unknown variables is much larger than the number of linear equations, a solution cannot be found. In this application, since the dimension of the original system is high and the data at the point m is generally observed from the asymptotic attractor of the system, the dimension d is low and applied to a system where d << n. It is assumed that Since L is an integer slightly larger than d, L << n. Therefore, there are few variables that are linearly independent of n × L unknown variables. Therefore, a solution can be obtained if a certain condition is satisfied.

次に、解を求める条件について説明する。上述の式（８）は、下記の式（１０）として表すことができる。 Next, conditions for obtaining a solution will be described. The above equation (8) can be expressed as the following equation (10).

ａⁱ ・ｘ_j ＝ｙ_i+j-1 ＝ａ^i-1 ・ｘ_j+1 ・・・式（１０） a ⁱ · x _j = y _{i + j -1} = a ^{i -1} · x _{j +1} (10)

式（１０）において、ａ^i-1 には、決定すべきＬ個の変数が含まれている。線形方程式理論に基づき、係数行列のランクによって、線形方程式の解を求められるか否かが決まる。ここで、下記の式（１１）に示すようにＸ~ （Ｘ~ は、Ｘの上方にチルダ、以降の説明において同じ。）を規定するものとする。 In equation (10), a ^i-1 includes L variables to be determined. Based on the theory of linear equations, the rank of the coefficient matrix determines whether or not the solution of the linear equations can be obtained. Here, as shown in the following formula (11), X ~ (X ~ is a tilde above X, the same in the following description) is defined.

上述の計算アルゴリズムにおいて、ｒａｎｋ（Ｘ~ ）＞Ｌの場合、解を求めることができない。一方、ｒａｎｋ（Ｘ~ ）≦Ｌの場合、解を求めることが可能であり、特に、ｒａｎｋ（Ｘ~ ）＝Ｌの場合、解を正確に求めることができる。当該条件を満足するように観測する時点の数ｍを適切な値に設定することにで、解を求めることが可能となり、特にｍ＝Ｌ＋１となるように設定することで、線形マップΦを正確に求めることが可能となる。 In the above calculation algorithm, when rank (X ~)> L, a solution cannot be obtained. On the other hand, when rank (X˜) ≦ L, a solution can be obtained. In particular, when rank (X˜) = L, the solution can be obtained accurately. It is possible to obtain a solution by setting the number m of observation time points to an appropriate value so as to satisfy the condition. In particular, by setting m = L + 1, the linear map Φ can be accurately set. It becomes possible to ask for.

＜非線形マップΦの決定＞
系ｘ（ｔ）∈Ｒⁿの変化に線形性が無く、また線形近似することができない程度に非線形性の強い系に対しては、上述の線形マップΦを用いても、高精度な予測を行うことは困難である。このような非線形性の強い系に対して対応することが可能な非線形マップΦを決定する方法について説明する。非線形マップΦの各要素は、下記の式（１２）を用いて表される。 <Determination of nonlinear map Φ>
System x (t) ∈R ⁿ without linearity in a change in, and for the highly nonlinear system to the extent that they can not be linearly approximated, even using a linear map Φ described above, a highly accurate prediction It is difficult to do. A method of determining a nonlinear map Φ that can be applied to such a highly nonlinear system will be described. Each element of the nonlinear map Φ is expressed using the following equation (12).

式（１２）において、ｇ_l （ｘ），ｌ＝１，２，... ，ｋは、非線形マップΦにおける基底関数であり、ａ_li，（ｌ＝１，２，... ，ｋ、ｉ＝１，２，... ，Ｌ）は、それぞれの係数である。基底関数ｇ_l （ｘ）としては、多次元方程式、三角関数、フーリエ級数に係る関数、ニューラルネットワークの近似関数、ファジー集合に係る関数、各種フラクタル関数等の様々な非線形関数を用いることができる。 In Expression (12), g _l (x), l = 1, 2,..., K is a basis function in the nonlinear map Φ, and a _li , (l = 1, 2,..., K, i = 1, 2,..., L) are the respective coefficients. As the basis function g _l (x), various nonlinear functions such as a multidimensional equation, a trigonometric function, a function related to a Fourier series, an approximate function of a neural network, a function related to a fuzzy set, and various fractal functions can be used.

このような式（１２）を用いることにより、前述の線形マップΦの求め方を応用して、非線形マップΦを求めることが可能となる。ただし、ｋの数は、観測する時点ｍの数と比べて遙かに多くなりがちであるため、式（１２）に基づいて非線形マップΦを求めることが困難な場合も生じる。このような場合でも、非線形マップΦを求めることは可能である。本願では、非線形マップΦを求める方法として下記の方法を開示する。 By using the equation (12), the nonlinear map Φ can be obtained by applying the above-described method for obtaining the linear map Φ. However, since the number of k tends to be much larger than the number of observation points m, it may be difficult to obtain the nonlinear map Φ based on the equation (12). Even in such a case, it is possible to obtain the nonlinear map Φ. In the present application, the following method is disclosed as a method for obtaining the nonlinear map Φ.

非線形マップΦに基づく計算アルゴリズムについて説明する。先ず、観測された時系列データｘ₁，ｘ₂ ，... ，ｘ_m を取得し、選択された項目に係る状態変数ｘ_s （ｔ）に係る観測データに、時系列データを示す変数ｙ₁，ｙ₂ ，... ，ｙ_mを割り付ける。そして、基底関数ｇ_l （ｘ），ｌ＝１，２，... ，ｋ及びその展開次数ｋを選択する（手順Ｓｂ１）。 A calculation algorithm based on the nonlinear map Φ will be described. First, the observed time series data x ₁ , x ₂ ,..., X _m are acquired, and the variable y indicating the time series data is added to the observed data related to the state variable x _s (t) related to the selected item. _1, y _2, ..., allocates y _m. Then, a basis function g _l (x), l = 1, 2,..., K and its expansion order k are selected (step Sb1).

次に、Φ_i （ｘ_j ）＝ｘ~ _j ^T ・α_i となるようにｘ~ _j ＝［ｇ₁ （ｘ_j ），ｇ₂ （ｘ_j ），... ，ｇ₃ （ｘ_j ），］^T （但し、ｊ＝１，２，... ，ｍ）及びα_i を構築する（手順Ｓｂ２）。 Next, x _j = [g ₁ (x _j ), g ₂ (x _j ),..., G ₃ (x _j ) so that Φ _i (x _j ) = _x˜j ^T · α _i. ,] ^T (where j = 1, 2,..., M) and α _i are constructed (procedure Sb2).

次に、圧縮センシング手法により下記の式（１３）に示す数式を生成し、生成した数式を解いてα₁ を求める（手順Ｓｂ３）。 Next, the mathematical formula shown in the following formula (13) is generated by the compression sensing technique, and α ₁ is obtained by solving the generated mathematical formula (procedure Sb3).

ｘ~ _j ^T ・α₁ ＝ｙ_j ・・・式（１３）
但し、ｊ＝１，２，... ，ｍ x ~ _j ^T · α ₁ = y _j Expression (13)
Where j = 1, 2,..., M

次に、ｉ＝２，３，... ,Ｌについて示した下記の式（１４）に基づいて、α_i についての数式をｉ＝２，３，... ,Ｌの順に生成し、生成した数式を用いたα_i についての数式を圧縮センシング手法により解く手順を繰り返し行いα_i を求める（手順Ｓｂ４）。 Next, based on the following formula (14) shown for i = 2, 3,..., L, formulas for α _i are generated in the order of i = 2, 3,. The procedure for solving the mathematical expression for α _i using the above mathematical formula is repeated by the compression sensing technique to obtain α _i (procedure Sb4).

ｘ~ _j ^T ・α_i ＝ｙ_i+j-1 ・・・式（１４）
但し、ｊ＝１，２，... ,ｍ−１ _{^{_{x ~ j T · α i =}}} y i + j-1 ··· formula (14)
Where j = 1, 2,..., M−1.

式（１４）に基づき算出した係数α_i を用いて、ｍ以降の時点における時系列データｙ_m+i-1 を下記の式（１５）により算出する（手順Ｓｂ５）。 Using the coefficient α _i calculated based on the equation (14), time series data ym _{+ i−1} at the time point after _{m is} calculated by the following equation (15) (procedure Sb5).

ｙ_m+i-1 ＝ｘ~ _m ^T ・α_i ・・・式（１５） y _{m + i-1} = x ~ _m ^T · α _i (15)

式（１５）により算出された時系列データｙ_m+i-1 が、推測すべき未知の時系列データである。このようにして、非線形マップΦを求めることが可能となる。 The time series data ym _{+ i-1} calculated by the equation (15) is unknown time series data to be estimated. In this way, the nonlinear map Φ can be obtained.

＜導出装置＞
以上詳述した未知の時系列データを推測する方法は、コンピュータを用いた導出装置として応用することにより、産業上利用可能な発明として具現化することができる。本発明に係る導出装置は、経時的に変化する事象に関するｎ次元（ｎは２以上の整数）の項目それぞれについて時系列で観測して得られたｍ点（ｍは２以上の整数）の観測データに基づいて、前記観測データに含まれていない時点のデータを導出データとして導出する。図２は、本発明に係る導出装置の構成例を示すブロック図である。図２に示す導出装置１は、パーソナルコンピュータ、サーバコンピュータ、それに接続されるクライアントコンピュータ、その他各種コンピュータ等のコンピュータを用いて実現される。導出装置１は、制御部１０、記録部１１、記憶部１２、入力部１３、出力部１４、取得部１５等の各種機構を備えている。 <Deriving device>
The method for estimating unknown time-series data described in detail above can be embodied as an industrially applicable invention by applying it as a derivation device using a computer. The derivation device according to the present invention observes m points (m is an integer equal to or greater than 2) obtained by observing each of n-dimensional (n is an integer equal to or greater than 2) items related to events that change over time. Based on the data, data at a time point not included in the observation data is derived as derived data. FIG. 2 is a block diagram illustrating a configuration example of the derivation device according to the present invention. The derivation device 1 shown in FIG. 2 is realized using a computer such as a personal computer, a server computer, a client computer connected thereto, and other various computers. The deriving device 1 includes various mechanisms such as a control unit 10, a recording unit 11, a storage unit 12, an input unit 13, an output unit 14, and an acquisition unit 15.

制御部１０は、ＣＰＵ（Central Processing Unit ）等の回路を用いて構成され、検出装置１全体を制御する機構である。 The control unit 10 is configured using a circuit such as a CPU (Central Processing Unit), and is a mechanism that controls the entire detection apparatus 1.

記録部１１は、ＨＤＤ（Hard Disk Drive）等の磁気記録機構、ＳＳＤ（Solid State Disk）等の不揮発性半導体記録機構等の不揮発性の補助記録機構である。記録部１１には、本発明に係る導出プログラム１１ａ等の各種プログラム及びデータが記録されている。 The recording unit 11 is a non-volatile auxiliary recording mechanism such as a magnetic recording mechanism such as an HDD (Hard Disk Drive) or a non-volatile semiconductor recording mechanism such as an SSD (Solid State Disk). In the recording unit 11, various programs and data such as the derivation program 11a according to the present invention are recorded.

記憶部１２は、ＳＤＲＡＭ（Synchronous Dynamic Random Access Memory）、ＳＲＡＭ（Static Random Access Memory）等の揮発性の主記憶機構である。 The storage unit 12 is a volatile main storage mechanism such as an SDRAM (Synchronous Dynamic Random Access Memory) or an SRAM (Static Random Access Memory).

入力部１３は、キーボード、マウス等のハードウェア、及びドライバ等のソフトウェアを備える入力機構である。 The input unit 13 is an input mechanism including hardware such as a keyboard and a mouse, and software such as a driver.

出力部１４は、モニタ、プリンタ等のハードウェア、及びドライバ等のソフトウェアを備える出力機構である。 The output unit 14 is an output mechanism including hardware such as a monitor and a printer, and software such as a driver.

取得部１５は、外部から観測データ等のデータを取得するＬＡＮボード、各種メモリスロット、無線通信デバイス、画像読取用デバイス等のハードウェア、及びドライバ等のソフトウェアを備えるインターフェース機構である。なお、入力部１３を、取得部１５として使用することも可能である。 The acquisition unit 15 is an interface mechanism including a LAN board that acquires data such as observation data from the outside, various memory slots, wireless communication devices, hardware such as an image reading device, and software such as a driver. The input unit 13 can be used as the acquisition unit 15.

そして、記録部１１に記録されている導出プログラム１１ａを、記録部１２に記憶し、制御部１０の制御に基づき実行することにより、コンピュータは、導出プログラム１１ａに係る各種手順を実行し、本発明の導出装置１として機能する。なお、便宜上、記録部１１及び記憶部１２として区別しているが、両者とも各種情報の記録という同様の機能を有するものであり、装置の仕様、運用形態等に応じていずれの機構に記録させるかは、適宜決定することが可能である。同様に制御部１０が備えるレジスタを記録部１１として用いても良い。 Then, by storing the derivation program 11a recorded in the recording unit 11 in the recording unit 12 and executing it based on the control of the control unit 10, the computer executes various procedures related to the derivation program 11a. Functions as a derivation device 1 for For the sake of convenience, the recording unit 11 and the storage unit 12 are distinguished from each other, but both have the same function of recording various types of information, and which mechanism is used for recording according to the specifications of the apparatus, the operation mode, and the like. Can be determined as appropriate. Similarly, a register provided in the control unit 10 may be used as the recording unit 11.

次に、本発明に係る導出装置１の処理について説明する。図３は、本発明に係る導出装置１の線形性時系列データの導出処理の一例を示すフローチャートである。導出装置１の制御部１０は、取得部１５により、ｎ次元の項目それぞれについて時系列に並ぶｍ点の観測データを取得する（Ｓ１０１）。なお、観測データは、前述のように、取得部１５又は入力部１３にて取得するが、記録部１１に予め観測データを記録しておき、記録部１１からの観測データの読み取りをもって観測データの取得としても良い。また、観測データとしては、各種測定装置による測定より得られたデータ、株価等の経済上のデータの他、画像データ等の様々なデータを用いることが可能である。観測データとして画像データを用いる場合、例えば、画像を構成する各画素又は所定の領域を構成する複数の画素の代表値について、明度、又はＲ（Ｒｅｄ）値、Ｇ（Ｇｒｅｅｎ）値及びＢ（Ｂｌｕｅ）値の階調値を取得し、観測データとして用いることが可能である。画像データそのものは、外部からの入力を受け付けて取得しても良く、また画像読取機能を備えるスキャナ等の取得部１５を用いて画像を読み取ることで取得しても良い。 Next, the process of the derivation device 1 according to the present invention will be described. FIG. 3 is a flowchart showing an example of the derivation process of the linearity time series data of the derivation device 1 according to the present invention. The control unit 10 of the derivation device 1 uses the acquisition unit 15 to acquire observation data of m points arranged in time series for each of the n-dimensional items (S101). As described above, the observation data is acquired by the acquisition unit 15 or the input unit 13. However, the observation data is recorded in the recording unit 11 in advance, and the observation data is read by reading the observation data from the recording unit 11. It is good as acquisition. As observation data, it is possible to use various data such as image data in addition to data obtained by measurement by various measuring devices and economic data such as stock prices. When image data is used as the observation data, for example, with respect to representative values of each pixel constituting the image or a plurality of pixels constituting a predetermined region, brightness, R (Red) value, G (Green) value, and B (Blue) ) The tone value of the value can be acquired and used as observation data. The image data itself may be acquired by receiving an input from the outside, or may be acquired by reading an image using an acquisition unit 15 such as a scanner having an image reading function.

制御部１０は、取得した観測データを記録部１１に記録する（Ｓ１０２）。記録部１１には、取得部１５が取得した観測データを要素とし、項目が行方向に並び、時系列が列方向に並ぶように配置したｎ行×ｍ列の行列Ｘ（始域行列）が記録される。なお、記録部１１への実際の記録形式は、必ずしも行列形式で記録する必要は無く、行列形式に配置可能なように行及び列を示すアドレス、又はそれに代替される識別符号が付されていれば良い。 The control unit 10 records the acquired observation data in the recording unit 11 (S102). The recording unit 11 includes an n-row × m-column matrix X (starting matrix) in which the observation data acquired by the acquisition unit 15 is an element, items are arranged in the row direction, and time series are arranged in the column direction. To be recorded. The actual recording format in the recording unit 11 does not necessarily have to be recorded in a matrix format, and an address indicating a row and a column, or an identification code substituted for it, is attached so that it can be arranged in the matrix format. It ’s fine.

読み取った観測データを記録後、操作者は、入力部１３を操作して、推測の対象となる項目を、導出する項目として指定し、更に推測すべき時点の数Ｌを指定する。制御部１０は、入力部１３から導出する項目及び導出する時点数の指定を受け付ける（Ｓ１０３）。ステップＳ１０３では、観測データに係る複数の項目ｘ（ｔ）のうちから指定された項目ｘ_s （ｔ）の指定を受け付ける。前述した特定の状態変数ｘ_s （ｔ）の選択に対応する処理である。また、時点数の指定とは、予測の対象となる時点の数、即ち前述したＬの値を受け付ける処理である。ステップＳ１０３の処理としては、予め設定されている項目及び時点数を読み取ることで指定の受け付けとしても良い。また、指定する項目は、複数であっても良く、更には、全項目を指定しても良い。 After recording the read observation data, the operator operates the input unit 13 to specify an item to be estimated as an item to be derived, and further specifies the number L of points to be estimated. The control unit 10 accepts designation of items to be derived from the input unit 13 and the number of time points to be derived (S103). In step S103, designation of an item x _s (t) designated from among a plurality of items x (t) related to observation data is accepted. This process corresponds to the selection of the specific state variable x _s (t) described above. The designation of the number of time points is a process of receiving the number of time points to be predicted, that is, the value of L described above. As the processing in step S103, the designation may be accepted by reading preset items and the number of points in time. Further, there may be a plurality of items to be specified, and all items may be specified.

制御部１０は、観測データ及び導出データを変数に割り付ける（Ｓ１０４）。ステップＳ１０４において、観測データに対しては、変数ｙ₁ ，ｙ₂ ，... ，ｙ_m を割り付けて記録部１１に記録する。また、ステップＳ１０２にて指定された時点の数Ｌに対応する時点ｍ＋１，ｍ＋２，... ，ｍ＋Ｌ−１に対応する変数ｙ_m+1 ，ｙ_m+2 ，... ，ｙ_m+L-1 を定義する。定義された変数ｙ_m+1 ，ｙ_m+2 ，... ，ｙ_m+L-1 は、導出データを割り付ける変数として記録部１１に記録される。記録部１１に記録される変数列ｙ₁ ，ｙ₂ ，... ，ｙ_m ，ｙ_m+1 ，ｙ_m+2 ，... ，ｙ_m+L-1 は、以降の処理により、１次元の行列Ｙ（終域行列）として扱われる。 The control unit 10 assigns observation data and derived data to variables (S104). In step S104, for the observed data, variables y _1, y _2, ..., by assigning y _m recorded in the recording unit 11. Also, variables ym _{+ 1} , ym _{+ 2} ,..., Ym _{+ L} corresponding to the time points m + 1, m + 2,..., M + L-1 corresponding to the number L of time points specified in step S102. Define _-1 . The defined variables y _{m + 1} , y _{m + 2} ,..., Y _{m + L-1} are recorded in the recording unit 11 as variables to which the derived data is assigned. Variable column y _1, y ₂ which are recorded in the recording unit _{11, ..., y m, y} m + 1, y m + 2, ..., y m + L-1 is the subsequent processing, 1 Treated as a dimensional matrix Y (final matrix).

制御部１０は、変数に割り付けた観測データ及び導出データから、線形マップΦの１行目の要素ａ₁ に係る数式を生成し（Ｓ１０５）、生成した数式に基づいて要素ａ₁ を算出する（Ｓ１０６）。ステップＳ１０５〜１０６では、前述の線形マップΦの決定として説明した方法のうち、手順Ｓａ１に対応する処理を実行する。 The control unit 10 generates a mathematical expression related to the element a ₁ in the first row of the linear map Φ from the observation data and the derived data assigned to the variable (S105), and calculates the element a ₁ based on the generated mathematical expression (S105). S106). In steps S105 to S106, processing corresponding to the procedure Sa1 is executed among the methods described as the determination of the linear map Φ described above.

制御部１０は、ステップＳ１０６にて算出したａ₁ を元に、線形マップΦのｉ行目の要素ａ_i に係る数式を逐次生成し（Ｓ１０７）、生成した数式に基づいて要素ａ_i を逐次算出する（Ｓ１０８）。ステップＳ１０７〜Ｓ１０８では、前述の線形マップΦの決定として説明した処理のうち、手順Ｓａ２に対応する処理を実行する。要素ａ_i は、ａ₂ ，ａ₃ ，... ，の順で数式の生成及び演算を繰り返すことにより、即ち、ステップＳ１０７〜Ｓ１０８の処理を繰り返すことにより、順次算出される。 Based on a ₁ calculated in step S106, the control unit 10 sequentially generates a mathematical formula related to the element a _i in the i-th row of the linear map Φ (S107), and sequentially generates the element a _i based on the generated mathematical formula. Calculate (S108). In steps S107 to S108, a process corresponding to the procedure Sa2 is executed among the processes described as the determination of the linear map Φ. The elements a _i are sequentially calculated by repeating the generation and calculation of mathematical expressions in the order of a ₂ , a ₃ ,..., That is, by repeating the processes of steps S107 to S108.

制御部１０は、算出したａ_i を要素とする線形マップΦと、観測データｘ_m とに基づく演算により、時系列データｙ_m+i-1 を導出する（Ｓ１０９）。ステップＳ１０９では、前述の線形マップΦの決定として説明した方法のうち、手順Ｓａ３に対応する処理を実行する。 The control unit 10 derives time-series data y _{m + i-1} by calculation based on the linear map Φ having the calculated a _i as an element and the observation data x _m (S109). In step S109, the process corresponding to the procedure Sa3 is executed among the methods described as the determination of the linear map Φ.

制御部１０は、ステップＳ１０３にて指定された項目全てについて時系列データの導出を行ったか否かを判定する（Ｓ１１０）。 The control unit 10 determines whether time-series data has been derived for all items specified in step S103 (S110).

ステップＳ１１０において、時系列データを導出していない項目があると判定した場合（Ｓ１１０：ＮＯ）、制御部１０は、ステップＳ１０４に戻り、以降の処理を繰り返すことで、導出していない項目についての時系列データを導出する。 If it is determined in step S110 that there is an item for which time-series data has not been derived (S110: NO), the control unit 10 returns to step S104, and repeats the subsequent processing to determine the items that have not been derived. Deriving time series data.

ステップＳ１１０において、全ての項目について時系列データを導出したと判定した場合（Ｓ１１０：ＹＥＳ）、制御部１０は、導出した各項目についての時系列データｙ_m+i-1 を出力部１４から出力する（Ｓ１１１）。ステップＳ１１１における出力としては、モニタへの表示、プリンタからの印刷の他、外部の装置への送信、記録部１１への記録等の様々な方法にて行われる。 If it is determined in step S110 that time series data has been derived for all items (S110: YES), the control unit 10 outputs time series data ym _{+ i-1} for each derived item from the output unit 14. (S111). The output in step S111 is performed by various methods such as display on a monitor, printing from a printer, transmission to an external apparatus, recording on the recording unit 11, and the like.

以上のようにして、線形性時系列データの導出処理が実行される。 As described above, the derivation process of the linearity time series data is executed.

次に、非線形性の時系列データの導出処理について説明する。図４は、本発明に係る導出装置１の非線形性時系列データの導出処理の一例を示すフローチャートである。導出装置１の制御部１０は、取得部１５により、ｎ次元の項目それぞれについてｍ点の観測データを取得し（Ｓ２０１）、取得した観測データを記録部１１に記録する（Ｓ２０２）。 Next, a process for deriving nonlinear time-series data will be described. FIG. 4 is a flowchart showing an example of the derivation process of the nonlinear time series data of the derivation device 1 according to the present invention. The control unit 10 of the derivation device 1 acquires m-point observation data for each of the n-dimensional items by the acquisition unit 15 (S201), and records the acquired observation data in the recording unit 11 (S202).

読み取った観測データを記録後、操作者は、入力部１３を操作して、推測の対象となる項目を、導出する項目として指定し、更に推測すべき時点の数Ｌを指定する。制御部１０は、入力部１３から導出する項目及び導出する時点数の指定を受け付ける（Ｓ２０３）。 After recording the read observation data, the operator operates the input unit 13 to specify an item to be estimated as an item to be derived, and further specifies the number L of points to be estimated. The control unit 10 accepts designation of items to be derived from the input unit 13 and the number of points to be derived (S203).

さらに、操作者は、入力部１３を操作して、基底関数ｇ₁ （ｘ）及び次数ｋの指定を受ける（Ｓ２０４）。ステップＳ２０４では、前述の非線形マップΦの決定として説明した方法のうち、手順Ｓｂ１に対応する処理を実行する。なお、ステップＳ２０４の処理としては、予め設定されている基底関数及び次数を読み取ることで指定の受け付けとしても良い。 Further, the operator operates the input unit 13 and receives designation of the basis function g ₁ (x) and the order k (S204). In step S204, the process corresponding to the procedure Sb1 is executed among the methods described as the determination of the nonlinear map Φ. In addition, as a process of step S204, it is good also as designation | designated reception by reading the basis function and order set beforehand.

制御部１０は、非線形マップΦ_i （ｘ_j ）を構成する基底関数ｇ_i （ｘ）の行列ｘ~ _j ^T 及び係数α_i を構築する（Ｓ２０５）。ステップＳ２０５では、前述の非線形マップΦの決定として説明した方法のうち、手順Ｓｂ２に対応する処理を実行する。 Control unit 10 constructs a matrix x ~ _j ^T and coefficients alpha _i of the nonlinear map [Phi _i basis functions constituting the _{_{(x j) g i (x}} ) (S205). In step S205, the process corresponding to the procedure Sb2 is executed among the methods described as the determination of the nonlinear map Φ.

制御部１０は、観測データ及び導出データの変数を割り付ける（Ｓ２０６）。更に、制御部１０は、変数に割り付けた観測データ及び導出データから、非線形マップΦの１行目の要素α¹ に係る数式を生成し（Ｓ２０７）、生成した数式に基づいて要素α¹ を算出する（Ｓ２０８）。ステップＳ２０７〜Ｓ２０８では、前述の非線形マップΦの決定として説明した方法のうち、手順Ｓｂ３に対応する処理を実行する。 The control unit 10 assigns observation data and derived data variables (S206). Further, the control unit 10 generates a mathematical formula related to the element α ¹ in the first row of the nonlinear map Φ from the observation data and the derived data assigned to the variable (S207), and calculates the element α ¹ based on the generated mathematical formula. (S208). In steps S207 to S208, processing corresponding to the procedure Sb3 is executed among the methods described as the determination of the nonlinear map Φ.

制御部１０は、ステップＳ２０８にて算出したα¹ を元に、非線形マップΦのｉ行目の要素αⁱ に係る数式を逐次生成し（Ｓ２０９）、生成した数式に基づいて要素αⁱ を逐次算出する（Ｓ２１０）。ステップＳ２０９〜Ｓ２１０では、前述の非線形マップΦの決定として説明した方法のうち、手順Ｓｂ４に対応する処理を実行する。要素αⁱは、α²，α³ ，... ，の順で数式の生成及び演算を繰り返すことにより、即ち、ステップＳ２０９〜Ｓ２１０の処理を繰り返すことにより、順次算出される。 Based on α ¹ calculated in step S208, the control unit 10 sequentially generates mathematical expressions related to the element α ⁱ in the i-th row of the nonlinear map Φ (S209), and sequentially generates the elements α ⁱ based on the generated mathematical expressions. Calculate (S210). In steps S209 to S210, the process corresponding to the procedure Sb4 is executed among the methods described as the determination of the nonlinear map Φ. The element α ⁱ is sequentially calculated by repeating the generation and calculation of mathematical expressions in the order of α ² , α ³ ,..., That is, by repeating the processing of steps S209 to S210.

制御部１０は、算出したαⁱ を要素とする非線形マップΦと、観測データｘ_m とに基づく演算により、時系列データｙ_m+i-1 を導出する（Ｓ２１１）。ステップＳ２１１では、前述の非線形マップΦの決定として説明した方法のうち、手順Ｓｂ５に対応する処理を実行する。 The control unit 10 derives time-series data y _{m + i-1} by calculation based on the nonlinear map Φ having the calculated α ⁱ as an element and the observation data x _m (S211). In step S211, the process corresponding to the procedure Sb5 is executed among the methods described as the determination of the nonlinear map Φ described above.

制御部１０は、ステップＳ２０３にて指定された項目全てについて時系列データの導出を行ったか否かを判定する（Ｓ２１２）。 The control unit 10 determines whether time-series data has been derived for all items specified in step S203 (S212).

ステップＳ２１２において、時系列データを導出していない項目があると判定した場合（Ｓ２１２：ＮＯ）、制御部１０は、ステップＳ２０６に戻り、以降の処理を繰り返すことで、導出していない項目についての時系列データを導出する。 In step S212, when it is determined that there is an item for which time-series data has not been derived (S212: NO), the control unit 10 returns to step S206, and repeats the subsequent processing to determine the items that have not been derived. Deriving time series data.

ステップＳ２１２において、全ての項目について時系列データを導出したと判定した場合（Ｓ２１２：ＹＥＳ）、制御部１０は、導出した各項目についての時系列データｙ_m+i-1 を出力部１４から出力する（Ｓ２１２）。 If it is determined in step S212 that time series data has been derived for all items (S212: YES), the control unit 10 outputs time series data ym _{+ i-1} for each derived item from the output unit 14. (S212).

以上のようにして、非線形性時系列データの導出処理が実行される。 As described above, the process of deriving nonlinear time series data is executed.

＜実際の動的システムへの適用例＞
以上、詳述したように、本発明の導出装置１により、時系列データの導出処理として、未知の時系列データの推測を行うことができる。本発明の導出装置１にて実行される未知の時系列データの導出は、経時的に変換するｎ次元の項目を有する時系列データ、特に短期的高次元時系列データに基づいて、長期的低次元の時系列データの推測に有用である。その応用範囲は広く、株価、為替、物価等の金融市場、気象、電力消費量、生命分子レベルの進展データ、複雑な疾患の進展等の様々な分野に適用することができる。 <Example of application to an actual dynamic system>
As described above in detail, the deriving device 1 of the present invention can estimate unknown time-series data as the time-series data deriving process. The derivation of unknown time-series data executed by the derivation device 1 of the present invention is based on time-series data having n-dimensional items to be converted over time, particularly on the basis of short-term high-dimensional time-series data. Useful for estimating time-series data of dimensions. Its application range is wide, and it can be applied to various fields such as financial markets such as stock prices, exchange rates, prices, weather, power consumption, biomolecular level progress data, and complicated disease progress.

本発明を気象予測に適用する例について説明する。図５は、本発明に係る導出装置１が取得する観測データの一例を示す説明図である。図５は、気象衛星により太平洋南西部付近を撮像して得られた可視画像であり、台風が発生している状況を確認することができる。導出装置１は、図５に例示する可視画像から画素データを観測データとして取得する。取得する画素データは、可視画像を構成する全ての画素、画像中の特定の座標における画素、予め設定された領域毎の画素の代表値（例えば、平均値）等、適宜設定することができる。また画素データに用いる数値としては、モノクロ画像からは階調値、カラー画像からはＲ値、Ｇ値及びＢ値の各色成分における階調値等適宜設定することができる。 An example in which the present invention is applied to weather prediction will be described. FIG. 5 is an explanatory diagram showing an example of observation data acquired by the deriving device 1 according to the present invention. FIG. 5 is a visible image obtained by imaging the vicinity of the southwestern Pacific with a meteorological satellite, and it is possible to confirm a situation where a typhoon is generated. The deriving device 1 acquires pixel data as observation data from the visible image illustrated in FIG. The pixel data to be acquired can be set as appropriate, such as all pixels constituting the visible image, pixels at specific coordinates in the image, and representative values (for example, average values) of pixels for each preset region. The numerical values used for the pixel data can be set as appropriate, such as gradation values for monochrome images and gradation values for R, G, and B color components for color images.

今回は本発明に係る疑似試験として、図５に例示する可視画像に含まれる３００点の座標位置に対応する画素のＲ値、Ｇ値及びＢ値それぞれについて２５６段階に分級された階調値を９００点の観測データとして取得した。更に、図５に示す台風の渦の中心、所謂、台風の目の位置を示すＸ座標及びＹ座標を観測データとして取得した。即ち、気象衛星による可視画像を、９０２次元の状態変数を含む系として捉え、系に含まれる９０２項目の観測データを取得した。また、観測データを取得する同様の処理を、所定の時間間隔で撮像して得られた２１点の可視画像に対して行った。このような方法により、経時的に変化する系（動的システム）から一定の時間間隔で２１点の時点における多次元の観測データを取得した。そして、導出する項目として、台風の目の位置を示すＸ座標及びＹ座標を指定し、線形マップΦに基づき、観測データに係る時点より後の８点の時点について、指定したＸ座標及びＹ座標の値を導出し、出力した。 This time, as a pseudo test according to the present invention, gradation values classified into 256 levels for each of the R value, G value, and B value of the pixels corresponding to the coordinate positions of 300 points included in the visible image illustrated in FIG. Obtained as 900 points of observation data. Further, the X-coordinate and Y-coordinate indicating the center of the typhoon vortex shown in FIG. That is, a visible image from a meteorological satellite was regarded as a system including a 902-dimensional state variable, and 902 items of observation data included in the system were acquired. In addition, a similar process for acquiring observation data was performed on 21 visible images obtained by imaging at predetermined time intervals. By such a method, multi-dimensional observation data at 21 points in time was acquired from a system that changes over time (dynamic system) at regular time intervals. Then, as the items to be derived, the X coordinate and the Y coordinate indicating the position of the typhoon eye are specified, and the specified X coordinate and Y coordinate for the eight time points after the time point related to the observation data based on the linear map Φ. The value of was derived and output.

図６は、本発明に係る導出装置１が出力する導出データの一例を示す説明図である。図６では、台風の目の位置を示す座標を、２１点の観測データ、８点の導出データ及び８点の実際のデータについて地図上に示した説明図である。地図上の位置を明確にすべく陸地については海岸線を示している。図中左下に示された陸地がルソン島等のフィリピン諸島であり、左上に示された陸地が台湾島である。また、図中「△」で示した点が観測データにより示される台風の目の実際の位置、「○」で示した点が導出データにより示される台風の目の推測位置、そして「◇」で示した点が導出データに対応する時点における実際の台風の目の実際の位置を示している。 FIG. 6 is an explanatory diagram showing an example of derived data output by the deriving device 1 according to the present invention. FIG. 6 is an explanatory diagram showing coordinates indicating the position of the typhoon eye on the map with respect to 21 points of observation data, 8 points of derived data, and 8 points of actual data. In order to clarify the position on the map, the coastline is shown for the land. The land shown in the lower left of the figure is the Philippine Islands such as Luzon, and the land shown in the upper left is Taiwan Island. In addition, the point indicated by “△” in the figure is the actual position of the typhoon eye indicated by the observation data, the point indicated by “◯” is the estimated position of the typhoon eye indicated by the derived data, and “◇” The indicated points indicate the actual positions of the actual typhoon eyes at the time corresponding to the derived data.

図６に示すように、２１点の観測データから導出された８点の導出データにより示される台風の中心の位置の経時変化、即ち、台風の予測進路は、実際の進路と近似していることを確認することができる。現在の気象に係る数値予測では、各地点の各高度における温度、湿度、気圧、風向き等の気象データを元に台風の進路予測を行っている。これに対して本発明は、気象衛星による可視画像のみからでも台風の進路予測を行うことが可能であることを示している。実際に本発明を適用する際には、各種気象データを観測データとすることにより、更には、これらの気象データと可視画像、赤外画像等の画像データから得られるデータとを観測データとして併用することにより、更に正確な進路予測の実現を見込むことができる。 As shown in FIG. 6, the temporal change in the position of the typhoon center indicated by the eight derived data derived from the 21 observation data, that is, the predicted course of the typhoon approximates the actual course. Can be confirmed. In the current numerical prediction related to weather, the course of typhoon is predicted based on weather data such as temperature, humidity, atmospheric pressure, and wind direction at each altitude at each point. On the other hand, the present invention shows that it is possible to predict the course of a typhoon from only a visible image from a weather satellite. When actually applying the present invention, various meteorological data are used as observation data. Furthermore, these meteorological data and data obtained from image data such as visible images and infrared images are used in combination as observation data. By doing so, it is possible to expect a more accurate course prediction.

前記実施の形態では、本発明の無数に存在する実施例の一部を開示したに過ぎず、マップΦの概念を用いて時系列データを導出（推測）する様々な方法に適用することが可能である。また、マップΦを算出する演算手順についても上述した方法に限らず、様々な方法を適用することが可能である。例えば、マップΦの要素に係る数式の生成及び要素の算出は、行毎に逐次算出するのではなく、複数の要素を一度に算出するアルゴリズムを適用しても良い。 In the above embodiment, only a part of the infinite number of examples of the present invention is disclosed, and it can be applied to various methods for deriving (estimating) time series data using the concept of the map Φ. It is. The calculation procedure for calculating the map Φ is not limited to the method described above, and various methods can be applied. For example, an algorithm for calculating a plurality of elements at a time may be applied to the generation of mathematical formulas and the calculation of elements related to the elements of the map Φ instead of sequentially calculating each line.

１導出装置
１０制御部
１１記録部
１１ａ導出プログラム
１２記憶部
１３入力部
１４出力部
１５取得部 DESCRIPTION OF SYMBOLS 1 Derivation apparatus 10 Control part 11 Recording part 11a Derivation program 12 Storage part 13 Input part 14 Output part 15 Acquisition part

Claims

Based on the observation data of m points (m is an integer of 2 or more) obtained by observing each of n-dimensional items (n is an integer of 2 or more) in time series regarding events that change over time, the observation data A derivation device for deriving data at a time not included in
an acquisition unit for acquiring observation data of m points for each of n-dimensional items;
Item accepting means for accepting designation of an item to be derived;
A recording unit that records observation data acquired by the acquisition unit as an element, records an n-row × m-column start matrix arranged such that items are arranged in a row direction and a time series is arranged in a column direction;
L rows × m columns (with m + 1-i point observation data and i−1 point derivation data arranged in chronological order in the i-th row (i is a natural number) with the observation data and derivation data of the specified item as elements. Means for deriving a transformation map in which L is an integer of 2 or more) and a mapping from the start matrix;
Means for deriving the end-region matrix as a mapping from the start-region matrix recorded in the recording unit by the derived transformation map;
An derivation device comprising: an output unit that outputs derivation data included in the derived end-region matrix.

Based on observation data of m points (m is an integer of 2 or more) obtained by observing each of n-dimensional items (n is an integer of 2 or more) in time series regarding events that change over time, the observation data A derivation device for deriving data at a time not included in
an acquisition unit for acquiring observation data of m points for each of n-dimensional items;
Item accepting means for accepting designation of an item to be derived;
Consists of n coefficients and observation data, and m + 1-i point observation data (i = 1, 2,..., L) for the specified item and i-1 point derived data. formula generating means for generating m + 1-i formulas;
Derivation means for deriving derived data based on the generated mathematical formula;
An derivation device comprising: an output unit that outputs the derived data.

The derivation device according to claim 2, wherein
The mathematical expression generating means generates i mathematical expressions having linearity composed of a product of n coefficients and observation data of n items, and variables relating to observation data and derived data of designated items,
The derivation device, wherein the derivation unit derives the derivation data related to the variable using the values of n coefficients obtained from the mathematical formula generated by the mathematical formula generation unit.

The derivation device according to claim 2, wherein
A function specification unit that receives the specification of the nonlinear function and the expansion order of the specified nonlinear function;
The mathematical expression generating means generates n nonlinear functions using the observation data and the derived data as variables, respectively, as i mathematical expressions based on the nonlinear function and the expansion order specified by the function specifying unit. A derivation device.

The derivation device according to any one of claims 1 to 4, wherein
The derivation device, wherein the derivation data is data for L-1 points after the m-th observation data or data for L-1 points before the first observation data.

Based on observation data of m points (m is an integer of 2 or more) obtained by observing each of n-dimensional items (n is an integer of 2 or more) in time series regarding events that change over time, the observation data A derivation method using a derivation device for deriving data at a time not included in the derivation data,
The derivation device includes:
obtaining observation data at m points for each of the n-dimensional items;
Determining the item to derive;
n coefficients and observation data, and m + 1-L point observation data (L = 1, 2,..., i) for the specified item and L-1 point derived data variables. generating i mathematical expressions;
And a step of deriving derived data based on the generated mathematical formula.

Based on observation data of m points (m is an integer of 2 or more) obtained by observing the computer in time series for each item of n dimensions (n is an integer of 2 or more) related to events that change over time, A derivation program for deriving data at a time point not included in the observation data as derived data,
On the computer,
obtaining observation data at m points for each of the n-dimensional items;
Determining the item to derive;
n coefficients and observation data, and m + 1-L point observation data (L = 1, 2,..., i) for the specified item and L-1 point derived data variables. generating i mathematical expressions;
And a step of deriving derived data based on the generated mathematical formula.