JP2018190280A

JP2018190280A - Data processing apparatus, data processing method, program, and data structure

Info

Publication number: JP2018190280A
Application number: JP2017094047A
Authority: JP
Inventors: 理基鈴木; Masaki Suzuki
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2017-05-10
Filing date: 2017-05-10
Publication date: 2018-11-29
Anticipated expiration: 2037-05-10
Also published as: JP6815931B2

Abstract

PROBLEM TO BE SOLVED: To provide a data processing apparatus or the like capable of performing analysis for multidimensional and multi-scale data.SOLUTION: A statistics data processing apparatus includes a statistics data processing part for generating a statistics data group having statistics data based on data included in each combination of a point of view and a scale with respect to a plurality of different points of view and a plurality of different scales for each point of view.SELECTED DRAWING: Figure 1

Description

本発明は、データ処理装置、データ処理方法、プログラムおよびデータ構造に関する。 The present invention relates to a data processing device, a data processing method, a program, and a data structure.

交通あるいはヘルスケアなど様々な分野において、所定の対象の時系列データを検出することが行われている。このような時系列データから検出対象の異常を即座に発見することが重要な場合がある。
また、時系列データのサンプル数が膨大になりかつ常に次々とデータが発生するストリームデータを処理の対象とする場合も多い。 In various fields such as traffic and healthcare, time-series data of a predetermined target is detected. In some cases, it is important to immediately detect an abnormality to be detected from such time-series data.
Further, there are many cases where stream data in which the number of samples of time series data is enormous and data is always generated one after another is processed.

異常を検出する方法としては、事前知識を用いて異常を検出する方法と、異常として外れ値を検出する方法とに大別される。
事前知識を用いて異常を検出する方法では、特定の条件を満たす時系列データが発生したときに異常の発生として当該異常を検出するが、未知の事象に対応することができない。また、事前知識を用いて異常を検出する方法では、前提条件が仮定されている場合が多いため、前提条件が変化するときには、その都度、特定の条件を修正する必要がある。また、この方法では、事前知識を学習する必要がある。このような点を補うために、機械学習を利用することも考えられるが、十分に高速な処理は確立されていない。 The method for detecting an abnormality is roughly classified into a method for detecting an abnormality using prior knowledge and a method for detecting an outlier as an abnormality.
In the method of detecting an abnormality using prior knowledge, when time-series data satisfying a specific condition occurs, the abnormality is detected as the occurrence of the abnormality, but an unknown event cannot be dealt with. In addition, in the method of detecting an abnormality using prior knowledge, a precondition is often assumed, and therefore, when the precondition changes, it is necessary to correct a specific condition each time. This method also requires prior knowledge to be learned. In order to compensate for this point, it may be possible to use machine learning, but a sufficiently high-speed process has not been established.

こうした事情から、ストリームデータを対象として、外れ値を検出する方法の確立および高度化が重要となる。
例えば、外れ値を検出する技術は従前から存在しているが、特にストリームデータを対象とするときに、十分に高速な処理が行われない場合があった。異常が発生するときを予測することが困難な状況では、常に異常の発生の有無を監視する必要があり、時系列データを構成するデータが次々と発生する速度と比べて十分に高速な処理を実現する必要があった。 Under these circumstances, it is important to establish and enhance a method for detecting outliers for stream data.
For example, techniques for detecting outliers have existed for some time, but there have been cases where sufficiently high-speed processing has not been performed, particularly when stream data is targeted. In situations where it is difficult to predict when an anomaly will occur, it is necessary to constantly monitor whether an anomaly has occurred, and processing that is sufficiently faster than the rate at which the data constituting the time-series data occurs one after another It was necessary to realize.

一例として、非特許文献１では、ｔ−ｄｉｇｅｓｔと呼ばれる技術が提案されている。ｔ−ｄｉｇｅｓｔは、値の集合に対してその分布を推定するデータ構造を有し、例えば、ストリームデータを対象としてその分布を推定することが可能である。
しかしながら、ｔ−ｄｉｇｅｓｔだけでは、十分に高速な処理を実現することが困難な場合があった。 As an example, Non-Patent Document 1 proposes a technique called t-digest. The t-digest has a data structure for estimating the distribution of a set of values. For example, the distribution can be estimated for stream data.
However, there are cases where it is difficult to realize sufficiently high-speed processing with only t-digest.

ＴＥＤＤＵＮＮＩＮＧＡＮＤＯＴＭＡＲＥＲＴＬ、“ＣＯＭＰＵＴＩＮＧＥＸＴＲＥＭＥＬＹＡＣＣＵＲＡＴＥＱＵＡＮＴＩＬＥＳＵＳＩＮＧｔ−ＤＩＧＥＳＴＳ”、［ｏｎｌｉｎｅ］、［平成２９年４月１２日検索］、インターネット＜ＵＲＬ：https://github.com/tdunning/t-digest/blob/master/docs/t-digest-paper/histo.pdf＞TED DUNING AND OTMAR ERTL, "COMPUTING EXTREME ACCURATE QUANTILES USING t-DIESTS", [online], [April 12, 2017 search], Internet <URL: https://github.com/tdunning/t-gest blob / master / docs / t-digest-paper / histo.pdf>

従来では、多次元およびマルチスケールのデータ解析を行うことが困難である場合があった。なお、ｔ−ｄｉｇｅｓｔでは、多次元およびマルチスケールについては想定されていなかった。 Conventionally, it has been difficult to perform multidimensional and multiscale data analysis. In t-digest, multidimensional and multiscale were not assumed.

本発明は、このような事情を考慮してなされたもので、多次元およびマルチスケールのデータ解析を行うことを可能とするデータ処理装置、データ処理方法、プログラムおよびデータ構造を提供することを課題とする。 The present invention has been made in view of such circumstances, and it is an object of the present invention to provide a data processing device, a data processing method, a program, and a data structure that enable multidimensional and multiscale data analysis. And

一構成例として、複数の異なる観点および前記観点ごとの複数の異なるスケールについて、前記観点と前記スケールとの組み合わせごとに含まれるデータに基づく統計量データを有する統計量データ群を生成する統計量データ処理部を備える、統計量データ処理装置である。
一構成例として、統計量データ処理装置において、前記統計量データ処理部は、前記統計量データ群に対して新たなデータを追加した前記統計量データ群を生成する、構成が用いられてもよい。
一構成例として、統計量データ処理装置において、前記統計量データ処理部は、前記統計量データ群から抽出範囲に含まれるデータに基づく前記統計量データ群を生成する、構成が用いられてもよい。
一構成例として、統計量データ処理装置において、前記統計量データ処理部は、それぞれの前記観点について１個の前記スケールが特定された統計量データである単位統計量データを生成する単位統計量データ生成装置により生成された前記単位統計量データを使用して、前記統計量データ群を生成する、構成が用いられてもよい。 As one configuration example, for a plurality of different viewpoints and a plurality of different scales for each of the viewpoints, statistical data for generating a statistical data group having statistical data based on data included for each combination of the viewpoint and the scale It is a statistics data processing apparatus provided with a process part.
As one configuration example, in the statistics data processing device, a configuration may be used in which the statistics data processing unit generates the statistics data group in which new data is added to the statistics data group. .
As one configuration example, in the statistics data processing device, a configuration may be used in which the statistics data processing unit generates the statistics data group based on data included in an extraction range from the statistics data group. .
As one configuration example, in the statistics data processing device, the statistics data processing unit generates unit statistics data that is unit statistics data in which one scale is specified for each of the viewpoints. The structure which produces | generates the said statistics data group using the said unit statistics data produced | generated by the production | generation apparatus may be used.

一構成例として、統計量データ処理部が、複数の異なる観点および前記観点ごとの複数の異なるスケールについて、前記観点と前記スケールとの組み合わせごとに含まれるデータに基づく統計量データを有する統計量データ群を生成する、統計量データ処理方法である。
一構成例として、統計量データ処理部が、複数の異なる観点および前記観点ごとの複数の異なるスケールについて、前記観点と前記スケールとの組み合わせごとに含まれるデータに基づく統計量データを有する統計量データ群を生成するステップ、をコンピュータに実行させるためのプログラムである。
一構成例として、統計量データ群のデータ構造であって、複数の異なる観点および前記観点ごとの複数の異なるスケールについて、前記観点と前記スケールとの組み合わせごとに、前記組み合わせごとに含まれるデータに基づく統計量データを有する、データ構造である。 As one configuration example, the statistic data processing unit includes statistic data based on data included in each combination of the viewpoint and the scale with respect to a plurality of different viewpoints and a plurality of different scales for each viewpoint. This is a statistical data processing method for generating groups.
As one configuration example, the statistic data processing unit includes statistic data based on data included in each combination of the viewpoint and the scale with respect to a plurality of different viewpoints and a plurality of different scales for each viewpoint. A program for causing a computer to execute a step of generating a group.
As one configuration example, a data structure of a statistic data group, including a plurality of different viewpoints and a plurality of different scales for each of the viewpoints, for each combination of the viewpoint and the scale, the data included in each combination A data structure with statistical data based thereon.

本発明によれば、多次元およびマルチスケールのデータ解析を行うことを可能とすることができる。 According to the present invention, it is possible to perform multidimensional and multiscale data analysis.

本発明の一実施形態（第１実施形態）に係るデータ処理システムの概略的な構成を示すブロック図である。1 is a block diagram showing a schematic configuration of a data processing system according to an embodiment (first embodiment) of the present invention. 本発明の一実施形態に係る統計量データ処理装置の概略的な構成を示すブロック図である。It is a block diagram which shows the schematic structure of the statistic data processing apparatus which concerns on one Embodiment of this invention. 本発明の一実施形態に係る統計量データ群の一例のデータ構造を模式的に示す図である。It is a figure which shows typically the data structure of an example of the statistics data group which concerns on one Embodiment of this invention. 本発明の一実施形態に係る統計量データ群に対する追加処理の一例を説明するための図である。It is a figure for demonstrating an example of the addition process with respect to the statistics data group which concerns on one Embodiment of this invention. 本発明の一実施形態に係る統計量データ群に対する抽出処理の一例を説明するための図である。It is a figure for demonstrating an example of the extraction process with respect to the statistics data group which concerns on one Embodiment of this invention. 本発明の一実施形態に係る統計量データ処理装置において行われる統計量データ群を生成するための処理の手順の一例を示す図である。It is a figure which shows an example of the procedure of the process for producing | generating the statistics data group performed in the statistics data processing apparatus which concerns on one Embodiment of this invention. 本発明の一実施形態に係る統計量データ処理装置において行われる統計量データ群に対してデータを追加するための処理の手順の一例を示す図である。It is a figure which shows an example of the procedure of the process for adding data with respect to the statistics data group performed in the statistics data processing apparatus which concerns on one Embodiment of this invention. 本発明の一実施形態に係る統計量データ処理装置において行われる統計量データ群からデータを抽出するための処理の手順の一例を示す図である。It is a figure which shows an example of the procedure of the process for extracting data from the statistics data group performed in the statistics data processing apparatus which concerns on one Embodiment of this invention. 本発明の一実施形態（第２実施形態）に係るデータ処理システムの概略的な構成を示すブロック図である。It is a block diagram which shows the schematic structure of the data processing system which concerns on one Embodiment (2nd Embodiment) of this invention.

以下、図面を参照し、本発明の実施形態について説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

（第１実施形態）
［データ処理システム］
図１は、本発明の一実施形態（第１実施形態）に係るデータ処理システム１の概略的な構成を示すブロック図である。
データ処理システム１は、ｎ（ｎは２以上の整数とする。）個の端末装置１１−１〜１１−ｎと、統計量データ処理装置１２と、データベース１３と、ネットワーク２１を備える。
ネットワーク２１としては、有線または無線の任意のネットワークが用いられてもよく、例えば、インターネットあるいはＷｉ−Ｆｉ（登録商標）のネットワークなどが用いられてもよい。
データベース１３は、本実施形態では、統計量データ処理装置１２とは別に備えられるが、他の例として、統計量データ処理装置１２に一体化されていてもよい。 (First embodiment)
[Data processing system]
FIG. 1 is a block diagram showing a schematic configuration of a data processing system 1 according to an embodiment (first embodiment) of the present invention.
The data processing system 1 includes n (n is an integer of 2 or more) terminal devices 11-1 to 11-n, a statistic data processing device 12, a database 13, and a network 21.
As the network 21, a wired or wireless arbitrary network may be used. For example, the Internet or a Wi-Fi (registered trademark) network may be used.
Although the database 13 is provided separately from the statistic data processing device 12 in this embodiment, as another example, the database 13 may be integrated with the statistic data processing device 12.

統計量データ処理装置１２は、端末装置１１−１〜１１−ｎに関する所定の対象のデータを取得して解析する。本実施形態では、当該データは、所定の対象の時系列データについて、サンプル数が膨大（ビックデータ）になりかつ常に次々とデータが発生するストリームデータであるとする。 The statistic data processing device 12 acquires and analyzes predetermined target data regarding the terminal devices 11-1 to 11-n. In the present embodiment, it is assumed that the data is stream data in which the number of samples is enormous (big data) and data is always generated one after another with respect to predetermined target time-series data.

所定の対象のデータとしては、任意のデータが用いられてもよく、例えば、ＩｏＴ（ＩｎｔｅｒｎｅｔｏｎＴｈｉｎｇｓ）に関するデータが用いられてもよく、あるいは、他のデータが用いられてもよい。
具体例として、所定の対象のデータとしては、任意のシステムにおけるデータが用いられてもよく、例えば、車両などの交通システムに関するデータ、人などのヘルスケアシステムに関するデータ、製品を生産などする工場システムに関するデータ、証券などの金融システムに関するデータ、有線または無線の通信システムに関するデータなどが用いられてもよい。また、任意のシステムにおいて、所定の対象のデータとしては、様々なデータが用いられてもよく、例えば、温度に関するデータ、湿度に関するデータ、速度に関するデータ、加速度に関するデータ、画像に関するデータ、酸素などの物質の濃度に関するデータ、品質に関するデータ、株価に関するデータ、通信信号に関するデータ、端末装置（本実施形態では、端末装置１１−１〜１１−ｎ）の位置に関するデータ、端末装置が存在する領域に関するデータなどが用いられてもよい。 Arbitrary data may be used as the predetermined target data. For example, data on IoT (Internet on Things) may be used, or other data may be used.
As a specific example, data in an arbitrary system may be used as the predetermined target data. For example, data related to transportation systems such as vehicles, data related to healthcare systems such as people, and factory systems that produce products Data relating to financial systems such as securities, data relating to wired or wireless communication systems, and the like may be used. In addition, in an arbitrary system, various data may be used as predetermined target data. For example, data related to temperature, data related to humidity, data related to speed, data related to acceleration, data related to images, data related to oxygen, etc. Data relating to substance concentration, data relating to quality, data relating to stock prices, data relating to communication signals, data relating to positions of terminal devices (terminal devices 11-1 to 11-n in the present embodiment), data relating to areas where terminal devices exist. Etc. may be used.

また、所定の対象のデータを検出する手法としては、様々な手法が用いられてもよい。
一例として、それぞれの端末装置１１−１〜１１−ｎにおいて、当該それぞれの端末装置１１−１〜１１−ｎに関する所定の対象のデータを検出する構成が用いられてもよい。この構成では、それぞれの端末装置１１−１〜１１−ｎは、所定の対象のデータを検出する検出部を備え、当該検出部により検出されたデータを、ネットワーク２１を介して、統計量データ処理装置１２に送信する。当該検出部は、例えば、センサー、あるいは、撮像装置（カメラ）であってもよい。ここで、撮像装置（カメラ）も、センサーの一例であると捉えられてもよい。
なお、それぞれの端末装置１１−１〜１１−ｎは、ＩｏＴの端末装置であってもよく、あるいは、他の端末装置であってもよい。 Various methods may be used as a method for detecting predetermined target data.
As an example, in each of the terminal devices 11-1 to 11-n, a configuration for detecting predetermined target data related to each of the terminal devices 11-1 to 11-n may be used. In this configuration, each of the terminal devices 11-1 to 11-n includes a detection unit that detects data of a predetermined target, and the data detected by the detection unit is subjected to statistical data processing via the network 21. Transmit to device 12. The detection unit may be, for example, a sensor or an imaging device (camera). Here, the imaging device (camera) may also be regarded as an example of a sensor.
Each of the terminal devices 11-1 to 11-n may be an IoT terminal device or may be another terminal device.

他の例として、データ処理システム１は、端末装置１１−１〜１１−ｎとは別の検出装置（図示を省略）を備えてもよい。この構成では、当該検出装置は、それぞれの端末装置１１−１〜１１−ｎに関する所定の対象のデータを検出する検出部を有しており、当該検出部により検出されたデータを、ネットワーク２１を介して、統計量データ処理装置１２に送信する。 As another example, the data processing system 1 may include a detection device (not shown) different from the terminal devices 11-1 to 11-n. In this configuration, the detection device includes a detection unit that detects predetermined target data regarding each of the terminal devices 11-1 to 11-n, and the data detected by the detection unit is transmitted to the network 21. To the statistic data processing device 12.

当該検出装置は、例えば、所定の領域の画像を撮像する撮像装置（カメラ）を備えてもよく、当該画像に基づいて、当該所定の領域に存在する端末装置１１−１〜１１−ｎの数のデータなどを所定の対象のデータとして検出してもよい。
当該検出装置は、例えば、所定の領域に存在する端末装置１１−１〜１１−ｎと無線または有線で通信する通信部を備えてもよく、この通信の結果に基づいて、当該所定の領域に存在する端末装置１１−１〜１１−ｎの数のデータなどを所定の対象のデータとして検出してもよい。
当該検出装置は、例えば、端末装置１１−１〜１１−ｎから発信される信号を取得する信号取得部を備えてもよく、当該信号の状況に基づいて、当該信号の発生頻度あるいは遅延度などを表すデータなどを所定の対象のデータとして検出してもよい。 The detection device may include, for example, an imaging device (camera) that captures an image of a predetermined region, and the number of terminal devices 11-1 to 11-n existing in the predetermined region based on the image. Or the like may be detected as predetermined target data.
The detection apparatus may include, for example, a communication unit that communicates with the terminal devices 11-1 to 11-n existing in a predetermined area wirelessly or by wire. Based on the result of the communication, the detection apparatus Data of the number of existing terminal apparatuses 11-1 to 11-n may be detected as predetermined target data.
The detection device may include, for example, a signal acquisition unit that acquires signals transmitted from the terminal devices 11-1 to 11-n. Based on the state of the signal, the frequency of occurrence of the signal or the degree of delay, etc. May be detected as predetermined target data.

本実施形態では、それぞれの端末装置１１−１〜１１−ｎは、ネットワーク２１を介して、他の装置（例えば、統計量データ処理装置１２など）と通信することが可能である。なお、それぞれの端末装置１１−１〜１１−ｎは、有線または無線により、ネットワーク２１との間で通信接続する。
他の例として、それぞれの端末装置１１−１〜１１−ｎは、通信機能を備えなくてもよい。 In the present embodiment, each of the terminal devices 11-1 to 11-n can communicate with other devices (for example, the statistics data processing device 12) via the network 21. Each of the terminal devices 11-1 to 11-n is connected for communication with the network 21 by wire or wireless.
As another example, each of the terminal devices 11-1 to 11-n may not have a communication function.

また、それぞれの端末装置１１−１〜１１−ｎとしては、例えば、同じ構成を有する端末装置が用いられてもよく、あるいは、異なる構成を有する端末装置が含まれてもよい。
また、それぞれの端末装置１１−１〜１１−ｎは、例えば、物に付加あるいは装着などされてもよく、あるいは、人により携帯あるいは装着などされてもよい。当該物としては、任意のものであってもよく、例えば、自動車などの車両、あるいは、電化製品などであってもよい。 Moreover, as each terminal device 11-1 to 11-n, for example, a terminal device having the same configuration may be used, or a terminal device having a different configuration may be included.
Each of the terminal devices 11-1 to 11-n may be added to or attached to an object, or may be carried or attached by a person. As the said thing, arbitrary things may be sufficient, for example, vehicles, such as a motor vehicle, or an electric appliance.

また、端末装置１１−１〜１１−ｎとは別の検出装置は、例えば、統計量データ処理装置１２に備えられてもよい。この構成では、統計量データ処理装置１２は、当該検出装置により検出されたデータを取得して解析する。 In addition, a detection device other than the terminal devices 11-1 to 11-n may be provided in the statistic data processing device 12, for example. In this configuration, the statistic data processing device 12 acquires and analyzes the data detected by the detection device.

［統計量データ処理装置］
図２は、本発明の一実施形態に係る統計量データ処理装置１２の概略的な構成を示すブロック図である。
統計量データ処理装置１２は、入力部１１１と、出力部１１２と、記憶部１１３と、通信部１１４と、制御部１１５を備える。
制御部１１５は、データ取得部１３１と、統計量データ処理部１３２と、データ出力制御部１３３を備える。
統計量データ処理部１３２は、観点設定部１５１と、スケール設定部１５２と、統計量データ群生成部１５３と、追加部１５４と、抽出部１５５を備える。 [Statistics data processor]
FIG. 2 is a block diagram showing a schematic configuration of the statistic data processing apparatus 12 according to an embodiment of the present invention.
The statistics data processing device 12 includes an input unit 111, an output unit 112, a storage unit 113, a communication unit 114, and a control unit 115.
The control unit 115 includes a data acquisition unit 131, a statistic data processing unit 132, and a data output control unit 133.
The statistic data processing unit 132 includes a viewpoint setting unit 151, a scale setting unit 152, a statistic data group generation unit 153, an addition unit 154, and an extraction unit 155.

入力部１１１は、外部から情報を入力する。入力部１１１は、例えば、ユーザ（人）により行われる操作を受け付ける操作部を有し、当該操作部により受け付けられた操作に応じた情報を入力する。入力部１１１は、例えば、外部の装置（例えば、記録媒体など）と接続されて当該外部の装置から出力される情報を入力する。
出力部１１２は、情報を出力する。出力部１１２は、例えば、画面を有しており、情報を画面に表示（出力）する。出力部１１２は、例えば、外部の装置（例えば、記録媒体など）と接続されて当該外部の装置に情報を出力する。
記憶部１１３は、情報を記憶する。なお、本実施形態では、記憶部１１３とデータベース１３とは、任意に使い分けられてもよい。
通信部１１４は、情報を通信する。本実施形態では、通信部１１４は、ネットワーク２１を介して、他の装置（例えば、端末装置１１−１〜１１−ｎあるいは別の検出装置）と情報を通信する。 The input unit 111 inputs information from the outside. The input unit 111 includes, for example, an operation unit that receives an operation performed by a user (person), and inputs information according to the operation received by the operation unit. For example, the input unit 111 is connected to an external device (for example, a recording medium) and inputs information output from the external device.
The output unit 112 outputs information. The output unit 112 has a screen, for example, and displays (outputs) information on the screen. The output unit 112 is connected to, for example, an external device (for example, a recording medium) and outputs information to the external device.
The storage unit 113 stores information. In the present embodiment, the storage unit 113 and the database 13 may be arbitrarily used.
The communication unit 114 communicates information. In the present embodiment, the communication unit 114 communicates information with other devices (for example, the terminal devices 11-1 to 11-n or another detection device) via the network 21.

制御部１１５は、統計量データ処理装置１２における各種の制御を行う。
本実施形態では、記憶部１１３は、所定の制御プログラムおよびそのパラメーターの情報を記憶する。また、制御部１１５は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）を用いて構成される。そして、制御部１１５では、ＣＰＵが記憶部１１３に記憶された制御プログラムを、記憶部１１３に記憶されたパラメーターを使用して実行することで、各種の制御を行う。 The control unit 115 performs various controls in the statistic data processing apparatus 12.
In the present embodiment, the storage unit 113 stores information on a predetermined control program and its parameters. The control unit 115 is configured by using a CPU (Central Processing Unit). In the control unit 115, the CPU performs various types of control by executing the control program stored in the storage unit 113 using the parameters stored in the storage unit 113.

なお、統計量データ処理装置１２について、図２に示した各処理部１１１〜１１５を備える構成は一例であり、他の構成が用いられてもよい。例えば、各処理部１１１〜１１５の機能の区分は、説明の便宜上のものであり、必ずしも図２に示した構成に限定されない。 In addition, about the statistics data processing apparatus 12, the structure provided with each process part 111-115 shown in FIG. 2 is an example, and another structure may be used. For example, the classification of the functions of the processing units 111 to 115 is for convenience of explanation, and is not necessarily limited to the configuration shown in FIG.

制御部１１５の機能について説明する。
データ取得部１３１は、解析対象となるデータとして、所定の対象のデータを取得する。
一例として、データ取得部１３１は、通信部１１４により他の装置（例えば、端末装置１１−１〜１１−ｎあるいは別の検出装置）から受信されたデータを、解析対象となるデータとして取得してもよい。
他の例として、データ取得部１３１は、過去に取得されたデータを順次データベース１３に記憶しておき、当該データの処理を行うときに、当該データベース１３から当該データを解析対象となるデータとして取得してもよい。
また、他の例として、所定の対象のデータが統計量データ処理装置１２を経由せずにデータベース１３に記憶される構成が用いられてもよく、この場合、データ取得部１３１は、当該データの処理を行うときに、当該データベース１３から当該データを解析対象となるデータとして取得してもよい。 The function of the control unit 115 will be described.
The data acquisition unit 131 acquires predetermined target data as data to be analyzed.
As an example, the data acquisition unit 131 acquires data received from another device (for example, the terminal devices 11-1 to 11-n or another detection device) by the communication unit 114 as data to be analyzed. Also good.
As another example, the data acquisition unit 131 sequentially stores data acquired in the past in the database 13 and acquires the data from the database 13 as data to be analyzed when processing the data. May be.
As another example, a configuration in which predetermined target data is stored in the database 13 without passing through the statistics data processing device 12 may be used. In this case, the data acquisition unit 131 When performing processing, the data may be acquired from the database 13 as data to be analyzed.

統計量データ処理部１３２は、データ取得部１３１により取得されたデータについて統計的な処理を行い、その結果のデータ（本実施形態において、「統計量データ」ともいう。）を取得する。
なお、統計量データ処理部１３２では、例えば、データ取得部１３１によりリアルタイムで次々と取得されるデータ（新たに増えていくデータのまとまり）について処理を行う場合があってもよく、また、データ取得部１３１により取得された過去のデータ（新たに増えないデータのまとまり）について処理を行う場合があってもよい。 The statistic data processing unit 132 performs a statistical process on the data acquired by the data acquisition unit 131, and acquires the resulting data (also referred to as “statistical data” in the present embodiment).
Note that the statistic data processing unit 132 may process, for example, data that is successively acquired in real time by the data acquisition unit 131 (a collection of newly increasing data). There may be a case where processing is performed on past data (a group of data that does not newly increase) acquired by the unit 131.

データ出力制御部１３３は、出力対象となるデータを出力部１１２により出力する制御を行う。この出力としては、例えば、文字、図形あるいはグラフなどの表示出力が用いられる。
出力対象となるデータとしては、任意のデータが用いられてもよく、例えば、データ取得部１３１により取得されたデータ、あるいは、統計量データ処理部１３２による処理により得られた結果のデータなどが用いられてもよい。また、統計量データ処理部１３２による処理により得られた結果のデータとしては、例えば、統計量データ群生成部１５３による処理により得られた結果のデータ、追加部１５４による処理により得られた結果のデータ、あるいは、抽出部１５５による処理により得られた結果のデータのうちの１以上が用いられてもよい。 The data output control unit 133 controls the output unit 112 to output data to be output. As this output, for example, display output of characters, figures, graphs or the like is used.
Arbitrary data may be used as the data to be output. For example, data acquired by the data acquisition unit 131 or data obtained as a result of processing by the statistic data processing unit 132 is used. May be. Further, as the result data obtained by the process by the statistic data processing unit 132, for example, the result data obtained by the process by the statistic data group generation unit 153, the result obtained by the process by the addition unit 154, One or more of data or result data obtained by processing by the extraction unit 155 may be used.

統計量データ処理部１３２について説明する。
観点設定部１５１は、解析対象のデータについて、観点（項目）を設定する。観点設定部１５１は、解析対象のデータについて、２個以上の観点を設定してもよい。
ここで、観点としては、任意の観点が用いられてもよく、例えば、時間（時刻）、領域（地域）、デバイス種別などが用いられてもよい。
また、観点設定部１５１は、例えば、あらかじめ定められた観点を設定してもよく、あるいは、ユーザなどから指示された観点を設定してもよい。観点があらかじめ定められる場合には、例えば、当該観点を特定する情報が記憶部１１３に記憶される。 The statistic data processing unit 132 will be described.
The viewpoint setting unit 151 sets viewpoints (items) for the analysis target data. The viewpoint setting unit 151 may set two or more viewpoints for the data to be analyzed.
Here, any viewpoint may be used as the viewpoint, and for example, time (time), area (region), device type, or the like may be used.
In addition, the viewpoint setting unit 151 may set a predetermined viewpoint, or may set a viewpoint instructed by a user or the like. When the viewpoint is determined in advance, for example, information specifying the viewpoint is stored in the storage unit 113.

スケール設定部１５２は、それぞれの観点について、スケール（粒度）を設定する。スケール設定部１５２は、それぞれの観点について、２個以上のスケールを設定してもよい。
ここで、スケールとしては、任意の大きさを有するスケールが用いられてもよい。
例えば、時間のスケールとして、１秒のスケール、１分のスケール、１時間のスケール、１日のスケール、あるいは、他の任意の大きさのスケールが用いられてもよい。
例えば、領域のスケールとして、地区のスケール、市区町村のスケール、都道府県のスケール、全国のスケール、あるいは、他の任意の大きさのスケールが用いられてもよい。
例えば、デバイス種別のスケールとして、機種のスケール、メーカーのスケール、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）のスケール、あるいは、他の任意の種別（属性）のスケールが用いられてもよい。
スケール設定部１５２は、例えば、それぞれの観点について、あらかじめ定められたスケールを設定してもよく、あるいは、ユーザなどから指示されたスケールを設定してもよい。スケールがあらかじめ定められる場合には、例えば、当該スケールを特定する情報が記憶部１１３に記憶される。 The scale setting unit 152 sets a scale (granularity) for each viewpoint. The scale setting unit 152 may set two or more scales for each viewpoint.
Here, as the scale, a scale having an arbitrary size may be used.
For example, a scale of 1 second, a scale of 1 minute, a scale of 1 hour, a scale of 1 day, or any other scale may be used as the time scale.
For example, an area scale, a city scale, a prefecture scale, a national scale, or any other scale may be used as the area scale.
For example, as the device type scale, a model scale, a manufacturer scale, an OS (Operating System) scale, or another arbitrary type (attribute) scale may be used.
For example, the scale setting unit 152 may set a predetermined scale for each viewpoint, or may set a scale instructed by a user or the like. When the scale is determined in advance, for example, information for specifying the scale is stored in the storage unit 113.

統計量データ群生成部１５３は、データ取得部１３１により取得されたデータについて、観点設定部１５１により設定された観点およびスケール設定部１５２により設定されたスケールに基づいて、統計的な処理を行うことで、複数の統計量データの集合（本実施形態において、「統計量データ群」ともいう。）を生成する。
本実施形態では、統計量データ群生成部１５３は、多次元およびマルチスケールのデータ解析を行った結果のデータを、統計量データ群（本実施形態において、「多次元マルチスケール統計量データ群」ともいう。）として生成する。
ここで、本実施形態では、次元は観点を表わしており、多次元は複数の観点があることを表わしている。
また、本実施形態では、マルチスケールは、複数のスケールがあることを表わしている。 The statistics data group generation unit 153 performs statistical processing on the data acquired by the data acquisition unit 131 based on the viewpoint set by the viewpoint setting unit 151 and the scale set by the scale setting unit 152. Thus, a set of a plurality of statistical data (also referred to as “statistical data group” in the present embodiment) is generated.
In the present embodiment, the statistic data group generation unit 153 converts the data obtained as a result of multidimensional and multiscale data analysis into a statistic data group (in this embodiment, “multidimensional multiscale statistic data group”). It is also generated as
Here, in this embodiment, the dimension represents a viewpoint, and the multidimension represents a plurality of viewpoints.
In the present embodiment, the multi-scale indicates that there are a plurality of scales.

追加部１５４は、既に生成された統計量データ群（例えば、多次元マルチスケール統計量データ群）に対して、データ取得部１３１により取得されたデータ（新たなデータ）を追加する場合に、観点設定部１５１により設定された観点およびスケール設定部１５２により設定されたスケールに基づいて、統計的な処理を行うことで、データ追加後の統計量データ群（例えば、多次元マルチスケール統計量データ群）を生成する。 When the adding unit 154 adds the data (new data) acquired by the data acquiring unit 131 to the already generated statistical data group (for example, the multidimensional multiscale statistical data group), the viewpoint By performing statistical processing based on the viewpoint set by the setting unit 151 and the scale set by the scale setting unit 152, a statistical data group after data addition (for example, a multi-dimensional multi-scale statistical data group) ) Is generated.

例えば、複数の装置（検出装置など）から全体として時系列となるデータが独立に送信されるような場合に、データ取得部１３１により取得される当該データの順序が時系列ではなくなる場合があり得る。この場合、統計量データ処理装置１２では、このような時系列データのすべてが取得されてから解析を行うと、その処理結果の出力が遅くなることがあり得る。このようなときに、既に取得されているデータに基づいて統計量データ群生成部１５３により統計量データ群を生成して出力し、その後に該当するデータが新たに到着した後に、追加部１５４によりデータ追加後の統計量データ群を取得して出力（例えば、出力内容を更新）すると、効果的である。 For example, when data in time series as a whole is independently transmitted from a plurality of devices (such as detection devices), the order of the data acquired by the data acquisition unit 131 may not be in time series. . In this case, if the statistic data processing device 12 performs analysis after all such time-series data has been acquired, the output of the processing result may be delayed. In such a case, the statistic data group generation unit 153 generates and outputs a statistic data group based on the already acquired data, and then the corresponding data newly arrives thereafter. It is effective to obtain and output (for example, update the output contents) the statistical data group after the data is added.

抽出部１５５は、既に生成された統計量データ群（例えば、多次元マルチスケール統計量データ群）に基づいて、所定の抽出条件に合ったデータ（例えば、統計量データでもよく、あるいは、統計量データ群でもよい。）を抽出する。 The extraction unit 155 may be data (for example, statistical data, or statistical data that meets a predetermined extraction condition based on a statistical data group that has already been generated (for example, a multidimensional multiscale statistical data group). It may be a data group).

［統計量データ群のデータ構造の例］
図３は、本発明の一実施形態に係る統計量データ群２０１の一例のデータ構造を模式的に示す図である。
図３の例では、当該統計量データ群２０１は、統計量データ処理部１３２により時系列データが処理された結果である多次元マルチスケール統計量データ群となっている。当該統計量データ群２０１は、時系列データを対象として多次元およびマルチスケールに拡張された分布データ（本実施形態では、統計量データ）の構造を有する。
図３の例では、複数の観点として、時間、領域、デバイス種別が用いられている。また、それぞれの観点について、複数のスケール（マルチスケール）が用いられている。 [Example of data structure of statistical data group]
FIG. 3 is a diagram schematically showing an example of the data structure of the statistic data group 201 according to the embodiment of the present invention.
In the example of FIG. 3, the statistical data group 201 is a multidimensional multiscale statistical data group that is a result of processing time-series data by the statistical data processing unit 132. The statistic data group 201 has a structure of distribution data (in this embodiment, statistic data) extended to multi-dimensional and multi-scale for time series data.
In the example of FIG. 3, time, region, and device type are used as a plurality of viewpoints. Further, for each viewpoint, a plurality of scales (multiscale) are used.

図３の例では、時間に関する複数のスケールとして、時間スケールｓ０、時間スケールｓ１、時間スケールｓ２が用いられている。一例として、時間スケールｓ０は「１０秒」であり、時間スケールｓ１は「２０秒」であり、時間スケールｓ２は「４０秒」である。
また、領域に関する複数のスケールとして、領域スケールＡ１、領域スケールＡ２、領域スケールＡ３が用いられている。一例として、領域スケールＡ１は「東京」であり、領域スケールＡ２は「関東」であり、領域スケールＡ３は「日本（全国）」である。
また、デバイス種別に関する複数のスケールとして、デバイス種別スケールＤ１、デバイス種別スケールＤ２が用いられている。一例として、デバイス種別スケールＤ１は「特定の機種Ａ１」であり、デバイス種別スケールＤ２は「特定のメーカーＢ１」である。 In the example of FIG. 3, a time scale s0, a time scale s1, and a time scale s2 are used as a plurality of time scales. As an example, the time scale s0 is “10 seconds”, the time scale s1 is “20 seconds”, and the time scale s2 is “40 seconds”.
Further, as a plurality of scales related to the region, a region scale A1, a region scale A2, and a region scale A3 are used. As an example, the area scale A1 is “Tokyo”, the area scale A2 is “Kanto”, and the area scale A3 is “Japan (nationwide)”.
Further, a device type scale D1 and a device type scale D2 are used as a plurality of scales related to the device type. As an example, the device type scale D1 is “specific model A1”, and the device type scale D2 is “specific manufacturer B1”.

また、図３の例では、時刻を表す軸の方向（矢印）を示してある。時刻の範囲（時間の区分）として、時刻範囲ｔ０、時刻範囲ｔ１、時刻範囲ｔ２、時刻範囲ｔ３が用いられている。時刻範囲ｔ０は「０秒以上１０秒未満」であり、時刻範囲ｔ１は「１０秒以上２０秒未満」であり、時刻範囲ｔ２は「２０秒以上３０秒未満」であり、時刻範囲ｔ３は「３０秒以上４０秒未満」である。
なお、時刻の初期値（本実施形態では、０秒）としては、任意のタイミングが用いられてもよい。 In the example of FIG. 3, the direction (arrow) of the axis representing the time is shown. As a time range (time division), a time range t0, a time range t1, a time range t2, and a time range t3 are used. The time range t0 is “0 seconds to less than 10 seconds”, the time range t1 is “10 seconds to less than 20 seconds”, the time range t2 is “20 seconds to less than 30 seconds”, and the time range t3 is “ 30 seconds or more and less than 40 seconds ".
An arbitrary timing may be used as the initial time value (0 seconds in the present embodiment).

ここで、１個の時間スケールと、１個の領域スケールと、１個のデバイス種別スケールが特定されて、当該時間スケールに応じた１個の時刻範囲が特定されると、単位となる統計量データ（本実施形態において、「単位統計量データ」ともいう。）が特定される。
当該時間スケールに応じた１個の時刻範囲としては、本実施形態では、当該時刻範囲の上限値が当該時間スケールの整数倍に相当する値にある時刻範囲である。具体例として、時間スケールｓ０（＝１０秒）の場合には、時刻範囲ｔ０（＝０秒以上１０秒未満）、時刻範囲ｔ１（＝１０秒以上２０秒未満）、時刻範囲ｔ２（＝２０秒以上３０秒未満）、または、時刻範囲ｔ３（＝３０秒以上４０秒未満）のいずれかとなる。時間スケールｓ１（＝２０秒）の場合には、時刻範囲ｔ１（＝１０秒以上２０秒未満）、または、時刻範囲ｔ３（＝３０秒以上４０秒未満）のいずれかとなる。時間スケールｓ２（＝４０秒）の場合には、時刻範囲ｔ３（＝３０秒以上４０秒未満）となる。 Here, when one time scale, one region scale, and one device type scale are specified, and one time range corresponding to the time scale is specified, a unit statistic Data (also referred to as “unit statistic data” in the present embodiment) is specified.
In the present embodiment, the time range corresponding to the time scale is a time range in which the upper limit value of the time range is a value corresponding to an integral multiple of the time scale. As a specific example, in the case of the time scale s0 (= 10 seconds), the time range t0 (= 0 second to less than 10 seconds), the time range t1 (= 10 seconds to less than 20 seconds), the time range t2 (= 20 seconds). Or less than 30 seconds) or time range t3 (= 30 seconds or more and less than 40 seconds). In the case of the time scale s1 (= 20 seconds), it is either the time range t1 (= 10 seconds or more and less than 20 seconds) or the time range t3 (= 30 seconds or more and less than 40 seconds). In the case of the time scale s2 (= 40 seconds), the time range is t3 (= 30 seconds to less than 40 seconds).

１個の時間スケールと、１個の領域スケールと、１個のデバイス種別スケールと、当該時間スケールに応じた１個の時刻範囲によって特定される単位統計量データは、当該時刻範囲の最大の時刻から当該時間スケールだけ過去に遡った時刻までの間に属し、かつ、当該領域スケールに属し、かつ、当該デバイス種別スケールに属するデータについて、統計量データ処理部１３２によって統計的な演算を行うことにより得られた統計量データに相当する。 The unit statistic data specified by one time scale, one area scale, one device type scale, and one time range corresponding to the time scale is the maximum time in the time range. The statistical data processing unit 132 performs a statistical calculation on the data belonging to the time scale and the time that goes back to the past by the time scale and belonging to the area scale and the device type scale. It corresponds to the obtained statistical data.

図３の例では、時間スケールｓ０かつ時刻範囲ｔ３に該当する６個の単位統計量データ２１１〜２１３、２２１〜２２３と、時間スケールｓ１かつ時刻範囲ｔ３に該当する６個の単位統計量データ３１１〜３１３、３２１〜３２３と、時間スケールｓ２かつ時刻範囲ｔ３に該当する６個の単位統計量データ４１１〜４１３、４２１〜４２３だけに符号を付してあり、他の単位統計量データについては符号を省略してある。 In the example of FIG. 3, six unit statistical data 211 to 213 and 221 to 223 corresponding to the time scale s0 and the time range t3, and six unit statistical data 311 corresponding to the time scale s1 and the time range t3. .., 313, 321-323, and the six unit statistical data 411-413, 421-423 corresponding to the time scale s2 and the time range t3 are provided with symbols, and the other unit statistical data are encoded. Is omitted.

一例として、単位統計量データ２１１は、時間スケールＳ０（１０秒）、領域スケールＡ１（東京）、デバイス種別スケールＤ１（特定の機種Ａ１）、時刻範囲ｔ３（３０秒以上４０秒未満）に該当する。そして、当該単位統計量データ２１１は、時刻が３０秒以上４０秒未満に属し、かつ、領域が東京に属し、かつ、デバイス種別が特定の機種Ａ１に属するデータに基づいて得られた統計量データである。すなわち、当該データは、時刻が３０秒以上４０秒未満に発生し、東京に存在する端末装置１１−１〜１１−ｎにおいて発生し、デバイス種別が特定の機種Ａ１である当該端末装置１１−１〜１１−ｎにおいて発生したデータであることを意味する。当該単位統計量データは、このようなデータの集合を用いて得られた統計量データである。 As an example, the unit statistics data 211 corresponds to the time scale S0 (10 seconds), the area scale A1 (Tokyo), the device type scale D1 (specific model A1), and the time range t3 (30 seconds to less than 40 seconds). . The unit statistical data 211 is statistical data obtained based on data belonging to a time belonging to 30 seconds or more and less than 40 seconds, a region belonging to Tokyo, and a device type belonging to a specific model A1. It is. That is, the data is generated in the terminal devices 11-1 to 11-n that exist in Tokyo at a time of 30 seconds or more and less than 40 seconds, and the terminal device 11-1 whose device type is the specific model A1. Means that the data occurred in -11-n. The unit statistical data is statistical data obtained using such a set of data.

他の例として、単位統計量データ３２２は、時間スケールＳ１（２０秒）、領域スケールＡ２（関東）、デバイス種別スケールＤ２（特定のメーカーＢ１）、時刻範囲ｔ３（３０秒以上４０秒未満）に該当する。そして、当該単位統計量データ３２２は、時刻が２０秒以上４０秒未満に属し、かつ、領域が関東に属し、かつ、デバイス種別が特定のメーカーＢ１に属するデータに基づいて得られた統計量データである。すなわち、当該データは、時刻が２０秒以上４０秒未満に発生し、関東に存在する端末装置１１−１〜１１−ｎにおいて発生し、デバイス種別が特定のメーカーＢ１である当該端末装置１１−１〜１１−ｎにおいて発生したデータであることを意味する。当該単位統計量データは、このようなデータの集合を用いて得られた統計量データである。 As another example, the unit statistic data 322 includes a time scale S1 (20 seconds), an area scale A2 (Kanto), a device type scale D2 (specific manufacturer B1), and a time range t3 (30 seconds to less than 40 seconds). Applicable. The unit statistical data 322 is statistical data obtained based on data belonging to a time belonging to 20 seconds or more and less than 40 seconds, a region belonging to Kanto, and a device type belonging to a specific manufacturer B1. It is. That is, the data is generated at a time of 20 seconds or more and less than 40 seconds, is generated at the terminal devices 11-1 to 11-n existing in Kanto, and the terminal device 11-1 whose device type is the specific manufacturer B1. Means that the data occurred in -11-n. The unit statistical data is statistical data obtained using such a set of data.

他の例として、単位統計量データ４１３は、時間スケールＳ２（４０秒）、領域スケールＡ３（日本）、デバイス種別スケールＤ１（特定の機種Ａ１）、時刻範囲ｔ３（３０秒以上４０秒未満）に該当する。そして、当該単位統計量データ４１３は、時刻が０秒以上４０秒未満に属し、かつ、領域が日本に属し、かつ、デバイス種別が特定の機種Ａ１に属するデータに基づいて得られた統計量データである。すなわち、当該データは、時刻が０秒以上４０秒未満に発生し、日本に存在する端末装置１１−１〜１１−ｎにおいて発生し、デバイス種別が特定の機種Ａ１である当該端末装置１１−１〜１１−ｎにおいて発生したデータであることを意味する。当該単位統計量データは、このようなデータの集合を用いて得られた統計量データである。 As another example, the unit statistic data 413 includes a time scale S2 (40 seconds), an area scale A3 (Japan), a device type scale D1 (specific model A1), and a time range t3 (30 seconds to less than 40 seconds). Applicable. The unit statistical data 413 is statistical data obtained based on data belonging to a time belonging to 0 seconds or more and less than 40 seconds, a region belonging to Japan, and a device type belonging to a specific model A1. It is. That is, the data is generated at a time between 0 seconds and less than 40 seconds, is generated in the terminal devices 11-1 to 11-n existing in Japan, and the terminal device 11-1 whose device type is the specific model A1. Means that the data occurred in -11-n. The unit statistical data is statistical data obtained using such a set of data.

ここで、単位統計量データとしては、任意の統計量のデータが用いられてもよく、例えば、順序統計に関する任意の値のデータが用いられてもよく、あるいは、平均値のデータが用いられてもよい。
順序統計に関する値としては、例えば、中央値が用いられてもよい。なお、一般に、処理対象となる複数のデータが同じである場合、平均値を取得（演算）する処理よりも、中央値を取得する処理の方が、処理時間が短くなると考えられる。
また、順序統計に関する値としては、例えば、累積分布関数（ＣＤＦ：ＣｕｍｕｌａｔｉｖｅＤｉｓｔｒｉｂｕｔｉｏｎＦｕｎｃｔｉｏｎ）の値が用いられてもよく、あるいは、確率分布関数（ＰＤＦ：ＰｒｏｂａｂｉｌｉｔｙＤｅｎｓｉｔｙＦｕｎｃｔｉｏｎ）の値が用いられてもよい。 Here, as the unit statistic data, data of an arbitrary statistic may be used. For example, data of an arbitrary value related to order statistics may be used, or data of an average value may be used. Also good.
As a value related to order statistics, for example, a median value may be used. In general, when a plurality of pieces of data to be processed are the same, it is considered that the processing time for the median value is shorter than the processing for acquiring (calculating) the average value.
In addition, as a value related to order statistics, for example, a value of a cumulative distribution function (CDF) may be used, or a value of a probability distribution function (PDF: Probability Density Function) may be used. .

図３の例では、それぞれの四角（直方体あるいは立方体）の単位が単位統計量データ（単位統計量データ２１１〜２１３、２２１〜２２３、３１１〜３１３、３２１〜３２３、４１１〜４１３、４２１〜４２３など）に相当する。
なお、統計量データ群生成部１５３は、任意の手法を用いて、単位統計量データを取得してもよく、例えば、既存の技術であるｔ−ｄｉｇｅｓｔの技術（例えば、非特許文献１など参照。）を用いて単位統計量データを演算して取得してもよい。本実施形態では、統計量データ群は、複数の単位統計量データを含んで構成される。 In the example of FIG. 3, the unit of each square (cuboid or cube) is unit statistical data (unit statistical data 211-213, 221-223, 311-313, 321-323, 411-413, 421-423, etc. ).
Note that the statistic data group generation unit 153 may acquire unit statistic data using an arbitrary technique. For example, the t-digest technique (for example, see Non-Patent Document 1 or the like) that is an existing technique. .) May be used to calculate and obtain unit statistics data. In the present embodiment, the statistical data group includes a plurality of unit statistical data.

また、１個の観点について用いられる複数のスケールとしては、例えば、すべてについて互いに包含関係にある複数のスケールが用いられてもよく、あるいは、すべてについて互いに包含関係にない複数のスケールが用いられてもよく、あるいは、一部のみについて包含関係にある複数のスケールが用いられてもよい。
すべてについて互いに包含関係にある複数のスケールとしては、領域のスケールを例とすると、例えば、「東京」、「関東」、「日本」がある。
すべてについて互いに包含関係にない複数のスケールとしては、領域のスケールを例とすると、例えば、「東京」、「千葉」、「茨城」がある。
一部のみについて包含関係にある複数のスケールとしては、領域のスケールを例とすると、例えば、「東京」、「関東」（東京を含む。）、「大阪」がある。 In addition, as a plurality of scales used for one viewpoint, for example, a plurality of scales that are inclusive relation to each other may be used, or a plurality of scales that are not inclusive relation to each other are used. Alternatively, a plurality of scales that are inclusive in only part of them may be used.
As a plurality of scales that are inclusive of each other, for example, the scale of the region is “Tokyo”, “Kanto”, and “Japan”.
Examples of the plurality of scales that are not inclusive of each other include, for example, “Tokyo”, “Chiba”, and “Ibaraki”, taking the scale of the region as an example.
As a plurality of scales that are inclusive in only a part, taking the scale of the area as an example, there are, for example, “Tokyo”, “Kanto” (including Tokyo), and “Osaka”.

［追加処理の例］
図４は、本発明の一実施形態に係る統計量データ群２０１に対する追加処理の一例を説明するための図である。
追加処理は、例えば、新たな時系列データが発生した場合に、適切な単位統計量データにそのサンプル（当該データ）を加える処理である。追加処理は、例えば、ｔ−ｄｉｇｅｓｔの技術（例えば、非特許文献１など参照。）におけるデータ構造が有するＡＤＤの処理に相当してもよい。ＡＤＤの処理では、新たに発生した時系列データを各次元から評価し、そのデータが属する部分集合を列挙し、列挙された各部分集合に対してそのデータを加えるオペレーションを実行する。 [Example of additional processing]
FIG. 4 is a diagram for explaining an example of the addition process for the statistic data group 201 according to the embodiment of the present invention.
For example, when new time-series data is generated, the addition process is a process of adding the sample (the data) to appropriate unit statistic data. The additional process may correspond to, for example, an ADD process included in a data structure in a t-digest technique (see, for example, Non-Patent Document 1). In the ADD process, newly generated time-series data is evaluated from each dimension, a subset to which the data belongs is listed, and an operation of adding the data to each listed subset is executed.

図４の例では、図３に示された統計量データ群２０１と同じものを示してある。
追加部１５４は、統計量データ群２０１に対して新たなデータを追加する場合、当該新たなデータが属するすべての単位統計量データを演算し直すことで生成し直し、その結果を反映した統計量データ群（統計量データ群２０１を更新したデータ）を生成する。
図４の例では、新たなデータが、時刻範囲ｔ２、領域スケールＡ１、デバイス種別スケールＤ２に属するとする。この場合、追加部１５４は、当該新たなデータについて、時間スケールＳ０に属する単位統計量データ５１１と、時間スケールＳ１に属する単位統計量データ５１２と、時間スケールＳ２に属する単位統計量データ５１３のそれぞれを生成し直す。
本実施形態では、追加部１５４は、観点設定部１５１により設定された観点およびスケール設定部１５２により設定されたスケールに基づいて、追加処理を行う。 In the example of FIG. 4, the same data as the statistical data group 201 shown in FIG. 3 is shown.
When adding new data to the statistic data group 201, the adding unit 154 re-generates all the unit statistic data to which the new data belongs by recalculating the statistic reflecting the result. A data group (data obtained by updating the statistic data group 201) is generated.
In the example of FIG. 4, it is assumed that new data belongs to the time range t2, the area scale A1, and the device type scale D2. In this case, for the new data, the adding unit 154 includes unit statistics data 511 belonging to the time scale S0, unit statistics data 512 belonging to the time scale S1, and unit statistics data 513 belonging to the time scale S2. Is regenerated.
In the present embodiment, the adding unit 154 performs an adding process based on the viewpoint set by the viewpoint setting unit 151 and the scale set by the scale setting unit 152.

ここで、追加部１５４により統計量データ群２０１に対して新たなデータを追加する処理は、統計量データ処理装置１２において、例えば、データ取得部１３１により新たなデータを取得したことに応じて（自動的に）行われる構成が用いられてもよく、あるいは、ユーザなどによる所定の指示が入力部１１１により受け付けられたことに応じて行われる構成が用いられてもよい。 Here, the process of adding new data to the statistic data group 201 by the adding unit 154 is performed in response to the acquisition of new data by, for example, the data acquisition unit 131 in the statistic data processing device 12 ( A configuration that is performed automatically) may be used, or a configuration that is performed in response to a predetermined instruction from the user or the like being received by the input unit 111 may be used.

［抽出処理の例］
図５は、本発明の一実施形態に係る統計量データ群２０１に対する抽出処理の一例を説明するための図である。
抽出処理は、例えば、１個以上の単位統計量データからデータ（例えば、統計量データでもよく、あるいは、統計量データ群でもよい。）を抽出する処理である。抽出処理は、例えば、ｔ−ｄｉｇｅｓｔの技術（例えば、非特許文献１など参照。）におけるデータ構造が有するＧＥＴの処理に相当してもよい。ＧＥＴの処理では、抽出元の範囲（例えば、母集団のなかにおける部分集合）を指定して、当該範囲に含まれるデータについて当該範囲における統計量を取得するオペレーションを実行する。 [Example of extraction processing]
FIG. 5 is a diagram for explaining an example of extraction processing for the statistic data group 201 according to an embodiment of the present invention.
The extraction processing is processing for extracting data (for example, statistical data or a statistical data group) from one or more unit statistical data. The extraction process may correspond to, for example, a GET process included in the data structure in the t-digest technique (see, for example, Non-Patent Document 1). In the GET processing, an extraction source range (for example, a subset in the population) is designated, and an operation for acquiring a statistic in the range is executed for data included in the range.

図５の例では、図３に示された統計量データ群２０１と同じものを示してある。
抽出部１５５は、統計量データ群２０１から抽出対象の範囲（図５の例では、抽出範囲６１１）に含まれるデータ（所定の対象のデータ）を抽出し、抽出されたデータに基づいて単位統計量データを演算することで生成し、生成された単位統計量データの集合である統計量データ群を生成する。
本実施形態では、抽出部１５５は、観点設定部１５１により設定された観点およびスケール設定部１５２により設定されたスケールに基づいて、抽出処理を行う。 In the example of FIG. 5, the same data as the statistical data group 201 shown in FIG. 3 is shown.
The extraction unit 155 extracts data (predetermined target data) included in the extraction target range (extraction range 611 in the example of FIG. 5) from the statistical data group 201, and unit statistics based on the extracted data A statistic data group that is a set of generated unit statistic data is generated by calculating the quantity data.
In the present embodiment, the extraction unit 155 performs extraction processing based on the viewpoint set by the viewpoint setting unit 151 and the scale set by the scale setting unit 152.

ここで、本実施形態では、複数の単位統計量データのまとまりと他の１個以上の単位統計量データのまとまりとが１対１で対応している場合、抽出部１５５は、これら複数の単位統計量データに基づいて抽出処理を行ってもよく、あるいは、当該他の１個以上の単位統計量データに基づいて抽出処理を行ってもよく、いずれにおいても同じ結果が得られる。具体的に、図５の例では、複数の単位統計量データのまとまりが、時刻範囲ｔ２、ｔ３かつ時間スケールｓ０かつ領域スケールＡ１、Ａ２、Ａ３かつデバイス種別スケールＤ１、Ｄ２に含まれる１２個の単位統計量データのまとまりに相当し、また、他の１個以上の単位統計量データのまとまりが、時刻範囲ｔ２、ｔ３かつ時間スケールｓ１かつ領域スケールＡ１、Ａ２、Ａ３かつデバイス種別スケールＤ１、Ｄ２に含まれる６個の単位統計量データのまとまりに相当する。抽出部１５５では、通常、抽出対象の単位統計量データの数が少ない方が、処理の効率化が図られる。 Here, in the present embodiment, when a group of a plurality of unit statistics data and a group of one or more other unit statistics data are in a one-to-one correspondence, the extraction unit 155 determines the plurality of units. Extraction processing may be performed based on the statistical data, or extraction processing may be performed based on the one or more other unit statistical data, and the same result is obtained in any case. Specifically, in the example of FIG. 5, a plurality of unit statistic data sets include 12 time ranges t2, t3, time scale s0, area scales A1, A2, A3, and device type scales D1, D2. It corresponds to a group of unit statistic data, and another group of one or more unit statistic data includes time ranges t2, t3, time scale s1, area scales A1, A2, A3, and device type scales D1, D2. Corresponds to a group of six unit statistic data included in. In the extraction unit 155, the processing efficiency is usually improved when the number of unit statistics data to be extracted is smaller.

なお、抽出部１５５により統計量データ群２０１から抽出対象の範囲（抽出範囲）に含まれるデータを抽出する処理は、統計量データ処理装置１２において、例えば、制御プログラムなどにあらかじめ定められた条件が満たされたことを判定したことに応じて（自動的に）行われる構成が用いられてもよく、あるいは、ユーザなどによる所定の指示が入力部１１１により受け付けられたことに応じて行われる構成が用いられてもよい。 Note that the process of extracting data included in the extraction target range (extraction range) from the statistics data group 201 by the extraction unit 155 is performed under the condition set in advance in, for example, a control program in the statistics data processing device 12. A configuration that is (automatically) performed in response to determining that the condition is satisfied may be used, or a configuration that is performed in response to a predetermined instruction from the user or the like being received by the input unit 111. May be used.

［統計量データ処理装置において行われる処理の例］
図６は、本発明の一実施形態に係る統計量データ処理装置１２において行われる統計量データ群を生成するための処理の手順の一例を示す図である。
なお、本例は一例であり、他の任意の処理手順が用いられてもよい。 [Example of processing performed in statistical data processing apparatus]
FIG. 6 is a diagram illustrating an example of a processing procedure for generating a statistic data group performed in the statistic data processing apparatus 12 according to an embodiment of the present invention.
In addition, this example is an example and other arbitrary processing procedures may be used.

（ステップＳ１）
データ取得部１３１は、解析対象のデータを取得する。
（ステップＳ２）
観点設定部１５１は、解析に使用する観点を設定する。
（ステップＳ３）
スケール設定部１５２は、それぞれの観点について、解析に使用するスケールを設定する。
（ステップＳ４）
統計量データ群生成部１５３は、データ取得部１３１により取得されたデータについて、観点設定部１５１により設定された観点およびスケール設定部１５２により設定されたスケールに基づいて、統計量データ群を生成する。
（ステップＳ５）
データ出力制御部１３３は、このような処理に関する任意のデータ（例えば、統計量データ群）を画面に出力するように制御する。 (Step S1)
The data acquisition unit 131 acquires data to be analyzed.
(Step S2)
The viewpoint setting unit 151 sets a viewpoint used for analysis.
(Step S3)
The scale setting unit 152 sets a scale used for analysis for each viewpoint.
(Step S4)
The statistic data group generation unit 153 generates a statistic data group for the data acquired by the data acquisition unit 131 based on the viewpoint set by the viewpoint setting unit 151 and the scale set by the scale setting unit 152. .
(Step S5)
The data output control unit 133 performs control so that arbitrary data (for example, a statistic data group) related to such processing is output on the screen.

ここで、統計量データ群を生成するための処理を行うタイミングとしては、任意のタイミングが用いられてもよく、例えば、あらかじめ定められた一定周期のタイミングが用いられてもよく、あるいは、ユーザなどから受け付けられた指示に応じたタイミングが用いられてもよい。
また、統計量データ群を生成するための処理は、例えば、任意の１個以上の観点について、スケールごとに異なるタイミングで行われてもよい。一例として、観点「領域」についてスケール「東京」およびスケール「大阪」が用いられる場合、スケール「東京」に該当する単位統計量データを生成する処理のタイミングと、スケール「大阪」に該当する単位統計量データを生成する処理のタイミングとを異ならせて、これら両方の処理が完了した後に、これら両方の処理の結果を組み合わせた統計量データ群を生成する構成が用いられてもよい。 Here, any timing may be used as the timing for performing the process for generating the statistic data group. For example, a predetermined cycle timing may be used, or a user or the like may be used. The timing according to the instruction received from may be used.
Moreover, the process for generating the statistical data group may be performed at different timings for each scale, for example, for any one or more viewpoints. As an example, when the scale “Tokyo” and the scale “Osaka” are used for the viewpoint “area”, the timing of processing for generating unit statistical data corresponding to the scale “Tokyo” and the unit statistics corresponding to the scale “Osaka” A configuration may be used in which a statistical data group is generated by combining the results of both processes after the completion of both processes with different timings for generating the quantity data.

図７は、本発明の一実施形態に係る統計量データ処理装置１２において行われる統計量データ群に対してデータを追加するための処理の手順の一例を示す図である。
なお、本例は一例であり、他の任意の処理手順が用いられてもよい。 FIG. 7 is a diagram illustrating an example of a processing procedure for adding data to a statistical data group performed in the statistical data processing apparatus 12 according to an embodiment of the present invention.
In addition, this example is an example and other arbitrary processing procedures may be used.

（ステップＳ１１）
追加部１５４は、データを追加する先となる統計量データ群を取得する。
（ステップＳ１２）
データ取得部１３１は、追加する解析対象のデータ（追加対象データ）を取得する。
（ステップＳ１３）
観点設定部１５１は、解析に使用する観点を設定する。
（ステップＳ１４）
スケール設定部１５２は、それぞれの観点について、解析に使用するスケールを設定する。
（ステップＳ１５）
追加部１５４は、取得された統計量データ群に対して、データ取得部１３１により取得されたデータを、観点設定部１５１により設定された観点およびスケール設定部１５２により設定されたスケールに基づいて、追加することで、統計量データ群（データ追加後の統計量データ群）を生成する。
（ステップＳ１６）
データ出力制御部１３３は、このような処理に関する任意のデータ（例えば、データ追加後の統計量データ群）を画面に出力するように制御する。 (Step S11)
The adding unit 154 acquires a statistic data group to which data is added.
(Step S12)
The data acquisition unit 131 acquires analysis target data (addition target data) to be added.
(Step S13)
The viewpoint setting unit 151 sets a viewpoint used for analysis.
(Step S14)
The scale setting unit 152 sets a scale used for analysis for each viewpoint.
(Step S15)
The adding unit 154 adds the data acquired by the data acquisition unit 131 to the acquired statistical data group based on the viewpoint set by the viewpoint setting unit 151 and the scale set by the scale setting unit 152. By adding, a statistic data group (statistic data group after data addition) is generated.
(Step S16)
The data output control unit 133 performs control so that arbitrary data related to such processing (for example, a statistic data group after data addition) is output to the screen.

ここで、統計量データ群に対してデータを追加するための処理を行うタイミングとしては、任意のタイミングが用いられてもよく、例えば、あらかじめ定められた一定周期のタイミングが用いられてもよく、あるいは、ユーザなどから受け付けられた指示に応じたタイミングが用いられてもよい。
一例として、まず、図６に示される統計量データ群を生成するための処理を行い、その後に、一定周期のタイミングで繰り返して、図７に示される当該統計量データ群に対してデータを追加するための処理を行う構成が用いられてもよい。この構成では、まず、既に取得されているデータに基づいて統計量データ群を生成し、その後、一定周期ごとに、新たに取得されたデータを用いて当該統計量データ群を更新していくことができる。 Here, as the timing for performing the process for adding data to the statistical data group, any timing may be used, for example, a predetermined period may be used. Or the timing according to the instruction | indication received from the user etc. may be used.
As an example, first, processing for generating the statistical data group shown in FIG. 6 is performed, and then data is added to the statistical data group shown in FIG. The structure which performs the process for doing may be used. In this configuration, first, a statistic data group is generated based on already acquired data, and thereafter, the statistic data group is updated using newly acquired data at regular intervals. Can do.

図８は、本発明の一実施形態に係る統計量データ処理装置１２において行われる統計量データ群からデータを抽出するための処理の手順の一例を示す図である。
なお、本例は一例であり、他の任意の処理手順が用いられてもよい。 FIG. 8 is a diagram illustrating an example of a processing procedure for extracting data from a statistical data group performed in the statistical data processor 12 according to an embodiment of the present invention.
In addition, this example is an example and other arbitrary processing procedures may be used.

（ステップＳ２１）
抽出部１５５は、データを抽出する元となる統計量データ群を取得する。
（ステップＳ２２）
抽出部１５５は、抽出範囲を設定する。当該抽出範囲は、例えば、時刻範囲と、それぞれの観点ごとのスケールを用いて、特定される。
（ステップＳ２３）
観点設定部１５１は、解析に使用する観点を設定する。
（ステップＳ２４）
スケール設定部１５２は、それぞれの観点について、解析に使用するスケールを設定する。
（ステップＳ２５）
抽出部１５５は、取得された統計量データ群から、設定された抽出範囲に含まれるデータを抽出する。そして、抽出部１５５は、抽出されたデータについて、観点設定部１５１により設定された観点およびスケール設定部１５２により設定されたスケールに基づいて、統計量データ群（抽出範囲における統計量データ群）を生成する。なお、抽出範囲に含まれるデータは、例えば、当該抽出範囲から抽出されてもよく、あるいは、他の範囲から該当するデータ（実質的に同じデータ）が抽出されてもよい。
（ステップＳ２６）
データ出力制御部１３３は、このような処理に関する任意のデータ（例えば、抽出範囲における統計量データ群）を画面に出力するように制御する。 (Step S21)
The extraction unit 155 acquires a statistical data group from which data is extracted.
(Step S22)
The extraction unit 155 sets an extraction range. The extraction range is specified using, for example, a time range and a scale for each viewpoint.
(Step S23)
The viewpoint setting unit 151 sets a viewpoint used for analysis.
(Step S24)
The scale setting unit 152 sets a scale used for analysis for each viewpoint.
(Step S25)
The extraction unit 155 extracts data included in the set extraction range from the acquired statistical data group. The extraction unit 155 then extracts a statistic data group (a statistic data group in the extraction range) for the extracted data based on the viewpoint set by the viewpoint setting unit 151 and the scale set by the scale setting unit 152. Generate. The data included in the extraction range may be extracted from the extraction range, for example, or corresponding data (substantially the same data) may be extracted from another range.
(Step S26)
The data output control unit 133 performs control so that arbitrary data relating to such processing (for example, a statistic data group in the extraction range) is output on the screen.

ここで、統計量データ群からデータを抽出するための処理を行うタイミングとしては、任意のタイミングが用いられてもよく、例えば、あらかじめ定められた一定周期のタイミングが用いられてもよく、あるいは、ユーザなどから受け付けられた指示に応じたタイミングが用いられてもよい。 Here, as timing for performing processing for extracting data from the statistic data group, any timing may be used, for example, a predetermined cycle timing may be used, or The timing according to the instruction | indication received from the user etc. may be used.

［第１実施形態のまとめ］
以上のように、本実施形態に係るデータ処理システム１では、統計量データ処理装置１２において、多次元およびマルチスケールのデータ解析を行うことを可能とすることができる。
本実施形態に係る統計量データ処理装置１２は、時系列データの母集団における多次元かつマルチスケールな部分集合に対して、値の分布（統計量）を計算して取得することが可能な時系列データ、あるいは、取得された統計量（単位統計量データ）、あるいは、統計量データ群、のうちの１以上をデータベース１３（あるいは、記憶部１１３）に記憶して保持する。 [Summary of First Embodiment]
As described above, in the data processing system 1 according to the present embodiment, the statistic data processing device 12 can perform multidimensional and multiscale data analysis.
When the statistic data processing apparatus 12 according to the present embodiment is capable of calculating and obtaining a value distribution (statistic) for a multidimensional and multiscale subset in the population of time-series data. One or more of the series data, the acquired statistics (unit statistics data), or the statistics data group is stored and held in the database 13 (or the storage unit 113).

本実施形態に係る統計量データ群のデータ構造では、例えば、多次元およびマルチスケールで、統計量データを把握すること、あるいは、統計量データを検索することなどが可能である。
本実施形態に係る統計量データ群のデータ構造では、例えば、複数の観点および複数のスケールで、異常の発生などの事象を監視して検出することなどが可能であり、様々な観点および様々なスケールの事象を並列に監視して検出することなどが可能である。この場合に、本実施形態に係る統計量データ群のデータ構造では、例えば、発生した事象がいずれの観点およびいずれのスケールでの事象であるかを判定することが可能である。具体例として、広い領域のスケールで事象が発生した場合には、広い領域にわたる原因による事象であると推定することができ、また、特定の領域のスケールで事象が発生した場合には、当該特定の領域に限られた原因による事象であると推定することができる。 In the data structure of the statistics data group according to the present embodiment, for example, it is possible to grasp the statistics data in multi-dimensional and multi-scale, or to search the statistics data.
In the data structure of the statistic data group according to the present embodiment, for example, it is possible to monitor and detect an event such as the occurrence of an abnormality from a plurality of viewpoints and a plurality of scales. It is possible to monitor and detect scale events in parallel. In this case, in the data structure of the statistic data group according to the present embodiment, for example, it is possible to determine which viewpoint and at which scale the generated event is an event. As a specific example, if an event occurs in a wide area scale, it can be estimated that the event is caused by a cause in a wide area, and if an event occurs in a specific area scale, It can be presumed that the event is caused by a cause limited to the region.

ここで、本実施形態では、端末装置１１−１〜１１−ｎに関する値について統計量データを処理する構成としたが、他の任意の値について統計量データを処理する構成が実施されてもよい。 Here, in this embodiment, although it was set as the structure which processes statistics data about the value regarding the terminal devices 11-1 to 11-n, the structure which processes statistics data about another arbitrary value may be implemented. .

（第２実施形態）
［データ処理システム］
図９は、本発明の一実施形態（第２実施形態）に係るデータ処理システム１００１の概略的な構成を示すブロック図である。
データ処理システム１００１は、ｎ個の端末装置１１−１〜１１−ｎと、統計量データ処理装置１０１１と、データベース１０１２と、単位統計量データ生成装置１０２１と、ネットワーク２１を備える。
ここで、端末装置１１−１〜１１−ｎと、ネットワーク２１は、図１に示されるものと同様であり、説明の便宜上、同じ符号を付してある。
また、データベース１０１２は、図１に示されるデータベース１３と同様に、データを記憶する機能を有する。 (Second Embodiment)
[Data processing system]
FIG. 9 is a block diagram showing a schematic configuration of a data processing system 1001 according to one embodiment (second embodiment) of the present invention.
The data processing system 1001 includes n terminal devices 11-1 to 11-n, a statistic data processing device 1011, a database 1012, a unit statistic data generation device 1021, and a network 21.
Here, the terminal devices 11-1 to 11-n and the network 21 are the same as those shown in FIG. 1, and are given the same reference numerals for convenience of explanation.
Further, the database 1012 has a function of storing data, similar to the database 13 shown in FIG.

本実施形態に係るデータ処理システム１００１について、図１に示されるデータ処理システム１との相違点について説明する。
図９の例では、図１の例と比べて、単位統計量データ生成装置１０２１を備えている点と、統計量データ処理装置１０１１により行われる処理の一部が、相違する。 Differences between the data processing system 1001 according to the present embodiment and the data processing system 1 shown in FIG. 1 will be described.
The example of FIG. 9 is different from the example of FIG. 1 in that the unit statistics data generation device 1021 is provided and a part of the processing performed by the statistics data processing device 1011 is different.

単位統計量データ生成装置１０２１は、ｔ−ｄｉｇｅｓｔの技術（例えば、非特許文献１など参照。）により実行することが可能な処理の全部または一部を行う機能を有している。
本実施形態では、単位統計量データ生成装置１０２１は、ｔ−ｄｉｇｅｓｔの技術を用いて、解析対象のデータ、観点を特定する情報、および、それぞれの観点のスケールを特定する情報に基づいて、単位統計量データを生成する機能を有する。
また、本実施形態では、単位統計量データ生成装置１０２１は、ｔ−ｄｉｇｅｓｔの技術を用いて、単位統計量データに対してデータを追加する機能を有している。
また、本実施形態では、単位統計量データ生成装置１０２１は、ｔ−ｄｉｇｅｓｔの技術を用いて、単位統計量データからデータを抽出する機能を有している。当該機能として、例えば、ｔ−ｄｉｇｅｓｔにおける複数分布探索機能が利用されてもよい。
なお、単位統計量データ生成装置１０２１としては、例えば、統計量データ処理装置１０１１を管理する者により管理されてもよく、あるいは、他の者によって提供される単位統計量データ生成装置１０２１を利用する構成が用いられてもよい。 The unit statistic data generation apparatus 1021 has a function of performing all or part of processing that can be executed by the t-digest technique (for example, see Non-patent Document 1).
In the present embodiment, the unit statistic data generation apparatus 1021 uses the t-digest technique to determine the unit based on the data to be analyzed, the information for specifying the viewpoint, and the information for specifying the scale of each viewpoint. It has a function to generate statistical data.
In this embodiment, the unit statistic data generation apparatus 1021 has a function of adding data to the unit statistic data using the t-digest technique.
In the present embodiment, the unit statistic data generation apparatus 1021 has a function of extracting data from the unit statistic data by using the t-digest technique. As this function, for example, a multiple distribution search function in t-digest may be used.
The unit statistic data generation device 1021 may be managed by a person who manages the statistic data processing device 1011 or the unit statistic data generation device 1021 provided by another person is used. A configuration may be used.

統計量データ処理装置１０１１は、図１に示される統計量データ処理装置１２との相違点として、単位統計量データ生成装置１０２１により行われる処理については当該単位統計量データ生成装置１０２１に当該処理を要求して処理結果を受ける構成としてある。 The statistic data processing device 1011 is different from the statistic data processing device 12 shown in FIG. 1 in that the unit statistic data generating device 1021 performs the processing for the processing performed by the unit statistic data generating device 1021. It is configured to request and receive processing results.

ここで、説明の便宜上から、図２に示される機能ブロックを利用して説明する。
本実施形態に係る統計量データ処理装置１０１１は、概略的には、図２に示されるものと同様な機能ブロックを有する。 Here, for convenience of explanation, description will be made using the functional blocks shown in FIG.
The statistic data processing apparatus 1011 according to the present embodiment schematically has functional blocks similar to those shown in FIG.

本実施形態では、統計量データ群生成部１５３は、単位統計量データを生成する処理を要求する信号を、通信部１１４によりネットワーク２１を介して、単位統計量データ生成装置１０２１に送信する。当該信号には、単位統計量データを生成するために必要な情報が含まれ、例えば、解析対象のデータ（または、それを特定する情報）、１個以上の観点を特定する情報、および、それぞれの観点のスケールを特定する情報が含まれる。
単位統計量データ生成装置１０２１は、このような要求の信号を受信した場合、当該要求に応じて単位統計量データを生成し、生成された単位統計量データを含む信号を、ネットワーク２１を介して、統計量データ処理装置１０１１に送信する。
統計量データ群生成部１５３は、単位統計量データ生成装置１０２１から通信部１１４により受信された単位統計量データを使用（利用）して、統計量データ群を生成する。 In the present embodiment, the statistic data group generation unit 153 transmits a signal requesting processing for generating unit statistic data to the unit statistic data generation apparatus 1021 via the network 21 by the communication unit 114. The signal includes information necessary for generating unit statistic data, for example, data to be analyzed (or information for specifying it), information for specifying one or more viewpoints, and each Contains information that identifies the scale of the viewpoint.
When the unit statistic data generation apparatus 1021 receives such a request signal, the unit statistic data generation apparatus 1021 generates unit statistic data in response to the request, and sends a signal including the generated unit statistic data via the network 21. And sent to the statistics data processing apparatus 1011.
The statistic data group generation unit 153 uses (uses) the unit statistic data received by the communication unit 114 from the unit statistic data generation apparatus 1021 to generate a statistic data group.

本実施形態では、追加部１５４は、単位統計量データにデータを追加する処理を要求する信号を、通信部１１４によりネットワーク２１を介して、単位統計量データ生成装置１０２１に送信する。当該信号には、単位統計量データにデータを追加するために必要な情報が含まれ、例えば、追加する先の単位統計量データ（または、それを特定する情報）、および、追加対象データ（または、それを特定する情報）が含まれる。
単位統計量データ生成装置１０２１は、このような要求の信号を受信した場合、当該要求に応じてデータ追加後の単位統計量データを生成し、生成された単位統計量データを含む信号を、ネットワーク２１を介して、統計量データ処理装置１０１１に送信する。
追加部１５４は、単位統計量データ生成装置１０２１から通信部１１４により受信された単位統計量データを使用（利用）して、データ追加後の統計量データ群を生成する。 In the present embodiment, the adding unit 154 transmits a signal requesting processing for adding data to the unit statistics data to the unit statistics data generation apparatus 1021 via the network 21 by the communication unit 114. The signal includes information necessary for adding data to the unit statistic data. For example, unit statistic data to be added (or information for specifying the data) and data to be added (or , Information that identifies it).
When the unit statistic data generation apparatus 1021 receives such a request signal, the unit statistic data generation apparatus 1021 generates unit statistic data after adding data in response to the request, and transmits the signal including the generated unit statistic data to the network. 21 to the statistic data processing apparatus 1011.
The adding unit 154 uses (uses) the unit statistic data received from the unit statistic data generating apparatus 1021 by the communication unit 114, and generates a statistic data group after data addition.

本実施形態では、抽出部１５５は、単位統計量データ（または、それに含まれるデータ）を抽出する処理を要求する信号を、通信部１１４によりネットワーク２１を介して、単位統計量データ生成装置１０２１に送信する。当該信号には、単位統計量データ（または、それに含まれるデータ）を抽出するために必要な情報が含まれ、例えば、抽出する単位統計量データ（または、それに含まれるデータ）を特定する情報（例えば、抽出範囲を特定する情報）が含まれる。
単位統計量データ生成装置１０２１は、このような要求の信号を受信した場合、当該要求に応じて抽出された単位統計量データ（または、それに含まれるデータ）を含む信号を、ネットワーク２１を介して、統計量データ処理装置１０１１に送信する。
抽出部１５５は、単位統計量データ生成装置１０２１から通信部１１４により受信された単位統計量データを使用（利用）して、抽出範囲の統計量データ群を生成する。 In the present embodiment, the extraction unit 155 sends a signal requesting processing for extracting unit statistics data (or data included therein) to the unit statistics data generation apparatus 1021 via the network 21 by the communication unit 114. Send. The signal includes information necessary for extracting the unit statistic data (or data included in the unit statistic data). For example, information specifying the unit statistic data to be extracted (or data included in the data) ( For example, information specifying the extraction range) is included.
When the unit statistic data generation apparatus 1021 receives such a request signal, the unit statistic data generation apparatus 1021 transmits a signal including unit statistic data (or data included therein) extracted in response to the request via the network 21. And sent to the statistics data processing apparatus 1011.
The extraction unit 155 uses (uses) the unit statistic data received from the unit statistic data generation apparatus 1021 by the communication unit 114, and generates a statistic data group of the extraction range.

［第２実施形態のまとめ］
以上のように、第１実施形態と同様に、本実施形態に係る統計量データ処理装置１０１１において、多次元およびマルチスケールのデータ解析を行うことを可能とすることができる。 [Summary of Second Embodiment]
As described above, similarly to the first embodiment, the statistic data processing apparatus 1011 according to this embodiment can perform multidimensional and multiscale data analysis.

（以上の実施形態のまとめ）
一構成例として、複数の異なる観点および観点ごとの複数の異なるスケールについて、観点とスケールとの組み合わせごとに含まれるデータに基づく統計量データを有する統計量データ群を生成する統計量データ処理部（図２の例では、統計量データ処理部１３２）を備える、統計量データ処理装置（図１、図９の例では、統計量データ処理装置１２、１０１１）である。
一構成例として、統計量データ処理部は、統計量データ群に対して新たなデータを追加した統計量データ群を生成する（図２の例では、追加部１５４の機能）。
一構成例として、統計量データ処理部は、統計量データ群から抽出範囲に含まれるデータに基づく統計量データ群を生成する（図２の例では、抽出部１５５の機能）。
一構成例として、統計量データ処理部は、それぞれの観点について１個のスケールが特定された統計量データである単位統計量データを生成する単位統計量データ生成装置（図９の例では、単位統計量データ生成装置１０２１）により生成された単位統計量データを使用して、統計量データ群を生成する。
一構成例として、統計量データ処理部が、複数の異なる観点および観点ごとの複数の異なるスケールについて、観点とスケールとの組み合わせごとに含まれるデータに基づく統計量データを有する統計量データ群を生成する、統計量データ処理方法である（図１、図９の例では、統計量データ処理装置１２、１０１１により行われる処理の方法）。
一構成例として、統計量データ処理部が、複数の異なる観点および観点ごとの複数の異なるスケールについて、観点とスケールとの組み合わせごとに含まれるデータに基づく統計量データを有する統計量データ群を生成するステップ、をコンピュータ（図１、図９の例では、統計量データ処理装置１２、１０１１を構成するコンピュータ）に実行させるためのプログラムである。
一構成例として、統計量データ群のデータ構造であって、複数の異なる観点および観点ごとの複数の異なるスケールについて、観点とスケールとの組み合わせごとに、当該組み合わせごとに含まれるデータに基づく統計量データを有する、データ構造（図３の例では、統計量データ群２０１のデータ構造）である。 (Summary of the above embodiments)
As one configuration example, for a plurality of different viewpoints and a plurality of different scales for each viewpoint, a statistic data processing unit that generates a statistic data group having statistical data based on data included for each combination of viewpoints and scales ( In the example of FIG. 2, it is a statistic data processing device (the statistic data processing devices 12, 1011 in the examples of FIGS. 1 and 9) that includes the statistic data processing unit 132).
As one configuration example, the statistic data processing unit generates a statistic data group in which new data is added to the statistic data group (in the example of FIG. 2, the function of the adding unit 154).
As one configuration example, the statistic data processing unit generates a statistic data group based on data included in the extraction range from the statistic data group (in the example of FIG. 2, the function of the extraction unit 155).
As one configuration example, the statistic data processing unit is a unit statistic data generation device that generates unit statistic data that is statistic data for which one scale is specified for each viewpoint (in the example of FIG. A statistical data group is generated using the unit statistical data generated by the statistical data generator 1021).
As one configuration example, the statistics data processing unit generates a statistics data group having statistics data based on data included in each combination of viewpoints and scales for a plurality of different viewpoints and a plurality of different scales for each viewpoint. (In the example of FIGS. 1 and 9, the method of processing performed by the statistic data processing devices 12, 1011).
As one configuration example, the statistics data processing unit generates a statistics data group having statistics data based on data included in each combination of viewpoints and scales for a plurality of different viewpoints and a plurality of different scales for each viewpoint. Is a program for causing a computer (in the example of FIGS. 1 and 9, a computer configuring the statistic data processing apparatuses 12 and 1011) to execute.
As an example of the configuration, a statistics structure based on data included in each combination of viewpoints and scales with respect to a plurality of different viewpoints and a plurality of different scales for each viewpoint. A data structure having data (in the example of FIG. 3, the data structure of the statistic data group 201).

以上に示した実施形態に係る各装置（例えば、統計量データ処理装置１２、１０１１など）の機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体（記憶媒体）に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することにより、処理を行ってもよい。
なお、ここでいう「コンピュータシステム」とは、オペレーティング・システムあるいは周辺機器等のハードウェアを含むものであってもよい。
また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、フラッシュメモリ等の書き込み可能な不揮発性メモリ、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。
さらに、「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークあるいは電話回線等の通信回線を介してプログラムが送信された場合のサーバあるいはクライアントとなるコンピュータシステム内部の揮発性メモリ（例えばＤＲＡＭ（ＤｙｎａｍｉｃＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ））のように、一定時間プログラムを保持しているものも含むものとする。
また、上記のプログラムは、このプログラムを記憶装置等に格納したコンピュータシステムから、伝送媒体を介して、あるいは、伝送媒体中の伝送波により他のコンピュータシステムに伝送されてもよい。ここで、プログラムを伝送する「伝送媒体」は、インターネット等のネットワーク（通信網）あるいは電話回線等の通信回線（通信線）のように情報を伝送する機能を有する媒体のことをいう。
また、上記のプログラムは、前述した機能の一部を実現するためのものであってもよい。さらに、上記のプログラムは、前述した機能をコンピュータシステムに既に記録されているプログラムとの組み合わせで実現できるもの、いわゆる差分ファイル（差分プログラム）であってもよい。 A program for realizing the function of each device (for example, the statistic data processing device 12, 1011, etc.) according to the embodiment described above is recorded on a computer-readable recording medium (storage medium), and this recording medium Processing may be performed by causing the computer system to read and execute the program recorded in the above.
Here, the “computer system” may include hardware such as an operating system or peripheral devices.
The “computer-readable recording medium” means a flexible disk, a magneto-optical disk, a ROM (Read Only Memory), a writable non-volatile memory such as a flash memory, a portable medium such as a DVD (Digital Versatile Disc), A storage device such as a hard disk built in a computer system.
Furthermore, the “computer-readable recording medium” refers to a volatile memory (for example, DRAM (DRAM) inside a computer system that becomes a server or a client when a program is transmitted via a network such as the Internet or a communication line such as a telephone line. Dynamic Random Access Memory)) that holds a program for a certain period of time is also included.
The program may be transmitted from a computer system storing the program in a storage device or the like to another computer system via a transmission medium or by a transmission wave in the transmission medium. Here, the “transmission medium” for transmitting the program refers to a medium having a function of transmitting information, such as a network (communication network) such as the Internet or a communication line (communication line) such as a telephone line.
Further, the above program may be for realizing a part of the functions described above. Further, the above program may be a so-called difference file (difference program) that can realize the above-described functions in combination with a program already recorded in the computer system.

以上、本発明の実施形態について図面を参照して詳述したが、具体的な構成はこの実施形態に限られるものではなく、本発明の要旨を逸脱しない範囲の設計変更等も含まれる。 As mentioned above, although embodiment of this invention was explained in full detail with reference to drawings, the concrete structure is not restricted to this embodiment, The design change etc. of the range which does not deviate from the summary of this invention are included.

１、１００１…データ処理システム、１１−１〜１１−ｎ…端末装置、１２、１０１１…統計量データ処理装置、１３、１０１２…データベース、２１…ネットワーク、１１１…入力部、１１２…出力部、１１３…記憶部、１１４…通信部、１１５…制御部、１３１…データ取得部、１３２…統計量データ処理部、１３３…データ出力制御部、１５１…観点設定部、１５２…スケール設定部、１５３…統計量データ群生成部、１５４…追加部、１５５…抽出部、２０１…統計量データ群、２１１〜２１３、２２１〜２２３、３１１〜３１３、３２１〜３２３、４１１〜４１３、４２１〜４２３、５１１〜５１３…単位統計量データ、６１１…抽出範囲、１０２１…単位統計量データ生成装置 DESCRIPTION OF SYMBOLS 1,1001 ... Data processing system, 11-1 to 11-n ... Terminal device, 12, 1011 ... Statistics data processing device, 13, 1012 ... Database, 21 ... Network, 111 ... Input part, 112 ... Output part, 113 ... storage unit 114 ... communication unit 115 ... control unit 131 ... data acquisition unit 132 ... statistic data processing unit 133 ... data output control unit 151 ... viewpoint setting unit 152 ... scale setting unit 153 ... statistics Quantity data group generation unit, 154... Addition part, 155... Extraction part, 201... Statistic data group, 211 to 213, 221 to 223, 311 to 313, 321 to 323, 411 to 413, 421 to 423, 511 to 513 ... Unit statistics data, 611 ... Extraction range, 1021 ... Unit statistics data generator

Claims

For a plurality of different viewpoints and a plurality of different scales for each of the viewpoints, a statistic data processing unit that generates a statistic data group having statistical data based on data included for each combination of the viewpoint and the scale,
Statistics data processing device.

The statistic data processing unit generates the statistic data group obtained by adding new data to the statistic data group.
The statistic data processing apparatus according to claim 1.

The statistic data processing unit generates the statistic data group based on data included in an extraction range from the statistic data group.
The statistic data processing apparatus according to claim 1.

The statistic data processing unit generates the unit statistic data generated by a unit statistic data generation device that generates unit statistic data that is statistic data in which one scale is specified for each of the viewpoints. To generate the statistics data group,
The statistics data processing apparatus according to any one of claims 1 to 3.

The statistic data processing unit generates a statistic data group having statistic data based on data included in each combination of the viewpoint and the scale for a plurality of different viewpoints and a plurality of different scales for each of the viewpoints.
Statistics data processing method.

A step in which a statistic data processing unit generates a statistic data group having statistic data based on data included in each combination of the viewpoint and the scale for a plurality of different viewpoints and a plurality of different scales for each of the viewpoints; ,
A program that causes a computer to execute.

A data structure of a statistical data group,
With respect to a plurality of different viewpoints and a plurality of different scales for each of the viewpoints, for each combination of the viewpoint and the scale, statistics data based on data included for each of the combinations,
data structure.