JP2012174212A

JP2012174212A - Window processing device, method, and program

Info

Publication number: JP2012174212A
Application number: JP2011038656A
Authority: JP
Inventors: Tatsuya Asai; 達哉浅井; Hiroaki Morikawa; 裕章森川; Shinichiro Tako; 真一郎多湖; Hiroya Inakoshi; 宏弥稲越; Nobuhiro Yugami; 伸弘湯上
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2011-02-24
Filing date: 2011-02-24
Publication date: 2012-09-10
Anticipated expiration: 2031-02-24
Also published as: JP5637896B2

Abstract

PROBLEM TO BE SOLVED: To provide a window processing device capable of reducing computation time and a calculation area for window processing.SOLUTION: A window processing device 1 comprises: a data stream acquisition unit 12 that acquires a data stream including elements having the inclusion relation; a data stream analysis unit 13 that sequentially detects a position of a window element and a position of an arithmetic target element related to the window element from a data series, and stores a set of arithmetic processing values corresponding to the most recently detected arithmetic target element and a window element start symbol in a buffer 14 for every detection of the window element; a solution output unit 16 that performs window processing on the set of arithmetic processing values stored in the buffer 14.

Description

本発明は，包含関係を持つ要素を含むデータのストリームに対するウィンドウ処理技術に関する。 The present invention relates to a window processing technique for a stream of data including elements having an inclusion relationship.

近年，センサデータ，ＰＯＳ（ＰｏｉｎｔｏｆＳａｌｅｓ）データ，ブログやツィッター（ｔｗｉｔｔｅｒ）などのウェブデータなど，時々刻々と発生し続ける大量データを，リアルタイムに処理することを目的としたストリーム処理（ＳｔｒｅａｍＰｒｏｃｅｓｓｉｎｇ）が注目を集めている。 In recent years, stream processing (Stream Processing) for the purpose of processing in real time a large amount of data that is constantly generated, such as sensor data, POS (Point of Sales) data, and web data such as blogs and Twitter. Has attracted attention.

ストリーム処理では，リアルタイム性が要求されるため，限られた計算領域を用いて，できるだけ少ない処理量で計算する技術が必須である。 In stream processing, real-time performance is required, so a technique for calculating with as little processing as possible using a limited calculation area is essential.

これまでは，大量データの処理にはＤＢＭＳ（データベース管理システム）を用いるのが主流であった。しかし，ＤＢＭＳでは，データ格納や補助データ作成に膨大な処理時間が必要となるため，リアルタイム処理には不向きである。 Until now, DBMS (database management system) has been the mainstream for processing large amounts of data. However, since DBMS requires a huge amount of processing time for data storage and auxiliary data creation, it is not suitable for real-time processing.

なお，現在の商用またはオープンソースのストリーム処理エンジンは，データ種としてＣＳＶのような固定長のデータを想定しており，より柔軟なデータ表現が可能なＸＭＬ形式のデータを効率よく扱うことはできない。 In addition, the current commercial or open source stream processing engine assumes fixed-length data such as CSV as the data type, and cannot efficiently handle data in XML format that allows more flexible data representation. .

ストリーム処理における最も基本的な処理の１つとして，ウィンドウ処理がある。ウィンドウ処理では，決められた処理期間内のデータのみを逐次的に処理することによって，無限長のデータストリームに対して，決められた計算領域だけを用いてリアルタイムで処理することができる。 One of the most basic processes in stream processing is window processing. In window processing, only data within a predetermined processing period is sequentially processed, so that an infinitely long data stream can be processed in real time using only a predetermined calculation area.

上記のウィンドウ処理を，ＸＭＬデータのストリームに適用するために，Ｘｑｕｅｒｙ拡張言語およびその実行システムが提案されている（第1の既存技術）。 In order to apply the above window processing to a stream of XML data, an Xquery extension language and its execution system have been proposed (first existing technology).

図１６は，ＸＭＬデータのストリームの例を説明するための図である。 FIG. 16 is a diagram for explaining an example of an XML data stream.

図１６に示す例では，ある要素をルートにして包含関係を持つ構成要素を含むＸＭＬデータが次々と発生してストリームを構成しているとする。 In the example shown in FIG. 16, it is assumed that XML data including constituent elements having an inclusion relationship with a certain element as a root is generated one after another to form a stream.

図１６に示すＸＭＬデータのストリームは，ＰＯＳシステムで生成されたデータである。このＰＯＳ要素をルートとするＸＭＬデータ（以下，ＰＯＳ木と呼ぶ）は，ｃｕｓｔ要素とｓｈｏｐ要素とを持つ。また，レジ担当者（店員）が変わるまでは，新しいＰＯＳ木は発生しないものとする。 The XML data stream shown in FIG. 16 is data generated by the POS system. The XML data having the POS element as a root (hereinafter referred to as a POS tree) has a custom element and a shop element. Further, it is assumed that a new POS tree does not occur until the cashier person (store clerk) changes.

ｃｕｓｔ要素は，顧客の買い物データを表わす。ｃｕｓｔ要素は，必須要素として，購入品の価格を表すｐｒｉｃ要素を持ち，任意要素として，購入者名を表すｎａｍｅ要素を持つ。ｃｕｓｔ要素は，１つのＰＯＳ木に何回現れてもよい。 The custom element represents customer shopping data. The “cust” element has a “pric” element representing the price of the purchased product as an essential element, and a “name” element representing the purchaser name as an optional element. The custom element may appear any number of times in a single POS tree.

ｓｈｏｐ要素は，レジ担当の店員データを表わす。ｓｈｏｐ要素は，必須要素として，店員ＩＤを表すｉｄ要素を持つ。ｓｈｏｐ要素は，ＰＯＳ木に正確に１回だけ現れるものとする。 The shop element represents the clerk data in charge of the cash register. The shop element has an id element representing a clerk ID as an essential element. The shop element shall appear exactly once in the POS tree.

図１６に示すＰＯＳ木に対するウィンドウ処理として，以下のようなＸＭＬウィンドウ問い合わせが与えられたとする。 Assume that the following XML window query is given as window processing for the POS tree shown in FIG.

ｆｏｒｗｉｎｄｏｗ＄ｗｉｎ／ｐｏｓ／ｃｕｓｔ
ｒａｎｇｅ３
ｓｌｉｄｉｎｇ１
ｒｅｔｕｒｎＳＵＭ（＄ｗ／ｐｒｉｃ）
上記のＸＭＬウィンドウ問い合わせは，ウィンドウ処理として，ウィンドウサイズ（ｒａｎｇｅ）＝３，スライド幅（ｓｌｉｄｉｎｇ）＝１のスライディングウィンドウが設定されること，パス式“／ｐｏｓ／ｃｕｓｔ”で指定される部分木をウィンドウ構成要素とし，各ウィンドウ＄ｗに対して，パス式“＄ｗ／ｐｒｉｃ”で指定される演算対象要素（ｐｒｉｃ要素）に対応する演算対象要素値の総和を出力することを示す。 for window $ w in / pos / cust
range 3
sliding 1
return SUM ($ w / pric)
In the above XML window query, a sliding window with window size (range) = 3, slide width (sliding) = 1 is set as window processing, and a subtree specified by the path expression “/ pos / cust” is set. This indicates that the sum of the calculation target element values corresponding to the calculation target element (pric element) specified by the path expression “$ w / pric” is output for each window $ w.

図１７は，ＸＭＬデータ系列に対するウィンドウ処理を説明するための図である。 FIG. 17 is a diagram for explaining window processing for an XML data series.

図１７に示すように，ウィンドウ処理では，ＸＭＬデータのストリームから，１スキャン目の処理でウィンドウ構成要素（ｃｕｓｔ要素から始まる部分木）が１つ切り出され，ウィンドウバッファＢに格納される。そして，２スキャン目の処理で，ウィンドウバッファＢ内のウィンドウ構成要素から演算対象要素値が取得されて集計装置Ｍ内のバッファに格納される。その後，集計装置Ｍにより，バッファ内に格納された演算対象要素値が集計される。 As shown in FIG. 17, in the window processing, one window component (partial tree starting from a custom element) is cut out from the XML data stream in the first scan and stored in the window buffer B. Then, in the process of the second scan, the calculation target element value is acquired from the window constituent element in the window buffer B and stored in the buffer in the counting device M. Thereafter, the calculation target element values stored in the buffer are totaled by the totaling device M.

また，別の既存技術として，ストリーム型のＸｐａｔｈ照合が提案されている（第２の既存技術）。 As another existing technology, stream-type Xpath matching has been proposed (second existing technology).

ストリーム型のＸｐａｔｈ照合は，ＸＭＬデータを１回スキャンしてＸｐａｔｈ問合せを照合する手法に基づいた，ストリーム型のＸｐａｔｈ照合エンジンにより実現される。 The stream type Xpath collation is realized by a stream type Xpath collation engine based on a method of collating an Xpath query by scanning XML data once.

図１８は，ＸＭＬデータ系列に対するストリーム型のＸｐａｔｈ照合を説明するための図である。 FIG. 18 is a diagram for explaining stream-type Xpath verification for an XML data series.

ストリーム型のＸｐａｔｈ照合では，１スキャン目で，ウィンドウ構成要素を指定したパス（Ｘｐａｔｈ：／ｐｏｓ／ｃｕｓｔ）が照合され，さらに，２スキャン目で，演算対象要素を指定したパス（Ｘｐａｔｈ：／ｐｏｓ／ｃｕｓｔ／ｐｒｉｃ）が照合されて，演算対象要素に対応付けられた演算対象要素値が特定される。 In the stream type Xpath collation, the path (Xpath: / pos / cus) specifying the window component is collated at the first scan, and the path (Xpath: / pos) specifying the operation target element at the second scan. / Cust / pric) is collated, and the calculation target element value associated with the calculation target element is specified.

ＸＭＬデータから単に演算対象要素（ｐｒｉｃ）を検出するだけであれば，Ｘｐａｔｈが指定するＸＭＬの部分木を１スキャンで取得することができる。しかし，ＸＭＬデータは，ウィンドウ構成要素に包含される演算対象要素が不特定であることなど，柔軟な表現が可能である。そのため，包含関係にある演算対象要素をウィンドウ構成要素による部分木と関連付けて取得するためには，２スキャンが必要であった。 If the calculation target element (prix) is simply detected from the XML data, the XML subtree specified by Xpath can be acquired in one scan. However, the XML data can be expressed flexibly, for example, the calculation target element included in the window component is unspecified. For this reason, two scans are required in order to acquire the calculation target element in the inclusive relation in association with the subtree by the window constituent element.

具体的には，上記のＸＭＬウィンドウ問い合わせの場合のように演算を各ウィンドウ構成要素単位（ＸＰ１）で行いたい場合には，図１８に示すように，演算対象要素ＸＰ２＿２，ＸＰ２＿３が，包含関係にあるウィンドウ構成要素ＸＰ１＿２に関連付けられて取得されなければならなかった。すなわち，ＸＭＬデータからのウィンドウ構成要素の照合と各ウィンドウ構成要素からの演算対象要素の照合とのために，２スキャンが必要であった。 Specifically, when it is desired to perform an operation in units of each window component (XP1) as in the case of the above XML window inquiry, as shown in FIG. 18, the operation target elements XP2_2 and XP2_3 are in an inclusion relationship. It had to be acquired in association with a certain window component XP1_2. That is, two scans are required for collating the window constituent elements from the XML data and collating the calculation target elements from the respective window constituent elements.

Ａ．Ａｒａｓｕ，ｅｔａｌ， “ＴｈｅＣＱＬＣｏｎｔｉｎｕｏｕｓＱｕｅｒｙＬａｎｇｕａｇｅ：ＳｅｍａｎｔｉｃＦｏｕｎｄａｔｉｏｎｓａｎｄＱｕｅｒｙＥｘｅｃｕｔｉｏｎ”，ＶＬＤＢＪ．，Ｖｏｌ．１５（２），ｐｐ．１２１−１４２，２００６年A. Arasu, et al, “The CQL Continuous Query Language: Semantic Foundations and Query Execution”, VLDB J. et al. , Vol. 15 (2), pp. 121-142, 2006 Ｉ．Ｂｏｔａｎ，ｅｔａｌ， “ＥｘｔｅｎｄｉｎｇＸＱｕｅｒｙｗｉｔｈＷｉｎｄｏｗＦｕｎｃｔｉｏｎｓ”，Ｐｒｏｃ．ＶＬＤＢ２００７，ｐｐ．７５−８６，２００７年I. Botan, et al, “Extending XQuery with Window Functions”, Proc. VLDB 2007, pp. 75-86, 2007 Ｉ．Ａｖｉｌａ−Ｃａｍｐｉｌｌｏ，ｅｔａｌ， “ＸＭＬＴＫ：ＡｎＸＭＬＴｏｏｌｋｉｔｆｏｒＳｃａｌａｂｌｅＸＭＬＳｔｒｅａｍＰｒｏｃｅｓｓｉｎｇ”，Ｐｒｏｃ．ＰＬＡＮＸ’０２，２００２年I. Avila-Camillo, et al, “XMLTK: An XML Tool for Scalable XML Stream Processing”, Proc. PLANX'02, 2002 Ｙ．Ｄｉａｏ，ｅｔａｌ， “ＰａｔｈＳｈａｒｉｎｇａｎｄＰｒｅｄｉｃａｔｅＥｖａｌｕａｔｉｏｎｆｏｒＨｉｇｈ−ＰｅｒｆｏｒｍａｎｃｅＸＰａｔｈＦｉｌｔｅｒｉｎｇ”，Ｐｒｏｃ．ＡＣＭＴＯＤＳ，２８（４），４６７−５１６，２００３年Y. Diao, et al, “Path Sharing and Predicate Evaluation for High-Performance XPath Filtering”, Proc. ACM TODS, 28 (4), 467-516, 2003

上記で第１の既存技術として説明した，ＸＭＬデータ系列に対するウィンドウ処理では，ウィンドウ構成要素の取得に１スキャン，さらに演算対象要素値の取得に１スキャンの合計２回のスキャンの実行が必要であり，処理が非効率であるという問題があった。 In the window processing for the XML data series described above as the first existing technology, it is necessary to execute two scans in total, that is, one scan for acquiring the window component and one scan for acquiring the calculation target element value. There was a problem that the processing was inefficient.

また，第２の既存技術として説明した，ＸＭＬデータ系列に対するストリーム型Ｘｐａｔｈの照合処理では，互いに包含関係にある部分木を関連付けて取得するために，ウィンドウ構成要素と演算対象要素のそれぞれの照合に２スキャンの実行が必要であり，同様に，処理が非効率であるという問題があった。 Also, in the collation processing of the stream type Xpath for the XML data series described as the second existing technology, in order to obtain the subtrees in an inclusive relationship in association with each other, the collation between the window constituent element and the calculation target element is performed. Two scans are required to be executed, and similarly, there is a problem that the processing is inefficient.

本発明の目的は，例えばＸＭＬデータのように，包含関係を持つ要素を含むデータ系列に対するウィンドウ処理において，ウィンドウ構成要素と関連付けた演算対象要素を１回のスキャンで取得できる処理技術を提供することである。 An object of the present invention is to provide a processing technique capable of acquiring a calculation target element associated with a window constituent element in a single scan in a window process for a data series including elements having an inclusive relation such as XML data. It is.

本発明の一態様として開示されるウィンドウ処理装置は，１）データを格納するバッファと，２）包含関係を持つ要素を含むデータ系列を取得するデータ取得部と，３）前記データ系列から，ウィンドウ処理単位を定めるウィンドウ構成要素の位置および前記ウィンドウ構成要素に包含される演算対象要素の位置を前記ウィンドウ構成要素に関連付けて逐次的に検出し，前記ウィンドウ構成要素を検出する度に，直前に検出したウィンドウ構成要素と関連付けた演算対象要素に対応する演算対象要素値の集合を前記バッファに格納するデータストリーム解析部と，４）前記バッファに格納された前記ウィンドウ構成要素に関連付けられた演算処理値の集合に対してウィンドウ処理を行うウィンドウ処理部とを備える。 A window processing device disclosed as one aspect of the present invention includes 1) a buffer for storing data, 2) a data acquisition unit for acquiring a data series including elements having inclusion relations, and 3) a window from the data series. The position of the window component that defines the processing unit and the position of the operation target element included in the window component are sequentially detected in association with the window component, and are detected immediately before each window component is detected. A data stream analysis unit that stores in the buffer a set of operation target element values corresponding to the operation target element associated with the window component, and 4) an operation processing value associated with the window component stored in the buffer A window processing unit for performing window processing on the set of

また，本発明の別の態様として開示されるウィンドウ処理方法は，コンピュータが，上記ウィンドウ処理装置の各処理部で実現される各処理に対応する処理ステップを実行するものである。 In addition, in a window processing method disclosed as another aspect of the present invention, a computer executes processing steps corresponding to each process realized by each processing unit of the window processing apparatus.

また，本発明の別の態様として開示されるウィンドウ処理プログラムは，コンピュータに，上記ウィンドウ処理装置の各処理部で実現される各処理に対応する処理ステップを実行させるものである。 A window processing program disclosed as another aspect of the present invention causes a computer to execute processing steps corresponding to each processing realized by each processing unit of the window processing device.

上記した手段によれば，データ系列からウィンドウ構成要素と演算対象要素とを１回のスキャンで同時に行うことができ，ウィンドウ処理の計算時間と計算領域との効率化を実現することができる。 According to the above-described means, the window constituent element and the operation target element can be simultaneously performed from the data series by one scan, and the efficiency of the calculation time and calculation area of the window processing can be realized.

本発明の一態様として開示するウィンドウ処理装置の一実施例における構成例を示す図である。It is a figure which shows the structural example in one Example of the window processing apparatus disclosed as 1 aspect of this invention. データストリーム解析部の構成例を示す図である。It is a figure which shows the structural example of a data stream analysis part. ＸＭＬストリームＸＳのデータ構造の表現例を示す図である。It is a figure which shows the example of expression of the data structure of XML stream XS. ウィンドウクエリＱの例を示す図である。It is a figure which shows the example of the window query Q. ウィンドウ処理装置の処理フローを示す図である。It is a figure which shows the processing flow of a window processing apparatus. ウィンドウクエリ取得処理のより詳細な処理フロー例を示す図である。It is a figure which shows the example of a more detailed process flow of a window query acquisition process. データストリーム解析処理の処理フロー例を示す図である。It is a figure which shows the example of a processing flow of a data stream analysis process. データＣＸの例を示す図である。It is a figure which shows the example of the data CX. ステップＳ２２のストリーム解析のより詳細な処理フローを示す図である。It is a figure which shows the more detailed process flow of the stream analysis of step S22. ＸＭＬストリームのＸＭＬデータとパス照合によりバッファに出力されるデータとの関係例を示す図である。It is a figure which shows the example of a relationship between the XML data of an XML stream, and the data output to a buffer by path | pass verification. 解出力処理の処理フロー例を示す図である。It is a figure which shows the example of a processing flow of a solution output process. バッファのデータと，解出力処理により出力シーケンスＳｅｑに出力されるデータとの関係を模式的に示す図（その１）である。FIG. 6 is a diagram (part 1) schematically illustrating a relationship between buffer data and data output to an output sequence Seq by a solution output process. バッファのデータと，解出力処理により出力シーケンスＳｅｑに出力されるデータとの関係を模式的に示す図（その２）である。FIG. 10 is a diagram (part 2) schematically illustrating a relationship between buffer data and data output to an output sequence Seq by a solution output process. バッファのデータと，解出力処理により出力シーケンスＳｅｑに出力されるデータとの関係を模式的に示す図（その３）である。FIG. 11 is a diagram (No. 3) schematically illustrating a relationship between buffer data and data output to an output sequence Seq by a solution output process. ＸＭＬストリームＸＳの各ＸＭＬデータと，バッファに出力されるデータとの関係を模式的に示す図である。It is a figure which shows typically the relationship between each XML data of the XML stream XS, and the data output to a buffer. ＸＭＬデータのストリームの例を説明するための図である。It is a figure for demonstrating the example of the stream of XML data. ＸＭＬデータ系列に対するウィンドウ処理を説明するための図である。It is a figure for demonstrating the window process with respect to an XML data series. ＸＭＬデータ系列に対するストリーム型のＸｐａｔｈ照合を説明するための図である。It is a figure for demonstrating the stream type Xpath collation with respect to an XML data series.

本発明の一態様として開示するウィンドウ処理装置は，既知のストリーム型のＸＰａｔｈ照合機能に，ＸＭＬストリームからウィンドウ構成要素に関連付けられた演算対象要素を検出し，検出済みの演算対象要素に対応する演算対象要素値を取得して，ウィンドウ構成要素の開始位置を検出する度に，ウィンドウ構成要素開始を表わす記号と検出済みの演算対象要素に対応する演算対象値の集合とをバッファに送信する機能を有する。 A window processing apparatus disclosed as one aspect of the present invention detects a calculation target element associated with a window constituent element from an XML stream using a known stream type XPath collation function, and performs a calculation corresponding to the detected calculation target element. A function that acquires a target element value and detects the start position of the window component, and sends a symbol representing the start of the window component and a set of calculation target values corresponding to the detected calculation target element to the buffer. Have.

図１は，本発明の一態様として開示するウィンドウ処理装置１の一実施例における構成例を示す図である。 FIG. 1 is a diagram illustrating a configuration example in an embodiment of a window processing apparatus 1 disclosed as one aspect of the present invention.

ウィンドウ処理装置１は，包含関係を持つ要素を含むデータ系列，例えばＸＭＬデータのストリーム（以下，ＸＭＬストリーム）に対するウィンドウ処理を実行する。より詳しくは，ウィンドウ処理装置１は，ユーザが使用するクエリ装置２からＸＭＬストリームＸＳに対するウィンドウ問い合わせ（クエリ）Ｑを受け付け，データ送信装置３からネットワークＮを介して送信されたＸＭＬストリームＸＳに対するウィンドウ処理を実行する。ウィンドウ処理として，ウィンドウクエリＱで設定されたウィンドウサイズｗおよびスライド幅ｓをもとに，直近のウィンドウ集計値ａを，ウィンドウがスライドするたびに出力し，その出力をウィンドウクエリＱの解（集計値ａ）Ａとして送信する。 The window processing apparatus 1 executes window processing on a data series including elements having an inclusion relationship, for example, an XML data stream (hereinafter, XML stream). More specifically, the window processing device 1 receives a window inquiry (query) Q for the XML stream XS from the query device 2 used by the user, and performs window processing for the XML stream XS transmitted from the data transmission device 3 via the network N. Execute. As window processing, based on the window size w and slide width s set in the window query Q, the latest window total value a is output each time the window slides, and the output is the solution of the window query Q (total) Value a) Send as A.

ウィンドウ処理装置１は，上記の機能を実現するため，ウィンドウクエリ受付部１１，データストリーム取得部１２，データストリーム解析部１３，バッファ１４，バッファ集計部１５，および解出力部１６を備える。 The window processing device 1 includes a window query reception unit 11, a data stream acquisition unit 12, a data stream analysis unit 13, a buffer 14, a buffer totaling unit 15, and a solution output unit 16 in order to realize the above functions.

ウィンドウクエリ受付部１１は，ユーザのクエリ装置２から，ウィンドウ単位であるウィンドウ構成要素ＸＰ１，演算対象を示す演算対象要素ＸＰ２，ウィンドウサイズｗ（ｗ≧１）およびスライド幅ｓ（ｓ≧１）の指定を含むウィンドウクエリＱを受け付ける。 The window query reception unit 11 receives a window component element XP1, which is a window unit, an operation target element XP2, an operation target element XP2, a window size w (w ≧ 1), and a slide width s (s ≧ 1) from the user query device 2. A window query Q including designation is accepted.

データストリーム取得部１２は，ＸＭＬデータの発生源であるデータ送信装置３から，ネットワークＮを解してＸＭＬストリームＸＳを受信する。 The data stream acquisition unit 12 receives the XML stream XS through the network N from the data transmission device 3 that is the generation source of the XML data.

データストリーム解析部１３は，逐次的に，受信したＸＭＬストリームＸＳから，ウィンドウ構成要素ＸＰ１の位置を検出し，さらにウィンドウ構成要素ＸＰ１に包含される演算対象要素ＸＰ２の位置をウィンドウ構成要素ＸＰ１に関連付けて検出する。そして，データストリーム解析部１３は，検出した演算対象要素ＸＰ２に対応する演算対象要素値の集合を保持し，ウィンドウ構成要素ＸＰ１の開始位置を検出する度に，保持している演算対象要素値の集合，すなわち直前に検出した演算対象要素ＸＰ２に対応する演算対象要素値の集合をバッファ１４に格納する。 The data stream analyzer 13 sequentially detects the position of the window component XP1 from the received XML stream XS, and further associates the position of the operation target element XP2 included in the window component XP1 with the window component XP1. To detect. Then, the data stream analysis unit 13 holds a set of calculation target element values corresponding to the detected calculation target element XP2, and every time the start position of the window component element XP1 is detected, the data stream analysis unit 13 A set, that is, a set of calculation target element values corresponding to the calculation target element XP2 detected immediately before is stored in the buffer 14.

バッファ１４は，ウィンドウ構成要素ＸＰ１の検出，および検出したウィンドウ構成要素ＸＰ１に関連付けられた演算対象要素値の集合である値リストＶを保持する。 The buffer 14 holds a value list V that is a set of element values to be calculated associated with the detection of the window component XP1 and the detected window component XP1.

バッファ集計部１５は，ウィンドウサイズｗおよびスライド幅ｓにもとづいて，バッファ１４に格納されているウィンドウサイズｗ分の演算対象要素値の集合を集計してウィンドウ集計値ａを得る。バッファ集計部１５は，ウィンドウがスライドする度に，直近に得られたウィンドウ集計値ａを解出力部１６へ出力する。 Based on the window size w and the slide width s, the buffer tabulation unit 15 tabulates a set of calculation target element values for the window size w stored in the buffer 14 to obtain a window tabulation value a. The buffer totalization unit 15 outputs the window totalization value a obtained most recently to the solution output unit 16 each time the window slides.

解出力部１６は，出力されたウィンドウ集計値ａを，ウィンドウクエリの解Ａとしてクエリ装置２へ送る。 The solution output unit 16 sends the output window total value a to the query device 2 as the solution A of the window query.

なお，上記バッファ集計部１５，解出力部１６およびウィンドウクエリ受付部１１は，ウィンドウ処理部の一実施例である。 The buffer totaling unit 15, the solution output unit 16, and the window query receiving unit 11 are examples of a window processing unit.

図２は，データストリーム解析部１３の構成例を示す図である。 FIG. 2 is a diagram illustrating a configuration example of the data stream analysis unit 13.

データストリーム解析部１３は，ストリーム型パス照合部１３１，ウィンドウ構成要素位置検出部１３２，演算対象要素位置検出部１３３，演算対象要素値検出部１３５，演算対象要素値取得部１３６，ウィンドウ構成要素開始記号送信部１３７，および演算対象要素値リスト送信部１３８を備える。 The data stream analysis unit 13 includes a stream type path matching unit 131, a window component position detection unit 132, a calculation target element position detection unit 133, a calculation target element value detection unit 135, a calculation target element value acquisition unit 136, and a window component start A symbol transmission unit 137 and a calculation target element value list transmission unit 138 are provided.

ストリーム型パス照合部１３１は，受け付けたＸＭＬストリームＸＳの各ＸＭＬデータに対してストリーム型のＸｐａｔｈ照合を行って，ＸＭＬデータから要素を検出する。 The stream type path verification unit 131 performs stream type Xpath verification on each XML data of the received XML stream XS, and detects elements from the XML data.

ウィンドウ構成要素位置検出部１３２は，ＸＭＬデータから，ウィンドウ構成要素ＸＰ１の開始位置または終了位置を検出する。ここで，ウィンドウ構成要素ＸＰ１の開始位置および終了位置は，ウィンドウ構成要素ＸＰ１の開始／終了を示すタグまたはイベントとする。 The window component position detector 132 detects the start position or the end position of the window component XP1 from the XML data. Here, the start position and the end position of the window component XP1 are tags or events indicating the start / end of the window component XP1.

演算対象要素位置検出部１３３は，ＸＭＬデータから，演算対象要素ＸＰ２の開始位置または終了位置を検出する。ここで，演算対象要素ＸＰ２の開始位置および終了位置は，演算対象要素ＸＰ２を示す開始／終了を示すタグまたはイベントとする。 The calculation target element position detection unit 133 detects the start position or end position of the calculation target element XP2 from the XML data. Here, the start position and end position of the calculation target element XP2 are a tag or event indicating the start / end indicating the calculation target element XP2.

演算対象要素値検出部１３５は，検出した演算対象要素ＸＰ２に対応する演算対象要素値を検出する。 The calculation target element value detection unit 135 detects a calculation target element value corresponding to the detected calculation target element XP2.

演算対象要素値取得部１３６は，ＸＭＬデータから，ウィンドウ構成要素に関連付けられて検出された演算対象要素ＸＰ２に対応する演算対象要素値を取得して，値リストＶに追加する。 The calculation target element value acquisition unit 136 acquires a calculation target element value corresponding to the calculation target element XP2 detected in association with the window component from the XML data, and adds the calculation target element value to the value list V.

ウィンドウ構成要素開始記号送信部１３７は，ウィンドウ構成要素ＸＰ１の開始位置が検出される度に，検出されたウィンドウ構成要素ＸＰ１の開始位置を示すフラグ（＄）をバッファ１４に書き出す。 The window component start symbol transmission unit 137 writes a flag ($) indicating the start position of the detected window component XP1 to the buffer 14 every time the start position of the window component XP1 is detected.

演算対象要素値リスト送信部１３８は，ウィンドウ構成要素ＸＰ１の開始位置が検出された後に演算対象要素ＸＰ２の終了位置が検出された場合に，検出済みの演算対象要素値のリストＶをバッファ１４に書き出す。 The calculation target element value list transmission unit 138 stores the list V of detected calculation target element values in the buffer 14 when the end position of the calculation target element XP2 is detected after the start position of the window component element XP1 is detected. Write out.

図３（Ａ）〜（Ｃ）は，ＸＭＬストリームＸＳのデータ構造の表現例を示す図である。 FIGS. 3A to 3C are diagrams illustrating examples of the representation of the data structure of the XML stream XS.

データストリーム取得部１２が受信するＸＭＬストリームＸＳにおける各ＸＭＬデータは，ＰＯＳシステムで生成されたデータであり，ＰＯＳ要素をルートとして，包含関係を持つ要素を含む。ＸＭＬデータは，ｃｕｓｔ要素とｓｈｏｐ要素とを持ち，レジ担当者（店員）が変わるまでは，新しいＸＭＬデータ（ＰＯＳ要素）は発生しないものとする。 Each XML data in the XML stream XS received by the data stream acquisition unit 12 is data generated by the POS system, and includes elements having an inclusion relationship with the POS element as a root. The XML data has a custom element and a shop element, and new XML data (POS element) is not generated until the cashier person (store clerk) changes.

顧客の買い物データを示すｃｕｓｔ要素は，必須要素として，購入品の価格を表すｐｒｉｃ要素を持ち，任意要素として，購入者名を表すｎａｍｅ要素を持つ。ｃｕｓｔ要素は，１つのＸＭＬデータに何回出現してもよい。 The custom element indicating the customer's shopping data has a price element indicating the price of the purchased item as an essential element and a name element indicating the purchaser name as an optional element. The custom element may appear any number of times in a piece of XML data.

レジ担当の店員データを示すｓｈｏｐ要素は，必須要素として，店員ＩＤを表すｉｄ要素を持ち，ＸＭＬデータ内に１回だけ出現するものとする。 The shop element indicating the clerk data in charge of the cash register has an id element representing the clerk ID as an essential element, and appears only once in the XML data.

図３（Ａ）は，上記したＸＭＬデータのデータ構造を木構造で表現したものである。図３（Ａ）に示すＸＭＬデータの木構造表現は，図３（Ｂ）に示すように，タグを用いたテキスト表現に変換することができる。さらに，図３（Ｂ）のＸＭＬデータのテキスト表現は，図３（Ｃ）に示すようなイベント列の表現として扱うことができる。ＸＭＬデータのイベント列表現の代表例として，Ｗ３Ｃ標準のＳＡＸがある。 FIG. 3A shows the tree structure of the data structure of the XML data described above. The tree structure representation of the XML data shown in FIG. 3A can be converted into a text representation using tags as shown in FIG. Further, the text representation of the XML data in FIG. 3B can be treated as an event string representation as shown in FIG. As a representative example of the event string representation of XML data, there is the W3C standard SAX.

本実施例において，データストリーム取得部１２は，ＸＭＬストリームＸＳとして，ＸＭＬデータのテキスト表現またはイベント表現のストリームを取得することができる。 In the present embodiment, the data stream acquisition unit 12 can acquire a text expression or event expression stream of XML data as the XML stream XS.

また，本実施例において，データストリーム解析部１３は，ＸＭＬデータのイベント列表現に対して動作するが，ＸＭＬデータのテキスト表現を直接入力して，逐次的にイベントを検出しながら処理を行うように構成されてもよい。 In the present embodiment, the data stream analysis unit 13 operates on the event string representation of the XML data. However, the data stream analysis unit 13 directly inputs the text representation of the XML data and performs processing while sequentially detecting the event. It may be configured.

図４は，ウィンドウクエリＱの例を示す図である。 FIG. 4 is a diagram illustrating an example of the window query Q.

図４に示すウィンドウクエリＱは，ウィンドウ構成要素ＸＰ１を指定するパス式“／ｐｏｓ／ｃｕｓｔ”，演算対象要素ＸＰ２を指定するパス式“／ｐｏｓ／ｃｕｓｔ／ｐｒｉｃ”，ウィンドウサイズｒａｎｇｅ＝３，およびスライド幅ｓｌｉｄｉｎｇ＝１の指定が含まれている。 The window query Q shown in FIG. 4 includes a path expression “/ pos / cust” that specifies the window component XP1, a path expression “/ pos / cust / pric” that specifies the operation target element XP2, a window size range = 3, and The designation of slide width sliding = 1 is included.

以下，ウィンドウ処理装置１の処理の流れを説明する。 Hereinafter, the process flow of the window processing apparatus 1 will be described.

図５は，ウィンドウ処理装置１の処理フローを示す図である。 FIG. 5 is a diagram showing a processing flow of the window processing apparatus 1.

ステップＳ１：ウィンドウクエリ取得処理
ウィンドウクエリ受付部１１がウィンドウクエリ取得処理を実行する。 Step S1: Window Query Acquisition Processing The window query reception unit 11 executes window query acquisition processing.

図６は，ウィンドウクエリ取得処理のより詳細な処理フロー例を示す図である。 FIG. 6 is a diagram illustrating a more detailed processing flow example of the window query acquisition processing.

ウィンドウクエリ受付部１１は，ユーザのクエリ装置２からウィンドウクエリＱを受け取る（ステップＳ１１）。 The window query reception unit 11 receives the window query Q from the user query device 2 (step S11).

ウィンドウクエリ受付部１１は，受信したウィンドウクエリＱから，ウィンドウ構成要素ＸＰ１および演算対象要素ＸＰ２のパス指定を抽出する。そして，ウィンドウクエリ受付部１１は，データストリーム解析部１３へＸｐａｔｈ照合で用いる指定パス，すなわち，ウィンドウ構成要素ＸＰ１を指定するＸｐａｔｈＸＰ１＝／ｐｏｓ／ｃｕｓｔ，演算対象要素ＸＰ２を指定するＸｐａｔｈＸＰ２＝／ｐｏｓ／ｃｕｓｔ／ｐｒｉｃを送信する（ステップＳ１２）。 The window query reception unit 11 extracts path designations of the window component element XP1 and the calculation target element XP2 from the received window query Q. Then, the window query reception unit 11 specifies the specified path used for the Xpath collation to the data stream analysis unit 13, that is, Xpath XP1 = / pos / cus for specifying the window component element XP1, and Xpath XP2 = / for specifying the operation target element XP2. pos / cust / pric is transmitted (step S12).

ウィンドウクエリ受付部１１は，受信したウィンドウクエリＱから，ウィンドウサイズｗ（＝３），スライド幅ｓ（＝１）を抽出し，バッファ集計部１５へ送る（ステップＳ１３）。 The window query accepting unit 11 extracts the window size w (= 3) and the slide width s (= 1) from the received window query Q, and sends them to the buffer totaling unit 15 (step S13).

ステップＳ２：データストリーム解析処理
データストリーム解析部１３は，ＸＭＬストリームＸＳに対するＸｐａｔｈ照合を行って，ウィンドウ構成要素ＸＰ１，演算対象要素ＸＰ２を抽出する。 Step S2: Data Stream Analysis Processing The data stream analysis unit 13 performs Xpath collation on the XML stream XS, and extracts the window component element XP1 and the operation target element XP2.

図７は，データストリーム解析処理の処理フロー例を示す図である。 FIG. 7 is a diagram illustrating a processing flow example of the data stream analysis processing.

データストリーム解析部１３は，ＸＭＬストリームＸＳを受け取る。データストリーム解析部１３は，ＸＭＬストリームＸＳでの処理対象とするＸＭＬデータを示すＣＸを用意し，初期化として，受け取ったＸＭＬストリームＸＳの最初のＸＭＬデータをデータＣＸとする（ステップＳ２１）。図８は，データＣＸの例を示す図である。データストリーム解析部１３は，ＸＭＬデータのイベント列表現を受け取る。 The data stream analysis unit 13 receives the XML stream XS. The data stream analysis unit 13 prepares CX indicating the XML data to be processed in the XML stream XS, and sets the first XML data of the received XML stream XS as data CX as initialization (step S21). FIG. 8 is a diagram illustrating an example of the data CX. The data stream analysis unit 13 receives an event string representation of XML data.

次に，データストリーム解析部１３は，データＣＸに対してパス照合処理を行う（ステップＳ２２）。 Next, the data stream analysis unit 13 performs a path matching process on the data CX (step S22).

データストリーム解析部１３は，ＸＭＬストリームＸＳのデータＣＸの次にＸＭＬデータが存在するか判定する（ステップＳ２３）。データストリーム解析部１３は，ＸＭＬストリームＸＳのデータＣＸの次にＸＭＬデータが存在すれば（ステップＳ２３のＹ），次のＸＭＬデータをデータＣＸとし（ステップＳ２４），ＸＭＬストリームＸＳに次のＸＭＬデータが存在しなければ（ステップＳ２３のＮ），処理を終了する。 The data stream analysis unit 13 determines whether XML data exists after the data CX of the XML stream XS (step S23). If there is XML data next to the data CX of the XML stream XS (Y in step S23), the data stream analysis unit 13 sets the next XML data as the data CX (step S24), and the next XML data in the XML stream XS. Is not present (N in step S23), the process is terminated.

図９は，ステップＳ２２のパス照合処理のより詳細な処理フローを示す図である。 FIG. 9 is a diagram showing a more detailed processing flow of the path matching process in step S22.

データストリーム解析部１３のストリーム型パス照合部１３１は，初期化として，スキャン位置を示すカーソルＣをデータＣＸ（イベント列）の最初のイベントに設定し，演算対象要素フラグＦＡ＝０，値リストＶを空リストとする（ステップＳ２２１）。そして，ストリーム型パス照合部１３１は，パス照合を行い，データＣＸのカーソルＣがあるイベントの種別を判定する（ステップＳ２２２）。 The stream type path matching unit 131 of the data stream analyzing unit 13 sets the cursor C indicating the scan position as the first event of the data CX (event sequence) as an initialization, the calculation target element flag FA = 0, the value list V Is an empty list (step S221). Then, the stream type path matching unit 131 performs path matching and determines the type of event where the cursor C of the data CX is present (step S222).

ステップＳ２２２の処理で，ウィンドウ構成要素位置検出部１３２により，そのイベントが「ウィンドウ構成要素ＸＰ１の開始」であると判定された場合に，ウィンドウ構成要素開始記号送信部１３７が，ウィンドウ構成要素開始記号“＄”をバッファ１４に出力する（ステップＳ２２３）。ウィンドウ構成要素位置検出部１３２により，イベントが「ウィンドウ構成要素ＸＰ１の終了」であると判定された場合に，演算対象要素値リスト送信部１３８が，値リストＶをバッファ１４に出力する（ステップＳ２２４）。 In the process of step S222, when the window component position detection unit 132 determines that the event is “start of window component XP1”, the window component start symbol transmission unit 137 displays the window component start symbol. “$” Is output to the buffer 14 (step S223). When the window component position detecting unit 132 determines that the event is “end of window component XP1”, the calculation target element value list transmitting unit 138 outputs the value list V to the buffer 14 (step S224). ).

演算対象要素位置検出部１３３により，イベントが「演算対象要素ＸＰ２の開始」であると判定された場合に，演算対象要素位置検出部１３３が，演算対象要素フラグＦＡ＝１を設定する（ステップＳ２２５）。イベントが「演算対象要素ＸＰ２の終了」であると判定された場合に，演算対象要素位置検出部１３３が，演算対象要素フラグＦＡ＝０を設定する（ステップＳ２２６）。 When the calculation target element position detection unit 133 determines that the event is “start of calculation target element XP2”, the calculation target element position detection unit 133 sets the calculation target element flag FA = 1 (step S225). ). When it is determined that the event is “completion of the calculation target element XP2”, the calculation target element position detection unit 133 sets the calculation target element flag FA = 0 (step S226).

また，演算対象要素値検出部１３５により，イベントが「テキスト」のノードであると判定された場合に，さらに，演算対象要素値取得部１３６が，演算対象要素フラグＦＡ＝１であるかを判定する（ステップＳ２２７）。そして，演算対象要素フラグＦＡ＝１であれば（ステップＳ２２７のＹ），演算対象要素値取得部１３６が，カーソルＣが位置するイベントのテキスト値をリストＶに追加する（ステップＳ２２８）。一方，演算対象要素フラグＦＡ＝１でなければ（ステップＳ２２７のＮ），ステップＳ２２９の処理へ進む。 When the calculation target element value detection unit 135 determines that the event is a “text” node, the calculation target element value acquisition unit 136 further determines whether the calculation target element flag FA = 1. (Step S227). If the calculation target element flag FA = 1 (Y in step S227), the calculation target element value acquisition unit 136 adds the text value of the event where the cursor C is located to the list V (step S228). On the other hand, if the calculation target element flag FA is not 1 (N in step S227), the process proceeds to step S229.

ステップＳ２２３〜Ｓ２２６およびＳ２２８の処理後，または，ステップＳ２２２の処理においてカーソルＣが位置するイベントが上記のいずれの場合にも該当しない場合（それ以外），または，ステップＳ２２７の処理で演算対象要素フラグＦＡ＝１でない場合に，ストリーム型パス照合部１３１は，データＣＸで，カーソルＣが位置するイベントの次にイベントが存在するかを判定する（ステップＳ２２９）。カーソルＣが位置するイベントの次にイベントが存在する場合に（ステップＳ２２９のＹ），ストリーム型パス照合部１３１は，カーソルＣを次のイベントに移し（ステップＳ２２１０），ステップＳ２２２の処理へ戻す。 After the processing of steps S223 to S226 and S228, or when the event where the cursor C is located in the processing of step S222 does not correspond to any of the above cases (other than that), or in the processing of step S227, the calculation target element flag When FA is not 1, the stream type path matching unit 131 determines whether there is an event after the event where the cursor C is located in the data CX (step S229). If there is an event next to the event where the cursor C is located (Y in step S229), the stream type path matching unit 131 moves the cursor C to the next event (step S2210), and returns to the processing in step S222.

カーソルＣが位置するイベントの次にイベントが存在しない場合に（ステップＳ２２９のＮ），演算対象要素値リスト送信部１３８は，値リストＶをバッファ１４に出力する（ステップＳ２２１１）。 When there is no event next to the event where the cursor C is located (N in step S229), the calculation target element value list transmission unit 138 outputs the value list V to the buffer 14 (step S2211).

図１０は，ＸＭＬストリームのＸＭＬデータと，パス照合によりバッファ１４に出力されるデータとの関係例を示す図である。 FIG. 10 is a diagram illustrating an example of the relationship between XML data of an XML stream and data output to the buffer 14 by path verification.

図１０に示すＸＭＬストリームＸＳにおいて，データＣＸ_１を破線矩形で示している。図１０のデータＣＸ_１は，図８に示すイベント列に対応している。 In XML stream XS shown in FIG. 10 shows a data CX ₁ by a broken line rectangle. The data CX _{1 in} FIG. 10 corresponds to the event sequence shown in FIG.

データストリーム解析部１３により，データＣＸ_１において，ＸＰ１＝／ｐｏｓ／ｃｕｓｔとの照合によりパス／ｐｏｓ／ｃｕｓｔが検出されると（図８（ａ）），ウィンドウ構成要素ＸＰ１“ｃｕｓｔ”の開始位置と判断されて，ウィンドウ構成要素開始記号“＄”がバッファ１４に出力される。 When the data stream analysis unit 13 detects the path / pos / cus in the data CX ₁ by collating with XP1 = / pos / cus (FIG. 8A), the start position of the window component XP1 “cust” The window component start symbol “$” is output to the buffer 14.

その後に，ＸＰ２＝／ｐｏｓ／ｃｕｓｔ／ｐｒｉｃとの照合によりパス／ｐｏｓ／ｃｕｓｔ／ｐｒｉｃが検出されると（図８（ｂ）），演算対象要素“ｐｒｉｃ”の開始位置と判断されて，ＦＡ＝１が設定され，対応する演算対象要素値（５００）が抽出されて値リストＶに格納される（図８（ｃ））。続いて，パス／ｐｏｓ／ｃｕｓｔ／ｐｒｉｃの終了が検出されると（図８（ｄ）），演算対象要素“ｐｒｉｃ”の終了位置と判断されて，ＦＡ＝０と変更される。 After that, when the path / pos / cus / pric is detected by collating with XP2 = / pos / cust / pric (FIG. 8 (b)), it is determined as the start position of the calculation target element “pric”, and FA = 1 is set, the corresponding element value (500) to be calculated is extracted and stored in the value list V (FIG. 8 (c)). Subsequently, when the end of the path / pos / cust / pric is detected (FIG. 8D), it is determined that the calculation target element “pric” ends, and FA = 0 is changed.

その後に，パス／ｐｏｓ／ｃｕｓｔの終了が検出されると（図８（ｅ）），ウィンドウ構成要素ＸＰ１“ｃｕｓｔ”の終了位置と判断されて，値リストＶ＝｛５００｝がバッファ１４に出力される。 Thereafter, when the end of the path / pos / cus is detected (FIG. 8 (e)), it is determined as the end position of the window component XP1 “cust”, and the value list V = {500} is output to the buffer 14 Is done.

図１０に示すＸＭＬストリームＸＳにおいて，データＣＸ_２を破線矩形で示している。 In XML stream XS shown in FIG. 10 shows a data CX ₂ by a broken line rectangle.

データＣＸ_１の場合と同様に，データストリーム解析部１３によって，ＸＰ１＝／ｐｏｓ／ｃｕｓｔとのパス照合によってウィンドウ構成要素ＸＰ１“ｃｕｓｔ”の開始位置が検出されると，ウィンドウ構成要素開始記号“＄”がバッファ１４に出力される。 Similarly to the case of the data CX ₁ , when the data stream analysis unit 13 detects the start position of the window component XP1 “cust” by path verification with XP1 = / pos / cus, the window component start symbol “$” "Is output to the buffer 14.

その後に，ＸＰ２＝／ｐｏｓ／ｃｕｓｔ／ｐｒｉｃとのパス照合によって演算対象要素“ｐｒｉｃ”の開始位置が検出されると，ＦＡ＝１が設定され，対応する演算対象要素値（３００）がリストＶに格納される。演算対象要素“ｐｒｉｃ”の終了位置が検出されると，ＦＡ＝０と変更されるが，データＸ_２では，続いて，演算対象要素“ｐｒｉｃ”の開始が検出されて，ＦＡ＝１が再設定され，対応する演算対象要素値（２００）がリストＶに追加される。 Thereafter, when the start position of the calculation target element “pric” is detected by path matching with XP2 = / pos / cust / pric, FA = 1 is set, and the corresponding calculation target element value (300) is displayed in the list V. Stored in When the end position of the calculation target element “pric” is detected, FA = 0 is changed. However, in the data X ₂ , the start of the calculation target element “prix” is subsequently detected and FA = 1 is reset. The calculation target element value (200) is set and added to the list V.

その後，データＣＸ_１の場合と同様に，ウィンドウ構成要素ＸＰ１“ｃｕｓｔ”の終了が検出されると，リストＶ＝｛３００，２００｝がバッファ１４に出力される。 Thereafter, as in the case of the data CX ₁ , when the end of the window component XP 1 “cust” is detected, the list V = {300, 200} is output to the buffer 14.

ステップＳ３：解出力処理
図１１は，解出力処理の処理フロー例を示す図である。 Step S3: Solution Output Processing FIG. 11 is a diagram illustrating a processing flow example of the solution output processing.

バッファ集計部１５は，初期化として，バッファ１４でのデータの位置を示すカーソルＣＢを，バッファ１４の最初のデータに設定し，ウィンドウサイズカウンタＣＷ＝０を設定し，出力シーケンスＳｅｑを空列とする（ステップＳ３１）。 As an initialization, the buffer totaling unit 15 sets the cursor CB indicating the position of data in the buffer 14 to the first data in the buffer 14, sets the window size counter CW = 0, and sets the output sequence Seq as an empty string. (Step S31).

バッファ集計部１５は，バッファ１４で，カーソルＣＢが設定されたデータの種類を判定する（ステップＳ３２）。データの種類が，ウィンドウ構成要素開始記号“＄”であれば（ステップＳ３２の「記号＄」），バッファ集計部１５は，ウィンドウサイズカウンタＣＷを１増やして（ステップＳ３３），ステップＳ３９に進み，データの種類が，値リストＶであれば（ステップＳ３２の「値リスト」），ステップＳ３４に進む。ステップＳ３４の処理で，バッファ集計部１５は，ＣＷ≧０ならば，カーソルＣＢが設定された値リストＶを出力シーケンスＳｅｑの末尾に追加する。 The buffer totaling unit 15 determines the type of data for which the cursor CB is set in the buffer 14 (step S32). If the data type is the window component start symbol “$” (“symbol $” in step S32), the buffer tabulation unit 15 increments the window size counter CW by 1 (step S33), and proceeds to step S39. If the data type is the value list V (“value list” in step S32), the process proceeds to step S34. In the process of step S34, if CW ≧ 0, the buffer totaling unit 15 adds the value list V in which the cursor CB is set to the end of the output sequence Seq.

続いて，バッファ集計部１５は，ウィンドウサイズカウンタＣＷがウィンドウサイズｗより小さいか（ＣＷ＜ｗ）を判定して（ステップＳ３５），ウィンドウサイズカウンタＣＷがウィンドウサイズｗより小さくなく，等しい（ＣＷ＝ｗ）場合に（ステップＳ３５のＮ），出力シーケンスＳｅｑ内のデータ（値リストＶ）の集計値ａを求めて，解出力部１６へ出力する（ステップＳ３６）。解出力部１６は，上記のステップＳ３６の処理により，集計値ａを逐次受け取り，ウィンドウクエリＱの解Ａとしてクエリ装置２へ送る。 Subsequently, the buffer totalization unit 15 determines whether the window size counter CW is smaller than the window size w (CW <w) (step S35), and the window size counter CW is not smaller than the window size w and is equal (CW = w) In the case (N in step S35), the total value a of the data (value list V) in the output sequence Seq is obtained and output to the solution output unit 16 (step S36). The solution output unit 16 sequentially receives the total value a by the processing of step S36 described above, and sends it to the query device 2 as the solution A of the window query Q.

さらに，バッファ集計部１５は，出力シーケンスＳｅｑに格納されたデータの先頭からｓ件のデータを削除する。なお，スライド幅ｓがウィンドウサイズｗより大きい（ｓ＞ｗ）場合は，出力シーケンスＳｅｑに格納されたデータを全件削除する。バッファ集計部１５は，ウィンドウサイズカウンタＣＷを，ＣＷ−ｓの値に置き換える（ステップＳ３７）。 Furthermore, the buffer totalization unit 15 deletes s data from the head of the data stored in the output sequence Seq. When the slide width s is larger than the window size w (s> w), all data stored in the output sequence Seq is deleted. The buffer totalization unit 15 replaces the window size counter CW with the value of CW-s (step S37).

その後，バッファ集計部１５は，バッファ１４内でカーソルＣＢが設定されたデータの次にデータが存在するかを判定する（ステップＳ３８）。バッファ１４内に次のデータが存在する場合に（ステップＳ３８のＹ），バッファ集計部１５は，カーソルＣＢを次のデータ（値リスト）に移し（ステップＳ３９），ステップＳ３２の処理へ戻る。バッファ１４内に次のデータが存在しない場合に（ステップＳ３８のＮ），処理を終了する。 Thereafter, the buffer totaling unit 15 determines whether data exists after the data for which the cursor CB is set in the buffer 14 (step S38). When the next data exists in the buffer 14 (Y in step S38), the buffer totalization unit 15 moves the cursor CB to the next data (value list) (step S39), and returns to the process in step S32. If the next data does not exist in the buffer 14 (N in step S38), the process is terminated.

図１２〜図１４は，バッファ１４のデータと，解出力処理により出力シーケンスＳｅｑに出力されるデータとの関係を模式的に示す図である。図１２〜図１４に示す処理は，ウィンドウサイズｗ＝３，スライド幅ｓ＝１の場合のものである。 12 to 14 are diagrams schematically showing the relationship between the data in the buffer 14 and the data output to the output sequence Seq by the solution output process. The processing shown in FIGS. 12 to 14 is for the case where the window size w = 3 and the slide width s = 1.

バッファ集計部１５によって，バッファ１４内で，先頭のデータにカーソルＣＢが設定され，ウィンドウサイズカウンタＣＷ＝０に初期化される。 The buffer tabulation unit 15 sets the cursor CB to the first data in the buffer 14 and initializes the window size counter CW = 0.

図１２（Ａ）に示すように，ＣＢが位置するデータがウィンドウ構成要素開始記号“＄”であり，ウィンドウサイズカウンタＣＷが１増加され（ＣＷ＝１），カーソルＣＢが次のデータに移される。そして，図１２（Ｂ）に示すように，ＣＢが移動したデータが，演算対象要素値の値リストＶであれば，値リストＶ＝｛５００｝が出力シーケンスＳｅｑに格納され，カーソルＣＢはさらに次のデータに移される。 As shown in FIG. 12A, the data in which CB is located is the window element start symbol “$”, the window size counter CW is incremented by 1 (CW = 1), and the cursor CB is moved to the next data. . Then, as shown in FIG. 12B, if the data moved by CB is the value list V of the element values to be calculated, the value list V = {500} is stored in the output sequence Seq, and the cursor CB further Move to next data.

そして，図１３（Ａ）に示すように，カーソルＣＢが移動したデータがウィンドウ構成要素開始記号“＄”であれば，ウィンドウサイズカウンタＣＷ＝２となり，カーソルＣＢが次のデータに移される。そして，図１３（Ｂ）に示すように，カーソルＣＢが移動したデータの値リストＶ＝｛３００，２００｝が出力シーケンスＳｅｑに格納され，さらにカーソルＣＢが次のデータに移される。 Then, as shown in FIG. 13A, if the data that the cursor CB has moved is the window element start symbol “$”, the window size counter CW = 2, and the cursor CB is moved to the next data. Then, as shown in FIG. 13B, the data value list V = {300, 200} to which the cursor CB has moved is stored in the output sequence Seq, and the cursor CB is moved to the next data.

同様にして，カーソルＣＢが移動したデータがウィンドウ構成要素開始記号“＄”であれば，ＣＷ＝３となり，カーソルＣＢが次のデータに移される。 Similarly, if the data to which the cursor CB has moved is the window component start symbol “$”, CW = 3, and the cursor CB is moved to the next data.

その後，図１４（Ａ）に示すように，カーソルＣＢが移動したデータが値リストであれば，その値リストＶ＝｛１００｝が出力シーケンスＳｅｑに格納される。ここで，ウィンドウサイズカウンタＣＷがウィンドウサイズｗ（＝３）と等しくなるので，出力シーケンスＳｅｑに格納された値リストが集計されて，集計値ａ（１１００）が得られる。さらに，図１４（Ｂ）に示すように，出力シーケンスＳｅｑに格納されたデータ（値リスト）の先頭から，ｓ＝１件のデータ（値リスト）が削除され，ウィンドウサイズカウンタＣＷがＣＷ−ｓ（＝２）の値に変更される。なお，カーソルＣＢは，同様にして次のデータに移される。 Thereafter, as shown in FIG. 14A, if the data to which the cursor CB has moved is a value list, the value list V = {100} is stored in the output sequence Seq. Here, since the window size counter CW is equal to the window size w (= 3), the value list stored in the output sequence Seq is totaled to obtain the total value a (1100). Further, as shown in FIG. 14B, s = 1 data (value list) is deleted from the head of the data (value list) stored in the output sequence Seq, and the window size counter CW is set to CW-s. The value is changed to (= 2). The cursor CB is similarly moved to the next data.

図１５は，ＸＭＬストリームＸＳの各ＸＭＬデータと，バッファに出力されるデータとの関係を模式的に示す図である。 FIG. 15 is a diagram schematically illustrating a relationship between each XML data of the XML stream XS and data output to the buffer.

ウィンドウ処理装置１は，ウィンドウ構成要素ＸＰ１の開始位置を検出する度に，直前に検出された演算対象要素ＸＰ２に対応する演算対象要素値の集合とウィンドウ構成要素開始記号“＄”とをバッファ１４に格納する。 Each time the window processing device 1 detects the start position of the window component XP1, the buffer 14 stores a set of calculation target element values corresponding to the calculation target element XP2 detected immediately before and the window component start symbol “$”. To store.

ウィンドウ処理装置１は，１つのウィンドウ構成要素ＸＰ１に含まれる演算対象要素ＸＰ２が不特定数であっても，包含される演算対象要素ＸＰ２をすべてウィンドウ構成要素ＸＰ１に関連付けて検出し，演算対象要素ＸＰ２に対応する演算対象値をリストにまとめて保持する。 The window processing apparatus 1 detects all the included calculation target elements XP2 in association with the window component XP1 even if the calculation target element XP2 included in one window component XP1 is an unspecified number. The calculation target values corresponding to XP2 are held together in a list.

したがって，バッファ１４には，１回のスキャンで，ウィンドウ構成要素ＸＰ１の開始位置を示すウィンドウ構成要素開始記号“＄”と，ウィンドウ構成要素ＸＰ１に含まれる演算対象要素ＸＰ２に対応する演算対象要素値の集合（値リストＶ）とが交互に格納される。そのため，解集計処理において，ウィンドウ構成要素を単位とした演算対象要素値の集計が可能となる。 Therefore, in the buffer 14, the window component element start symbol “$” indicating the start position of the window component element XP1 and the calculation object element value corresponding to the operation element XP2 included in the window element element XP1 are stored in the buffer 14 once. Are stored alternately (value list V). Therefore, in the solution aggregation process, it is possible to aggregate the calculation target element values in units of window components.

本実施例において，ウィンドウ処理装置１は，ステップＳ２のデータストリーム解析処理とステップＳ３の解出力処理を，ＸＭＬストリームＸＳのＸＭＬデータＸごとに実行し，ＸＭＬストリームＸＳのＸＭＬデータＸの取得に従ってループして実行する。しかし，ＸＭＬストリームＸＳが有限の系列であれば，ウィンドウ処理装置１は，ＸＭＬストリームＸＳについて，ステップＳ２のデータストリーム解析処理とステップＳ３の解出力処理をバッチ的に実行するように構成されてもよい。 In the present embodiment, the window processing apparatus 1 executes the data stream analysis process of step S2 and the solution output process of step S3 for each XML data X of the XML stream XS, and loops according to the acquisition of the XML data X of the XML stream XS. And run. However, if the XML stream XS is a finite sequence, the window processing apparatus 1 may be configured to execute the data stream analysis process in step S2 and the solution output process in step S3 in batch for the XML stream XS. Good.

ウィンドウ処理装置１は，ＣＰＵおよびメモリ等を有するハードウェアと，ソフトウェアプログラムとを備えるコンピュータ・システム，または専用ハードウェアによって実現することができる。 The window processing device 1 can be realized by a computer system including hardware having a CPU, a memory, and the like, and a software program, or dedicated hardware.

ウィンドウ処理装置１は，演算装置（ＣＰＵ），一時記憶装置（ＤＲＡＭ，フラッシュメモリ等）および永続性記憶装置（ＨＤＤ，フラッシュメモリ等）を有し，外部とデータの入出力をするコンピュータによって実施することができる。また，ウィンドウ処理装置１は，このコンピュータが実行可能なプログラムによっても実施することができる。この場合に，ウィンドウ処理装置１が有すべき機能の処理内容を記述したプログラムが提供される。提供されたプログラムを上記コンピュータが実行することによって，上記説明したウィンドウ処理装置１の処理機能がコンピュータ上で実現される。なお，上記コンピュータは，可搬型記録媒体から直接プログラムを読み取り，そのプログラムに従った処理を実行することもできる。さらに，上記プログラムは，コンピュータで読み取り可能な記録媒体に記録しておくことができる。 The window processing apparatus 1 includes a processing unit (CPU), a temporary storage device (DRAM, flash memory, etc.), and a permanent storage device (HDD, flash memory, etc.), and is implemented by a computer that inputs and outputs data to the outside. be able to. The window processing apparatus 1 can also be implemented by a program executable by this computer. In this case, a program describing the processing contents of the functions that the window processing apparatus 1 should have is provided. When the computer executes the provided program, the processing functions of the window processing apparatus 1 described above are realized on the computer. The computer can also read a program directly from a portable recording medium and execute processing according to the program. Furthermore, the program can be recorded on a computer-readable recording medium.

以上説明したように，開示したウィンドウ処理装置１によれば，次のような効果がある。 As described above, the disclosed window processing apparatus 1 has the following effects.

すなわち，ウィンドウ処理装置１によれば，ウィンドウ構成要素に含まれる演算対象の要素が不特定数であるようなデータのストリームに対しても１回のスキャンによるウィンドウ処理が可能となり，処理時間および格納領域を軽減することができる。 In other words, the window processing device 1 enables window processing by one scan even for a data stream in which an operation target element included in a window component is an unspecified number, and processing time and storage The area can be reduced.

従来技術では，ウィンドウ構成要素の開始位置と演算対象要素位置を検出するために，ＸＭＬストリームにおける各ＸＭＬデータを，２回スキャンする必要があった。しかし，ウィンドウ処理装置１によれば，ウィンドウ構成要素の開始位置と演算対象要素位置を１回のスキャンで同時に検出するため，計算時間を削減できる。 In the prior art, in order to detect the start position of the window constituent element and the calculation target element position, it is necessary to scan each XML data in the XML stream twice. However, according to the window processing apparatus 1, since the start position of the window constituent element and the calculation target element position are detected simultaneously in one scan, the calculation time can be reduced.

さらに，副次的効果として，ウィンドウ処理装置１では，ウィンドウ構成要素のすべてをバッファ１４に保持するタイプの従来技術とは異なり，集計処理に必要最低限のデータ（値リスト）を取得してバッファ１４に格納する。その結果として，ウィンドウ処理装置１によれば，バッファ１４に保持するデータ量を削減できるため，計算時間と計算領域とを削減できる。 Further, as a secondary effect, the window processing apparatus 1 obtains the minimum data (value list) necessary for the totaling processing and obtains the buffer, unlike the conventional technique in which all the window components are held in the buffer 14. 14. As a result, according to the window processing apparatus 1, the amount of data held in the buffer 14 can be reduced, so that the calculation time and the calculation area can be reduced.

１ウィンドウ処理装置
１１ウィンドウクエリ受付部
１２データストリーム取得部
１３データストリーム解析部
１３１ストリーム型パス照合部
１３２ウィンドウ構成要素位置検出部
１３３演算対象要素位置検出部
１３５演算対象要素値検出部
１３６演算対象要素値取得部
１３７ウィンドウ構成要素開始記号送信部
１３８演算対象要素値リスト送信部
１４バッファ
１５バッファ集計部
１６解出力部
２クエリ装置
３データ送信装置
DESCRIPTION OF SYMBOLS 1 Window processing apparatus 11 Window query reception part 12 Data stream acquisition part 13 Data stream analysis part 131 Stream type | mold path collation part 132 Window component element position detection part 133 Calculation object element position detection part 135 Calculation object element value detection part 136 Calculation object element Value acquisition unit 137 Window component element start symbol transmission unit 138 Operation target element value list transmission unit 14 Buffer 15 Buffer aggregation unit 16 Solution output unit 2 Query device 3 Data transmission device

Claims

A window processing device for performing window processing on a data series,
A buffer for storing data,
A data acquisition unit for acquiring a data series including elements having inclusion relations;
From the data series, the position of the window component defining the window processing unit and the position of the operation target element included in the window component are sequentially detected in association with the window component, and the window component is detected. A data stream analysis unit that stores a set of calculation target element values corresponding to the calculation target element associated with the window component detected immediately before, in the buffer;
A window processing device, comprising: a window processing unit that performs window processing on a set of operation processing values associated with the window component stored in the buffer.

The data stream analysis unit holds a set of calculation target element values corresponding to the detected calculation target element, and each time the start position of the window constituent element is detected, the held set of calculation target element values And the start position symbol of the detected window component in the buffer,
The window processing device according to claim 1, wherein the window processing unit performs the window processing based on a start position symbol of a window component stored in the buffer.

A window processing method for a data series executed by a computer,
A processing step for obtaining a data series including elements having inclusion relations;
A step of sequentially detecting, from the data series, a position of a window component that defines a window processing unit and a position of an operation target element included in the window component in association with the window component;
A step of storing, in the buffer, a set of operation target element values corresponding to the operation target elements associated with the window configuration element detected immediately before each detection of the window component;
A window processing method comprising: performing a window process on a set of operation processing values associated with the window component stored in the buffer.

A window processing program for causing a computer to perform window processing on a data series,
A process of obtaining a data series including elements having an inclusion relation in the computer;
A process of sequentially detecting, from the data series, a position of a window component that defines a window processing unit and a position of an operation target element included in the window component in association with the window component;
A process of storing, in the buffer, a set of operation target element values corresponding to the operation target element associated with the window component detected immediately before each detection of the window component;
A window processing program for executing a window processing on a set of operation processing values associated with the window component stored in the buffer.