JP2012194950A

JP2012194950A - Structured document management device, method, and program

Info

Publication number: JP2012194950A
Application number: JP2011060371A
Authority: JP
Inventors: Minoru Inada; 稔稲田; Masakazu Hattori; 雅一服部
Original assignee: Toshiba Corp; Toshiba Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2011-03-18
Filing date: 2011-03-18
Publication date: 2012-10-11
Anticipated expiration: 2031-03-18
Also published as: JP5296128B2

Abstract

PROBLEM TO BE SOLVED: To provide a structured document management device capable of performing structure collation processing at high speed, a method, and a program.SOLUTION: In an inventive structured document management device, when inputted query data includes a first condition designating a hierarchical relation of layers of the logical structure of structured document data and a second condition designating an order relation of elements specified by element IDs, query data decomposition means decomposes the query data into first partial query data including only the first condition and second partial query data including means for performing joint operation of a collation result by the first partial query data according to the second condition. Structure collation processing means collates a data set of the structured document data with the first partial query data and outputs a collation result. Joint operation processing means performs joint operation processing of the collation result outputted from the structure collation processing means according to a procedure of joint operation included in the second partial query data.

Description

本発明の実施形態は、構造化文書管理装置、方法およびプログラムに関する。 Embodiments described herein relate generally to a structured document management apparatus, method, and program.

従来、ＸＭＬ（ＥｘｔｅｎｓｉｂｌｅＭａｒｋｕｐＬａｎｇｕａｇｅ）などで記述された構造化文書データを記憶・検索するための構造化文書管理装置が知られている。構造化文書管理装置における構造化文書データの検索のために、ＲＤＢＭＳ（ＲｅｌａｔｉｏｎａｌＤａｔａｂａｓｅＭａｎａｇｅｍａｎｔＳｙｓｔｅｍ）における問い合わせ言語ＳＱＬのように、ＸＭＬデータに対する問い合わせ言語ＸＱｕｅｒｙ（ＸＭＬＱｕｅｒｙＬａｎｇｕａｇｅ）が策定されており、多くの構造化文書管理装置でサポートされている。 Conventionally, a structured document management apparatus for storing / retrieving structured document data described in XML (Extensible Markup Language) or the like is known. In order to search for structured document data in the structured document management apparatus, a query language XQuery (XML Query Language) for XML data has been formulated, such as the query language SQL in RDBMS (Relational Database Management System). Supported by structured document management devices.

ＸＱｕｅｒｙは、ＸＭＬデータ集合をデータベースのように扱うための言語であり、条件に合致するデータ集合の取り出しや集計・分析を行うための手段が提供されている。ＸＭＬデータは親子や兄弟などの要素が組み合わさった階層化された論理構造（階層構造）を持つため、条件にはこの階層構造に関する条件（構造条件）を指定することができる。 XQuery is a language for handling an XML data set like a database, and provides means for extracting, summing up, and analyzing a data set that matches a condition. Since XML data has a hierarchical logical structure (hierarchical structure) in which elements such as parents and siblings are combined, a condition (structural condition) relating to this hierarchical structure can be specified as a condition.

構造条件の処理には、構造化文書管理装置が記憶している構造化文書データが条件に合致する構造を持つかを照合する構造照合処理を行う必要がある。この構造照合処理は、構造条件が階層の上下関係を指定する条件のみであれば比較的高速に処理することが可能であるが、構造条件の中にＸＭＬデータに含まれる要素の順序関係を指定する条件が含まれる場合は、高速に処理することが難しい。 In the process of the structure condition, it is necessary to perform a structure matching process for checking whether the structured document data stored in the structured document management apparatus has a structure that matches the condition. This structure matching process can be processed at relatively high speed if the structure condition is only a condition that specifies the hierarchical relationship of the hierarchy, but the order relation of the elements included in the XML data is specified in the structure condition. If the conditions to be included are included, it is difficult to process at high speed.

特開２００７−２２６４５２号公報JP 2007-226452 A

本発明が解決しようとする課題は、構造照合処理を高速に行うことができる構造化文書管理装置、方法およびプログラムを提供することである。 The problem to be solved by the present invention is to provide a structured document management apparatus, method, and program capable of performing structure collation processing at high speed.

実施形態の構造化文書管理装置は、構造化文書データ受付手段と、識別子付与手段と、構造化文書データ記憶手段と、クエリデータ受付手段と、クエリデータ分解手段と、構造照合処理手段と、結合演算処理手段と、を備える。構造化文書データ受付手段は、階層化された論理構造を有する構造化文書データの入力を受け付ける。識別子付与手段は、入力された前記構造化文書データに出現する要素に該構造化文書データ内での出現順序が要素間で比較可能な識別子を付与する。構造化文書データ記憶手段は、前記要素に前記識別子が付与された前記構造化文書データを記憶する。クエリデータ受付手段は、クエリデータの入力を受け付ける。クエリデータ分解手段は、入力されたクエリデータが、前記構造化文書データの論理構造における階層の上下関係を指定する第１条件と、前記識別子で特定される前記要素の順序関係を指定する第２条件とを含む場合に、該クエリデータを、前記第１条件のみを含む第１の部分クエリデータと、前記第１の部分クエリデータによる照合結果を前記第２条件に応じて結合演算する手順を含む第２の部分クエリデータとに分解する。構造照合処理手段は、前記構造化文書データ記憶手段が記憶する前記構造化文書データのデータ集合に対して、前記第１の部分クエリデータによる照合を行い、照合結果を出力する。結合演算処理手段は、前記照合結果を、前記第２の部分クエリデータに含まれる結合演算の手順に従って結合演算処理する。 The structured document management apparatus according to the embodiment includes a structured document data receiving unit, an identifier assigning unit, a structured document data storage unit, a query data receiving unit, a query data decomposing unit, a structure matching processing unit, Arithmetic processing means. The structured document data accepting unit accepts input of structured document data having a hierarchical logical structure. The identifier assigning means assigns an identifier whose appearance order in the structured document data can be compared between the elements to the element appearing in the input structured document data. The structured document data storage unit stores the structured document data in which the identifier is assigned to the element. The query data accepting unit accepts input of query data. The query data decomposing means has a second condition in which the input query data specifies a first condition specifying the hierarchical relationship in the logical structure of the structured document data and an order relationship of the elements specified by the identifier. Including a condition, the query data is combined with the first partial query data including only the first condition and the collation result of the first partial query data according to the second condition. It decomposes | disassembles into the 2nd partial query data containing. The structure matching processing means performs matching with the first partial query data against the data set of the structured document data stored in the structured document data storage means, and outputs a matching result. The join operation processing means performs a join operation process on the collation result according to a join operation procedure included in the second partial query data.

構造化文書管理システムのシステム構築例を示す模式図。The schematic diagram which shows the system construction example of a structured document management system. サーバおよびクライアント端末のモジュール構成図。The module block diagram of a server and a client terminal. 第１の実施形態におけるサーバおよびクライアント端末の概略構成を示すブロック図。The block diagram which shows schematic structure of the server and client terminal in 1st Embodiment. 構造化文書データの一例を示す説明図。Explanatory drawing which shows an example of structured document data. 図４に例示した構造化文書データに対して要素ＩＤを付与した要素ＩＤ付与済み構造化文書データの一例を示す説明図。FIG. 5 is an explanatory diagram illustrating an example of structured document data with an element ID assigned to the structured document data illustrated in FIG. 4. クエリデータの一例を示す説明図。Explanatory drawing which shows an example of query data. 検索処理部による検索処理の流れを示すフローチャート。The flowchart which shows the flow of the search process by a search process part. クエリデータ解析処理の流れを示すフローチャート。The flowchart which shows the flow of a query data analysis process. 図６に例示したクエリデータについてクエリデータ解析処理を行った結果である第１の部分クエリデータと第２の部分クエリデータの一例を示す説明図。Explanatory drawing which shows an example of the 1st partial query data and 2nd partial query data which are the result of having performed the query data analysis process about the query data illustrated in FIG. 構造照合処理の概略を示す説明図。Explanatory drawing which shows the outline of a structure collation process. 図９に例示した第１の部分クエリデータを用いて図５に例示した要素ＩＤ付与済み構造化文書データについて構造照合処理を行った結果である構造照合処理結果データの一例を示す説明図。FIG. 10 is an explanatory diagram illustrating an example of structure matching processing result data that is a result of performing structure matching processing on the structured document data with element IDs exemplified in FIG. 5 using the first partial query data illustrated in FIG. 9; 図９に例示した第２の部分クエリデータを用いて図１１に例示した構造照合処理結果データについて結合演算処理を行う場合の説明図。Explanatory drawing in the case of performing a joint calculation process about the structure collation process result data illustrated in FIG. 11 using the 2nd partial query data illustrated in FIG. 図９に例示したクエリデータの結果データを示す説明図。Explanatory drawing which shows the result data of the query data illustrated in FIG. 第２の実施形態におけるサーバおよびクライアント端末の概略構成を示すブロック図。The block diagram which shows schematic structure of the server and client terminal in 2nd Embodiment. 構造ガイドデータの一例を示す説明図。Explanatory drawing which shows an example of structure guide data. クエリデータの一例を示す説明図。Explanatory drawing which shows an example of query data. 第２の実施形態におけるクエリデータ解析処理の流れを示すフローチャート。The flowchart which shows the flow of the query data analysis process in 2nd Embodiment. 図１６に例示したクエリデータについて図１７のステップＳ２１５までの処理を行った結果である第１の部分クエリデータと第２の部分クエリデータの一例を示す説明図。Explanatory drawing which shows an example of the 1st partial query data and 2nd partial query data which are the result of having performed the process to step S215 of FIG. 17 about the query data illustrated in FIG. 構造条件書き換え処理の流れを示すフローチャート。The flowchart which shows the flow of a structural condition rewriting process. 図１８に例示した第１の部分クエリデータについて構造条件書き換え処理を行った結果を示す説明図。FIG. 19 is an explanatory diagram illustrating a result of the structural condition rewriting process performed on the first partial query data illustrated in FIG. 18. 図２０に例示した構造条件書き換え処理後の第１の部分クエリデータを用いて図５に例示した要素ＩＤ付与済み構造化文書データについて構造照合処理を行った結果である構造照合処理結果データの一例を示す説明図。An example of structure matching process result data that is a result of performing structure matching processing on the structured document data with element IDs exemplified in FIG. 5 using the first partial query data after the structure condition rewriting process exemplified in FIG. FIG. 図２０に例示した第２の部分クエリデータを用いて図２１に例示した構造照合処理結果データについて結合演算処理を行う場合の説明図。FIG. 22 is an explanatory diagram in a case where a join operation process is performed on the structure matching process result data illustrated in FIG. 21 using the second partial query data illustrated in FIG. 20. 図１６に例示したクエリデータの結果データを示す説明図。Explanatory drawing which shows the result data of the query data illustrated in FIG. クエリデータの一例を示す説明図。Explanatory drawing which shows an example of query data. 第３の実施形態におけるクエリデータ解析処理の流れを示すフローチャート。The flowchart which shows the flow of the query data analysis process in 3rd Embodiment. 図２４に例示したｐｏｓｉｔｉｏｎ関数を含むクエリデータについてクエリデータ解析処理を行った結果である第１の部分クエリデータと第２の部分クエリデータの一例を示す説明図。FIG. 25 is an explanatory diagram illustrating an example of first partial query data and second partial query data, which is a result of performing query data analysis processing on query data including the position function illustrated in FIG. 24. 図２４に例示したｌａｓｔ関数を含むクエリデータについてクエリデータ解析処理を行った結果である第１の部分クエリデータと第２の部分クエリデータの一例を示す説明図。FIG. 25 is an explanatory diagram illustrating an example of first partial query data and second partial query data, which is a result of query data analysis processing performed on query data including the last function illustrated in FIG. 24. 図２６に例示した第１のクエリデータを用いて図５に例示した要素ＩＤ付与済み構造化文書データについて構造照合処理を行った結果である構造照合処理結果データの一例を示す説明図。FIG. 27 is an explanatory diagram illustrating an example of structure matching processing result data that is a result of performing a structure matching process on the structured document data with element IDs exemplified in FIG. 5 using the first query data exemplified in FIG. 26; 図２６に例示した第２の部分クエリデータを用いて図２８に例示した構造照合処理結果データについて結合演算処理を行う場合の説明図。FIG. 28 is an explanatory diagram in a case where a join operation process is performed on the structure matching process result data illustrated in FIG. 28 using the second partial query data illustrated in FIG. 図２７に例示した第１のクエリデータを用いて図５に例示した要素ＩＤ付与済み構造化文書データについて構造照合処理を行った結果である構造照合処理結果データの一例を示す説明図。FIG. 28 is an explanatory diagram illustrating an example of structure matching processing result data that is a result of performing structure matching processing on the structured document data with element IDs exemplified in FIG. 5 using the first query data illustrated in FIG. 27; 図２７に例示した第２の部分クエリデータを用いて図３０に例示した構造照合処理結果データについて結合演算処理を行う場合の説明図。FIG. 28 is an explanatory diagram in a case where a join operation process is performed on the structure matching process result data illustrated in FIG. 30 using the second partial query data illustrated in FIG. 27; 図２４に例示したｐｏｓｉｔｉｏｎ関数を含むクエリデータの結果データを示す説明図。FIG. 25 is an explanatory diagram illustrating result data of query data including the position function illustrated in FIG. 24. 図２４に例示したｌａｓｔ関数を含むクエリデータの結果データを示す説明図。FIG. 25 is an explanatory diagram illustrating result data of query data including the last function illustrated in FIG. 24. 第４の実施形態におけるサーバおよびクライアント端末の概略構成を示すブロック図。The block diagram which shows schematic structure of the server and client terminal in 4th Embodiment. クエリデータの一例を示す説明図。Explanatory drawing which shows an example of query data. 第４の実施形態における検索処理の流れを示すフローチャート。The flowchart which shows the flow of the search process in 4th Embodiment. 第４の実施形態におけるクエリデータ解析処理の流れを示すフローチャート。The flowchart which shows the flow of the query data analysis process in 4th Embodiment. 図３５に例示したクエリデータについてクエリデータ解析処理を行った結果である第１の部分クエリデータの一例を示す説明図。Explanatory drawing which shows an example of the 1st partial query data which are the results of having performed the query data analysis process about the query data illustrated in FIG. 図３８に例示した第１の部分クエリデータを用いて図５に例示した要素ＩＤ付与済み構造化文書データについて構造照合処理を行った結果である構造照合処理結果データの一例を示す説明図。FIG. 40 is an explanatory diagram illustrating an example of structure matching processing result data that is a result of performing structure matching processing on the structured document data with element IDs illustrated in FIG. 5 using the first partial query data illustrated in FIG. 38; 図３５に例示したクエリデータの結果データを示す説明図。Explanatory drawing which shows the result data of the query data illustrated in FIG.

以下、実施形態の構造化文書管理装置、方法およびプログラムを、図面を参照して説明する。 Hereinafter, a structured document management apparatus, method, and program according to embodiments will be described with reference to the drawings.

［第１の実施形態］
まず、第１の実施形態について、図１乃至図１３を参照して説明する。図１は、第１の実施形態にかかる構造化文書管理システムのシステム構築例を示す模式図である。ここでは、実施形態の構造化文書管理システムとして、図１に示すように、構造化文書管理装置であるサーバコンピュータ（以下、サーバという。）１に、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）等のネットワーク２を介して、クライアントコンピュータ（以下、クライアント端末という。）３が複数台接続されたサーバクライアントシステムを想定する。 [First Embodiment]
First, a first embodiment will be described with reference to FIGS. FIG. 1 is a schematic diagram illustrating a system construction example of the structured document management system according to the first embodiment. Here, as a structured document management system of the embodiment, as shown in FIG. 1, a network 2 such as a LAN (Local Area Network) is connected to a server computer (hereinafter referred to as a server) 1 which is a structured document management apparatus. A server client system to which a plurality of client computers (hereinafter referred to as client terminals) 3 are connected is assumed.

図２は、サーバ１およびクライアント端末３のモジュール構成図である。サーバ１およびクライアント端末３は、例えば、通常のコンピュータを利用したハードウェア構成を有している。すなわち、サーバ１およびクライアント端末３は、情報処理を行うＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）１０１、ＢＩＯＳなどを記憶した読出し専用メモリであるＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）１０２、各種データを書き換え可能に記憶するＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）１０３、各種データベースとして機能するとともに各種のプログラムを格納するＨＤＤ（ＨａｒｄＤｉｓｃＤｒｉｖｅ）１０４、記憶媒体１１０を用いて情報を保管したり外部に情報を配布したり外部から情報を入手するためのＣＤ−ＲＯＭドライブ等の媒体駆動装置１０５、ネットワーク２を介して外部の他のコンピュータと通信により情報を伝達するための通信制御装置１０６、処理経過や結果等を操作者に表示するＣＲＴ（ＣａｔｈｏｄｅＲａｙＴｕｂｅ）やＬＣＤ（ＬｉｑｕｉｄＣｒｙｓｔａｌＤｉｓｐｌａｙ）等の表示部１０７、並びに操作者がＣＰＵ１０１に命令や情報等を入力するためのキーボードやマウス等の入力部１０８等を備えた構成であり、これらの各部間で送受信されるデータをバスコントローラ１０９が調停して動作する。 FIG. 2 is a module configuration diagram of the server 1 and the client terminal 3. The server 1 and the client terminal 3 have a hardware configuration using, for example, a normal computer. That is, the server 1 and the client terminal 3 include a CPU (Central Processing Unit) 101 that performs information processing, a ROM (Read Only Memory) 102 that is a read-only memory storing BIOS, and a RAM (RAM) that stores various data in a rewritable manner. Random Access Memory (103), HDD (Hard Disc Drive) 104 that functions as various databases and stores various programs, and storage medium 110 for storing information, distributing information outside, and obtaining information from outside Medium drive device 105 such as a CD-ROM drive for communication, communication control device 106 for communicating information with other external computers via network 2, processing progress and results, etc. A display unit 107 such as a CRT (Cathode Ray Tube) or LCD (Liquid Crystal Display) to be displayed to a user, and an input unit 108 such as a keyboard and a mouse for an operator to input commands and information to the CPU 101 In this configuration, the bus controller 109 operates by arbitrating data transmitted and received between these units.

このようなサーバ１およびクライアント端末３では、ユーザが電源を投入するとＣＰＵ１０１がＲＯＭ１０２内のローダーというプログラムを起動させ、ＨＤＤ１０４よりＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）というコンピュータのハードウェアとソフトウェアとを管理するプログラムをＲＡＭ１０３に読み込み、このＯＳを起動させる。このようなＯＳは、ユーザの操作に応じてプログラムを起動したり、情報を読み込んだり、保存を行ったりする。ＯＳのうち代表的なものとしては、Ｗｉｎｄｏｗｓ（登録商標）、ＵＮＩＸ（登録商標）等が知られている。これらのＯＳ上で動作するプログラムをアプリケーションプログラムと呼んでいる。なお、アプリケーションプログラムは、所定のＯＳ上で動作するものに限らず、後述の各種処理の一部の実行をＯＳに肩代わりさせるものであってもよいし、所定のアプリケーションソフトやＯＳなどを構成する一群のプログラムファイルの一部として含まれているものであってもよい。 In the server 1 and the client terminal 3, when the user turns on the power, the CPU 101 activates a program called a loader in the ROM 102, and a program for managing the hardware and software of the computer called OS (Operating System) from the HDD 104 is stored in the RAM 103. To start this OS. Such an OS activates a program, reads information, and stores information in accordance with a user operation. As typical OSes, Windows (registered trademark), UNIX (registered trademark), and the like are known. Programs that run on these OSs are called application programs. The application program is not limited to one that runs on a predetermined OS, and may be one that causes the OS to execute some of the various processes described below, or constitutes predetermined application software, an OS, or the like. It may be included as part of a group of program files.

ここで、サーバ１は、アプリケーションプログラムとして、構造化文書管理プログラムをＨＤＤ１０４に記憶している。この意味で、ＨＤＤ１０４は、構造化文書管理プログラムを記憶する記憶媒体として機能する。また、一般的には、サーバ１のＨＤＤ１０４にインストールされるアプリケーションプログラムは、ＣＤ−ＲＯＭやＤＶＤなどの各種の光ディスク、各種光磁気ディスク、フレキシブルディスクなどの各種磁気ディスク、半導体メモリ等の各種方式のメディア等の記憶媒体１１０に記録されて提供される。このため、ＣＤ−ＲＯＭ等の光情報記録メディアやＦＤ等の磁気メディア等の可搬性を有する記憶媒体１１０も、構造化文書管理プログラムを記憶する記憶媒体となり得る。さらには、構造化文書管理プログラムは、例えば通信制御装置１０６を介して外部から取り込まれ、ＨＤＤ１０４にインストールされてもよい。 Here, the server 1 stores a structured document management program in the HDD 104 as an application program. In this sense, the HDD 104 functions as a storage medium that stores the structured document management program. In general, application programs installed in the HDD 104 of the server 1 are various systems such as various optical disks such as CD-ROM and DVD, various magnetic disks such as various magneto-optical disks and flexible disks, and semiconductor memories. It is recorded on a storage medium 110 such as a medium and provided. Therefore, the portable storage medium 110 such as an optical information recording medium such as a CD-ROM or a magnetic medium such as an FD can also be a storage medium that stores the structured document management program. Further, the structured document management program may be imported from the outside via the communication control device 106 and installed in the HDD 104, for example.

サーバ１は、ＯＳ上で動作する構造化文書管理プログラムが起動すると、この構造化文書管理プログラムに従い、ＣＰＵ１０１が各種の演算処理を実行して各部を集中的に制御する。一方、クライアント端末３は、ＯＳ上で動作するアプリケーションプログラムが起動すると、このアプリケーションプログラムに従い、ＣＰＵ１０１が各種の演算処理を実行して各部を集中的に制御する。サーバ１およびクライアント端末３のＣＰＵ１０１が実行する各種の演算処理のうち、実施形態の構造化文書管理システムにおいて特徴的な処理について、以下に説明する。 In the server 1, when a structured document management program operating on the OS is started, the CPU 101 executes various arithmetic processes according to the structured document management program and centrally controls each unit. On the other hand, in the client terminal 3, when an application program operating on the OS is activated, the CPU 101 executes various arithmetic processes according to the application program, and controls each unit intensively. Of various types of arithmetic processing executed by the CPU 101 of the server 1 and the client terminal 3, processing characteristic in the structured document management system of the embodiment will be described below.

図３は、第１の実施形態におけるサーバ１およびクライアント端末３の概略構成を示すブロック図である。図３に示すように、クライアント端末３は、アプリケーションプログラムにより実現される機能構成として、構造化文書登録部１１と、検索部１２とを備える。 FIG. 3 is a block diagram showing a schematic configuration of the server 1 and the client terminal 3 in the first embodiment. As illustrated in FIG. 3, the client terminal 3 includes a structured document registration unit 11 and a search unit 12 as a functional configuration realized by an application program.

構造化文書登録部１１は、入力部１０８から入力された構造化文書データやクライアント端末３のＨＤＤ１０４に予め記憶された構造化文書データを、後述するサーバ１の構造化文書データベース（構造化文書ＤＢ）２１に登録するためのものである。この構造化文書登録部１１は、登録すべき構造化文書データとともに格納要求をサーバ１に送信する。 The structured document registration unit 11 stores the structured document data input from the input unit 108 and the structured document data stored in advance in the HDD 104 of the client terminal 3 into a structured document database (structured document DB) of the server 1 described later. ) 21 for registration. The structured document registration unit 11 transmits a storage request to the server 1 together with the structured document data to be registered.

図４は、構造化文書データの一例を示したものである。構造化文書データを記述するための代表的な言語としてＸＭＬ（ＥｘｔｅｎｓｉｂｌｅＭａｒｋｕｐＬａｎｇｕａｇｅ）が挙げられる。図４に示す構造化文書データは、ＸＭＬで記述されたものである。ＸＭＬでは、文書構造を構成する個々のパーツを「要素」（エレメント：Ｅｌｅｍｅｎｔ）と呼び、要素はタグ（ｔａｇ）を使って記述する。具体的には、要素の始まりを示すタグ（開始タグ）と、終わりを示すタグ（終了タグ）の２つのタグでデータを挟み込んで、１つの要素を表現している。なお、開始タグと終了タグで挟み込まれたテキストデータは、当該開始タグと終了タグで表された１つの要素に含まれるテキスト要素である。 FIG. 4 shows an example of structured document data. XML (Extensible Markup Language) is a typical language for describing structured document data. The structured document data shown in FIG. 4 is described in XML. In XML, individual parts constituting a document structure are called “elements” (elements), and elements are described using tags. Specifically, one element is expressed by sandwiching data between two tags, a tag indicating the start of an element (start tag) and a tag indicating the end (end tag). Note that the text data sandwiched between the start tag and the end tag is a text element included in one element represented by the start tag and the end tag.

図４に示す例では、＜ｂｏｏｋｓ＞というタグで囲まれたルート要素が存在する。この＜ｂｏｏｋｓ＞要素は、＜ｂｏｏｋ＞のタグで囲まれた３つの子要素を包含する。＜ｂｏｏｋ＞要素は、＜ｔｉｔｌｅ＞、＜ａｕｔｈｏｒ＞の各タグで囲まれた複数の子要素を包含する。＜ｔｉｔｌｅ＞要素は、「ＸＭＬデータベース」などのテキスト要素をもつ。 In the example shown in FIG. 4, there is a root element surrounded by a tag <books>. The <books> element includes three child elements surrounded by <book> tags. The <book> element includes a plurality of child elements surrounded by <title> and <author> tags. The <title> element has a text element such as “XML database”.

１番目の＜ｂｏｏｋ＞要素は、２つの＜ａｕｔｈｏｒ＞要素を持ち、これら２つの＜ａｕｔｈｏｒ＞要素が＜ｔｉｔｌｅ＞要素の後に出現する順序であるが、２，３番目の＜ｂｏｏｋ＞要素は、１つの＜ａｕｔｈｏｒ＞要素のみを持ち、＜ａｕｔｈｏｒ＞要素の後に＜ｔｉｔｌｅ＞要素が出現している。 The first <book> element has two <author> elements, and these two <author> elements are in the order in which they appear after the <title> element, but the second and third <book> elements are: It has only one <author> element, and a <title> element appears after the <author> element.

検索部１２は、ユーザにより入力部１０８から入力された指示に従って、構造化文書ＤＢ２１から所望のデータを検索するための検索条件などが記述されたクエリデータを作成し、当該クエリデータを含む検索要求をサーバ１へ送信する。また、検索部１２は、サーバ１から送信された当該検索要求に対応する結果データを受け取り、これを表示部１０７に表示する。 The search unit 12 creates query data describing a search condition for searching for desired data from the structured document DB 21 according to an instruction input from the input unit 108 by the user, and a search request including the query data. Is transmitted to the server 1. In addition, the search unit 12 receives result data corresponding to the search request transmitted from the server 1 and displays the result data on the display unit 107.

一方、サーバ１は、構造化文書管理プログラムにより実現される機能構成として、格納処理部２２と、検索処理部２３とを備える。また、サーバ１は、ＨＤＤ１０４などの記憶装置を利用した構造化文書ＤＢ２１を備える。 On the other hand, the server 1 includes a storage processing unit 22 and a search processing unit 23 as functional configurations realized by the structured document management program. The server 1 also includes a structured document DB 21 that uses a storage device such as the HDD 104.

格納処理部２２は、クライアント端末３からの格納要求を受けて、クライアント端末３から送信された構造化文書データを構造化文書ＤＢ２１に格納する処理を行う。この格納処理部２２は、格納インタフェース部２４と、要素ＩＤ付与部２５とを備えている。 Upon receiving a storage request from the client terminal 3, the storage processing unit 22 performs a process of storing the structured document data transmitted from the client terminal 3 in the structured document DB 21. The storage processing unit 22 includes a storage interface unit 24 and an element ID assigning unit 25.

格納インタフェース部２４は、構造化文書データの入力を受け付けて（構造化文書データ受付手段）、構造化文書データを構造化文書ＤＢ２１に格納するために要素ＩＤ付与部２５を呼び出す。 The storage interface unit 24 receives the input of structured document data (structured document data receiving unit) and calls the element ID adding unit 25 to store the structured document data in the structured document DB 21.

要素ＩＤ付与部２５は、識別子付与手段として機能するものであって、クライアント端末３から送信された構造化文書データを構文解析し、そこに出現する要素に要素間で出現順序が比較可能な識別子（以下、要素ＩＤという。）を付与した上で、要素ＩＤが付与された構造化文書データ（以下、要素ＩＤ付与済み構造化文書データという。）を構造化文書ＤＢ２１（構造化文書データ記憶手段）に格納する。 The element ID assigning unit 25 functions as an identifier assigning unit, and parses the structured document data transmitted from the client terminal 3 and can compare the appearance order between the elements appearing there. (Hereinafter referred to as element ID), and the structured document data to which the element ID has been assigned (hereinafter referred to as element ID-added structured document data) is designated as the structured document DB 21 (structured document data storage means). ).

ここで、要素ＩＤは、その大小で構造化文書データ中での要素の出現順序が判定できるように付与される。図４に例示した構造化文書データに対して要素ＩＤを付与した要素ＩＤ付与済み構造化文書データの例を図５に示す。図５では、要素ＩＤを付与する方法の一例として、ルート要素より要素の出現順に従ってＥ１、Ｅ２、Ｅ３、・・・と付与している。このように付与すれば、要素間の出現順序を要素ＩＤの比較で判定することができる。 Here, the element ID is assigned so that the appearance order of the elements in the structured document data can be determined based on the size of the element ID. FIG. 5 shows an example of structured document data with an element ID assigned to the structured document data illustrated in FIG. In FIG. 5, as an example of a method for assigning element IDs, E1, E2, E3,... If given in this way, the appearance order between elements can be determined by comparing element IDs.

例えば、図４に例示した構造化文書データにおける＜ｂｏｏｋｓ＞の１番目の＜ｂｏｏｋ＞要素は、その中に２つの＜ａｕｔｈｏｒ＞要素を持ち、これら２つの＜ａｕｔｈｏｒ＞要素の出現順序は、子に＜ｆｉｒｓｔ＞要素を持つ＜ａｕｔｈｏｒ＞要素が先に出現し、子に＜ｌａｓｔ＞要素を持つ＜ａｕｔｈｏｒ＞要素が後に出現する順となっている。ここで、図５でそれぞれに付与された要素ＩＤを比較すると、Ｅ５＜Ｅ８であり、要素ＩＤとしてＥ８を付与された要素である、子に＜ｌａｓｔ＞要素を持つ＜ａｕｔｈｏｒ＞要素が、要素ＩＤとしてＥ８を付与された要素である、子に＜ｆｉｒｓｔ＞要素を持つ＜ａｕｔｈｏｒ＞要素よりも後に出現することが判定できる。また、図５では、要素ＩＤ付与済み構造化文書データの形式が図４の構造化文書データとほぼ同一であるが、先に述べたように、その大小で要素の出現順序が判定できるように要素ＩＤが付与されていれば、要素ＩＤ付与済み構造化文書データの形式は特に限定されるものではない。 For example, the first <book> element of <books> in the structured document data illustrated in FIG. 4 has two <author> elements therein, and the appearance order of these two <author> elements is a child. The <author> element having the <first> element appears first, and the <author> element having the <last> element as a child appears later. Here, when comparing the element IDs assigned to each in FIG. 5, an <author> element having a <last> element as a child, which is an element to which E5 <E8 and E8 is assigned as an element ID, It can be determined that the element appears after the <author> element having a <first> element as a child, which is an element assigned E8 as an ID. In FIG. 5, the format of the structured document data with element IDs assigned is almost the same as the structured document data of FIG. 4, but as described above, the appearance order of the elements can be determined based on the size. If the element ID is assigned, the format of the structured document data with the element ID assigned is not particularly limited.

検索処理部２３は、クライアント端末３からの検索要求を受けて、クエリデータにより指定された条件に合致するデータを構造化文書ＤＢ２１から探し出し、この探し出したデータを結果データとして返す処理を行う。この検索処理部２３は、検索インタフェース部２６と、クエリデータ分解部２７と、構造照合処理部２８と、結合演算処理部２９とを備えている。 Upon receiving a search request from the client terminal 3, the search processing unit 23 searches the structured document DB 21 for data that matches the condition specified by the query data, and returns the found data as result data. The search processing unit 23 includes a search interface unit 26, a query data decomposition unit 27, a structure matching processing unit 28, and a join operation processing unit 29.

検索インタフェース部２６は、クエリデータの入力を受け付けて（クエリデータ受付手段）、受け付けたクエリデータにより指定された条件を満足する結果データを得るためにクエリデータ分解部２７を呼び出す。 The search interface unit 26 receives input of query data (query data receiving unit) and calls the query data decomposition unit 27 to obtain result data that satisfies the conditions specified by the received query data.

クエリデータ分解部２７は、クエリデータ分解手段として機能するものであって、クライアント端末３から送信され、検索インタフェース部２６を介して入力されたクエリデータ（以下、入力クエリデータという。）を構文解析し、この入力クエリデータが、構造化文書データの論理構造における階層の上下関係を指定する条件（第１条件）と、要素ＩＤで特定される要素の順序関係を指定する条件（第２条件）とを含む場合に、入力クエリデータを、第１条件のみを含む第１の部分クエリデータと、第１の部分クエリデータによる照合結果を第２条件に応じて結合演算する手順を含む第２の部分クエリデータとに分解する。 The query data decomposition unit 27 functions as query data decomposition means, and parses query data (hereinafter referred to as input query data) transmitted from the client terminal 3 and input via the search interface unit 26. The input query data includes a condition (first condition) for specifying the hierarchical relationship in the logical structure of the structured document data (first condition) and a condition (second condition) for specifying the order relation of the elements specified by the element ID. Including a procedure of combining the input query data with the first partial query data including only the first condition and the collation result based on the first partial query data according to the second condition. Break it down into partial query data.

構造照合処理部２８は、構造照合処理手段として機能するものであって、構造化文書ＤＢ２１に格納されている要素ＩＤ付与済み構造化文書データのデータ集合に対して、第１の部分クエリデータによる構造条件の照合処理を行い、その照合結果を構造照合処理結果データとして出力する。 The structure collation processing unit 28 functions as a structure collation processing unit, and uses the first partial query data for the data set of the structured document data with element IDs stored in the structured document DB 21. A structure condition matching process is performed, and the matching result is output as structure matching process result data.

結合演算処理部２９は、結合演算処理手段として機能するものであって、構造照合処理部２８から出力された第１の部分クエリデータによる照合結果である構造照合処理結果データに対して、第２のクエリデータに含まれる結合演算の手順に従って結合演算を行い、結合演算結果データを出力する。なお、結合演算処理部２９は、第２の部分クエリデータが空ならば、構造照合処理部２８から出力された構造照合処理結果データをそのまま出力するか、または自身の処理を省略する。 The join operation processing unit 29 functions as a join operation processing means, and applies a second to the structure matching processing result data that is the matching result by the first partial query data output from the structure matching processing unit 28. The join operation is performed according to the join operation procedure included in the query data, and the join operation result data is output. If the second partial query data is empty, the join operation processing unit 29 outputs the structure matching processing result data output from the structure matching processing unit 28 as it is or omits its own processing.

検索インタフェース部２６は、結合演算処理部２９から出力された結合演算処理結果データを、検索の結果データとしてクライアント端末３へ返却する。 The search interface unit 26 returns the join operation processing result data output from the join operation processing unit 29 to the client terminal 3 as search result data.

図６は、クエリデータの一例を示す説明図である。ＸＭＬでは、Ｗ３Ｃで提案されているＸＱｕｅｒｙという問合せ言語があり、図６に示すクエリデータは、このＸＱｕｅｒｙに基づいた問合せ記述方法に則っている。図６には、下記のような意味の複雑な階層構造に関する条件（構造条件）を含むクエリデータＱ１が示されている。
Ｑ１：構造化文書ＤＢ２１の各構造化文書データについて、階層のどこかに「ｂｏｏｋ」という要素があり、その「ｂｏｏｋ」という要素は、その中に「ａｕｔｈｏｒ」という要素を持ち、さらにこの「ａｕｔｈｏｒ」という要素の中に、「ｆｉｒｓｔ」と「ｌａｓｔ」という２つの要素を持つような「ｂｏｏｋ」要素であり、さらにその「ｂｏｏｋ」要素よりも後に出現する「ｂｏｏｋ」要素について、その中にある「ｔｉｔｌｅ」の一覧を返す。 FIG. 6 is an explanatory diagram illustrating an example of query data. In XML, there is a query language called XQuery proposed by W3C, and the query data shown in FIG. 6 conforms to a query description method based on this XQuery. FIG. 6 shows query data Q1 including conditions (structure conditions) regarding a complex hierarchical structure having the following meanings.
Q1: For each structured document data in the structured document DB 21, there is an element “book” somewhere in the hierarchy, the element “book” has an element “author” in the element, and this “author” Is a “book” element that has two elements “first” and “last”, and a “book” element that appears after the “book” element. Returns a list of “titles”.

図７は、サーバ１の検索処理部２３による検索処理の流れを示すフローチャートである。まず、検索インタフェース部２６により、クライアント端末３からネットワーク２経由で送信されたクエリデータの入力が受け付けられる（ステップＳ１）。 FIG. 7 is a flowchart showing the flow of search processing by the search processing unit 23 of the server 1. First, the search interface unit 26 accepts input of query data transmitted from the client terminal 3 via the network 2 (step S1).

次に、クエリデータ分解部２７により、入力クエリデータについてのクエリデータ解析処理が行われる（ステップＳ２）。クエリデータ分解部２７によるクエリデータ解析処理の一例を、図８を参照して説明する。 Next, the query data analysis unit 27 performs query data analysis processing on the input query data (step S2). An example of the query data analysis process by the query data decomposition unit 27 will be described with reference to FIG.

図８は、クエリデータ解析処理の流れを示すフローチャートである。クエリデータ分解部２７は、はじめに、入力クエリデータのすべてを便宜的に第１の部分クエリデータとする（ステップＳ２０１）。このとき、第２の部分クエリデータは空としておく。 FIG. 8 is a flowchart showing the flow of the query data analysis process. The query data decomposing unit 27 first sets all of the input query data as first partial query data for convenience (step S201). At this time, the second partial query data is left empty.

次に、クエリデータ分解部２７は、第１の部分クエリデータをチェックして、第１の部分クエリデータ（ここでは入力クエリデータと同じ）に、ある構造を持つ要素間の順序関係に関する構造条件、つまり、要素ＩＤで特定される要素の順序関係を指定するような条件が含まれるかどうかを判定する（ステップＳ２０２）。そして、クエリデータ分解部２７は、そのような構造条件が含まれていれば（ステップＳ２０２：Ｙｅｓ）、第１の部分クエリデータを、階層の上下関係を指定する条件のみを含む構造条件、つまり、順序関係に関する構造条件での照合の対象となる構造それぞれを指定するような構造条件に分解し（ステップＳ２０３）、分解された構造条件すべてを第１の部分クエリデータとする（ステップＳ２０４）。そして、クエリデータ分解部２７は、分解された各構造条件の構造照合処理結果間で、そこに含まれる要素ＩＤの結合演算を指示する内容を、第２の部分クエリデータとし（ステップＳ２０５）、クエリデータ解析処理を終了する。 Next, the query data decomposing unit 27 checks the first partial query data, and the first partial query data (here, the same as the input query data) has a structural condition relating to the order relationship between elements having a certain structure. That is, it is determined whether or not a condition for specifying the order relationship of the elements specified by the element ID is included (step S202). Then, if such a structural condition is included (step S202: Yes), the query data decomposition unit 27 converts the first partial query data into a structural condition that includes only a condition that specifies the hierarchical relationship of the hierarchy, that is, Then, it is decomposed into structural conditions that specify each of the structures to be collated with the structural conditions related to the order relationship (step S203), and all the decomposed structural conditions are set as the first partial query data (step S204). Then, the query data decomposing unit 27 sets, as the second partial query data, the contents instructing the join operation of the element IDs included in the structure matching processing results of the decomposed structural conditions (step S205). Terminates the query data analysis process.

一方、ステップＳ２０２の判定で、第１の部分クエリデータ（ここでは入力クエリデータと同じ）に上述した順序関係に関する構造条件が含まれなければ（ステップＳ２０２：Ｎｏ）、クエリデータ分解部２７は、ステップＳ２０３からステップＳ２０５の処理を行うことなく、クエリデータ解析処理を終了する。 On the other hand, if it is determined in step S202 that the first partial query data (here, the same as the input query data) does not include the structural condition related to the order relationship described above (step S202: No), the query data decomposition unit 27 The query data analysis process is terminated without performing the processes from step S203 to step S205.

図９は、図６に例示したクエリデータＱ１についてのクエリデータ解析処理の結果である第１の部分クエリデータＱ１＿Ａと第２の部分クエリデータＱ１＿Ｂの一例を示す図である。図９において、第１の部分クエリデータＱ１＿Ａは、構造条件ＰＰ１とＰＰ２を含んでいる。また、第２の部分クエリデータＱ１＿Ｂは、Ｑ１＿Ａによる構造照合処理の照合結果を、上述した順序関係に関する構造条件に応じて結合演算する手順を含んでいる。図６に例示したクエリデータＱ１は、順序関係に関する構造条件「ｆｏｌｌｏｗｉｎｇ−ｓｉｂｌｉｎｇ」が含まれているため、クエリデータＱ１は、この構造条件での照合の対象となる構造を指定する構造条件ＰＰ１，ＰＰ２に分解され、これらＰＰ１，ＰＰ２による構造照合処理の照合結果を「ｆｏｌｌｏｗｉｎｇ−ｓｉｂｌｉｎｇ」に応じて結合演算する手順が、第２の部分クエリデータＱ１＿Ｂとされる。なお、図中のＱ１＿ＢにあるＴ１，Ｔ２は、それぞれＰＰ１，ＰＰ２を構造照合処理した結果を識別する記号である。また、図中のＱ１＿Ａ，Ｑ１＿Ｂにある［１］，［２］，［３］は、Ｔ１，Ｔ２に含まれる要素ＩＤ群を識別する記号である。 FIG. 9 is a diagram illustrating an example of the first partial query data Q1_A and the second partial query data Q1_B, which are the results of the query data analysis process for the query data Q1 illustrated in FIG. In FIG. 9, the first partial query data Q1_A includes structural conditions PP1 and PP2. Further, the second partial query data Q1_B includes a procedure for performing a join operation on the collation result of the structure collation processing by Q1_A according to the structural condition related to the order relation described above. Since the query data Q1 illustrated in FIG. 6 includes a structural condition “following-sibling” related to the order relationship, the query data Q1 includes a structural condition PP1, which specifies a structure to be collated with the structural condition. The procedure of decomposing into PP2 and performing a join operation on the collation result of the structure collation processing by PP1 and PP2 according to “following-sibling” is the second partial query data Q1_B. Note that T1 and T2 in Q1_B in the figure are symbols for identifying the results of the structure matching processing for PP1 and PP2, respectively. [1], [2], and [3] in Q1_A and Q1_B in the figure are symbols for identifying element ID groups included in T1 and T2.

クエリデータ分解部２７によるクエリデータ解析処理が終了すると、次に、構造照合処理部２８により、第１の部分クエリデータに含まれる構造条件の構造照合処理が行われる（ステップＳ３）。ここで、構造照合処理とは、構造化文書データについて、指定された構造条件をＸＱｕｅｒｙで定められた仕様で解釈して照合し、結果として構造条件に合致する構造化文書データまたは構造化文書データ中の要素を得る処理をいう。 When the query data analysis process by the query data decomposing unit 27 is completed, the structure matching process unit 28 performs a structure matching process for the structure condition included in the first partial query data (step S3). Here, the structure matching process refers to structured document data or structured document data that conforms to the structure condition as a result of interpreting and collating the specified structure condition with the specification defined by XQuery. The process to get the elements inside.

ここで、図５に例示した要素ＩＤ付与済み構造化文書データと、図９に例示した第１の部分クエリデータＱ１＿Ａに含まれる構造条件ＰＰ１，ＰＰ２とを用いて、一般的な構造照合処理を行った場合の処理の概要を、図１０を参照して説明する。以下では、構造条件中の／で区切られた部分を単にパスと呼称する。 Here, using the structured document data with element IDs exemplified in FIG. 5 and the structural conditions PP1 and PP2 included in the first partial query data Q1_A exemplified in FIG. 9, a general structure matching process is performed. An outline of the processing when it is performed will be described with reference to FIG. Hereinafter, the part delimited by / in the structural condition is simply referred to as a path.

第１の部分クエリデータＱ１＿Ａに含まれる構造条件ＰＰ１の照合について、構造条件ＰＰ１を左から見て、まず先頭のパス／／ｂｏｏｋ（／ｄｅｓｃｅｎｄａｎｔ−ｏｒ−ｓｅｌｆ：：ｂｏｏｋ）の照合を行う。これは、ＸＱｕｅｒｙ仕様で、「ルート要素以下のどこかの階層構造にある要素で、名前がｂｏｏｋである要素を選択する」ことを意味する。これに従い、図５に例示した要素ＩＤ付与済み構造化文書データ中の要素の構造を照合すると、ルート要素＜ｂｏｏｋｓ＞以下の３つの＜ｂｏｏｋ＞要素、Ｅ２，Ｅ１３，Ｅ２１が選択される（１．１）。 Regarding the collation of the structural condition PP1 included in the first partial query data Q1_A, the head path // book (/ descendant-or-self :: book) is collated first when the structural condition PP1 is viewed from the left. This means that, in the XQuery specification, “the element in the hierarchical structure somewhere below the root element and having the name“ book ”is selected”. In accordance with this, when the structure of the element in the structured document data with element ID given as an example in FIG. 5 is collated, three <book> elements, E2, E13, and E21 below the root element <books> are selected (1). .1).

次の［ａｕｔｈｏｒ［ｆｉｒｓｔａｎｄｌａｓｔ］］とは、「前の構造照合で得られた要素について、その子にａｕｔｈｏｒという要素があり、さらにそのａｕｔｈｏｒ要素は、その子としてｆｉｒｓｔとｌａｓｔという要素を両方含んでいるような要素のみをさらに選択する」ことを意味する。これに従い各＜ｂｏｏｋ＞要素より下位の要素の構造を照合して＜ｂｏｏｋ＞要素を選択すると、結果として、要素ＩＤがＥ１３，Ｅ２１であるｂｏｏｋ要素が得られる（１．２）。 The next [author [first and last]] is: “With respect to the element obtained in the previous structure matching, its child has an element called author, and the author element contains both first and last elements as its children. Means to select only such elements. " If the <book> element is selected by collating the structure of elements below each <book> element in accordance with this, a book element having element IDs E13 and E21 is obtained as a result (1.2).

同様に、第１の部分クエリデータＱ１＿Ａに含まれる構造条件ＰＰ２の照合について、先頭のパス／／ｂｏｏｋについては先のＰＰ１の例と同じ意味であり、３つの＜ｂｏｏｋ＞要素、Ｅ２，Ｅ１３，Ｅ２１が選択される（２．１）。次に続くパス／ｔｉｔｌｅ（／ｃｈｉｌｄ：：ｔｉｔｌｅ）とは、「前のパス照合で得られた要素に対して子供の階層構造の位置にある要素で、名前がｔｉｔｌｅである要素を選択する」ことを意味する。これに従い、前の構造照合で得られた各＜ｂｏｏｋ＞要素に対して子供の構造位置にある要素を照合すると、要素ＩＤがＥ２の＜ｂｏｏｋ＞要素からは要素ＩＤがＥ３の＜ｔｉｔｌｅ＞要素が得られ、要素ＩＤがＥ１３の＜ｂｏｏｋ＞要素からは要素ＩＤがＥ１９の＜ｔｉｔｌｅ＞要素が得られ、要素ＩＤがＥ２１の＜ｂｏｏｋ＞要素からは要素ＩＤがＥ２７の＜ｔｉｔｌｅ＞要素が得られる（２．２）。 Similarly, with respect to the collation of the structural condition PP2 included in the first partial query data Q1_A, the first path // book has the same meaning as the previous PP1, and three <book> elements, E2, E13, E21 is selected (2.1). The next path / title (/ child :: title) is “select an element at the position of the child's hierarchical structure with respect to the element obtained in the previous path collation and whose name is title”. Means that. Accordingly, when the element at the child structure position is compared with each <book> element obtained by the previous structure matching, the <title> element with an element ID of E3 starts from the <book> element with an element ID of E2. The <title> element with the element ID E19 is obtained from the <book> element with the element ID E13, and the <title> element with the element ID E27 is obtained from the <book> element with the element ID E21. (2.2).

なお、ここで概説した構造条件およびパスの構造照合処理の方法は一例であり、構造条件およびパスの構造照合処理の方法には、特許文献１に記載されているようなものをはじめ、様々なものがある。本実施形態における構造照合処理部２８は、入力された構造条件に合致する構造を持つ構造化文書データ中の要素を取得するものであれば、内部の具体的な処理方法については特に限定されるものではない。 Note that the structure condition and the path structure matching method outlined here are merely examples, and the structure condition and the path structure matching method include various methods including those described in Patent Document 1. There is something. As long as the structure matching processing unit 28 in the present embodiment acquires an element in structured document data having a structure that matches the input structure condition, a specific internal processing method is particularly limited. It is not a thing.

図５に例示した要素ＩＤ付与済み構造化文書データについて、図９に例示した第１の部分クエリデータＱ１＿Ａに含まれる構造条件ＰＰ１，ＰＰ２の構造照合処理を行った結果である構造照合処理結果データＲ１＿Ａを図１１に示す。図中のＴ１，Ｔ２は、それぞれ構造条件ＰＰ１，ＰＰ２の構造照合処理の結果であり、Ｔ１としては構造条件ＰＰ１に合致する構造を持つ要素の要素ＩＤ群［１］が得られ、Ｔ２としては構造条件ＰＰ２に合致する構造を持つ要素の要素ＩＤ群［２］，［３］が得られる。なお、図１１ではＴ１，Ｔ２を表で示しているが、構造照合処理結果データの具体的なデータ構造は特に限定されるものではない。 The structure collation processing result data, which is the result of performing the structure collation processing of the structural conditions PP1 and PP2 included in the first partial query data Q1_A illustrated in FIG. 9 for the structured document data with element IDs illustrated in FIG. R1_A is shown in FIG. T1 and T2 in the figure are the results of the structure matching processing of the structural conditions PP1 and PP2, respectively. As T1, an element ID group [1] of elements having a structure matching the structural condition PP1 is obtained. Element ID groups [2] and [3] of elements having a structure that matches the structure condition PP2 are obtained. In FIG. 11, T1 and T2 are shown in a table, but the specific data structure of the structure matching process result data is not particularly limited.

構造照合処理部２８による構造照合処理が終了すると、次に、結合演算処理部２９により、構造照合処理部２８の処理結果である構造照合処理結果データについて、第２の部分クエリデータに含まれる手順に従って結合演算処理が行われる（ステップＳ４）。 When the structure matching processing by the structure matching processing unit 28 is completed, the procedure included in the second partial query data is next performed on the structure matching processing result data, which is the processing result of the structure matching processing unit 28, by the join operation processing unit 29. In step S4, a join operation process is performed.

図１２は、図９に例示した第２の部分クエリデータＱ１＿Ｂを用いて図１１に例示した構造照合処理結果データＲ１＿Ａについて結合演算処理を行う場合の処理の概要を説明する図である。結合演算処理は、従来のＲＤＢ（ＲｅｌａｔｉｏｎａｌＤａｔａｂａｓｅ）で行われている結合演算（ＪＯＩＮ）などと同じである。 FIG. 12 is a diagram for explaining an overview of processing when the join operation processing is performed on the structure matching processing result data R1_A illustrated in FIG. 11 using the second partial query data Q1_B illustrated in FIG. The join operation process is the same as the join operation (JOIN) performed in the conventional RDB (Relational Database).

図１２の例では、Ｔ１［１］＜Ｔ２［２］という順序関係に関する条件に従ってＴ１とＴ２との結合演算が行われ、Ｔ３が得られる。そして、Ｔ３について［３］のみを取り出すことにより、結合演算処理の結果として、図１２に示すような中間結果Ｒ１＿Ｂが得られる。この中間結果Ｒ１＿Ｂは、図６に例示したクエリデータＱ１を、上述したようなクエリデータ解析処理を行わずにそのまま処理した場合の結果と一致する。 In the example of FIG. 12, the join operation of T1 and T2 is performed according to the condition relating to the order relationship of T1 [1] <T2 [2], and T3 is obtained. Then, by extracting only [3] for T3, an intermediate result R1_B as shown in FIG. 12 is obtained as a result of the join operation process. This intermediate result R1_B matches the result of processing the query data Q1 illustrated in FIG. 6 as it is without performing the query data analysis process as described above.

結合演算処理部２９による結合演算処理が終了すると、最後に、検索インタフェース部２６により、結合演算処理部２９による結合演算処理の結果（中間結果）として得られる要素ＩＤが、それに対応する構造化文書データとして文字列化され、結果データとしてクライアント端末３に返却される（ステップＳ５）。図１２に示した中間結果Ｒ１＿Ｂが得られた場合には、この中間結果Ｒ１＿Ｂに含まれる要素ＩＤであるＥ２７に対応する構造化文書データが文字列化され、クエリデータＱ１の結果データＲ１として、図１３に示すデータがクライアント端末３に返却される。 When the join operation processing by the join operation processing unit 29 is finished, finally, the element ID obtained as a result (intermediate result) of the join operation processing by the join operation processing unit 29 by the search interface unit 26 is the corresponding structured document. It is converted into a character string as data and returned to the client terminal 3 as result data (step S5). When the intermediate result R1_B shown in FIG. 12 is obtained, the structured document data corresponding to the element ID E27 included in the intermediate result R1_B is converted into a character string, and as the result data R1 of the query data Q1, Data shown in FIG. 13 is returned to the client terminal 3.

以上、具体的な例を挙げながら説明したように、本実施形態によれば、サーバ１が、構造化文書データの登録時に、その構造化文書データに出現する各要素に要素間で出現順序が比較可能な要素ＩＤを付与して、要素ＩＤ付与済み構造化文書データを構造化文書ＤＢ２１に格納する。また、サーバ１は、構造化文書データの検索時には、クライアント端末３からの入力クエリデータを構文解析して、入力クエリデータが、構造化文書データの論理構造における階層の上下関係を指定する第１条件と、要素ＩＤで特定される要素の順序関係を指定する第２条件とを含む場合に、その入力クエリデータを、第１条件のみを含む第１の部分クエリデータと、第１の部分クエリデータによる照合結果を、第２条件に応じて結合演算する手順を含む第２の部分クエリデータとに分解して処理するようにしている。したがって、入力クエリデータが複雑な構造条件を含む場合でも、この入力クエリデータを単純な構造条件の構造照合処理と結合演算処理とで処理することで、構造照合処理の高速化を実現し、複雑な構造条件を含むクエリデータによる検索を高速に実行することができる。 As described above with reference to specific examples, according to this embodiment, when the structured document data is registered, the server 1 has the appearance order between the elements appearing in the structured document data. A comparable element ID is assigned, and the structured document data with the element ID assigned is stored in the structured document DB 21. Further, the server 1 parses the input query data from the client terminal 3 when searching for the structured document data, and the input query data designates the first hierarchical relationship in the logical structure of the structured document data. In the case of including a condition and a second condition that specifies the order relationship of the elements specified by the element ID, the input query data includes the first partial query data including only the first condition, and the first partial query. The collation result based on the data is processed by being decomposed into second partial query data including a procedure for performing a join operation in accordance with the second condition. Therefore, even if the input query data includes complex structural conditions, the input query data is processed by the simple structure condition structure matching process and the join operation process, thereby speeding up the structure matching process. Search by query data including various structural conditions can be executed at high speed.

なお、上記の具体例では、入力クエリデータに、要素ＩＤで特定される要素の順序関係を指定する条件（順序関係に関する構造条件）として「ｆｏｌｌｏｗｉｎｇ−ｓｉｂｌｉｎｇ」が含まれる場合を例示して説明したが、要素の順序関係を指定する他の条件、例えば「ｆｏｌｌｏｗｉｎｇ」や「ｐｒｅｃｅｅｄｉｎｇ」、「ｐｒｅｃｅｅｄｉｎｇ−ｓｉｂｌｉｎｇ」などが含まれる場合であっても、上記の具体例と同様に処理することができる。 In the above specific example, the case where “following-sibling” is included in the input query data as a condition (structural condition related to the order relation) for specifying the order relation of the elements specified by the element ID has been described. However, even when other conditions for specifying the order relation of elements, for example, “following”, “preceding”, “presenting-sibling”, and the like are included, the processing can be performed in the same manner as the above specific example.

［第２の実施形態］
次に、第２の実施形態について、図５、図７、図１４乃至図２３を参照して説明する。本実施形態は、第１の部分クエリデータが、構造化文書データの論理構造における階層を下位から上位へと辿る条件を含む場合に、その条件を、構造化文書データの論理構造における階層を下位から上位へと辿る条件に書き換えるようにした例である。なお、以下の説明において、上述した第１の実施形態と共通の構成については同一の符号を付し、重複した説明を省略する。 [Second Embodiment]
Next, a second embodiment will be described with reference to FIGS. 5, 7, and 14 to 23. In the present embodiment, when the first partial query data includes a condition for tracing the hierarchy in the logical structure of the structured document data from the lower level to the upper level, the condition is set to the lower level in the logical structure of the structured document data. In this example, the condition is rewritten to the condition of tracing from top to bottom. In the following description, the same reference numerals are given to the same components as those in the first embodiment described above, and duplicate descriptions are omitted.

図１４は、本実施形態におけるサーバ１’およびクライアント端末３の概略構成を示すブロック図である。本実施形態では、サーバ１’の格納処理部２２’に構造解析部３０が設けられている。また、サーバ１’の構造化文書ＤＢ２１’には、要素ＩＤ付与済み構造化文書データとともに、構造化文書ＤＢ２１’に格納された要素ＩＤ付与済み構造化文書データそれぞれの階層化された論理構造を集約した情報である構造ガイドデータが格納されている（構造ガイドデータ記憶手段）。また、サーバ１’の検索処理部２３’には、構造条件書き換え部３１が設けられている。なお、クライアント端末３の構成は第１の実施形態と同じである。 FIG. 14 is a block diagram showing a schematic configuration of the server 1 ′ and the client terminal 3 in the present embodiment. In the present embodiment, the structure analysis unit 30 is provided in the storage processing unit 22 ′ of the server 1 ′. In addition, the structured document DB 21 ′ of the server 1 ′ includes the structured logical document data with the element IDs and the hierarchical logical structures of the structured document data with the element IDs stored in the structured document DB 21 ′. Structure guide data that is aggregated information is stored (structure guide data storage means). Further, a structural condition rewriting unit 31 is provided in the search processing unit 23 ′ of the server 1 ′. The configuration of the client terminal 3 is the same as that in the first embodiment.

構造解析部３０は、構造ガイドデータ更新手段として機能するものであり、クライアント端末３から送信された構造化文書データの階層化された論理構造を解析し、その論理構造が構造ガイドデータに反映されるように、構造化文書ＤＢ２１’に格納されている構造ガイドデータを更新する。 The structure analysis unit 30 functions as a structure guide data update unit, analyzes the hierarchical logical structure of the structured document data transmitted from the client terminal 3, and reflects the logical structure in the structure guide data. Thus, the structure guide data stored in the structured document DB 21 ′ is updated.

構造ガイドデータは、構造化文書ＤＢ２１’に格納された要素ＩＤ付与済み構造化文書データそれぞれの階層化された論理構造を集約した情報であり、要素ＩＤ付与済み構造化文書データ中に出現する一意な階層構造に関する情報を保持するものである。図４に例示した構造化文書データに対応する構造ガイドデータは、例えば図１５に示すようなものとなる。構造解析部３０は、まず、クライアント端末３から送信された構造化文書データの階層化された論理構造を解析して図１５に示すような構造ガイドデータを新規に生成する。そして、構造解析部３０は、新規に生成した構造ガイドデータを構造化文書ＤＢ２１’に格納されている構造ガイドデータ（つまり、構造化文書ＤＢ２１’に格納された要素ＩＤ付与済み構造化文書データの論理構造を集約した構造ガイドデータ）と比較し、新規に生成した構造ガイドデータが、構造化文書ＤＢ２１’に格納されている構造ガイドデータにはない新たな階層構造に関する情報を含む場合に、その新たな階層構造に関する情報を、構造化文書ＤＢ２１’に格納されている構造ガイドデータに追加するかたちで、構造化文書ＤＢ２１’に格納されている構造ガイドデータを更新する。 The structure guide data is information obtained by aggregating the hierarchical logical structures of each of the structured document data with element IDs stored in the structured document DB 21 ′, and is unique in the structured document data with element IDs. It holds information related to a hierarchical structure. Structure guide data corresponding to the structured document data illustrated in FIG. 4 is, for example, as shown in FIG. First, the structure analysis unit 30 analyzes the hierarchical logical structure of the structured document data transmitted from the client terminal 3, and newly generates structure guide data as shown in FIG. Then, the structure analysis unit 30 creates the structure guide data newly generated from the structure guide data stored in the structured document DB 21 ′ (that is, the structured document data with element IDs stored in the structured document DB 21 ′. When the newly generated structure guide data includes information on a new hierarchical structure that is not included in the structure guide data stored in the structured document DB 21 ′. The structure guide data stored in the structured document DB 21 ′ is updated in such a manner that information on the new hierarchical structure is added to the structure guide data stored in the structured document DB 21 ′.

構造条件書き換え部３１は、条件書き換え手段として機能するものであって、クエリデータ分解部２７によって入力クエリデータを分解して得られた第１の部分クエリデータが、構造化文書データの論理構造における階層を下位から上位へと辿る条件、つまり構造化文書データのリーフ要素（葉）からルート要素（根）方向に構造を照合するような条件を含む場合に、該条件を、構造化文書ＤＢ２１’に格納されている構造ガイドデータに基づいて、構造化文書データの論理構造における階層を上位から下位へと辿る条件、つまり構造化文書データのルート要素（根）からリーフ要素（葉）方向に構造を照合するような条件に書き換える。 The structural condition rewriting unit 31 functions as a condition rewriting unit, and the first partial query data obtained by decomposing the input query data by the query data decomposing unit 27 is in the logical structure of the structured document data. When a condition for tracing the hierarchy from lower to higher, that is, a condition for collating the structure from the leaf element (leaf) to the root element (root) of the structured document data, the condition is designated as the structured document DB 21 ′. Based on the structure guide data stored in the, the condition to follow the hierarchy in the logical structure of the structured document data from the top to the bottom, that is, the structure from the root element (root) of the structured document data to the leaf element (leaf) direction To a condition that matches.

図１６は、本実施形態で想定するクエリデータの一例を示す説明図である。この図１６に示すクエリデータＱ２は、第１の実施形態で説明したクエリデータＱ１と同じくＸＱｕｅｒｙで記述されており、下記のような意味の複雑な階層構造に関する条件（構造条件）を含んでいる。
Ｑ２：構造化文書ＤＢ２１’の各構造化文書データについて、階層のどこかに「ｂｏｏｋ」という要素があり、その「ｂｏｏｋ」という要素はその中に「ａｕｔｈｏｒ」という要素を持ち、さらにこの「ａｕｔｈｏｒ」という要素の中に、「ｆｉｒｓｔ」と「ｌａｓｔ」という２つの要素を持つような「ｂｏｏｋ」要素であり、さらにその「ｂｏｏｋ」要素よりも後に出現する「ｂｏｏｋ」要素の子である「ｔｉｔｌｅ」要素の、親要素の子であるような「ａｕｔｈｏｒ」要素の一覧を返す。 FIG. 16 is an explanatory diagram illustrating an example of query data assumed in the present embodiment. The query data Q2 shown in FIG. 16 is described in XQuery similarly to the query data Q1 described in the first embodiment, and includes conditions (structural conditions) regarding a complicated hierarchical structure having the following meanings. .
Q2: For each structured document data in the structured document DB 21 ′, there is an element “book” somewhere in the hierarchy, the element “book” has an element “author” in the element, and this “author” "Book" element having two elements "first" and "last", and "title" which is a child of the "book" element appearing after the "book" element Returns a list of “author” elements that are children of the parent element.

クエリデータＱ２では、クエリデータＱ１に加えて「／ｐａｒｅｎｔ」というパスが出現している。ＸＱｕｅｒｙでの／ｄｅｓｃｅｎｄａｎｔ、／ｄｅｓｃｅｎｄａｎｔ−ｏｒ−ｓｅｌｆや／（ｃｈｉｌｄ）が、構造化文書のルート要素（根）からリーフ要素（葉）方向に構造を照合するパスであるのに対して、／ｐａｒｅｎｔや／ａｎｃｅｓｔｏｒは構造化文書データのリーフ要素（葉）からルート要素（根）方向に、階層構造内での親や先祖要素を照合するパスである。 In the query data Q2, a path “/ parent” appears in addition to the query data Q1. In XQuery, / descendant, / descendant-or-self, and / (child) are paths that collate structures from the root element (root) to the leaf element (leaf) direction of the structured document. “/ Ancestor” is a path for collating the parent and ancestor elements in the hierarchical structure from the leaf element (leaf) to the root element (root) of the structured document data.

本実施形態おける検索処理部２３’による検索処理の流れは、図７に示した第１の実施形態のものと同様である。ただし、本実施形態では、ステップＳ２のクエリデータ解析処理の中で、構造条件書き換え部３１による構造条件書き換え処理が実施される。 The flow of search processing by the search processing unit 23 'in this embodiment is the same as that in the first embodiment shown in FIG. However, in the present embodiment, the structural condition rewriting process by the structural condition rewriting unit 31 is performed in the query data analysis process in step S2.

図１７は、本実施形態におけるクエリデータ解析処理の流れを示すフローチャートである。この図１７のフローチャートにおいて、ステップＳ２１１〜ステップＳ２１５までの処理は、第１の実施形態で説明した図８のステップＳ２０１〜ステップＳ２０５と同様であるため、説明を省略する。 FIG. 17 is a flowchart showing the flow of query data analysis processing in this embodiment. In the flowchart of FIG. 17, the processing from step S211 to step S215 is the same as step S201 to step S205 of FIG. 8 described in the first embodiment, and thus the description thereof is omitted.

本実施形態におけるクエリデータ解析処理では、ステップＳ２１５の処理の次に、構造条件書き換え部３１が、これまでの処理で作成された第１の部分クエリデータに、構造化文書データの階層構造を葉から根方向へ照合するような構造条件が含まれるかどうかを判定する（ステップＳ２１６）。そして、構造条件書き換え部３１は、第１の部分クエリデータに、構造化文書データの階層構造を葉から根方向へ照合するような構造条件が含まれていれば（ステップＳ２１６：Ｙｅｓ）、構造条件書き換え処理を行って、構造化文書データの階層構造を葉から根方向へ照合する構造条件を、根から葉方向へ照合する構造条件に書き換える（ステップＳ２１７）。 In the query data analysis processing in the present embodiment, after the processing in step S215, the structural condition rewriting unit 31 leaves the hierarchical structure of the structured document data in the first partial query data created by the processing so far. It is determined whether or not a structural condition that matches in the root direction is included (step S216). If the first partial query data includes a structural condition such that the hierarchical structure of the structured document data is collated from the leaves to the root direction (step S216: Yes), the structural condition rewriting unit 31 Condition rewriting processing is performed to rewrite the structural condition for collating the hierarchical structure of the structured document data from the leaf to the root direction to the structural condition for collating from the root to the leaf direction (step S217).

一方、ステップＳ２１６の判定で、第１の部分クエリデータに、構造化文書データの階層構造を葉から根方向へ照合するような構造条件が含まれなければ（ステップＳ２１６：Ｎｏ）、ステップＳ２１７の構造条件書き換え処理を行うことなく、クエリデータ解析処理を終了する。 On the other hand, if it is determined in step S216 that the first partial query data does not include a structural condition for collating the hierarchical structure of the structured document data from the leaf to the root direction (step S216: No), the process proceeds to step S217. The query data analysis process is terminated without performing the structural condition rewriting process.

図１８は、図１６に例示したクエリデータＱ２について、図１７のステップＳ２１５までの処理を行った結果である第１の部分クエリデータＱ２＿Ａと第２の部分クエリデータＱ２＿Ｂの例を示す図である。図１８において、第１の部分クエリデータＱ２＿Ａは、構造条件ＰＰ１とＰＰ２を含んでいる。また、第２の部分クエリデータＱ２＿Ｂは、Ｑ２＿Ａによる構造照合処理の照合結果を、上述した順序関係に関する構造条件に応じて結合演算する手順を含んでいる。この図１８の例では、第１の部分クエリデータＱ２＿Ａの構造条件ＰＰ２が、／ｐａｒｅｎｔのパスを含んでいる。このため、構造条件書き換え部３１による構造条件書き換え処理が行われることになる。 18 is a diagram illustrating an example of the first partial query data Q2_A and the second partial query data Q2_B, which are the results of performing the processing up to step S215 in FIG. 17 for the query data Q2 illustrated in FIG. . In FIG. 18, the first partial query data Q2_A includes structural conditions PP1 and PP2. Further, the second partial query data Q2_B includes a procedure for performing a join operation on the collation result of the structure collation processing by Q2_A according to the structural condition related to the order relation described above. In the example of FIG. 18, the structure condition PP2 of the first partial query data Q2_A includes the path / parent. For this reason, the structural condition rewriting process by the structural condition rewriting unit 31 is performed.

図１９は、図１７のステップＳ２１７で行われる構造条件書き換え処理を示すフローチャートである。構造条件書き換え部３１は、まず、第１の部分クエリデータのうち、構造化文書データの階層構造を葉から根方向へ照合するパスが含まれる構造条件（以下、入力構造条件という。）の中で、葉から根方向へ照合するパス部分を特定する（ステップＳ２１７１）。次に、構造条件書き換え部３１は、入力構造条件を、特定したパス部分より前のパスと、特定したパス部分より後のパスと、構造ガイドデータとを参照して、葉から根方向へ照合するパス部分を含まない同じ意味のパスに書き換える（ステップＳ２１７２）。 FIG. 19 is a flowchart showing the structural condition rewriting process performed in step S217 of FIG. First, the structural condition rewriting unit 31 in the first partial query data includes a structural condition (hereinafter referred to as an input structural condition) including a path for collating the hierarchical structure of the structured document data from the leaf to the root direction. The path portion to be collated from the leaf to the root direction is identified (step S2171). Next, the structural condition rewriting unit 31 refers to the input structural condition from the leaf to the root direction with reference to the path before the identified path part, the path after the identified path part, and the structure guide data. To a path having the same meaning that does not include the path portion to be performed (step S2172).

ここで、図１８に例示した第１の部分クエリデータＱ２＿Ａにある構造条件ＰＰ２を、図１５に例示した構造ガイドデータを参照して書き換える場合を例に挙げて、ステップＳ２１７２での処理の概要を説明する。なお、ここでは説明を簡単にするために、図１５に例示した構造ガイドデータを参照しているが、実際には構造化文書ＤＢ２１’に格納された構造ガイドデータが参照される。 Here, an example of rewriting the structural condition PP2 in the first partial query data Q2_A illustrated in FIG. 18 with reference to the structural guide data illustrated in FIG. 15 will be given as an overview of the processing in step S2172. explain. In order to simplify the description here, the structure guide data illustrated in FIG. 15 is referred to. However, the structure guide data stored in the structured document DB 21 ′ is actually referred to.

第１の部分クエリデータＱ２＿Ａにある構造条件ＰＰ２の／ｐａｒｅｎｔ：：ｎｏｄｅ（）部分について、その直前のパスは／ｔｉｔｌｅである。図１５に例示した構造ガイドデータを参照すると、ｔｉｔｌｅを末尾に持つ構造は／ｂｏｏｋｓ／ｂｏｏｋ／ｔｉｔｌｅである。／ｐａｒｅｎｔ：：ｎｏｄｅ（）は親要素を指定するため、／ｐａｒｅｎｔ：：ｎｏｄｅ（）の指定する構造は／ｂｏｏｋｓ／ｂｏｏｋとわかる。さらに、／ｐａｒｅｎｔ：：ｎｏｄｅ（）の次のパスである／ａｕｔｈｏｒについても、ここまでで特定された構造は／ｂｏｏｋｓ／ｂｏｏｋであるから、／ｂｏｏｋｓ／ｂｏｏｋの後に／ａｕｔｈｏｒが続く構造があるかを構造ガイドデータで確認すると、存在するので、構造条件ＰＰ２の書き換え結果として／ｂｏｏｋｓ／ｂｏｏｋ／ａｕｔｈｏｒが得られる。 For the / parent :: node () part of the structural condition PP2 in the first partial query data Q2_A, the path immediately before is / title. Referring to the structure guide data illustrated in FIG. 15, the structure having a title at the end is / books / book / title. Since / parent :: node () designates a parent element, the structure designated by / parent :: node () is known as / books / book. Furthermore, regarding / author, which is the next path of / parent :: node (), since the structure specified so far is / books / book, is there any structure in which / books / book is followed by / author? Is confirmed by the structure guide data, and / books / book / author is obtained as a rewrite result of the structure condition PP2.

図２０は、図１８に例示した第１の部分クエリデータＱ２＿Ａについて、構造条件書き換え処理を行った結果を示す図である。なお、図中の第２の部分クエリデータＱ２＿Ｂは、図１８に例示したものと同じである。 FIG. 20 is a diagram illustrating a result of the structural condition rewriting process performed on the first partial query data Q2_A illustrated in FIG. Note that the second partial query data Q2_B in the figure is the same as that illustrated in FIG.

本実施形態では、構造条件書き換え部３１による構造条件書き換え処理を経て、構造照合処理部２８による構造照合処理が行われる（ステップＳ３）。このとき、第１の部分クエリデータに含まれる構造条件は、葉から根方向へ照合するパス部分を含まないものに書き換えられているため、構造照合処理部２８による構造照合処理は、すべて根から葉方向への照合のみで実現することができ、極めて高速な処理が可能である。 In the present embodiment, after the structural condition rewriting process by the structural condition rewriting unit 31, the structural verification process by the structural verification process unit 28 is performed (step S3). At this time, since the structural condition included in the first partial query data has been rewritten so as not to include the path portion to be collated from the leaf to the root direction, all the structural collation processing by the structural collation processing unit 28 is performed from the root. This can be realized only by collation in the leaf direction, and extremely high-speed processing is possible.

図５に例示した要素ＩＤ付与済み構造化文書データについて、図２０に例示した構造条件書き換え処理後の第１の部分クエリデータＱ２＿Ａに含まれる構造条件ＰＰ１，ＰＰ２による構造照合処理を行った結果である構造照合処理結果データＲ２＿Ａを図２１に示す。図中のＴ１，Ｔ２は、それぞれ構造条件ＰＰ１，ＰＰ２の構造照合処理の結果であり、Ｔ１としては構造条件ＰＰ１に合致する構造を持つ要素の要素ＩＤ群［１］が得られ、Ｔ２としては構造条件ＰＰ２に合致する構造を持つ要素の要素ＩＤ群［２］，［３］が得られる。 As a result of performing the structure matching process using the structural conditions PP1 and PP2 included in the first partial query data Q2_A after the structural condition rewriting process illustrated in FIG. Certain structure matching processing result data R2_A is shown in FIG. T1 and T2 in the figure are the results of the structure matching processing of the structural conditions PP1 and PP2, respectively. As T1, an element ID group [1] of elements having a structure matching the structural condition PP1 is obtained. Element ID groups [2] and [3] of elements having a structure that matches the structure condition PP2 are obtained.

本実施形態においても、構造照合処理部２８による構造照合処理が終了すると、結合演算処理部２９により、構造照合処理部２８の処理結果である構造照合処理結果データについて、第２の部分クエリデータに含まれる手順に従って結合演算処理が行われる（ステップＳ４）。 Also in this embodiment, when the structure matching processing by the structure matching processing unit 28 is completed, the join operation processing unit 29 converts the structure matching processing result data, which is the processing result of the structure matching processing unit 28, into the second partial query data. A join calculation process is performed according to the included procedure (step S4).

図２２は、図２０に例示した第２の部分クエリデータＱ２＿Ｂを用いて図２１に例示した構造照合処理結果データＲ２＿Ａについて結合演算処理を行う場合の処理の概要を説明する図である。図２２の例では、Ｔ１［１］＜Ｔ２［２］という順序関係に関する条件に従ってＴ１とＴ２との結合演算が行われ、Ｔ３が得られる。そして、Ｔ３について［３］のみを取り出すことにより、結合演算処理の結果として、図２２に示すような中間結果Ｒ２＿Ｂが得られる。この中間結果Ｒ２＿Ｂは、図１６に例示したクエリデータＱ２を、上述したようなクエリデータ解析処理を行わずにそのまま処理した場合の結果と一致する。 FIG. 22 is a diagram for explaining the outline of the process when the join operation process is performed on the structure matching process result data R2_A illustrated in FIG. 21 using the second partial query data Q2_B illustrated in FIG. In the example of FIG. 22, a join operation between T1 and T2 is performed according to a condition relating to the order relationship of T1 [1] <T2 [2], and T3 is obtained. Then, by extracting only [3] for T3, an intermediate result R2_B as shown in FIG. 22 is obtained as a result of the join operation process. This intermediate result R2_B matches the result when the query data Q2 illustrated in FIG. 16 is directly processed without performing the query data analysis process as described above.

最後に、検索インタフェース部２６により、結合演算処理部２９による結合演算処理の結果（中間結果）として得られる要素ＩＤが、それに対応する構造化文書データとして文字列化され、結果データとしてクライアント端末３に返却される（ステップＳ５）。図２２に示した中間結果Ｒ２＿Ｂが得られた場合には、この中間結果Ｒ２＿Ｂに含まれる要素ＩＤであるＥ２２に対応する構造化文書データが文字列化され、クエリデータＱ２の結果データＲ２として、図２３に示すデータがクライアント端末３に返却される。 Finally, the search interface unit 26 converts the element ID obtained as a result (intermediate result) of the join operation processing by the join operation processing unit 29 into a character string as the corresponding structured document data, and the client terminal 3 as the result data (Step S5). When the intermediate result R2_B shown in FIG. 22 is obtained, the structured document data corresponding to the element ID E22 included in the intermediate result R2_B is converted into a character string, and as the result data R2 of the query data Q2, Data shown in FIG. 23 is returned to the client terminal 3.

以上、具体的な例を挙げながら説明したように、本実施形態によれば、第１の部分クエリデータが構造化文書データの論理構造における階層を下位から上位へと辿る条件、つまり構造化文書データのリーフ要素（葉）からルート要素（根）方向に構造を照合するような条件を含む場合に、該条件を、構造化文書ＤＢ２１’に格納されている構造ガイドデータに基づいて、構造化文書データのルート要素（根）からリーフ要素（葉）方向に構造を照合するような条件に書き換える。そして、入力クエリデータを単純な構造条件の構造照合処理と結合演算処理とで処理するようにしている。したがって、入力クエリデータが構造化文書データのリーフ要素（葉）からルート要素（根）方向に構造を照合するような条件を含む複雑なものであっても、構造照合処理の高速化を実現し、複雑な構造条件を含むクエリデータによる検索を高速に実行することができる。 As described above with reference to specific examples, according to the present embodiment, the condition that the first partial query data follows the hierarchy in the logical structure of the structured document data from the lower level to the higher level, that is, the structured document. When a condition for collating the structure from the leaf element (leaf) to the root element (root) is included, the condition is structured based on the structure guide data stored in the structured document DB 21 ′. The conditions are rewritten so that the structure is collated from the root element (root) to the leaf element (leaf) direction of the document data. Then, the input query data is processed by a simple structure condition structure matching process and a join operation process. Therefore, even if the input query data is complex including conditions that match the structure from the leaf element (leaf) to the root element (root) of the structured document data, the structure matching process can be accelerated. Thus, it is possible to execute a search using query data including complicated structural conditions at high speed.

なお、上記の具体例では、第１の部分クエリデータに、構造化文書データの論理構造における階層を下位から上位へと辿る条件（構造化文書データのリーフ要素（葉）からルート要素（根）方向に構造を照合するような条件）として「ｐａｒｅｎｔ」が含まれる場合を例示して説明したが、構造化文書データの論理構造における階層を下位から上位へと辿る他の条件、例えば「ａｎｃｅｓｔｏｒ」や「ａｎｃｅｓｔｏｒ−ｏｒ−ｓｅｌｆ」などが含まれる場合であっても、上記の具体例と同様に処理することができる。 In the above specific example, the first partial query data includes a condition for tracing the hierarchy in the logical structure of the structured document data from lower to higher (from the leaf element (leaf) to the root element (root) of the structured document data). As an example, a case where “parent” is included as a condition for checking the structure in the direction) has been described, but other conditions for tracing the hierarchy in the logical structure of structured document data from the lower to the higher, for example, “ancestor” Even when “ancestor-or-self” is included, it can be processed in the same manner as in the above specific example.

［第３の実施形態］
次に、第３の実施形態について、図３、図５、図７、図２４乃至図３３を参照して説明する。本実施形態は、入力クエリデータに含まれる要素の順序関係を指定する条件が、ｐｏｓｉｔｉｏｎ関数やｌａｓｔ関数の場合の例である。本実施形態におけるサーバ１およびクライアント端末３の構成は、図３に示した第１の実施形態のものと同様である。なお、以下の説明において、上述した第１の実施形態と共通の構成については同一の符号を付し、重複した説明を省略する。 [Third Embodiment]
Next, a third embodiment will be described with reference to FIGS. 3, 5, 7, and 24 to 33. FIG. This embodiment is an example when the condition for specifying the order relation of elements included in input query data is a position function or a last function. The configurations of the server 1 and the client terminal 3 in this embodiment are the same as those in the first embodiment shown in FIG. In the following description, the same reference numerals are given to the same components as those in the first embodiment described above, and duplicate descriptions are omitted.

図２４は、本実施形態で想定するクエリデータの一例を示す説明図である。この図２４に示すクエリデータＱ３＿１，Ｑ３＿２は、第１の実施形態で説明したクエリデータＱ１や第２の実施形態で説明したクエリデータＱ２と同じくＸＱｕｅｒｙで記述されており、下記のような意味の複雑な階層構造に関する条件（構造条件）を含んでいる。
Ｑ３＿１：構造化文書ＤＢ２１の各構造化文書データについて、階層のどこかに「ｂｏｏｋ」という要素があり、その「ｂｏｏｋ」という要素の中にある「ａｕｔｈｏｒ」要素のうち、１番目に出現するものの一覧を返す。
Ｑ３＿２：構造化文書ＤＢ２１の各構造化文書データについて、階層のどこかに「ｂｏｏｋ」という要素があり、その「ｂｏｏｋ」という要素の中にある「ａｕｔｈｏｒ」要素のうち、最後に出現するものの一覧を返す。 FIG. 24 is an explanatory diagram illustrating an example of query data assumed in the present embodiment. The query data Q3_1 and Q3_2 shown in FIG. 24 are described in XQuery like the query data Q1 described in the first embodiment and the query data Q2 described in the second embodiment, and have the following meanings: It includes conditions (structure conditions) related to complex hierarchical structures.
Q3_1: For each structured document data in the structured document DB 21, there is an element “book” somewhere in the hierarchy, and among the “author” elements in the element “book”, the element that appears first Returns a list.
Q3_2: For each structured document data in the structured document DB 21, there is an element “book” somewhere in the hierarchy, and among the “author” elements in the element “book”, a list of the last appearing elements return it.

クエリデータＱ３＿１には、［ｐｏｓｉｔｉｏｎ（）＝１］という表現がある。これは、ＸＱｕｅｒｙで、［ｐｏｓｉｔｉｏｎ（）＝１］が付与されているパスに該当するノードのうち、１番目にあるもののみを選択する指定である。一方、クエリデータＱ３＿２には、［ｌａｓｔ（）］という表現がある。これは、［ｌａｓｔ（）］が付与されているパスに該当するノードのうち、最後のものを選択する指定となる。 The query data Q3_1 has an expression [position () = 1]. This is an instruction to select only the first node among the nodes corresponding to the path to which [position () = 1] is assigned in XQuery. On the other hand, the query data Q3_2 has an expression [last ()]. This is a specification for selecting the last node among the nodes corresponding to the path to which [last ()] is assigned.

本実施形態おける検索処理部２３による検索処理の流れは、図７に示した第１の実施形態のものと同様である。ただし、本実施形態では、ステップＳ２のクエリデータ解析処理の内容が第１の実施形態と相違している。 The flow of search processing by the search processing unit 23 in this embodiment is the same as that of the first embodiment shown in FIG. However, in the present embodiment, the contents of the query data analysis process in step S2 are different from those in the first embodiment.

図２５は、本実施形態におけるクエリデータ解析処理の流れを示すフローチャートである。クエリデータ分解部２７は、第１の実施形態と同様に、はじめに、入力クエリデータのすべてを便宜的に第１の部分クエリデータとする（ステップＳ２２１）。このとき、第２の部分クエリデータは空としておく。 FIG. 25 is a flowchart showing the flow of query data analysis processing in this embodiment. As in the first embodiment, the query data decomposing unit 27 first sets all input query data as first partial query data for convenience (step S221). At this time, the second partial query data is left empty.

次に、クエリデータ分解部２７は、第１の部分クエリデータをチェックして、第１の部分クエリデータ（ここでは入力クエリデータと同じ）に、［ｐｏｓｉｔｉｏｎ（）＝ｎ］や［ｌａｓｔ（）］といった条件が含まれるかどうかを判定する（ステップＳ２２２）。そして、クエリデータ分解部２７は、そのような構造条件が含まれていれば（ステップＳ２２２：Ｙｅｓ）、第１の部分クエリデータ（ここでは入力クエリデータと同じ）から［ｐｏｓｉｔｉｏｎ（）＝ｎ］や［ｌａｓｔ（）］といった条件を取り除いたものを、第１の部分クエリデータとする（ステップＳ２２３）。そして、クエリデータ分解部２７は、ステップＳ２２３で取り除いた条件が［ｐｏｓｉｔｉｏｎ（）＝ｎ］であれば、第１の部分クエリデータによる構造照合処理結果からｎ番目に出現する要素を選択する演算指示を第２の部分クエリデータとし、ステップＳ２２３で取り除いた条件が［ｌａｓｔ（）］であれば、第１の部分クエリデータによる構造照合処理結果から最後に出現する要素を選択する演算指示を第２の部分クエリデータとして（ステップＳ２２４）、クエリデータ解析処理を終了する。 Next, the query data decomposition unit 27 checks the first partial query data, and adds [position () = n] or [last () to the first partial query data (here, the same as the input query data). ] Is included (step S222). Then, if such a structural condition is included (step S222: Yes), the query data decomposition unit 27 starts from the first partial query data (here, the same as the input query data) [position () = n]. The data obtained by removing the conditions such as [last ()] is used as the first partial query data (step S223). Then, if the condition removed in step S223 is [position () = n], the query data decomposing unit 27 selects the nth element that appears from the result of the structure matching process using the first partial query data. Is the second partial query data, and if the condition removed in step S223 is [last ()], the operation instruction for selecting the element that appears last from the structure matching processing result by the first partial query data is the second. As the partial query data (step S224), the query data analysis process is terminated.

一方、ステップＳ２２２の判定で、第１の部分クエリデータ（ここでは入力クエリデータと同じ）に［ｐｏｓｉｔｉｏｎ（）＝ｎ］や［ｌａｓｔ（）］といった条件が含まれなければ（ステップＳ２２２：Ｎｏ）、クエリデータ分解部２７は、ステップＳ２２３およびステップＳ２２４の処理を行うことなく、クエリデータ解析処理を終了する。 On the other hand, if it is determined in step S222 that the first partial query data (here, the same as the input query data) does not include a condition such as [position () = n] or [last ()] (step S222: No) The query data decomposition unit 27 ends the query data analysis process without performing the processes of steps S223 and S224.

図２６は、図２４に例示したクエリデータＱ３＿１についてのクエリデータ解析処理の結果である第１の部分クエリデータＱ３＿１＿Ａと第２の部分クエリデータＱ３＿１＿Ｂの一例を示す図である。また、図２７は、図２４に例示したクエリデータＱ３＿２についてのクエリデータ解析処理の結果である第１の部分クエリデータＱ３＿２＿Ａと第２の部分クエリデータＱ３＿２＿Ｂの一例を示す図である。ここで、“ＧＲＯＵＰＢＹ（Ｘ）”という演算指示は、（Ｘ）で指定した部分の要素ＩＤが同一のものをグループ化する演算指示である。また、“ＦＩＬＴＥＲ（Ｘ）（Ｙ）（Ｚ）”という演算指示は、（Ｘ）で指定するグループについて、（Ｙ）で指定する部分の要素ＩＤが、（Ｚ）の順番にあるもののみを残す、という演算指示である。図２６に例示した第２の部分クエリデータＱ３＿１＿Ｂにあるように（Ｚ）が１のときは、（Ｙ）で指定する部分の要素ＩＤが最も小さなものを残すという演算指示となり、図２７に例示した第２の部分クエリデータＱ３＿２＿Ｂにあるように（Ｚ）がＬＡＳＴのときは、（Ｙ）で指定する部分の要素ＩＤが最も大きなものを残すという演算指示となる。 FIG. 26 is a diagram illustrating an example of the first partial query data Q3_1_A and the second partial query data Q3_1_B, which are results of the query data analysis process for the query data Q3_1 illustrated in FIG. FIG. 27 is a diagram illustrating an example of the first partial query data Q3_2_A and the second partial query data Q3_2_B, which are results of the query data analysis process for the query data Q3_2 illustrated in FIG. Here, the operation instruction “GROUP BY (X)” is an operation instruction for grouping elements having the same element ID in the part specified in (X). In addition, the calculation instruction “FILTER (X) (Y) (Z)” is only for those in which the element ID of the portion specified by (Y) is in the order of (Z) for the group specified by (X). The calculation instruction is to leave. As shown in the second partial query data Q3_1_B illustrated in FIG. 26, when (Z) is 1, the calculation instruction is to leave the element ID of the portion specified by (Y) being the smallest, and is illustrated in FIG. As shown in the second partial query data Q3_2_B, when (Z) is LAST, the calculation instruction is to leave the element with the largest element ID specified by (Y).

クエリデータ分解部２７によるクエリデータ解析処理が終了すると、第１の実施形態と同様に、構造照合処理部２８により、第１の部分クエリデータに含まれる構造条件の構造照合処理が行われる（ステップＳ３）。 When the query data analysis processing by the query data decomposing unit 27 is completed, the structure matching processing unit 28 performs the structure matching processing of the structural condition included in the first partial query data, as in the first embodiment (steps). S3).

図５に例示した要素ＩＤ付与済み構造化文書データについて、図２６に例示した第１の部分クエリデータＱ３＿１＿Ａに含まれる構造条件ＰＰ１による構造照合処理を行った結果である構造照合処理結果データＲ３＿１＿Ａを図２８に示す。また、図５に例示した要素ＩＤ付与済み構造化文書データについて、図２７に例示した第１の部分クエリデータＱ３＿２＿Ａに含まれる構造条件ＰＰ１による構造照合処理を行った結果である構造照合処理結果データＲ３＿２＿Ａを図３０に示す。図中のＴ１は構造条件ＰＰ１の構造照合処理の結果であり、図２８の構造照合処理結果データＲ３＿１＿Ａと、図３０の構造照合処理結果データＲ３＿２＿Ａのいずれにおいても、構造条件ＰＰ１に合致する構造を持つ要素の要素ＩＤ群［１］，［２］が得られる。 For the structured document data with element IDs given as an example in FIG. 5, the structure matching process result data R3_1_A, which is the result of performing the structure matching process according to the structure condition PP1 included in the first partial query data Q3_1_A exemplified in FIG. As shown in FIG. In addition, with respect to the structured document data with element ID given as an example in FIG. 5, the structure matching process result data that is the result of performing the structure matching process according to the structure condition PP1 included in the first partial query data Q3_2_A exemplified in FIG. R3_2_A is shown in FIG. T1 in the figure is the result of the structure matching process of the structure condition PP1, and in both the structure matching process result data R3_1_A of FIG. 28 and the structure matching process result data R3_2_A of FIG. Element ID groups [1] and [2] of the possessed elements are obtained.

構造照合処理部２８による構造照合処理が終了すると、第１の実施形態と同様に、結合演算処理部２９により、構造照合処理部２８の処理結果である構造照合処理結果データについて、第２の部分クエリデータに含まれる手順に従って結合演算処理が行われる（ステップＳ４）。 When the structure matching processing by the structure matching processing unit 28 is completed, the second operation is performed on the structure matching processing result data, which is the processing result of the structure matching processing unit 28, by the join operation processing unit 29, as in the first embodiment. A join operation process is performed according to the procedure included in the query data (step S4).

図２９は、図２６に例示した第２の部分クエリデータＱ３＿１＿Ｂを用いて図２８に例示した構造照合処理結果データＲ３＿１＿Ａについて結合演算処理を行う場合の処理の概要を説明する図である。また、図３１は、図２７に例示した第２の部分クエリデータＱ３＿２＿Ｂを用いて図３０に例示した構造照合処理結果データＲ３＿２＿Ａについて結合演算処理を行う場合の処理の概要を説明する図である。 FIG. 29 is a diagram for explaining an outline of processing when the join operation processing is performed on the structure matching processing result data R3_1_A illustrated in FIG. 28 using the second partial query data Q3_1_B illustrated in FIG. FIG. 31 is a diagram for explaining an overview of processing when the join operation processing is performed on the structure matching processing result data R3_2_A illustrated in FIG. 30 using the second partial query data Q3_2_B illustrated in FIG.

図２９の例および図３１の例では、“ＧＲＯＵＰＢＹＴ１［１］”という演算指示に従って、Ｔ１の要素ＩＤ群の中で同一の要素ＩＤがグループ化され、Ｔ２が得られる。そして、図２９の例では、“ＦＩＬＴＥＲＴ２［１］Ｔ２［２］１”という演算指示に従って、Ｔ２［１］について、Ｔ２［２］の要素ＩＤが最も小さいものを残す演算処理によりＴ３が得られ、Ｔ３について［２］のみを取り出すことにより、結合演算処理の結果として、図２９に示すような中間結果Ｒ３＿１＿Ｂが得られる。また、図３１の例では、“ＦＩＬＴＥＲＴ２［１］Ｔ２［２］ＬＡＳＴ”という演算指示に従って、Ｔ２［１］について、Ｔ２［２］の要素ＩＤが最も大きいものを残す演算処理によりＴ３が得られ、Ｔ３について［２］のみを取り出すことにより、結合演算処理の結果として、図３１に示すような中間結果Ｒ３＿２＿Ｂが得られる。これらの中間結果Ｒ３＿１＿Ｂ，Ｒ３＿２＿Ｂは、図２４に例示したクエリデータＱ３＿１，Ｑ３＿２を、上述したようなクエリデータ解析処理を行わずにそのまま処理した場合の結果と一致する。 In the example of FIG. 29 and the example of FIG. 31, the same element IDs are grouped in the element ID group of T1 according to the calculation instruction “GROUP BY T1 [1]”, and T2 is obtained. In the example of FIG. 29, T3 is obtained by the arithmetic processing that leaves the smallest element ID of T2 [2] for T2 [1] according to the arithmetic instruction “FILTER T2 [1] T2 [2] 1”. Then, by extracting only [2] for T3, an intermediate result R3_1_B as shown in FIG. 29 is obtained as a result of the join operation processing. In the example of FIG. 31, T3 is obtained by the calculation process that leaves the element with the largest element ID of T2 [2] for T2 [1] according to the calculation instruction “FILTER T2 [1] T2 [2] LAST”. Then, by extracting only [2] for T3, an intermediate result R3_2_B as shown in FIG. 31 is obtained as a result of the join operation processing. These intermediate results R3_1_B and R3_2_B coincide with the results of processing the query data Q3_1 and Q3_2 illustrated in FIG. 24 as they are without performing the query data analysis process as described above.

最後に、検索インタフェース部２６により、結合演算処理部２９による結合演算処理の結果（中間結果）として得られる要素ＩＤが、それに対応する構造化文書データとして文字列化され、結果データとしてクライアント端末３に返却される（ステップＳ５）。図２９に示した中間結果Ｒ３＿１＿Ｂが得られた場合には、この中間結果Ｒ３＿１＿Ｂに含まれる要素ＩＤであるＥ５，Ｅ１４，Ｅ２２に対応する構造化文書データが文字列化され、クエリデータＱ３＿１の結果データＲ３＿１として、図３２に示すデータがクライアント端末３に返却される。また、図３１に示した中間結果Ｒ３＿２＿Ｂが得られた場合には、この中間結果Ｒ３＿２＿Ｂに含まれる要素ＩＤであるＥ８，Ｅ１４，Ｅ２２に対応する構造化文書データが文字列化され、クエリデータＱ３＿２の結果データＲ３＿２として、図３３に示すデータがクライアント端末３に返却される。 Finally, the search interface unit 26 converts the element ID obtained as a result (intermediate result) of the join operation processing by the join operation processing unit 29 into a character string as the corresponding structured document data, and the client terminal 3 as the result data (Step S5). When the intermediate result R3_1_B shown in FIG. 29 is obtained, the structured document data corresponding to the element IDs E5, E14, and E22 included in the intermediate result R3_1_B is converted into a character string, and the result of the query data Q3_1 is obtained. The data shown in FIG. 32 is returned to the client terminal 3 as the data R3_1. When the intermediate result R3_2_B shown in FIG. 31 is obtained, the structured document data corresponding to the element IDs E8, E14, and E22 included in the intermediate result R3_2_B is converted into a character string, and the query data Q3_2 is obtained. As the result data R3_2, data shown in FIG. 33 is returned to the client terminal 3.

以上、具体的な例を挙げながら説明したように、本実施形態によれば、入力クエリデータが要素の順序関係を指定する条件としてｐｏｓｉｔｉｏｎ関数やｌａｓｔ関数を含む場合に、その入力クエリデータを、階層の上下関係を指定する条件のみを含む第１の部分クエリデータと、第１の部分クエリデータによる照合結果を、ｐｏｓｉｔｉｏｎ関数やｌａｓｔ関数で示される条件に応じて結合演算する手順を含む第２の部分クエリデータとに分解して処理するようにしている。したがって、入力クエリデータがｐｏｓｉｔｉｏｎ関数やｌａｓｔ関数を含む複雑なものであっても、この入力クエリデータを単純な構造条件の構造照合処理と結合演算処理とで処理することで、構造照合処理の高速化を実現し、複雑な構造条件を含むクエリデータによる検索を高速に実行することができる。 As described above with reference to a specific example, according to the present embodiment, when the input query data includes the position function or the last function as a condition for specifying the order relation of elements, the input query data is A second step including a step of performing a join operation on the first partial query data including only a condition specifying the hierarchical relationship of the hierarchy and a matching result based on the first partial query data according to the condition indicated by the position function or the last function. The data is broken down into partial query data. Therefore, even if the input query data is complicated including the position function and the last function, the input query data is processed by the simple structure condition structure matching process and the join operation process, thereby speeding up the structure matching process. The search using query data including complicated structural conditions can be executed at high speed.

［第４の実施形態］
次に、第４の実施形態について、図５、図１９、図３４乃至図４０を参照して説明する。本実施形態は、第２の実施形態と同様の構造条件の書き換えを行うが、入力クエリデータに要素の順序関係を指定する条件（第２条件）が含まれておらず、第２のクエリデータに基づく結合演算処理を行わない例である。なお、以下の説明において、上述した第２の実施形態と共通の構成については同一の符号を付し、重複した説明を省略する。 [Fourth Embodiment]
Next, a fourth embodiment will be described with reference to FIGS. 5, 19, and 34 to 40. FIG. In the present embodiment, the same structural condition as in the second embodiment is rewritten, but the input query data does not include a condition (second condition) for specifying the order relation of elements, and the second query data This is an example in which the join operation processing based on is not performed. In the following description, the same reference numerals are given to the same components as those in the above-described second embodiment, and a duplicate description is omitted.

図３４は、本実施形態におけるサーバ１’’およびクライアント端末３の概略構成を示すブロック図である。本実施形態では、サーバ１’’の検索処理部２３’’にクエリデータ分解部２７が設けられておらず、クライアント端末３から送信された入力クエリデータは、構造条件書き換え部３１’に入力される。また、サーバ１’’の検索処理部２３’’に結合演算処理部２９が設けられておらず、構造照合処理部２８による構造照合処理の結果である構造照合処理結果データが、そのまま検索インタフェース部２６に渡される。なお、クライアント端末３の構成は第２の実施形態と同じである。 FIG. 34 is a block diagram showing a schematic configuration of the server 1 ″ and the client terminal 3 in the present embodiment. In the present embodiment, the query data decomposition unit 27 is not provided in the search processing unit 23 ″ of the server 1 ″, and the input query data transmitted from the client terminal 3 is input to the structural condition rewriting unit 31 ′. The Further, the search processing unit 23 ″ of the server 1 ″ is not provided with the join operation processing unit 29, and the structure matching process result data, which is the result of the structure matching processing by the structure matching processing unit 28, is directly used as the search interface unit. 26. The configuration of the client terminal 3 is the same as that in the second embodiment.

図３５は、本実施形態で想定するクエリデータの一例を示す説明図である。この図３５に示すクエリデータＱ４は、第１乃至第３の実施形態で説明したクエリデータと同じくＸＱｕｅｒｙで記述されており、下記のような意味の複雑な階層構造に関する条件（構造条件）を含んでいる。
Ｑ４：構造化文書ＤＢ２１’の各構造化文書データについて、階層のどこかに「ｔｉｔｌｅ」という要素があり、その先祖の「ｂｏｏｋ」という要素の子要素である「ｐｕｂｌｉｓｈｅｒ」という要素の一覧を返す。 FIG. 35 is an explanatory diagram showing an example of query data assumed in the present embodiment. The query data Q4 shown in FIG. 35 is described in XQuery similarly to the query data described in the first to third embodiments, and includes conditions (structure conditions) relating to a complicated hierarchical structure having the following meanings. It is out.
Q4: For each structured document data in the structured document DB 21 ′, there is an element “title” somewhere in the hierarchy, and a list of elements “publisher” which is a child element of the element “book” of the ancestor is returned. .

クエリデータＱ４では、第２の実施形態で説明した／ａｎｃｅｓｔｏｒという構造化文書データのリーフ要素（葉）からルート要素（根）方向に、階層構造内での先祖要素を照合するパスが含まれているが、第１の実施形態で説明したクエリデータＱ１に含まれる／ｆｏｌｌｏｗｉｎｇ−ｓｉｂｌｉｎｇのような、ある構造を持つ要素間の順序関係に関する構造を照合するパスは含まれていない。 The query data Q4 includes a path for collating ancestor elements in the hierarchical structure from the leaf element (leaf) to the root element (root) of the structured document data called / ancestor described in the second embodiment. However, a path for collating a structure related to an order relationship between elements having a certain structure, such as / following-sibling included in the query data Q1 described in the first embodiment, is not included.

図３６は、本実施形態におけるサーバ１’’の検索処理部２３’’による検索処理の流れを示すフローチャートである。まず、検索インタフェース部２６により、クライアント端末３からネットワーク２経由で送信されたクエリデータの入力が受け付けられる（ステップＳ１）。このクエリデータは、構造条件書き換え部３１’に入力される。 FIG. 36 is a flowchart showing the flow of search processing by the search processing unit 23 ″ of the server 1 ″ in this embodiment. First, the search interface unit 26 accepts input of query data transmitted from the client terminal 3 via the network 2 (step S1). This query data is input to the structural condition rewriting unit 31 '.

次に、構造条件書き換え部３１’により、入力クエリデータについてのクエリデータ解析処理が行われる（ステップＳ２）。構造条件書き換え部３１’によるクエリデータ解析処理の一例を、図３７を参照して説明する。 Next, the query data analysis process for the input query data is performed by the structural condition rewriting unit 31 '(step S2). An example of the query data analysis process by the structural condition rewriting unit 31 'will be described with reference to FIG.

図３７は、構造条件書き換え部３１’によるクエリデータ解析処理の流れを示すフローチャートである。構造条件書き換え部３１’は、はじめに、入力クエリデータを第１の部分クエリデータとする（ステップＳ２３１）。次に、構造条件書き換え部３１’は、第１の部分クエリデータをチェックして、第１の部分クエリデータに、構造化文書データの階層構造を葉から根方向へ照合するような構造条件が含まれるかどうかを判定する（ステップＳ２３２）。そして、構造条件書き換え部３１’は、第１の部分クエリデータに、構造化文書データの階層構造を葉から根方向へ照合するような構造条件が含まれていれば（ステップＳ２３２：Ｙｅｓ）、構造条件書き換え処理を行って、構造化文書データの階層構造を葉から根方向へ照合する構造条件を、根から葉方向へ照合する構造条件に書き換える（ステップＳ２３３）。 FIG. 37 is a flowchart showing the flow of query data analysis processing by the structural condition rewriting unit 31 ′. The structural condition rewriting unit 31 ′ first sets the input query data as first partial query data (step S <b> 231). Next, the structural condition rewriting unit 31 ′ checks the first partial query data, and the first partial query data has a structural condition such that the hierarchical structure of the structured document data is collated from the leaf to the root direction. It is determined whether it is included (step S232). If the first partial query data includes a structural condition that collates the hierarchical structure of the structured document data from the leaf to the root direction (step S232: Yes), A structural condition rewriting process is performed to rewrite the structural condition for collating the hierarchical structure of the structured document data from the leaf to the root direction to the structural condition for collating from the root to the leaf direction (step S233).

一方、ステップＳ２３２の判定で、第１の部分クエリデータに、構造化文書データの階層構造を葉から根方向へ照合するような構造条件が含まれなければ（ステップＳ２３２：Ｎｏ）、ステップＳ２３３の構造条件書き換え処理を行うことなく、クエリデータ解析処理を終了する。 On the other hand, if it is determined in step S232 that the first partial query data does not include a structural condition for collating the hierarchical structure of the structured document data from the leaf to the root direction (step S232: No), the process proceeds to step S233. The query data analysis process is terminated without performing the structural condition rewriting process.

ステップＳ２３３で行われる構造条件書き換え処理は第２の実施形態と同様であり、処理内容は図１９に示したフローチャートの通りである。ここでは、図３５に例示したクエリデータＱ４を、図１５に例示した構造ガイドデータを参照して書き換える処理の概要を説明する。なお、ここでは説明を簡単にするために、図１５に例示した構造ガイドデータを参照しているが、実際には構造化文書ＤＢ２１’に格納された構造ガイドデータが参照される。 The structural condition rewriting process performed in step S233 is the same as that of the second embodiment, and the processing content is as shown in the flowchart of FIG. Here, an outline of processing for rewriting the query data Q4 illustrated in FIG. 35 with reference to the structure guide data illustrated in FIG. 15 will be described. In order to simplify the description here, the structure guide data illustrated in FIG. 15 is referred to. However, the structure guide data stored in the structured document DB 21 ′ is actually referred to.

クエリデータＱ４の／ａｎｃｅｓｔｏｒ：：ｂｏｏｋ部分について、その直前のパスは／ｔｉｔｌｅである。ここで、図１５に例示した構造ガイドデータを参照すると、ｔｉｔｌｅを末尾に持つ構造は／ｂｏｏｋｓ／ｂｏｏｋ／ｔｉｔｌｅである。／ａｎｃｅｓｔｏｒは先祖要素を指定するため、／ａｎｃｅｓｔｏｒの指定する構造は／ｂｏｏｋｓ／ｂｏｏｋ、／ｂｏｏｋｓであるが、：：ｂｏｏｋとｂｏｏｋ要素を指定しているので、／ａｎｃｅｓｔｏｒ：：ｂｏｏｋの指定する構造は／ｂｏｏｋｓ／ｂｏｏｋとなる。さらに、／ａｎｃｅｓｔｏｒ：：ｂｏｏｋの次のパスである／ｐｕｂｌｉｓｈｅｒについても、ここまでで特定された構造は／ｂｏｏｋｓ／ｂｏｏｋであるから、／ｂｏｏｋｓ／ｂｏｏｋの後に／ｐｕｂｌｉｓｈｅｒが続く構造があるかを構造ガイドデータで確認すると、存在するので、クエリデータＱ４に対する構造条件書き換え処理の結果として、／ｂｏｏｋｓ／ｂｏｏｋ／ｐｕｂｌｉｓｈｅｒが得られる。 For the / ancestor :: book part of the query data Q4, the path immediately before is / title. Here, referring to the structure guide data illustrated in FIG. 15, the structure having the title at the end is / books / book / title. Since / ancestor specifies an ancestor element, the structure specified by / ancestor is / books / book, / books, but because :: book and book elements are specified, the structure specified by / ancestor :: book Becomes / books / book. Furthermore, for / publisher, which is the next path of / ancestor :: book, since the structure specified so far is / books / book, whether there is a structure that follows / books / book or / publisher Since it exists when confirmed by the guide data, / books / book / publisher is obtained as a result of the structural condition rewriting process for the query data Q4.

図３８は、図３５に例示したクエリデータＱ４について、構造条件書き換え処理を行った結果の第１の部分クエリデータＱ４＿Ａを示す図である。図３５に例示したクエリデータＱ４は、構造条件書き換え処理により、／ｂｏｏｋｓ／ｂｏｏｋ／ｐｕｂｌｉｓｈｅｒといった、構造化文書データの階層構造を根から葉方向へ照合する構造条件に書き換えられる。 FIG. 38 is a diagram illustrating first partial query data Q4_A obtained as a result of performing the structural condition rewriting process on the query data Q4 illustrated in FIG. The query data Q4 illustrated in FIG. 35 is rewritten by a structural condition rewriting process into a structural condition such as / books / book / publisher that collates the hierarchical structure of structured document data from the root to the leaf direction.

構造条件書き換え部３１’によるクエリデータ解析処理が終了すると、第１乃至第３の実施形態と同様に、構造照合処理部２８により、第１の部分クエリデータに含まれる構造条件の構造照合処理が行われる（ステップＳ３）。 When the query data analysis processing by the structural condition rewriting unit 31 ′ is finished, the structural verification processing of the structural condition included in the first partial query data is performed by the structural verification processing unit 28, as in the first to third embodiments. Performed (step S3).

図５に例示した要素ＩＤ付与済み構造化文書データについて、図３８に例示した第１の部分クエリデータＱ４＿Ａによる構造照合処理を行った結果である構造照合処理結果データＲ４＿Ａを図３９に示す。図中のＴ１は／ｂｏｏｋｓ／ｂｏｏｋ／ｐｕｂｌｉｓｈｅｒという構造条件の構造照合処理の結果であり、このような構造を持つ要素の要素ＩＤ群［１］が得られる。 FIG. 39 shows structure matching process result data R4_A, which is the result of performing the structure matching process using the first partial query data Q4_A exemplified in FIG. 38 for the structured document data with element ID given in FIG. T1 in the figure is the result of the structure matching process under the structure condition of / books / book / publisher, and an element ID group [1] of elements having such a structure is obtained.

本実施形態では、この構造照合処理部２８による構造照合処理の結果がそのまま中間結果として検索インタフェース部２６に渡される。そして、最後に、検索インタフェース部２６により、構造照合処理部２８による構造照合処理の結果（中間結果）として得られる要素ＩＤが、それに対応する構造化文書データとして文字列化され、結果データとしてクライアント端末３に返却される（ステップＳ５）。図３９に示した中間結果Ｒ４＿Ａが得られた場合には、この中間結果Ｒ４＿Ａに含まれる要素ＩＤであるＥ１１，Ｅ２９に対応する構造化文書データが文字列化され、クエリデータＱ４の結果データＲ４として、図４０に示すデータがクライアント端末３に返却される。 In the present embodiment, the result of the structure matching process by the structure matching processing unit 28 is directly passed to the search interface unit 26 as an intermediate result. Finally, the element ID obtained as a result (intermediate result) of the structure matching processing by the structure matching processing unit 28 is converted into a character string as the corresponding structured document data by the search interface unit 26, and the result data is the client. Returned to the terminal 3 (step S5). When the intermediate result R4_A shown in FIG. 39 is obtained, the structured document data corresponding to the element IDs E11 and E29 included in the intermediate result R4_A is converted into a character string, and the result data R4 of the query data Q4 As shown in FIG. 40, the data shown in FIG.

以上、具体的な例を挙げながら説明したように、本実施形態によれば、入力クエリデータが、要素ＩＤで特定される要素の順序関係を指定する条件（第２条件）を含まず、構造化文書データの論理構造における階層を下位から上位へと辿る条件、つまり構造化文書データのリーフ要素（葉）からルート要素（根）方向に構造を照合するような条件を含む場合に、該条件を、構造化文書ＤＢ２１’に格納されている構造ガイドデータに基づいて、構造化文書データのルート要素（根）からリーフ要素（葉）方向に構造を照合するような条件に書き換える。そして、入力クエリデータを単純な構造条件の構造照合処理のみで処理するようにしている。したがって、入力クエリデータが構造化文書データのリーフ要素（葉）からルート要素（根）方向に構造を照合するような条件を含む複雑なものであっても、構造照合処理の高速化を実現し、複雑な構造条件を含むクエリデータによる検索を高速に実行することができる。 As described above, as described with specific examples, according to the present embodiment, the input query data does not include a condition (second condition) that specifies the order relationship of elements specified by the element ID, and the structure If there is a condition for tracing the hierarchy in the logical structure of structured document data from lower to higher, that is, a condition for collating the structure from the leaf element (leaf) to the root element (root) of the structured document data Is rewritten to a condition such that the structure is collated from the root element (root) of the structured document data to the leaf element (leaf) direction based on the structure guide data stored in the structured document DB 21 ′. Then, the input query data is processed only by the structure matching process with a simple structure condition. Therefore, even if the input query data is complex including conditions that match the structure from the leaf element (leaf) to the root element (root) of the structured document data, the structure matching process can be accelerated. Thus, it is possible to execute a search using query data including complicated structural conditions at high speed.

なお、上記の具体例では、入力クエリデータに含まれる、構造化文書データの論理構造における階層を下位から上位へと辿る条件（構造化文書データのリーフ要素（葉）からルート要素（根）方向に構造を照合するような条件）として「ａｎｃｅｓｔｏｒ」を例示したが、構造化文書データの論理構造における階層を下位から上位へと辿る他の条件、例えば「ｐａｒｅｎｔ」や「ａｎｃｅｓｔｏｒ−ｏｒ−ｓｅｌｆ」などが入力クエリデータに含まれる場合であっても、上記の具体例と同様に処理することができる。 In the above specific example, the condition for following the hierarchy in the logical structure of the structured document data included in the input query data from the lower order to the higher order (from the leaf element (leaf) to the root element (root) direction of the structured document data) “Ancestor” is exemplified as a condition for checking the structure in FIG. 5. However, other conditions for tracing the hierarchy in the logical structure of structured document data from the lower to the higher, such as “parent” and “ancestor-or-self”, are exemplified. Can be processed in the same manner as in the above specific example.

以上説明した第１乃至第４の実施形態におけるサーバ１、サーバ１’、サーバ１’’の機能は、例えば、コンピュータの演算装置であるＣＰＵ１０１が、アプリケーションプログラムとして実装された構造化文書管理プログラムを実行することにより実現される。 The functions of the server 1, server 1 ′, and server 1 ″ in the first to fourth embodiments described above are, for example, structured document management programs implemented as application programs by the CPU 101 that is a computing device of a computer. It is realized by executing.

第１乃至第４の実施形態におけるサーバ１、サーバ１’、サーバ１’’で実行される構造化文書管理プログラムは、例えば、インストール可能な形式又は実行可能な形式のファイルでＣＤ−ＲＯＭ、フレキシブルディスク（ＦＤ）、ＣＤ−Ｒ、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）などのコンピュータで読み取り可能な記憶媒体１１０に記録されて提供される。 The structured document management program executed by the server 1, the server 1 ′, and the server 1 ″ in the first to fourth embodiments is, for example, a CD-ROM, a flexible file in an installable format or an executable format file. The program is recorded on a computer-readable storage medium 110 such as a disc (FD), CD-R, or DVD (Digital Versatile Disc).

また、第１乃至第４の実施形態におけるサーバ１、サーバ１’、サーバ１’’で実行される構造化文書管理プログラムを、インターネットなどのネットワーク２に接続されたコンピュータ上に格納し、ネットワーク２経由でダウンロードさせることにより提供するように構成してもよい。また、第１乃至第４の実施形態におけるサーバ１、サーバ１’、サーバ１’’で実行される構造化文書管理プログラムを、インターネットなどのネットワーク２経由で提供または配布するように構成してもよい。さらに、第１乃至第４の実施形態におけるサーバ１、サーバ１’、サーバ１’’で実行される構造化文書管理プログラムを、ＲＯＭ１０２などに予め組み込んで提供するように構成してもよい。 Further, the structured document management program executed by the server 1, the server 1 ′, and the server 1 ″ in the first to fourth embodiments is stored on a computer connected to the network 2 such as the Internet, and the network 2 You may comprise so that it may provide by making it download via. In addition, the structured document management program executed by the server 1, the server 1 ′, and the server 1 ″ in the first to fourth embodiments may be configured to be provided or distributed via the network 2 such as the Internet. Good. Further, the structured document management program executed by the server 1, the server 1 ′, and the server 1 ″ in the first to fourth embodiments may be provided by being incorporated in the ROM 102 in advance.

第１乃至第４の実施形態におけるサーバ１、サーバ１’、サーバ１’’で実行される構造化文書管理プログラムは、格納インタフェース部２４、要素ＩＤ付与部２５、検索インタフェース部２６、クエリデータ分解部２７、構造照合処理部２８、結合演算処理部２９、構造解析部３０、構造条件書き換え部３１，３１’などを含むモジュール構成となっており、実際のハードウェアとしてはＣＰＵ（プロセッサ）１０１がＨＤＤ１０４などから構造化文書管理プログラムを読み出して実行することにより上記各部が主記憶装置（例えばＲＡＭ１０３）上にロードされ、格納インタフェース部２４、要素ＩＤ付与部２５、検索インタフェース部２６、クエリデータ分解部２７、構造照合処理部２８、結合演算処理部２９、構造解析部３０、構造条件書き換え部３１，３１’などが主記憶装置上に生成されるようになっている。 The structured document management program executed by the server 1, the server 1 ′, and the server 1 ″ in the first to fourth embodiments includes a storage interface unit 24, an element ID assigning unit 25, a search interface unit 26, and query data decomposition. Unit 27, structure collation processing unit 28, join operation processing unit 29, structure analysis unit 30, structural condition rewriting units 31, 31 ′, etc., and the actual hardware includes CPU (processor) 101. By reading and executing the structured document management program from the HDD 104 or the like, the above units are loaded onto the main storage device (for example, the RAM 103), and the storage interface unit 24, the element ID adding unit 25, the search interface unit 26, and the query data decomposing unit. 27, structure verification processing unit 28, join operation processing unit 29, structure analysis unit 30, structure Such matter rewriter 31, 31 'is adapted to be generated on the main memory.

以上述べた少なくとも一つの実施形態にかかる構造化文書管理システムによれば、入力クエリデータを単純な構造条件に変えて構造照合処理を実行するようにしているので、入力クエリデータが複雑な構造条件を含む場合でも、構造照合処理の高速化を実現し、複雑な構造条件を含むクエリデータによる検索を高速に実行することができる。 According to the structured document management system according to at least one embodiment described above, the input query data is changed to a simple structure condition and the structure matching process is executed. Even in the case of including, it is possible to speed up the structure matching process, and to execute the search by the query data including the complicated structure condition at high speed.

なお、本発明のいくつかの実施形態を説明したが、これらの実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、請求の範囲に記載された発明とその均等の範囲に含まれる。 In addition, although some embodiment of this invention was described, these embodiment is shown as an example and is not intending limiting the range of invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the invention described in the claims and the equivalents thereof.

１，１’，１’’サーバ
２４格納インタフェース部
２５要素ＩＤ付与部
２６検索インタフェース部
２７クエリデータ分解部
２８構造照合処理部
２９結合演算処理部
３０構造解析部
３１，３１’構造条件書き換え部 1, 1 ′, 1 ″ server 24 Storage interface unit 25 Element ID assigning unit 26 Search interface unit 27 Query data decomposition unit 28 Structure collation processing unit 29 Join operation processing unit 30 Structure analysis unit 31, 31 ′ structure condition rewriting unit

Claims

Structured document data receiving means for receiving input of structured document data having a hierarchical logical structure;
Identifier assigning means for assigning to the element appearing in the structured document data input an identifier whose appearance order in the structured document data can be compared between the elements;
Structured document data storage means for storing the structured document data in which the identifier is assigned to the element;
Query data receiving means for receiving input of query data;
When the input query data includes a first condition that specifies the hierarchical relationship of the hierarchy in the logical structure of the structured document data, and a second condition that specifies the order relationship of the elements specified by the identifier , A second partial query including a procedure for combining the query data with the first partial query data including only the first condition and a matching result based on the first partial query data according to the second condition Query data decomposition means for decomposing data,
Structure collation processing means for performing collation with the first partial query data for the data set of the structured document data stored in the structured document data storage means, and outputting a collation result;
A join operation processing means for performing a join operation processing on the collation result according to a join operation procedure included in the second partial query data;
A structured document management apparatus comprising:

Structure guide data storage means for storing structure guide data that is information obtained by aggregating the hierarchical logical structures of the structured document data stored in the structured document data storage means;
Structure guide data updating means for updating the structure guide data so that the hierarchical logical structure of the input structured document data is reflected in the structure guide data;
Condition rewriting means for rewriting the condition to a condition for tracing the hierarchy from the top to the bottom based on the structure guide data when the first partial query data includes a condition for tracing the hierarchy from the bottom to the top. The structured document management apparatus according to claim 1, further comprising:

Structured document data receiving means for receiving input of structured document data having a hierarchical logical structure;
Identifier assigning means for assigning to the element appearing in the structured document data input an identifier whose appearance order in the structured document data can be compared between the elements;
Structured document data storage means for storing the structured document data in which the identifier is assigned to the element;
Structure guide data storage means for storing structure guide data that is information obtained by aggregating the hierarchical logical structures of the structured document data stored in the structured document data storage means;
Structure guide data updating means for updating the structure guide data so that the hierarchical logical structure of the input structured document data is reflected in the structure guide data;
Query data receiving means for receiving input of query data;
When the input query data includes a condition for tracing the hierarchy in the logical structure of the structured document data from the lower level to the upper level, the condition is traced from the upper level to the lower level based on the structure guide data. Condition rewriting means for rewriting conditions,
A structural collation processing unit that collates the data set of the structured document data stored in the structured document data storage unit according to the condition rewritten by the condition rewriting unit and outputs a collation result. A featured structured document management device.

Receiving input of structured document data having a hierarchical logical structure;
A step of assigning to an element appearing in the input structured document data an identifier whose appearance order in the structured document data is comparable between the elements and storing the identifier in a storage device;
Accepting query data input,
When the input query data includes a first condition that specifies the hierarchical relationship of the hierarchy in the logical structure of the structured document data, and a second condition that specifies the order relationship of the elements specified by the identifier , A second partial query including a procedure for combining the query data with the first partial query data including only the first condition and a matching result based on the first partial query data according to the second condition Breaking it into data,
Collating the first partial query data with respect to the data set of the structured document data stored in the storage device, and outputting a collation result;
A structured document management method comprising: performing a join operation process on the collation result according to a join operation procedure included in the second partial query data.

On the computer,
A function for receiving input of structured document data having a hierarchical logical structure;
A function of assigning to an element appearing in the input structured document data an identifier in which the order of appearance in the structured document data can be compared between the elements and storing it in a storage device;
A function that accepts input of query data,
When the input query data includes a first condition that specifies the hierarchical relationship of the hierarchy in the logical structure of the structured document data, and a second condition that specifies the order relationship of the elements specified by the identifier , A second partial query including a procedure for combining the query data with the first partial query data including only the first condition and a matching result based on the first partial query data according to the second condition The ability to break it down into data,
A function of collating the first partial query data against the data set of the structured document data stored in the storage device and outputting a collation result;
A structured document management program that realizes a function of processing the collation result in accordance with a join operation procedure included in the second partial query data.