JP2013003695A

JP2013003695A - Distributed database retrieval device, distributed database retrieval method and program

Info

Publication number: JP2013003695A
Application number: JP2011131854A
Authority: JP
Inventors: Yosuke Kuroda; 洋介黒田
Original assignee: Toshiba Corp; Toshiba Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2011-06-14
Filing date: 2011-06-14
Publication date: 2013-01-07
Anticipated expiration: 2031-06-14
Also published as: CN102831138B; CN102831138A; JP5318155B2

Abstract

PROBLEM TO BE SOLVED: To provide a distributed database retrieval device which realizes an effective search without complicating a mechanism of a master server side.SOLUTION: A distributed database retrieval device is composed by connecting plural slave servers each having a data base to a master server searching the data bases based on an input query. The slave server comprises: a second transmitter-receiver to perform data transmission/reception with the master server; a local plan candidate generation part to generate a local plan candidate based on a received dispersion plan; and a local plan selection part to determine a local plan that computing cost is the lowest based on the generated local plan candidate.

Description

本発明の実施形態は、分散データベース検索装置、分散データベース検索方法、及びプログラムに関する。 Embodiments described herein relate generally to a distributed database search device, a distributed database search method, and a program.

表形式やＸＭＬ形式等の大量のデータを扱うために複数のサーバで構成された分散データベース検索装置が存在する。分散データベース検索装置は通常、ユーザとのやり取りを行うマスターサーバと、実際にデータを管理するスレーブサーバとから構成されている。 There is a distributed database search device composed of a plurality of servers in order to handle a large amount of data such as a table format and an XML format. A distributed database search apparatus is generally composed of a master server that communicates with users and a slave server that actually manages data.

スレーブサーバは全て同一のアーキテクチャのデータベース検索装置から構成されていることもあれば、異なるアーキテクチャのデータベース検索装置から構成されていることもある。 All of the slave servers may be composed of database retrieval apparatuses having the same architecture, or may be composed of database retrieval apparatuses having different architectures.

一般に、分散データベース検索装置に対して検索式（以下、クエリという）が入力された場合、クエリはマスターサーバが受け取る。マスターサーバはクエリを解析して、各スレーブサーバがサーバ内部で実行する部分と、サーバ間で演算する必要がある部分に分割する。各スレーブサーバ内で実行する部分は各スレーブサーバが最適なローカルプランを生成する。サーバ間の演算部分はマスターサーバが最適な分散プランを生成する。なお、ローカルプランとはスレーブサーバがスレーブサーバの有するデータを検索するためのプランであり、分散プランとは対象の分散データベースの有するデータ全体を検索するためのプランである。 Generally, when a search expression (hereinafter referred to as a query) is input to the distributed database search device, the query is received by the master server. The master server analyzes the query and divides it into a part that each slave server executes inside the server and a part that needs to be operated between the servers. Each slave server generates an optimal local plan for the part executed in each slave server. The master server generates an optimal distributed plan for the calculation part between servers. The local plan is a plan for the slave server to search for data held by the slave server, and the distributed plan is a plan for searching the entire data of the target distributed database.

分散プランを生成する際は、ＪＯＩＮ等のサーバ間のデータに対する結合演算処理、ＳＯＲＴ等の複数のサーバに対する集合演算処理、分割した部分クエリの結合演算処理等の演算処理の順序と実行するサーバを検索応答時間が最短になるように決定する。さらに演算を実行するサーバへのデータ転送方法やフォーマット等を決定する。 When generating a distributed plan, the order of operation processing such as join operation processing for data between servers such as JOIN, set operation processing for a plurality of servers such as SORT, and join operation processing of divided partial queries is executed. The search response time is determined to be the shortest. Further, a data transfer method, a format, and the like to the server that executes the calculation are determined.

分散データベースでは、検索性能を向上するために分散プランの最適化を強化することで、それらデータベース間でのデータ受け渡し処理におけるデータの転送コストや、サーバ間のデータの演算コストを低減したいという要望がある。 In distributed databases, there is a demand to reduce the data transfer cost in data transfer processing between these databases and the data calculation cost between servers by strengthening the optimization of the distributed plan to improve search performance. is there.

従来、分散プランの最適化は全てマスターサーバによって実現されてきた。しかしながら、マスターサーバが分散プランの全てを決定することには多くの問題がある。 Conventionally, all optimization of distributed plans has been realized by a master server. However, there are many problems for the master server to determine all of the distributed plans.

まず、分散プランの検討範囲は上記で挙げたようにクエリの分割範囲、サーバ間演算の順序と実行場所の決定、分割したクエリの結合方法等と非常に広いため、多くの候補プランが発生してしまい、その中から最適なプランを検索するためには多くの情報を必要とする。このためマスターサーバは各スレーブサーバ側からの索引や統計情報等を密に取得して維持・管理する必要がある。したがってマスターサーバの仕組みは複雑であり、管理コストが多くかかる。 First of all, since the scope of consideration of distributed plans is very wide, such as the query division range, determination of the order and execution location of operations between servers, and the method of joining the divided queries as mentioned above, many candidate plans are generated. Therefore, it takes a lot of information to search for an optimal plan. For this reason, the master server needs to closely acquire, maintain, and manage indexes and statistical information from each slave server. Therefore, the structure of the master server is complicated and requires a high management cost.

また、マスターサーバが欲しい情報を全て取得した場合でも、スレーブサーバ毎に統計量が大きく異なった場合や、アーキテクチャが異なる場合はスレーブサーバ毎に最適な動作が異なる可能性がある。そのようなケースにおいて全てのスレーブサーバに対して統一した分散プランでは一部のスレーブサーバの実行速度がボトルネックとなり全体の性能が低下する可能性がある。しかしながら、スレーブサーバ毎に最適な動作ができるような分散プランを生成するとマスターサーバの分散プラン生成の仕組みが複雑になる。すなわち、マスターサーバの分散プラン処理部の最適化機能を向上させると、マスターサーバの最適化の仕組みは複雑化する。このため、マスターサーバが各スレーブサーバの状態毎に適した形で分散プランを生成することは困難である。 Also, even when all the information desired by the master server is acquired, there is a possibility that the optimum operation may be different for each slave server if the statistics are greatly different for each slave server or if the architecture is different. In such a case, in a distributed plan unified for all slave servers, the execution speed of some slave servers may become a bottleneck and the overall performance may be reduced. However, generating a distributed plan that can perform optimal operation for each slave server complicates the mechanism for generating the distributed plan of the master server. That is, if the optimization function of the distributed plan processing unit of the master server is improved, the mechanism for optimizing the master server becomes complicated. For this reason, it is difficult for the master server to generate a distributed plan in a form suitable for each state of each slave server.

特開２００１−３３１４８５号公報JP 2001-331485 A 特開平０７−１４１３９９号公報Japanese Patent Application Laid-Open No. 07-141399

本発明が解決しようとする課題は、マスターサーバ側の仕組みを複雑化することなく効率的な検索を実現する分散データベース検索装置を提供することである。 The problem to be solved by the present invention is to provide a distributed database search device that realizes efficient search without complicating the mechanism on the master server side.

実施形態の分散データベース検索装置は、入力された問合せクエリに基づいてデータベースを検索するマスターサーバと、データベースを具備する複数のスレーブサーバとを接続して構成され、スレーブサーバは、マスターサーバとデータの送受信を行う第２の送受信部と、受信した分散プランに基づいてローカルプラン候補を生成するローカルプラン候補生成部と、生成されたローカルプラン候補に基づいて最も演算コストが低いローカルプランを決定するローカルプラン選択部と、を備える。 The distributed database search apparatus according to the embodiment is configured by connecting a master server that searches a database based on an input query query and a plurality of slave servers that include the database. A second transmission / reception unit that performs transmission / reception, a local plan candidate generation unit that generates a local plan candidate based on the received distributed plan, and a local that determines a local plan with the lowest calculation cost based on the generated local plan candidate A plan selection unit.

第１の実施形態に係る分散データベース検索装置の全体構成図の一例である。It is an example of the whole block diagram of the distributed database search device concerning a 1st embodiment. 第１の実施形態に係るデータベースに登録されるデータの１つであるＸＭＬデータの一例を示す模式図である。It is a schematic diagram which shows an example of the XML data which is one of the data registered into the database which concerns on 1st Embodiment. 第１の実施形態に係るデータベースに登録されるデータの１つであるＸＭＬデータの一例を示す模式図である。It is a schematic diagram which shows an example of the XML data which is one of the data registered into the database which concerns on 1st Embodiment. 第１の実施形態に係るスレーブサーバが保持するデータベース情報の一例を示す模式図である。It is a schematic diagram which shows an example of the database information which the slave server which concerns on 1st Embodiment hold | maintains. 第１の実施形態に係るマスターサーバが保持するスレーブサーバ群情報の一例を示す模式図である。It is a schematic diagram which shows an example of the slave server group information which the master server which concerns on 1st Embodiment hold | maintains. 第１の実施形態に係る分散データベース検索処理の一例を示すフローチャートである。It is a flowchart which shows an example of the distributed database search process which concerns on 1st Embodiment. 第１の実施形態に係るＸＭＬに対する問い合わせ言語ＸＱｕｅｒｙの一例を示す模式図である。It is a schematic diagram which shows an example of the query language XQuery with respect to XML which concerns on 1st Embodiment. 第１の実施形態に係るクエリ分割部において生成される部分クエリの一例を示す図である。It is a figure which shows an example of the partial query produced | generated in the query division part which concerns on 1st Embodiment. 第１の実施形態に係る分散プラン生成部において生成される分散プランの一例を示す図である。It is a figure which shows an example of the dispersion | distribution plan produced | generated in the dispersion | distribution plan production | generation part which concerns on 1st Embodiment. 第１の実施形態に係る分割クエリ結合演算追加部による分散プラン修正処理の一例を示すフローチャート図である。It is a flowchart figure which shows an example of the distributed plan correction process by the division | segmentation query combination calculation addition part which concerns on 1st Embodiment. 第１の実施形態に係る分割クエリ結合演算追加処理が行われた分散プランの一例を示す模式図である。It is a schematic diagram which shows an example of the distributed plan in which the division | segmentation query combination operation addition process which concerns on 1st Embodiment was performed. 第１の実施形態に係るローカルプラン選択部において生成されるローカルプランの一例を示す模式図である。It is a schematic diagram which shows an example of the local plan produced | generated in the local plan selection part which concerns on 1st Embodiment. 第１の実施形態に係るローカルプラン候補生成処理の一例を示すフローチャート図である。It is a flowchart figure which shows an example of the local plan candidate production | generation process which concerns on 1st Embodiment. 第１の実施形態に係るローカルプラン候補生成処理の一例を示すフローチャート図である。It is a flowchart figure which shows an example of the local plan candidate production | generation process which concerns on 1st Embodiment. 第１の実施形態に係るローカルプラン候補生成処理の一例を示すフローチャート図である。It is a flowchart figure which shows an example of the local plan candidate production | generation process which concerns on 1st Embodiment. 第１の実施形態に係るローカルプラン候補生成部において生成されるローカルプランの一例を示す模式図である。It is a schematic diagram which shows an example of the local plan produced | generated in the local plan candidate production | generation part which concerns on 1st Embodiment. 第１の実施形態に係るローカルプラン候補生成部において生成されたローカルプランの一例を示す模式図である。It is a schematic diagram which shows an example of the local plan produced | generated in the local plan candidate production | generation part which concerns on 1st Embodiment. 第１の実施形態に係るローカルプラン候補生成部において生成されたローカルプランの一例を示す模式図である。It is a schematic diagram which shows an example of the local plan produced | generated in the local plan candidate production | generation part which concerns on 1st Embodiment. 第１の実施形態に係るローカルプラン候補生成部において生成されたローカルプランの一例を示す模式図である。It is a schematic diagram which shows an example of the local plan produced | generated in the local plan candidate production | generation part which concerns on 1st Embodiment. 第１の実施形態に係るローカルプラン候補生成部において生成されたローカルプランの一例を示す模式図である。It is a schematic diagram which shows an example of the local plan produced | generated in the local plan candidate production | generation part which concerns on 1st Embodiment. 第１の実施形態に係るローカルプラン候補生成部において生成されたローカルプランの一例を示す模式図である。It is a schematic diagram which shows an example of the local plan produced | generated in the local plan candidate production | generation part which concerns on 1st Embodiment. 第１の実施形態に係るローカルプラン候補生成部で演算コストを算出する際に用いるパラメータの一例を示すフローチャート図である。It is a flowchart figure which shows an example of the parameter used when calculating the calculation cost in the local plan candidate production | generation part which concerns on 1st Embodiment. 第１の実施形態に係る分散プラン更新部が分散プランを更新する処理の一例を示すフローチャート図である。It is a flowchart figure which shows an example of the process in which the distribution plan update part which concerns on 1st Embodiment updates a distribution plan. 第１の実施形態に係る分散プラン更新部において更新される分散プランの一例を示す模式図である。It is a schematic diagram which shows an example of the distribution plan updated in the distribution plan update part which concerns on 1st Embodiment. 第２の実施形態に係る分散データベース検索装置の全体構成図の一例である。It is an example of the whole block diagram of the distributed database search device concerning a 2nd embodiment. 第２の実施形態に係るスキーマ変更部で変更される前のスキーマの一例を示す模式図である。It is a schematic diagram which shows an example of the schema before being changed by the schema change part which concerns on 2nd Embodiment. 第２の実施形態に係る分散データベース検索処理の一例を示すフローチャートである。It is a flowchart which shows an example of the distributed database search process which concerns on 2nd Embodiment. 第２の実施形態に係るローカルプラン順序決定処理が行われたローカルプランの一例を示す模式図である。It is a schematic diagram which shows an example of the local plan with which the local plan order determination process which concerns on 2nd Embodiment was performed. 第２の実施形態に係るスキーマ生成部が分散プランのスキーマを生成する処理の一例を示すフローチャート図である。It is a flowchart figure which shows an example of the process which the schema production | generation part which concerns on 2nd Embodiment produces | generates the schema of a distributed plan. 第２の実施形態に係るスキーマ変更部で変更された新しいスキーマの一例を示す模式図である。It is a schematic diagram which shows an example of the new schema changed in the schema change part which concerns on 2nd Embodiment. 第２の実施形態に係るスキーマ変更部が分散プランの演算に入力されるデータのスキーマを更新する処理の一例を示すフローチャート図である。It is a flowchart figure which shows an example of the process in which the schema change part which concerns on 2nd Embodiment updates the schema of the data input into calculation of a distributed plan. 第２の実施形態に係るスキーマ変更部によるスキーマ生成処理が行われた結果得られる入出力スキーマの一例を示す図。The figure which shows an example of the input-output schema obtained as a result of performing the schema production | generation process by the schema change part which concerns on 2nd Embodiment.

以下、実施形態の分散データベース検索装置について図面を参照して説明する。 Hereinafter, a distributed database search apparatus according to an embodiment will be described with reference to the drawings.

（第１の実施形態）
図１は第１の実施形態の分散データベース検索装置の機能構成を示す構成図である。本実施形態の分散データベース検索装置は、ユーザから入力される検索式（以下、問合せクエリという）に基づいて検索を行い、検索結果を出力する。 (First embodiment)
FIG. 1 is a configuration diagram illustrating a functional configuration of the distributed database search apparatus according to the first embodiment. The distributed database search device of this embodiment performs a search based on a search expression (hereinafter referred to as a query query) input from a user, and outputs a search result.

図１に示すように、本実施形態の分散データベース検索装置は、マスターサーバとして機能する計算機０と、スレーブサーバとして機能する計算機１〜計算機Ｎとが接続して構成されている。 As shown in FIG. 1, the distributed database search apparatus of this embodiment is configured by connecting a computer 0 that functions as a master server and computers 1 to N that function as slave servers.

マスターサーバである計算機０は、構文解析部１１と、クエリ分割部１２と、分散プラン生成部１３と、分割クエリ結合演算追加部１４と、分散プラン更新部１５と、分散プラン実行部１６と、送受信部１７（第１の送受信部）と、情報記憶部２０（第１の記憶装置）とを備える。 The computer 0 which is a master server includes a syntax analysis unit 11, a query division unit 12, a distributed plan generation unit 13, a divided query join operation addition unit 14, a distributed plan update unit 15, a distributed plan execution unit 16, A transmission / reception unit 17 (first transmission / reception unit) and an information storage unit 20 (first storage device) are provided.

情報記憶部２０は、スレーブサーバ群情報を格納するスレーブサーバ群情報記憶部２１と、後述する分散プランを格納する分散プラン記憶部２２とを備える。なお、スレープサーバ群情報とは全てのスレーブサーバの名前や位置、及び登録件数等スレーブサーバが持つデータベースに関する情報であり、スレーブサーバが持つデータベースの情報の一部である。マスターサーバはこのスレーブサーバ群情報に基づいてスレーブサーバにどのデータを送信するかを決定する。 The information storage unit 20 includes a slave server group information storage unit 21 that stores slave server group information, and a distributed plan storage unit 22 that stores a distributed plan to be described later. Note that the slave server group information is information about the database held by the slave server, such as the names and positions of all slave servers, and the number of registrations, and is part of the database information held by the slave server. The master server determines which data is to be transmitted to the slave server based on the slave server group information.

構文解析部１１は、ユーザから与えられた問合せクエリ５１を構文解析する。 The syntax analysis unit 11 parses the query query 51 given by the user.

クエリ分割部１２は、問合せクエリ５１を分割する機能を備え、構文解析部１１による問合せクエリ５１の構文解析結果と、スレーブサーバ群情報テーブル２１とに基づいて、サーバ内演算及びサーバ間演算の単位で問合せクエリ５１を分割する。分割した問合せクエリ５１を部分クエリという。なお、サーバ内演算とは問合せクエリ５１を各スレーブサーバ内で処理することである。また、サーバ間演算とは複数のスレーブサーバからデータを集めてマスターサーバ内で演算する処理である。 The query dividing unit 12 has a function of dividing the query query 51, and based on the result of the syntax analysis of the query query 51 by the syntax analysis unit 11 and the slave server group information table 21, a unit of intra-server operation and inter-server operation The query query 51 is divided by The divided query 51 is referred to as a partial query. The intra-server operation means that the query 51 is processed in each slave server. The inter-server calculation is a process of collecting data from a plurality of slave servers and calculating in the master server.

分散プラン生成部１３は、分散プラン生成手段として機能するものであり、クエリ分割部１２により得られた部分クエリと、スレーブサーバ群情報テーブル２１の情報とに基づいて分散プランを生成する。すなわち、分散プラン生成部１３は、スレーブサーバ群情報記憶部２１に格納されているスレーブサーバの名前や位置、及び登録件数等に基づいて、分割されたサーバ内演算とサーバ間演算の、演算順序、演算実行場所及びサーバ間で送受信するデータの内容を決定した分散プランを生成する。なお、サーバ間で送受信するデータの内容とは、例えばスレーブサーバのデータ、スレーブサーバのデータの一部、平均値等のスレーブサーバのデータの演算結果、もしくは最大値や複数のスレーブサーバのデータの組合せ等のサーバ間演算の結果である。 The distributed plan generating unit 13 functions as a distributed plan generating unit, and generates a distributed plan based on the partial query obtained by the query dividing unit 12 and information in the slave server group information table 21. That is, the distributed plan generation unit 13 calculates the operation order of the divided intra-server operation and inter-server operation based on the name and position of the slave server stored in the slave server group information storage unit 21 and the number of registrations. Then, a distributed plan in which the contents of the data to be transmitted / received between the execution location and the server is determined is generated. The contents of data transmitted / received between servers include, for example, slave server data, a part of slave server data, the result of operation of slave server data such as an average value, or the maximum value and data of a plurality of slave servers. It is a result of calculation between servers, such as a combination.

分散プラン生成部１３は分割クエリ結合演算追加部１４と分散プラン更新部１５とを備える。 The distributed plan generating unit 13 includes a divided query join operation adding unit 14 and a distributed plan updating unit 15.

分割クエリ結合演算追加部１４は、分散プラン生成部１３により得られた分散プランを図１０のフローチャートに示す分割クエリ結合演算追加処理に従って修正する。以下、修正された分散プランを修正分散プランという。分割クエリ結合演算追加処理については後述する。 The divided query join operation adding unit 14 modifies the distributed plan obtained by the distributed plan generating unit 13 according to the divided query join operation adding process shown in the flowchart of FIG. Hereinafter, the modified distributed plan is referred to as a modified distributed plan. The split query join operation addition process will be described later.

分散プラン更新部１５は、分散プラン更新手段として機能するものでありローカルプラン選択部、スレーブサーバから送られてきたローカルプランに基づいて修正分散プランを更新する分散プラン更新処理を行う。分散プラン修正処理については後述する。 The distributed plan update unit 15 functions as a distributed plan update unit, and performs a distributed plan update process for updating the modified distributed plan based on the local plan selection unit and the local plan sent from the slave server. The distributed plan correction process will be described later.

分散プラン実行部１６は、分散プラン更新部１５によって更新された分散プランに基づいて演算を実行する。 The distributed plan execution unit 16 executes a calculation based on the distributed plan updated by the distributed plan update unit 15.

送受信部１７は、データ送受信手段として機能を有する。 The transmission / reception unit 17 functions as a data transmission / reception unit.

続いてスレーブサーバである計算機１〜Ｎについて説明する。 Next, computers 1 to N that are slave servers will be described.

計算機１〜Ｎはスレーブサーバとして機能するものであり、ローカルプラン選択部３１と、ローカルプラン候補生成部３２と、ローカルプラン実行部３３と、送受信部３４（第２の送受信部）と、情報記憶部４０（第２の記憶装置）とを備える。 The computers 1 to N function as slave servers, and include a local plan selection unit 31, a local plan candidate generation unit 32, a local plan execution unit 33, a transmission / reception unit 34 (second transmission / reception unit), and information storage. Unit 40 (second storage device).

情報記憶部４０は、格納したデータのスキーマ情報や件数等の統計情報などのデータベースに格納されたデータに関する情報であるデータベース情報を保持するデータベース情報記憶部４１と、ローカルプランを保持するローカルプラン記憶部４２と、検索対象の実際のデータ（以下、格納データという）を保持する格納データ記憶部４３と、を備える。 The information storage unit 40 includes a database information storage unit 41 that holds database information that is information related to data stored in a database such as schema information of stored data and statistical information such as the number of cases, and a local plan storage that holds a local plan And a storage data storage unit 43 that holds actual data to be searched (hereinafter referred to as storage data).

ローカルプラン選択部３１は、ローカルプラン生成手段として機能するものであり、ローカルプラン候補生成部３２を備える。ローカルプラン選択部３１は、分割クエリ結合演算追加部１４により得られた修正分散プランの内、自身のサーバに関連する部分プランからなるローカルプランを生成する。ローカルプラン選択部が生成したローカルプランに基づいて、ローカルプラン候補生成部３がローカルプランの候補をさらに作成する。ローカルプラン選択部３１は、自身が作成したローカルプランとローカルプラン候補生成部３２に作成されたローカルプラン候補の中から、見積もり実行時間または見積もり実行計算量からなる演算コストを計算する。ローカルプラン選択部３１は、算出した演算コストが最小となるプランをローカルプランとして決定する。 The local plan selection unit 31 functions as a local plan generation unit, and includes a local plan candidate generation unit 32. The local plan selection unit 31 generates a local plan made up of partial plans related to its own server among the modified distributed plans obtained by the split query join operation addition unit 14. Based on the local plan generated by the local plan selection unit, the local plan candidate generation unit 3 further creates a local plan candidate. The local plan selection unit 31 calculates a calculation cost including an estimated execution time or an estimated execution calculation amount from the local plan created by itself and the local plan candidate created by the local plan candidate generation unit 32. The local plan selection unit 31 determines the plan that minimizes the calculated calculation cost as the local plan.

ローカルプラン実行部３３は、ローカルプラン選択部３１によって得られたローカルプランに基づいて演算を実施する。送受信部３４はマスターサーバの送受信部１７と同一の機能を持つ。 The local plan execution unit 33 performs a calculation based on the local plan obtained by the local plan selection unit 31. The transmission / reception unit 34 has the same function as the transmission / reception unit 17 of the master server.

ここで、図２及び図３に、スレーブサーバの格納データ記憶部４３に登録されるデータの一例を示す。なお、本実施形態のスレーブサーバの格納データ記憶部４３に登録されるデータはＸＭＬフォーマット形式で記述されている。なお、図２に示すデータは、書籍の発行年数、タイトル、著者、価格に関するデータである。図３に示すデータは、ある賞の受賞年度、受賞者、受賞した書籍のタイトル、受賞者の性別に関するデータである。 Here, FIGS. 2 and 3 show an example of data registered in the stored data storage unit 43 of the slave server. Note that the data registered in the stored data storage unit 43 of the slave server of this embodiment is described in the XML format. The data shown in FIG. 2 is data related to the publication year, title, author, and price of the book. The data shown in FIG. 3 is data relating to the award year of a certain award, the winner, the title of the book that has been awarded, and the gender of the winner.

図４は、スレーブサーバのデータ情報記憶部４１が保持するデータベースの情報の一例である。図４に示すように、データベース情報は、「登録ノード」１１２と、「登録数」１１３と、「索引情報」１１４の項目を有するデータベース情報テーブル１１１として保持される。 FIG. 4 is an example of database information held by the data information storage unit 41 of the slave server. As shown in FIG. 4, the database information is held as a database information table 111 having items of “registration node” 112, “number of registrations” 113, and “index information” 114.

「登録ノード」１１２は、スレーブサーバの格納データ記憶部４３に登録されたＸＭＬデータが有するノード名を示すものであり、ここではどのノードの下にあるかを含めて記述する。なお、ノードとはＸＭＬデータを構成する要素や属性などである。 The “registered node” 112 indicates a node name included in the XML data registered in the stored data storage unit 43 of the slave server, and is described here including under which node. Note that a node is an element or attribute that constitutes XML data.

「登録数」１１３は、スレーブサーバの格納データ記憶部４３に登録されたＸＭＬデータ中に各登録ノード１１２が出現した回数を示す。 “Registered number” 113 indicates the number of times each registered node 112 appears in the XML data registered in the stored data storage unit 43 of the slave server.

「索引情報」１１４は、登録ノード１１２に対して設定した索引の種類を記述したものである。索引の種類は、例えば数値索引や文字索引である。なお、図４には、一例として計算機１乃至計算機４のデータベース情報テーブル１１１−１〜１１１−４を示している。 “Index information” 114 describes the type of index set for the registration node 112. The index type is, for example, a numerical index or a character index. FIG. 4 shows database information tables 111-1 to 111-4 of the computers 1 to 4 as an example.

図５は、マスターサーバである計算機０が保持するスレーブサーバ群情報記憶部２１に保存されたスレーブサーバ群情報２１の一例である。図５に示すように、スレーブサーバ群情報は「サーバ名」１２２、「Ｃｏｌｌｅｃｔｉｏｎ情報」１２３、「登録文書数」１２４という項目を有するスレーブサーバ群情報テーブル１２１として保持される。 FIG. 5 is an example of the slave server group information 21 stored in the slave server group information storage unit 21 held by the computer 0 that is the master server. As shown in FIG. 5, the slave server group information is held as a slave server group information table 121 having items of “server name” 122, “Collection information” 123, and “number of registered documents” 124.

「サーバ名」１２２にはスレーブサーバの名称が格納される。「Ｃｏｌｌｅｃｔｉｏｎ情報」１２３には登録するＸＭＬデータの格納場所の名前（以下、Ｃｏｌｌｅｃｔｉｏｎ名という）が格納される。本実施形態の分散データベースは異なるスレーブサーバでも同じＣｏｌｌｅｃｔｉｏｎ情報１２３を持つことができるため、ユーザはＣｏｌｌｅｃｔｉｏｎ名を指定することで特定のＸＭＬデータの集合内を検索することが可能となる。「登録文書数」１２４はＣｏｌｌｅｃｔｉｏｎに登録されたＸＭＬデータの数が格納される。 The “server name” 122 stores the name of the slave server. The “Collection information” 123 stores the name of the storage location of the registered XML data (hereinafter referred to as “Collection name”). Since the distributed database of the present embodiment can have the same collection information 123 even in different slave servers, the user can search within a set of specific XML data by specifying the collection name. “Registered document count” 124 stores the number of XML data registered in the collection.

ここで、図６乃至図２５を参照して、本実施形態の分散データベース検索装置の処理について説明する。図６は、本実施形態の分散データベース検索装置の検索処理の一例を示すフローチャートである。なお、本実施形態の分散データベース検索装置は、計算機１〜４のデータベース情報記憶部４１に、図４に示したデータベース情報が格納されているとする。 Here, with reference to FIG. 6 to FIG. 25, processing of the distributed database search device of this embodiment will be described. FIG. 6 is a flowchart illustrating an example of a search process of the distributed database search apparatus according to this embodiment. In the distributed database search device of this embodiment, it is assumed that the database information shown in FIG. 4 is stored in the database information storage unit 41 of the computers 1 to 4.

まず、ユーザによってマスターサーバに検索式である問合せクエリ５１が入力される（ステップＳ１）。 First, a query query 51, which is a search expression, is input to the master server by the user (step S1).

ここで、図７に、ユーザによって入力される問合せクエリ５１の一例を示す。図７に示す問合せクエリ５１は、ＸＭＬデータの問い合わせ言語であるＸＱｕｅｒｙによって記述されている。なお、図７に示す問合せクエリ５１は、「過去の受賞歴がある男性作家の著書のうち１９９０年以降に出版された本のタイトルと値段を出力せよ」という意味である。 Here, FIG. 7 shows an example of the query query 51 input by the user. The query 51 shown in FIG. 7 is described in XQuery, which is a query language for XML data. The query 51 shown in FIG. 7 means “output the title and price of a book published since 1990 among the books of male authors who have received awards”.

入力された問合せクエリ５１の１行目のｆｏｒから始まる１文は"ｐｕｂｌｉｓｈｅｒ"というＣｏｌｌｅｃｔｉｏｎに登録されているｂｏｏｋの名前を持つノードのうち、属性ノードｙｅａｒの値を数値化したものが１９９０以上のノードを変数＄ｘに格納している。これにより１９９０年以降に出版した本の一覧を取得している。なお、ＸＱｕｅｒｙにおいて変数は"＄"で始まる文字列として表現される。 One sentence starting from “for” in the first line of the input query query 51 is a node having a book name registered in the collection “publisher”, and the value of the attribute node “year” is more than 1990. The node is stored in the variable $ x. As a result, a list of books published since 1990 is acquired. In XQuery, a variable is expressed as a character string starting with “$”.

次に２行目のｆｏｒから始まる１文は"ｐｒｉｚｅＷｉｎｎｅｒｓ"というＣｏｌｌｅｃｔｉｏｎに登録されているｐｒｉｚｅＷｉｎｎｅｒの名前を持つノードのうち、ｇｅｎｄｅｒという子ノードの値を文字列化したものが"ｍａｌｅ"という文字列と等しいものを選択した後、その子ノードであるｎａｍｅを変数＄ｙに格納しており、これにより受賞歴のある男性作家の名前の一覧を取得している。次に３、４、５行目のｌｅｔから始まる１文で１行目に取得したｂｏｏｋの子ノードであるａｕｔｈｏｒ、ｔｉｔｌｅ、ｐｒｉｃｅのノードを各々変数＄ｚ、＄ｕ、＄ｖに格納しており、これにより本の著者名とタイトルと値段を取得している。次に６行目のｗｈｅｒｅから始まる１文で男性作家の名前と本の著者名が一致するものの組合せを取得している。最後に７行目のｒｅｔｕｒｎから始まる１文で、６行目で取得した組合せに対してＬｉｓｔの名前のノードで囲んだＸＭＬを作りだしてユーザに返却している。これにより条件を満たした本のタイトルと値段を取得している。 Next, one sentence starting from “for” on the second line is a character string “male” obtained by converting the value of a child node named “generator” into a character string from among the nodes having the name “prizeWinner” registered in the collection “prizeWinners”. Is selected, the child node name is stored in the variable $ y, thereby obtaining a list of names of male writers who have received awards. Next, the author, title, and price nodes that are child nodes of the book acquired on the first line in the first sentence starting from the let on the third, fourth, and fifth lines are stored in the variables $ z, $ u, and $ v, respectively. The book author name, title, and price are obtained. Next, in the first sentence starting with “where” on the 6th line, a combination of a male writer whose name matches the author's name is obtained. Finally, in one sentence starting from “return” on the 7th line, an XML surrounded by a node with the name of List is created for the combination acquired on the 6th line and returned to the user. As a result, the title and price of the book that satisfies the conditions are acquired.

図７に示す問合せクエリ５１がマスターサーバに入力されると、マスターサーバの構文解析部１１は、問合せクエリ５１を構文解析する（ステップＳ２）。構文解析部１１による構文解析結果は、マスターサーバのクエリ分割部１２に送信される。 When the query query 51 shown in FIG. 7 is input to the master server, the syntax analysis unit 11 of the master server parses the query query 51 (step S2). The result of parsing by the parsing unit 11 is transmitted to the query dividing unit 12 of the master server.

構文解析結果を受信したクエリ分割部１２は、スレーブサーバ群情報テーブル２１の情報に基づいて、問合せクエリ５１を各スレーブサーバ内で処理するサーバ内演算及び複数スレーブサーバからデータを集めて演算するサーバ間演算の単位の部分クエリに分割する問合せクエリ５１分割処理を行う（ステップＳ３）。 The query division unit 12 that has received the parsing result, based on the information in the slave server group information table 21, performs an in-server operation for processing the query query 51 in each slave server and a server that collects and calculates data from a plurality of slave servers. A query query 51 dividing process is performed to divide into partial queries in the unit of inter-operation (step S3).

すなわち、クエリ分割部１２は、構文解析部１１の構文解析の結果に基づいて、問合せクエリ５１を部分クエリに分割する。クエリ分割部１２は、スレーブサーバ群情報テーブル１２１を参照して、これらの部分クエリ毎に、部分クエリの内容がサーバ間演算かサーバ内演算を判定する。ここで、クエリ分割部１２による分割結果である部分クエリの一覧である部分クエリ一欄テーブル１３１の一例を図８に示す。 That is, the query dividing unit 12 divides the query query 51 into partial queries based on the result of syntax analysis by the syntax analyzing unit 11. The query division unit 12 refers to the slave server group information table 121 to determine whether the partial query content is an inter-server operation or an intra-server operation for each partial query. Here, FIG. 8 shows an example of the partial query one column table 131 that is a list of partial queries that are the results of the division by the query dividing unit 12.

図８に示した部分クエリ一覧テーブル１３１は、部分クエリに順次振られる番号を格納する「番号」１３２、分割して得られた部分クエリを格納する「部分クエリ内容」１３３、部分クエリがサーバ内演算かサーバ間演算であるかを格納する「サーバ間／サーバ内演算」１３４、演算に必要なデータを保存した計算機名を格納する「対応サーバ」１３５の項目を有する。 The partial query list table 131 shown in FIG. 8 includes a “number” 132 for storing numbers sequentially assigned to partial queries, a “partial query content” 133 for storing partial queries obtained by division, and partial queries in the server. It has items of “inter-server / intra-server operation” 134 for storing whether the operation is an operation between servers or “corresponding server” 135 for storing the name of a computer storing data necessary for the operation.

以下に、クエリ分割部１２による問合せクエリの分割処理について、図７乃至図８を参照して具体的に説明する。 Hereinafter, the query query dividing process by the query dividing unit 12 will be described in detail with reference to FIGS.

クエリ分割部１２は、スレーブサーバ群情報テーブル１２１に基づいて、部分クエリごとの演算に用いるデータを保持する計算機を特定する。図８では図７に示した問合せクエリ５１におけるＣｏｌｌｅｃｔｉｏｎ（"ｐｕｂｌｉｓｈｅｒ"）とＣｏｌｌｅｃｔｉｏｎ（"ｐｒｉｚｅＷｉｎｎｅｒ"）の２つのＣｏｌｌｅｃｔｉｏｎ情報に着目する。すなわち、これらのＣｏｌｌｅｃｔｉｏｎ情報を用いて図５に示したスレーブサーバ群情報テーブル１２１を検索する。 Based on the slave server group information table 121, the query dividing unit 12 specifies a computer that holds data used for calculation for each partial query. In FIG. 8, attention is focused on two pieces of collection information of collection (“publisher”) and collection (“prizeWinner”) in the query query 51 shown in FIG. That is, the slave server group information table 121 shown in FIG. 5 is searched using these collection information.

すなわち、クエリ分割部１２は、Ｃｏｌｌｅｃｔｉｏｎ（"ｐｕｂｌｉｓｈｅｒ"）は計算機１乃至３に存在し、Ｃｏｌｌｅｃｔｉｏｎ（"ｐｒｉｚｅＷｉｎｎｅｒ"）は計算機４に存在すると判定する。 In other words, the query division unit 12 determines that Collection (“publisher”) exists in the computers 1 to 3 and that Collection (“prizeWinner”) exists in the computer 4.

さらにクエリ分割部１２はＣｏｌｌｅｃｔｉｏｎに対して実施する"／"、"／／"、"＞＝"、"＝"といったＸＱｕｅｒｙの演算に注目し、演算が複数の異なる計算機からの値が必要かどうかを判定する。演算が複数の異なる計算機からの値を必要とする場合、この演算をサーバ間演算と判定する。なお、一つの計算機からの値で行われる演算の場合、この演算をサーバ内演算と判定する。 Further, the query division unit 12 pays attention to XQuery operations such as “/”, “//”, “> =”, “=” performed on the collection, and whether the operation requires values from a plurality of different computers. Determine. When the operation requires values from a plurality of different computers, this operation is determined as an inter-server operation. In addition, in the case of the calculation performed with the value from one computer, this calculation is determined as the calculation in the server.

図８に示すように、問合せクエリ５１においては"／"、"／／"、"＞＝１９９０"といった演算は全て入力となったデータの計算機上で実施できるため＄ｘ、＄ｚ、＄ｕ、＄ｖは全て同一計算機上に演算したデータが格納される。なお、サーバ内演算については幾つかの演算単位で分割する。図８では"ｆｏｒ"、"ｌｅｔ"といった代入文が発生する単位で分割しており番号１〜５の部分クエリが発生する。なお、"ｆｏｒ"、"ｌｅｔ"という代入文の単位で分割したのは１例であり、実際にはもっと細かい演算単位で分割しても良いし、もっと大きな演算単位で分割しても良い。 As shown in FIG. 8, in the query 51, operations such as “/”, “//”, “> = 1990” can all be performed on the input data calculator, so that $ x, $ z, $ u , $ V all store data calculated on the same computer. Note that the intra-server computation is divided into several computation units. In FIG. 8, the query is divided in units in which assignment statements such as “for” and “let” are generated, and partial queries of numbers 1 to 5 are generated. Note that the division in units of assignment statements such as “for” and “let” is just an example, and in actuality, the division may be performed in smaller operation units or in larger operation units.

一方、"ｗｈｅｒｅ＄ｙ＝＄ｚ"という演算において、＄ｙは計算機４のデータであり、＄ｚは計算機１から３にあるデータであるためサーバ間演算であるとする。 On the other hand, in the calculation “where $ y = $ z”, it is assumed that $ y is data of the computer 4 and $ z is data in the computers 1 to 3 and thus is an inter-server operation.

続いて、次の"ｒｅｔｕｒｎ＜Ｌｉｓｔ＞｛＄ｕ｝｛＄ｖ｝＜／Ｌｉｓｔ＞"の演算は最終結果を返却する演算であるため、＄ｕ、＄ｖのデータを持つ計算機１から３のデータを集めて演算するサーバ間演算が必要だと判定する。 Subsequently, since the next “return <List> {$ u} {$ v} </ List>” is an operation that returns the final result, the computers 1 to 3 having data of $ u and $ v It is determined that an inter-server operation that collects and calculates data is necessary.

続いて、分散プラン生成部１３が、図８に示した部分クエリ一覧テーブル１３１と図５に示したスレーブサーバ群情報テーブル１２１とに基づいて分散プランを生成する（ステップＳ４）。生成した分散プランは分散プランテーブル１４１に格納される。 Subsequently, the distributed plan generation unit 13 generates a distributed plan based on the partial query list table 131 shown in FIG. 8 and the slave server group information table 121 shown in FIG. 5 (step S4). The generated distributed plan is stored in the distributed plan table 141.

図９に、分散プランテーブル１４１の一例を示す。 FIG. 9 shows an example of the distributed plan table 141.

分散プランテーブル１４１は「演算番号」１４２、「部分クエリ番号」１４３、「演算内容」１４４、「事前実行演算番号」１４５、「実行場所」１４６、「送信場所」１４７、「入力変数」１４８、「出力変数」１４９の項目を有する。 The distributed plan table 141 includes “operation number” 142, “partial query number” 143, “operation content” 144, “pre-execution operation number” 145, “execution location” 146, “transmission location” 147, “input variable” 148, It has an item “output variable” 149.

「演算番号」１４２は、演算毎に割り当てられた番号を示す。「部分クエリ番号」１４３は、図８に示した部分クエリテーブル１３１における部分クエリ番号１３２の項目で割り当てられた番号を格納する。部分クエリ番号１３２が割り当てられていない場合は空欄とする。 The “calculation number” 142 indicates a number assigned for each calculation. The “partial query number” 143 stores the number assigned in the item of the partial query number 132 in the partial query table 131 shown in FIG. If the partial query number 132 is not assigned, it is left blank.

「演算内容」１４４は、各演算の内容を格納する。ここでは、分散プラン生成部１３は、図８に示した部分クエリ一覧テーブル１３１におけるサーバ内演算は、そのままサーバ内演算とし、部分クエリ一覧テーブル１３１におけるサーバ間演算は、具体的な操作を表す演算内容を記述する。さらに、部分クエリ一覧テーブル１３１におけるサーバ間演算の前後では、データの送信、データ受信の演算が必要となるため、新たに加える。 “Operation content” 144 stores the content of each operation. Here, the distributed plan generation unit 13 uses the intra-server operation in the partial query list table 131 shown in FIG. 8 as the intra-server operation as it is, and the inter-server operation in the partial query list table 131 represents a specific operation. Describe the content. Further, before and after the inter-server calculation in the partial query list table 131, data transmission and data reception calculations are required, and are newly added.

「事前実行演算番号」１４５は、その演算を実行する前に必ず実行しなければならない演算がある場合に、その演算番号を格納する。「実行場所」１４６は、演算を実行する場所（計算機）を格納する。データの送信とデータ受信の演算の場合は、演算結果のデータの送信先を「送信場所」１４７に格納する。すなわち、実行場所１４６はデータの送信元の計算機であり、送信場所１４７はデータの送信先の計算機である。 The “pre-execution operation number” 145 stores the operation number when there is an operation that must be executed before the operation is executed. The “execution location” 146 stores a location (computer) where the calculation is executed. In the case of calculation of data transmission and data reception, the transmission destination of the data of the calculation result is stored in “transmission location” 147. That is, the execution location 146 is a computer that is a data transmission source, and the transmission location 147 is a computer that is a data transmission destination.

「入力変数」１４８は、演算に入力データが必要な場合に格納され、そのデータが格納された変数の名前のリストを格納する。「出力変数」１４９は、演算が新しい値を作成する場合、その格納先の変数の名前のリストを格納する。 The “input variable” 148 is stored when input data is required for the operation, and stores a list of names of variables in which the data is stored. The “output variable” 149 stores a list of names of variables to be stored when the operation creates a new value.

なお、図９に示した分散プランテーブル１４１の演算番号１乃至５は、図８に示した部分クエリ一覧テーブル１３１の番号１乃至５に対応する。また、分散プランテーブル１４１の演算番号１０及び１１は、図８に示した部分クエリ一覧テーブル１３１の番号６、７に対応する。 Note that the operation numbers 1 to 5 in the distributed plan table 141 illustrated in FIG. 9 correspond to the numbers 1 to 5 in the partial query list table 131 illustrated in FIG. The operation numbers 10 and 11 in the distributed plan table 141 correspond to the numbers 6 and 7 in the partial query list table 131 shown in FIG.

また、部分クエリ一覧テーブル１３１でサーバ内／サーバ間演算１３４に「サーバ間演算」と格納されている部分クエリ番号６の演算は、演算内容１４４に「サーバ間ＪＯＩＮ」と格納される。すなわち、部分クエリ内容１３３に基づいた具体的な演算内容が格納される。同様に、部分クエリ番号７の演算は、演算内容１４４に「返却データ作成」という具体的な演算が格納されている。なお、「サーバ間ＪＯＩＮ」とは２つの変数に格納されたデータのうち値の等しい組合せを残す演算であり、「返却データ作成」とは入力されたデータを利用して新しいＸＭＬデータを作る演算である。 In addition, in the partial query list table 131, the operation of the partial query number 6 stored in the intra-server / inter-server operation 134 is stored as “inter-server JOIN” in the operation content 144. That is, the specific calculation content based on the partial query content 133 is stored. Similarly, the calculation of partial query number 7 stores a specific calculation “return data creation” in the calculation content 144. Note that “inter-server JOIN” is an operation that leaves the same combination of data stored in two variables, and “return data creation” is an operation that creates new XML data using the input data. It is.

分散プラン生成部１３は、「サーバ間演算」にのみ着目し、実行順序や実行場所を決定する。具体的には、図９に示した分散プランテーブル１４１は、サーバ間演算の演算内容１４４はサーバ間ＪＯＩＮと返却データ作成の２種類であるため、返却データ作成演算はサーバ間ＪＯＩＮを実行した後の方が対象となるデータが少なくなり効率が良いと判断する。 The distributed plan generation unit 13 pays attention only to “inter-server computation” and determines the execution order and execution location. Specifically, in the distributed plan table 141 shown in FIG. 9, since the calculation contents 144 of the inter-server calculation are two types of JOIN between servers and return data creation, the return data creation calculation is executed after executing the server-to-server JOIN. It is determined that the target is less efficient and the efficiency is better.

また、本実施形態では、サーバ間演算はマスターサーバで実施するものとして実行場所を決定する。なお、サーバ間演算をマスターサーバで実行するためには各スレーブサーバにあるデータを集める必要がある。そのため分散プラン生成部１３は、分散プランテーブル１４１の演算番号６および演算番号８を追加し、それぞれの演算内容１４４にスレーブサーバがマスターサーバにデータを送る演算である「データ送信」を格納する。続いて、演算番号６および８において送信されたデータをマスターサーバが受信するために、演算番号７および９を追加し、それぞれの演算内容１４４にマスターサーバがスレーブサーバからデータを受信する演算である「データ受信」を格納する。 Further, in the present embodiment, the execution location is determined on the assumption that the inter-server calculation is performed by the master server. In addition, in order to perform the calculation between servers by a master server, it is necessary to collect the data in each slave server. Therefore, the distribution plan generation unit 13 adds the calculation number 6 and the calculation number 8 of the distribution plan table 141, and stores “data transmission” that is the calculation in which the slave server sends data to the master server in each calculation content 144. Subsequently, in order for the master server to receive the data transmitted in the calculation numbers 6 and 8, the calculation numbers 7 and 9 are added, and the master server receives the data from the slave server in each calculation content 144. Stores “data received”.

なお、本実施形態ではサーバ間演算をマスターサーバで実行するように決定しているが、マスターサーバではなく、例えば複数のスレーブサーバで実行するように決定しても良い。 In the present embodiment, the inter-server calculation is determined to be executed by the master server, but may be determined to be executed by, for example, a plurality of slave servers instead of the master server.

分散プラン生成部１３は、図８に記載した部分クエリ内容１３３に基づいて、入力変数１４８および出力変数１４９を格納する。例えば、"ｆｏｒ〜ｉｎ"、"ｌｅｔ〜：＝"の"〜"に書かれた変数が出力変数１４９であり、それ以外の場所で書かれた変数は入力変数１４８とする。また、データ送信演算においては入力変数１４８が出力変数１４９になる。さらにデータ受信演算においてはデータ送信演算の出力変数１４９が入力変数１４８と出力変数１４９になる。 The distributed plan generation unit 13 stores the input variable 148 and the output variable 149 based on the partial query content 133 described in FIG. For example, a variable written in “for” in “for˜in” and “let˜: =” is an output variable 149, and a variable written elsewhere is an input variable 148. In the data transmission calculation, the input variable 148 becomes the output variable 149. Further, in the data reception calculation, the output variable 149 of the data transmission calculation becomes an input variable 148 and an output variable 149.

なお、サーバ内演算は、分散プラン生成部１３によって順序関係が崩れないように任意の順序に配置される。このように、本実施形態の分散プラン生成部１３は、部分クエリ一覧テーブル１３１の「サーバ間演算」のみを検討する。これにより検討範囲が限定されるため、分散プラン生成部１３１は容易に分散プランを生成可能である。 The intra-server operations are arranged in an arbitrary order so that the order relation is not broken by the distributed plan generation unit 13. As described above, the distributed plan generation unit 13 of this embodiment considers only “inter-server operation” of the partial query list table 131. Since the examination range is thereby limited, the distributed plan generation unit 131 can easily generate a distributed plan.

次に、分割クエリ結合演算追加部１４が、分散プラン生成部１３が生成した分散プランに対して「分割クエリ結合」演算を追加し、分散プランの修正を行う分割クエリ結合演算追加処理を行う（ステップＳ５）。 Next, the split query join operation addition unit 14 adds a “split query join” operation to the distributed plan generated by the distributed plan generation unit 13 and performs split query join operation addition processing for correcting the distributed plan ( Step S5).

ここで、図１０を参照して、分割クエリ結合演算追加部１４が、図９に示した分散プランテーブル１４１に格納された分散プランに対して、分割クエリ結合演算追加処理を行う際の動作について具体的に説明する。 Here, with reference to FIG. 10, the operation when the split query join operation adding unit 14 performs the split query join operation adding process on the distributed plan stored in the distributed plan table 141 shown in FIG. 9. This will be specifically described.

なお、この分割クエリ結合演算追加処理ではｉおよびｊという変数を用いる。ｉは１以上の整数であり、対象の分散プランの演算番号以下である（１≦ｉ≦分散プランの演算番号の最大値）。また、分割クエリ結合演算追加処理の開始時点ではｉ＝１である。また、分散プランの演算番号の最大値を「ｍａｘ」とする。また、演算番号１４２がｉの分散プランの演算を演算Ｅとする。なお、ｊはｉと同様の性質の変数であり、演算番号１４２がｊの分散プランの演算を演算Ｓとする。 Note that variables i and j are used in this split query join operation addition process. i is an integer greater than or equal to 1, and is less than or equal to the operation number of the target distributed plan (1 ≦ i ≦ maximum value of operation numbers of the distributed plan). Also, i = 1 at the start of the split query join operation addition process. In addition, the maximum value of the calculation number of the distributed plan is “max”. In addition, the calculation of the distributed plan whose operation number 142 is i is defined as operation E. Note that j is a variable having the same property as i, and an operation of a distributed plan whose operation number 142 is j is an operation S.

分割クエリ結合演算追加部１４は、分散プラン生成部１３から分散プランを受信すると、まず、初期化処理としてｉ＝１、分散プランの演算番号の最大値＝ｍａｘとする（ステップＳ１０）。 When the split query join operation addition unit 14 receives the distribution plan from the distribution plan generation unit 13, first, i = 1 is set as the initialization process, and the maximum value of the operation number of the distribution plan is set to max (step S10).

分割クエリ結合演算追加部１４は、図９に記載した分散プランテーブル１４１における演算番号ｉの演算Ｅを取得する（ステップＳ２０）。次に、分割クエリ結合演算追加部１４は、取得した演算Ｅがサーバ間演算であり、かつｉ≠ｍａｘであるか否かを判定する（ステップＳ３０）。 The divided query join calculation adding unit 14 acquires the calculation E of the calculation number i in the distributed plan table 141 described in FIG. 9 (Step S20). Next, the divided query join operation adding unit 14 determines whether or not the acquired operation E is an inter-server operation and i ≠ max (step S30).

取得した演算Ｅがサーバ間演算であり、かつｉ≠ｍａｘである場合、すなわち演算Ｅが最後の演算以外の場合（ステップＳ３０がＹｅｓ）、ｊ：＝１とする（ステップＳ４０）。 When the acquired operation E is an inter-server operation and i ≠ max, that is, when the operation E is other than the last operation (step S30 is Yes), j: = 1 is set (step S40).

次に、分散プランテーブル１４１の演算番号ｊの演算Ｓを取得する（ステップＳ５０）。分割クエリ結合演算追加部１４は、演算Ｓの演算内容１４４がデータ送信であり、かつ送信場所１４７に格納された計算機がスレーブサーバであるか否かを、図９の分散プランテーブル１４１を参照して判定する（ステップＳ６０）。 Next, the operation S of the operation number j in the distributed plan table 141 is acquired (step S50). The divided query join operation adding unit 14 refers to the distributed plan table 141 of FIG. 9 to determine whether the operation content 144 of the operation S is data transmission and whether the computer stored in the transmission location 147 is a slave server. (Step S60).

演算Ｓの演算内容１４４が「データ送信」であり、かつ送信場所１４７の計算機がスレーブサーバの場合（ステップＳ６０がＹｅｓ）、分割クエリ結合演算追加部１４は、演算Ｓの入力変数１４７と演算Ｅの入力変数１４７とに共通して現れる変数を格納したリスト（以下、ｖａｒＬｉｓｔという）を作成する（ステップＳ７０）。次に演算Ｓの入力変数１４７からｖａｒＬｉｓｔにある変数を除いた変数を格納したリスト（以下、ｎｅｗＶａｒＬｉｓｔという）を作成する（ステップＳ８０）。分割クエリ結合演算追加部１４は、作成したｖａｒＬｉｓｔが空でなく、かつこのｖａｒＬｉｓｔに含まれる変数が、演算Ｓの入力変数１４７と完全に一致しない、かつｖａｒｌｉｓｔの変数を出力する演算とｎｅｗＶａｒＬｉｓｔの変数を出力する演算とが並列実行可能かを判定する（ステップＳ９０）。 When the calculation content 144 of the calculation S is “data transmission” and the computer at the transmission location 147 is a slave server (Yes in step S60), the split query join calculation addition unit 14 adds the input variable 147 of the calculation S and the calculation E A list (hereinafter referred to as varList) storing variables that appear in common with the input variable 147 is created (step S70). Next, a list (hereinafter referred to as “newVarList”) storing variables excluding variables in varList from the input variable 147 of the operation S is created (step S80). The split query join operation addition unit 14 creates a new VarList variable and a new VarList variable in which the created varList is not empty and the variable included in the varList does not completely match the input variable 147 of the operation S and the variable of the varlist is output. It is determined whether or not the operation for outputting can be executed in parallel (step S90).

なお、ここで図９に示した分散プランテーブル１４１において、ｖａｒｌｉｓｔの変数を出力する演算とｎｅｗＶａｒＬｉｓｔの変数を出力する演算とが並列実行可能かどうかの判定方法について説明する。この方法は、一方の演算の事前実行演算番号１４５及びこの事前実行演算番号１４５の演算の事前実行演算番号１４５を繰り返し遡って調べた際に、もう一方の演算が事前実行演算番号として存在しない場合に、２つの演算は並列実行可能と判定する。すなわち、演算Ａの事前実行演算番号１４５に演算Ｂの演算番号がある場合は、演算Ｂを実行した後に演算Ａを実行しなくてはいけないため、並列実行できないと判定する。 Here, a description will be given of a method for determining whether or not an operation that outputs a variable of a variable and an operation that outputs a variable of a newVarList can be executed in parallel in the distributed plan table 141 shown in FIG. In this method, when the pre-execution operation number 145 of one operation and the pre-execution operation number 145 of the operation of this pre-execution operation number 145 are repeatedly examined, the other operation does not exist as the pre-execution operation number. In addition, it is determined that the two operations can be executed in parallel. In other words, if there is an operation number of operation B in the pre-execution operation number 145 of operation A, since operation A must be executed after operation B is executed, it is determined that parallel execution is not possible.

図１０のステップＳ９０の説明に戻る。ｖａｒＬｉｓｔが空でなく、かつこのｖａｒＬｉｓｔに含まれる変数が、演算Ｓの入力変数１４７と完全に一致しない、かつｖａｒｌｉｓｔの変数を出力する演算とｎｅｗＶａｒＬｉｓｔの変数を出力する演算とが並列実行可能な場合（ステップＳ９０がＹｅｓ）、次の５つの演算を演算Ｅの後に挿入し、挿入後、ｉにｉ＋５を代入する（ｉ：＝ｉ＋５）（ステップＳ１００）。 Returning to the description of step S90 in FIG. When varList is not empty, and the variable included in varList does not completely match the input variable 147 of operation S, and the operation that outputs the variable of variable and the operation that outputs the variable of newVarList can be executed in parallel (Step S90 is Yes), the next five operations are inserted after the operation E, and after the insertion, i + 5 is substituted for i (i: = i + 5) (step S100).

１つ目の演算は、演算内容１４４が"データ送信"、入力変数１４８と出力変数１４９がｖａｒＬｉｓｔに含まれる変数、実行場所１４６が"演算Ｓの送信場所１４７、送信場所１４７が演算Ｓの実行場所１４６"である。 In the first calculation, calculation content 144 is “data transmission”, input variable 148 and output variable 149 are variables included in varList, execution location 146 is “transmission location 147 of calculation S, and transmission location 147 is execution of calculation S. Location 146 ".

２つ目の演算は、演算内容１４４が"データ受信"、入力変数１４８と出力変数１４９がｖａｒＬｉｓｔに含まれる変数、実行場所１４６が"演算Ｓの送信場所１４７"、"送信場所１４７が演算Ｓの実行場所１４６"である。 In the second calculation, the calculation content 144 is “data reception”, the input variable 148 and the output variable 149 are variables included in the varList, the execution location 146 is “the transmission location 147 of the calculation S”, and the “transmission location 147 is the calculation S. Execution location 146 ".

３つ目の演算は、演算内容１４４が"分割クエリ結合"、入力変数１４８がｖａｒＬｉｓｔに含まれる変数、実行場所１４６が"演算Ｓの実行場所１４６"である。なお、演算内容が「分割クエリ結合」とは、ある変数に対して別々に並行して処理を行った結果を再び１つにする演算である。 In the third operation, the operation content 144 is “partitioned query combination”, the input variable 148 is a variable included in varList, and the execution location 146 is “the execution location 146 of the operation S”. Note that the operation content “divided query combination” is an operation for re-combining the results obtained by separately processing a certain variable in parallel.

４つ目の演算は、演算内容１４４が"データ送信"、入力変数１４８と出力変数１４９がｎｅｗＶａｒＬｉｓｔに含まれる変数、実行場所１４６が"演算Ｓの実行場所１４６"、"送信場所１４７が演算Ｓの送信場所１４７"である。 In the fourth calculation, the calculation content 144 is “data transmission”, the input variable 148 and the output variable 149 are variables included in the newVarList, the execution location 146 is “execution location 146 of the calculation S”, and the transmission location 147 is the calculation S. Is the transmission location 147 ".

５つ目の演算は、演算内容１４４が"データ受信"、入力変数１４８と出力変数１４９がｎｅｗＶａｒＬｉｓｔ、実行場所１４６が"演算Ｓの実行場所１４６"送信場所１４７が"演算Ｓの送信場所"である。 In the fifth calculation, the calculation content 144 is “data reception”, the input variable 148 and the output variable 149 are newVarList, the execution place 146 is “the execution place 146 of the calculation S”, and the transmission place 147 is “the transmission place of the calculation S”. is there.

次に、演算Ｓで送信する変数を変更するために３つの処理を実施する（ステップＳ１１０）。１つ目は演算Ｓの入力変数１４８と出力変数１４９の内容をｖａｒＬｉｓｔの値に変更する。２つ目は演算Ｓの次の演算Ｒを取得する。演算Ｒは演算Ｓに対応する演算内容１４４"データ受信"の演算である。３つ目にＲの入力変数１４８と出力変数１４９リストの内容をｖａｒＬｉｓｔの値に変更する。その後ステップＳ１２０へ進む。 Next, three processes are performed in order to change the variable transmitted by the calculation S (step S110). The first is to change the contents of the input variable 148 and the output variable 149 of the operation S to the value of varList. The second obtains the next operation R after the operation S. The operation R is an operation of the operation content 144 “data reception” corresponding to the operation S. Third, the contents of the R input variable 148 and output variable 149 lists are changed to the value of varList. Thereafter, the process proceeds to step S120.

なお、演算Ｓがデータ送信ではない、または実行場所１４６の計算機がスレーブサーバでない場合（ステップＳ６０がＮｏ）も、ステップＳ１２０へと進む。 Note that also when the calculation S is not data transmission or the computer at the execution place 146 is not a slave server (No in step S60), the process proceeds to step S120.

またｖａｒＬｉｓｔが空、または演算Ｓの入力変数リストと完全に一致する、かつｖａｒｌｉｓｔの変数を出力する演算とｎｅｗＶａｒＬｉｓｔの変数を出力する演算とが並列実行可能でない場合（ステップＳ９０がＮｏの場合）も、ステップＳ１２０に進む。 Also, when varList is empty, or an operation that outputs a variable of varlist and an operation that outputs a variable of newVarList cannot be executed in parallel (when Step S90 is No). The process proceeds to step S120.

ステップＳ１２０では、分割クエリ結合演算追加部１４は、ｊにｊ＋１を代入する（ｊ：＝ｊ＋１）（ステップＳ１２０）。すなわち、ステップＳ１２０はステップＳ６０がＮｏの場合もしくはステップＳ９０がＮｏの場合もしくはステップＳ１１０に続いて行われる。 In step S120, the divided query join operation adding unit 14 substitutes j + 1 for j (j: = j + 1) (step S120). That is, step S120 is performed when step S60 is No or when step S90 is No or after step S110.

ステップＳ１２０に続いて、分割クエリ結合演算追加部１４は、ｊがｉより小さいかを判定する（ステップＳ１３０）。ｊがｉより小さい場合（ステップＳ１３０がＹｅｓ）はステップＳ５０に戻り処理を繰り返す。ｊがｉより小さくない場合（ステップＳ１３０がＮｏ）はステップＳ１４０に進みｉにｉ＋１を代入する（ｉ：＝ｉ＋１）（ステップＳ１４０）。 Subsequent to step S120, the split query join operation adding unit 14 determines whether j is smaller than i (step S130). When j is smaller than i (step S130 is Yes), the process returns to step S50 and is repeated. When j is not smaller than i (No in step S130), the process proceeds to step S140, and i + 1 is substituted for i (i: = i + 1) (step S140).

演算Ｅがサーバ間演算ではない、またはｉ＝ｍａｘの場合（ステップＳ３０がＮｏ）、ステップＳ１４０に進む。すなわち、ステップＳ１４０の処理はステップＳ１３０がＮｏの場合もしくはステップＳ３０がＮｏの場合に続いて行われる。 When the calculation E is not an inter-server calculation or i = max (No in step S30), the process proceeds to step S140. That is, the process of step S140 is performed subsequently when step S130 is No or when step S30 is No.

ステップＳ１４０に続いて、分割クエリ結合演算追加部１４は、ｉがｍａｘ以下かを判定する（ステップＳ１５０）。すなわち、すべての部分クエリに対して分割クエリ演算追加処理を行ったかを判定する。 Subsequent to step S140, the split query join operation adding unit 14 determines whether i is equal to or less than max (step S150). That is, it is determined whether the divided query calculation addition processing has been performed for all partial queries.

ｉがｍａｘ以下である場合（ステップＳ１５０がＹｅｓ）、分割クエリ結合演算追加部１４は、ステップＳ２０に戻り処理を繰り返す。ｉがｍａｘよりも大きい場合（ステップＳ１５０がＮｏ）、分割クエリ結合演算追加部１４は処理を終了する。 When i is less than or equal to max (step S150 is Yes), the divided query join operation adding unit 14 returns to step S20 and repeats the process. When i is larger than max (step S150: No), the split query join operation adding unit 14 ends the process.

ここで、図１１に、図９に示した分散プランに対して、分割クエリ結合演算追加部１４が上述した分割クエリ結合演算追加処理を行った結果、修正された分散プラン（以下、修正分散プランという）の一例を示す。 Here, in FIG. 11, as a result of the split query join operation addition unit 14 performing the above-described split query join operation addition processing on the distributed plan shown in FIG. An example).

図９の分散プランテーブル１４１に記載された分散プランは、演算１乃至１０はサーバ間演算ではないため、分割クエリ結合演算追加部１４、はｉが１０になるまで１ずつ増やしていく（図１０のステップＳ２０、ステップＳ３０、ステップＳ１４０、ステップＳ１５０）。 In the distributed plan described in the distributed plan table 141 in FIG. 9, since the operations 1 to 10 are not server-to-server operations, the divided query join operation adding unit 14 increases by 1 until i becomes 10 (FIG. 10). Step S20, Step S30, Step S140, Step S150).

ｉ＝１０になると、演算Ｅがサーバ間演算であるため（ステップＳ３０がＹｅｓ）、変数ｊに１を代入する（ステップＳ４０）。なお、演算Ｅがサーバ間演算かどうかの判定は、演算Ｅの部分クエリ番号に基づいて分散プランテーブル１４１を参照して行われる。 When i = 10, since the operation E is an inter-server operation (Yes in step S30), 1 is substituted into the variable j (step S40). Whether or not the operation E is an inter-server operation is determined with reference to the distributed plan table 141 based on the partial query number of the operation E.

次にｊが６になるまではデータ送信ではないためｊを１ずつ増やしていく（ステップＳ５０、ステップＳ６０がＮｏ、ステップＳ１２０、ステップＳ１３０）。 Next, since j is not a data transmission until j reaches 6, j is incremented by 1 (No in step S50 and step S60, step S120 and step S130).

ｊが６になると、演算Ｓの演算内容がデータ送信で実行場所１４６の計算機がスレーブサーバであるため（ステップＳ６０がＹｅｓ）、演算番号６の入力変数１４８と演算番号１０の入力変数１４８に共通で現れる変数＄ｚを格納した変数リストｖａｒＬｉｓｔを作成する（ステップＳ７０）。 When j is 6, since the operation content of the operation S is data transmission and the computer at the execution place 146 is a slave server (Yes in step S60), the input variable 148 of the operation number 6 and the input variable 148 of the operation number 10 are common. A variable list varList that stores the variable $ z that appears in step S70 is created (step S70).

取得した演算番号６の入力変数１４８からｖａｒＬｉｓｔの変数リストを除いた変数＄ｕ、変数＄ｙを格納した変数リストｎｅｗＶａｒＬｉｓｔを作成する（ステップＳ８０）。 A variable list newVarList storing the variables $ u and $ y obtained by removing the variable list of varList from the obtained input variable 148 of the operation number 6 is created (step S80).

ｖａｒＬｉｓｔが空ではなく、演算番号６の入力変数リストとも完全に一致しない、かつｖａｒｌｉｓｔとｎｅｗＶａｒＬｉｓｔの変数を出力する演算が並列実行可能であるため図１１の演算番号１１−１５の５つの演算を追加した後ｉを５加えて１５を代入する（ステップＳ９０、ステップＳ１００）。次に演算番号６のデータ送信と演算番号７のデータ送信の入力変数１４８と出力変数１４９にｖａｒＬｉｓｔの値変数＄ｚを代入する（ステップＳ１１０）。 Since varList is not empty, does not completely match the input variable list of operation number 6, and operations that output variables of varlist and newVarList can be executed in parallel, five operations of operation numbers 11-15 in FIG. 11 are added. After that, i is incremented by 5 and 15 is substituted (step S90, step S100). Next, the value variable $ z of varList is substituted into the input variable 148 and the output variable 149 of the data transmission of the operation number 6 and the data transmission of the operation number 7 (step S110).

次にｊが８になるまではデータ送信ではないため単純に変数ｊを１ずつ増やしていく（ステップＳ５０、ステップＳ６０、ステップＳ１２０、ステップＳ１３０）。ｊが８の場合は演算内容１４４がデータ送信で実行場所１４６がスレーブサーバであるため、演算番号８の入力変数と演算番号１０の入力変数リストに共通で現れる変数＄ｙ格納した変数リストｖａｒＬｉｓｔを取得する（ステップＳ７０）。 Next, since j is not data transmission until j reaches 8, the variable j is simply increased by 1 (step S50, step S60, step S120, step S130). When j is 8, since the operation content 144 is data transmission and the execution place 146 is a slave server, the variable list varList storing the variable $ y that appears in common in the input variable of the operation number 8 and the input variable list of the operation number 10 is stored. Obtain (step S70).

取得したｖａｒＬｉｓｔは空ではないが、演算番号８の入力変数リストと完全に一致するため（ステップＳ９０がＮｏ）、ｊに１を加える（ステップＳＳ１２０）。以降データ送信演算はないためｉを１増やして１６を代入する（ステップＳ１２０、ステップＳ１３０、ステップＳ１４０、ステップＳ１５０）。 Although the acquired varList is not empty, it completely matches the input variable list of operation number 8 (No in step S90), so 1 is added to j (step SS120). Since there is no further data transmission calculation, i is incremented by 1 and 16 is substituted (step S120, step S130, step S140, step S150).

次にｉが１６における演算はサーバ間演算であるが、ｉがｍａｘと等しいため（ステップＳ１５０がＹｅｓ）、ステップＳ２０にもとり、処理を繰り返す。そして、ｉの値がｍａｘを超えると（ステップＳ１５０がＮｏ）、分割クエリ結合演算追加部１４は処理を終了する。その結果、分散プランが作成される。図１１に修正された分散プランのテーブル１５１を示す。すなわち、分散プラン結合演算追加部１４は、サーバ間演算とその他の演算はできるだけ並列に実行して、後で結合する形にプランを書き換える。 Next, the calculation in which i is 16 is an inter-server calculation, but since i is equal to max (Yes in step S150), the process is repeated in step S20. When the value of i exceeds max (No in step S150), the split query join operation adding unit 14 ends the process. As a result, a distributed plan is created. FIG. 11 shows a table 151 of the modified distributed plan. That is, the distributed plan combination operation addition unit 14 executes the inter-server operation and other operations in parallel as much as possible, and rewrites the plan so that it is combined later.

上述したように、分割クエリ結合演算追加部１４が分散プランを修正すると、マスターサーバの送受信部１７はスレーブサーバに修正された修正分散プランを送信する。このとき、全てのスレーブサーバに修正分散プランを送信してもよい。また、実行場所を参照して、関連のあるスレーブサーバにのみ修正分散プランを送信してもよい。 As described above, when the split query join operation addition unit 14 corrects the distribution plan, the transmission / reception unit 17 of the master server transmits the corrected distribution plan to the slave server. At this time, the modified distribution plan may be transmitted to all slave servers. Alternatively, the modified distribution plan may be transmitted only to the related slave server with reference to the execution location.

スレーブサーバのローカルプラン選択部３１は、受信した分割クエリ結合手順追加部１４により修正された分散プランの内、自身のサーバに関連する部分に対するローカルプラン候補を生成する（ステップＳ６）。 The local plan selection unit 31 of the slave server generates a local plan candidate for a portion related to its own server in the distributed plan modified by the received divided query combination procedure adding unit 14 (step S6).

ここで、ローカルプラン選択部３１によって生成されたローカルプランの候補を図１２に例示する。図１２に示したローカルプラン候補２は、計算機２であるスレーブサーバのローカルプラン選択部３１が、図１１に記載した分散プランと図４に記載したデータベース情報１１１とに基づいて生成した、計算機２に関するローカルプラン候補の一例である。 Here, the local plan candidates generated by the local plan selection unit 31 are illustrated in FIG. The local plan candidate 2 shown in FIG. 12 is generated by the local plan selection unit 31 of the slave server that is the computer 2 based on the distributed plan shown in FIG. 11 and the database information 111 shown in FIG. It is an example of the local plan candidate regarding.

図１２に示したローカルプラン候補２示すように、ローカルプラン候補２は「演算番号」３０２、「部分クエリ番号」３０３、「演算内容」３０４、「事前実行演算番号」３０５、「実行場所」３０６、「送信場所」３０７、「入力変数」３０８、「出力変数」３０９の項目を有する。なお、本実施形態のローカルプラン候補および後述するローカルプランが有する項目３０２〜３０９は、本実施形態の分散プランテーブルが有する項目１４２〜１４９と同一である。 As shown in the local plan candidate 2 shown in FIG. 12, the local plan candidate 2 includes “operation number” 302, “partial query number” 303, “operation content” 304, “pre-execution operation number” 305, and “execution location” 306. , “Transmission location” 307, “input variable” 308, and “output variable” 309. Note that items 302 to 309 included in the local plan candidates of the present embodiment and the local plan described later are the same as the items 142 to 149 included in the distributed plan table of the present embodiment.

なお、図１２に示したローカルプラン候補２において、計算機２では図４のデータベース情報からのｂｏｏｋの下のｙｅａｒ属性ノードに対して数値索引が設定されている。そのため、最初の演算番号１で部分クエリ番号１の部分クエリに該当する処理を演算「数値索引」によって実現する。演算「数値索引」はノードが持つ値を数値化したものを索引化した数値索引を設定したノードに対して、与えられた数値との比較条件を満たすノード若しくはそのノードを所有する文書の最初のノードを高速に取得する演算である。 In the local plan candidate 2 shown in FIG. 12, in the computer 2, a numerical index is set for the year attribute node under the book from the database information in FIG. Therefore, the process corresponding to the partial query of the first query number 1 and the partial query number 1 is realized by the calculation “numerical index”. The operation "numerical index" is the first node of a node that satisfies a comparison condition with a given numeric value or the document that owns that node for a node that has a numeric index that is obtained by indexing the values of nodes. This is an operation for acquiring a node at high speed.

具体的には、ローカルプラン選択部３１は、属性ノードｙｅａｒの値が１９９０以上の条件を満たすｂｏｏｋノードを数値索引内から探し出して変数＄ｘに格納する。 Specifically, the local plan selection unit 31 searches for a book node satisfying the condition that the value of the attribute node “year” is 1990 or more from the numerical index and stores it in the variable $ x.

演算番号２、４、５は各々部分クエリ番号３、４、５の部分クエリを処理する演算「ＴＲＡＶＥＲＳＥ（トラバース）」を実施する。なお、演算「ＴＲＡＶＥＲＳＥ」とはＸＭＬ内の或るノード（入力）から或るノード（出力）へ辿っていく演算である。具体的には演算番号１で求めた＄ｘに格納されたｂｏｏｋノードから、子ノードであるａｕｔｈｏｒ、ｔｉｔｌｅ、ｐｒｉｃｅを取得して各々変数＄ｚ、＄ｕ、＄ｖに格納する。演算番号３、６、７、８は分散プランで作成したデータ送信、データ受信、分割クエリ結合演算をそのまま残したものである。 Operation numbers 2, 4, and 5 perform an operation “TRAVERSE” that processes partial queries of partial query numbers 3, 4, and 5, respectively. The operation “TRAVERSE” is an operation that traces from a certain node (input) to a certain node (output) in XML. Specifically, the author node, title, and price as child nodes are acquired from the book node stored in $ x obtained by the operation number 1 and stored in the variables $ z, $ u, and $ v, respectively. The operation numbers 3, 6, 7, and 8 are the data transmission, data reception, and split query join operations created by the distributed plan as they are.

次に、図１３を参照して、図１２のローカルプラン候補２に基づいてローカルプラン候補生成部３２がさらにローカルプラン候補を生成するローカルプラン候補生成処理について説明する（図６のステップＳ７）。図１３はローカルプラン候補生成処理の一例を示すフローチャートである。 Next, a local plan candidate generation process in which the local plan candidate generation unit 32 further generates a local plan candidate based on the local plan candidate 2 in FIG. 12 will be described with reference to FIG. 13 (step S7 in FIG. 6). FIG. 13 is a flowchart illustrating an example of local plan candidate generation processing.

なお、このローカルプラン候補生成処理はｉという変数を用いる。ｉは１以上であり、かつ、対象のローカルプラン選択部３１が生成したローカルプラン候補のリスト（以下、ｉｎｐｕｔＰｌａｎＬｉｓｔという）内の要素数以下である（１≦ｉ≦ｉｎｐｕｔＰｌａｎＬｉｓｔ内の要素数）。 This local plan candidate generation process uses a variable i. i is 1 or more and is equal to or less than the number of elements in the list of local plan candidates (hereinafter referred to as inputPlanList) generated by the target local plan selection unit 31 (1 ≦ i ≦ number of elements in the inputPlanList).

まず、ローカルプラン候補生成部３２は、図１２に示すローカルプラン候補を取得する（ステップＳ２００）。ローカルプラン候補生成部３２は、ｉ＝１、ｉｎｐｕｔＰｌａｎＬｉｓｔ内の要素数＝ｍａｘとする（ステップＳ２１０）。 First, the local plan candidate generation unit 32 acquires the local plan candidates shown in FIG. 12 (step S200). The local plan candidate generating unit 32 sets i = 1 and the number of elements in the inputPlanList = max (step S210).

続いてローカルプラン候補生成部３２はｉｎｐｕｔＰｌａｎＬｉｓｔからｉ番目のローカルプラン候補ｐｌａｎを取得する（ステップＳ２２０）。ローカルプラン候補生成部３２は、取得したｐｌａｎに基づいて、後述する分割クエリ結合の実施変数の組合せパターン生成処理を実施し、このｐｌａｎを含む新規のローカルプラン候補リストを作成する（ステップＳ２３０）。分割クエリ結合の実施変数の組合せパターン生成処理とは、分割クエリ結合前後で実施する演算を変化させて様々なローカルプラン候補を生成する処理であり、図１４のフローチャートにより詳細を後述する。 Subsequently, the local plan candidate generation unit 32 acquires the i-th local plan candidate plan from the inputPlanList (step S220). Based on the acquired plan, the local plan candidate generation unit 32 performs a combination pattern generation process of an execution variable for split query combination, which will be described later, and creates a new local plan candidate list including this plan (step S230). The combination pattern generation process of the execution variable of the split query combination is a process of generating various local plan candidates by changing the calculation performed before and after the split query combination, and will be described in detail later with reference to the flowchart of FIG.

ローカルプラン候補生成部３２は、分割クエリ結合の実施変数の組合せパターン生成処理によって得られた新規のローカルプラン候補リスト（以下、ｎｅｘｔＰｌａｎＬｉｓｔという）を取得する（ステップＳ２４０）。ここで、ｊという変数を用いる。ｊは１以上であり、かつ、ｎｅｘｔＰｌａｎＬｉｓｔ内の要素数以下である（１≦ｊ≦ｎｅｘｔＰｌａｎＬｉｓｔ内の要素数）。また、この時ｊ＝１であり、ｎｅｘｔＰｌａｎＬｉｓｔ内の要素数＝ｆｉｎａｌＭａｘとする（ステップＳ２５０）。 The local plan candidate generation unit 32 acquires a new local plan candidate list (hereinafter, referred to as “nextPlanList”) obtained by the combination pattern generation processing of the execution variables of the split query combination (step S240). Here, a variable called j is used. j is 1 or more and less than or equal to the number of elements in nextPlanList (1 ≦ j ≦ number of elements in nextPlanList). At this time, j = 1 and the number of elements in nextPlanList = finalMax (step S250).

次に、ローカルプラン候補生成部３２は、ｎｅｘｔＰｌａｎＬｉｓｔからｊ番目のローカルプラン候補ｎｅｘｔＰｌａｎを取得する（ステップＳ２６０）。ローカルプラン候補生成部３２は、取得したｎｅｘｔＰｌａｎに基づいて分割クエリ結合の実施場所パターン生成処理を実施しｎｅｘｔＰｌａｎを含む新規のローカルプラン候補リストを作成する（ステップＳ２７０）。分割クエリ結合の実施場所パターン生成処理とは、分割クエリ結合演算の実行場所を変化させることで様々なローカルプラン候補を生成する処理であり、図１５のフローチャートにより詳細を後述する。 Next, the local plan candidate generation unit 32 acquires the jth local plan candidate nextPlan from the nextPlanList (step S260). The local plan candidate generation unit 32 performs the execution location pattern generation process of the split query combination based on the acquired nextPlan, and creates a new local plan candidate list including the nextPlan (step S270). The execution location pattern generation process for split query join is a process for generating various local plan candidates by changing the execution place of the split query join operation, and will be described in detail later with reference to the flowchart of FIG.

ローカルプラン候補生成部３２は、分割クエリ結合の実施場所パターン生成処理結果に基づいて新規のローカルプラン候補リストｆｉｎａｌＰｌａｎＬｉｓｔを作成する（ステップＳ２８０）。作成したｆｉｎａｌＰｌａｎＬｉｓｔ内のローカルプラン候補を最終出力である出力候補プランリストｏｕｔｐｕｔＬｉｓｔに追加する（ステップＳ２９０）。 The local plan candidate generation unit 32 creates a new local plan candidate list finalPlanList based on the execution location pattern generation processing result of the split query combination (step S280). The local plan candidates in the created finalPlanList are added to the output candidate plan list outputList that is the final output (step S290).

次に、ステップＳ３００に進み、ｊにｊ＋１を代入する（ｊ：＝ｊ＋１）（ステップＳ３００）。ローカルプラン候補生成部３２は、このｊがｆｉｎａｌＭａｘ以下であるかを判定する（ステップＳ３１０）。ｊがｆｉｎａｌＭａｘ以下の場合（ステップＳ３１０がＹｅｓ）、ステップＳ２６０に戻り処理を繰り返す。 Next, proceeding to step S300, j + 1 is substituted for j (j: = j + 1) (step S300). The local plan candidate generation unit 32 determines whether this j is equal to or less than finalMax (step S310). If j is not more than finalMax (step S310 is Yes), the process returns to step S260 and is repeated.

ｊがｆｉｎａｌＭａｘより大きい場合（ステップＳ３１０がＮｏ）、ステップＳ３２０に進みｉにｉ＋１を代入する（ｉ：＝ｉ＋１）（ステップＳ３２０）。 When j is larger than finalMax (step S310: No), the process proceeds to step S320, and i + 1 is substituted for i (i: = i + 1) (step S320).

次にｉがｍａｘ以下かを判定する（ステップＳ３３０）。ｉがｍａｘ以下の場合（ステップＳ３３０がＹｅｓ）はステップＳ２２０に戻り処理を繰り返す。ｉがｍａｘよりも大きい場合は終了する。最終出力はｏｕｔｐｕｔＬｉｓｔに格納されたローカルプラン候補リストであり、ローカルプラン選択部３１はこのローカルプラン候補リストの中から最終的に１つのローカルプランを選択する。 Next, it is determined whether i is equal to or less than max (step S330). If i is less than or equal to max (step S330 is Yes), the process returns to step S220 and is repeated. If i is larger than max, the process ends. The final output is a local plan candidate list stored in outputList, and the local plan selection unit 31 finally selects one local plan from the local plan candidate list.

次に、図１４のフローチャートに従って、ローカルプラン候補生成部３２による、図１３のステップＳ２３０における分割クエリ結合の実施変数の組合せパターン生成処理について説明する。なお、この分割クエリ結合の実施変数の組合せパターン生成処理はｉという変数を用いる。ｉは１以上であり、かつ、入力対象のローカルプラン候補ｐｌａｎの演算番号以下である（１≦ｉ≦ｐｌａｎの演算番号の最大値）。 Next, according to the flowchart of FIG. 14, the combination pattern generation process of the execution variable of the split query combination in step S230 of FIG. 13 by the local plan candidate generation unit 32 will be described. Note that the variable i is used in the combination pattern generation process of the execution variable of the split query combination. i is 1 or more and is equal to or less than the operation number of the local plan candidate plan to be input (the maximum value of the operation numbers of 1 ≦ i ≦ plan).

まず、ローカルプラン候補生成部３２は、ｐｌａｎを最終出力である一時候補リストｎｅｘｔＰｌａｎＬｉｓｔに登録する（ステップＳ４００）。また、ローカルプラン候補生成部３２は、ｉ＝１、ｐｌａｎの演算番号の最大値＝ｍａｘとする（ステップＳ４１０）。 First, the local plan candidate generating unit 32 registers plan in the temporary candidate list nextPlanList that is the final output (step S400). Further, the local plan candidate generating unit 32 sets i = 1 and the maximum value of the operation number of plan = max (step S410).

続いて、ローカルプラン候補生成部３２は入力されたローカルプラン候補ｐｌａｎの演算番号ｉの演算Ｅを取得する（ステップＳ４２０）。取得した演算Ｅが分割クエリ結合演算の場合（ステップＳ４３０がＹｅｓ）、ｐｌａｎの演算番号ｉ＋１番目のデータ送信演算Ｓを取得し、さらに演算Ｓの入力変数の全組合せパターンのリストｖａｒＰａｔｔｅｒｎＬｉｓｔを取得する（ステップＳ４４０）。 Subsequently, the local plan candidate generation unit 32 acquires the operation E of the operation number i of the input local plan candidate plan (step S420). If the acquired operation E is a split query join operation (Yes in step S430), the operation number i + 1 of the plan and the data transmission operation S of the 1st data are acquired, and further, a list varPatternList of all combination patterns of the input variables of the operation S is acquired ( Step S440).

次に、ｊ：＝１とし、ｎｅｘｔＰｌａｎＬｉｓｔ内の要素数＝ｎｅｘｔＭａｘとする（ステップＳ４５０）。次に、ｎｅｘｔＰｌａｎＬｉｓｔのｊ番目のプランｎｅｘｔＰｌａｎを取得する（ステップＳ４６０）。次に、ｋ：＝１とし、ｖａｒＰａｔｔｅｒｎＬｉｓｔ内の要素数＝ｖｔＭａｘとする（ステップＳ４７０）。次に、ｎｅｘｔＰｌａｎの内容をコピーした新しいローカルプラン候補ｎｅｗＰｌａｎを作成する（ステップＳ４８０）。 Next, j: = 1, and the number of elements in nextPlanList = nextMax (step S450). Next, the jth plan nextPlan of nextPlanList is acquired (step S460). Next, k: = 1 and the number of elements in varPatternList = vtMax (step S470). Next, a new local plan candidate newPlan is created by copying the contents of nextPlan (step S480).

次に、ローカルプラン候補生成部３２は、次の３つの処理を実施する（ステップＳ４９０）。１つ目はｖａｒＰａｔｔｅｒｎＬｉｓｔのｋ番目の要素ｔａｒｇｅｔＶａｒｓを取得する処理であり、２つ目はｎｅｗＰｌａｎの演算番号ｉ＋１番目のデータ送信演算ＡＳを取得する処理であり、３つ目はｎｅｗＰｌａｎの演算番号ｉ−１番目のデータ受信演算ＡＲを取得する処理である。 Next, the local plan candidate generation unit 32 performs the following three processes (step S490). The first is a process for obtaining the k-th element targetVars of varPatternList, the second is the process for obtaining the newPlan operation number i + 1, and the third data transmission operation AS, and the third is the process number i- This is processing for obtaining the first data reception calculation AR.

次に演算ＡＳの入力変数リストとｔａｒｇｅｔＶａｒｓの内容が一致するかを判定する（ステップＳ５００）。 Next, it is determined whether the input variable list of the operation AS matches the contents of targetVars (step S500).

演算ＡＳの入力変数リストが空の場合（ステップＳ５００がＹｅｓ）、演算Ｅ、演算ＡＳ、演算ＡＲは不要であるため各演算内容をｄｕｍｍｙに変更する。ｄｕｍｍｙは何もしない演算であり、後で削除する。ここでは演算番号がずれるため削除しない。 If the input variable list of the calculation AS is empty (step S500 is Yes), the calculation E, the calculation AS, and the calculation AR are unnecessary, so that each calculation content is changed to dummy. Dummy is an operation that does nothing, and is deleted later. Here, the calculation number is shifted, so it is not deleted.

次に、分割クエリ結合をしない変数を事前に送るように、演算Ｅより前にあるデータ送信演算の入出力変数と事前実行演算番号を変更するために次の４つの処理を実施する（ステップＳ５２０）。１つ目は演算Ｅの入力変数を入力変数とするデータ送信演算ＢＳを取得する処理であり、２つ目はＢＳの入出力変数リストへｔａｒｇｅｔＶａｒｓに含まれる変数を追加する処理であり、３つ目はｔａｒｇｅｔＶａｒｓの変数を出力する演算の演算番号リストｐｒｅＥｘｅＬｉｓｔを取得する処理であり、４つ目はＢＳの事前実行演算番号にｐｒｅＥｘｅＬｉｓｔの値を代入する処理である。 Next, the following four processes are performed in order to change the input / output variable and the pre-execution operation number of the data transmission operation before the operation E so as to send the variable that does not perform the split query combination in advance (step S520). ). The first is a process of acquiring a data transmission calculation BS using the input variable of the calculation E as an input variable, and the second is a process of adding a variable included in the targetVars to the input / output variable list of the BS. The fourth is a process of acquiring the operation number list preExeList of the operation for outputting the variable of targetVars, and the fourth is the process of substituting the value of preExeList for the pre-execution operation number of the BS.

ローカルプラン候補生成部３２は、ステップＳ５２０の処理を行ったｎｅｗＰｌａｎをｎｅｗＰｌａｎＬｉｓｔに追加する（ステップＳ５３０）。 The local plan candidate generation unit 32 adds newPlan that has been subjected to the process of step S520 to the newPlanList (step S530).

その後、ｋにｋ＋１を代入し（ｋ：＝ｋ＋１）（ステップＳ５４０）、ｋがｖｔＭａｘ以下であるかを判定する（ステップＳ５５０）。ｋがｖｔＭａｘ以下の場合（ステップＳ５５０がＹｅｓ）、ステップＳ４８０に戻り処理を繰り返す。ｋがｖｔＭａｘより大きい場合（ステップＳ５５０がＮｏ）、ステップＳ５６０に進みｊにｊ＋１を代入する（ｊ：＝ｊ＋１）（ステップＳ５６０）。 Thereafter, k + 1 is substituted for k (k: = k + 1) (step S540), and it is determined whether k is equal to or less than vtMax (step S550). When k is less than or equal to vtMax (step S550 is Yes), the process returns to step S480 and the process is repeated. When k is larger than vtMax (step S550: No), the process proceeds to step S560, and j + 1 is substituted for j (j: = j + 1) (step S560).

ローカルプラン候補生成部３２は、ｊがｎｅｘｔＭａｘ以下であるかを判定する（ステップＳ５７０）。ｊがｎｅｘｔＭａｘ以下の場合（ステップＳ５７０がＹｅｓ）、ステップＳ４６０に戻り処理を繰り返す。ｊがｎｅｘｔＭａｘより大きい場合（ステップＳ５７０がＮｏ）、ステップＳ５８０に進みｎｅｗＰｌａｎＬｉｓｔ内の要素をｎｅｘｔＰｌａｎＬｉｓｔに全て移す（ステップＳ５８０）。その後、ステップＳ５９０へと進み、ｉにｉ＋１を代入する（ｉ：＝ｉ＋１）（ステップＳ５９０）。 The local plan candidate generation unit 32 determines whether j is equal to or less than nextMax (step S570). When j is less than or equal to nextMax (step S570 is Yes), the process returns to step S460 and is repeated. If j is larger than nextMax (No in step S570), the process proceeds to step S580, and all the elements in newPlanList are moved to nextPlanList (step S580). Thereafter, the process proceeds to step S590, where i + 1 is substituted for i (i: = i + 1) (step S590).

なお、演算Ｅが分割クエリ結合ではない場合も（ステップＳ４３０がＮｏ）、ステップＳ５９０に進みｉにｉ＋１を代入する（ｉ：＝ｉ＋１）（ステップＳ５９０）。すなわちステップＳ５９０の処理はステップＳ４３０がＮｏまたはステップＳ５８０に続いて行われる。 Even when the operation E is not a split query join (No in step S430), the process proceeds to step S590 and i + 1 is substituted for i (i: = i + 1) (step S590). That is, the process of step S590 is performed after step S430 is No or step S580.

ローカルプラン候補生成部３２は、ｉがｍａｘ以下であるかを判定する（ステップＳ６００）。ｉがｍａｘ以下の場合（ステップＳ６００がＹｅｓ）、ステップＳ４２０に戻り処理を繰り返す。ｉがｍａｘより大きい場合は終了する。処理の終了後の最終出力はｎｅｘｔＰｌａｎＬｉｓｔに格納されたローカルプラン候補リストであり、図１３のステップＳ２４０に戻ってローカルプラン候補生成部３２の処理が続けられる。 The local plan candidate generation unit 32 determines whether i is equal to or less than max (step S600). If i is less than or equal to max (step S600 is Yes), the process returns to step S420 and is repeated. If i is larger than max, the process ends. The final output after the processing is the local plan candidate list stored in nextPlanList, and the processing of the local plan candidate generating unit 32 is continued by returning to step S240 in FIG.

次に、図１５のフローチャートに従って分割クエリ結合の実施場所パターン生成処理について説明する。なお、この分割クエリ結合の実施場所パターン生成処理はｉという変数を用いる。ｉは１以上であり、かつ、入力対象のローカルプラン候補ｎｅｘｔＰｌａｎの演算番号以下である（１≦ｉ≦ｎｅｘｔＰｌａｎの演算番号の最大値）。 Next, the execution location pattern generation process for split query combination will be described with reference to the flowchart of FIG. Note that the variable location i is used in the execution location pattern generation process of the split query combination. i is 1 or more and is equal to or smaller than the operation number of the local plan candidate nextPlan to be input (the maximum value of the operation numbers of 1 ≦ i ≦ nextPlan).

ローカルプラン候補生成部３２は、ｎｅｘｔＰｌａｎを最終出力である最終候補リストｆｉｎａｌＰｌａｎＬｉｓｔに登録する（ステップＳ７００）。 The local plan candidate generation unit 32 registers nextPlan in the final candidate list finalPlanList that is the final output (step S700).

また、処理の開始時点ではｉ＝１、ｎｅｘｔＰｌａｎの演算番号の最大値＝ｍａｘとする（ステップＳ７１０）。 Further, i = 1 and the maximum value of the nextPlan operation number = max at the start of processing (step S710).

まず、分割クエリ結合の実施場所パターン生成処理は入力されたローカルプラン候補ｎｅｘｔＰｌａｎの演算番号ｉの演算Ｅを取得する（ステップＳ７２０）。取得した演算Ｅが分割クエリ結合演算の場合（ステップＳ７３０がＹｅｓ）、ｎｅｘｔＰｌａｎの演算番号ｉ＋１番目の演算Ｓを取得する（ステップＳ７４０）。次に、ｊ：＝１とし、ｆｉｎａｌＰｌａｎＬｉｓｔ内の要素数＝ｆｉｎａｌＭａｘとする（ステップＳ７５０）。次に、ｆｉｎａｌＰｌａｎＬｉｓｔのｊ番目のプランｆｉｎａｌＰｌａｎを取得する（ステップＳ７６０）。次に、ｆｉｎａｌＰｌａｎの内容をコピーした新しいローカルプラン候補ｎｅｗＰｌａｎを作成する（ステップＳ７７０）。 First, the execution location pattern generation process of split query combination acquires the operation E of the operation number i of the input local plan candidate nextPlan (step S720). When the acquired operation E is a split query join operation (Yes in step S730), the operation number i + 1 of the nextPlan is acquired (step S740). Next, j: = 1 and the number of elements in finalPlanList = finalMax (step S750). Next, the jth plan finalPlan of finalPlanList is acquired (step S760). Next, a new local plan candidate newPlan is created by copying the contents of finalPlan (step S770).

次にローカルプラン候補生成部３２は、分割クエリ結合の実行場所をマスターサーバに変更するために次の５つの処理を実施する（ステップＳ７８０）。１つ目は分割結合演算Ｅの直後の送信データ演算Ｓを取得する処理である。２つ目は分割クエリ結合演算Ｅの直前の受信データ演算Ｒを取得する処理である。３つ目は演算Ｒの演算内容を送信データ演算に、入出力変数にＳの入力変数を追加したものに、実行場所を演算Ｓの送信先→Ｓの送信元に変更する処理である。４つ目の処理は不要になった演算Ｓの演算内容をｄｕｍｍｙに変更する処理である。５つ目の処理は、演算Ｅの実行場所をＳの送信元に変更する処理である。 Next, the local plan candidate generating unit 32 performs the following five processes in order to change the execution location of the split query combination to the master server (step S780). The first is a process of acquiring the transmission data calculation S immediately after the split / join calculation E. The second is a process of acquiring the received data calculation R immediately before the split query join calculation E. The third is a process of changing the execution location from the transmission destination of the calculation S to the transmission source of the S, with the calculation content of the calculation R being the transmission data calculation and the input variable of S being added to the input / output variable. The fourth process is a process of changing the calculation content of the calculation S that is no longer necessary to “dummy”. The fifth process is a process of changing the execution location of the operation E to the S transmission source.

次に、ステップＳ７９０に進みステップＳ７８０で修正したｎｅｗＰｌａｎを新規プランリストｎｅｗＰｌａｎＬｉｓｔに追加する（ステップＳ７９０）。次に、ステップＳ８００に進みｊにｊ＋１を代入する（ｊ：＝ｊ＋１）（ステップＳ８００）。次にｊがｆｉｎａｌＭａｘ以下であるかを判定する（ステップＳ８１０）。ｊがｆｉｎａｌＭａｘ以下の場合（ステップＳ８１０がＹｅｓ）、ステップＳ７６０に戻り処理を繰り返す。ｊがｆｉｎａｌＭａｘより大きい場合（ステップＳ８１０がＮｏ）、ステップＳ８２０に進みｎｅｗＰｌａｎＬｉｓｔ内の要素を全てｆｉｎａｌＰｌａｎＬｉｓｔに移す（ステップＳ８２０）。さらにステップＳ８３０へと進む。 Next, the process proceeds to step S790, and newPlan corrected in step S780 is added to the new plan list newPlanList (step S790). Next, proceeding to step S800, j + 1 is substituted for j (j: = j + 1) (step S800). Next, it is determined whether j is equal to or less than finalMax (step S810). If j is not more than finalMax (step S810 is Yes), the process returns to step S760 and the process is repeated. When j is larger than finalMax (step S810: No), the process proceeds to step S820, and all the elements in newPlanList are moved to finalPlanList (step S820). Further, the process proceeds to step S830.

また演算Ｅが分割クエリ結合ではない場合も（ステップＳ７３０がＮｏ）、ステップＳ８３０に進みｉにｉ＋１を代入する（ｉ：＝ｉ＋１）（ステップＳ８３０）。すなわちステップＳ８３０の処理はステップＳ７３０がＮｏまたはステップＳ８２０に続いて行われる。 When the calculation E is not a split query combination (No in step S730), the process proceeds to step S830 and i + 1 is substituted for i (i: = i + 1) (step S830). That is, the process of step S830 is performed after step S730 is No or step S820.

次にｉがｍａｘ以下であるかを判定する（ステップＳ８４０）。ｉがｍａｘ以下の場合（ステップＳ８４０がＹｅｓ）、ステップＳ７２０に戻り処理を繰り返す。ｉがｍａｘより大きい場合はステップＳ８５０に進む。 Next, it is determined whether i is equal to or less than max (step S840). If i is less than or equal to max (step S840 is Yes), the process returns to step S720 and is repeated. If i is larger than max, the process proceeds to step S850.

ステップＳ８５０はｆｉｎａｌＰｌａｎＬｉｓｔ内にある全てのローカルプラン候補内の演算で、演算内容がｄｕｍｍｙとなっているものを削除して終了する（ステップＳ８５０）。最終出力はｆｉｎａｌＰｌａｎＬｉｓｔに格納されたローカルプラン候補リストであり、図１３のステップＳ２８０に戻ってローカルプラン候補生成部３２の処理が続けられる。 In step S850, all the calculations in the local plan candidates in finalPlanList are deleted with the calculation content being dummy (step S850). The final output is a local plan candidate list stored in finalPlanList, and the process returns to step S280 of FIG. 13 to continue the processing of the local plan candidate generation unit 32.

図１２に示したローカルプラン候補２が、ローカルプラン選択部３１で得られたローカルプラン候補の場合において、上述したローカルプラン候補生成処理が行われた結果、得られる新たなローカルプラン候補２−１乃至２−６を図１６乃至図２１に示す。 When the local plan candidate 2 shown in FIG. 12 is a local plan candidate obtained by the local plan selection unit 31, a new local plan candidate 2-1 obtained as a result of the above-described local plan candidate generation processing is performed. 1 to 6 are shown in FIGS.

ローカルプラン候補生成部３２が、これらのローカルプラン候補２−１乃至２−６を生成する処理を具体的に説明する。まず、図１２のローカルプラン候補２が、分割クエリ結合の実施変数の組合せパターン処理の入力として渡される（図１３のステップＳ２００、Ｓ２１０、Ｓ２２０、Ｓ２３０）。 A process in which the local plan candidate generating unit 32 generates these local plan candidates 2-1 to 2-6 will be specifically described. First, the local plan candidate 2 in FIG. 12 is passed as an input for combination pattern processing of the execution variable of the split query combination (steps S200, S210, S220, and S230 in FIG. 13).

次に分割クエリ結合の実施変数の組合せパターン処理において、入力された図１２のローカルプラン候補２（以下、ｐｌａｎとする）を一時候補リスト（以下、ｎｅｘｔＰｌａｎＬｉｓｔとする）に登録する（図１４のステップＳ４００）。次にｐｌａｎではｉが７になるまでは分割クエリ結合演算ではないので変数ｉを１ずつ増やしていく（ステップＳ４２０、Ｓ４３０、Ｓ５５０、Ｓ５４０）。ｉが７の時の演算Ｅは分割クエリ結合であるため演算番号が８の演算Ｓを取得し、Ｓの入力変数（＄ｕ、＄ｖ）の全組合せとして３つの組合せ（＄ｕ）、（＄ｖ）、（＄ｕ、＄ｖ）をｖａｒＰａｔｔｅｒｎＬｉｓｔに格納する（ステップＳ４４０）。次に、ｊ：＝１としてｎｅｘｔＰｌａｎＬｉｓｔ内の１番目の要素として最初に登録した図１２のｐｌａｎをｎｅｘｔＰｌａｎとして取得する（ステップＳ４５０、Ｓ４６０）。次にｎｅｘｔＰｌａｎをコピーしたｎｅｗＰｌａｎを作成する（ステップＳ４８０）。 Next, in the combination pattern processing of the execution variables of the split query join, the input local plan candidate 2 (hereinafter referred to as “plan”) in FIG. 12 is registered in the temporary candidate list (hereinafter referred to as “nextPlanList”) (step in FIG. 14). S400). Next, in plan, since i is not a split query join operation until i becomes 7, the variable i is incremented by 1 (steps S420, S430, S550, and S540). Since the operation E when i is 7 is a split query join, the operation S with the operation number 8 is acquired, and three combinations ($ u), ( $ V) and ($ u, $ v) are stored in varPatternList (step S440). Next, the plan of FIG. 12 registered first as the first element in the nextPlanList with j: = 1 is acquired as the nextPlan (steps S450 and S460). Next, newPlan is created by copying nextPlan (step S480).

次に、ｖａｒＰａｔｔｅｒｎＬｉｓｔの１番目の要素（＄ｕ）を取得し、ｎｅｗＰｌａｎの演算番号８のデータ送信演算ＡＳと演算番号６のデータ受信演算ＡＲを取得する（ステップＳ４９０）。ＡＳの入力変数（＄ｕ、＄ｖ）と（＄ｕ）は一致しないため、分割クエリ結合の入力変数を送信する演算番号３のデータ送信演算を取得し入出力変数に分割クエリ結合の対象から外した変数＄ｕを追加する。さらに変数＄ｕを出力する演算番号４を演算番号３の事前実行演算番号に追加する（ステップＳ５００、Ｓ５２０）。これによって得られたｎｅｗＰｌａｎをｎｅｗＰｌａｎＬｉｓｔに追加する（ステップＳ５３０）。ｎｅｗＰｌａｎであるローカルプラン候補２−１の内容を図１６に示す。但し図１６に示したローカルプラン候補２−１では演算内容ｄｕｍｍｙの演算は削除済みである。次にｋに２を代入し再び図１２のｐｌａｎをコピーしたｎｅｗＰｌａｎを作成する（ステップＳ４８０）。 Next, the first element ($ u) of varPatternList is acquired, and the data transmission operation AS of operation number 8 and the data reception operation AR of operation number 6 are acquired (step S490). Since the AS input variables ($ u, $ v) and ($ u) do not match, the data transmission operation of operation number 3 for transmitting the input variable of the split query combination is acquired and the input / output variable is selected from the target of the split query connection. Add the removed variable $ u. Further, the operation number 4 for outputting the variable $ u is added to the pre-execution operation number of the operation number 3 (steps S500 and S520). The newPlan obtained in this way is added to the newPlanList (step S530). The contents of the local plan candidate 2-1 that is newPlan are shown in FIG. However, in the local plan candidate 2-1 shown in FIG. 16, the calculation of the calculation content dummy has been deleted. Next, 2 is substituted for k, and a newPlan is created by copying the plan of FIG. 12 again (step S480).

次に、ｖａｒＰａｔｔｅｒｎＬｉｓｔの２番目の要素（＄ｖ）を取得し、ｎｅｗＰｌａｎの演算番号８のデータ送信演算ＡＳと演算番号６のデータ受信演算ＡＲを取得する（ステップＳ４９０）。ＡＳの入力変数（＄ｕ、＄ｖ）と（＄ｖ）は一致しないため、分割クエリ結合の入力変数＄ｚを送信する演算番号３のデータ送信演算を取得し入出力変数に分割クエリ結合の対象から外した変数＄ｖを追加する。さらに変数＄ｖを出力する演算番号５を演算番号３の事前実行演算番号に追加する（ステップＳ５００、Ｓ５２０）。これによって得られたｎｅｗＰｌａｎをｎｅｗＰｌａｎＬｉｓｔに追加する（ステップＳ５３０）。ｎｅｗＰｌａｎであるローカルプラン候補２−２の内容を図１７に示す。但し図１７に示したローカルプラン候補２−２では演算内容ｄｕｍｍｙの演算は削除済みである。次にｋに３を代入し再び図１２のｐｌａｎをコピーしたｎｅｗＰｌａｎを作成する（ステップＳ４８０）。 Next, the second element ($ v) of varPatternList is acquired, and the data transmission operation AS of operation number 8 and the data reception operation AR of operation number 6 are acquired (step S490). Since the AS input variables ($ u, $ v) and ($ v) do not match, the data transmission operation of operation number 3 for transmitting the input variable $ z of the split query combination is acquired, and the split query connection is input to the input / output variable. Add the variable $ v excluded from the target. Further, the operation number 5 for outputting the variable $ v is added to the pre-execution operation number of the operation number 3 (steps S500 and S520). The newPlan obtained in this way is added to the newPlanList (step S530). The contents of the local plan candidate 2-2 which is newPlan are shown in FIG. However, in the local plan candidate 2-2 shown in FIG. 17, the calculation of the calculation content dummy has been deleted. Next, newPlan is created by substituting 3 for k and copying the plan of FIG. 12 again (step S480).

次に、ｖａｒＰａｔｔｅｒｎＬｉｓｔの３番目の要素（＄ｕ、＄ｖ）を取得し、ｎｅｗＰｌａｎの演算番号８のデータ送信演算ＡＳと演算番号６のデータ受信演算ＡＲを取得する（ステップＳ４９０）。ＡＳの入力変数（＄ｕ、＄ｖ）とｖａｒＰａｔｔｅｒｎＬｉｓｔの３番目の要素（＄ｕ、＄ｖ）は完全に一致するため、演算Ｅ、ＡＳ、ＡＲの演算内容をｄｕｍｍｙに変更する（ステップＳ５１０）。次に分割クエリ結合の入力変数を送信する演算番号３のデータ送信演算を取得し入出力変数に分割クエリ結合の対象から外した変数＄ｕ、＄ｖを追加する。さらに変数＄ｕ、＄ｖを出力する演算番号４、５を演算番号３の事前実行演算番号に追加する（ステップＳ５２０）。これによって得られたｎｅｗＰｌａｎをｎｅｗＰｌａｎＬｉｓｔに追加する（ステップＳ５３０）。このｎｅｗＰｌａｎであるローカルプラン候補２−３の内容を図１８に示す。但し図１８に示したローカルプラン候補２−３では演算内容ｄｕｍｍｙの演算は削除済みである。次にｋに４を代入する（ステップＳ５４０）。 Next, the third element ($ u, $ v) of varPatternList is acquired, and the data transmission operation AS of operation number 8 and the data reception operation AR of operation number 6 are acquired (step S490). Since the AS input variables ($ u, $ v) and the third element ($ u, $ v) of varPatternList completely match, the computation contents of computations E, AS, AR are changed to dummy (step S510). . Next, the data transmission operation of operation number 3 for transmitting the input variable of the split query combination is acquired, and the variables $ u and $ v excluded from the target of the split query connection are added to the input / output variables. Further, the operation numbers 4 and 5 for outputting the variables $ u and $ v are added to the pre-execution operation number of the operation number 3 (step S520). The newPlan obtained in this way is added to the newPlanList (step S530). The contents of the local plan candidate 2-3 that is the newPlan are shown in FIG. However, in the local plan candidate 2-3 shown in FIG. 18, the calculation of the calculation content dummy has been deleted. Next, 4 is substituted for k (step S540).

次にｖａｒＰａｔｔｅｒｎＬｉｓｔは要素数が３までしかないためｊに２を代入する（ステップＳ５９０、Ｓ５８０）。次にｎｅｘｔＰｌａｎＬｉｓｔは要素数が１しかないため、今まで得られたｎｅｗＰｌａｎＬｉｓｔ内の３つのローカルプラン候補図１６、１７、１８に示したローカルプラン候補をｎｅｘｔＰｌａｎＬｉｓｔに移す（ステップＳ５７０、Ｓ５６０）。ｎｅｘｔＰｌａｎＬｉｓｔはこの時点で図１２、図１６乃至図１８のローカルプラン候補を要素として持つ。次にｉに８以降の値を代入しても分割クエリ結合は存在しないため分割クエリ結合の実施変数の組合せパターン生成処理を終了する（ステップＳ５５０、Ｓ５４０、Ｓ４２０、Ｓ４３０）。 Next, since varPatternList has only 3 elements, 2 is substituted for j (steps S590 and S580). Next, since nextPlanList has only one element, the three local plan candidates in the newPlanList obtained so far are transferred to the nextPlanList (steps S570 and S560). NextPlanList has the local plan candidates shown in FIGS. 12, 16 to 18 as elements at this time. Next, even if a value after 8 is substituted for i, there is no split query combination, and therefore the combination pattern generation processing of the execution variable of the split query combination is ended (steps S550, S540, S420, S430).

次に分割クエリ結合の実施変数の組合せパターン生成処理の出力リストとして図１２、図１６乃至図１８のローカルプラン候補を格納したｎｅｘｔＰｌａｎＬｉｓｔを取得する（図１３のステップＳ２４０）。次にｊに１を代入し、ｎｅｘｔＰｌａｎＬｉｓｔの１番目の要素である図１２のローカルプラン候補ｎｅｘｔＰｌａｎを取得する（ステップＳ２５０、Ｓ２６０）。次にｎｅｘｔＰｌａｎは分割クエリ結合の実施場所パターン生成処理の入力として渡される（ステップＳ２７０）。 Next, nextPlanList in which the local plan candidates shown in FIGS. 12, 16 to 18 are stored is acquired as an output list of the combination pattern generation process of the execution variable of the split query combination (step S240 in FIG. 13). Next, 1 is substituted into j, and the local plan candidate nextPlan of FIG. 12 which is the first element of nextPlanList is acquired (steps S250 and S260). NextPlan is then passed as an input for the execution location pattern generation process for split query join (step S270).

次に、分割クエリ結合の実施場所パターン生成処理の入力として渡された図１２のローカルプラン候補ｎｅｘｔＰｌａｎを、最終候補リストｆｉｎａｌＰｌａｎＬｉｓｔに登録する（図１５のステップＳ７００）。次にｉに１を代入する（ステップＳ７１０）。 Next, the local plan candidate nextPlan in FIG. 12 passed as the input of the execution location pattern generation process for split query combination is registered in the final candidate list finalPlanList (step S700 in FIG. 15). Next, 1 is substituted into i (step S710).

ｎｅｘｔＰｌａｎではｉが７になるまでは分割クエリ結合演算ではないので変数ｉを１ずつ増やしていく（ステップＳ７２０、Ｓ７３０、Ｓ８３０、Ｓ８４０）。ｉが７の時の演算Ｅは分割クエリ結合であるため演算番号が８のデータ送信演算Ｓを取得する（ステップＳ７４０）。次にｊに１を代入し、ｆｉｎａｌＰｌａｎＬｉｓｔ内の１番目の要素として最初に登録した、図１２に示したローカルプラン候補ｎｅｘｔＰｌａｎをｆｉｎａｌＰｌａｎとして取得する（ステップＳ７５０、Ｓ７６０）。次にｆｉｎａｌＰｌａｎをコピーしたｎｅｗＰｌａｎを作成する（ステップＳ７７０）。 Since nextPlan is not a split query join operation until i becomes 7, variable i is incremented by 1 (steps S720, S730, S830, and S840). Since the operation E when i is 7 is a split query combination, the data transmission operation S with the operation number 8 is acquired (step S740). Next, 1 is substituted into j, and the local plan candidate nextPlan shown in FIG. 12 that is first registered as the first element in finalPlanList is obtained as finalPlan (steps S750 and S760). Next, newPlan is created by copying finalPlan (step S770).

部分クエリ番号の実行場所をマスターサーバに変更するために以下の処理を行う（ステップＳ７８０）。最初に演算番号８の送信データ演算Ｓと演算番号６の受信データ演算Ｒを取得する。次にＲを送信データ演算に、入出力変数はＳの入力変数＄ｕ、＄ｖを加えたものに、実行場所を"計算機２→０"に変更する。さらにＳの演算内容をｄｕｍｍｙに変更する。最後にＥの実行場所を計算機０に変更する。 The following processing is performed to change the execution location of the partial query number to the master server (step S780). First, the transmission data calculation S of calculation number 8 and the reception data calculation R of calculation number 6 are acquired. Next, the execution location is changed to “computer 2 → 0”, with R being the transmission data calculation, the input / output variables being the addition of the S input variables $ u and $ v. Further, the calculation content of S is changed to dummy. Finally, the execution location of E is changed to computer 0.

次に、これによって得られたｎｅｗＰｌａｎをｎｅｗＰｌａｎＬｉｓｔに追加する（ステップＳ７９０）。ｎｅｗＰｌａｎであるローカルプラン候補２−４の内容を図１９に示す。但し図１９に示したローカルプラン候補２−４では演算内容ｄｕｍｍｙの演算は削除済みである。 Next, newPlan obtained in this way is added to newPlanList (step S790). The contents of the local plan candidate 2-4, which is newPlan, are shown in FIG. However, in the local plan candidate 2-4 shown in FIG. 19, the calculation of the calculation content dummy has been deleted.

次にｊに２を代入するが、ｆｉｎａｌＰｌａｎＬｉｓｔは要素数が１しかないため、今まで得られたｎｅｗＰｌａｎＬｉｓｔ内の１つのローカルプラン候補２−４をｆｉｎａｌＰｌａｎＬｉｓｔに移す（ステップＳ８００、Ｓ８１０、Ｓ８２０）。次にｉに８以降の値を代入しても分割クエリ結合は存在しないため、今までｆｉｎａｌＰｌａｎＬｉｓｔで得られた各ローカルプラン候補内に存在する演算内容がｄｕｍｍｙの演算を削除して分割クエリ結合の実施場所パターン生成処理終了する（ステップＳ８３０、Ｓ８４０、Ｓ８５０）。 Next, 2 is substituted for j. Since finalPlanList has only one element, one local plan candidate 2-4 in newPlanList obtained so far is transferred to finalPlanList (steps S800, S810, and S820). Next, even if a value after 8 is substituted for i, there is no split query join. Therefore, the operation contents existing in each local plan candidate obtained by finalPlanList until now delete the operation of dummy and the split query join The execution location pattern generation process ends (steps S830, S840, S850).

次に分割クエリ結合の実施場所パターン生成処理の出力リストとして図１２、図１９のローカルプラン候補を格納したｆｉｎａｌＰｌａｎＬｉｓｔを取得する（図１３のステップＳ２８０）。次にｆｉｎａｌＰｌａｎＬｉｓｔ内の要素図１２、図１９のローカルプラン候補を出力候補プランリストｏｕｔｐｕｔＬｉｓｔに移す（ステップＳ２９０）。次にｊに２を代入し、ｎｅｘｔＰｌａｎＬｉｓｔの２番目の要素である図１６のローカルプラン候補ｎｅｘｔＰｌａｎを取得する（ステップＳ３００、Ｓ３１０、Ｓ２６０）。次にｎｅｘｔＰｌａｎは分割クエリ結合の実施場所パターン生成処理の入力として渡される（ステップＳ２７０）。 Next, finalPlanList that stores the local plan candidates shown in FIGS. 12 and 19 is acquired as an output list of the execution location pattern generation process for split query combination (step S280 in FIG. 13). Next, the local plan candidates shown in FIGS. 12 and 19 in the finalPlanList are moved to the output candidate plan list outputList (step S290). Next, 2 is substituted for j, and the local plan candidate nextPlan of FIG. 16 which is the second element of nextPlanList is acquired (steps S300, S310, and S260). NextPlan is then passed as an input for the execution location pattern generation process for split query join (step S270).

分割クエリ結合の実施場所パターン生成処理において図１６に示したローカルプラン候補２−１は、図１２に示したローカルプラン候補２とプランの形が入出力変数１４９を除きほぼ同じであり、図１２に示したローカルプラン候補２が入力の場合と同じ動作となるため図１６に示したローカルプラン候補２−１が入力の場合の詳細は省略する。分割クエリ結合の実施場所パターン生成処理の出力リストｆｉｎａｌＰｌａｎＬｉｓｔの内容は図１６、図２０のローカルプラン候補リストとなる。 The local plan candidate 2-1 shown in FIG. 16 in the execution location pattern generation process of split query join is almost the same as the local plan candidate 2 shown in FIG. 12 except for the input / output variable 149. FIG. Since the operation is the same as when the local plan candidate 2 shown in FIG. 16 is input, the details when the local plan candidate 2-1 shown in FIG. 16 is input are omitted. The contents of the output list finalPlanList of the execution location pattern generation process of the split query combination are the local plan candidate lists of FIGS.

次にｆｉｎａｌＰｌａｎＬｉｓｔ内の要素図１６、図２０のローカルプラン候補を出力候補プランリストｏｕｔｐｕｔＬｉｓｔに移す（ステップＳ２９０）。次にｊに３を代入し、ｎｅｘｔＰｌａｎＬｉｓｔの３番目の要素である図１７のローカルプラン候補ｎｅｘｔＰｌａｎを取得する（ステップＳ３００、Ｓ３１０、Ｓ２６０）。次に、ｎｅｘｔＰｌａｎは分割クエリ結合の実施場所パターン生成処理の入力として渡される（ステップＳ２７０）。 Next, the local plan candidates shown in FIGS. 16 and 20 in finalPlanList are moved to the output candidate plan list outputList (step S290). Next, 3 is substituted into j, and the local plan candidate nextPlan of FIG. 17 that is the third element of nextPlanList is acquired (steps S300, S310, and S260). NextPlan is passed as an input of the execution location pattern generation process for split query join (step S270).

分割クエリ結合の実施場所パターン生成処理において図１７に示したローカルプラン候補２−２は図１２に示したローカルプラン候補２とプランの形が入出力変数１４９を除きほぼ同じであり、図１２に示したローカルプラン候補２が入力の場合と同じ動作となるため図１７に示したローカルプラン候補２−２が入力の場合の詳細は省略する。分割クエリ結合の実施場所パターン生成処理の出力リストｆｉｎａｌＰｌａｎＬｉｓｔの内容は図１７、図２１のローカルプラン候補リストとなる。 In the execution location pattern generation process of split query combination, the local plan candidate 2-2 shown in FIG. 17 is almost the same as the local plan candidate 2 shown in FIG. 12 except for the input / output variable 149. Since the operation is the same as that in the case where the local plan candidate 2 shown in FIG. 17 is input, details in the case where the local plan candidate 2-2 shown in FIG. 17 is input are omitted. The contents of the output list finalPlanList of the execution location pattern generation process of the split query combination are the local plan candidate lists of FIGS.

ｆｉｎａｌＰｌａｎＬｉｓｔ内の要素図１７、図２１のローカルプラン候補を出力候補プランリストｏｕｔｐｕｔＬｉｓｔに移す（ステップＳ２９０）。次にｊに４を代入し、ｎｅｘｔＰｌａｎＬｉｓｔの４番目の要素である図１８のローカルプラン候補ｎｅｘｔＰｌａｎを取得する（ステップＳ３００、Ｓ３１０、Ｓ２６０）。次にｎｅｘｔＰｌａｎは分割クエリ結合の実施場所パターン生成処理の入力として渡される（ステップＳ２７０）。 Elements in finalPlanList The local plan candidates in FIGS. 17 and 21 are moved to the output candidate plan list outputList (step S290). Next, 4 is substituted into j, and the local plan candidate nextPlan of FIG. 18 that is the fourth element of nextPlanList is acquired (steps S300, S310, and S260). NextPlan is then passed as an input for the execution location pattern generation process for split query join (step S270).

分割クエリ結合の実施場所パターン生成処理の入力として渡された図１８のローカルプラン候補ｎｅｘｔＰｌａｎを最終候補リストｆｉｎａｌＰｌａｎＬｉｓｔに登録する（図１５のステップＳ７００）。次にｉに１を代入する（ステップＳ７２０）。次にｎｅｘｔＰｌａｎでは変数ｉを１ずつ増やしていくが最後まで分割クエリ結合演算が存在しないのでステップＳ８５０に移る（ステップＳ７２０、Ｓ７３０、Ｓ８３０、Ｓ８４０）。ステップＳ８５０でｆｉｎａｌＰｌａｎＬｉｓｔに登録された図１８に示したローカルプラン候補２−３内において存在する演算内容がｄｕｍｍｙの演算を削除して分割クエリ結合の実施場所パターン生成処理終了する（ステップＳ８５０）。分割クエリ結合の実施場所パターン生成処理の出力リストｆｉｎａｌＰｌａｎＬｉｓｔの内容は、図１９のローカルプラン候補リストとなる。 The local plan candidate nextPlan shown in FIG. 18 passed as an input for the execution location pattern generation process for split query join is registered in the final candidate list finalPlanList (step S700 in FIG. 15). Next, 1 is substituted into i (step S720). Next, in nextPlan, the variable i is incremented by 1, but since there is no split query join operation until the end, the process proceeds to step S850 (steps S720, S730, S830, S840). In step S850, the calculation content existing in the local plan candidate 2-3 shown in FIG. 18 registered in finalPlanList shown in FIG. 18 is deleted, and the split query join execution location pattern generation process ends (step S850). The content of the output list finalPlanList of the execution location pattern generation process of the split query combination is the local plan candidate list of FIG.

ｆｉｎａｌＰｌａｎＬｉｓｔ内の要素図１９のローカルプラン候補２−４を出力候補プランリストｏｕｔｐｕｔＬｉｓｔに移す（ステップＳ２９０）。次にｊに５を代入する（ステップＳ３００）。ｎｅｘｔＰｌａｎＬｉｓｔに登録された要素数は４しかないためｉに２を代入する（ステップＳ３１０、Ｓ３２０）。ｉｎｐｕｔＰｌａｎＬｉｓｔの要素数は１しかないためローカルプラン候補生成処理を終了する。最終的に得られたローカルプラン候補リストｏｕｔｐｕｔＬｉｓｔに登録されたローカルプラン候補は図１６乃至図２１である。 Element in finalPlanList The local plan candidate 2-4 in FIG. 19 is moved to the output candidate plan list outputList (step S290). Next, 5 is substituted for j (step S300). Since there are only four elements registered in nextPlanList, 2 is substituted for i (steps S310 and S320). Since the number of elements of inputPlanList is only 1, the local plan candidate generation process is terminated. The local plan candidates registered in the finally obtained local plan candidate list outputList are shown in FIGS.

図１２のローカルプラン候補２では、サーバ間演算に必要な為演算番号３のデータ送信演算によって送信される変数＄zとサーバ間演算と並列に実行する演算４，５によって得られる変数＄ｕ，＄ｖを演算番号７によって計算機２で分割結合している。 In the local plan candidate 2 in FIG. 12, since it is necessary for the calculation between servers, the variable $ z transmitted by the data transmission calculation of the calculation number 3 and the variables $ u, obtained by the calculations 4 and 5 executed in parallel with the calculation between the servers. $ V is divided and combined in the computer 2 by the operation number 7.

図１６に示したローカルプラン候補２−１は、図１２に示したローカルプラン候補２に対し演算番号４をサーバ間演算と並列実行しないことで演算番号３のデータ送信演算において変数＄ｚと＄ｕを送信し、演算番号７の分割結合で＄ｖと分割結合するようにしたローカルプラン候補である。 The local plan candidate 2-1 shown in FIG. 16 does not execute the operation number 4 in parallel with the inter-server operation for the local plan candidate 2 shown in FIG. This is a local plan candidate that transmits u and is divided and combined with $ v by the division and combination of operation number 7.

図１７に示したローカルプラン候補２−２、図１２に示したローカルプラン候補２に対し演算番号５をサーバ間演算と並列実行しないことで演算番号３のデータ送信演算において変数＄ｚと＄ｖを送信し、演算番号７の分割結合で＄ｕと分割結合するようにしたローカルプラン候補である。 17 is not executed in parallel with the inter-server operation for the local plan candidate 2-2 shown in FIG. 17 and the local plan candidate 2 shown in FIG. , And a local plan candidate that is divided and combined with $ u by the division and combination of operation number 7.

図１８に示したローカルプラン候補２−３は、図１２に示したローカルプラン候補２に対し、演算番号４、５をサーバ間演算と並列実行しないことで演算番号３のデータ送信演算において変数＄ｚと＄ｖと＄ｕを送信し分割結合をしないようにしたローカルプラン候補である。 The local plan candidate 2-3 shown in FIG. 18 does not execute the operation numbers 4 and 5 in parallel with the inter-server operation for the local plan candidate 2 shown in FIG. This is a local plan candidate in which z, $ v, and $ u are transmitted so as not to be divided and combined.

図１９に示したローカルプラン候補２−４は、図１２に示したローカルプラン候補２に対し実行場所が計算機２ではなく計算機０の場合のローカルプラン候補である。 The local plan candidate 2-4 shown in FIG. 19 is a local plan candidate when the execution location is not the computer 2 but the computer 0 with respect to the local plan candidate 2 shown in FIG.

図２０に示したローカルプラン候補２−５は、図１６に示したローカルプラン候補２−１に対し実行場所が計算機２ではなく計算機０の場合のローカルプラン候補である。 A local plan candidate 2-5 shown in FIG. 20 is a local plan candidate when the execution location is not the computer 2 but the computer 0 with respect to the local plan candidate 2-1 shown in FIG.

図２１に示したローカルプラン候補２−６は、図１７に示したローカルプラン候補２−２に対し実行場所が計算機２ではなく計算機０の場合のローカルプラン候補である。 The local plan candidate 2-6 shown in FIG. 21 is a local plan candidate when the execution location is not the computer 2 but the computer 0 with respect to the local plan candidate 2-2 shown in FIG.

以上のようにして、図１２及び図１６乃至図２１に示したローカルプラン候補では、分割結合演算をする場合としない場合のローカルプラン候補、及び分割結合演算をする場合は分割結合する際に対象となる全ての変数の組合せと分割結合の実施場所がマスターサーバかスレーブサーバかの組合せを網羅したローカルプラン候補が生成される。 As described above, in the local plan candidates shown in FIG. 12 and FIGS. 16 to 21, the local plan candidates with and without the split / join operation, and the target when split / join when the split / join operation is performed. Local plan candidates are generated that cover all combinations of variables and combinations of whether the location of the split / join is the master server or the slave server.

ローカルプラン選択部３１はこれらのローカルプラン候補の中から見積もり実行時間または見積もり実行計算量からなるコストを計算し、コストが最小となるローカルプランを選択する（ステップＳ８）。 The local plan selection unit 31 calculates a cost consisting of the estimated execution time or the estimated execution calculation amount from these local plan candidates, and selects a local plan that minimizes the cost (step S8).

ここで、ローカルプラン候補の見積もり実行時間算出の一例を説明する。ローカルプラン候補の見積もり実行時間は、例えば、図２２に挙げた各演算の処理見積もり時間用のパラメータと、ローカルプランに含まれる演算に基づいて算出される。 Here, an example of calculating the estimated execution time of the local plan candidate will be described. The estimated execution time of the local plan candidate is calculated based on, for example, the parameters for the estimated processing time of each calculation listed in FIG. 22 and the calculation included in the local plan.

各演算のパラメータは、例えば、「数値索引：０．００１ｍｓｅｃ／出力変数の件数」、「ＴＲＡＶＥＲＳＥ：１ｍｓｅｃ／入力変数の件数」、「分割クエリ結合：１ｍｓｅｃ／入力変数の件数」、「サーバ間ＪＯＩＮ：１ｍｓｅｃ／入力変数の件数」、「データ送信５ｍｓｅｃ＋（０．００１ｍｓｅｃ／入力変数の件数）×変数の数」、「データ受信５ｍｓｅｃ＋（０．００１ｍｓｅｃ／入力変数の件数）×変数の数」、などである。なお、演算内容によって出力変数に格納する件数をあらかじめ設定する。 The parameters of each operation are, for example, “numerical index: 0.001 msec / number of output variables”, “TRAVERSE: 1 msec / number of input variables”, “partition query combination: 1 msec / number of input variables”, “JOIN between servers” : 1 msec / number of input variables ”,“ data transmission 5 msec + (0.001 msec / number of input variables) × number of variables ”,“ data reception 5 msec + (0.001 msec / number of input variables) × number of variables ”, etc. It is. The number of items to be stored in the output variable is set in advance according to the calculation contents.

ここで、図１２に示したローカルプラン候補２における見積もり時間の算出処理の一例を以下に説明する。なお、図４から計算機２では１万件のＸＭＬデータが登録されていることがわかるため、これにより各演算の見積もり実行時間を計算する。 Here, an example of an estimated time calculation process in the local plan candidate 2 shown in FIG. 12 will be described below. Since it can be seen from FIG. 4 that 10,000 pieces of XML data are registered in the computer 2, the estimated execution time of each calculation is calculated by this.

（１）数値索引で"＞＝"演算を利用する場合は、全体の１０％がヒットすると見積もる。そのため１万件の１０％で１０００件ヒットすると見積もる。したがって、数値索引の見積もり計算時間は、０．００１ｍｓｅｃ／出力変数の件数×１０００件＝１ｍｓｅｃとなる。 (1) When using a “> =” operation in a numerical index, it is estimated that 10% of the total hits. Therefore, it is estimated that 1000 hits in 10% of 10,000 cases. Therefore, the estimated calculation time of the numerical index is 0.001 msec / number of output variables × 1000 cases = 1 msec.

（２）ＴＲＡＶＥＲＳＥの入力変数の件数は（１）の結果から１０００件と見積もる。出力は変わらないものと想定して１０００件と見積もる。したがって、ＴＲＡＶＥＲＳＥの見積もり計算時間は、１ｍｓｅｃ／入力変数の件数×１０００件＝１０００ｍｓｅｃとなる。 (2) The number of input variables of TRAVELS is estimated to be 1000 from the result of (1). Assuming that the output will not change, the number is estimated to be 1000. Therefore, the estimated calculation time of TRAVELSE is 1 msec / number of input variables × 1000 cases = 1000 msec.

（３）データ送信の入力変数の件数は、（２）の結果から１０００件と見積もる。したがって、データ送信の見積もり計算時間は、５ｍｓｅｃ＋０．００１ｍｓｅｃ／入力変数の件数×１０００× １＝６ｍｓｅｃとなる。 (3) The number of input variables for data transmission is estimated to be 1000 from the result of (2). Therefore, the estimated calculation time for data transmission is 5 msec + 0.001 msec / number of input variables × 1000 × 1 = 6 msec.

（４）ＴＲＡＶＥＲＳＥの入力変数の件数は（２）の結果から１０００件と見積もる。出力は変わらないものと想定して１０００件と見積もる。したがって、ＴＲＡＶＥＲＳＥの見積もり計算時間は、１ｍｓｅｃ／入力変数の件数×１０００件＝１０００ｍｓｅｃとなる。 (4) The number of input variables of TRAVERS is estimated to be 1000 from the result of (2). Assuming that the output will not change, the number is estimated to be 1000. Therefore, the estimated calculation time of TRAVELSE is 1 msec / number of input variables × 1000 cases = 1000 msec.

（５）ＴＲＡＶＥＲＳＥの入力変数の件数は（４）の結果から１０００件と見積もる。出力は変わらないものと想定して１０００件と見積もる。したがって、ＴＲＡＶＥＲＳＥの見積もり計算時間は、１ｍｓｅｃ／入力変数の件数×１０００件＝１０００ｍｓｅｃとなる。 (5) The number of input variables of TRAVERS is estimated to be 1000 from the result of (4). Assuming that the output will not change, the number is estimated to be 1000. Therefore, the estimated calculation time of TRAVELSE is 1 msec / number of input variables × 1000 cases = 1000 msec.

（６）データ受信の入力変数の件数は、マスターサーバでサーバ間ＪＯＩＮが実行されて４０％に削減されて４００件と見積もる。したがって、データ受信の見積もり計算時間は、５ｍｓｅｃ＋０．００１ｍｓｅｃ／入力変数の件数×４００× １≒６ｍｓｅｃとなる。 (6) The number of input variables for data reception is estimated to be 400, which is reduced to 40% by executing the server-to-server JOIN on the master server. Therefore, the estimated calculation time for data reception is 5 msec + 0.001 msec / number of input variables × 400 × 1≈6 msec.

（７）分割クエリ結合の入力件数は（５），（６）から計１４００件と見積もる。したがって、分割クエリ結合の見積もり計算時間は、１ｍｓｅｃ／入力変数の件数×１４００件＝１４００ｍｓｅｃとなる。 (7) The number of input queries for split query is estimated to be 1400 in total from (5) and (6). Therefore, the estimated calculation time for the split query combination is 1 msec / number of input variables × 1400 cases = 1400 msec.

（８）（３）でマスターサーバへデータ送信後、マスターサーバでは図１１の分散プランの演算番号１０のサーバ間ＪＯＩＮが実行される。ここではサーバ間ＪＯＩＮの見積もり計算時間は、分散プラン生成部１３でローカルプラン候補の見積もり計算時間と同様の考え方で計算された後、スレーブサーバに分散プランと共に送られているものとする。分散プラン生成部１３では、サーバ間ＪＯＩＮの入力件数は各スレーブサーバの登録データ数の１／３程度と仮定して計算する。したがって、サーバ間ＪＯＩＮの見積もり時間は１ｍｓｅｃ／入力変数の件数×１１２５０件÷３＝３７５０ｍｓｅｃとなる。 (8) After data transmission to the master server in (3), the master server executes the server-to-server JOIN with the operation number 10 in the distributed plan of FIG. Here, it is assumed that the estimated calculation time of the server-to-server JOIN is calculated by the distributed plan generation unit 13 in the same way as the estimated calculation time of the local plan candidate, and then sent to the slave server together with the distributed plan. The distributed plan generation unit 13 calculates the number of input JOINs between servers on the assumption that the number of registered data of each slave server is about 1/3. Therefore, the estimated time for JOIN between servers is 1 msec / number of input variables × 11250/3 = 3750 msec.

なお、図１１において、分割クエリ結合演算追加部１４により演算番号１０のサーバ間ＪＯＩＮと並列に実行可能な演算として挙げられた、演算番号４，５のサーバ内演算は、ローカルプラン候補２の演算番号４，５のＴＲＡＶＥＲＳＥに該当する。このため演算番号４，５と、演算番号３，６及びサーバ間ＪＯＩＮは並行して実施することが分かるので、ローカルプラン候補２の見積もり時間は以下の２つの見積もり時間の長い方になる。 In FIG. 11, the intra-server operations of operation numbers 4 and 5, which are listed as operations that can be executed in parallel with the inter-server JOIN of operation number 10 by the divided query join operation addition unit 14, are the operations of the local plan candidate 2. It corresponds to TRAVERSE of numbers 4 and 5. For this reason, it can be seen that the operation numbers 4 and 5, the operation numbers 3 and 6, and the inter-server JOIN are executed in parallel, so the estimated time of the local plan candidate 2 is the longer of the following two estimated times.

（１）＋（２）＋（３）＋（８）＋（６）＋（７）＝６１６３ｍｓｅｃ
（１）＋（２）＋（４）＋（５）＋（７）＝４４０１ｍｓｅｃ
結果として６１６３ｍｓｅｃと見積もる。 (1) + (2) + (3) + (8) + (6) + (7) = 6163 msec
(1) + (2) + (4) + (5) + (7) = 4401 msec
As a result, it is estimated to be 6163 msec.

別の例としてローカルプラン候補２−３を計算機２で実行する際は全処理が並列に実行できないので、見積もり時間は、（１）、（２）、（３）、（４）、および（５）の演算とサーバ間ＪＯＩＮを合わせたものになる。すなわち、
１ｍｓｅｃ＋１０００ｍｓｅｃ＋１０００ｍｓｅｃ＋１０００ｍｓｅｃ＋６ｍｓｅｃ＋３７５０ｍｓｅｃ＝６７５７ｍｓｅｃ
となりローカルプラン候補２の方がサーバ間ＪＯＩＮを含めると速くなると計算される。このようにローカルプラン候補の見積もり計算時間に、マスターサーバで実行するサーバ間演算の見積もり時間も加えることで、分割クエリ結合演算追加部で追加した分割クエリ結合による最適化が有効かを判断できる。 As another example, when the local plan candidate 2-3 is executed by the computer 2, since all the processes cannot be executed in parallel, the estimated times are (1), (2), (3), (4), and (5 ) And the server-to-server JOIN. That is,
1 msec + 1000 msec + 1000 msec + 1000 msec + 6 msec + 3750 msec = 6757 msec
It is calculated that the local plan candidate 2 is faster when including the server-to-server JOIN. In this way, by adding the estimated time of the inter-server calculation executed on the master server to the estimated calculation time of the local plan candidate, it is possible to determine whether the optimization by the divided query combination added by the divided query join calculation adding unit is effective.

本実施形態では、計算機１は図１８に示すローカルプラン候補２−３を、計算機２は図１２に示すローカルプラン候補２を、計算機３は図１９に示すローカルプラン候補２−４をそれぞれローカルプランとして選んだものとする。 In the present embodiment, the computer 1 is a local plan candidate 2-3 shown in FIG. 18, the computer 2 is a local plan candidate 2 shown in FIG. 12, and the computer 3 is a local plan candidate 2-4 shown in FIG. As chosen.

各計算機のローカルプラン選択部３１は、選択したローカルプランをマスターサーバの分散プラン更新部１５に送信する。ローカルプラン選択部３１が、選択したローカルプランを分散プラン更新部１５に送信すると、スレーブサーバは、対象のローカルプランを実行する（ステップＳ９）。 The local plan selection unit 31 of each computer transmits the selected local plan to the distributed plan update unit 15 of the master server. When the local plan selection unit 31 transmits the selected local plan to the distributed plan update unit 15, the slave server executes the target local plan (step S9).

スレーブサーバの行うステップＳ９と並行して、ローカルプランを受信したマスターサーバの分散プラン更新部１５は分散プラン更新処理を行う（ステップＳ１０）。 In parallel with step S9 performed by the slave server, the distributed plan update unit 15 of the master server that has received the local plan performs distributed plan update processing (step S10).

ここで、マスターサーバの分散プラン更新部１５が、図１１の分散プランに対し、計算機１、計算機２、計算機３がそれぞれ図１８、図１２、図１９のローカルプラン候補を選んだ場合に、図２３のフローチャートに従って分散プランの更新を行う場合の分散プラン更新処理について説明する。なお、この分散プラン更新処理はｉという変数を用いる。ｉは１以上であり、かつ、対象のスレーブサーバ数以下である（１≦ｉ≦スレーブサーバ数）。 Here, when the distributed plan update unit 15 of the master server selects the local plan candidates shown in FIGS. 18, 12, and 19 for the distributed plan shown in FIG. The distributed plan update process when the distributed plan is updated according to the flowchart of FIG. This distributed plan update process uses a variable i. i is 1 or more and less than or equal to the number of target slave servers (1 ≦ i ≦ number of slave servers).

処理の開始時点ではｉ＝１である。また、スレーブサーバ数＝ｍａｘとする（ステップＳ９００）。 I = 1 at the start of the process. Further, the number of slave servers = max is set (step S900).

分散プラン更新部１５は、計算機番号ｉのローカルプランＬＰｌａｎを取得する（ステップＳ９１０）。次に分散プランとローカルプランの分割クエリ結合に関する差分をチェックする（ステップＳ９２０）。差分チェックによりＬＰｌａｎでは分割クエリ結合が完全に削除されているかを判定する（ステップＳ９３０）。 The distributed plan update unit 15 acquires the local plan LPlan for the computer number i (step S910). Next, the difference regarding the split query combination between the distributed plan and the local plan is checked (step S920). Based on the difference check, the LPlan determines whether the split query combination is completely deleted (step S930).

ＬＰｌａｎで分割クエリ結合が完全に削除されていた場合（ステップＳ９３０がＹｅｓ）、分散プランにある分割クエリ結合及びその前後にあるデータ送信演算、データ受信演算の実行場所から計算機ｉを削除する（ステップＳ９４０）。次に削除した結果分割クエリ結合演算の実行場所が空かを判定する（ステップＳ９５０）。 If the split query join is completely deleted in LPlan (step S930 is Yes), the computer i is deleted from the execution place of the split query join in the distributed plan and the data transmission operation and the data reception operation before and after that (step S930). S940). Next, it is determined whether the execution location of the deleted result divided query join operation is empty (step S950).

分割クエリ結合演算の実行場所が空の場合（ステップＳ９５０がＹｅｓ）、分散プランにある分割クエリ結合及びその前後にあるデータ送信演算、データ受信演算を削除する（ステップＳ９６０）。次に、ステップＳ９７０に進む。 If the execution location of the split query join operation is empty (step S950 is Yes), the split query join in the distributed plan, the data transmission operation and the data reception operation before and after that are deleted (step S960). Next, the process proceeds to step S970.

また分割クエリ結合演算の実行場所が空でない場合（ステップＳ９５０がＮｏ）、ステップＳ９７０に進む。さらにＬＰｌａｎで分割クエリ結合が完全に削除されていない場合（ステップＳ９３０がＮｏ）、ステップＳ９７０に進む。すなわちステップＳ９７０の処理は、ステップＳ９３０がＮｏの場合、もしくはステップＳ９５０がＮｏの場合、もしくはステップＳ９６０に続いて行われる。ステップＳ９７０では、ＬＰｌａｎで分割クエリ結合の実行場所が変更されたかを判定する。 If the execution location of the split query join operation is not empty (No in step S950), the process proceeds to step S970. Further, when the split query combination is not completely deleted in LPlan (step S930 is No), the process proceeds to step S970. That is, the process of step S970 is performed when step S930 is No, when step S950 is No, or after step S960. In step S970, it is determined whether the execution location of the split query combination is changed in LPlan.

ＬＰｌａｎで分割クエリ結合の実行場所が変更された場合（ステップＳ９７０がＹｅｓ）、分散プランにある分割クエリ結合、その前後にあるデータ送信演算、およびデータ受信演算の実行場所から計算機ｉを削除する（ステップＳ９８０）。次に削除した結果分割クエリ結合演算の実行場所が空かを判定する（ステップＳ９９０）。 When the execution location of the split query join is changed in LPlan (step S970 is Yes), the computer i is deleted from the execution location of the split query join in the distributed plan, the data transmission operation before and after that, and the data reception operation ( Step S980). Next, it is determined whether or not the execution location of the deleted result divided query join operation is empty (step S990).

分割クエリ結合演算の実行場所が空の場合（ステップＳ９９０がＹｅｓ）、分散プランにある分割クエリ結合及びその前後にあるデータ送信演算、データ受信演算を削除する（ステップＳ１０００）。次に、ステップＳ９９０に進む。 When the execution location of the split query join operation is empty (Yes in step S990), the split query join in the distributed plan, the data transmission operation and the data reception operation before and after that are deleted (step S1000). Next, the process proceeds to step S990.

また分割クエリ結合演算の実行場所が空でない場合（ステップＳ９９０がＮｏ）、ステップＳ１０１０に進む。すなわちステップＳ１０１０の処理はステップＳ９９０がＮｏの場合もしくはステップＳ１０００に続いて行われる。ステップＳ１０１０ではＬＰｌａｎの実行場所を変更した分割クエリ結合が分散プラン内に既に存在するかを判定する。 If the execution location of the split query join operation is not empty (No in step S990), the process proceeds to step S1010. That is, the process of step S1010 is performed when step S990 is No or subsequent to step S1000. In step S1010, it is determined whether a split query combination in which the execution location of LPlan has been changed already exists in the distributed plan.

ＬＰｌａｎの実行場所を変更した分割クエリ結合が分散プラン内に既に存在しない場合（ステップＳ１０１０がＮｏ）、実行場所を変更した分割クエリ結合、及びその前後にあるデータ送信、データ受信の演算を分散プランに追加する（ステップＳ１０３０）。次にステップＳ１０４０に進む。 If the split query join whose LPlan execution location has been changed does not already exist in the distributed plan (No in step S1010), the split query join whose execution location has been changed, and the data transmission and data reception operations before and after the split query join are distributed plan. (Step S1030). Next, the process proceeds to step S1040.

またＬＰｌａｎの実行場所を変更した分割クエリ結合が分散プラン内に既に存在する場合（ステップＳ１０１０がＮｏ）、実行場所に計算機ｉを追加する（ステップＳ１０２０）。次にステップＳ１０４０に進む。 If a split query combination whose LPlan execution location has been changed already exists in the distributed plan (No in step S1010), the computer i is added to the execution location (step S1020). Next, the process proceeds to step S1040.

さらにＬＰｌａｎで分割クエリ結合の実行場所が変更なかった場合（ステップＳ９７０がＮｏ）、ステップＳ１０４０に進む。すなわちステップＳ１０４０の処理はステップＳ９７０がＮｏの場合、またはステップＳ１０２０に続いて、またはステップＳ１０３０に続いて行われる。ステップＳ１０４０ではｉにｉ＋１を代入する（ｉ：＝ｉ＋１）（ステップＳ１０４０）。 Further, when the execution location of the split query combination is not changed in LPlan (No in step S970), the process proceeds to step S1040. That is, the process of step S1040 is performed when step S970 is No, subsequent to step S1020, or subsequent to step S1030. In step S1040, i + 1 is substituted for i (i: = i + 1) (step S1040).

次にｉがｍａｘ以下かを判定する（ステップＳ１０５０）。ｉがｍａｘ以下の場合（ステップＳ１０５０がＹｅｓ）、ステップＳ９１０に戻り処理を繰り返す。ｉがｍａｘより大きい場合は終了する。 Next, it is determined whether i is equal to or less than max (step S1050). When i is less than or equal to max (step S1050 is Yes), the process returns to step S910 and is repeated. If i is larger than max, the process ends.

図１１に示した分散プランに対して、計算機１、２、３のローカルプランがそれぞれローカルプラン候補２−３、ローカルプラン候補２、ローカルプラン候補２−４の場合において、上述した分散プラン更新処理が行われた結果、得られる分散プランを図２４に示す。 With respect to the distributed plan shown in FIG. 11, when the local plans of the computers 1, 2, and 3 are the local plan candidate 2-3, the local plan candidate 2, and the local plan candidate 2-4, respectively, the distributed plan update process described above is performed. FIG. 24 shows a dispersion plan obtained as a result of the above.

図２４に示した分散プラン３は、「演算番号」３１２、「部分クエリ番号」３１３、「演算内容」３１４、「事前実行演算番号」３１５、「実行場所」３１６、「送信場所」３１７、「入力変数」３１８、「出力変数」３１９の項目を有する。この分散プラン３は有する項目３１２乃至３１９は、分散プランテーブル１４１の有する項目１４２乃至１４９及びローカルプラン候補２、２−１乃至２−６が有する項目３０２乃至３０９と同一である。 The distribution plan 3 shown in FIG. 24 includes “operation number” 312, “partial query number” 313, “operation content” 314, “pre-execution operation number” 315, “execution location” 316, “transmission location” 317, “ It has items of “input variable” 318 and “output variable” 319. The items 312 to 319 included in the distribution plan 3 are the same as the items 142 to 149 included in the distribution plan table 141 and the items 302 to 309 included in the local plan candidates 2, 2-1 to 2-6.

まずｉに１を代入して、計算機１の選択したローカルプランであるローカルプラン候補２−３と図１１の分散プランの分割クエリ結合に関する差分をチェックする（ステップＳ９００、Ｓ９１０、Ｓ９２０）。ローカルプラン候補２−３では、計算機１の分割クエリ結合は完全に削除されているため、図１１の分散プランの分割クエリ結合及びその前後にあるデータ送信、データ受信演算（図２４の演算番号１１−１５）の実行場所から計算機１を削除する（ステップＳ９４０）。 First, 1 is assigned to i, and the difference regarding the split query combination between the local plan candidate 2-3, which is the local plan selected by the computer 1, and the distributed plan of FIG. 11 is checked (steps S900, S910, S920). In the local plan candidate 2-3, since the split query combination of the computer 1 is completely deleted, the split query combination of the distributed plan of FIG. 11 and the data transmission and data reception operations before and after that (operation number 11 of FIG. 24). -15), the computer 1 is deleted from the execution location (step S940).

次に、分散プラン更新部１５は分割クエリ結合の実行場所は空ではないため、ローカルプラン候補２−３をチェックして分割クエリ結合の実行場所が変更されていないかチェックする（ステップＳ９５０、Ｓ９７０）。実行場所は変更されていないため、ｉに２を代入する（ステップＳ１０４０、Ｓ１０５０）。 Next, since the execution location of the split query join is not empty, the distributed plan update unit 15 checks the local plan candidate 2-3 to check whether the execution location of the split query join has been changed (Steps S950 and S970). ). Since the execution location has not been changed, 2 is substituted for i (steps S1040 and S1050).

次にｉが２として図１１の分散プランと、計算機２の選択したローカルプランであるローカルプラン候補２とにおける分割クエリ結合に関する差分をチェックする（ステップＳ９２０）。図１２に示したローカルプラン候補２は、計算機２の分割クエリ結合は削除もされず、実行場所も変わらないため、ｉに３を代入する（ステップＳ９３０、Ｓ９７０、Ｓ１０４０、Ｓ１０５０）。 Next, i is set to 2, and the difference regarding the split query combination between the distributed plan of FIG. 11 and the local plan candidate 2 which is the local plan selected by the computer 2 is checked (step S920). In the local plan candidate 2 shown in FIG. 12, since the split query combination of the computer 2 is not deleted and the execution location is not changed, 3 is substituted for i (steps S930, S970, S1040, and S1050).

次にｉが３として図１１の分散プランと計算機３の選択したローカルプランであるローカルプラン候補２−４の分割クエリ結合に関する差分をチェックする（ステップＳ９２０）。ローカルプラン候補２−４は、計算機３の分割クエリ結合は削除されていないため、次に分割クエリ結合の実行場所が変更されているかをチェックする（ステップＳ９３０、Ｓ９７０）。 Next, i is set to 3, and the difference regarding the split query combination between the distributed plan of FIG. 11 and the local plan candidate 2-4 which is the local plan selected by the computer 3 is checked (step S920). Since the local query candidate 2-4 has not deleted the split query join of the computer 3, it next checks whether the execution location of the split query join has been changed (steps S930 and S970).

また、ローカルプラン候補２−４は、分割クエリ結合の実行場所が変更されているため、分散プランの分割クエリ結合及びその前後にあるデータ送信、データ受信演算（図２４の演算番号１１−１５）の実行場所から計算機３を削除する（ステップＳ９８０）。分割クエリ結合の実行場所は空ではないため、分散プラン更新部１５は、次に実行場所を変更した分割クエリ結合が分散プランに挿入されているかを確認する（ステップＳ９８０、Ｓ１０１０）。分散プランには挿入されていないため、実行場所を変更した分割クエリ結合及びその前後にあるデータ送信、データ受信演算を分散プランに挿入する（図２４の演算番号１４、１５、１６）（ステップＳ１０３０）。次にｉに４を代入するが、計算機数は３までなので処理を終了する（ステップＳ１０４０、Ｓ１０５０）。最終的に図２４の分散プラン３が得られる。 In addition, since the execution location of the split query join is changed in the local plan candidate 2-4, the split query join of the distributed plan and the data transmission and data reception calculations before and after the split query join (operation numbers 11-15 in FIG. 24) The computer 3 is deleted from the execution location (step S980). Since the execution location of the split query join is not empty, the distributed plan update unit 15 confirms whether the split query join whose execution location has been changed next is inserted in the distributed plan (steps S980 and S1010). Since it is not inserted in the distributed plan, the split query combination whose execution location has been changed and the data transmission and data reception operations before and after that are inserted into the distributed plan (operation numbers 14, 15, and 16 in FIG. 24) (step S1030). ). Next, 4 is substituted for i. Since the number of computers is up to 3, the processing is terminated (steps S1040 and S1050). Finally, the distribution plan 3 of FIG. 24 is obtained.

分散プラン実行部１６は、分散プラン３の演算を実行する（図６のステップＳ１１）。なお、分散プラン３の実行と、スレーブサーバのローカルプラン実行部３３の実行とは、お互いデータの送受信で依存関係にあり、例えば並列に実行したり、相手からデータが送信されるのを待ったりする。すなわち、マスターサーバの送受信部１７とスレーブサーバの送受信部３４とがデータのやり取りを行いながら分散プランは実行される。 The distributed plan execution unit 16 executes the calculation of the distributed plan 3 (step S11 in FIG. 6). The execution of the distributed plan 3 and the execution of the local plan execution unit 33 of the slave server are dependent on each other in data transmission / reception. For example, they are executed in parallel or waiting for data to be transmitted from the other party. To do. That is, the distributed plan is executed while data is exchanged between the transmission / reception unit 17 of the master server and the transmission / reception unit 34 of the slave server.

上述したように、本実施形態によれば、ユーザによってマスターサーバに入力された問合せクエリに対して、マスターサーバは部分クエリの結合処理を除いたサーバ間演算に関連する部分の最適化のみを実施し、部分クエリの結合の最適化はスレーブサーバが実施する。すなわち、マスターサーバは部分クエリの結合処理を最適化の範囲から除外することが可能となるため、全てをマスターサーバ側で最適化する場合と比較して単純な仕組みで実現することが可能となる。 As described above, according to the present embodiment, for the query query input to the master server by the user, the master server performs only the optimization of the portion related to the inter-server operation excluding the join processing of the partial queries. However, the slave server performs optimization of partial query joins. In other words, since the master server can exclude partial query join processing from the scope of optimization, it can be realized with a simple mechanism as compared with the case where everything is optimized on the master server side. .

すなわち、マスターサーバはスレーブサーバが決定する部分クエリの結合処理に依存しない形で部分クエリの結合処理を仮に決定する。スレーブサーバは、サーバ毎のデータベース情報に基づいて部分クエリの結合処理を最適化する。さらに、マスターサーバは、スレーブサーバがこの部分クエリの結合処理の結果をマスターサーバに通知することよって、効率の良い分散プランを生成する。また、分割クエリ結合演算に関してスレーブサーバ側が最適化するため、各スレーブサーバに適した効率のよいプランの生成が可能となる。 That is, the master server tentatively determines the partial query join processing without depending on the partial query join processing determined by the slave server. The slave server optimizes the partial query join processing based on the database information for each server. Further, the master server generates an efficient distributed plan by the slave server notifying the master server of the result of the join processing of the partial query. In addition, since the slave server side optimizes the split query join operation, an efficient plan suitable for each slave server can be generated.

したがって、本発明の実施形態によれば、問合せクエリ５１について、マスターサーバが決定する分散プランの範囲から、分割クエリ結合演算に関する演算を取り除くことで、マスターサーバの分散プラン最適化の仕組みを簡素化した上で、スレーブサーバが各々に最適な形で上記の範囲の最適化を実施する。これにより、効率的なクエリ処理を実現することが可能となる。 Therefore, according to the embodiment of the present invention, for the query query 51, by removing the operation related to the split query join operation from the range of the distributed plan determined by the master server, the mechanism for optimizing the distributed plan of the master server is simplified. After that, the slave server performs the optimization in the above range in an optimal manner for each. Thereby, efficient query processing can be realized.

（第２の実施形態）
第２の実施形態の分散データベース検索装置について図面を参照して説明する。なお、第１の実施形態の分散データベース検索装置と同一の構成には同一の符号を付し、説明は省略する。 (Second Embodiment)
A distributed database search apparatus according to a second embodiment will be described with reference to the drawings. In addition, the same code | symbol is attached | subjected to the structure same as the distributed database search device of 1st Embodiment, and description is abbreviate | omitted.

図２５は、第２の実施形態の分散データベース検索装置の機能構成を示す構成図である。図２５に示すように、本実施形態の分散データベース検索装置は、図１に示した分散データベース検索装置の構成にスキーマ生成部１８、３５とスキーマ変更部１９、３６をさらに有する構成である。 FIG. 25 is a configuration diagram illustrating a functional configuration of the distributed database search device according to the second embodiment. As shown in FIG. 25, the distributed database search device of the present embodiment is configured to further include schema generation units 18 and 35 and schema change units 19 and 36 in the configuration of the distributed database search device shown in FIG.

スキーマとは格納するデータの種類や位置情報を保持するものデータの構造のことである。 A schema is a data structure that holds the type and position information of data to be stored.

スキーマ生成部１８、３５は各サーバにおいて行われる演算の結果、入出力されるデータのスキーマを生成する。例えば、マスターサーバが備えるスキーマ生成部１８は、分散プランの各演算の入出力データのスキーマを生成する。また、スレーブサーバが備えるスキーマ生成部３５は、ローカルプランの各演算の入出力データのスキーマを生成する。 The schema generation units 18 and 35 generate schemas of input and output data as a result of operations performed in each server. For example, the schema generation unit 18 included in the master server generates an input / output data schema for each operation of the distributed plan. Further, the schema generation unit 35 provided in the slave server generates a schema of input / output data of each calculation of the local plan.

スキーマ変更部１９、３６は、分散プランまたはローカルプランの各演算によって得られたデータを次の演算に渡す際に、当該データのスキーマを変更する。マスターサーバのスキーマ変更部１９は、分散プランまたはローカルプランの各演算の出力によって得られたデータを、分散プランの次の演算の入力に渡す際にデータのスキーマを変更する。スレーブサーバのスキーマ変更部３６は、分散プランまたはローカルプランの各演算の出力によって得られたデータを、ローカルプランの次の演算に渡す際に、当該データのスキーマを変更する。 The schema changing units 19 and 36 change the schema of the data when passing the data obtained by each calculation of the distributed plan or the local plan to the next calculation. The schema change unit 19 of the master server changes the data schema when passing the data obtained by the output of each operation of the distributed plan or the local plan to the input of the next operation of the distributed plan. The schema change unit 36 of the slave server changes the schema of the data when passing the data obtained by the output of each operation of the distributed plan or the local plan to the next operation of the local plan.

ここで、図２６に本実施形態におけるスキーマ変更部１９によって変更される前のスキーマに含まれる項目の一例を示す。変更される前のスキーマとは、すなわち後述するスキーマ生成部１８によって生成されるスキーマである。スキーマとは格納するデータの種類や位置情報を保持するものであり、図２６に示すように本実施形態では、「可変領域」４０１と、「変数〜」４０２−ｉ（ｉは１以上の整数）と「拡張変数＄＃ｅ（Ｖ１・・・，Ｖｎ）」４０３の項目を有する。 Here, FIG. 26 shows an example of items included in the schema before being changed by the schema changing unit 19 in the present embodiment. The schema before being changed is a schema generated by the schema generation unit 18 described later. The schema holds the type and position information of data to be stored. As shown in FIG. 26, in this embodiment, “variable area” 401 and “variable˜” 402-i (i is an integer of 1 or more). ) And “extended variable $ # e (V1..., Vn)” 403.

「拡張変数＄＃ｅ（Ｖ１,・・・，Ｖｎ）」４０３の「＄＃ｅ」は「変数〜」４０２−ｉの「〜」に一致しない一意の文字列が入り、「（Ｖ１,・・・，Ｖｎ）」には「，」で区切られたｎ個の一意になる任意の文字列が入る。「可変領域」４０１とはサーバ間でデータを送る際にカラム数の違いを吸収するためのデータを格納する項目である。「変数〜」４０２−ｉは各変数のデータを格納する項目で「拡張変数＄＃ｅ（Ｖ１,・・・，Ｖｎ）」４０３は変数「Ｖ１」から変数「Ｖｎ」の全変数のデータを格納する項目である。なお、「変数〜」４０２−ｉは入出力されるデータ毎にさまざまな変数が格納される。 “$ # E” of “extended variable $ # e (V1,..., Vn)” 403 contains a unique character string that does not match “~” of “variable˜” 402-i, and “(V1,. .., Vn) ”includes n unique character strings separated by“, ”. The “variable area” 401 is an item for storing data for absorbing the difference in the number of columns when data is transmitted between servers. “Variable˜” 402-i is an item for storing data of each variable, and “Extended variable $ # e (V1,..., Vn)” 403 is data of all variables from variable “V1” to variable “Vn”. The item to store. Note that “variable˜” 402-i stores various variables for each input / output data.

ここで、図２７を参照して、本実施形態の分散データベース検索装置の検索処理の一例について説明する。図２７は、本実施形態の分散データベース検索装置の検索処理の一例を示すフローチャートである。なお、ステップＳ１至Ｓ８及び，Ｓ１０の処理については図６に示したフローチャートと同じであるため説明を割愛する。 Here, with reference to FIG. 27, an example of the search processing of the distributed database search device of this embodiment will be described. FIG. 27 is a flowchart showing an example of search processing of the distributed database search device of this embodiment. Note that the processing of steps S1 to S8 and S10 is the same as that in the flowchart shown in FIG.

ローカルプラン選択部３７は、図２７のステップＳ１−Ｓ８でローカルプランを決定した後、決定したローカルプランに実行順序が非決定的な部分を確定するローカルプラン順序決定処理を実施する（ステップＳ１２）。ここで、実行順序が非決定的な部分とはローカルプランの各演算の事前実行番号が等しい演算の集合である。これらの演算はどの順序で実行するかの指定がないため任意の順序で実行可能である。すなわち、ローカルプラン順序決定処理によって、でこれらの演算の実行順序を一意に決定される。 After determining the local plan in steps S1 to S8 in FIG. 27, the local plan selection unit 37 performs a local plan order determination process for determining a portion in which the execution order is non-deterministic in the determined local plan (step S12). Here, the portion whose execution order is non-deterministic is a set of operations in which the prior execution numbers of the operations in the local plan are equal. Since these operations are not specified in any order, they can be executed in any order. That is, the execution order of these operations is uniquely determined by the local plan order determination process.

図２８に、図１２で示したローカルプラン候補２がローカルプラン順序決定処理（ステップＳ１２）により非決定的な演算集合の順序を一意に決定された結果を示す。図１２に示したローカルプラン候補２では演算番号２、４、５の各演算の事前実行番号は１であり実行順序が非決定的である。図２９では演算番号２、４、５の順序で実行するように決定し、演算番号４の事前実行番号を１から２、演算番号５の事前実行番号を１から４に書き換えている。 FIG. 28 shows a result of the local plan candidate 2 shown in FIG. 12 having the non-deterministic operation set order uniquely determined by the local plan order determination process (step S12). In the local plan candidate 2 shown in FIG. 12, the prior execution number of each operation of operation numbers 2, 4, and 5 is 1, and the execution order is non-deterministic. In FIG. 29, the execution numbers are determined to be executed in the order of operation numbers 2, 4, and 5, and the pre-execution number of operation number 4 is rewritten from 1 to 2, and the pre-execution number of operation number 5 is rewritten from 1 to 4.

ステップＳ１２におけるローカルプラン順序決定処理後、演算の順序が決定されたローカルプランは送信部３４によってマスターサーバに送信される。マスターサーバは当該ローカルプランを受信すると、分散プラン更新部１０が分散プラン更新処理を行う（ステップＳ１０）。ステップＳ１０と並行して、スレーブサーバのスキーマ生成部３５が、当該ローカルプラン内の各演算の入力スキーマおよび出力スキーマ（以下、入出力スキーマという）を生成する（ステップＳ１４）。入力スキーマとは、入力されるデータのスキーマである。出力スキーマとは演算で出力されるデータのスキーマである。 After the local plan order determination process in step S12, the local plan whose operation order has been determined is transmitted to the master server by the transmission unit 34. When the master server receives the local plan, the distributed plan update unit 10 performs distributed plan update processing (step S10). In parallel with step S10, the schema generation unit 35 of the slave server generates an input schema and an output schema (hereinafter referred to as an input / output schema) for each operation in the local plan (step S14). The input schema is a schema of input data. The output schema is a schema of data output by calculation.

ここで図２９を参照して、スキーマ生成部３５が、図２８に示したローカルプランに対して、スキーマ生成処理（ステップＳ１４）を行う際の動作について具体的に説明する。 Here, with reference to FIG. 29, the operation | movement at the time of the schema production | generation part 35 performing a schema production | generation process (step S14) with respect to the local plan shown in FIG. 28 is demonstrated concretely.

なお、このスキーマ生成処理ではｉという変数を用いる。ｉは１以上の整数であり、対象のローカルプランの演算番号以下である（１≦ｉ≦ローカルプランの演算番号の最大値）。また、スキーマ生成処理の開始時点ではｉ＝１である。また、ローカルプランの演算番号の最大値を「ｍａｘ」とする。また、演算番号３０１がｉのローカルプランの演算を演算Ｓとする。 In this schema generation process, a variable i is used. i is an integer of 1 or more and is equal to or smaller than the operation number of the target local plan (1 ≦ i ≦ maximum value of the operation number of the local plan). Also, i = 1 at the start of the schema generation process. In addition, the maximum value of the operation number of the local plan is “max”. In addition, the calculation of the local plan whose calculation number 301 is i is defined as calculation S.

スキーマ生成部３５は、ローカルプラン選択部３１からローカルプランを受信すると、まず、初期化処理としてｉ＝１、ローカルプランの演算番号の最大値＝ｍａｘとする（ステップＳ１１１０）。次に、ｉがｍａｘ以下であるか否かを判定する（ステップＳ１１１０）。 When receiving the local plan from the local plan selection unit 31, the schema generation unit 35 first sets i = 1 as the initialization process and sets the maximum value of the local plan operation number = max (step S1110). Next, it is determined whether i is equal to or less than max (step S1110).

ｉがｍａｘ以下である場合（ステップＳ１１１０がＹｅｓ）、演算番号ｉの演算Ｓを取得する（ステップＳ１１２０）。次に演算Ｓに出力変数があるかを判定する（ステップＳ１１３０）。 If i is less than or equal to max (step S1110 is Yes), the calculation S of the calculation number i is acquired (step S1120). Next, it is determined whether or not the operation S has an output variable (step S1130).

演算Ｓに出力変数がある場合（ステップＳ１１３０がＹｅｓ）、演算Ｓの出力変数を出力スキーマと変数リストＶＬに追加する。さらに変数毎に到達番号リストＲＬと利用番号リストＵＬを用意する（ステップＳ１１４０）。次に各演算の事前実行番号から、演算Ｓと演算Ｓより後に実行される演算番号のリストＡＬを取得する（ステップＳ１１５０）。次に、演算Ｓの各出力変数に対して、変数毎に用意した到達演算番号リストＲＬにリストＡＬに格納された演算番号を登録する（ステップＳ１１６０）。その後ステップＳ１１７０に進む。なお、演算Ｓに出力変数がない場合（ステップＳ１１３０がＮｏ）もステップＳ１１７０に進む。 If there is an output variable in the operation S (Yes in step S1130), the output variable of the operation S is added to the output schema and the variable list VL. Further, an arrival number list RL and a usage number list UL are prepared for each variable (step S1140). Next, a list AL of operation numbers to be executed after the operation S and the operation S is obtained from the prior execution numbers of the operations (step S1150). Next, for each output variable of the operation S, the operation numbers stored in the list AL are registered in the arrival operation number list RL prepared for each variable (step S1160). Thereafter, the process proceeds to step S1170. Note that if the calculation S has no output variable (No in step S1130), the process also proceeds to step S1170.

ステップＳ１１７０では演算Ｓに入力変数があるかを判定する（ステップＳ１１７０）。 In step S1170, it is determined whether there is an input variable in operation S (step S1170).

演算Ｓに入力変数がある場合（ステップＳ１１７０がＹｅｓ）、演算Ｓの事前実行演算番号を辿り、演算Ｓより前に実行される演算番号のリストＢＬを取得する（ステップＳ１１８０）。次に演算Ｓの各入力変数に対して、変数毎に用意した利用演算番号リストＵＬにリストＢＬに格納された演算番号を登録する（ステップＳ１２００）。その後ステップＳ１２１０に進む。なお、演算Ｓに入力変数がない場合（ステップＳ１１７０がＮｏ）もステップＳ１２１０に進む。 When there is an input variable in the operation S (step S1170 is Yes), the pre-execution operation number of the operation S is traced, and a list BL of operation numbers executed before the operation S is acquired (step S1180). Next, the operation numbers stored in the list BL are registered in the use operation number list UL prepared for each variable for each input variable of the operation S (step S1200). Thereafter, the process proceeds to step S1210. Note that if the calculation S has no input variable (No in step S1170), the process also proceeds to step S1210.

ステップＳ１２１０ではｉにｉ＋１を代入する（ｉ：＝ｉ＋１）（ステップＳ１２１０）。次にステップＳ１１１０に戻る。 In step S1210, i + 1 is substituted for i (i: = i + 1) (step S1210). Next, the process returns to step S1110.

ｉがｍａｘより大きい場合（ステップＳ１１１０がＮｏ）、ステップＳ１２２０に進む。 If i is greater than max (No in step S1110), the process proceeds to step S1220.

ここで、ｊという変数を用いる。ｊは１以上であり、かつ、変数リストＶＬ内の要素数以下である（１≦ｊ≦ＶＬ内の要素数）。また、この時ｊ＝１であり、ＶＬ内の要素数＝ｖｍａｘとする（ステップＳ１２２０）。次に、ｊがｖｍａｘ以下であるか否かを判定する（ステップＳ１２３０）。 Here, a variable called j is used. j is 1 or more and less than or equal to the number of elements in the variable list VL (1 ≦ j ≦ number of elements in VL). At this time, j = 1 and the number of elements in the VL = vmax (step S1220). Next, it is determined whether j is equal to or less than vmax (step S1230).

ｊがｖｍａｘ以下である場合（ステップＳ１２３０がＹｅｓ）、変数リストＶＬからｊ番目の変数ｖａｒを取得する（ステップＳ１２４０）。次に変数ｖａｒの到達演算番号リストＲＬと利用番号リストＵＬに共通して出現する演算番号リストＣＬを取得する（ステップＳ１２５０）。次にリストＣＬ内の演算番号の各演算の出力スキーマに変数ｖａｒを追加する（ステップＳ１２６０）。次にｊにｊ＋１を代入する（ｊ：＝ｊ＋１）（ステップＳ１２７０）。次にステップＳ１２３０に戻る。 If j is equal to or less than vmax (step S1230 is Yes), the j-th variable var is acquired from the variable list VL (step S1240). Next, the operation number list CL that appears in common in the reaching operation number list RL and the usage number list UL of the variable var is acquired (step S1250). Next, a variable var is added to the output schema of each operation of the operation number in the list CL (step S1260). Next, j + 1 is substituted for j (j: = j + 1) (step S1270). Next, the process returns to step S1230.

ｊがｖｍａｘより大きい場合（ステップＳ１２３０がＮｏ）、ｉに１を代入する（ｉ：＝１）（ステップＳ１２８０）。次に、ｉがｍａｘ以下であるか否かを判定する（ステップＳ１２９０）。 If j is greater than vmax (No in step S1230), 1 is substituted for i (i: = 1) (step S1280). Next, it is determined whether i is equal to or less than max (step S1290).

ｉがｍａｘ以下である場合（ステップＳ１２９０がＹｅｓ）、演算番号ｉの演算Ｓを取得する（ステップＳ１３００）。次に演算Ｓの入力スキーマに事前実行演算番号の演算の出力スキーマをコピーする（ステップＳ１３１０）。次に演算Ｓの出力スキーマの先頭に可変領域を、最後尾に拡張変数＄＃ｅ（）を追加する（ステップＳ１３２０）。次にｉにｉ＋１を代入する（ｉ：＝ｉ＋１）（ステップＳ１３３０）。次にステップＳ１２９０に戻る。 When i is less than or equal to max (step S1290 is Yes), the calculation S of the calculation number i is acquired (step S1300). Next, the operation output schema of the pre-execution operation number is copied to the operation S input schema (step S1310). Next, the variable area is added to the head of the output schema of the operation S, and the extension variable $ # e () is added to the tail (step S1320). Next, i + 1 is substituted for i (i: = i + 1) (step S1330). Next, the process returns to step S1290.

ｉがｍａｘより大きい場合（ステップＳ１２９０がＮｏ）、処理を終了する。なお、図２９で示したフローチャートは、スレーブサーバのスキーマ生成部１８におけるスキーマ生成処理Ｓ１５においても対象がローカルプランであるか分散プランであるかの違いのみで同様に動作する。 If i is greater than max (No in step S1290), the process ends. Note that the flowchart shown in FIG. 29 operates in the same way only in the schema generation process S15 in the schema generation unit 18 of the slave server, except for whether the target is a local plan or a distributed plan.

ここで、図３０に、図２８で示したローカルプラン順序決定処理で得られたローカルプランにおいて、上述したスキーマ生成処理が行われた結果得られる入出力スキーマの一例を示す。図３０に示した入出力スキーマテーブル５００は、演算番号５００、入力スキーマ１、入力スキーマ２、出力スキーマ５０２の項目を示す。 Here, FIG. 30 shows an example of an input / output schema obtained as a result of the above-described schema generation processing in the local plan obtained by the local plan order determination processing shown in FIG. The input / output schema table 500 shown in FIG. 30 shows items of operation number 500, input schema 1, input schema 2, and output schema 502.

入出力スキーマテーブル５００の演算番号５０１には、スキーマ生成処理が行われたプランの演算番号と対応した演算番号が格納される。 The operation number 501 of the input / output schema table 500 stores an operation number corresponding to the operation number of the plan for which the schema generation processing has been performed.

入力スキーマ１は、スキーマ生成処理において生成された入力スキーマの１つ目が格納される。入力スキーマ２は、スキーマ生成処理において生成された入力スキーマの２つ目の入力スキーマが格納される。 The input schema 1 stores the first input schema generated in the schema generation process. The input schema 2 stores the second input schema of the input schema generated in the schema generation process.

出力スキーマ５０４は、スキーマ生成処理において生成された出力スキーマが格納される。 The output schema 504 stores the output schema generated in the schema generation process.

スキーマ生成処理部１８が、この入出力スキーマテーブル５００を算出する処理を具体的に説明する。まず図２８のローカルプランが、スキーマ生成処理の入力として渡される。次にｉに１を、ｍａｘに８を代入する（ステップＳ１１００）。次にｉがｍａｘ以下であるため演算番号１の演算Ｓを取得する（ステップＳ１１１０、Ｓ１１２０）。演算Ｓは出力変数＄ｘを持つ為＄ｘを出力スキーマ５０４と変数リストＶＬに追加する。さらに＄ｘに対する到達番号リストＲＬと利用番号リストＵＬを用意する（ステップＳ１１３０、Ｓ１１４０）。次に事前実行番号を辿り演算Ｓの演算番号１と演算Ｓより後に実行される演算番号２−８を到達番号リストＲＬに格納する（ステップＳ１１５０、Ｓ１１６０）。演算Ｓは入力変数を持たないためｉに２を代入し、演算番号２の演算Ｓを取得する（ステップＳ１２１０、Ｓ１１１０、Ｓ１１２０）。 A process in which the schema generation processing unit 18 calculates the input / output schema table 500 will be specifically described. First, the local plan of FIG. 28 is passed as an input of the schema generation process. Next, 1 is substituted for i and 8 is substituted for max (step S1100). Next, since i is equal to or less than max, the operation S with the operation number 1 is acquired (steps S1110 and S1120). Since the operation S has the output variable $ x, $ x is added to the output schema 504 and the variable list VL. Further, an arrival number list RL and a usage number list UL for $ x are prepared (steps S1130 and S1140). Next, the advance execution number is traced, and the operation number 1 of the operation S and the operation number 2-8 executed after the operation S are stored in the arrival number list RL (steps S1150 and S1160). Since the calculation S has no input variable, 2 is substituted for i, and the calculation S of the calculation number 2 is acquired (steps S1210, S1110, S1120).

演算Ｓは出力変数＄ｚを持つ為＄ｚを出力スキーマ５０４と変数リストＶＬに追加する。さらに＄ｚに対する到達番号リストＲＬと利用番号リストＵＬを用意する（ステップＳ１１３０、Ｓ１１４０）。次に事前実行番号を辿り演算Ｓの演算番号２と演算Ｓより後に実行される演算番号３−８を到達番号リストＲＬに格納する（ステップＳ１１５０、Ｓ１１６０）。次に演算Ｓは入力変数＄ｘを持つ為、Ｓより前に実行された演算番号１を利用演算番号ＵＬに登録する。（ステップＳ１１７０、Ｓ１１８０、Ｓ１２００）。次に演算番号３の演算Ｓを取得する（ステップＳ１２１０、Ｓ１１１０、Ｓ１１２０）。 Since the operation S has the output variable $ z, $ z is added to the output schema 504 and the variable list VL. Further, an arrival number list RL and usage number list UL for $ z are prepared (steps S1130 and S1140). Next, the advance execution number is traced, and the operation number 2 of the operation S and the operation number 3-8 executed after the operation S are stored in the arrival number list RL (steps S1150 and S1160). Next, since the operation S has the input variable $ x, the operation number 1 executed before S is registered in the use operation number UL. (Steps S1170, S1180, S1200). Next, the operation S with the operation number 3 is acquired (steps S1210, S1110, S1120).

演算Ｓは出力変数＄ｚを持つ為＄ｚを出力スキーマ５０４と変数リストＶＬに追加する。さらに＄ｚに対する到達番号リストＲＬと利用番号リストＵＬを用意する（ステップＳ１１３０、Ｓ１１４０）。次に事前実行番号を辿り演算Ｓの演算番号３と演算Ｓより後に実行される演算番号３、６−８を到達番号リストＲＬに格納する（ステップＳ１１５０、Ｓ１１６０）。次に入力変数＄ｚを持つ為、Ｓより前に実行された演算番号１、２を＄ｚの利用演算番号ＵＬに登録する。（ステップＳ１１３０、Ｓ１１７０、Ｓ１１８０、Ｓ１２００）。次に演算番号４の演算Ｓを取得する（ステップＳ１２１０、Ｓ１１１０、Ｓ１１２０）。 Since the operation S has the output variable $ z, $ z is added to the output schema 504 and the variable list VL. Further, an arrival number list RL and usage number list UL for $ z are prepared (steps S1130 and S1140). Next, the advance execution number is traced, and the operation number 3 of the operation S and the operation numbers 3 and 6-8 executed after the operation S are stored in the arrival number list RL (steps S1150 and S1160). Next, since it has the input variable $ z, the operation numbers 1 and 2 executed before S are registered in the utilization operation number UL of $ z. (Steps S1130, S1170, S1180, S1200). Next, the operation S with the operation number 4 is acquired (steps S1210, S1110, S1120).

演算Ｓは出力変数＄ｕを持つ為＄ｕを出力スキーマ５０４と変数リストＶＬに追加する。さらに＄ｕに対する到達番号リストＲＬと利用番号リストＵＬを用意する（ステップＳ１１３０、Ｓ１１４０）。次に事前実行番号を辿り演算Ｓの演算番号４と演算Ｓより後に実行される演算番号５、７、８を到達番号リストＲＬに格納する（ステップＳ１１５０、Ｓ１１６０）。次に演算Ｓは入力変数＄ｘを持つ為、Ｓより前に実行された演算番号１、２を利用演算番号ＵＬに登録する。（ステップＳ１１７０、Ｓ１１８０、Ｓ１２００）。次に演算番号５の演算Ｓを取得する（ステップＳ１２１０、Ｓ１１１０、Ｓ１１２０）。 Since the operation S has the output variable $ u, $ u is added to the output schema 504 and the variable list VL. Further, a reaching number list RL and a usage number list UL for $ u are prepared (steps S1130 and S1140). Next, the advance execution number is traced, and the operation number 4 of the operation S and the operation numbers 5, 7, and 8 executed after the operation S are stored in the arrival number list RL (steps S1150 and S1160). Next, since the operation S has the input variable $ x, the operation numbers 1 and 2 executed before S are registered in the use operation number UL. (Steps S1170, S1180, S1200). Next, the operation S with the operation number 5 is acquired (steps S1210, S1110, S1120).

演算Ｓは出力変数＄ｖを持つ為＄ｖを出力スキーマ５０４と変数リストＶＬに追加する。さらに＄ｖに対する到達番号リストＲＬと利用番号リストＵＬを用意する（ステップＳ１１３０、Ｓ１１４０）。次に事前実行番号を辿り演算Ｓの演算番号５と演算Ｓより後に実行される演算番号７、８を到達番号リストＲＬに格納する（ステップＳ１１５０、Ｓ１１６０）。次に演算Ｓは入力変数＄ｘを持つ為、Ｓより前に実行された演算番号１、２、４を利用演算番号ＵＬに登録する。（ステップＳ１１７０、Ｓ１１８０、Ｓ１２００）。次に演算番号６の演算Ｓを取得する（ステップＳ１２１０、Ｓ１１１０、Ｓ１１２０）。 Since the operation S has the output variable $ v, $ v is added to the output schema 504 and the variable list VL. Further, an arrival number list RL and usage number list UL for $ v are prepared (steps S1130 and S1140). Next, the advance execution number is traced, and the operation number 5 of the operation S and the operation numbers 7 and 8 executed after the operation S are stored in the arrival number list RL (steps S1150 and S1160). Next, since the operation S has the input variable $ x, the operation numbers 1, 2, and 4 executed before S are registered in the use operation number UL. (Steps S1170, S1180, S1200). Next, the calculation S of the calculation number 6 is acquired (steps S1210, S1110, S1120).

演算Ｓは出力変数＄ｚを持つ為＄ｚを出力スキーマ５０４と変数リストＶＬに追加する。さらに＄ｚに対する。到達番号リストＲＬと利用番号リストＵＬを用意する（ステップＳ１１３０、Ｓ１１４０）。次に事前実行番号を辿り演算Ｓの演算番号６と演算Ｓより後に実行される演算番号７、８を到達番号リストＲＬに格納する（ステップＳ１１５０、Ｓ１１６０）。次に、入力変数＄ｚを持つ為、Ｓより前に実行された演算番号１−３を＄ｚの利用演算番号ＵＬに登録する。（ステップＳ１１３０、Ｓ１１７０、Ｓ１１８０、Ｓ１２００）。次に演算番号７の演算Ｓを取得する（ステップＳ１２１０、Ｓ１１１０、Ｓ１１２０）。 Since the operation S has the output variable $ z, $ z is added to the output schema 504 and the variable list VL. Furthermore, for $ z. A reachable number list RL and a use number list UL are prepared (steps S1130 and S1140). Next, the advance execution number is traced, and the operation number 6 of the operation S and the operation numbers 7 and 8 executed after the operation S are stored in the arrival number list RL (steps S1150 and S1160). Next, since it has the input variable $ z, the operation numbers 1-3 executed before S are registered in the utilization operation number UL of $ z. (Steps S1130, S1170, S1180, S1200). Next, the calculation S of the calculation number 7 is acquired (steps S1210, S1110, S1120).

演算Ｓは出力変数を持たず、入力変数＄ｚを持つ為、Ｓより前に実行された演算番号１−６を＄ｚの利用演算番号ＵＬに登録する。（ステップＳ１１３０、Ｓ１１７０、Ｓ１１８０、Ｓ１２００）。次に演算番号８の演算Ｓを取得する（ステップＳ１２１０、Ｓ１１１０、Ｓ１１２０）。 Since the operation S has no output variable and has the input variable $ z, the operation numbers 1-6 executed before S are registered in the usage operation number UL of $ z. (Steps S1130, S1170, S1180, S1200). Next, the operation S with the operation number 8 is acquired (steps S1210, S1110, S1120).

演算Ｓは出力変数＄ｕと＄ｖを持つ為＄ｕと＄ｖを出力スキーマ５０４と変数リストＶＬに追加する。さらに＄ｕと＄ｖに対する。到達番号リストＲＬと利用番号リストＵＬを用意する（ステップＳ１１３０、Ｓ１１４０）。次に事前実行番号を辿り演算Ｓの演算番号８を到達番号リストＲＬに格納する（ステップＳ１１５０、Ｓ１１６０）。次に、入力変数＄ｕ、＄ｖを持つ為、Ｓより前に実行された演算番号１−７を＄ｕ、＄ｖの利用演算番号ＵＬに登録する。（ステップＳ１１３０、Ｓ１１７０、Ｓ１１８０、Ｓ１２００）。次にｉがｍａｘを超えたためｊに１を、ｖｍａｘにＶＬの要素数４を代入する（ステップＳ１２１０、Ｓ１１１０、Ｓ１２２０）。次に変数リストの１番目の要素である変数＄ｘを取得する（ステップＳ１２３０，Ｓ１２４０）。 Since the operation S has output variables $ u and $ v, $ u and $ v are added to the output schema 504 and the variable list VL. Furthermore, for $ u and $ v. A reachable number list RL and a use number list UL are prepared (steps S1130 and S1140). Next, the advance execution number is traced, and the operation number 8 of the operation S is stored in the arrival number list RL (steps S1150 and S1160). Next, since there are input variables $ u and $ v, the operation numbers 1-7 executed before S are registered in the use operation numbers UL of $ u and $ v. (Steps S1130, S1170, S1180, S1200). Next, since i exceeds max, 1 is substituted for j, and the number of VL elements 4 is substituted for vmax (steps S1210, S1110, S1220). Next, the variable $ x which is the first element of the variable list is acquired (steps S1230 and S1240).

変数＄ｘの利用番号リストＵＬの各要素は１、２、４、到達演算番号ＲＬは１−８であるため共通して出現する演算番号１、２、４の出力スキーマ５０４に＄ｘを追加する（ステップＳ１２５０、Ｓ１２６０）。次にＶＬの２番目の変数＄ｚを取得する（ステップＳ１２７０、Ｓ１２３０、Ｓ１２４０）。 $ X is added to the output schema 504 of operation numbers 1, 2, and 4 that appear in common because each element of the usage number list UL of the variable $ x is 1, 2, and 4, and the reaching operation number RL is 1-8. (Steps S1250 and S1260). Next, the second variable $ z of VL is acquired (steps S1270, S1230, S1240).

変数＄ｚの利用番号リストＵＬの各要素は１−６、到達演算番号ＲＬは２−８であるため共通して出現する演算番号２−６、の出力スキーマ５０４に＄ｚを追加する（ステップＳ１２５０、Ｓ１２６０）。次にＶＬの３番目の変数＄ｕを取得する（ステップＳ１２７０、Ｓ１２３０、Ｓ１２４０）。 Since each element of the usage number list UL of the variable $ z is 1-6 and the reaching operation number RL is 2-8, $ z is added to the output schema 504 of the operation number 2-6 that appears in common (step) S1250, S1260). Next, the third variable $ u of VL is acquired (steps S1270, S1230, S1240).

変数＄ｕの利用番号リストＵＬの各要素は１−７、到達演算番号ＲＬは４、５、７、８であるため共通して出現する演算番号４、５、７、の出力スキーマ５０４に＄ｕを追加する（ステップＳ１２５０、Ｓ１２６０）。次にＶＬの４番目の変数＄ｖを取得する（ステップＳ１２７０、Ｓ１２３０、Ｓ１２４０）。 Since each element of the usage number list UL of the variable $ u is 1-7 and the reaching operation number RL is 4, 5, 7, 8, the output schema 504 of the operation numbers 4, 5, 7 appearing in common is $ u is added (steps S1250 and S1260). Next, the fourth variable $ v of VL is acquired (steps S1270, S1230, S1240).

変数＄ｖの利用番号リストＵＬの各要素は１−７、到達演算番号ＲＬは５、７、８であるため共通して出現する演算番号５、７、の出力スキーマ５０４に＄ｖを追加する（ステップＳ１２５０、Ｓ１２６０）。次にｉに１を代入し、演算番号１の演算Ｓを取得する（ステップＳ１２７０、Ｓ１２３０、Ｓ１２８０）。 Since each element of the usage number list UL of the variable $ v is 1-7 and the reaching operation number RL is 5, 7, 8, $ v is added to the output schema 504 of the operation numbers 5, 7 that appear in common. (Steps S1250 and S1260). Next, 1 is substituted into i, and the operation S with the operation number 1 is acquired (steps S1270, S1230, and S1280).

演算番号１は入力がないため、出力スキーマ５０４に可変領域と拡張変数＄＃ｅ（）を追加して、演算番号２の演算Ｓを取得する（ステップＳ１３００−Ｓ１３２０、Ｓ１２８０、Ｓ１２９０）。 Since the operation number 1 is not input, the variable S and the extended variable $ # e () are added to the output schema 504 to obtain the operation S of the operation number 2 (steps S1300 to S1320, S1280, and S1290).

演算番号２の入力スキーマ１に演算番号１の出力スキーマ５０４をコピーし、出力スキーマ５０４に可変領域と拡張変数＄＃ｅ（）を追加して、演算番号３の演算Ｓを取得する（ステップＳ１３００−Ｓ１３２０、Ｓ１２８０、Ｓ１２９０）。 Copy the output schema 504 of the operation number 1 to the input schema 1 of the operation number 2, add the variable area and the extension variable $ # e () to the output schema 504, and acquire the operation S of the operation number 3 (step S1300). -S1320, S1280, S1290).

演算番号３の入力スキーマ１に演算番号２の出力スキーマ５０４をコピーし、出力スキーマ５０４に可変領域と拡張変数＄＃ｅ（）を追加して、演算番号４の演算Ｓを取得する（ステップＳ１３００−Ｓ１３２０、Ｓ１２８０、Ｓ１２９０）。 Copy the output schema 504 of operation number 2 to the input schema 1 of operation number 3, add the variable area and the extension variable $ # e () to the output schema 504, and obtain the operation S of operation number 4 (step S1300). -S1320, S1280, S1290).

演算番号４の入力スキーマ１に演算番号２の出力スキーマ５０４をコピーし、出力スキーマ５０４に可変領域と拡張変数＄＃ｅ（）を追加して、演算番号５の演算Ｓを取得する（ステップＳ１３００−Ｓ１３２０、Ｓ１２８０、Ｓ１２９０）。 The output schema 504 with the operation number 2 is copied to the input schema 1 with the operation number 4, and the variable area and the extension variable $ # e () are added to the output schema 504 to obtain the operation S with the operation number 5 (step S1300). -S1320, S1280, S1290).

演算番号５の入力スキーマ１に演算番号４の出力スキーマ５０４をコピーし、出力スキーマ５０４に可変領域と拡張変数＄＃ｅ（）を追加して、演算番号６の演算Ｓを取得する（ステップＳ１３００−Ｓ１３２０、Ｓ１２８０、Ｓ１２９０）。 The output schema 504 with the operation number 4 is copied to the input schema 1 with the operation number 5, the variable region and the extension variable $ # e () are added to the output schema 504, and the operation S with the operation number 6 is acquired (step S1300). -S1320, S1280, S1290).

演算番号６の入力スキーマ１に演算番号３の出力スキーマ５０４をコピーし、出力スキーマ５０４に可変領域と拡張変数＄＃ｅ（）を追加して、演算番号７の演算Ｓを取得する（ステップＳ１３００−Ｓ１３２０、Ｓ１２８０、Ｓ１２９０）。 The output schema 504 with the operation number 3 is copied to the input schema 1 with the operation number 6, and the variable region and the extension variable $ # e () are added to the output schema 504 to obtain the operation S with the operation number 7 (step S1300). -S1320, S1280, S1290).

演算番号７の入力スキーマ１に演算番号５の出力スキーマ５０４を、入力スキーマ２に演算番号６の出力スキーマ５０４を各々コピーし、出力スキーマ５０４に可変領域と拡張変数＄＃ｅ（）を追加して、演算番号８の演算Ｓを取得する（ステップＳ１３００−Ｓ１３２０、Ｓ１２８０、Ｓ１２９０）。 Copy the output schema 504 of operation number 5 to the input schema 1 of operation number 7, copy the output schema 504 of operation number 6 to the input schema 2, and add the variable area and extension variable $ # e () to the output schema 504. Thus, the operation S with the operation number 8 is acquired (steps S1300-S1320, S1280, S1290).

演算番号８の入力スキーマ１に演算番号７の出力スキーマ５０４をコピーし、出力スキーマ５０４に可変領域と拡張変数＄＃ｅ（）を追加して終了する（ステップＳ１３００−Ｓ１３２０、Ｓ１２８０、Ｓ１２９０）。スキーマ生成処理の出力結果の一例は、図３０にした入出力スキーマテーブル５００である。なお、図３０では入力スキーマが１つだけの場合は入力スキーマ１の項目に、２つある場合は２つ目を入力スキーマ２の項目に格納しているが、入力スキーマの項目は必要に応じて増やしたり減らしたりしても良い。 The output schema 504 with the operation number 7 is copied to the input schema 1 with the operation number 8, the variable area and the extension variable $ # e () are added to the output schema 504, and the process ends (steps S1300 to S1320, S1280, and S1290). An example of the output result of the schema generation process is the input / output schema table 500 shown in FIG. In FIG. 30, when there is only one input schema, it is stored in the input schema 1 item, and when there are two, the second is stored in the input schema 2 item. It may be increased or decreased.

ここで、ステップＳ１４と並行して実行されるマスターサーバの処理について説明する。分散プラン生成部２３は、分散プラン更新処理を行い（ステップＳ１０）、分散プランを更新した後、更新した分散プランに基づいて実行順序が非決定的な部分を確定する分散プラン順序決定処理を実施する（ステップＳ１３）。ステップＳ１３の分散プラン順序決定処理は対象のプランがローカルプランではなく分散プランである以外はステップＳ１２と同じである。 Here, the process of the master server executed in parallel with step S14 will be described. The distributed plan generation unit 23 performs a distributed plan update process (Step S10), updates the distributed plan, and then executes a distributed plan order determination process that determines a portion whose execution order is non-deterministic based on the updated distributed plan. (Step S13). The distributed plan order determination process in step S13 is the same as step S12 except that the target plan is not a local plan but a distributed plan.

次に、スキーマ生成部１８は、決定した分散プラン内の各演算の入出力スキーマを生成する（ステップＳ１５）。ステップＳ１５のスキーマ生成処理は対象のプランがローカルプランではなく分散プランである以外はステップＳ１４と同じである。 Next, the schema generation unit 18 generates an input / output schema for each operation in the determined distributed plan (step S15). The schema generation process in step S15 is the same as step S14 except that the target plan is not a local plan but a distributed plan.

マスターサーバとスレーブサーバのそれぞれで、スキーマ生成処理が行われると、マスターサーバとスレーブサーバのそれぞれは生成された入出力スキーマに基づいてプランを実行する。すなわち、スレーブサーバのローカルプラン実行部３６は、スキーマ生成処理（ステップＳ１４）で生成された入出力スキーマを元にローカルプランを実行する（ステップＳ９）。また、マスターサーバの分散プラン実行部１６は、スキーマ生成処理（ステップＳ１５）で生成されたスキーマを元に、分散プランを実行する（ステップＳ１１）。 When the schema generation processing is performed in each of the master server and the slave server, each of the master server and the slave server executes a plan based on the generated input / output schema. That is, the local plan execution unit 36 of the slave server executes the local plan based on the input / output schema generated in the schema generation process (step S14) (step S9). Further, the distributed plan execution unit 16 of the master server executes the distributed plan based on the schema generated by the schema generation process (step S15) (step S11).

ローカルプラン実行（ステップＳ９）では、ローカルプラン実行部３３は、ローカルプランの各演算を実施する際に、入力されるデータと用意した入力スキーマをスキーマ変更部３６に渡してスキーマ変更処理を実施する（ステップＳ１６）。また分散プラン実行（ステップＳ１１）においても同様に、分散プラン実行部１６が分散プランの各演算を実施する際に、入力されるデータと用意した入力スキーマをスキーマ変更部１９に渡してスキーマ変更処理を実施する（ステップＳ１６）。 In the local plan execution (step S9), the local plan execution unit 33 performs the schema change processing by passing the input data and the prepared input schema to the schema change unit 36 when performing each calculation of the local plan. (Step S16). Similarly, in the distributed plan execution (step S11), when the distributed plan executing unit 16 performs each calculation of the distributed plan, the input data and the prepared input schema are passed to the schema changing unit 19, and the schema changing process is performed. (Step S16).

ここで、図３１を参照して、ステップＳ１６においてマスターサーバのスキーマ変更部１９が、分散プランを実行する際に行うスキーマ変更処理について説明する。 Here, with reference to FIG. 31, the schema changing process performed when the schema changing unit 19 of the master server executes the distributed plan in step S16 will be described.

スキーマ変更部１９は、分散プラン実行部１６が次に実行する演算の入力であるスキーマＳを持つデータＤと、スキーマ生成部１８で予め用意した演算の入力スキーマＴを取得する（ステップＳ１４００）。次にスキーマＳとスキーマＴの変数の項目が一致しているかを判定する（ステップＳ１４１０）。 The schema changing unit 19 acquires the data D having the schema S that is the input of the operation to be executed next by the distributed plan executing unit 16 and the operation input schema T prepared in advance by the schema generating unit 18 (step S1400). Next, it is determined whether the items of the variables of the schema S and the schema T match (step S1410).

スキーマＳとスキーマＴの変数の項目が一致していない場合（ステップＳ１４１０がＮｏ）、スキーマＳの拡張変数＄＃ｅの変数リストＶＬを取得し、リストが空であるかを判定する（ステップＳ１４２０）。 If the items of the variables of the schema S and the schema T do not match (No in step S1410), the variable list VL of the extension variable $ # e of the schema S is acquired, and it is determined whether the list is empty (step S1420). ).

スキーマＳの拡張変数＄＃ｅの変数リストＶＬが空でない場合（ステップＳ１４２０がＮｏ）、拡張変数リストＶＬ内の各変数をスキーマＳの変数の項目として追加し、ＶＬを空に変更する（ステップＳ１４３０）。次にスキーマＳに存在し、スキーマＴに存在しない変数の項目のリストＤＬを取得する（ステップＳ１４３０）。次にステップＳ１４５０に進む。なお、スキーマＳの拡張変数＄＃ｅの変数リストＶＬが空の場合（ステップＳ１４２０がＹｅｓ）も、ステップＳ１４５０に進む。 If the variable list VL of the extended variable $ # e in the schema S is not empty (No in step S1420), each variable in the extended variable list VL is added as a variable item in the schema S, and the VL is changed to empty (step) S1430). Next, a list DL of variable items that exist in the schema S and do not exist in the schema T is acquired (step S1430). Next, the process proceeds to step S1450. If the variable list VL of the extended variable $ # e in the schema S is empty (step S1420 is Yes), the process proceeds to step S1450.

ステップＳ１４５０では、変数の項目リストＤＬ内の変数がスキーマＳにおいて非連続に並んでいるかを判定する（ステップＳ１４５０）。 In step S1450, it is determined whether the variables in the variable item list DL are discontinuously arranged in the schema S (step S1450).

変数の項目リストＤＬ内の変数がスキーマＳにおいて非連続に並んでいる場合、（ステップＳ１４５０がＹｅｓ）、変数の項目リストＤＬ内の変数が連続するようにスキーマＳとデータＤの各データを書き換える（ステップＳ１４６０）。次にステップＳ１４７０に進む。なお、変数の項目リストＤＬ内の変数がスキーマＳにおいて非連続に並んでいない場合（ステップＳ１４２０がＮｏ）も、ステップＳ１４７０に進む。 When the variables in the variable item list DL are discontinuously arranged in the schema S (Yes in step S1450), the data in the schema S and the data D are rewritten so that the variables in the variable item list DL are continuous. (Step S1460). Next, the process proceeds to step S1470. If the variables in the variable item list DL are not discontinuously arranged in the schema S (No in step S1420), the process also proceeds to step S1470.

ステップＳ１４７０では、スキーマＳの変数の項目からリストＤＬ内の変数を削除し、拡張変数＄＃ｅの変数リストに追加する（ステップＳ１４７０）。次にデータＤの可変領域の項目にリストＤＬ内の変数の合計サイズを格納して終了する（ステップＳ１４８０）。なおスキーマＳとスキーマＴの変数の項目が一致している場合（ステップＳ１４１０がＹｅｓ）も終了する。なお、図３１で示したフローチャートは、ローカルプラン実行部３３における演算に対するスキーマ変更部３６の処理時においても、対象がローカルプランであるか分散プランであるかの違いのみで同様に動作する。 In step S1470, the variable in list DL is deleted from the variable item of schema S and added to the variable list of extension variable $ # e (step S1470). Next, the total size of the variables in the list DL is stored in the variable area item of the data D, and the process ends (step S1480). Note that when the items of the variables of the schema S and the schema T match (Yes in step S1410), the process is also ended. Note that the flowchart shown in FIG. 31 operates in the same manner only when the target is a local plan or a distributed plan even when the schema change unit 36 processes a calculation in the local plan execution unit 33.

ここで図３１を参照して、スキーマ変更部１９が、図２６に示した、計算機０の演算番号７の入力スキーマと、計算機１、計算機２、計算機３から送信された各データのスキーマに対し、スキーマ変更処理を行う際の動作について具体的に説明する。このスキーマ変更処理は分散プランが図２４、計算機１、計算機２、計算機３のローカルプランが図１２、図１８、図１６に示したローカルプラン候補における分散プランの演算番号７の受信演算の実施時に行うものである。 Referring to FIG. 31, the schema changing unit 19 applies the input schema of the operation number 7 of the computer 0 and the schema of each data transmitted from the computer 1, the computer 2, and the computer 3 shown in FIG. 26. The operation when the schema change process is performed will be specifically described. This schema change process is performed when the reception plan of the distributed plan operation number 7 in the local plan candidate shown in FIG. 24, computer 1, computer 2, and computer 3 is shown in FIG. 12, FIG. 18, FIG. Is what you do.

まず図２６の計算機０の演算番号７の入力スキーマＴと、計算機１から送信されたデータのスキーマＳを取得する（ステップＳ１４００）。スキーマＳとスキーマＴの変数項目が一致しているため処理を終了する（ステップＳ１４１０）。 First, the input schema T of operation number 7 of the computer 0 in FIG. 26 and the schema S of the data transmitted from the computer 1 are acquired (step S1400). Since the variable items of the schema S and the schema T match, the process ends (step S1410).

次に計算機０の演算番号７の入力スキーマＴと、計算機２から送信されたデータのスキーマＳを取得する（ステップＳ１４００）。スキーマＳとスキーマＴの変数項目が一致せず、Ｓの拡張変数＄＃ｅの変数リストが空であるためスキーマＳに存在し、スキーマＴに存在しない変数として＄ｕと＄ｖを取得する（ステップＳ１４１０、ステップＳ１４２０、ステップＳ１４４０）。変数＄ｕと＄ｖはスキーマＳにおいて連続して並んでいる為、スキーマＳの項目から変数＄ｕと＄ｖを削除し、拡張変数＄＃ｅの変数リストに追加する（ステップＳ１４５０、ステップＳ１４７０）。最後に＄ｕと＄ｖの変数の合計サイズを拡張領域に格納する（ステップＳ１４８０）。 Next, the input schema T of the calculation number 7 of the computer 0 and the schema S of the data transmitted from the computer 2 are acquired (step S1400). Since the variable items of the schema S and the schema T do not match and the variable list of the extension variable $ # e of S is empty, $ u and $ v are acquired as variables that exist in the schema S and do not exist in the schema T ( Step S1410, Step S1420, Step S1440). Since the variables $ u and $ v are continuously arranged in the schema S, the variables $ u and $ v are deleted from the items of the schema S and added to the variable list of the extended variable $ # e (steps S1450 and S1470). ). Finally, the total size of the variables of $ u and $ v is stored in the extension area (step S1480).

次に計算機０の演算番号７の入力スキーマＴと、計算機３から送信されたデータのスキーマＳを取得する（ステップＳ１４００）。スキーマＳとスキーマＴの変数項目が一致せず、Ｓの拡張変数＄＃ｅの変数リストが空であるためスキーマＳに存在し、スキーマＴに存在しない変数として＄ｕを取得する（ステップＳ１４１０、ステップＳ１４２０、ステップＳ１４４０）。変数＄ｕはスキーマＳにおいて連続して並んでいる為、スキーマＳの項目から変数＄ｕを削除し、拡張変数＄＃ｅの変数リストに追加する（ステップＳ１４５０、ステップＳ１４７０）。最後に＄ｕの変数の合計サイズを拡張領域に格納する（ステップＳ１４８０）。スキーマ変更処理の結果を図３２に示す。図３２は、スキーマ変更処理後のスキーマが含む項目を示す一例である。図３２に示すように本実施形態では、スキーマ変更処理後は、「可変領域」４０１と、「変数〜」４０２と「拡張変数＄＃ｅ（Ｖ１・・・，Ｖｎ）」４０３の項目を有する。 Next, the input schema T of the calculation number 7 of the computer 0 and the schema S of the data transmitted from the computer 3 are acquired (step S1400). Since the variable items of the schema S and the schema T do not match and the variable list of the extension variable $ # e of S is empty, $ u is acquired as a variable that exists in the schema S and does not exist in the schema T (step S1410, Step S1420, Step S1440). Since the variable $ u is continuously arranged in the schema S, the variable $ u is deleted from the items of the schema S and added to the variable list of the extended variable $ # e (steps S1450 and S1470). Finally, the total size of the variable of $ u is stored in the extension area (step S1480). The result of the schema change process is shown in FIG. FIG. 32 is an example showing items included in the schema after the schema change processing. As shown in FIG. 32, in the present embodiment, after the schema change process, there are items of “variable area” 401, “variable˜” 402, and “extended variable $ # e (V1..., Vn)” 403. .

なお上記のスキーマ変更処理は、マスターサーバの分散プランの演算とスレーブサーバのローカルプランの演算でデータを受け渡す際に実施する事例を説明したが、同一サーバ内、或いはスレーブサーバ間の演算でデータを受け渡す際にも適用できる。図２４では演算番号６のサーバ間ＪＯＩＮの前に変数＄ｚを受信することになっている。 Note that the schema change process described above is an example that is performed when data is transferred in the master server's distributed plan calculation and slave server's local plan calculation, but the data is calculated in the same server or between slave servers. It can also be applied when handing over. In FIG. 24, the variable $ z is received before the server-to-server JOIN with the operation number 6.

しかしながら図１８、図１２、図１９のローカルプラン候補から分かるように、計算機１は変数＄ｚ、＄ｕ、＄ｖの３組のデータを、計算機２は変数＄ｚを、計算機３は変数＄ｚ、＄ｕの２組のデータを送信している。このため、そのままではカラム数の異なるテーブルを扱うことになってしまう。したがってマスターサーバのスキーマ変更部１８では、図２６に示す各計算機のデータのスキーマを、図３２に示すように全ての計算機のデータが同じスキーマとみなすことができるようにスキーマを変更している。図２４では演算番号６のサーバ間ＪＯＩＮでは変数＄ｚのみ必要であるため、他に変数が含まれていた場合は全て１つの可変長データを格納した拡張変数＄＃ｅのカラムとして扱うようになっている。そして可変領域で可変長領域のサイズを記憶している。これによりサーバ間ＪＯＩＮの実行時に全て同じ形のテーブルとして扱うことが可能になっている。 However, as can be seen from the local plan candidates in FIGS. 18, 12, and 19, the computer 1 has three sets of data $ z, $ u, and $ v, the computer 2 has the variable $ z, and the computer 3 has the variable $. Two sets of data of z and $ u are transmitted. For this reason, a table with a different number of columns will be handled as it is. Therefore, the schema change unit 18 of the master server changes the schema of the data of each computer shown in FIG. 26 so that the data of all the computers can be regarded as the same schema as shown in FIG. In FIG. 24, since only the variable $ z is necessary in the server-to-server JOIN with the operation number 6, all other variables are handled as columns of the extended variable $ # e storing one variable length data. It has become. The variable area stores the size of the variable length area. As a result, all of the tables can be handled as the same table when executing the JOIN between servers.

本実施形態の分散データベース検索装置によると、スレーブサーバ毎にローカルプランを作成するため、スレーブサーバ毎に分散プランの一部が異なる可能性がある。すなわち、スレーブサーバ毎に送られるデータのスキーマが異なる可能性がある。本実施形態は、このような場合に、スキーマ変更部１８、２０においてデータを受け渡す際に送信する側のデータのスキーマ（入力スキーマ）と、受信するデータのスキーマ（出力スキーマ）の違いを変更することで、同一スキーマのデータとして扱うことを実現する。 According to the distributed database search apparatus of the present embodiment, a local plan is created for each slave server, so there is a possibility that part of the distributed plan differs for each slave server. That is, the schema of data sent to each slave server may be different. In this case, the present embodiment changes the difference between the schema (input schema) of the data to be transmitted and the schema of the received data (output schema) when the schema changing units 18 and 20 deliver the data. By doing so, it can be handled as data of the same schema.

すなわち、スレーブサーバ毎に効率の良いローカルプランを作成する場合、複数の異なるスキーマのデータが存在する場合がある。このように、スレーブサーバ毎に効率の良いローカルプランを作成する場合は、複数の異なるスキーマのデータを統一的に扱う処理が必要になる。 That is, when creating an efficient local plan for each slave server, there may be a plurality of different schema data. As described above, when an efficient local plan is created for each slave server, it is necessary to process the data of a plurality of different schemas uniformly.

上述のように本実施形態によると、複数の異なるスキーマのデータを統一的に扱うことが可能となる。また、本実施形態ではスキーマのみ、あるいはスキーマとデータの一部の領域の書き換えのみで複数の異なるスキーマのデータを統一的に扱うこと実現することが可能である。 As described above, according to the present embodiment, data of a plurality of different schemas can be handled in a unified manner. Further, in the present embodiment, it is possible to realize unified handling of data of a plurality of different schemas only by rewriting only a schema or a partial area of the schema and data.

以上、本発明の実施形態を説明したが、これら実施形態は例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 As mentioned above, although embodiment of this invention was described, these embodiment is shown as an example and is not intending limiting the range of invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the invention described in the claims and the equivalents thereof.

１１…構文解析部、１２…クエリ分割部、１３…分散プラン生成部、１４…分散プラン結合演算追加部、１５…分散プラン更新部、１６…分散プラン実行部、１７…送受信部、１８…スキーマ生成部、１９…スキーマ変更部、３１…ローカルプラン選択部、３２…ローカルプラン候補生成部、３３…ローカルプラン実行部、３４…送受信部、３５…スキーマ生成部、３６…スキーマ変更部 DESCRIPTION OF SYMBOLS 11 ... Syntax analysis part, 12 ... Query division part, 13 ... Distributed plan production | generation part, 14 ... Distributed plan combination calculation addition part, 15 ... Distributed plan update part, 16 ... Distributed plan execution part, 17 ... Transmission / reception part, 18 ... Schema Generation unit, 19 ... schema change unit, 31 ... local plan selection unit, 32 ... local plan candidate generation unit, 33 ... local plan execution unit, 34 ... transmission / reception unit, 35 ... schema generation unit, 36 ... schema change unit

Claims

A search method for a distributed database search device in which a plurality of slave servers having a database for storing data and a master server for storing management information of the database and searching for the data based on a query query are connected,
Intra-server arithmetic processing that operates on each of the slave servers, or includes inter-server arithmetic processing that collects data from the intra-server arithmetic and the plurality of slave servers and operates on a master server, and is stored in a plurality of databases Generating a distributed plan for retrieving data based on the query query;
In the distributed plan, there is an operation that can be executed in parallel with the inter-server operation processing, and there is an operation that requires both the result of executing the operation and the result of the inter-server operation processing, An operation that can be executed in parallel with the inter-server operation processing and the operation between the servers are executed in parallel, and the data obtained by the operation that can be executed in parallel with the inter-server operation processing and the operation between the servers are obtained. Adding a split query join operation that joins the combined data to the distributed plan;
Extracting a plan related to the slave server from the distributed plan;
Generating local plan candidates in which the transmission / reception operation of data related to the split query join operation and the split query join operation included in the extracted plan is changed;
Generating a new local plan candidate in which the transmission / reception calculation of the data related to the divided query combination calculation and the divided query combination calculation included in the generated local plan candidate is changed;
Calculating a calculation cost of each of the generated local plan candidates and the extracted plan, and selecting a plan that minimizes the calculation cost as a local plan;
Updating the distributed plan based on the selected local plan;
A search method comprising:

A distributed database search device to which a plurality of slave servers having a database for storing data and a master server for searching the database based on a query query are connected,
The master server is
A storage unit for storing management information of each database of the slave server;
Intra-server arithmetic processing that operates on each of the slave servers, or includes inter-server arithmetic processing that collects data from the intra-server arithmetic and the plurality of slave servers and operates on a master server, and is stored in a plurality of databases A distributed plan generating unit for generating a distributed plan for searching for data based on the query query;
In the distributed plan, there is an operation that can be executed in parallel with the inter-server operation processing, and there is an operation that requires both the result of executing the operation and the result of the inter-server operation processing, An operation that can be executed in parallel with the inter-server operation processing and the operation between the servers are executed in parallel, and the data obtained by the operation that can be executed in parallel with the inter-server operation processing and the operation between the servers are obtained. A split query join operation addition unit for adding a split query join operation for combining the collected data to the distributed plan;
A distributed plan update unit that updates the distributed plan based on a local plan received from the slave server;
Comprising
The slave server is
A plan related to the slave server is extracted from the distributed plan, and a local plan candidate is generated by changing the split query join operation included in the extracted plan and the data send / receive operation related to the split query join operation. A local plan candidate generator,
A distributed database search device comprising: a local plan selection unit that calculates a calculation cost of each of the generated local plan candidates and the extracted plan, and selects a plan having the minimum calculation cost as a local plan.

The said server calculation WHEREIN: When the column of the data transmitted / received differs in the said master server and the said slave server, the schema change part which makes the said column the column in which the variable-length data was memorize | stored is provided. Distributed database search device.

A master server connected to a plurality of slave servers having a database for storing data and constituting a distributed database device for searching the database based on an inputted query query,
A storage unit for storing management information of each database of the slave server;
Intra-server arithmetic processing that operates on each of the slave servers, or includes inter-server arithmetic processing that collects data from the intra-server arithmetic and the plurality of slave servers and operates on a master server, and is stored in a plurality of databases A distributed plan generating unit for generating a distributed plan for searching for data based on the query query;
In the distributed plan, there is an operation that can be executed in parallel with the inter-server operation processing, and there is an operation that requires both the result of executing the operation and the result of the inter-server operation processing, An operation that can be executed in parallel with the inter-server operation processing and the operation between the servers are executed in parallel, and the data obtained by the operation that can be executed in parallel with the inter-server operation processing and the operation between the servers are obtained. A split query join operation addition unit for adding a split query join operation for combining the collected data to the distributed plan;
A master server comprising: a distributed plan update unit that updates the distributed plan based on a local plan received from the slave server.

A slave server comprising a database and configured as a distributed database search system connected to a plurality of master servers that search the database based on an input query query;
A local plan in which a plan related to the slave server is extracted from the distributed plan received from the master server, and the transmission / reception calculation of the data related to the divided query join operation and the divided query join operation included in the extracted plan is changed. A local plan candidate generator for generating candidates;
A slave server comprising: a local plan selection unit that calculates a calculation cost of each of the generated local plan candidate and the extracted plan, and selects a plan having the minimum calculation cost as a local plan.

The slave server according to claim 5, wherein the operation cost of the split query join operation added by the split query join operation adding unit is included in the operation cost of the local plan candidate calculated by the local plan selection unit.

A program of a distributed database search device to which a plurality of slave servers having a database for storing data and a master server for storing management information of the database and searching for the data based on a query query are connected,
On the computer,
Intra-server arithmetic processing that operates on each of the slave servers, or includes inter-server arithmetic processing that collects data from the intra-server arithmetic and the plurality of slave servers and operates on a master server, and is stored in a plurality of databases A function for generating a distributed plan for retrieving data based on the query query;
In the distributed plan, there is an operation that can be executed in parallel with the inter-server operation processing, and there is an operation that requires both the result of executing the operation and the result of the inter-server operation processing, An operation that can be executed in parallel with the inter-server operation processing and the operation between the servers are executed in parallel, and the data obtained by the operation that can be executed in parallel with the inter-server operation processing and the operation between the servers are obtained. A function for adding a split query join operation to join the distributed data to the distributed plan;
A function of updating the distributed plan based on the local plan received from the slave server;
A plan related to the slave server is extracted from the distributed plan, and a local plan candidate is generated by changing the split query join operation included in the extracted plan and the data send / receive operation related to the split query join operation. Function and
A function of calculating the calculation cost of each of the generated local plan candidate and the extracted plan, and selecting a plan with the minimum calculation cost as a local plan;
A program that realizes