JP2016095561A

JP2016095561A - Control device, distributed database system, method, and program

Info

Publication number: JP2016095561A
Application number: JP2014229618A
Authority: JP
Inventors: 和世成田; Kazuyo Narita
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2014-11-12
Filing date: 2014-11-12
Publication date: 2016-05-26
Anticipated expiration: 2034-11-12
Also published as: JP6398632B2

Abstract

PROBLEM TO BE SOLVED: To reduce a total processing time for a work load that accompanies issuance of a query.SOLUTION: A control device according to the present invention comprises: query set acceptance means for accepting a query set including one or more queries for an object data group that is a prescribed data group stored separately in a plurality of data storage means; processing cost estimation means for estimating, on the basis of the meta-data of the object data group, a first processing cost that is a processing cost when a query session is executed without redividing/allocating the object data group and a second processing cost that is a processing cost when the query session is executed after redividing/allocating the object data group by a prescribed method; redivision/allocation determination means for determining, on the basis of the estimation result by the processing cost estimation means, whether or not to redivide/allocate the object data group, and, when redividing/allocating the object data group, what method should be used to redivide/allocate the object data group.SELECTED DRAWING: Figure 14

Description

本発明は、分散データベースシステム、データの分割配置（partition）を制御する制御装置、分割配置制御方法および分割配置制御用プログラムに関する。 The present invention relates to a distributed database system, a control device that controls data partitioning, a partition placement control method, and a partition placement control program.

機械学習等を用いて大量のデータを分析する分析処理の前処理として、分析対象のデータ群に対して抽出、集約、整形などを行い、分析処理が処理しやすい形（例えば、行列、計画行列、属性データ付の任意のデータ型等）に整える作業を行う場合がある。このような作業は、一般に属性生成処理（feature engineering）と呼ばれる。 As preprocessing of analysis processing that analyzes a large amount of data using machine learning, etc., the data group to be analyzed is extracted, aggregated, shaped, etc., so that the analysis processing is easy to process (for example, matrix, planning matrix) , Any data type with attribute data, etc.) may be performed. Such work is generally referred to as attribute generation processing (feature engineering).

属性生成処理はデータベースシステムを利用して行われることが多い。データベースシステムを利用する場合、例えば、ユーザまたは属性生成処理を行う手段は、分析対象とされるデータ群を元データとしてデータベースシステムの記憶領域に格納させた上で、該データベースシステムに対してＳＱＬといった一般にクエリと呼ばれる問い合わせ言語を発行して、所望のデータ操作を行う。元データが膨大である場合には、分散データベースシステムが利用される。 The attribute generation process is often performed using a database system. When using the database system, for example, the user or the means for performing attribute generation processing stores the data group to be analyzed as original data in the storage area of the database system, and then executes SQL for the database system. A query language generally called a query is issued to perform desired data manipulation. When the original data is enormous, a distributed database system is used.

分散データベースシステムは、データ群を細かいデータブロックに分割して複数のデータノードに格納する、いわゆる分割配置（partition）を行って、分割配置したデータブロックの各々に対して並列にクエリに基づくデータ操作を行うシステムである。分散データベースシステムに実装されるデータ操作用のオープンソースソフトウェアとして、Ｈｉｖｅ、Ｉｍｐａｌａ、Ｐｒｅｓｔｏなどが広く知られている。 A distributed database system divides a data group into fine data blocks and stores them in a plurality of data nodes, so-called partitioning (partition), and data operation based on a query in parallel for each of the partitioned data blocks It is a system that performs. As open source software for data manipulation implemented in a distributed database system, Hive, Impala, Presto, and the like are widely known.

属性生成処理では、分析処理が処理しやすいデータの形を得るために、一度に大量のクエリが発行される。そのため、属性生成処理は、分析タスクの中でも時間のかかりやすい処理といえる。分析タスク全体の効率化を図るためには、属性生成処理の効率化が求められている。 In the attribute generation process, a large number of queries are issued at a time in order to obtain a form of data that can be easily processed by the analysis process. Therefore, it can be said that the attribute generation process is a time-consuming process in the analysis task. In order to improve the efficiency of the entire analysis task, it is required to improve the efficiency of attribute generation processing.

分散データベースシステムにおけるデータ操作の効率化に関する技術として、分割配置したデータブロックごとに、メタ情報を付与する技術がある。 As a technique for improving the efficiency of data operation in a distributed database system, there is a technique for adding meta information to each divided data block.

Ｈｉｖｅがサポートするデータフォーマットの１つであるＯＲＣＦｉｌｅを例に用いて、該技術の一例を挙げる。ＯＲＣｆｉｌｅでは、Ｈｉｖｅが扱うレコード群（元データ）を複数のレコード（行データ）からなるグループに分割したものをデータブロックとして保持する。すなわち、ＯＲＣｆｉｌｅにおいて、各データブロックには複数のレコードが格納されている。そして、各データブロックは、当該データブロック内での、各列に対するレコードの最大値、最小値、和、カウント値等の統計情報をメタデータとして保持する。以下、このような分割配置されたデータブロックに対して付与される、データブロック内のデータの統計量を示すメタデータを、ブロック統計情報と呼ぶ場合がある。 An example of this technique will be described using ORCFFile, which is one of the data formats supported by Hive, as an example. In the ORC file, a record group (original data) handled by Hive is divided into a group composed of a plurality of records (row data) and held as a data block. That is, in the ORC file, a plurality of records are stored in each data block. Each data block holds statistical information such as the maximum value, the minimum value, the sum, and the count value of the records for each column in the data block as metadata. Hereinafter, the metadata indicating the statistical amount of data in the data block, which is given to the data blocks arranged in a divided manner, may be referred to as block statistical information.

ブロック統計情報は、ある種のデータ操作を行うときに効率性を発揮する。例えば、次のようなクエリを考える。 Block statistics information is efficient when performing certain types of data operations. For example, consider the following query:

クエリ例１：
select * from Table1 where attribute01 <= 100; Query example 1:
select * from Table1 where attribute01 <= 100;

クエリ例１は、“Table1”というテーブルから、“attribute01”という列（属性）名の列の値が１００以下であるレコードをすべて取得するというデータ操作を行わせるためのクエリである。なお、テーブルは、データベース表やリレーションモデルオブジェクトとも呼ばれる。 Query example 1 is a query for performing a data operation of acquiring all records having a column (attribute) name column value “attribute01” of 100 or less from a table “Table1”. The table is also called a database table or a relationship model object.

Ｈｉｖｅを利用した分散データベースシステムは、このようなクエリを受け取ると、まずクエリ内の述語“attribute01 <= 100”で示される条件を基に、各データブロックのブロック統計情報を参照して、列“attribute01”の最小値を確認する。ここで、述語（predicate）は、述語論理における述語であり、クエリ内においては、例えば、ｗｈｅｒｅ句などに指定される論理式の部分に相当する。クエリに用いられる述語のうち分割配置の仕方によって処理コストに変化が生じるものとしては、選択述語と結合述語とがある。上記のクエリ例１に用いられている述語は、選択述語である。 When the distributed database system using Hive receives such a query, first, based on the condition indicated by the predicate “attribute01 <= 100” in the query, the block statistical information of each data block is referred to, and the column “ Check the minimum value of “attribute01”. Here, the predicate is a predicate in the predicate logic, and corresponds to, for example, a part of a logical expression specified in the where clause in the query. Among the predicates used in a query, there are a selection predicate and a join predicate that cause a change in processing cost depending on the way of divided arrangement. The predicate used in the above query example 1 is a selection predicate.

今、仮に、あるデータブロックのブロック統計情報が示す“attribute01”列の最小値が１１０であるとする。その場合、当該データブロックの中には、“attribute01”列の値が１１０以上のレコードしか存在しない。したがって、そのようなデータブロックは、クエリ例１の処理で読み込む必要がない。 Now, suppose that the minimum value of the “attribute01” column indicated by the block statistical information of a certain data block is 110. In that case, only the record whose value of the “attribute01” column is 110 or more exists in the data block. Therefore, such a data block does not need to be read in the process of the query example 1.

Ｈｉｖｅを利用した分散データベースシステムでは、データフォーマットにＯＲＣＦｉｌｅを用いている場合、まず、発行されたクエリ内の述語を基に、各データブロックのブロック統計情報を参照する。そして、該分散データベースシステムは、述語にマッチするレコード、より具体的には述語が示す論理式を満たすレコードがデータブロック内に存在しないことがわかると、当該データブロックの読み込みを回避する。このようにして、記憶領域に対するＩ／Ｏ（Input/Output）の発生を抑え、より高速なデータ操作を実現している。 In the distributed database system using Hive, when ORFFile is used as the data format, first, block statistical information of each data block is referred to based on the predicate in the issued query. Then, when it is found that there is no record in the data block that matches the predicate, more specifically, a record that satisfies the logical expression indicated by the predicate, the distributed database system avoids reading the data block. In this way, the generation of I / O (Input / Output) for the storage area is suppressed, and higher-speed data operation is realized.

分散データベースシステムにおけるデータの分割配置に関する技術として、例えば、非特許文献１に記載された技術がある。非特許文献１には、データベースシステムが、追加データを格納する際に、既に格納されているデータに対して過去に発行されたクエリの履歴に基づいて、追加データをどのように分割配置するかを決定するデータ分割配置方法が記載されている。具体的には、データベースシステムが、クエリの履歴から、過去発行されたクエリに頻出する選択述語の集合を抽出して、抽出された選択述語集合に含まれるいずれか１つ以上の選択述語にマッチしない行を同一グループに選り分ける。このような分割配置を行うことで、ある１つ以上の選択述語にマッチにしないグループに相当するデータブロックを、該当する選択述語のいずれかを含むクエリが発行されたときに読み飛ばせるようにし、記憶領域に対する不要なＩ／Ｏの削減によるデータ操作の効率化を図る。 As a technique related to the divided arrangement of data in the distributed database system, for example, there is a technique described in Non-Patent Document 1. In Non-Patent Document 1, when a database system stores additional data, how to divide and arrange the additional data based on the history of queries issued in the past with respect to the data already stored. A data division arrangement method for determining the data is described. Specifically, the database system extracts from the query history a set of selection predicates that frequently appear in previously issued queries, and matches any one or more selection predicates included in the extracted selection predicate set. Sort the lines that are not in the same group. By performing such a divided arrangement, a data block corresponding to a group that does not match one or more selection predicates can be skipped when a query including any of the corresponding selection predicates is issued. Further, it is possible to improve the efficiency of data operation by reducing unnecessary I / O to the storage area.

また、特許文献１には、分散型のデータベース装置が、指定されたタイミング毎に、アクセス履歴から算出した複数の分割テーブルそれぞれに対するアクセス頻度およびレコードのヒット率に基づいて、複数のテーブル領域のそれぞれに格納されるべき分割テーブルの配置を変更することが記載されている。 Also, in Patent Document 1, each of a plurality of table areas is determined based on the access frequency and the record hit rate for each of a plurality of divided tables calculated from an access history at each designated timing. Is described to change the arrangement of the partition table to be stored.

また、特許文献２には、クエリが発行された段階で、そのクエリを処理するトランザクションが、現在のブロック内のソート状態における次元の順序と問い合わせ結果の出力順序とが一致していない場合に、ブロック内をソートすることが記載されている。 Further, in Patent Document 2, when a query is issued, when the transaction for processing the query does not match the order of dimensions in the sort state in the current block and the output order of the query results, It is described that the block is sorted.

また、特許文献３には、クエリを、選択演算や結合演算などの関係代数を接点とするタスクで構成された処理ツリーに変換し、変換した処理ツリーから、関係代数を他の関係代数の結果に依存しないで実施できるタスク群を取り出して、データベースへのアクセス領域ごとにグループ分けすることが記載されている。また、特許文献３には、グループ分けされたタスクに対してマルチプロセッション・プロセッシングを行う際に、処理コストの小さい順にタスクの処理を行うことや、共通のブロックへのアクセスを一度に行うことなどが記載されている。 Patent Document 3 discloses that a query is converted into a processing tree composed of tasks having a relational algebra such as a selection operation or a join operation as a contact point, and the relational algebra is converted into a result of another relational algebra from the converted processing tree. It describes that a group of tasks that can be executed without depending on the database is taken out and grouped for each access area to the database. Further, in Patent Document 3, when performing multi-processing and processing for grouped tasks, tasks are processed in ascending order of processing costs, and access to a common block is performed at once. Etc. are described.

特開２００９−２８８９７９号公報JP 2009-288879 A 特開平１１−００３２６０号公報JP-A-11-003260 特開２００８−１６５６２２号公報JP 2008-165622 A

L. Sun, M. J. Franklin, S. Krishnan, and R. S. Xin, "Fine-grained partitioning for aggressive data skipping.", SIGMOD 2014, June 2014, p.1115-1126.L. Sun, M. J. Franklin, S. Krishnan, and R. S. Xin, "Fine-grained partitioning for aggressive data skipping.", SIGMOD 2014, June 2014, p.1115-1126.

ブロック統計情報により不要なＩ／Ｏを抑制する効果を得るためには、データブロックが適切に分割配置されていなければならない。すなわち、分散データベースシステムにおけるデータ操作の効率化には、元のデータが不要なＩ／Ｏを抑制できるようなデータブロックに分割されていることが重要である。 In order to obtain the effect of suppressing unnecessary I / O by the block statistical information, the data block must be appropriately divided and arranged. That is, in order to improve the efficiency of data operation in the distributed database system, it is important that the original data is divided into data blocks that can suppress unnecessary I / O.

例えば、クエリ例１のような“attribute01”列に関する述語を含むクエリが頻繁に発行されることが事前に分かる場合、 “Table1”の各レコードを“attribute01”列の値に基づいて分割配置することにより、効果的に記憶領域に対する不要なＩ／Ｏを抑制できる。 For example, if it is known in advance that a query including a predicate related to the “attribute01” column as in Query Example 1 is frequently issued, each record of “Table1” is divided and arranged based on the value of the “attribute01” column Thus, unnecessary I / O to the storage area can be effectively suppressed.

しかし、分析者が試行錯誤を行いながら分析処理を行うような分析タスクでは、属性生成処理で生成したい属性もその都度異なってくる。このため、属性生成処理で発行されるクエリに使用される列の傾向を事前につかむことは難しい。 However, in an analysis task in which an analyst performs an analysis process while performing trial and error, attributes to be generated in the attribute generation process are also different each time. For this reason, it is difficult to grasp in advance the tendency of the columns used in the query issued in the attribute generation process.

例えば、特許文献１に記載された方法のように、過去のアクセス履歴からであれば、アクセス頻度が高いにも関わらずヒット率が低い分割テーブルを、さらに２以上の分割テーブルに分割することは可能である。しかし、新規に発行されたクエリセットがアクセス履歴に追従しない場合には、そのようなアクセス頻度やヒット率に基づく方法では、データ操作の効率化は図れない。 For example, as in the method described in Patent Document 1, if a past access history is used, a partition table having a low hit rate despite a high access frequency may be further divided into two or more partition tables. Is possible. However, when a newly issued query set does not follow the access history, the method based on such access frequency and hit rate cannot improve the efficiency of data operation.

すなわち、アクセス履歴により示されるアクセス頻度やレコードのヒット率は、似た傾向のクエリが発行されるデータベースシステムにおいて定期的に行われるデータの分割配置の見直しにおけるデータの分割時の基準として適していたとしても、新規のクエリ集合に対して、その実行前にデータの再分割配置の是非を問うための判定基準としては適さない。 In other words, the access frequency and record hit rate indicated by the access history were suitable as the criteria for data division in the review of the data division arrangement periodically performed in database systems that issue queries with similar trends. However, it is not suitable as a criterion for asking whether or not to re-divide data before executing a new query set.

なお、非特許文献１に記載された方法のように、クエリ内の述語の頻度に基づいてパーティションキーを設定する方法であれば、新規のクエリ集合に対しても適用できるかもしれない。 Note that, as in the method described in Non-Patent Document 1, any method that sets a partition key based on the frequency of predicates in a query may be applicable to a new query set.

しかし、非特許文献１に記載された方法には、次のような問題がある。第一に、非特許文献１に記載された方法は、クエリ集合における述語の頻度のみを考慮しており、述語そのものがデータ操作に与える処理コストを考慮していない。このため、処理コストが小さく、問題にならないような述語が頻繁にクエリ履歴内に現れた場合に、真に処理コストのボトルネックとなっている述語が無視される可能性がある。真に処理コストのボトルネッックとなっている述語が無視された場合、効率化効果が見込めない場合がある。 However, the method described in Non-Patent Document 1 has the following problems. First, the method described in Non-Patent Document 1 considers only the frequency of predicates in the query set, and does not consider the processing cost that the predicates themselves give to data operations. For this reason, when a predicate that has a low processing cost and does not cause a problem frequently appears in the query history, the predicate that is truly a bottleneck of the processing cost may be ignored. If a predicate that is truly a bottleneck in processing costs is ignored, the efficiency improvement effect may not be expected.

第二に、非特許文献１に記載された方法は、選択述語のみを考慮しており、結合述語を無視している。結合述語は、複数のテーブル間の結合（join）演算を要する述語である。選択述語は、複数のテーブル間の結合演算を要せず、１つのテーブルから条件にマッチする行を抽出する選択（selection）演算を要する述語である。 Secondly, the method described in Non-Patent Document 1 considers only the selection predicate and ignores the combined predicate. A join predicate is a predicate that requires a join operation between a plurality of tables. The selection predicate is a predicate that does not require a join operation between a plurality of tables and requires a selection operation for extracting a row that matches a condition from one table.

クエリ例２：
select * from T,R where T.name=R.name ∧ T.age>30; Query example 2:
select * from T, R where T.name = R.name ∧ T.age>30;

例えば、上記のクエリ例２には、“T.name = R.name”という結合述語と、“T.age > 30”という選択述語の２つの述語の論理積が用いられている。なお、論理積であるので、“T.name=R.name ∧ T.age>30”をまとめて１つの結合述語とみなしてもよい。 For example, in the above query example 2, a logical product of two predicates of a join predicate “T.name = R.name” and a selection predicate “T.age> 30” is used. Since it is a logical product, “T.name = R.name∧T.age> 30” may be collectively regarded as one join predicate.

結合演算には計算量の異なるアルゴリズムがいくつか存在し、例えば、上記のクエリ例２の場合、要する演算は選択演算と結合演算の組み合わせとなるが、事前にデータが適切にソートされて分割配置されている方が、結合演算で効率のいいアルゴリズムを選択できる。 There are several algorithms with different computational complexity in the join operation. For example, in the case of the above query example 2, the required operation is a combination of the selection operation and the join operation, but the data is appropriately sorted in advance and divided. The more efficient algorithm can be selected in the join operation.

しかし、非特許文献１に記載された方法は、結合述語については何ら開示されておらず、選択述語のみを対象にしているため、結合述語が用いられた場合に適切な分割配置を行うことができない。一般に、選択演算よりも結合演算の処理コストの方が大きくなる傾向があるため、選択述語のみを対象にしている非特許文献１に記載された方法では、処理効率化の大きな機会損失が生じている。 However, the method described in Non-Patent Document 1 does not disclose any join predicates, and only targets selection predicates. Therefore, when a join predicate is used, an appropriate division arrangement can be performed. Can not. In general, since the processing cost of the join operation tends to be larger than the selection operation, the method described in Non-Patent Document 1 that targets only the selection predicate causes a large opportunity loss in processing efficiency. Yes.

第三に、非特許文献１に記載された方法は、新しく追加されたデータに対して、初めて分割配置を行うことを想定しており、既にデータベース内に存在するデータに対して、適切な基準を与えてデータの分割配置をやり直すことを想定していない。属性生成処理では、一度に大量のクエリが発行されることが多い。そのような大量のクエリを一連の処理要求であるクエリセットとして見た場合、当該クエリセットの実行には多くの時間が掛かる。このため、大量のクエリを含むクエリセットを受け付けた場合には、その実行前に当該クエリセットに対応した適切な基準でデータの分割配置をやり直したほうが、総実行時間の短縮が可能な場合がある。しかし、データの分割配置をやり直すことで余計に時間が掛かる場合も当然あるため、データの分割配置を行うかどうかを判断する仕組みが必要だが、非特許文献１にはそのような仕組みについては何ら考慮されていない。一般に、データの分割配置をやり直す処理、すなわちデータが複数のデータノードに分割されて配置されている状態でそれとは異なる分割方法を用いてデータを分割して配置しなおす処理は、データの再分割配置（re-partition）と呼ばれる。 Thirdly, the method described in Non-Patent Document 1 assumes that the newly added data is divided for the first time, and is suitable for data already existing in the database. It is not assumed that data is divided and arranged again. In attribute generation processing, a large number of queries are often issued at a time. When such a large number of queries are viewed as a query set that is a series of processing requests, it takes a lot of time to execute the query set. For this reason, when a query set including a large number of queries is accepted, it may be possible to reduce the total execution time by re-splitting the data according to an appropriate standard corresponding to the query set before the execution. is there. However, since it may naturally take extra time to redo the data divisional arrangement, a mechanism for determining whether to perform the data divisional arrangement is necessary. However, Non-Patent Document 1 does not describe any such mechanism. Not considered. In general, the process of redoing and re-dividing data, that is, the process of dividing and re-allocating data using a different division method when the data is divided and arranged in a plurality of data nodes, This is called re-partition.

なお、特許文献２に記載された方法は、ブロックごとに当該ブロック内のソート状態を示す情報を保持しておくことにより、各ブロックに対して同じソート処理が繰り返されるのを防ごうというものであって、データ全体の分割配置の最適化については何ら考慮されていない。 Note that the method described in Patent Document 2 is intended to prevent the same sort process from being repeated for each block by holding information indicating the sort state in the block for each block. Therefore, no consideration is given to the optimization of the divided arrangement of the entire data.

また、特許文献３に記載された方法は、クエリ内の演算処理に対応するタスクを、アクセス先のデータ領域ごとにグループ化して処理の効率化を図ろうというものであって、データの分割配置の最適化については何ら考慮されていない。 In addition, the method described in Patent Document 3 is a method for grouping tasks corresponding to arithmetic processing in a query for each data area of an access destination to improve processing efficiency. No consideration is given to the optimization of.

そこで、本発明は、属性生成処理のような複数のクエリや述語の演算に時間のかかるクエリの発行を伴うワークロードの総処理時間を縮小できる制御装置、分散データベースシステム、分割配置制御方法および分散配置制御用プログラムを提供することを目的とする。 Therefore, the present invention provides a control device, a distributed database system, a divided arrangement control method, and a distribution that can reduce the total processing time of a workload that involves issuing multiple queries such as attribute generation processing and queries that require time for predicate operations. An object is to provide a program for arrangement control.

本発明による制御装置は、複数のデータ格納手段に分割されて格納されている所定のデータ群である対象データ群に対する、１つ以上のクエリを含むクエリセットを受け付けるクエリセット受付手段と、対象データ群のメタデータに基づいて、対象データ群を再分割配置せずにクエリセットを実行した場合の処理コストである第１の処理コストと、対象データ群を所定の方法で再分割配置した後にクエリセットを実行した場合の処理コストである第２の処理コストとを推定する処理コスト推定手段と、処理コスト推定手段による推定結果に基づいて、クエリセットを実行する前に、対象データ群を再分割配置するか否か、および再分割配置する場合にはどのような方法で再分割配置するかを判断する再分割配置判断手段とを備えたことを特徴とする。 A control device according to the present invention includes a query set receiving unit that receives a query set including one or more queries for a target data group that is a predetermined data group that is divided and stored in a plurality of data storage units, and target data Based on the group metadata, the first processing cost, which is the processing cost when the query set is executed without subdividing the target data group, and the query after subdividing the target data group by a predetermined method Based on the processing cost estimation means for estimating the second processing cost, which is the processing cost when the set is executed, and the estimation result by the processing cost estimation means, the target data group is subdivided before executing the query set Re-division arrangement determining means for determining whether or not to arrange, and in the case of sub-division arrangement, how to re-divide arrangement. To.

また、本発明による分散データベースシステムは、所定のデータ群を複数のデータブロックに分割して格納する複数のデータ格納手段と、所定のデータ群である対象データ群に対する、１つ以上のクエリを含むクエリセットを受け付けるクエリセット受付手段と、対象データ群のメタデータを記憶するメタデータ格納手段と、メタデータに基づいて、対象データ群を再分割配置せずにクエリセットを実行した場合の処理コストである第１の処理コストと、対象データ群を再分割配置した後でクエリセットを実行した場合の処理コストである第２の処理コストとを推定する処理コスト推定手段と、処理コスト推定手段による推定結果に基づいて、クエリセットを実行する前に、対象データ群を再分割配置するか否か、および再分割配置する場合にはどのような方法で再分割配置するかを判断する再分割配置判断手段と、再分割配置判断手段によって対象データ群を再分割配置すると判断された場合に、クエリセットを実行する前に、対象データ群の再分割配置を実行する再分割配置実行手段とを備えたことを特徴とする。 The distributed database system according to the present invention includes a plurality of data storage means for dividing a predetermined data group into a plurality of data blocks and storing the data, and one or more queries for the target data group that is the predetermined data group. Query set receiving means for receiving a query set, metadata storage means for storing metadata of the target data group, and processing cost when executing the query set without subdividing the target data group based on the metadata A processing cost estimation unit that estimates the first processing cost and the second processing cost that is the processing cost when the query set is executed after the target data group is subdivided and arranged, and the processing cost estimation unit Based on the estimation result, whether or not to subdivide the target data group and execute subdivision arrangement before executing the query set Is a subdivision arrangement determination unit that determines how to perform subdivision arrangement, and when the target data group is determined to be subdivision arrangement by the subdivision arrangement determination unit, before executing the query set, And a subdivision arrangement executing means for executing subdivision arrangement of the data group.

また、本発明によるデータ分割配置制御方法は、制御装置が、複数のデータ格納手段に分割されて格納されている所定のデータ群である対象データ群に対する、１つ以上のクエリを含むクエリセットを受け付け、制御装置が、クエリセットを実行する前に、対象データ群のメタデータに基づいて、対象データ群を再分割配置せずにクエリセットを実行した場合の処理コストと、対象データ群を所定の方法で再分割配置した後にクエリセットを実行した場合の処理コストとを推定し、制御装置が、クエリセットを実行する前に、推定結果に基づいて、クエリセットを実行する前に、対象データ群を再分割配置するか否か、および再分割配置する場合にはどのような方法で再分割配置するかを判断することを特徴とする。 Further, in the data division arrangement control method according to the present invention, the control device includes a query set including one or more queries for a target data group that is a predetermined data group that is divided and stored in a plurality of data storage means. Receiving and processing device when the control device executes the query set without subdividing the target data group based on the metadata of the target data group, and the target data group are determined in advance before executing the query set The processing cost is estimated when the query set is executed after subdivision placement by the above method, and before the controller executes the query set, the target data before executing the query set is executed based on the estimation result. It is characterized in that it is determined whether or not a group is to be subdivided and in what manner the group is to be subdivided.

また、本発明による分割配置制御用プログラムは、コンピュータに、複数のデータ格納手段に分割されて格納されている所定のデータ群である対象データ群に対する、１つ以上のクエリを含むクエリセットを受け付ける処理、対象データ群のメタデータに基づいて、対象データ群を再分割配置せずにクエリセットを実行した場合の処理コストと、対象データ群を所定の方法で再分割配置した後にクエリセットを実行した場合の処理コストとを推定する処理、およびクエリセットを実行する前に、推定結果に基づいて、対象データ群を再分割配置するか否か、再分割配置する場合にはどのような方法で再分割配置するかを判断する処理を実行させることを特徴とする。 In addition, the divided arrangement control program according to the present invention accepts a query set including one or more queries for a target data group, which is a predetermined data group that is divided and stored in a plurality of data storage means, in a computer. Based on the processing and metadata of the target data group, the processing cost when executing the query set without subdividing the target data group, and executing the query set after subdividing the target data group by a predetermined method Whether to re-divide the target data group based on the estimation result before executing the query set and the processing cost, and what method to re-divide It is characterized in that a process for determining whether to divide and arrange is executed.

本発明によれば、属性生成処理のようなクエリの発行を伴うワークロードの総処理時間を縮小できる。 According to the present invention, it is possible to reduce the total processing time of a workload that involves issuing a query such as attribute generation processing.

本発明の実施形態にかかる分散データベースシステムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the distributed database system concerning embodiment of this invention. 処理コスト推定手段１２の構成例を示すブロック図である。3 is a block diagram illustrating a configuration example of a processing cost estimation unit 12. FIG. 本発明の実施形態にかかる分散データベースシステムの動作例を示すフローチャートである。It is a flowchart which shows the operation example of the distributed database system concerning embodiment of this invention. 処理コスト推定手段１２および再分割配置判断手段１３の動作例を示すフローチャートである。It is a flowchart which shows the operation example of the process cost estimation means 12 and the subdivision arrangement | positioning judgment means 13. クエリセットの例を示す説明図である。It is explanatory drawing which shows the example of a query set. 述語の抽出結果の例を示す説明図である。It is explanatory drawing which shows the example of the extraction result of a predicate. 列集合の抽出結果の例を示す説明図である。It is explanatory drawing which shows the example of the extraction result of a column set. クエリのサブセットＱ_ｉ’の例を示す説明図である。Is an explanatory diagram showing an example of a subset Q _{i 'queries.} 対象データ群の分割配置例を示す説明図である。It is explanatory drawing which shows the example of a division | segmentation arrangement | positioning of an object data group. 対象データ群の他の分割配置例を示す説明図である。It is explanatory drawing which shows the other division | segmentation arrangement | positioning example of a target data group. 対象データ群の他の分割配置例を示す説明図である。It is explanatory drawing which shows the other division | segmentation arrangement | positioning example of a target data group. クエリ実行計画の例を示す説明図である。It is explanatory drawing which shows the example of a query execution plan. メタデータの一例を示す説明図である。It is explanatory drawing which shows an example of metadata. 本発明の概要を示すブロック図である。It is a block diagram which shows the outline | summary of this invention.

以下、本発明の実施形態を図面を参照して説明する。図１は、本発明の実施形態にかかる分散データベースシステムの構成例を示すブロック図である。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing a configuration example of a distributed database system according to an embodiment of the present invention.

図１に示す分散データベースシステムは、クエリセット受付手段１１と、処理コスト推定手段１２と、再分割配置判断手段１３と、再分割配置実行手段１４と、クエリ実行手段１５と、複数のデータ格納手段２１と、メタデータ格納手段２２とを備える。図１において、各コンポーネントは、ネットワークやバス等を介して互いに通信可能に接続されている。 The distributed database system shown in FIG. 1 includes a query set receiving unit 11, a processing cost estimating unit 12, a subdivision arrangement determining unit 13, a subdivision arrangement execution unit 14, a query execution unit 15, and a plurality of data storage units. 21 and metadata storage means 22. In FIG. 1, each component is connected to be communicable with each other via a network, a bus, or the like.

データ格納手段２１の各々は、クエリの処理対象とされるデータ群を分割したデータブロックを格納する。以下、クエリの処理対象とされるデータ群を、単に対象データ群という。 Each of the data storage means 21 stores a data block obtained by dividing a data group to be processed by a query. Hereinafter, a data group to be processed by a query is simply referred to as a target data group.

本実施形態において対象データ群は、１つ以上の列の各々に割り当てられる値を含む１つ以上の行データからなる表データである。本発明では、表データを、テーブルの実体という意味で用いている。また、対象データ群は、表データにおける１つ以上の任意の列の集合である列集合をパーティションキーとして、１つ以上の行データを各々有する複数のデータブロックに分割されて、データ格納手段２１に格納される。対象データ群は、より具体的には、各行データに含まれるパーティションキーに指定された列集合の値に基づいて、複数のデータブロックに分割されてデータ格納手段２１に格納される。なお、パーティションキーは、表データを分割する際の基準とされるキーである。また、分散データベースシステムは、各行データがどのデータブロックに所属するかを示す情報を保持している。 In the present embodiment, the target data group is table data composed of one or more row data including values assigned to each of one or more columns. In the present invention, the table data is used in the sense of a table entity. The target data group is divided into a plurality of data blocks each having one or more row data using a column set, which is a set of one or more arbitrary columns in the table data, as a partition key, and the data storage means 21 Stored in More specifically, the target data group is divided into a plurality of data blocks and stored in the data storage means 21 based on the value of the column set designated by the partition key included in each row data. The partition key is a key used as a reference when dividing table data. Further, the distributed database system holds information indicating to which data block each row data belongs.

メタデータ格納手段２２は、対象データ群のメタデータを格納する。対象データ群のメタデータは、後述する処理コスト推定手段１２がクエリセットの処理コストを算出できる情報を含んでいればよい。対象データ群のメタデータは、例えば、対象データ群のデータサイズ（行数、バイトサイズ等）、各データブロックのデータサイズ（行数、バイトサイズ等）といった基本的な情報だけでなく、データの各種カテゴリ（列や行やそれらの集合等）に関する統計量、特に、対象データ群の列ごとの値のカーディナリティ（cardinality）、選択率（selectivity）、ヒストグラムといった対象データ群の表データにおける列の値に関する統計量を含んでいてもよい。 The metadata storage unit 22 stores the metadata of the target data group. The metadata of the target data group only needs to include information that allows the processing cost estimation unit 12 described later to calculate the processing cost of the query set. The metadata of the target data group includes, for example, basic information such as the data size (number of rows, byte size, etc.) of the target data group and the data size (number of rows, byte size, etc.) of each data block, Statistics related to various categories (columns, rows, collections, etc.), especially column values in the table data of the target data group such as cardinality, selectivity, and histogram of the value of each column of the target data group Statistics may be included.

本実施形態において、クエリセットの処理コストの計算方法は特に限定されないが、対象データ群のメタデータは、対象データ群とされる表データにおける列の値に関する統計量であって、列中の占有度合いや選択度合い（選択されやすさ又は選択されにくさ）に関する統計量を含むことが好ましい。 In this embodiment, the calculation method of the processing cost of the query set is not particularly limited, but the metadata of the target data group is a statistic regarding the value of the column in the table data set as the target data group, and the occupation in the column It is preferable to include a statistic regarding the degree and the degree of selection (ease of selection or difficulty of selection).

クエリセット受付手段１１は、外部から対象データ群に対するクエリセットを受け付ける。クエリセットは、１つ以上のクエリを含んでいればよい。クエリセットの発行元は、例えば、一連の処理として実行させたい１つ以上のクエリをクエリセットとして入力すればよい。 The query set accepting unit 11 accepts a query set for the target data group from the outside. The query set only needs to include one or more queries. For example, the query set publisher may input one or more queries to be executed as a series of processes as a query set.

処理コスト推定手段１２は、メタデータ格納手段２２に記憶されている対象データ群のメタデータに基づいて、対象データ群を再分割配置せずにクエリセットを実行した場合の処理コストである第１の処理コストと、対象データ群を所定の方法で再分割配置した後にクエリセットを実行した場合の処理コストである第２の処理コストとを推定（算出）する。処理コスト推定手段１２は、例えば、再分割配置判断手段１３からの要求に応じて、第１の処理コストおよび第２の処理コストを推定してもよい。 The processing cost estimation unit 12 is the first processing cost when the query set is executed without subdividing the target data group based on the metadata of the target data group stored in the metadata storage unit 22. And a second processing cost that is a processing cost when the query set is executed after the target data group is re-divided and arranged by a predetermined method. For example, the processing cost estimation unit 12 may estimate the first processing cost and the second processing cost in response to a request from the subdivision arrangement determination unit 13.

本実施形態において、処理コストは、指定された処理を完了するまでにかかる時間である処理時間に関連する指標であって、処理コストを比較することによって処理時間の大小比較ができる指標であれば特に問わない。 In the present embodiment, the processing cost is an index related to the processing time, which is the time taken to complete the specified processing, and can be compared with the processing time by comparing the processing costs. It doesn't matter.

また、処理コスト推定手段１２は、対象データ群のメタデータだけでなく、さらに各データ格納手段２１を制御する計算機のディスクバンド幅や当該計算機間を接続するネットワークバンド幅といった分散データベースシステムの構成要素の処理性能に関する情報に基づいて、第１の処理コストおよび第２の処理コストを推定してもよい。なお、ここでは、バンド幅を、単位時間あたりのデータ転送容量という意味で用いている。 The processing cost estimation means 12 is not only the metadata of the target data group, but also the components of the distributed database system such as the disk bandwidth of the computer that controls each data storage means 21 and the network bandwidth that connects the computers. The first processing cost and the second processing cost may be estimated based on the information regarding the processing performance. Here, the bandwidth is used to mean the data transfer capacity per unit time.

図２は、処理コスト推定手段１２の構成例を示すブロック図である。図２に示すように、処理コスト推定手段１２は、列集合抽出手段１２１と、第１の処理コスト算出手段１２２と、分割配置方法格納手段１２３と、第２の処理コスト算出手段１２４とを含んでいてもよい。 FIG. 2 is a block diagram illustrating a configuration example of the processing cost estimation unit 12. As shown in FIG. 2, the processing cost estimation unit 12 includes a column set extraction unit 121, a first processing cost calculation unit 122, a divided arrangement method storage unit 123, and a second processing cost calculation unit 124. You may go out.

列集合抽出手段１２１は、クエリセットに含まれるクエリの述語（より具体的には、選択述語と結合述語）に出現する列集合を抽出する。列集合抽出手段１２１により抽出された列集合は、再分割配置におけるパーティションキーの候補とされるとともに、処理コストの算出対象とするクエリを限定するために用いられる。列集合抽出手段１２１は、処理コストの計算時間をさらに短縮するため、クエリセットに含まれるクエリの選択述語または結合述語に出現する列集合のうち所定の頻度以上で出現する列集合を抽出してもよい。 The column set extraction unit 121 extracts a column set that appears in a query predicate (more specifically, a selection predicate and a join predicate) included in the query set. The column set extracted by the column set extraction unit 121 is used as a partition key candidate in the subdivision arrangement, and is used to limit queries for which processing costs are to be calculated. The column set extraction unit 121 extracts column sets that appear at a predetermined frequency or higher from column sets that appear in the selection predicates or join predicates of the queries included in the query set in order to further reduce the processing cost calculation time. Also good.

第１の処理コスト算出手段１２２は、第１の処理コストを算出する。 The first processing cost calculation unit 122 calculates a first processing cost.

分割配置方法格納手段１２３は、第２の処理コストを算出する際の分割配置方法を特定する情報を格納する。分割配置方法格納手段１２３は、分割配置方法を特定する情報を複数格納してもよい。分割配置方法の例としては、パーティションキーとされた列集合の値が同じである行データが同一データブロックに多く含まれるように対象データ群を分割する分割方法や、全データブロックに渡りパーティションキーとされた列集合の値で行データがソートされるように対象データ群を分割する分割方法などが挙げられる。同じ分割方法であっても異なる分割数を指定する場合などは異なる分割配置方法として扱ってもよい。分割配置方法格納手段１２３に格納される情報は、再分割配置実行手段が対応可能な分割配置方法を特定する情報であれば、特に問わない。なお、再分割配置実行手段１４が対応可能な分割配置方法が１つしかないなどで第２の処理コストを算出する際の分割配置方法が固定される場合には、分割配置方法格納手段１２３は省略されてもよい。以下、分割配置方法格納手段１２３に格納されている情報によって特定される分割配置方法を、単に、指定の分割配置方法という場合がある。 The divided arrangement method storage unit 123 stores information for specifying a divided arrangement method used when calculating the second processing cost. The divided arrangement method storage unit 123 may store a plurality of pieces of information specifying the divided arrangement method. Examples of partitioning methods include partitioning methods that divide the target data group so that many row data with the same column set value as the partition key are included in the same data block, and partition keys that span all data blocks. For example, there is a division method that divides the target data group so that the row data is sorted by the value of the column set. Even when the same division method is used, when a different number of divisions is designated, it may be handled as a different division arrangement method. The information stored in the divided arrangement method storage unit 123 is not particularly limited as long as it is information specifying a divided arrangement method that can be handled by the subdivision arrangement execution unit. If the division arrangement method for calculating the second processing cost is fixed, for example, because there is only one division arrangement method that the subdivision arrangement execution unit 14 can handle, the division arrangement method storage unit 123 It may be omitted. Hereinafter, the division arrangement method specified by the information stored in the division arrangement method storage unit 123 may be simply referred to as a designated division arrangement method.

第２の処理コスト算出手段１２４は、第２の処理コストを算出する。第２の処理コスト算出手段１２４は、列集合抽出手段１２１によって抽出された列集合の各々をパーティションキーの候補として、候補ごとに、分割配置方法格納手段１２３に格納されている情報によって特定される分割配置方法で対象データ群を再分割配置した場合の第２の処理コストを算出する。このとき、第２の処理コスト算出手段１２４は、当該候補の列集合を使用しているクエリのみを対象にして、第２の処理コストを算出してもよい。なお、その場合、第１の処理コスト算出手段１２２も、上記の候補に対して、当該候補の列集合を使用しているクエリのみを対象にして、第１の処理コストを算出する。 The second processing cost calculation unit 124 calculates a second processing cost. The second processing cost calculation unit 124 identifies each column set extracted by the column set extraction unit 121 as a partition key candidate, and is specified for each candidate by information stored in the divided arrangement method storage unit 123. A second processing cost is calculated when the target data group is subdivided and arranged by the division arrangement method. At this time, the second processing cost calculation unit 124 may calculate the second processing cost only for queries that use the candidate column set. In this case, the first processing cost calculation unit 122 also calculates the first processing cost for only the query that uses the candidate column set for the candidate.

第１の処理コスト算出手段１２２および第２の処理コスト算出手段１２４は、処理コストを算出する際に、クエリの実行アルゴリズムとして所定の最適化アルゴリズムが適用されると仮定して、処理コストを算出してもよい。所定の最適化アルゴリズムは、例えば、結合演算を含むクエリのオペレーションの順序を最適化した実行アルゴリズムであってもよい。 The first processing cost calculation unit 122 and the second processing cost calculation unit 124 calculate the processing cost on the assumption that a predetermined optimization algorithm is applied as the query execution algorithm when calculating the processing cost. May be. The predetermined optimization algorithm may be, for example, an execution algorithm that optimizes the order of operations of a query including a join operation.

図示省略しているが、分散データベースシステムは、後述するクエリ実行手段１５がどのようなアルゴリズムに対応しているかを示す情報を格納するアルゴリズム格納手段や、受け付けたクエリセットを格納するクエリセット格納手段を有していてもよい。 Although not shown, the distributed database system includes an algorithm storage unit that stores information indicating what algorithm the query execution unit 15 described later corresponds to, and a query set storage unit that stores an accepted query set. You may have.

なお、第１の処理コスト算出手段１２２および第２の処理コスト算出手段１２４は、これら処理コストの算出をクエリセットの実行前に行う。 Note that the first processing cost calculation unit 122 and the second processing cost calculation unit 124 calculate these processing costs before executing the query set.

再分割配置判断手段１３は、処理コスト推定手段１２によって推定された第１の処理コストおよび第２の処理コストに基づいて、クエリセットの実行前に、対象データ群を再分割配置するか否か、および再分割配置する場合にはどのような方法で再分割配置するかを判断する。 The subdivision arrangement determining unit 13 determines whether to subdivide the target data group before executing the query set based on the first processing cost and the second processing cost estimated by the processing cost estimation unit 12. In the case of subdivision arrangement, it is determined how to perform subdivision arrangement.

再分割配置判断手段１３は、例えば、第２の処理コストのいずれかが第１の処理コストよりも小さかった場合に、最も小さい第２の処理コストが算出されたパーティションキーの候補と分割配置方法とを用いて対象データ群を再分割配置すると判断してもよい。 For example, the re-division arrangement determination unit 13 determines the partition key candidate and the division arrangement method for which the smallest second processing cost is calculated when any of the second processing costs is smaller than the first processing cost. May be used to determine that the target data group is to be subdivided.

再分割配置実行手段１４は、再分割配置判断手段１３によって対象データ群を再分割配置すると判断された場合に、クエリセットを実行する前に、対象データ群の再分割配置を実行する。 The subdivision arrangement execution unit 14 executes subdivision arrangement of the target data group before executing the query set when the subdivision arrangement determination unit 13 determines that the target data group is subdivided.

再分割配置実行手段１４は、例えば、再分割配置判断手段１３からの要求に応じて、指定されたパーティションキーおよび分割配置方法を用いて対象データ群の再分割配置を実行してもよい。再分割配置実行手段１４は、より具体的には、表データにおけるパーティションキーとして指定された列集合に関する値の集合を基に、表データに含まれる各行データを複数のデータブロックのいずれかに割り振ってもよい。このとき、再分割配置実行手段１４は、例えば、各行データに対して当該行データがどのデータブロックに所属するかを示すパーティション情報を生成してもよい。そして、再分割配置実行手段１４は、生成したパーティション情報を基に、行データの移動を指示するクエリを発行するなどして、各行データを、当該行データの所属先とされたデータブロックを格納するデータ格納手段２１に移動させる。また、再分割配置実行手段１４は、再分割配置を実行した後、新たな分割配置における各データブロック内のデータの統計量を示すメタデータ（ブロック統計情報）を生成して、メタデータ格納手段２２に格納してもよい。 For example, in response to a request from the subdivision arrangement determination unit 13, the subdivision arrangement execution unit 14 may execute subdivision arrangement of the target data group using a designated partition key and division arrangement method. More specifically, the subdivision arrangement execution unit 14 allocates each row data included in the table data to one of the plurality of data blocks based on a set of values related to the column set designated as the partition key in the table data. May be. At this time, for example, the subdivision arrangement execution unit 14 may generate partition information indicating which data block the row data belongs to for each row data. Then, the subdivision arrangement execution means 14 stores each row data in the data block to which the row data belongs by issuing a query instructing the movement of the row data based on the generated partition information. Data storage means 21 to be moved. The re-division arrangement executing unit 14 generates metadata (block statistical information) indicating the statistical amount of data in each data block in the new division arrangement after executing the sub-division arrangement, and stores the metadata. 22 may be stored.

クエリ実行手段１５は、指定されたクエリを実行する。 The query execution means 15 executes the designated query.

本実施形態において、クエリセット受付手段１１は、例えば、マウスやキーボード、ネットワークカードといったデータの入出力を行うデータ入出力装置と、記憶手段に記憶されているプログラムに従って動作するＣＰＵ等の情報処理装置とによって実現される。また、処理コスト推定手段１２、再分割配置判断手段１３、再分割配置実行手段１４およびクエリ実行手段１５は、例えば、記憶手段に記憶されているプログラムに従って動作するＣＰＵ等の情報処理装置とによって実現される。また、データ格納手段２１、メタデータ格納手段２２およびその他の格納手段（上記のアルゴリズム情報格納手段やクエリセット格納手段等）は、例えば、記憶装置により実現される。 In the present embodiment, the query set accepting unit 11 includes, for example, a data input / output device that inputs and outputs data such as a mouse, a keyboard, and a network card, and an information processing device such as a CPU that operates according to a program stored in the storage unit. And realized by. Further, the processing cost estimation unit 12, the subdivision arrangement determination unit 13, the subdivision arrangement execution unit 14, and the query execution unit 15 are realized by an information processing apparatus such as a CPU that operates according to a program stored in the storage unit, for example. Is done. In addition, the data storage unit 21, the metadata storage unit 22, and other storage units (the algorithm information storage unit, the query set storage unit, and the like) are realized by a storage device, for example.

次に、本実施形態の動作を説明する。図３は、本実施形態の分散データベースシステムの動作例を示すフローチャートである。図３に示す例では、まずクエリセット受付手段１１が、外部からクエリセットを受け付ける（ステップＳ１１）。 Next, the operation of this embodiment will be described. FIG. 3 is a flowchart showing an operation example of the distributed database system of the present embodiment. In the example shown in FIG. 3, first, the query set receiving unit 11 receives a query set from the outside (step S11).

次に、処理コスト推定手段１２が、受け付けたクエリセットの第１の処理コストと第２の処理コストとを算出する（ステップＳ１２）。なお、処理コストの算出方法については後述する。 Next, the processing cost estimation means 12 calculates the first processing cost and the second processing cost of the accepted query set (step S12). The processing cost calculation method will be described later.

次に、再分割配置判断手段１３が、算出された第１の処理コストと第２の処理コストとに基づいて、対象データ群を再分割配置するか否か、および再分割配置する場合にはいずれの方法で再分割配置するかを判断する（ステップＳ１３）。 Next, the subdivision arrangement determination unit 13 determines whether or not to redivide the target data group based on the calculated first processing cost and second calculation cost, and when subdivision arrangement is performed. It is determined which method is used for re-division arrangement (step S13).

ステップＳ１３では、再分割配置判断手段１３は、算出された第１の処理コストと第２の処理コストとを比較して、第２の処理コストのいずれかが第１の処理コストよりも小さい場合に（ステップＳ１３のＹｅｓ）、最も小さい第２の処理コストが算出された方法（パーティションキーの候補と分割配置方法）を用いて再分割配置すると判断する。また、再分割配置判断手段１３は、算出された第１の処理コストと第２の処理コストとを比較して、第２の処理コストのいずれも第１の処理コストよりも小さくなければ（ステップＳ１３のＮｏ）、再分割配置しないと判断する（ステップＳ１５に進む）。 In step S13, the re-division arrangement determining unit 13 compares the calculated first processing cost with the second processing cost, and if any of the second processing costs is smaller than the first processing cost. (Yes in step S13), it is determined that the subdivision arrangement is performed using the method (partition key candidate and division arrangement method) in which the smallest second processing cost is calculated. Further, the subdivision arrangement determination unit 13 compares the calculated first processing cost with the second processing cost, and if neither of the second processing costs is smaller than the first processing cost (step No in S13), it is determined that the subdivision arrangement is not performed (proceeds to step S15).

再分割配置判断手段１３が再分割配置すると判断した場合（ステップＳ１３のＹｅｓ）、再分割配置実行手段１４は、指定された方法で再分割配置を実行する（ステップＳ１４）。 When the subdivision arrangement determination unit 13 determines that subdivision arrangement is to be performed (Yes in step S13), the subdivision arrangement execution unit 14 executes subdivision arrangement by a designated method (step S14).

最後に、クエリ実行手段１５は、受け付けたクエリセットを実行して、実行結果を出力する（ステップＳ１５、ステップＳ１６）。 Finally, the query execution means 15 executes the accepted query set and outputs the execution result (step S15, step S16).

一方、再分割配置判断手段１３が再分割配置しないと判断した場合（ステップＳ１３のＮｏ）には、現在の分割配置のまま、クエリ実行手段１５が、受け付けたクエリセットを実行して、実行結果を出力する（ステップＳ１５、ステップＳ１６）。 On the other hand, when the subdivision arrangement determination unit 13 determines not to perform subdivision arrangement (No in step S13), the query execution unit 15 executes the accepted query set with the current division arrangement, and the execution result. Is output (step S15, step S16).

次に、具体例を用いて本実施形態の動作を説明する。図４に示すフローチャートは、処理コスト推定手段１２および再分割配置判断手段１３の動作である上記のステップＳ１２〜ステップＳ１４のより詳細な処理フローの一例である。 Next, the operation of this embodiment will be described using a specific example. The flowchart shown in FIG. 4 is an example of a more detailed processing flow of the above-described steps S12 to S14 that are operations of the processing cost estimation unit 12 and the subdivision arrangement determination unit 13.

図４に示す例では、まず処理コスト推定手段１２の第１の処理コスト算出手段１２２が、クエリセット受付手段１１が受け付けたクエリセット（以下、クエリセットＱという）のクエリｑ_ｊ∈Ｑの各々について、再分割配置を行わない場合に当該クエリｑ_ｊを実行するのにかかる処理コストである第１のクエリ実行コストとして、ｑｕｅｒｙ＿ｃｏｓｔ＿ｎｏ_ｊを算出する（ステップＳ１０１）。 In the example illustrated in FIG. 4, first, the first processing cost calculation unit 122 of the processing cost estimation unit 12 uses each query q _j εQ of the query set (hereinafter referred to as query set Q) received by the query set reception unit 11. Query_cost_no _j is calculated as the first query execution cost, which is the processing cost for executing the query q _j when the subdivision arrangement is not performed (step S101).

図５は、クエリセットＱの例を示す説明図である。例えば、クエリセット受付手段１１は、図５に示すような５つのクエリ（ｑ１〜ｑ５）を含むクエリセットＱを受け付けたとする。そのような場合、第１の処理コスト算出手段１２２は、ｑ_ｊ＝ｑ１〜ｑ４の各々について、ｑｕｅｒｙ＿ｃｏｓｔ＿ｎｏ_ｊを算出する。本例では、ｊ＝１〜４の整数である。 FIG. 5 is an explanatory diagram showing an example of the query set Q. For example, it is assumed that the query set accepting unit 11 accepts a query set Q including five queries (q1 to q5) as shown in FIG. In such a case, the first processing cost calculation unit 122 calculates query_cost_no _j for each of q _j = q1 to q4. In this example, j is an integer of 1 to 4.

次に、処理コスト推定手段１２の列集合抽出手段１２１は、クエリセットＱのクエリｑ_ｊから、選択述語または結合述語のいずれかにおいて所定の頻度以上で出現する列集合Ｃ_ｉを全て抽出し、列集合Ｃ_ｉの集合｛Ｃ_１，Ｃ_２，．．．，ＣＩ｝を得る（ステップＳ１０２）。なお、Ｉは抽出された列集合Ｃ_ｉの数（種類数）である。なお、列集合抽出手段１２１は、抽出された列集合Ｃ_ｉの数が所定数に満たなければ、最も多く出現した列集合Ｃ_ｉから順に所定数の列集合Ｃ_ｉを抽出してもよい。また、列集合抽出手段１２１は、抽出された列集合Ｃ_ｉの数がゼロである場合には、現在の分割配置におけるパーティションキーそのままに異なる分割配置方法を用いた場合の第２の処理コストを第２の処理コスト算出手段１２４に算出させてもよい。 Next, the column set extraction unit 121 of the processing cost estimation unit 12 extracts all column sets C _i that appear at a predetermined frequency or higher in either the selection predicate or the join predicate from the query q _j of the query set Q. set set of columns _{_{_{C i {C 1, C 2}}} ,. . . , CI} is obtained (step S102). Note that I is the number (number of types) of the extracted column set C _i . Incidentally, the column set extraction unit 121, if the number of the extracted set of columns C _i is not less than the predetermined number may be extracted a predetermined number of column sets C _i from the most emerging set of columns C _i in order. Further, the column set extraction means 121, when the number of extracted column sets C _i is zero, calculates the second processing cost when using a different divided arrangement method as it is with the partition key in the current divided arrangement. The second processing cost calculation unit 124 may calculate it.

以下、列集合Ｃ_ｉの単純な抽出法を例に挙げる。なお、抽出法は、クエリセットＱと頻度の閾値θを与えられて、θ以上の頻度でクエリセットＱ内のクエリの選択述語または結合述語のいずれかに現れる列集合の全てを列挙するものであれば、方式やアルゴリズムは問わない。 Below, include a simple extraction method set of columns C _i as an example. The extraction method enumerates all the column sets that appear in either the selection predicate or the join predicate of the query in the query set Q with a query set Q and a frequency threshold θ. Any method or algorithm can be used.

単純な抽出法では、列集合抽出手段１２１は、まずクエリセットＱを走査し、ｗｈｅｒｅ句から述語を抜き出す。そして、列集合抽出手段１２１は、抜き出した述語に用いられている列の組合せを全て抽出して、抽出した列の組合せが異なるごとに一つの列集合として生成する。図５に示すクエリセットＱによる例を示す。まず、列集合抽出手段１２１は、クエリセットＱ内の各クエリ（ｑ１〜ｑ４）を走査し、ｗｈｅｒｅ句から述語を抜き出す。このとき、列集合抽出手段１２１は、クエリｑ３のように論理和（∨）で連結されている述語があった場合は、該述語を分割して別々の述語として扱う。 In a simple extraction method, the column set extraction unit 121 first scans the query set Q and extracts predicates from the where clause. The column set extraction unit 121 extracts all column combinations used in the extracted predicates, and generates a single column set whenever the extracted column combinations are different. The example by the query set Q shown in FIG. 5 is shown. First, the column set extraction unit 121 scans each query (q1 to q4) in the query set Q and extracts a predicate from the where clause. At this time, if there is a predicate connected by logical sum (∨) as in the query q3, the column set extraction unit 121 divides the predicate and handles it as separate predicates.

図６は、述語の抽出結果の例を示す説明図である。図６には、図５に示すクエリセットＱから抽出される述語が一例として示されている。上記の方法によれば、図５に示すクエリセットＱから、５つの述語（ｐ１〜ｐ５）が抽出される。 FIG. 6 is an explanatory diagram illustrating an example of a predicate extraction result. FIG. 6 shows an example of predicates extracted from the query set Q shown in FIG. According to the above method, five predicates (p1 to p5) are extracted from the query set Q shown in FIG.

続いて、列集合抽出手段１２１は、抽出した述語の各々に現れる列の組合せからなる列集合を抽出するとともに、抽出した列集合が、抽出した全述語内に現れる頻度を計算する。このとき、列集合抽出手段１２１は、１つの述語内に同じテーブルに存在する複数の列が共起している場合には、それら複数の列で作られるあらゆる組合せを列集合として抽出して、抽出した列集合の各々に対して頻度を計算する。例えば、述語ｐ１には、“T.name”と“R.name”と“T.age”という３つの列が現れている。“T.name”と“T.age”とは同じテーブルに存在する列である。これら同じテーブルに存在する複数の列で作られるあらゆる列の組合せとしては｛T.name｝，｛T.age｝，｛T.name, T.age｝の３通りが考えられ、これら３通りの組合せの各々が１つの列集合として抽出される。なお、述語ｐ１からは、さらにテーブルの異なる列｛R.name｝が追加されて、｛T.name｝，｛T.age｝，｛T.name, T.age｝，｛R.name｝の計４つの列集合が抽出される。なお、これら列集合の全ては述語ｐ１に出現するため、これら４つの列集合の各々に対して出現述語数が＋１される。列集合抽出手段１２１は、抽出した述語ｐ１〜ｐ５について同様の処理を行う。なお、次の述語から抽出した列集合が、既に抽出された列集合と完全一致した場合には、新たな列集合とせずに、完全一致した列集合に対して出現述語数を＋１すればよい。 Subsequently, the column set extraction unit 121 extracts a column set including a combination of columns appearing in each of the extracted predicates, and calculates the frequency at which the extracted column set appears in all the extracted predicates. At this time, when a plurality of columns existing in the same table co-occur in one predicate, the column set extraction unit 121 extracts all combinations created by the plurality of columns as a column set, Calculate the frequency for each of the extracted column sets. For example, in the predicate p1, three columns “T.name”, “R.name”, and “T.age” appear. “T.name” and “T.age” are columns existing in the same table. There are three possible combinations of columns created by multiple columns in the same table: {T.name}, {T.age}, {T.name, T.age}. Each combination is extracted as one set of columns. From the predicate p1, a different column {R.name} of the table is further added, and {T.name}, {T.age}, {T.name, T.age}, {R.name} A total of four column sets are extracted. Since all of these column sets appear in the predicate p1, the number of appearance predicates is incremented by 1 for each of these four column sets. The column set extraction unit 121 performs the same processing on the extracted predicates p1 to p5. If the column set extracted from the next predicate completely matches the already extracted column set, the number of appearance predicates may be incremented by 1 with respect to the completely matched column set instead of a new column set. .

図７は、列集合の抽出結果の例を示す説明図である。図７には、図６に示す述語集合から抽出される列集合およびその頻度が示されている。図７に示すように、本例では、５通りの列集合Ｃ_ｉ＝Ｃ_１〜Ｃ_５が抽出されて、抽出された列集合Ｃ_ｉの各々について頻度（出現述語数）が算出される。今、θ＝３とすると、列集合抽出手段１２１は、クエリセットＱに頻出する列集合として、列集合Ｃ_１とＣ_２とを抽出する。なお、列集合抽出手段１２１は、再分割配置判断手段１３が再分割配置する際の方法としてパーティションキーのみを指定する場合には、現在の分割配置におけるパーティションキーと同じ列集合を抽出対象から除外してもよい。 FIG. 7 is an explanatory diagram illustrating an example of column set extraction results. FIG. 7 shows a column set extracted from the predicate set shown in FIG. 6 and its frequency. As shown in FIG. 7, in this example, five column sets C _i = C _{1 to} C ₅ are extracted, and the frequency (number of appearance predicates) is calculated for each of the extracted column sets C _i . Now, assuming that θ = 3, the column set extraction unit 121 extracts column sets C ₁ and C ₂ as column sets that frequently appear in the query set Q. Note that the column set extraction unit 121 excludes the same column set as the partition key in the current division arrangement from the extraction target when the subdivision arrangement determination unit 13 designates only the partition key as a method for the subdivision arrangement. May be.

なお、図７に示す例では、頻度を、対象の列集合が現れる述語の数すなわち出現述語数と定義しているが、頻度の定義はこれに限られない。例えば、抽出された述語の総数に対する出現述語数の割合を頻度としてもよいし、述語の数ではなく対象の列集合が述語に現れているクエリの数を頻度としてもよいし、その他任意に定義された指標を用いてもよい。 In the example shown in FIG. 7, the frequency is defined as the number of predicates in which the target column set appears, that is, the number of appearance predicates, but the definition of the frequency is not limited to this. For example, the ratio of the number of appearance predicates to the total number of extracted predicates may be the frequency, the number of queries in which the target column set appears in the predicate instead of the number of predicates may be the frequency, or any other definition The measured index may be used.

次に、列集合抽出手段１２１は、抽出された列集合Ｃ_ｉの各々について、クエリセットＱに含まれるクエリｑ_ｊの中から、当該列集合Ｃ_ｉをカバーしているクエリｑ_ｊを抽出して、抽出したクエリｑ_ｊからなるサブセットＱ_ｉ’を得る（ステップＳ１０３）。ここで、列集合Ｃ_ｉをカバーしているクエリは、具体的には、選択述語または結合述語を含むクエリであって、当該列集合Ｃ_ｉの要素となっている列が当該１つのクエリ内に全て現れているクエリである。以下、サブセットＱ_ｉ’に含まれるクエリｑ_ｊの各々をクエリｑ_ｊ’と記す場合がある。ここで、ｊ’はクエリセットＱに含まれるクエリｑ_ｊの識別子であるｊのうち指定された列集合Ｃ_ｉをカバーするとの条件を満たすものを表わしている。 Next, the column set extraction unit 121, for each of the extracted set of columns C _i, from the query q _j included in the query set Q, it extracts the query q _j that covers the column set C _i Thus, a subset Q _i ′ consisting of the extracted query q _j is obtained (step S103). Here, the query covering the column set C _i is specifically a query including a selection predicate or a join predicate, and a column that is an element of the column set C _i is included in the one query. It is a query that appears in all. Hereinafter may _{be 'each} query q _j included in the query q _j' subset Q _i referred to. Here, j ′ represents that which satisfies the condition of covering the designated column set C _i among the identifiers _j of the queries q _j included in the query set Q.

図８は、クエリのサブセットＱ_ｉ’の例を示す説明図である。図８には、図５に示すクエリセットＱから抽出された列集合Ｃ_ｉである列集合Ｃ_１およびＣ_２について、クエリセットＱから抽出されたクエリのサブセットＱ_１’およびＱ_２’の例が示されている。 FIG. 8 is an explanatory diagram illustrating an example of a subset Q _i ′ of a query. FIG. 8 shows an example of the subsets Q ₁ ′ and Q ₂ ′ of the query extracted from the query set Q for the column sets C ₁ and C ₂ that are the column set C _i extracted from the query set Q shown in FIG. It is shown.

例えば、列集合抽出手段１２１は、頻出する列集合として列集合Ｃ_１とＣ_２とが抽出されると、再びクエリセットＱを走査して、列集合Ｃ_１または列集合Ｃ_２がｗｈｅｒｅ句の述語に現れているクエリを特定し、特定したクエリの識別子ｊを、該当する列集合の識別子ｉと対応づければよい。 For example, when the column sets C ₁ and C ₂ are extracted as frequent column sets, the column set extraction unit 121 scans the query set Q again, and the column set C ₁ or the column set C ₂ is the where clause. The query appearing in the predicate is specified, and the identifier j of the specified query may be associated with the identifier i of the corresponding column set.

本例では、列集合抽出手段１２１は、列集合Ｃ_１＝｛T.age｝が全て現れているクエリのサブセットＱ_１’として、Ｑ_１’＝｛ｑ１，ｑ２，ｑ３｝を得る。また、列集合抽出手段１２１は、列集合Ｃ_２＝｛T.name｝が全て現れているクエリのサブセットＱ_２’として、Ｑ_２’＝｛ｑ１，ｑ２，ｑ５｝を得る。 In this example, the column set extraction unit 121 obtains Q ₁ ′ = {q1, q2, q3} as a subset Q ₁ ′ of queries in which all column sets C ₁ = {T.age} appear. Further, the column set extraction unit 121 obtains Q ₂ ′ = {q1, q2, q5} as a subset Q ₂ ′ of queries in which all column sets C ₂ = {T.name} appear.

次に、処理コスト推定手段１２の第２の処理コスト算出手段１２４は、メタデータ格納手段２２に格納されている対象データ群のメタデータと、分割配置方法格納手段１２３に格納されている情報に基づいて、ステップＳ１０２で抽出された列集合をパーティションキーとして再分割配置した場合の第２の処理コストを算出する（ステップＳ１０４）。 Next, the second processing cost calculation unit 124 of the processing cost estimation unit 12 uses the metadata of the target data group stored in the metadata storage unit 22 and the information stored in the divided arrangement method storage unit 123. Based on this, a second processing cost is calculated when the column set extracted in step S102 is re-divided and arranged as a partition key (step S104).

ステップＳ１０４では、第２の処理コスト算出手段１２４は、抽出された列集合Ｃ_ｉをそれぞれパーティションキーとした場合の再分割配置の実行にかかる処理コストである分割配置コストとして、ｐａｒｔｉｔｉｏｎ＿ｃｏｓｔ_ｉを算出するとともに、再分割配置後の状態においてクエリｑ_ｊ’∈Ｑ_ｉ’の各々の実行にかかる処理コストである第２のクエリ実行コストとして、ｑｕｅｒｙ＿ｃｏｓｔ＿ｐａｒｔ_ｉｊ’を算出する。 In step S104, the second processing cost calculating unit 124 calculates partition_cost _i as a divided arrangement cost that is a processing cost for executing the re-divided arrangement when each of the extracted column sets C _i is a partition key. At the same time, query_cost_part _{ij ′} is calculated as the second query execution cost that is the processing cost for executing each of the queries q _{j ′} ∈Q _i ′ in the state after the subdivision arrangement.

本例では、１つの列集合Ｃ_ｉに対して、ｐａｒｔｉｔｉｏｎ＿ｃｏｓｔ_ｉと、クエリｑ_ｊ’∈Ｑ_ｉ’の全てのｊ’についてのｑｕｅｒｙ＿ｃｏｓｔ＿ｐａｒｔ_ｉｊ’とを足したものが第２の処理コストとなる。また、クエリｑ_ｊ∈Ｑのうちｊ＝ｊ’となる全てのｊについてのｑｕｅｒｙ＿ｃｏｓｔ＿ｎｏ_ｊを足したものが、当該列集合Ｃ_ｉの第２の処理コストと比較される第１の処理コストとなる。 In the present example, the second processing cost is obtained by adding partition_cost _i and query_cost_part _{ij ′} for all j ′ of the query q _{j ′} ∈Q _i ′ to one column set C _i . Also, the sum of query_cost_no _j for all j with j = j ′ in the query q _j εQ is the first processing cost compared with the second processing cost of the column set C _i . .

第１の処理コストおよび第２の処理コストが算出されると、再分割配置判断手段１３は、これらを比較して、再分割配置するか否か、および再分割配置する場合には再分割配置時に用いるパーティションキーを決定する（ステップＳ１０５〜ステップＳ１１２）。 When the first processing cost and the second processing cost are calculated, the subdivision arrangement determination unit 13 compares these, and determines whether or not subdivision arrangement is performed. A partition key used at times is determined (steps S105 to S112).

本例では、再分割配置判断手段１３は、再分割配置することで、しないときよりも総処理コストが小さくなる列集合Ｃ_ｉがあるかどうかを以下の条件式（１）が真となるｃｏｓｔ＿ｃｕｔ_ｉが存在するかどうかで判断する。なお、条件式（１）におけるｃｏｓｔ＿ｃｕｔ_ｉは、指定の列集合Ｃ_ｉでの再分割配置によるコスト削減量の見積もりに相当する。 In this example, the subdivision arrangement determining unit 13 determines whether there is a column set C _i whose total processing cost is smaller than that when the subdivision arrangement is not performed, and cost_cut in which the following conditional expression (1) is true Judgment is made based on whether _i exists. Note that cost_cut _i in the conditional expression (1) corresponds to an estimation of the amount of cost reduction by subdivision arrangement in the designated column set C _i .

図４に示す例では、再分割配置判断手段１３は、まず列集合Ｃ_ｉの識別子ｉを初期化するとともに、条件式（１）が真となったＣ_ｉを格納するバッファＢを初期化する（ステップＳ１０５）。ステップＳ１０５では、ｉ＝１とし、またＢ＝Φとする。ここで、Φは空集合を表す記号である。なお、本例では、わかりやすいよう識別子ｉの値を１基準としているが、識別子の形式は特に限定されない。 In the example shown in FIG. 4, subdivided rearrangement determining means 13 first is initialized identifier i set of columns C _i, conditional expressions (1) initializes the buffer B for storing a C _i became true (Step S105). In step S105, i = 1 and B = Φ. Here, Φ is a symbol representing an empty set. In this example, the value of the identifier i is set to 1 for easy understanding, but the format of the identifier is not particularly limited.

次に、再分割配置判断手段１３は、識別子ｉが示す列集合Ｃ_ｉについて、上記の条件式（１）を判定する（ステップＳ１０６）。再分割配置判断手段１３は、具体的には、上記の条件式（１）に従い、ｃｏｓｔ＿ｃｕｔ_ｉを算出して、算出されたｃｏｓｔ＿ｃｕｔ_ｉが０以上であるか否かを判定する。条件式（１）が真であれば（ステップＳ１０６のＹｅｓ）、バッファＢに、ｃｏｓｔ＿ｃｕｔ_ｉの算出に用いた列集合Ｃ_ｉまたはその識別子ｉを、算出したｃｏｓｔ＿ｃｕｔ_ｉの値と一緒に格納する（ステップＳ１０７）。一方、偽であれば（ステップＳ１０６のＮｏ）、そのままステップＳ１０８に進む。 Next, the subdivision arrangement determination unit 13 determines the conditional expression (1) for the column set C _i indicated by the identifier i (step S106). Specifically, the subdivision arrangement determination unit 13 calculates cost_cut _i according to the conditional expression (1), and determines whether the calculated cost_cut _i is 0 or more. If conditional expression (1) is true (Yes in step S106), the buffer B, and the column set _{C i,} or the identifier i used to calculate the Cost_cut _i, is stored with the calculated value of the Cost_cut _i ( Step S107). On the other hand, if false (No in step S106), the process directly proceeds to step S108.

ステップＳ１０８では、ｉがインクリメントされる。次いで、再分割配置判断手段１３は、ステップＳ１０９で、ｉが列集合Ｃ_ｉの最大数であるＩを超えたか否か、すなわちステップＳ１０２で抽出された全ての列集合Ｃ_ｉについてステップＳ１０６〜ステップＳ１０７の処理を完了したか否かを判断する。 In step S108, i is incremented. Next, the subdivision arrangement determination means 13 determines whether or not _i exceeds I, which is the maximum number of column sets C _i in step S109, that is, all column sets C _i extracted in step S102. It is determined whether or not the processing of S107 is completed.

再分割配置判断手段１３は、全ての列集合Ｃ_ｉについてステップＳ１０６〜ステップＳ１０７の処理を完了すると（ステップＳ１０９のＹｅｓ）、バッファＢが空でないかを判定する（ステップＳ１１０）。ステップＳ１１０の判定式が真、すなわちバッファＢが空でなければ、再分割配置すると判断して、ステップＳ１１１に進む。一方、ステップＳ１１０の判定式が偽、すなわちバッファＢが空であれば、再分割配置しないと判断する（ステップＳ１１２）。再分割配置判断手段１３は、例えば、再分割配置しないと判断した場合、クエリ実行手段１５にクエリセットＱを渡し、クエリセットＱを実行する命令を発行してもよい。 Subdividing the rearrangement determining means 13 has completed the processing of step S106~ step S107 for all columns set _{C i} (Yes in step S109), determines whether the buffer B is not empty (step S110). If the determination formula in step S110 is true, that is, if the buffer B is not empty, it is determined that subdivision arrangement is performed, and the process proceeds to step S111. On the other hand, if the determination formula in step S110 is false, that is, if the buffer B is empty, it is determined that the subdivision arrangement is not performed (step S112). For example, when it is determined that the subdivision arrangement is not performed, the subdivision arrangement determination unit 13 may pass the query set Q to the query execution unit 15 and issue an instruction for executing the query set Q.

ステップＳ１１１では、再分割配置判断手段１３は、再分割配置すると判断した場合、バッファＢを参照してｃｏｓｔ＿ｃｕｔ_ｉが最大値をとる列集合Ｃ_ｉを選択することにより、選択した列集合Ｃ_ｉをパーティションキーに用いて再分割配置することを決定する。 In step S111, when the subdivision arrangement determination unit 13 determines that subdivision arrangement is to be performed, the column set C _i having the maximum value of cost_cut _i is selected with reference to the buffer B, thereby selecting the selected column set C _i . It is decided to repartition using the partition key.

再分割配置判断手段１３は、例えば、再分割配置実行手段１４にパーティションキーとする列集合の情報とともに再分割配置を実行する命令を発行してもよい。 The subdivision arrangement determination unit 13 may issue, for example, an instruction for executing subdivision arrangement together with column set information as a partition key to the subdivision arrangement execution unit 14.

なお、図４に示す例は、指定の分割配置方法が１つの場合の例であるが、指定の分割配置方法は複数であってもよい。そのような場合には、ステップＳ１０４で、第２の処理コスト算出手段１２４が、列集合Ｃ_ｉと分割配置方法Ｍ_ｈの組ごとに、当該列集合Ｃ_ｉをパーティションキーとし、かつ分割配置方法Ｍ_ｈを用いた場合の分割配置コストであるｐａｒｔｉｔｉｏｎ＿ｃｏｓｔ_ｉｈと、そのような方法を用いた再分割配置後の状態におけるクエリｑ_ｊ’∈Ｑ_ｉ’の各々についての第２のクエリ実行コストであるｑｕｅｒｙ＿ｃｏｓｔ＿ｐａｒｔ_ｉｈｊ’とを算出すればよい。 The example shown in FIG. 4 is an example in which there is one designated divided arrangement method, but there may be a plurality of designated divided arrangement methods. In such a case, in step S104, the second processing cost calculation unit 124 uses the column set C _i as a partition key for each set of the column set C _i and the divided arrangement method M _h and uses the divided arrangement method. Partition_cost _ih , which is the divisional arrangement cost when using M _h, and the second query execution cost for each of the queries q _{j ′} ∈Q _i ′ in the state after subdivision arrangement using such a method query_cost_part _{ihj ′} may be calculated.

その場合、再分割配置判断手段１３は、再分割配置することで、しないときよりも総処理コストが小さくなる列集合Ｃ_ｉと分割配置方法Ｍ_ｈの組があるかどうかを以下の条件式（２）が真となるｃｏｓｔ＿ｃｕｔ_ｉｈが存在するかどうかで判断すればよい。条件式（１）におけるｃｏｓｔ＿ｃｕｔ_ｉｈは、指定の列集合Ｃ_ｉおよび分割配置方法Ｍ_ｈでの再分割配置によるコスト削減量の見積もりに相当する。 In that case, the subdivision arrangement determination means 13 determines whether there is a set of the column set C _i and the subdivision arrangement method M _h that reduces the total processing cost by performing the subdivision arrangement as compared with the case where it is not. The determination may be made based on whether or not cost_cut _{ih in} which 2) is true exists. Cost_cut _ih the conditional expression (1) corresponds to estimation of cost reductions by subdivision located in the specified set of columns _{C i} and the division arrangement method _{M h.}

そして、再分割配置判断手段１３は、条件式（２）が真となるｃｏｓｔ＿ｃｕｔ_ｉｈが存在した場合には、ｃｏｓｔ＿ｃｕｔ_ｉｈが最大値をとる列集合Ｃ_ｉと分割配置方法Ｍ_ｈの組を選択することにより、選択した列集合Ｃ_ｉをパーティションキーに用い、かつ選択した分割配置方法Ｍ_ｈを用いて再分割配置することを決定する。 The subdivision rearrangement determining means 13, when the Cost_cut _ih the conditional expression (2) is true is present, the Cost_cut _ih selects the set of column sets _{C i} and split arrangement method _{M h} having the maximum value As a result, it is determined that the selected column set C _i is used as the partition key and re-divided using the selected divided arrangement method M _h .

なお、第２の処理コスト算出手段１２４は、上記の方法以外にも、例えば、予め所定の分割方法で固定して第２のクエリ実行コストを算出し、再分割配置判断手段１３が、コスト削減量に基づいて、再分割配置するか否かを決定するとともに、再分割配置する場合には、算出された第２のクエリ実行コストの中から高コストとなった述語の演算の種類およびクエリ内におけるそれら演算の出現頻度とに基づいて、パーティションキーと分割配置方法とを決定してもよい。 In addition to the above method, the second processing cost calculation unit 124 calculates the second query execution cost by fixing in advance with a predetermined division method, for example, and the subdivision arrangement determination unit 13 reduces the cost. Based on the amount, it is determined whether or not subdivision placement is to be performed, and in the case of subdivision placement, the type of predicate operation that has become expensive from the calculated second query execution cost and within the query The partition key and the division arrangement method may be determined based on the appearance frequency of those operations.

例えば、再分割配置判断手段１３は、クエリセット内のクエリにある列集合に対して一致条件（＝や≠）で選択演算を行う際に高コストとなるクエリが、他の演算を行う際に高コストとなるクエリと比べて多いとき、パーティションキーとして当該列集合と、列値が同じ多くの行データを同一ブロックに配置する分割法とを選択してもよい。また、例えば、再分割配置判断手段１３は、クエリセット内のクエリにある列集合に対して範囲条件（＞や＜）で選択演算を行う際に高コストとなるクエリが、他の演算を行う際に高コストとなるクエリと比べて多いとき、パーティションキーとして当該列集合と、全ブロックに渡り列値で全行データがソートされるような分割方法とを選択してもよい。また、例えば、再分割配置判断手段１３は、クエリセット内のクエリにある列集合に対して結合演算を行う高コストとなるクエリが、他の演算を行う際に高コストとなるクエリと比べて多いとき、パーティションキーとして当該列集合と、全ブロックに渡り列値で全行データがソートされるような分割方法とを選択してもよい。 For example, the subdivision arrangement determination unit 13 is used when a query that is expensive when performing a selection operation on a set of columns in a query in a query set with a matching condition (= or ≠) performs another operation. When there are a large number of queries compared to a high-cost query, the partition set and a partitioning method in which many row data having the same column value are arranged in the same block may be selected. Further, for example, the subdivision arrangement determining unit 13 performs another operation when a query that is expensive when performing a selection operation with a range condition (> or <) on a column set in a query in a query set. When the number of queries is high compared to a high-cost query, the column set may be selected as a partition key and a partitioning method in which all row data is sorted by column values over all blocks. Further, for example, the subdivision arrangement determination unit 13 compares the query that is expensive to perform the join operation on the column set in the query in the query set with the cost that is expensive when performing other operations. In many cases, the column set and a partitioning method that sorts all row data by column values over all blocks may be selected as a partition key.

また、上記の例では、列集合抽出手段１２１が、パーティションキーの候補とされる列集合を抽出する処理（ステップＳ１０２）で、クエリセット内の述語（選択述語または結合述語）への出現頻度が所定の頻度以上であることを要件の１つとしたが、クエリセット内の述語への出現頻度を見ずに、クエリセット内の選択述語または結合述語に出現している列集合全てを抽出することも可能である。そのような場合には、θ＝０とすればよい。 In the above example, the column set extraction unit 121 extracts a column set that is a partition key candidate (step S102), and the appearance frequency of the predicate (selection predicate or join predicate) in the query set is as follows. One of the requirements is that the frequency is equal to or higher than a predetermined frequency, but all column sets that appear in the selection predicates or join predicates in the query set are extracted without looking at the frequency of appearance in the predicates in the query set. Is also possible. In such a case, θ = 0 may be set.

ただし、列集合の総数が多い場合は計算時間が非常に大きくなるため、適切な閾値が設定されるのが好ましい。ただし、出現頻度の閾値を高くしすぎると、コストの大きな低頻度の列集合を見落とす可能性があることに注意が必要である。 However, when the total number of column sets is large, the calculation time becomes very long, so it is preferable to set an appropriate threshold value. However, it should be noted that if the threshold value of the appearance frequency is set too high, a low-frequency column set having a large cost may be overlooked.

本実施形態では、クエリセットから所定の頻度以上の述語を抽出してパーティションキーの候補を得るのではなく、クエリセットから所定の頻度以上の列集合を抽出してパーティションキーの候補を得ている。このため、コストの大きな低頻度の列集合を見落とす可能性は、述語の頻度に基づく方法よりは低くなると考えられる。一般に、列と述語とでは、列のほうが粒度（granularity）の粗い概念になるため、あるクエリ集合においてある列を含む述語の頻度が極端に低くても、列の頻度は比較的大きくなる傾向がある。また、あるクエリ集合において述語に用いられる列集合の種類数は述語の種類数より小さくなる場合が多い。このため、見落とされる列集合の数は見落とされる述語の数より小さいことが期待できる。 In this embodiment, rather than extracting a predicate having a predetermined frequency or higher from a query set to obtain a partition key candidate, a column set having a predetermined frequency or higher is extracted from the query set to obtain a partition key candidate. . For this reason, it is considered that the possibility of overlooking a low-frequency column set having a large cost is lower than the method based on the predicate frequency. In general, because columns and predicates have a coarser granularity concept, the frequency of columns tends to be relatively large even if the frequency of predicates that include a column in a query set is extremely low. is there. In many query sets, the number of types of column sets used for predicates is smaller than the number of types of predicates. For this reason, it can be expected that the number of column sets that are overlooked is smaller than the number of predicates that are overlooked.

次に、処理コストの算出方法について具体例を用いて説明する。本例は、分割配置方法として、対象データ群全体をパーティションキーで指定された列集合の値でソートした後に指定の数のブロックに等分割して配置する方法（ソート分割法）を採用したときの処理コストの算出方法の例である。本例でも、図５に示すクエリセットＱを例に用いる。 Next, a method for calculating the processing cost will be described using a specific example. In this example, when the entire target data group is sorted by the value of the column set specified by the partition key and then equally divided into the specified number of blocks (sort partition method) This is an example of a method for calculating the processing cost. Also in this example, the query set Q shown in FIG. 5 is used as an example.

図９は、対象データ群の分割配置例を示す説明図である。なお、図９には、対象データ群のうちテーブル“T”に属する行データが２つのデータブロックに分割配置される例が示されている。なお、テーブル“T”は、データ属性に対応する列として、“name”列、“age”列、“sex”列の３つの列を有している。 FIG. 9 is an explanatory diagram illustrating an example of a divided arrangement of the target data group. FIG. 9 shows an example in which row data belonging to the table “T” in the target data group is divided and arranged into two data blocks. The table “T” has three columns, “name” column, “age” column, and “sex” column, as columns corresponding to the data attribute.

今、第１のデータブロックＴＢ１には、列“name”の値がそれぞれ“suzuki”，“yamada”，“aida”である３つの行データが含まれている。また、第２のデータブロックＴＢ２には、列“name”の値が“kuroda”，“hirata”，“satou”である３つの行データが含まれている。また、メタデータ格納手段２２には、各データブロックのブロック統計情報として、当該データブロック内における各列の最小値“min”と最大値“max”が少なくとも格納されている。図９に示す例では、テーブル“T”は、例えば行データが登録された順番のまま２つのデータブロック（ＴＢ１，ＴＢ２）に等分割されて配置されている。 Now, the first data block TB1 includes three row data whose values of the column “name” are “suzuki”, “yamada”, and “aida”, respectively. The second data block TB2 includes three row data whose column “name” values are “kuroda”, “hirata”, and “satou”. The metadata storage unit 22 stores at least the minimum value “min” and the maximum value “max” of each column in the data block as block statistical information of each data block. In the example illustrated in FIG. 9, the table “T” is arranged, for example, by being equally divided into two data blocks (TB1, TB2) in the order in which the row data is registered.

また、図１０および図１１は、対象データ群の他の分割配置例を示す説明図である。図１０には、パーティションキーとして｛T.age｝を用い、かつソート分割法を用いてテーブル“T”を２つのデータブロック（ＴＢ１，ＴＢ２）に分割配置した例が示されている。また、図１１には、パーティションキーとして｛T.sex, T.name｝を用い、かつソート分割法を用いてテーブル“T”を２つのデータブロック（ＴＢ１，ＴＢ２）に分割配置した例が示されている。 FIGS. 10 and 11 are explanatory diagrams illustrating other examples of divided arrangement of the target data group. FIG. 10 shows an example in which {T.age} is used as the partition key and the table “T” is divided and arranged into two data blocks (TB1, TB2) using the sort division method. FIG. 11 shows an example in which {T.sex, T.name} is used as the partition key and the table “T” is divided and arranged into two data blocks (TB1, TB2) using the sort partition method. Has been.

なお、分割配置を行う際の分割方法は、ソート分割法だけではない。例えば、パーティションキーとして指定された列集合の値からハッシュ値を生成し、同値のハッシュ値を持つブロックに行データを割り当てる方法（ハッシュ分割法）なども一般的な方法である。分割配置アルゴリズムごとに分割配置コスト（ｐａｒｔｉｔｉｏｎ_ｃｏｓｔ）は異なる。例えば、ソート分割法では典型的なソートアルゴリズムを用いる場合、表データの行数Ｎに対して、Ｏ（ＮｌｏｇＮ）＋Ｏ（Ｎ）の分割配置コストがかかるが、ハッシュ分割法では、Ｏ（Ｎ）の分割配置コストとなる。 It should be noted that the division method for performing the divided arrangement is not limited to the sort division method. For example, a general method is a method in which a hash value is generated from a column set value designated as a partition key and row data is allocated to a block having the same hash value (hash partitioning method). The division arrangement cost (partition_cost) is different for each division arrangement algorithm. For example, when a typical sort algorithm is used in the sort partition method, a partition arrangement cost of O (NlogN) + O (N) is required for the number of rows N of the table data. In the hash partition method, O (N) It becomes the division arrangement cost.

次に、クエリ実行コストの算出例を示す。各クエリのコスト計算は、データベース技術分野における、コストに基づいたクエリ最適化を行うことと等価である。一般に、コストに基づいたクエリ最適化では、与えられたクエリについて考えられる全てのクエリ実行計画を列挙する。 Next, an example of calculating the query execution cost is shown. Cost calculation for each query is equivalent to performing cost-based query optimization in the database technology field. In general, cost-based query optimization enumerates all possible query execution plans for a given query.

図１２に、図５に示されるクエリｑ１について考えられるクエリ実行計画の例として、３種のクエリ実行計画木を示す。各クエリ実行計画木は、葉ノードにテーブルを持つ。また、各クエリ実行計画木の中間ノードは、対象クエリのオペレーション（選択、結合、射影）に対応している。クエリ実行計画木では、左側の葉ノードから順番に処理が実行され、ルートノードにおいて結果が算出されることを意味する。図中のσ記号はオペレーションが選択演算であることを表している。また、リボン状の記号は、オペレーションが結合演算であることを表している。また、各オペレーション記号の添え字はそれぞれ選択条件、結合条件を表している。 FIG. 12 shows three types of query execution plan trees as examples of possible query execution plans for the query q1 shown in FIG. Each query execution plan tree has a table at a leaf node. Further, the intermediate node of each query execution plan tree corresponds to the operation (selection, combination, projection) of the target query. In the query execution plan tree, this means that processing is executed in order from the left leaf node, and the result is calculated at the root node. The symbol σ in the figure indicates that the operation is a selection operation. A ribbon-like symbol indicates that the operation is a join operation. The subscripts of each operation symbol represent the selection condition and the combination condition, respectively.

第１の処理コスト算出手段１２２および第２の処理コスト算出手段１２４は、このようなクエリ実行計画木を基に、各クエリ実行計画を実行する際のクエリ実行コストを計算して、最も処理コストの小さいクエリ実行計画のクエリ実行コストを、当該クエリのクエリ実行コストに採用してもよい。 The first processing cost calculation unit 122 and the second processing cost calculation unit 124 calculate the query execution cost for executing each query execution plan based on such a query execution plan tree, and obtain the most processing cost. The query execution cost of a small query execution plan may be adopted as the query execution cost of the query.

各クエリのクエリ実行コストは、メタデータ格納手段２２に格納されている列ごとの値の選択率やヒストグラム等のメタデータから算出できる。 The query execution cost of each query can be calculated from metadata such as a value selection rate and a histogram stored for each column stored in the metadata storage unit 22.

例えば、クエリセットＱにおいて頻出な列集合Ｃ_ｉとして、列集合Ｃ_１とＣ_２とが与えられたとする。なお、列集合Ｃ_１＝｛T.age｝をカバーするクエリｑ_ｉｊ’はｑ１，ｑ２，ｑ３である。 For example, it is assumed that column sets C ₁ and C ₂ are given as frequent column sets C _i in the query set Q. Note that the query q _{ij ′} covering the column set C ₁ = {T.age} is q1, q2, and q3.

今、第２の処理コスト算出手段１２４が、列集合Ｃ_１をパーティションキーに用いて再分割配置後の状態におけるクエリｑ１の第２のクエリ実行コストであるｑｕｅｒｙ＿ｃｏｓｔ＿ｐａｒｔ_１１を算出することを考える。図１２に示す例では第３の計画が最も小さなクエリ実行コストとなる。なぜなら結合演算と選択演算が混在する場合、先に選択演算を行うほうが効率的であるため、第１の計画より第２の計画および第３の計画のほうが低コストとなる。 Now, the second processing cost calculation means 124, consider calculating the Query_cost_part ₁₁ is a second query execution cost query q1 in the state after the subdivision placed using a column set _{C 1} in the partition key. In the example shown in FIG. 12, the third plan has the lowest query execution cost. This is because when the combination operation and the selection operation are mixed, it is more efficient to perform the selection operation first, so that the second plan and the third plan are less expensive than the first plan.

また、本例の場合、テーブル“T”は列集合Ｃ_１＝｛T.age｝をパーティションキーとしてソート分割法で再分割配置されている。すなわち、図１０に示す状態にある。すると、選択演算“T.age>30”に対して、事前にブロック統計情報を参照することにより、第１のデータブロックへの不要な読み込みを阻止できる。したがって、第１の計画より第３の計画の方が処理時間が短くなる。 In this example, the table “T” is re-divided and arranged by the sort division method with the column set C ₁ = {T.age} as the partition key. That is, it is in the state shown in FIG. Then, unnecessary reading into the first data block can be prevented by referring to the block statistical information in advance for the selection operation “T.age> 30”. Accordingly, the processing time is shorter in the third plan than in the first plan.

本例では、第３の計画が選ばれ、第３の計画による第２のクエリ実行コストがｑｕｅｒｙ＿ｃｏｓｔ＿ｐａｒｔ_１１として使用される。クエリｑ２，ｑ３についても同様に、ｑｕｅｒｙ＿ｃｏｓｔ＿ｐａｒｔ_１２，ｑｕｅｒｙ＿ｃｏｓｔ＿ｐａｒｔ_１３が求められる。 In this example, the third plan is selected, and the second query execution cost based on the third plan is used as the query_cost_part ₁₁ . Similarly, query_cost_part ₁₂ and query_cost_part ₁₃ are also obtained for the queries q2 and q3.

なお、再分割配置を行わない場合のクエリ実行コストの算出方法も基本的には同様である。すなわち、第１の処理コスト算出手段１２２は、対象クエリｑ_ｊのクエリ実行計画木を基に、現在の再部活配置において最も処理コストの小さいクエリ実行計画のクエリ実行コストを、対象クエリｑ_ｊの第１のクエリ実行コストであるｑｕｅｒｙ＿ｃｏｓｔ＿ｎｏ_ｊに採用する。 Note that the method for calculating the query execution cost when the subdivision arrangement is not performed is basically the same. That is, the first processing cost calculation means 122, based on the query plan tree of the target query q _j, the query execution cost of the most processing cost small query plan for the current re club arrangement, the target query q _j This is adopted as query_cost_no _j which is the first query execution cost.

ただし、第１の処理コスト算出手段１２２は、各クエリ実行計画を実行する際のクエリ実行コストを計算する際、現在の分割配置の状態におけるクエリ実行コストを計算する。本例の場合、第１の処理コスト算出手段１２２は、テーブル“T”が図１０に示す状態にあるとして、クエリ実行計画木のクエリ実行コストを計算する。なお、図１０に示す状態は、テーブル“T”が｛T.age｝でソートされずに分割された例である。このため、例えばクエリｑ１に対しては、図１２に示した計画のうちの第１の計画および第２の計画の２種類が考えられるクエリ実行計画となる。したがって、第２の計画が最も効率的であるとして選択されて、現在の分割配置における第２の計画のクエリ実行コストがｑｕｅｒｙ＿ｃｏｓｔ＿ｎｏ_１となる。第１の処理コスト算出手段１２２は、他のクエリｑ_ｊについても同様に、ｑｕｅｒｙ＿ｃｏｓｔ＿ｎｏ_ｊを計算する。このようにして、列集合Ｃ_１，列集合Ｃ_２に対応したｃｏｓｔ＿ｃｕｔ_ｉやｃｏｓｔ＿ｃｕｔ_ｉｈを算出するための各種の処理コストを得る。 However, the first processing cost calculation unit 122 calculates the query execution cost in the current divided arrangement state when calculating the query execution cost when executing each query execution plan. In the case of this example, the first processing cost calculation unit 122 calculates the query execution cost of the query execution plan tree assuming that the table “T” is in the state shown in FIG. The state shown in FIG. 10 is an example in which the table “T” is divided without being sorted by {T.age}. Therefore, for example, the query q1 is a query execution plan in which two types of plans, the first plan and the second plan, shown in FIG. 12 are considered. Therefore, the second plan is selected as the most efficient, and the query execution cost of the second plan in the current split arrangement is query_cost_no ₁ . The first processing cost calculation unit 122 calculates query_cost_no _j in the same manner for other queries q _j . In this way, various processing costs for calculating cost_cut _i and cost_cut _ih corresponding to the column set C ₁ and the column set C ₂ are obtained.

再分割配置判断手段１３は、このようにして得られた各種の処理コストを、例えば条件式（１）に代入して、列集合Ｃ_ｉの各々についてのｃｏｓｔ＿ｃｕｔ_ｉを得る。 The subdivision arrangement determination unit 13 substitutes the various processing costs obtained in this way into, for example, the conditional expression (1) to obtain cost_cut _i for each column set C _i .

以下に、クエリ実行コストの算出例を示す。本例は、上述したクエリｑ１の第３の計画に対するクエリ実行コストの算出例である。クエリｑ１は選択演算と結合演算からなるクエリであるため、クエリ実行コストは、選択演算の処理コストと結合演算の処理コストの和になる。処理コストを表す指標はＣＰＵコストやＩ／Ｏコストなど複数考えられるが、ここではＩ／Ｏコストのみ考慮する。Ｉ／ＯにはディスクのＩ／ＯとネットワークのＩ／Ｏの２種類があるが、ここではこれらの負荷はデータブロックの格納先にかかわらずみな同量であると仮定し、ディスクのＩ／Ｏコストのみを算出対象する。 An example of calculating the query execution cost is shown below. This example is a calculation example of the query execution cost for the third plan of the query q1 described above. Since the query q1 is a query including a selection operation and a join operation, the query execution cost is the sum of the processing cost of the selection operation and the processing cost of the join operation. There are a plurality of indexes representing the processing cost, such as CPU cost and I / O cost, but only I / O cost is considered here. There are two types of I / O: disk I / O and network I / O. Here, it is assumed that these loads are the same regardless of the storage location of the data block. Only O cost is calculated.

本例では、選択演算および結合演算の処理コストを、当該演算でアクセスされるデータブロックに含まれる行数の和で表す。データブロックにアクセスすることは、ディスクのＩ／Ｏが行われることを意味する。また、本例では、索引（インデックス）は存在しないものとする。 In this example, the processing cost of the selection operation and the join operation is represented by the sum of the number of rows included in the data block accessed by the operation. Accessing the data block means that disk I / O is performed. In this example, it is assumed that no index exists.

クエリｑ１の第３の計画では、まず第１のオペレーションとして選択演算が実行される。より具体的には、テーブル“T”に対して、条件“age>30”を指定した選択演算が実行される。インデックスがない場合、条件に用いられている列集合（本例の場合、｛T.age｝）でソートされていないテーブルへの選択演算では、全てのデータブロックにアクセスする必要がある。すなわち、再分割配置しない場合の選択演算の処理コストは６となる。 In the third plan of query q1, a selection operation is first executed as the first operation. More specifically, the selection calculation specifying the condition “age> 30” is executed for the table “T”. When there is no index, it is necessary to access all data blocks in a selection operation on a table that is not sorted by the column set used in the condition (in this example, {T.age}). In other words, the processing cost of the selection calculation when the subdivision arrangement is not performed is 6.

一方、条件に用いられている列集合をパーティションキーとしてソート分割法で再分割配置した場合の選択演算の処理コストは、次のようになる。条件に用いられている列集合の値の順にソートされた後で分割されている場合、ブロック統計情報を参照して不要なデータブロックへのアクセスを避けることができる。このため、選択演算の処理コストの計算では、まず当該選択演算においてアクセスされるであろうデータブロックの数（以下、アクセスブロック数という。）を見積もる。 On the other hand, the processing cost of the selection operation when the column set used for the condition is repartitioned by the sort partition method with the partition key as follows is as follows. When the data is divided after being sorted in the order of the values of the column set used in the condition, access to unnecessary data blocks can be avoided by referring to the block statistical information. For this reason, in the calculation of the processing cost of the selection calculation, first, the number of data blocks that will be accessed in the selection calculation (hereinafter referred to as the number of access blocks) is estimated.

例えば、第２の処理コスト算出手段１２４は、メタデータとしてメタデータ格納手段２２に格納されている各列の値のヒストグラムに基づいて、条件に合致する列の値を持つ行データが全体の何％を占めるかを計算してもよい。 For example, the second processing cost calculation unit 124 calculates the total number of row data having column values that match the conditions based on a histogram of column values stored in the metadata storage unit 22 as metadata. % May be calculated.

図１３に、本例のテーブル“T”における列”age”のヒストグラムを示す。本例では、メタデータ格納手段２２から、テーブル“T”における列”age”のヒストグラムとして図１３に示されるヒストグラムが得られたとする。図１３に示すヒストグラムによれば、列”age”の値が３０よりも大きい値を持つ行データが全体の５０％存在することを事前に（再分割配置する前に）知ることができる。 FIG. 13 shows a histogram of the column “age” in the table “T” of this example. In this example, it is assumed that the histogram shown in FIG. 13 is obtained from the metadata storage unit 22 as the histogram of the column “age” in the table “T”. According to the histogram shown in FIG. 13, it can be known in advance (before the subdivision arrangement) that 50% of the row data having a value of the column “age” larger than 30 exists.

以下、本例においてこの５０％を選択率と呼び、πで表す。これにより、該選択演算によるアクセスブロック数は、次のような演算により求まる。 Hereinafter, this 50% is referred to as selectivity in this example, and is represented by π. Thereby, the number of access blocks by the selection calculation is obtained by the following calculation.

アクセスブロック数＝ceil(行の総数×選択率π)÷floor(行の総数÷データブロック数)
・・・（３） Number of access blocks = ceil (total number of rows x selection rate π) ÷ floor (total number of rows ÷ number of data blocks)
... (3)

ここで、ｃｅｉｌ（）は小数点以下を切り上げる関数を表している。また、ｆｌｏｏｒ（）は、小数点以下を切り捨てる関数を表している。 Here, ceil () represents a function that rounds up the decimal point. Further, floor () represents a function for truncating after the decimal point.

式（３）によれば、クエリｑ１の第３の計画における第１のオペレーションである選択演算によるアクセスブロック数＝１と見積もられる。 According to Expression (3), it is estimated that the number of access blocks by the selection operation that is the first operation in the third plan of the query q1 = 1.

また、このとき、該選択演算によって選択される行数は、行の総数×選択率π＝３であることも分かる。 Also, at this time, it can be seen that the number of rows selected by the selection calculation is the total number of rows × selectivity π = 3.

第３の計画では次に、第２のオペレーションとして結合演算が実行される。なお、結合演算の処理コストは、｛T.age｝でソート分割されている場合とされていない場合とで変わらない。今、対象データ群に、テーブル“R”の行データとして、列“age”を持つ６個の行データが含まれており、３行ずつ２個のデータブロック（データブロックＲＢ１、データブロックＲＢ２）に分割されて格納されていると仮定する。 Next, in the third plan, the join operation is executed as the second operation. It should be noted that the processing cost of the join operation does not change depending on whether the sorting is performed with {T.age} or not. Now, the target data group includes 6 row data having the column “age” as the row data of the table “R”, and 2 data blocks (data block RB1, data block RB2) every 3 rows. It is assumed that the data is divided and stored.

分散データベースシステムにおいて結合演算でよく知られたアルゴリズムは、Ｍａｐ−ｓｉｄｅ−ｊｏｉｎと呼ばれ、次のような流れで処理を行うアルゴリズムである。 An algorithm that is well-known for join operations in a distributed database system is called Map-side-join, and is an algorithm that performs processing in the following flow.

当該クエリの結合演算を並列に行うノードがｎ個あるとする。一般に、データ格納手段の各々には、当該データ格納手段に格納されているデータブロックに対するデータ操作を取りまとめて行うノード（計算機）が対応づけられている。まず、第１のステップで、データベース管理システムが、より小さなテーブル“T”をｎ個のノード全てにコピーする。第２のステップで、ｎ個のノードが、より大きなテーブル“R”の行データを含むデータブロックを、ノード間で重複がないように読み込み、読み込んだテーブル“R”のデータブロックと、先のステップでコピーされたテーブル“T”とを結合する。第３のステップで、データベース管理システムが、各ノードの結合結果をマージして出力する。 Assume that there are n nodes that perform the join operation of the query in parallel. In general, each data storage means is associated with a node (computer) that collectively performs data operations on the data blocks stored in the data storage means. First, in a first step, the database management system copies a smaller table “T” to all n nodes. In the second step, n nodes read the data block including the row data of the larger table “R” so that there is no duplication between the nodes, and the data block of the read table “R” Join table “T” copied in step. In the third step, the database management system merges and outputs the join results of the nodes.

第３の計画の第２のオペレーションとして、第１のオペレーションによるテーブル“T”についての選択結果と、テーブル“R”との結合を行う場合、ｎ＝２とすると、まず第１のオペレーションによるテーブル“T”についての選択結果である３個の行からなるテーブル“T'”が各ノード（ノードｎ１およびノードｎ２）に読み込まれる。次に、ノードｎ１がテーブル“R”のデータブロックであるデータブロックＲＢ１を読み込んでテーブル“T'”と結合するとともに、ノードｎ２がテーブル“R”のデータブロックであるデータブロックＲＢ２を読み込んでテーブル“T'”と結合する。このとき、結合演算の処理コストは、テーブル“T'”のスキャンにかかる処理コストと、テーブル“R”の最大行数を持つデータブロックをスキャンするのにかかる処理コストである。 As the second operation of the third plan, when the selection result for the table “T” by the first operation is combined with the table “R”, if n = 2, the table by the first operation is first set. A table “T ′” consisting of three rows as a selection result for “T” is read into each node (node n1 and node n2). Next, the node n1 reads the data block RB1 which is a data block of the table “R” and combines it with the table “T ′”, and the node n2 reads the data block RB2 which is a data block of the table “R”. Combine with “T '”. At this time, the processing cost of the join operation is the processing cost for scanning the table “T ′” and the processing cost for scanning the data block having the maximum number of rows in the table “R”.

したがって、クエリｑ１の第３の計画における第２のオペレーションである結合演算による処理コストは、３＋ｃｅｉｌ（６÷２）＝６と見積もられる。 Therefore, the processing cost by the join operation, which is the second operation in the third plan of the query q1, is estimated as 3 + ceil (6 ÷ 2) = 6.

以上から、現状の分割配置すなわち列“age”でテーブル“T”を分割配置されていない状態におけるクエリｑ１のクエリ実行コスト（第１のクエリ実行コスト）であるｑｕｅｒｙ＿ｃｏｓｔ＿ｎｏ_１は６＋６＝１２となる。また、列“age”でテーブル“T”を再分割配置した状態におけるクエリｑ１のクエリ実行コスト（第２のクエリ実行コスト）であるｑｕｅｒｙ＿ｃｏｓｔ＿ｎｏ_１１は、３＋６＝９となる。他のクエリについても同様の方法で、クエリ実行コストを計算できる。 From the above, query_cost_no ₁ which is the query execution cost (first query execution cost) of the query q1 in the state where the table “T” is not divided and arranged in the current divided arrangement, that is, the column “age” is 6 + 6 = 12. Further, the query_cost_no ₁₁ that is the query execution cost (second query execution cost) of the query q1 in a state where the table “T” is rearranged in the column “age” is 3 + 6 = 9. Query execution costs can be calculated for other queries in the same way.

なお、上記の例では、ディスクのＩ／Ｏの負荷およびネットワークのＩ／Ｏの負荷は、データブロックの格納先にかかわらず同量であると仮定したが、データブロックの格納先によってこれらの負荷が異なってくる場合には、各データ格納手段を制御する各計算機のディスクバンド幅やそれら計算機間を接続するネットワークバンド幅等の当該分散データベースの構成要素の処理性能に関する情報を基に、データブロックの格納先に応じた重みを設定して、クエリ実行コストを算出してもよい。 In the above example, it is assumed that the I / O load on the disk and the I / O load on the network are the same regardless of the storage location of the data block, but these loads depend on the storage location of the data block. Data blocks based on information on the processing performance of the constituent elements of the distributed database, such as the disk bandwidth of each computer that controls each data storage means and the network bandwidth that connects these computers. The query execution cost may be calculated by setting a weight according to the storage location.

以上のように、本実施形態によれば、第一に、クエリの述語がデータ操作に与える処理コストを考慮したデータ分割配置が可能となり、演算の処理コストが問題にならないような述語が頻繁にクエリに現れている場合に、真にボトルネックとなっている述語が無視されることが回避できる。第二に、選択述語のみならず結合述語のコストも対象にし、真にコストの大きな述語を効率化するデータ分割配置のためのパーティションキーを設定できる。第三に、データの分割配置をやり直した場合のクエリセットの実行にかかる総処理時間が削減できる見込みを計算することで、再分割配置することで逆に総処理時間が増えてしまう場合を回避できる。したがって、複数のクエリや述語の演算に時間のかかるクエリを実行して初めて１つの仕事が完了する属性生成処理といった、クエリの発行を伴うワークロードの効率化が図れる。 As described above, according to the present embodiment, firstly, it is possible to perform the data division arrangement considering the processing cost given to the data operation by the query predicate, and the predicate in which the processing cost of the operation does not become a problem frequently. It can be avoided that predicates that are truly bottlenecks are ignored when they appear in the query. Secondly, it is possible to set a partition key for data division arrangement that makes the cost of not only the selection predicate but also the join predicate a target and makes the predicate with a truly large cost efficient. Third, by calculating the probability that the total processing time required to execute the query set when the data division arrangement is redone will be avoided, avoiding the case where the total processing time increases conversely due to the subdivision arrangement it can. Therefore, it is possible to improve the efficiency of a workload that involves issuing a query, such as an attribute generation process in which one work is completed only after executing a query that takes a long time to calculate a plurality of queries and predicates.

特に、分散データベースシステムが、分割したデータブロックに対してＯＲＣＦｉｌｅのようなブロック統計情報を所持する場合において、一連のクエリの実行にかかる総処理時間の減少が期待できる。 In particular, when the distributed database system possesses block statistical information such as ORFFile for the divided data blocks, it can be expected that the total processing time required for executing a series of queries will be reduced.

また、本実施形態によれば、例えば、大量クエリの中で１つ以上の列に対する高コストな選択演算や結合演算が出現する場合に、当該１つ以上の列に対してデータの再分割配置を行ってから大量のクエリを実行したほうが再分割配置しない場合に比べて総処理時間が減る見込みがあれば、当該１つ以上の列のうち最もコスト削減量が見込まれる列集合からデータ再分割配置を行うためのパーティションキーを生成できるので、一連のクエリの実行にかかる総処理時間が最小化されることが期待できる In addition, according to the present embodiment, for example, when a high-cost selection operation or join operation for one or more columns appears in a large amount of queries, data subdivision arrangement for the one or more columns If there is a possibility that the total processing time will be reduced when a large number of queries are executed after performing the query, compared to the case where the re-partitioning is not performed, the data is subdivided from the column set that is expected to reduce the most cost among the one or more columns. Since a partition key can be generated for placement, the total processing time required to execute a series of queries can be expected to be minimized.

なぜなら、クエリの述語に頻出する列集合に注目するだけでなく、その列集合を述語に含むクエリのコストに注目し、クエリセットの総処理コストが最小になるような再分割配置を行うからである。そのような再分割配置が行えるのは、式（１）等により算出されるコスト削減量に基づくパーティションキーや分割配置方法の決定による。コスト削減量に基づいて再分割配置のパーティションキーや分割配置方法を決定するので、計算コストが問題にならないような述語が頻繁にクエリに現れている場合に、真に計算のボトルネックとなっている述語が無視される問題が回避できる。また、クエリの述語ではなく述語に登場する列集合に注目することで、選択述語のみならず結合述語のコストも対象にでき、真にコストの大きな述語を効率化するデータ分割配置のためのパーティションキーを抽出できる。また、再分割配置判断手段を設けることで、不要な再分割配置を実行してしまうリスクを回避できる。 This is because not only the column set that frequently appears in the query predicate but also the cost of the query that includes the column set in the predicate, and the subdivision arrangement that minimizes the total processing cost of the query set is performed. is there. Such subdivision arrangement can be performed by determining the partition key and the division arrangement method based on the cost reduction amount calculated by the equation (1) or the like. Since the partition key and the partition placement method for subdivision placement are determined based on the amount of cost reduction, if a predicate that does not pose a problem in computation cost frequently appears in the query, it is truly a bottleneck for computation. The problem that predicates are ignored can be avoided. In addition, by focusing on the column set that appears in the predicate rather than the query predicate, it is possible to target not only the selection predicate but also the cost of the join predicate. The key can be extracted. Moreover, the risk of executing unnecessary subdivision arrangement can be avoided by providing subdivision arrangement judgment means.

なお、上記の実施形態では、分散データベースシステムが、受け付けたクエリセットに対して、データの再分割配置をするか否かおよびその方法を判断する例を示したが、データの再分割配置をするか否かを判断する手段または装置は、分散データベースシステムが備える制御装置でなくてもよい。例えば、ユーザは分散データベースシステムにクエリセットを発行する前に、別途用意されたデータ再分割配置制御装置にクエリセットを発行して、データ再分割配置制御装置が受け付けたクエリセットに対して、分散データベースシステムに格納されているデータ群に対して再分割配置をするか否かおよびその方法を判断して、判断結果に基づいて、分散データベースシステムに分割配置の命令を発行したり、クエリセットを転送してもよい。 In the above embodiment, an example has been described in which the distributed database system determines whether or not to repartition and arrange the data for the accepted query set. The means or device for determining whether or not the distributed database system does not have to be a control device. For example, before issuing a query set to a distributed database system, a user issues a query set to a separately prepared data subdivision arrangement control device and distributes the query set received by the data subdivision arrangement control device. Determine whether and how to re-partition the data group stored in the database system, and issue a split placement instruction to the distributed database system, It may be transferred.

すなわち、上記の実施形態で示したクエリセット受付手段、処理コスト推定手段、再分割配置手段、クエリ実行手段、再分割配置手段、メタデータ記憶手段は、それぞれ別々のユニットとして実現されていてもよい。また、処理コスト推定手段に含まれるとした列集合抽出手段、第１の処理コスト算出手段、分割配置方法格納手段、第２の処理コスト算出手段も、それぞれ別々のユニットとして実現されていてもよい。 That is, the query set receiving means, processing cost estimation means, subdivision arrangement means, query execution means, subdivision arrangement means, and metadata storage means described in the above embodiment may be realized as separate units. . Also, the column set extraction means, the first processing cost calculation means, the divided arrangement method storage means, and the second processing cost calculation means that are included in the processing cost estimation means may be realized as separate units. .

次に、本発明の概要を説明する。図１４は、本発明の概要を示すブロック図である。図１４に示すように、本発明の制御装置は、クエリセット受付手段５１と、処理コスト推定手段５２と、再分割配置判断手段５３とを備えている。 Next, the outline of the present invention will be described. FIG. 14 is a block diagram showing an outline of the present invention. As shown in FIG. 14, the control device of the present invention includes a query set receiving unit 51, a processing cost estimating unit 52, and a subdivision arrangement determining unit 53.

クエリセット受付手段５１（例えば、クエリセット受付手段１１）は、複数のデータ格納手段に分割されて格納されている所定のデータ群である対象データ群に対する、１つ以上のクエリを含むクエリセットを受け付ける。 The query set accepting unit 51 (for example, the query set accepting unit 11) selects a query set including one or more queries for a target data group that is a predetermined data group that is divided and stored in a plurality of data storage units. Accept.

処理コスト推定手段５２（例えば、処理コスト推定手段１２）は、対象データ群のメタデータに基づいて、対象データ群を再分割配置せずにクエリセットを実行した場合の処理コストである第１の処理コストと、対象データ群を所定の方法で再分割配置した後にクエリセットを実行した場合の処理コストである第２の処理コストとを推定する。 The processing cost estimation unit 52 (for example, the processing cost estimation unit 12) is a first processing cost that is a processing cost when a query set is executed without subdividing the target data group based on the metadata of the target data group. A processing cost and a second processing cost that is a processing cost when the query set is executed after the target data group is re-divided and arranged by a predetermined method are estimated.

再分割配置判断手段５３（例えば、再分割配置判断手段１３）は、処理コスト推定手段５２による推定結果に基づいて、クエリセットを実行する前に、対象データ群を再分割配置するか否か、および再分割配置する場合にはどのような方法で再分割配置するかを判断する。 Based on the estimation result by the processing cost estimation unit 52, the subdivision arrangement determination unit 53 (for example, the subdivision arrangement determination unit 13) determines whether to subdivide the target data group before executing the query set. In the case of rearrangement, it is determined how to perform the rearrangement.

このような構成を有することによって、クエリの発行を伴うワークロードの総処理時間を縮小できる。 By having such a configuration, it is possible to reduce the total processing time of a workload that involves issuing a query.

また、対象データ群は、１つ以上の列の各々に割り当てられる値を含む１つ以上の行データからなる表データであり、対象データ群は、表データにおける１つ以上の任意の列の集合である列集合をパーティションキーとして、各行データに含まれるパーティションキーの値に基づいて、１つ以上の行データを各々有する複数のデータブロックに分割されており、対象データ群のメタデータは、列の値に関する統計量を含み、処理コスト推定手段は、第２の処理コストとして、クエリセットに含まれるクエリの述語に出現する列集合をパーティションキーに用いて対象データ群を再分割配置したときのクエリセットの処理コストを、メタデータに基づいて推定し、再分割配置判断手段は、第２の処理コストのいずれかが第１の処理コストよりも小さい場合に、最も小さい第２の処理コストが得られた列集合をパーティションキーに用いて対象データ群を再分割配置すると判断してもよい。 The target data group is table data including one or more row data including values assigned to each of one or more columns, and the target data group is a set of one or more arbitrary columns in the table data. Is divided into a plurality of data blocks each having one or more row data based on the value of the partition key included in each row data, and the metadata of the target data group is a column The processing cost estimation means includes a column set appearing in the query predicate included in the query set as a second key, and the target data group is subdivided and arranged as a partition key. The processing cost of the query set is estimated based on the metadata, and the re-division arrangement determining unit determines whether any of the second processing costs is higher than the first processing cost. If again, it may be determined that subdivide arranged subject data group using a set of columns in which the smallest second processing cost obtained partitioning key.

また、受け付けたクエリセットに含まれているクエリの述語に所定の頻度以上で出現する列集合を抽出する列集合抽出手段を備え、処理コスト推定手段は、列集合抽出手段によって抽出された列集合の各々をパーティションキーの候補として、各候補について、クエリセットのうち当該候補に含まれる列を使用する述語を含むクエリの実行にかかる第１の処理コストおよび第２の処理コストを推定してもよい。 In addition, a column set extraction unit that extracts a column set that appears at a predetermined frequency or more in a query predicate included in the accepted query set is provided, and the processing cost estimation unit includes the column set extracted by the column set extraction unit. As the partition key candidates, the first processing cost and the second processing cost for executing a query including a predicate that uses a column included in the candidate of the query set for each candidate may be estimated. Good.

また、処理コスト推定手段は、パーティションキーの一候補に対して、複数の分割配置方法に対応した複数の第２の処理コストを推定し、再分割配置判断手段は、第２の処理コストのいずれかが第１の処理コストよりも小さい場合に、最も小さい第２の処理コストが得られたパーティションキーの候補と分割配置方法とを用いて対象データ群を再分割配置すると判断してもよい。 Further, the processing cost estimation means estimates a plurality of second processing costs corresponding to a plurality of division arrangement methods for one partition key candidate, and the subdivision arrangement determination means determines which of the second processing costs. If is smaller than the first processing cost, it may be determined that the target data group is subdivided and arranged using the partition key candidate and the division arrangement method that provide the smallest second processing cost.

また、複数の分割配置方法には、パーティションキーとされた列集合の値が同じである行データが同一データブロックに多く含まれるように対象データ群を分割する第１の分割方法と、全データブロックに渡りパーティションキーとされた列集合の値で行データがソートされるように対象データ群を分割する第２の分割方法とが含まれていてもよい。 The plurality of division arrangement methods include a first division method that divides the target data group so that many row data having the same column set values as partition keys are included in the same data block, and all data There may be included a second division method for dividing the target data group so that the row data is sorted by the value of the column set used as the partition key over the block.

処理コスト推定手段は、クエリの実行アルゴリズムとして所定の最適化アルゴリズムを適用して、第１の処理コストおよび第２の処理コストを推定してもよい。 The processing cost estimation means may estimate the first processing cost and the second processing cost by applying a predetermined optimization algorithm as a query execution algorithm.

また、処理コスト推定手段は、さらに、各データ格納手段を制御する計算機のディスクバンド幅または該計算機間を接続するネットワークのバンド幅に基づいて、第１の処理コストおよび第２の処理コストを推定してもよい。 The processing cost estimation means further estimates the first processing cost and the second processing cost based on the disk bandwidth of the computer that controls each data storage means or the bandwidth of the network that connects the computers. May be.

以上、実施形態及び実施例を参照して本願発明を説明したが、本願発明は上記実施形態および実施例に限定されるものではない。本発明の構成や詳細には、本願発明のスコープ内で当業者が理解し得る様々な変更をすることができる。 Although the present invention has been described with reference to the embodiments and examples, the present invention is not limited to the above embodiments and examples. Various changes that can be understood by those skilled in the art can be made to the configuration and details of the present invention within the scope of the present invention.

本発明は、分析用途に限らず、大量のデータを扱う業務システムの効率化や、複雑な計算を行う計算機システムの効率化といった用途にも好適に適用できる。 The present invention can be suitably applied not only to analysis applications, but also to applications such as improving the efficiency of business systems that handle large amounts of data and improving the efficiency of computer systems that perform complex calculations.

１１クエリセット受付手段
１２処理コスト推定手段
１２１列集合抽出手段
１２２第１の処理コスト算出手段
１２３分割配置方法格納手段
１２４第２の処理コスト算出手段
１３再分割配置判断手段
１４再分割配置実行手段
１５クエリ実行手段
２１データ格納手段
２２メタデータ格納手段
５１クエリセット受付手段
５２処理コスト推定手段
５３再分割配置判断手段 DESCRIPTION OF SYMBOLS 11 Query set reception means 12 Processing cost estimation means 121 Column set extraction means 122 First processing cost calculation means 123 Division arrangement method storage means 124 Second processing cost calculation means 13 Subdivision arrangement judgment means 14 Subdivision arrangement execution means 15 Query execution means 21 Data storage means 22 Metadata storage means 51 Query set acceptance means 52 Processing cost estimation means 53 Subdivision arrangement judgment means

Claims

Query set accepting means for accepting a query set including one or more queries for a target data group that is a predetermined data group divided and stored in a plurality of data storage means;
Based on the metadata of the target data group, a first processing cost that is a processing cost when the query set is executed without subdividing the target data group, and the target data group in a predetermined method Processing cost estimation means for estimating a second processing cost that is a processing cost when the query set is executed after the subdivision arrangement;
Based on the estimation result by the processing cost estimation means, whether or not to re-divide the target data group before executing the query set, and in what way to re-division arrangement And a subdivision arrangement determining means for determining whether or not to perform the control.

The target data group is table data composed of one or more row data including a value assigned to each of one or more columns.
The target data group includes a column set that is a set of one or more arbitrary columns in the table data as a partition key, and each set of one or more row data based on the value of the partition key included in each row data. Divided into a plurality of data blocks,
The metadata of the target data group includes a statistic regarding the value of the column,
The processing cost estimation means uses, as a second processing cost, a processing cost of the query set when the target data group is subdivided and arranged using a column set appearing in a query predicate included in the query set as a partition key. , Based on the metadata,
The subdivision arrangement determining means uses the column set from which the smallest second processing cost is obtained as a partition key when any of the second processing costs is smaller than the first processing cost. The control device according to claim 1, wherein the control unit determines that the group is to be rearranged.

Column set extraction means for extracting a column set that appears at a predetermined frequency or more in a query predicate included in the accepted query set,
The processing cost estimation means executes each of the column sets extracted by the column set extraction means as a partition key candidate, and executes a query including a predicate that uses a column included in the candidate in the query set for each candidate. The control device according to claim 2, wherein the first processing cost and the second processing cost are estimated.

The processing cost estimation means estimates a plurality of second processing costs corresponding to a plurality of divided arrangement methods for a partition key candidate,
The re-division arrangement determining unit uses the partition key candidate and the division arrangement method that provide the smallest second processing cost when any of the second processing costs is smaller than the first processing cost. The control device according to claim 2, wherein the target data group is determined to be subdivided.

The plurality of division arrangement methods include a first division method that divides a target data group so that many row data having the same column set value as a partition key are included in the same data block, and all data blocks. 5. The control device according to claim 4, further comprising: a second division method that divides the target data group so that the row data is sorted by the value of the column set that is set as the transition partition key.

The processing cost estimation means estimates a first processing cost and a second processing cost by applying a predetermined optimization algorithm as an execution algorithm of a query. The control device described.

The processing cost estimation means further estimates the first processing cost and the second processing cost based on the disk bandwidth of each computer that controls each data storage means or the bandwidth of the network that connects the computers. The control device according to any one of claims 1 to 6.

A plurality of data storage means for dividing and storing a predetermined data group into a plurality of data blocks;
Query set accepting means for accepting a query set including one or more queries for the target data group which is the predetermined data group;
Metadata storage means for storing metadata of the target data group;
Based on the metadata, a first processing cost that is a processing cost when the query set is executed without subdividing the target data group, and the query after subdividing the target data group A processing cost estimation means for estimating a second processing cost that is a processing cost when the set is executed;
Based on the estimation result by the processing cost estimation means, whether or not to re-divide the target data group before executing the query set, and in what way to re-division arrangement Subdivision arrangement judging means for judging whether to do,
Re-division arrangement execution means for executing re-division arrangement of the target data group before executing the query set when it is determined by the sub-division arrangement determination means that the target data group is re-division arrangement. A distributed database system characterized by this.

The control device accepts a query set including one or more queries for a target data group that is a predetermined data group that is divided and stored in a plurality of data storage means,
Before the execution of the query set, the control device executes the query set without subdividing the target data group based on the metadata of the target data group, and the target Estimating the processing cost when the query set is executed after the data group is re-partitioned by a predetermined method,
Whether the target data group is to be subdivided and arranged before executing the query set based on the estimation result before executing the query set, and Is a divisional placement control method characterized by determining how to re-partition placement.

On the computer,
A process of receiving a query set including one or more queries for a target data group that is a predetermined data group that is divided and stored in a plurality of data storage means;
Based on the metadata of the target data group, the processing cost when the query set is executed without subdividing the target data group, and the query after the target data group is subdivided by a predetermined method A process for estimating the processing cost when the set is executed, and whether or not the target data group is subdivided and arranged based on the estimation result before the query set is executed. Is a split placement control program for executing the process of determining how to re-partition placement.