JP5112137B2

JP5112137B2 - Distributed index join method and system

Info

Publication number: JP5112137B2
Application number: JP2008084576A
Authority: JP
Inventors: 純司寺本; 啓敏須賀; 東潮日高
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2008-03-27
Filing date: 2008-03-27
Publication date: 2013-01-09
Anticipated expiration: 2028-03-27
Also published as: JP2009238008A

Description

本発明は、分散インデックス結合方法及びシステムに係り、特に、ネットワーク上の分散システムにおけるリレーショナル・データベースのインデックス同士を結合するための分散インデックス結合システム及び方法及びプログラムに関する。詳しくは、リレーショナル・データベースのインデックス同士がコンピュータ内部のバス転送速度と比べて遅いネットワークで接続されている分散システムにおける分散インデックス結合方法及びシステムに関する。 The present invention relates to a distributed index joining method and system, and more particularly to a distributed index joining system, method and program for joining indexes of relational databases in a distributed system on a network. More specifically, the present invention relates to a distributed index combination method and system in a distributed system in which indexes of a relational database are connected to each other via a network that is slower than a bus transfer speed inside a computer.

従来、リレーショナル・データベース同士の結合操作を実行するためのアルゴリズムとして、順次２つのデータベースのレコードの突合せを行い、結合条件を満たすものに対して結果のレコードを生成する入れ子ループ結合や、２つのデータベースが結合条件でソートされている場合には、先頭から順次突き合わせてマージするマージ結合や、一方のデータベースのレコードをハッシュ表に展開することにより結合条件を満たすレコードを求めるハッシュやインデックスを用いたハッシュ結合等が知られている。この例を図９に示す。 Conventionally, as an algorithm for performing a join operation between relational databases, a nested loop join in which records of two databases are sequentially matched and a result record is generated for those satisfying the join condition or two databases are used. Are sorted by join conditions, merge join that sequentially matches from the beginning and merge, or hash that uses a hash or index to find records that satisfy the join condition by expanding records from one database into a hash table Bonding and the like are known. An example of this is shown in FIG.

ステップ１０）インデックスＡ，Ｂのレコードを指すポインタを先頭にするように初期化する。 Step 10) Initialization is performed so that the pointers pointing to the records of indexes A and B start.

ステップ１１）インデックスＡのポインタが指すレコードのキー値をマシンＢに送信する。 Step 11) The key value of the record pointed to by the pointer of index A is transmitted to machine B.

ステップ１２）インデックスＡのポインタが指すレコードのキー値でインデックスＢを検索する。 Step 12) The index B is searched with the key value of the record pointed to by the pointer of the index A.

ステップ１３）検索条件を満たすレコードがインデックスＢにあるかを判定し、ある場合には、ステップ１４に移行し、ない場合は、ステップ１５に移行する。 Step 13) It is determined whether or not a record satisfying the search condition exists in the index B. If there is a record, the process proceeds to Step 14, and if not, the process proceeds to Step 15.

ステップ１４）インデックスＡ、ＢのレコードをマシンＪに送り、当該マシンＪで両レコードを結合し、出力する。 Step 14) The records of indexes A and B are sent to the machine J, and both records are combined and output by the machine J.

ステップ１５）インデックスＡのポインタが指すレコードのキー値を超える最小の値をインデックスＢから検索する。 Step 15) The index B is searched for the minimum value that exceeds the key value of the record pointed to by the pointer of the index A.

ステップ１６）検索条件を満たすレコードがインデックスＢにあるかを判定し、ある場合には、ステップ１７に移行し、ない場合には処理を終了する。 Step 16) It is determined whether or not a record satisfying the search condition exists in the index B. If there is a record, the process proceeds to Step 17, and if not, the process ends.

ステップ１７）インデックスＢのポインタを該当するレコードのところに進める。 Step 17) The index B pointer is advanced to the corresponding record.

ステップ１８）インデックスＢのポインタが指すレコードのキー値をマシンＡに送る。 Step 18) The key value of the record pointed to by the pointer of index B is sent to machine A.

ステップ１９）インデックスＢのポインタが指すレコードのキー値でインデックスＡを検索する。 Step 19) The index A is searched with the key value of the record pointed to by the pointer of the index B.

ステップ２０）検索条件を満たすレコードがインデックスＡにあるかを判定し、ある場合は、ステップ２１に移行し、ない場合はステップ２２に移行する。 Step 20) It is determined whether or not a record satisfying the search condition exists in the index A. If there is a record, the process proceeds to Step 21, and if not, the process proceeds to Step 22.

ステップ２１）インデックスＡ，ＢのレコードをマシンＪに送りマシンＪで両レコードを結合して出力する。 Step 21) The records of indexes A and B are sent to machine J, and both records are combined and output by machine J.

ステップ２２）インデックスＢのポインタが指すレコードのキー値を超える最小の値をインデックスＡから検索する。 Step 22) The index A is searched for the minimum value that exceeds the key value of the record pointed to by the pointer of the index B.

ステップ２３）検索条件を満たすレコードがインデックスＡにあるかを判定し、ある場合はステップ２４に移行し、ない場合は、処理を終了する。 Step 23) It is determined whether or not there is a record satisfying the search condition in the index A. If there is a record, the process proceeds to Step 24, and if not, the process ends.

ステップ２４）インデックスＡのポインタを該当するレコードのところに進め、ステップ１１の処理に移行する。 Step 24) The pointer of index A is advanced to the corresponding record, and the process proceeds to step 11.

また、結合操作を別々のジョブに区分し、それらのジョブを複数のプロセッサ間で最適にスケジューリングすることにより、データ・スキューの存在下で並列リレーショナル・データベース環境にある共通フィールド上の２つのデータベース・リレーションをマージ結合する技術がある。この技術は、最小メークスパン最適化技法を使用して、プロセッサ間でジョブの現集合をスケジューリングすることによって、２つのリレーションの自然結合のための全実行時間を最小することができる（例えば、特許文献１参照）。
特開平３−１２６１６９号公報 It also separates join operations into separate jobs and optimally schedules them across multiple processors, thereby allowing two database databases on a common field in a parallel relational database environment in the presence of data skew. There is a technology that merges relations. This technique can minimize the total execution time for the natural combination of two relations by scheduling the current set of jobs between processors using minimum make-span optimization techniques (eg, patents). Reference 1).
JP-A-3-126169

しかしながら、上記の図９に示すアルゴリズムをネットワークにより分散したシステムにそのまま適用すると、通信回数が増大し、性能が現れないという問題がある。 However, if the algorithm shown in FIG. 9 is applied to a system distributed by a network as it is, there is a problem that the number of communications increases and performance does not appear.

また、最小メークスパン最適化技法を使用して、マージ結合する技術は、マルチプロセッサシステムを想定したもので、分散システム間の通信はネットワークよりも高速なインターコネクトを前提としており、インデックス同士がコンピュータ内部のバス転送速度と比べて遅いネットワークで接続された分散システムについては考慮されていない。 In addition, the merge-joining technology using the minimum make-span optimization technique assumes a multiprocessor system, and communication between distributed systems is premised on interconnects that are faster than the network. No consideration is given to a distributed system connected by a network that is slower than the bus transfer speed of the network.

本発明は、上記の点に鑑みなされたもので、インデックス同士がコンピュータ内部のバス転送速度と比べて遅いネットワークで接続された分散システムにおいて、インデックス同士のマージ結合を高速化させ、処理中の通信回数を削減することが可能な分散インデックス結合システム及び方法及びプログラムを提供することを目的とする。 The present invention has been made in view of the above points. In a distributed system in which indexes are connected to each other via a network that is slower than the bus transfer speed inside the computer, the merge connection between the indexes is accelerated, and communication during processing is performed. It is an object of the present invention to provide a distributed index combining system, method and program capable of reducing the number of times.

図１は、本発明の原理を説明するための図である。 FIG. 1 is a diagram for explaining the principle of the present invention.

本発明（請求項１）は、ネットワークに複数のプロセッサが接続されている分散ネットワークシステムにおいて、該ネットワークを経由してインデックス同士をマージする分散インデックスの結合方法であって、
複数のレコードのインデックスからなるインデックスＡ、該インデックスＡを走査するためのＮ個（Ｎ≧２）のポインタからなるポインタリスト及び該ポインタに対応するソートされたレコードのキー値からなるレコードリストを格納する記憶手段Ａを有するプロセッサＡと、
複数のレコードのインデックスからなるインデックスＢ、該インデックスＢを走査するためのＮ個（Ｎ≧２）のポインタからなるポインタリスト及び該ポインタに対応するソートされたレコードのキー値からなるレコードリストを格納する記憶手段Ｂを有するプロセッサＢと、
プロセッサＡのインデックスＡのレコードとプロセッサＢのインデックスＢのレコードを結合するプロセッサＪと、がネットワークを介して接続されているシステムにおいて、
プロセッサＡにおいて、
全てのインデックスＡの先頭から順にＮ個分のレコードリストＡを生成してプロセッサＢに送信する初期レコード送信ステップを行い（ステップ１）、
プロセッサＢにおいて、
プロセッサＡから送信されたレコードリストＡに基づいて、記憶手段ＢからインデックスＢを検索するインデックス検索ステップＢと（ステップ２）、
インデックス検索ステップＢにより検索条件を満たしたインデックスＢのレコードとインデックスＡのレコードとをプロセッサＪに送信する結合レコード送信ステップＢと（ステップ３）、
インデックスＢにおいて、レコードリストＡのＮ番目の値（リスト中の最大値）を超える最小の値から順にＮ個分のレコードリストＢを生成して（ステップ５）プロセッサＡに送信する（ステップ６）レコードリスト送信ステップＢと、を行い、
プロセッサＡにおいて、
プロセッサＢから送信されたレコードリストＢに基づいて、記憶手段ＡからインデックスＡを検索するインデックス検索ステップＡ（ステップ７）と、
インデックス検索ステップＡにより検索条件を満たしたインデックスＡのレコードとインデックスＢのレコードとをプロセッサＪに送信する結合レコード送信ステップＡと（ステップ８）、
インデックスＡにおいて、レコードリストＢのＮ番目の値（リスト中の最大値）を超える最小の値から順にＮ個分のレコードリストＡを生成して（ステップ１０）プロセッサＢに送信する（ステップ１１）レコードリスト送信ステップＡと、を行う。 The present invention (claim 1), in the distributed network system in which a plurality of processors in the network are connected to a coupling method of distributed index merging index each other via the network,
Stores an index A composed of indexes of a plurality of records, a pointer list composed of N (N ≧ 2) pointers for scanning the index A, and a record list composed of key values of sorted records corresponding to the pointers Processor A having storage means A for
An index B composed of indexes of a plurality of records, a pointer list composed of N (N ≧ 2) pointers for scanning the index B, and a record list composed of key values of sorted records corresponding to the pointers are stored. Processor B having storage means B for
In a system in which the processor A that combines the record of the index A of the processor A and the record of the index B of the processor B is connected via a network,
In processor A,
An initial record transmission step of generating N record lists A in order from the top of all indexes A and transmitting them to the processor B is performed (step 1).
In processor B,
Index search step B for searching index B from storage means B based on record list A transmitted from processor A (step 2),
A combined record transmission step B for transmitting the record of the index B and the record of the index A that satisfy the search condition by the index search step B to the processor J (step 3);
In the index B, N record lists B are generated in order from the smallest value exceeding the Nth value (maximum value in the list) of the record list A (step 5) and transmitted to the processor A (step 6). Perform record list transmission step B ,
In processor A,
Index search step A (step 7) for searching index A from storage means A based on record list B transmitted from processor B ;
A combined record transmission step A for transmitting the record of the index A and the record of the index B that satisfy the search condition in the index retrieval step A to the processor J (step 8);
In the index A, N record lists A are generated in order from the smallest value exceeding the Nth value (maximum value in the list) of the record list B (step 10) and transmitted to the processor B (step 11). Record list transmission step A is performed.

図２は、本発明の原理構成図である。 FIG. 2 is a principle configuration diagram of the present invention.

本発明（請求項２）は、ネットワークに複数のプロセッサが接続されている分散ネットワークシステム上で、該ネットワークを経由してインデックス同士をマージする分散インデックスの結合システムであって、
システムは、
複数のレコードのインデックスからなるインデックスＡ、該インデックスＡを走査するためのＮ個（Ｎ≧２）のポインタからなるポインタリスト及び該ポインタに対応するソートされたレコードのキー値からなるレコードリストを格納する記憶手段１３０Ａを有するプロセッサＡと、
複数のレコードのインデックスからなるインデックスＢ、該インデックスＢを走査するためのＮ個（Ｎ≧２）のポインタからなるポインタリスト及び該ポインタに対応するソートされたレコードのキー値からなるレコードリストを格納する記憶手段１３０Ｂを有するプロセッサＢと、
プロセッサＡのインデックスＡのレコードとプロセッサＢのインデックスＢのレコードを結合するプロセッサＪと、有し、
プロセッサＡは、
全てのインデックスＡの先頭から順にＮ個分のレコードリストＡを生成してプロセッサＢに送信する初期レコード送信手段Ａと、
プロセッサＢから送信されたレコードリストＢに基づいて、記憶手段ＡからインデックスＡを検索するインデックス検索手段１１０Ａと、
インデックス検索手段１１０Ａにより検索条件を満たしたインデックスＡのレコードとインデックスＢのレコードとをプロセッサＪに送信する結合レコード送信手段１４１Ａと、
インデックスＡにおいて、レコードリストＢのＮ番目の値（リスト中の最大値）を超える最小の値から順にＮ個分のレコードリストＡを生成してプロセッサＢに送信するレコードリスト送信手段１４０Ａと、を有し、
プロセッサＢは、
プロセッサＡから送信されたレコードリストＡに基づいて、記憶手段１３０ＢからインデックスＢを検索するインデックス検索手段１１０Ｂと、
インデックス検索手段Ｂにより検索条件を満たしたインデックスＢのレコードとインデックスＡのレコードとをプロセッサＪに送信する結合レコード送信手段１４１Ｂと、
インデックスＢにおいて、レコードリストＡのＮ番目の値（リスト中の最大値）を超える最小の値から順にＮ個分のレコードリストＢを生成してプロセッサＡに送信するレコードリスト送信手段１４０Ｂと、を有する。 The present invention (claim 2), on a distributed network system in which a plurality of processors in the network are connected to a coupling system distributed index merging index each other via the network,
the system,
Stores an index A composed of indexes of a plurality of records, a pointer list composed of N (N ≧ 2) pointers for scanning the index A, and a record list composed of key values of sorted records corresponding to the pointers Processor A having storage means 130A for
An index B composed of indexes of a plurality of records, a pointer list composed of N (N ≧ 2) pointers for scanning the index B, and a record list composed of key values of sorted records corresponding to the pointers are stored. Processor B having storage means 130B for
A processor J coupling a record of the index B of the record and processor B Index A processor A, a,
Processor A
Initial record transmitting means A for generating N record lists A in order from the top of all indexes A and transmitting them to the processor B ;
Based on the record list B transmitted from the processor B, the index search means 110A for searching the index A from the storage means A ;
A combined record transmission unit 141A that transmits the record of index A and the record of index B that satisfy the search condition by the index search unit 110A to the processor J;
In the index A, record list transmission means 140A for generating N record lists A in order from the smallest value exceeding the Nth value (maximum value in the list) of the record list B and transmitting it to the processor B. Have
Processor B
Based on the record list A transmitted from the processor A, the index search means 110B for searching the index B from the storage means 130B;
Combined record transmission means 141B for transmitting the record of index B and the record of index A that satisfy the search condition by the index search means B to the processor J;
In the index B, record list transmission means 140B for generating N record lists B in order from the smallest value exceeding the Nth value (maximum value in the list) of the record list A and transmitting it to the processor A. Have.

上記のように本発明によれば、各プロセッサにおいて、インデックスを走査するポインタをＮ個有し、プロセッサ間においてＮ個のポインタからなるポインタリスト単位で、Ｎ個分のポインタに対応するレコード（レコードリスト）を送信することにより、処理負荷が低減される。これは、ネットワーク経由の場合、データ送受信の起動・停止に関わる処理のオーバーヘッドが大きいため、送受信するデータの総量は同じであっても、そのやり取りに要する回数を減らすことにより、処理時間が短縮される。 As described above, according to the present invention, each processor has N pointers for scanning an index, and records (records) corresponding to N pointers in a pointer list unit including N pointers between processors. By transmitting (list), the processing load is reduced. This is because the processing overhead associated with starting / stopping data transmission / reception is large over a network, so even if the total amount of data to be transmitted / received is the same, the processing time can be shortened by reducing the number of times required for the exchange. The

以下、図面と共に本発明の実施の形態を説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

図３は、本発明の一実施の形態における分散インデックス結合システムの構成を示す。 FIG. 3 shows the configuration of a distributed index combining system according to an embodiment of the present invention.

同図に示す分散インデックス結合システムは、インデックスＡを有するプロセッサＡと、インデックスＢを有するプロセッサＢと、プロセッサＡとプロセッサＢのレコードを結合する、プロセッサＡ、Ｂとは別のプロセッサＪから構成される。 The distributed index combination system shown in FIG. 1 includes a processor A having an index A, a processor B having an index B, and a processor J different from the processors A and B that combine the records of the processors A and B. The

プロセッサＡとプロセッサＢは、同様の構成であり、それぞれ、入出力部１００、インデックス検索部１１０、レコードリスト処理部１２０、レコードリスト記憶部１３０、レコードリスト通信部１４０から構成される。 The processor A and the processor B have the same configuration, and include an input / output unit 100, an index search unit 110, a record list processing unit 120, a record list storage unit 130, and a record list communication unit 140, respectively.

レコードリスト記憶部１３０は、図４に示すように、ポインタリストとレコードリストを格納する。以下の説明において、「ポインタ」とは、インデックスのどこを検索しているかを指すポインタであり、複数のポインタをリスト化したものを「ポインタリスト」と呼ぶ。また、ポインタリストで指し示しているインデックスのレコードをレコードリストである。図４の例では、ポインタリストの先頭は、インデックスの先頭のレコードを指しており、当該レコードのキー値「００３４」がレコードリストに設定されている。ポインタリストの２番目はインデックスのｎ番目のレコードを指しており、当該レコードのキー値「０３０５」がレコードリストに設定されている。 As shown in FIG. 4, the record list storage unit 130 stores a pointer list and a record list. In the following description, the “pointer” is a pointer indicating where the index is searched, and a list of a plurality of pointers is referred to as a “pointer list”. The record of the index pointed to by the pointer list is a record list. In the example of FIG. 4, the top of the pointer list points to the top record of the index, and the key value “0034” of the record is set in the record list. The second pointer list points to the nth record in the index, and the key value “0305” of the record is set in the record list.

プロセッサＪは、インデックスＡを持つプロセッサＡ，インデックスＢを持つプロセッサＢとは別の、インデックス同士のマージ結合を統括するプロセッサで、以下のフローチャートで表される処理全体を実行するトリガとなるプロセッサである。プロセッサＪは、マージ処理部２１０とレコード結合部２２０を有し、マージ処理部２１０は、検索条件入力装置３１０と接続され、レコード結合部２２０は、結合結果出力装置３２０と接続されている。 The processor J is a processor that supervises merge join between indexes, which is different from the processor A having the index A and the processor B having the index B, and is a processor that triggers the entire processing represented by the following flowchart. is there. The processor J includes a merge processing unit 210 and a record combining unit 220. The merge processing unit 210 is connected to the search condition input device 310, and the record combining unit 220 is connected to the combining result output device 320.

マージ処理部２１０は、プロセッサＡ，Ｂに対して、検索条件入力装置３１０から入力された検索条件をプロセッサＡ，Ｂに出力する。 The merge processing unit 210 outputs the search condition input from the search condition input device 310 to the processors A and B to the processors A and B.

レコード結合部２２０は、プロセッサＡ及び、プロセッサＢのレコードリスト通信部１４０から送信されたインデックスの検索結果の該当レコードを結合し、結合結果出力装置３２０に出力する。この処理が実行されるということは、両インデックスに共通するキー値を持つレコードがあったということを示している。 The record combination unit 220 combines the corresponding records of the index search results transmitted from the record list communication unit 140 of the processor A and the processor B, and outputs the combined records to the combination result output device 320. Execution of this processing indicates that there is a record having a key value common to both indexes.

以下に、上記の構成における動作を説明する。 The operation in the above configuration will be described below.

図５は、本発明の一実施の形態におけるマージ結合のフローチャートである。 FIG. 5 is a flowchart of merge join in one embodiment of the present invention.

ステップ１１０）プロセッサＢにおいて、入出力部１００Ｂを介してレコードを指すＮ個のポインタリストが入力されると、レコードリスト処理部１２０Ｂは、ポインタがインデックスＢの先頭レコードを指すように、レコードリスト記憶部１３０ＢのＮ個のインデックスＢのポインタリストを初期化する。 Step 110) When N pointer lists indicating records are input via the input / output unit 100B in the processor B, the record list processing unit 120B stores the record list so that the pointer points to the first record of the index B. The pointer list of N indexes B of the unit 130B is initialized.

ステップ１１６）プロセッサＡにおいて、インデックスＡのレコードを指すＮ個のポインタが入出力部１００Ａを介して入力されると、レコードリスト処理部１２０Ａにおいて、インデックスＡの先頭レコードから順にＮ個分のレコードを指すように、Ｎ個のポインタを初期化し、初期化済みのインデックスＡのレコードを指すＮ個のポインタリストを出力する。 Step 116) When N pointers pointing to the records of the index A are input via the input / output unit 100A in the processor A, the record list processing unit 120A selects N records in order from the first record of the index A. N pointers are initialized so as to point, and a list of N pointers indicating the record of the index A that has been initialized is output.

ステップ１２０）レコードリスト処理部１２０Ａにおいて、レコードリスト記憶部１３０ＡからインデックスＡのレコードを指すＮ個のポインタリストを読み込み、Ｎ個のポインタリストが示すインデックスＡの各レコードのキー値を取得してリスト化し、レコードリストとしてレコードリスト記憶部１３０Ａに格納すると共に、レコードリスト通信部１４０Ａを介してプロセッサＢに送信する。 Step 120) In the record list processing unit 120A, the N pointer lists indicating the records of the index A are read from the record list storage unit 130A, and the key value of each record of the index A indicated by the N pointer lists is acquired and the list is acquired. The record list is stored in the record list storage unit 130A as a record list and transmitted to the processor B via the record list communication unit 140A.

ステップ１２５）プロセッサＢのレコードリスト通信部１４０Ｂでレコードリストを受信すると、レコードリスト処理部１２０Ｂは、そのレコードリストをメモリ（図示せず）のプロセッサＡのレコードリスト領域に格納する。そして、インデックス検索部１１０Ｂは、プロセッサＡから取得したインデックスＡのＮ個のレコードリストのレコードのキー値を検索キーとして、レコードリスト記憶部１３０ＢからインデックスＢを検索し、インデックスＢのレコードを指すＮ個のポインタリストを取得する。当該ステップの詳細については後述する。 Step 125) When the record list is received by the record list communication unit 140B of the processor B, the record list processing unit 120B stores the record list in the record list area of the processor A in a memory (not shown). Then, the index search unit 110B searches the index B from the record list storage unit 130B using the key values of the records in the N record lists of the index A acquired from the processor A as the search key, and points N to the record of the index B Get the pointer list. Details of this step will be described later.

ステップ１３０）レコードリスト処理部１２０Ｂは、取得したインデックスＢのレコードを指すＮ個のポインタリストが示すインデックスＢの各レコードのキー値を取得してレコードリストとしてレコードリスト記憶部１３０Ｂに格納すると共に、レコードリスト通信部１４０Ｂを介してプロセッサＡに送信する。 Step 130) The record list processing unit 120B acquires the key value of each record of the index B indicated by the N pointer lists indicating the acquired records of the index B, stores the key value in the record list storage unit 130B as a record list, and The data is transmitted to the processor A via the record list communication unit 140B.

ステップ１３５）プロセッサＡのレコードリスト通信部１４０を介してプロセッサＢからレコードリストを取得すると、レコードリスト処理部１２０において、そのレコードリストをメモリ（図示せず）に格納する。インデックス検索部１１０Ａは、取得したインデックスＢのＮ個のレコードリストのレコードのキー値を検索キーとして、レコードリスト記憶部記憶部１３０ＡからインデックスＡを検索し、インデックスＢのレコードを指すＮ個のポインタリストを取得する。詳細については後述する。 Step 135) When the record list is acquired from the processor B via the record list communication unit 140 of the processor A, the record list processing unit 120 stores the record list in a memory (not shown). The index search unit 110A searches the index A from the record list storage unit 130A using the key values of the records of the N record lists of the index B acquired as search keys, and N pointers pointing to the records of the index B Get a list. Details will be described later.

次に、上記のステップ１２５のインデックスＢの検索処理について詳細に説明する。 Next, the index B search process in step 125 will be described in detail.

図６は、本発明の一実施の形態におけるインデックス検索処理のフローチャートである。 FIG. 6 is a flowchart of the index search process according to the embodiment of the present invention.

ステップ２１０）プロセッサＢのレコードリスト処理部１２０Ｂにおいて、メモリ（図示せず）に、Ｎ個分まとめて処理するためのループカウンタｊを設け、ｊ＝１と初期化する。 Step 210) In the record list processing unit 120B of the processor B, a loop counter j is provided in the memory (not shown) for processing all N pieces and is initialized as j = 1.

ステップ２１５）レコードリスト処理部１２０Ｂは、プロセッサＡから送信されたインデックスＡのＮ個のレコードリストからカウンタｊに相当するレコード（インデックスＡのレコードリスト中ｊ番目のレコード）を取り出す。 Step 215) The record list processing unit 120B extracts a record corresponding to the counter j (jth record in the record list of the index A) from the N record lists of the index A transmitted from the processor A.

ステップ２２５）インデックス検索部１１０Ｂは、ステップ２１５で取り出されたインデックスＡのレコードリスト中ｊ番目のレコードのキー値を用いて、レコードリスト記憶部１３０ＢからインデックスＢを検索する。検索結果が「真」の場合はインデックスＢの検索結果該当レコードを指すポインタが出力される。 Step 225) The index search unit 110B searches the record list storage unit 130B for the index B using the key value of the jth record in the record list of the index A extracted in step 215. When the search result is “true”, a pointer indicating the record corresponding to the search result of index B is output.

ステップ２３０）検索条件を満たすレコードがインデックスＢに存在するか否かを判定する。ステップ２２５における検索結果が「真」の場合はステップ２３５に移行し、「偽」の場合はステップ２６５に移行する。 Step 230) It is determined whether or not a record satisfying the search condition exists in the index B. If the search result in step 225 is “true”, the process proceeds to step 235, and if “false”, the process proceeds to step 265.

ステップ２３５）レコードリスト処理部１２０Ｂは、ステップ２２５で得られたポインタが指すインデックスＢの検索結果該当レコードをレコードリスト記憶部１３０Ｂから取得し、インデックスＡのレコードリスト中ｊ番目のレコードと共に、レコードリスト通信部１４０Ｂを介してプロセッサＪに送信する。 Step 235) The record list processing unit 120B obtains the record corresponding to the search result of the index B indicated by the pointer obtained in Step 225 from the record list storage unit 130B, and records the record list together with the jth record in the record list of the index A. The data is transmitted to the processor J via the communication unit 140B.

プロセッサＪのレコード結合部２２０は、プロセッサＢから送信されたインデックスＢの検索結果該当レコードと、インデックスＡのレコードリスト中ｊ番目のレコードを結合して結合結果出力装置３２０に出力する。 The record combining unit 220 of the processor J combines the record corresponding to the search result of the index B transmitted from the processor B and the jth record in the record list of the index A and outputs the combined result to the combined result output device 320.

ステップ２６５）ループ用のカウンタｊがＮを超えていないかどうか確認する。Ｎを超えていない、すなわち、Ｎ回のループが終了していなければ、ステップ２７０に移行し、Ｎを超えている、すなわち、Ｎ回のループが終了していればステップ２４０に移行する。 Step 265) It is confirmed whether the counter j for the loop does not exceed N. If N is not exceeded, that is, if N loops are not completed, the process proceeds to step 270. If N is exceeded, that is, N loops are completed, the process proceeds to step 240.

ステップ２７０）カウンタｊを１インクリメントし、ステップ２１５に移行する。 Step 270) The counter j is incremented by 1, and the process proceeds to Step 215.

ステップ２４０）インデックス検索部１１０Ｂは、ステップ２６５の条件分岐により、ステップ２１５にて取り出されたインデックスＡのレコードリスト中のＮ番目のレコードのキー値を用いて、レコードリスト記憶部１３０ＢのインデックスＢから、「キー値を超える最小の値」を検索する。検索結果が「真」の場合は、インデックスＢの検索結果該当レコードを指すポインタを出力する。 Step 240) The index search unit 110B uses the key value of the Nth record in the record list of the index A extracted in step 215 by the conditional branch of step 265, and starts from the index B of the record list storage unit 130B. , “Minimum value exceeding key value” is searched. If the search result is “true”, a pointer indicating the record corresponding to the search result of index B is output.

ステップ２４５）ステップ２４０において、プロセッサＪから与えられた検索条件を満たすレコードがインデックスＢに存在するかどうか確認し、存在すれば、ステップ２５１に移行し、存在しなければ処理を終了する。 Step 245) In Step 240, it is confirmed whether or not a record satisfying the search condition given by the processor J exists in the index B. If it exists, the process proceeds to Step 251. If not, the process ends.

ステップ２５１）レコードリスト処理部１２０Ｂは、インデックスＢの検索結果該当レコードを指すポインタリストとインデックスＢのレコードを指すＮ個のポインタリストへ記録する。 Step 251) The record list processing unit 120B records the index B search result in the pointer list indicating the corresponding record and the N pointer list indicating the record in the index B.

図７は、本発明の一実施の形態におけるインデックスＡ検索処理のフローチャートである。 FIG. 7 is a flowchart of the index A search process according to the embodiment of the present invention.

ステップ３１０）プロセッサＡのレコードリスト処理部１２０Ａにおいて、メモリ（図示せず）にＮ個分纏めて処理するためのループカウンタｉを設け、ｉ＝１とする。 Step 310) In the record list processing unit 120A of the processor A, a loop counter i is provided in a memory (not shown) for processing N pieces collectively, and i = 1 is set.

ステップ３１５）レコードリスト処理部１２０Ａは、プロセッサＢから送信されたＮ個のレコードリストから、カウンタｉに相当するレコード（インデックスＢのレコードリスト中ｉ番目のレコード）を取り出す。 Step 315) The record list processing unit 120A extracts a record corresponding to the counter i (i-th record in the record list of the index B) from the N record lists transmitted from the processor B.

ステップ３２６）インデックス検索部１１０Ａは、インデックスＢのレコードリスト中ｉ番目のレコードが入力されると、レコードリスト記憶部１３０Ａから該ｉ番目のレコードのキー値を用いてインデックスＡを検索する。検索結果が「真」の場合はインデックスＡの検索結果の該当レコードを指すポインタを出力する。 Step 326) When the i-th record in the record list of index B is input, the index search unit 110A searches the index A using the key value of the i-th record from the record list storage unit 130A. If the search result is “true”, a pointer indicating the corresponding record in the search result of index A is output.

ステップ３３１）ステップ３２６で出力されたレコードのポインタについて、検索条件を満たすレコードがインデックスＡに存在するかどうかを確認し、存在していればステップ３３５に移行し、存在していなければステップ３６５に移行する。 Step 331) With respect to the pointer of the record output in Step 326, it is confirmed whether or not a record satisfying the search condition exists in the index A. If it exists, the process proceeds to Step 335. If not, the process proceeds to Step 365. Transition.

ステップ３３５）レコードリスト処理部１２０Ａは、レコードリスト記憶部１３０から、ステップ３２５で得られたポインタが指すインデックスＡの検索結果該当レコードを取得し、インデックスＢのレコードリスト中ｉ番目のレコードと共に、プロセッサＪに送信する。ここで、プロセッサＪは、インデックスＡを持つプロセッサＡ、インデックスＢを持つプロセッサＢとは別の、インデックス同士のマージ結合を統括するプロセッサであり、本フローチャートで表される処理全体を実行するトリガとなるプロセッサである。 Step 335) The record list processing unit 120A obtains the record corresponding to the search result of the index A pointed to by the pointer obtained in Step 325 from the record list storage unit 130, and the processor together with the i-th record in the record list of the index B. Send to J. Here, the processor J is a processor that supervises the merge join between indexes, which is different from the processor A having the index A and the processor B having the index B, and a trigger for executing the entire processing represented by this flowchart. Is a processor.

プロセッサＪのレコード結合部２２０は、プロセッサＡから送信されたインデックスＡとインデックスＢのレコードを結合する。この処理が行われるということは、両インデックスに共通するキー値を持つレコードがあったということである。プロセッサＪのレコード結合部２２０は、インデックスＡのｉ番目の分割範囲検索結果該当レコード及びインデックスＢのレコード中ｉ番目のレコードを結果出力装置３２０に出力する。 The record combining unit 220 of the processor J combines the index A and index B records transmitted from the processor A. This processing means that there is a record having a key value common to both indexes. The record combining unit 220 of the processor J outputs the i-th divided range search result corresponding record of the index A and the i-th record among the records of the index B to the result output device 320.

ステップ３６５）ループ用カウンタｉがＮを超えていないかどうか確認し、Ｎを超えていない、すなわち、Ｎ回のループが終了していなければステップ３７０に移行し、Ｎを超えている、すなわち、Ｎ回のループが終了していれば、ステップ３４１に移行する。 Step 365) Check if the loop counter i does not exceed N. If N is not exceeded, that is, if N loops are not completed, the process proceeds to Step 370, where N is exceeded. If N loops have been completed, the process proceeds to step 341.

ステップ３７０）カウンタｉを１インクリメントし、ステップ３１５に移行する。 Step 370) The counter i is incremented by 1, and the process proceeds to Step 315.

ステップ３４１）ステップ３６５の条件分岐により、ステップ３１５にて取得済みのインデックスＢのレコード中Ｎ番目のレコードのキー値を用いて、インデックスＡから「キー値を超える最小の値」を検索し、インデックスＡの検索結果が「真」の場合は、インデックスＡの検索結果該当レコードを指すポインタを出力する。 Step 341) According to the conditional branch in Step 365, the “minimum value exceeding the key value” is searched from the index A using the key value of the Nth record among the records of the index B acquired in Step 315, and the index If the search result of A is “true”, a pointer indicating the record corresponding to the search result of index A is output.

ステップ３４６）ステップ３４１において、プロセッサＪから与えられた検索条件を満たすレコードがインデックスＡに存在するかどうかを判定し、存在すればステップ３５１に移行し、存在しなければ処理を終了する。 Step 346) In step 341, it is determined whether or not a record satisfying the search condition given by the processor J exists in the index A. If it exists, the process proceeds to step 351, and if not, the process ends.

ステップ３５１）レコードリスト処理部１２０Ａは、インデックスＡの検索結果の該当レコードから順にＮ個分のレコードを指すように、インデックスＡのレコードを指すＮ個のポインタリストへ記録する。 Step 351) The record list processing unit 120A records in the N pointer lists indicating the records of the index A so as to indicate the N records in order from the corresponding record of the search result of the index A.

以下に、本発明による手法と、従来の技術による手法で５００万件のデータ同士を全件マージ結合した場合の比較データを図８に示す。モックアップによる試験のため、インデックス読み込み処理等に違いがあり、インデックス読み込み処理時間等を公平に比較することはできないが、本発明により、ネットワーク転送処理時間が減少していることが確認できた。 FIG. 8 shows comparison data when 5 million pieces of data are merged and joined by the method according to the present invention and the method according to the conventional technique. Due to the mock-up test, there is a difference in index reading processing and the like, and the index reading processing time cannot be compared fairly, but it was confirmed that the network transfer processing time was reduced by the present invention.

本発明は、図３に示すプロセッサの構成に基づいて説明したが、この例に限定されることなく、各プロセッサの構成要素の動作をプログラムとして構築し、各プロセッサ（コンピュータ）にインストールして実行させる、または、ネットワークを介して流通させることが可能である。 Although the present invention has been described based on the configuration of the processor shown in FIG. 3, the present invention is not limited to this example, and the operation of each processor component is constructed as a program, installed in each processor (computer), and executed. Or can be distributed via a network.

また、構築されたプログラムをハードディスクや、フレキシブルディスク、ＣＤ−ＲＯＭ等の可搬記憶媒体に格納し、コンピュータにインストールする、または、配布することが可能である。 In addition, the constructed program can be stored in a portable storage medium such as a hard disk, a flexible disk, or a CD-ROM, and can be installed or distributed in a computer.

なお、本発明は、上記の実施の形態に限定されることなく、特許請求の範囲内において種々変更・応用が可能である。 The present invention is not limited to the above-described embodiment, and various modifications and applications can be made within the scope of the claims.

本発明は、ＤＢＭＳ（Data Base Management System）やＲＤＢＭＳ（Relational Data Base Management System）に適用可能である。 The present invention is applicable to DBMS (Data Base Management System) and RDBMS (Relational Data Base Management System).

本発明の原理を説明するための図である。It is a figure for demonstrating the principle of this invention. 本発明の原理構成図である。It is a principle block diagram of this invention. 本発明の一実施の形態における分散インデックス結合システムの構成図である。It is a block diagram of the distributed index coupling | bonding system in one embodiment of this invention. 本発明の一実施の形態におけるレコードリスト記憶部を説明するための図である。It is a figure for demonstrating the record list memory | storage part in one embodiment of this invention. 本発明の一実施の形態におけるマージ結合のフローチャートである。It is a flowchart of the merge connection in one embodiment of this invention. 本発明の一実施の形態におけるインデックスＢ検索処理のフローチャートである。It is a flowchart of the index B search process in one embodiment of this invention. 本発明の一実施の形態におけるインデックスＡ検索処理のフローチャートである。It is a flowchart of the index A search process in one embodiment of this invention. 本発明の手法と従来の手法の比較結果である。It is a comparison result of the method of the present invention and the conventional method. 従来のマージ結合のフローチャートである。It is a flowchart of the conventional merge join.

Explanation of symbols

１００入出力部
１１０インデックス検索手段、インデックス検索部
１２０レコードリスト処理部
１３０記憶手段、レコードリスト記憶部
１４０レコードリスト送信手段、レコードリスト通信部
１４１結合レコード送信手段
２１０マージ処理部
２２０結合手段、レコード結合部
３１０検索条件入力装置
３２０検索結果出力装置 100 I / O Unit 110 Index Search Unit, Index Search Unit 120 Record List Processing Unit 130 Storage Unit, Record List Storage Unit 140 Record List Transmission Unit, Record List Communication Unit 141 Combined Record Transmission Unit 210 Merge Processing Unit 220 Combination Unit, Record Combination 310 Search condition input device 320 Search result output device

Claims

In a distributed network system to network multiple processors are connected to a coupling method of distributed index merging index each other via the network,
Stores an index A composed of indexes of a plurality of records, a pointer list composed of N (N ≧ 2) pointers for scanning the index A, and a record list composed of key values of sorted records corresponding to the pointers Processor A having storage means A for
An index B composed of indexes of a plurality of records, a pointer list composed of N (N ≧ 2) pointers for scanning the index B, and a record list composed of key values of sorted records corresponding to the pointers are stored. Processor B having storage means B for
In the system in which the processor A that combines the record of the index A of the processor A and the record of the index B of the processor B is connected via the network,
In the processor A,
Performing an initial record transmission step of generating N record lists A in order from the top of all indexes A and transmitting them to the processor B;
In the processor B,
Based on the record list A transmitted from the processor A, an index search step B for searching the index B from the storage means B;
A combined record transmission step B for transmitting the index B record and the index A record that satisfy the search condition in the index search step B to the processor J;
A record list transmission step B for generating N record lists B in order from the smallest value exceeding the Nth value (maximum value in the list) of the record list A in the index B and transmitting it to the processor A And
In the processor A,
Based on the record list B transmitted from the processor B, an index search step A for searching the index A from the storage means A;
A combined record transmission step A for transmitting the index A record and the index B record satisfying the search condition in the index retrieval step A to the processor J;
A record list transmission step A for generating N record lists A in order from the smallest value exceeding the Nth value (maximum value in the list) of the record list B in the index A and transmitting it to the processor B And do,
A distributed index combining method characterized by the above.

The network on a distributed network system in which a plurality of processors are connected to a coupling system distributed index merging index each other via the network,
The system
Stores an index A composed of indexes of a plurality of records, a pointer list composed of N (N ≧ 2) pointers for scanning the index A, and a record list composed of key values of sorted records corresponding to the pointers Processor A having storage means A for
An index B composed of indexes of a plurality of records, a pointer list composed of N (N ≧ 2) pointers for scanning the index B, and a record list composed of key values of sorted records corresponding to the pointers are stored. Processor B having storage means B for
A processor J coupling a record of the index B of the said a record index A processor B of the processor A, a,
The processor A is
Initial record transmitting means A for generating N record lists A in order from the top of all indexes A and transmitting them to the processor B ;
Based on the record list B transmitted from the processor B, the index search means A for searching the index A from the storage means A ;
Combined record transmission means A for transmitting the record of index A and the record of index B that satisfy the search condition by the index search means A to the processor J;
Record list transmission means A for generating N record lists A in order from the smallest value exceeding the Nth value (maximum value in the list) of the record list B in the index A and transmitting it to the processor B And having
The processor B
Before SL based on the record list A transmitted from the processor A, the index search unit B to find the index B from the storage unit B,
Combined record transmission means B for transmitting the record of index B and the record of index A that satisfy the search condition by the index search means B to the processor J;
Record list transmission means B for generating N record lists B in order from the smallest value exceeding the Nth value (maximum value in the list) of the record list A in the index B and transmitting it to the processor A When,
A distributed index combining system comprising: