JP2012123544A

JP2012123544A - Load distribution device and program

Info

Publication number: JP2012123544A
Application number: JP2010272830A
Authority: JP
Inventors: Yutaka Kaneko; 金子　　豊; Minxi Hwang; ▲ミン▼錫黄; Shinya Takeuchi; 真也竹内; Yoshinori Izumi; 吉則和泉
Original assignee: Nippon Hoso Kyokai NHK; Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2010-12-07
Filing date: 2010-12-07
Publication date: 2012-06-28
Anticipated expiration: 2030-12-07
Also published as: JP5600573B2

Abstract

PROBLEM TO BE SOLVED: To eliminate deviation in the management range length of data in respective node devices while suppressing the increase of storage capacity used for managing the node device participating in a system, in the system of distributing and managing the data by the plurality of node devices.SOLUTION: Each node device 1 constituting a Key-Value type data management system by a distribution hash table includes a load distribution device 10. When a node table for managing the node device 1 participating in the data management system is updated, the load distribution device 10 writes correspondence of the node ID of the node device 1 and an address to a node table for key retrieval. The load distribution device 10 obtains the management range length of a key in each node device 1 on the basis of the node table for the key retrieval, allocates a virtual node to the node device 1 of the shortest management range length until the management range lengths get nearly equal, and adds the address of the node device 1 and the node ID of the virtual node to the node table for the key retrieval.

Description

本発明は、分散型データベースシステムや分散ファイルシステムにおける負荷分散に関し、特にＤＨＴ（分散ハッシュテーブル、Distributed Hash Table）によるKey-Value型データ管理システムの負荷分散装置及びプログラムに関する。 The present invention relates to load distribution in a distributed database system and a distributed file system, and more particularly to a load distribution apparatus and program for a key-value data management system based on DHT (Distributed Hash Table).

ネットワークで接続された複数のノードを見かけ上１台の蓄積装置とみなしてデータやファイルを管理するシステムとして、分散型データベースシステムや分散ファイルシステムなどがある。このような分散型データベースシステムや分散ファイルシステムでは、「キー」と「値」を組みとしたデータ（以下、「キー」と「値」の組みを（キー、値）と記載する。）を管理するKey-Value型のデータ管理システムが使われている。大規模なKey-Value型のデータ管理システムを実現する方法としては、ＤＨＴが提案されている。ＤＨＴではシステムを構成する複数のノードにおいて、（キー、値）を分散して保管管理する。 There are a distributed database system, a distributed file system, and the like as a system for managing data and files by regarding a plurality of nodes connected via a network as if they are apparently regarded as one storage device. In such a distributed database system or distributed file system, data including a combination of “key” and “value” (hereinafter, a combination of “key” and “value” is described as (key, value)) is managed. A key-value type data management system is used. As a method for realizing a large-scale key-value type data management system, DHT has been proposed. In DHT, (key, value) is distributed and managed in a plurality of nodes constituting the system.

ＤＨＴの代表的な方式としてChordがある（例えば、非特許文献１参照。）。Chordでは、システムに参加するノードは、ハッシュ関数を使ってノードアドレスなどからシステム内で一意なノードＩＤを生成し、ノードＩＤを値の順序で円状に配置したオーバーレイネットワークを構成する。システムに（キー、値）を保管する場合、ノードは、キーからハッシュ関数を使ってキーＩＤを生成し、オーバーレイネットワーク上でそのキーＩＤに時計周りに近いノードＩＤのノードにその（キー、値）を保管する。そして、ノードがキーを検索する場合、キーからハッシュ関数を使ってキーＩＤを生成し、そのキーＩＤに近いノードＩＤのノードを発見し、発見したノードからキーに対応する値を取得する。 Chord is a typical DHT system (see, for example, Non-Patent Document 1). In Chord, a node participating in the system generates a unique node ID in the system from a node address or the like using a hash function, and forms an overlay network in which the node IDs are arranged in a circle in the order of values. When storing (key, value) in the system, the node generates a key ID from the key using a hash function, and the node (key, value) has a node ID close to the key ID in the clockwise direction on the overlay network. ). When a node searches for a key, a key ID is generated from the key using a hash function, a node having a node ID close to the key ID is found, and a value corresponding to the key is acquired from the found node.

例えば、「キー」をシステムで分散管理する対象のデータのデータ名とし、「値」をそのデータを記憶しているノードのアドレスとする。データを記憶しているノードは、データ名からハッシュ関数によりキーＩＤを生成し、オーバーレイネットワーク上でこのキーＩＤに時計周りに近いノードＩＤのノードに、データ名等を示す「キー」と自ノードのノードアドレスを示す「値」との対応付けを登録する。データを利用する利用ノードは、利用したいデータのデータ名からキーＩＤを生成し、このキーＩＤに時計周りに近いノードＩＤのノードに問合せを行なって、「キー」に対応した「値」を取得する。利用ノードは、この取得した「値」により、データを記憶しているノードのアドレスを知ることができる。 For example, “key” is a data name of data to be distributed and managed by the system, and “value” is an address of a node storing the data. The node storing the data generates a key ID from the data name by a hash function, and a node having a node ID close to the key ID in the clockwise direction on the overlay network, a “key” indicating the data name and the own node The association with the “value” indicating the node address is registered. The usage node that uses the data generates a key ID from the data name of the data that it wants to use, and queries the node with a node ID close to the clockwise direction to this key ID to obtain a “value” corresponding to the “key”. To do. The using node can know the address of the node storing the data from the acquired “value”.

ＤＨＴでは、キーＩＤに近いノードＩＤのノードを発見することが必要である。ノードを発見する方式として、システムに参加している各ノードにおいて、システムに参加しているノードの一部を管理するノードテーブルを保有することにより、システム全体でノードテーブルを分散して保管する方式と、システムに参加している全ノードが登録されている完全なノードテーブルを全ノードにおいて共通に持つ方式とがある。 In DHT, it is necessary to find a node having a node ID close to the key ID. As a node discovery method, each node participating in the system has a node table that manages a part of the nodes participating in the system, and the node table is distributed and stored throughout the system. And a method having a complete node table in which all nodes participating in the system are registered in all nodes.

ノードテーブルを分散して保管する方式では、多数のノードから構成される大規模なシステムを構成することが可能であり、完全なノードテーブルを保管する方式は、分散してノードテーブルを保管する方式に比べて大規模なシステムを構成が難しい。一方、ノードテーブルを分散して保管する方式では、キーを検索する場合、複数のノードをルーティングしながら探すため検索時間がかかるが、完全なノードテーブルを持つ方式では、ルーティングの必要がなくO(1)の検索ができる。言い換えれば、定数オーダーの検索ができる。Chordはノードテーブルを各ノードで分散して保管する方式であり、システムに参加しているノードの数をNとすると、キーを管理しているノードを発見するまでのホップ数はO(log₂N)（Ｎの対数のオーダー）となる。 In the method of storing node tables in a distributed manner, it is possible to configure a large-scale system consisting of a large number of nodes. The method of storing complete node tables is a method of storing node tables in a distributed manner. Compared to, it is difficult to configure a large-scale system. On the other hand, in the method of storing the node table in a distributed manner, when searching for a key, it takes time to search while routing a plurality of nodes. However, in the method having a complete node table, there is no need for routing and O ( 1) Search is possible. In other words, a constant order can be searched. Chord is a method of storing the node table in a distributed manner at each node. If the number of nodes participating in the system is N, the number of hops until the node managing the key is found is O (log ₂ N) (logarithm order of N).

完全なノードテーブルを保管する方式として非特許文献２、非特許文献３が提案されている。全ノードが完全なノードテーブルを共通に保管する方式では、システムへのノードの参加や離脱があった場合、全ノードがそれにあわせてノードテーブルを修正する。そのため、非特許文献２では、ハッシュ空間を分割した中に、Slice Leaderおよび、Unit Leaderのノードをそれぞれ割り当て、それらのノードを経由して階層的に全ノードに参加や離脱の通知を行っている。また、各ノードはノードテーブルを正しく保つために、ノードテーブルのノードおよび隣接ノードへの生存確認を定期的に行っている。非特許文献３では、各ノードはノードテーブの中からランダムにノードを選択し、定期的にノードテーブルを交換し合っている。 Non-Patent Document 2 and Non-Patent Document 3 have been proposed as methods for storing a complete node table. In a method in which all nodes store a complete node table in common, when a node joins or leaves the system, all nodes modify the node table accordingly. Therefore, in Non-Patent Document 2, nodes of Slice Leader and Unit Leader are allocated while the hash space is divided, and notification of participation or withdrawal is sent to all nodes hierarchically via those nodes. . Each node periodically checks the existence of the nodes in the node table and the adjacent nodes in order to keep the node table correct. In Non-Patent Document 3, each node randomly selects a node from the node table and periodically exchanges the node table.

ＤＨＴにおけるハッシュ空間内で、各ノードが管理する範囲は、時計回りと逆方向の隣接ノード（predecessor）のＩＤから自ノードＩＤまでとなる。ＳＨＡ−１などのハッシュ関数を使うことによって生成されるノードＩＤはランダムな値であるが、一般的に、生成されるノードＩＤには偏りが生じる。このハッシュ空間上でのノードＩＤの偏りは、各ノードが管理するキーＩＤの管理範囲長の偏りとなり、結果として、システムに参加しているノードが管理する（キー、値）のデータ量に差が生じてしまう。 The range managed by each node in the hash space in DHT is from the ID of an adjacent node (predecessor) in the reverse direction to the clockwise direction to the own node ID. The node ID generated by using a hash function such as SHA-1 is a random value, but in general, the generated node ID is biased. The bias of the node ID on the hash space is a bias of the management range length of the key ID managed by each node, and as a result, the difference in the amount of data (key, value) managed by the nodes participating in the system. Will occur.

これに対処する方式として仮想ノード方式がある。仮想ノード方式では、１つのノードが複数の異なるノードＩＤを生成し、オーバーレイネットワークに参加させることで、各ノードのキー管理範囲長を均等化させる。生成する仮想ノード数は、非特許文献１では１ノード当たりlog(N)個であり、非特許文献３では１ノード当たり６０という経験的な値である。仮想ノードのノードＩＤの生成方法としては、予め決められた方法、例えば、ノードアドレスに１，２，３・・・を順番に追加した値からハッシュ関数により生成する方法や、非特許文献２で提案しているように、元のノードＩＤからハッシュ空間上で等間隔な位置に配置する方法などがある。 There is a virtual node method as a method for dealing with this. In the virtual node method, one node generates a plurality of different node IDs and participates in the overlay network, thereby equalizing the key management range length of each node. The number of virtual nodes to be generated is an empirical value of log (N) per node in Non-Patent Document 1, and 60 per node in Non-Patent Document 3. As a method for generating a node ID of a virtual node, a predetermined method, for example, a method of generating from a value obtained by adding 1, 2, 3,... As proposed, there is a method of arranging the nodes at equidistant positions in the hash space from the original node ID.

仮想ノード方式は、各ノードのキー管理範囲長を均等化するだけではなく、ノードの性能が異なる場合に、その性能に応じてノードのキー管理範囲長を調整することにも利用することができる。すなわち、性能が高いノードの仮想ノード数を増やすことで、そのノードのキー管理範囲長を広くさせることができる。しかし、ノードの性能差に応じて仮想ノード数を制御する場合、システム内のノードの性能差が大きいと仮想ノード数を多くする必要がある。そのため、ノードテーブルサイズの増大や、管理オーバーヘッドの増加などの問題が生じる。これを解決するために、特許文献１では、ハッシュ空間上でノードが管理する境界を、隣接ノード間の性能差に応じて各ノード間で調整している。 The virtual node method can be used not only to equalize the key management range length of each node but also to adjust the key management range length of the node according to the performance when the node performance differs. . That is, by increasing the number of virtual nodes of a node with high performance, the key management range length of that node can be increased. However, when the number of virtual nodes is controlled according to the difference in node performance, it is necessary to increase the number of virtual nodes if the performance difference between the nodes in the system is large. Therefore, problems such as an increase in node table size and an increase in management overhead occur. In order to solve this, in Patent Document 1, a boundary managed by a node on a hash space is adjusted between nodes according to a performance difference between adjacent nodes.

特開２００４−２５２６６３号公報JP 2004-252663 A

I. Stoica他, “Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications,” Proc. ACM SIGCOMM, 2001I. Stoica et al., “Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications,” Proc. ACM SIGCOMM, 2001 金子他, “ノードの局所性と管理の公平性を考慮したOneHop-P2P拡張方式,” FIT2008, no.4, RL-006, 2008Kaneko et al., “OneHop-P2P extension method considering node locality and management fairness,” FIT2008, no.4, RL-006, 2008 前橋他, “大規模データ処理のための分散システムの実装とその応用”, 信学論D, Vol.J93-D, No.7, 2010Maebashi et al., “Implementation and application of distributed systems for large-scale data processing”, IEICE D, Vol.J93-D, No.7, 2010

上述のように、ＤＨＴでは、ハッシュ関数を使って生成したノードＩＤを用いた場合、ハッシュ空間上でのノードの配置に偏りが生じ、その結果として各ノードが管理するキー管理範囲長に差が生じる。そこで、これを解決するために、各ノードに仮想ノードを割り当てる方法が用いられている。しかし、各ノードに固定数の仮想ノードを割り当てると、ノードテーブルサイズの増加や、管理オーバーヘッドの増加などによって必要な記憶容量が増大し、リソースの負荷が生じる。そのため、むやみに多数の仮想ノードを割り当てることには問題がある。
また、特許文献１のように、隣接ノード間の割り当て範囲を調整する方式であっても、ノードの配置がもともと偏っている場合には、割り当て範囲の調整範囲が狭くあるいは広くなりすぎてしまう。そのため、固定的数を割り当てる仮想ノード手法を併用することが必要となり、根本的な解決にはならない。 As described above, in the DHT, when the node ID generated using the hash function is used, there is a bias in the arrangement of the nodes on the hash space, and as a result, there is a difference in the key management range length managed by each node. Arise. In order to solve this problem, a method of assigning a virtual node to each node is used. However, if a fixed number of virtual nodes are allocated to each node, the required storage capacity increases due to an increase in the node table size, an increase in management overhead, and the like, resulting in a resource load. Therefore, there is a problem in unnecessarily allocating a large number of virtual nodes.
Further, even if the allocation range between adjacent nodes is adjusted as in Patent Document 1, if the arrangement of nodes is originally biased, the allocation range adjustment range is too narrow or too wide. For this reason, it is necessary to use a virtual node method for assigning a fixed number, which is not a fundamental solution.

本発明は、このような事情を考慮してなされたもので、複数のノード装置によりデータを分散して管理するシステムにおいて、システムに参加しているノード装置を管理するために使用する記憶容量の増加を抑えながら、各ノード装置におけるデータの管理範囲長に偏りをなくすことができる負荷分散装置及びプログラムを提供する。 The present invention has been made in consideration of such circumstances, and in a system in which data is distributed and managed by a plurality of node devices, the storage capacity used for managing the node devices participating in the system is reduced. Provided are a load distribution device and a program capable of eliminating a bias in a data management range length in each node device while suppressing an increase.

［１］本発明の一態様は、データ管理システムを構成するノード装置を特定するノード特定情報を含んだノードテーブルを記憶するとともに、前記ノード特定情報と、前記ノード特定情報により特定される前記ノード装置に対して生成されたノードＩＤとの対応付けを示すキー検索用ノードテーブルを記憶する記憶部と、前記データ管理システムに参加したノード装置のノード特定情報を前記ノードテーブルに書き込み、前記データ管理システムから離脱したノード装置のノード特定情報を前記ノードテーブルから削除するノードテーブル管理部と、前記キー検索用ノードテーブルに設定されているノードＩＤと、前記ノードＩＤに対応付けられた前記ノード特定情報とに基づいて、各ノード装置が管理対象とするキーの管理範囲長を算出する管理範囲長演算部と、前記ノードテーブル内のノード特定情報と、前記ノード特定情報により特定されるノード装置に対して生成されたノードＩＤとの対応付けを設定したキー検索用ノードテーブルを前記記憶部に書き込み、生成した前記キー検索用ノードテーブルに基づいて前記管理範囲長演算部により算出された各ノード装置の管理範囲長の差異が所定の条件に合致すると判断するまで、前記管理範囲長に基づいて選択した前記ノード装置に対して新たにノードＩＤを生成するとともに、前記ノード装置のノード特定情報と新たに生成した前記ノードＩＤとの対応付けを前記キー検索用ノードテーブルに追加して更新する処理を繰り返す仮想ノード登録部と、前記キー検索用ノードテーブル内のノードＩＤと、指定されたキーに基づいて、前記キーに対応した値を管理している前記ノード装置のノード特定情報を取得するキーテーブル処理部と、を備えることを特徴とする負荷分散装置である。
この態様によれば、キーに対応した値を保持するノード装置を検索するためにノード装置の特定情報とノードＩＤとを対応づけたキー検索用ノードテーブルを用いるKey-Value型データ管理システムにおいて、ノード装置の参加や離脱が発生した際、負荷分散装置は、ノード装置間での管理範囲長の差異が所定条件を満たすまで（一例として、各ノード装置のキーの管理範囲長の差異が所定の範囲内になるまで、あるいは、ノード性能値によって重み付けした各ノード装置のキーの管理範囲長の差異が所定の範囲内になるまで、言い換えれば、各ノード装置のノード性能値に対するキーの管理範囲長の比の差異が所定の範囲内になるまで）、適宜選択されたノード装置（一例として、管理範囲長が最も狭いノード装置）に対して仮想ノードのノードＩＤを割り当ててキーの管理範囲長を増加させる処理を繰り返し、ノード装置の特定情報と当該ノード装置に対して生成されたノードＩＤとからなるキー検索用ノードテーブルを生成する。
なお、ノード装置のキーの管理範囲は、そのノード装置のノード特定情報に対応してキー検索用ノードテーブルに登録されているノードＩＤに基づいて決まる。ノード装置に割り当てるノードＩＤは、そのノード装置のノード特定情報と、当該ノード装置に割り当て済みのノードＩＤ毎に固有の情報とを元に算出されたハッシュ値を用いることができる。
一例として、ノードＩＤに対応した管理範囲は、隣接するノードＩＤから自ノードＩＤまでである。但し、範囲の境界がどちらに属するかは適宜予め定める。
管理範囲長は、上記の管理範囲の長さであり、各ノード装置のキーの管理範囲長は、そのノード装置に割り当てられた各ノードＩＤに対応した管理範囲の長さの総計である。
各ノードのキーの管理範囲長の差異を所定の範囲内とすることは、各ノード装置の負荷を同等に近くするということである。
これにより、各ノード装置のキーの管理範囲長の不均衡を効率的に是正し、システム全体に割り当てる仮想ノードの数を抑えることができる。キーの管理範囲長の不均衡を是正することによって各ノード装置の負荷の偏りも是正され、従って、キー検索用ノードテーブルの増加を抑えながら、各ノード装置の負荷分散を行なうことが可能となる。 [1] One aspect of the present invention stores a node table including node specifying information for specifying a node device constituting a data management system, and the node specified by the node specifying information and the node specifying information. A storage unit for storing a key search node table indicating a correspondence with a node ID generated for the device; and node identification information of a node device participating in the data management system is written to the node table, and the data management is performed. A node table management unit that deletes node specifying information of a node device that has left the system from the node table, a node ID set in the key search node table, and the node specifying information associated with the node ID Based on the above, the management range length of the key to be managed by each node device is calculated. A key search node table in which a correspondence between a management range length calculation unit, node identification information in the node table, and a node ID generated for the node device identified by the node identification information is set. The management range length until it is determined that the difference in management range length of each node device calculated by the management range length calculation unit based on the generated key search node table is written in the storage unit and meets a predetermined condition A new node ID is generated for the node device selected based on the node, and a correspondence between the node specifying information of the node device and the newly generated node ID is added to the key search node table. Based on the virtual node registration unit that repeats the updating process, the node ID in the key search node table, and the specified key A load balancer, characterized in that it comprises a key table processing unit for acquiring node identification information of the node device which manages a value corresponding to the key.
According to this aspect, in the key-value type data management system using the key search node table in which the node device specific information and the node ID are associated with each other in order to search for the node device that holds the value corresponding to the key. When a node device joins or leaves, the load balancer determines that the difference in management range length between node devices satisfies a predetermined condition (for example, the difference in key management range length of each node device is a predetermined value). Until the difference in key management range length of each node device weighted by the node performance value falls within a predetermined range, in other words, the key management range length for the node performance value of each node device Until the difference in the ratios falls within a predetermined range), the node of the virtual node is selected with respect to the appropriately selected node device (for example, the node device having the narrowest management range length). Repeating a process of increasing the control range key length by assigning ID, and generate a key search node table comprising a node ID generated for a particular information and the node device of the node device.
The key device management range is determined based on the node ID registered in the key search node table corresponding to the node specifying information of the node device. As the node ID assigned to the node device, a hash value calculated based on the node identification information of the node device and information unique to each node ID already assigned to the node device can be used.
As an example, the management range corresponding to the node ID is from the adjacent node ID to the own node ID. However, which range boundary belongs to is determined in advance as appropriate.
The management range length is the length of the management range described above, and the management range length of the key of each node device is the total of the lengths of the management ranges corresponding to the respective node IDs assigned to the node device.
Making the difference in the key management range length of each node within a predetermined range means that the load of each node device is made nearly equal.
Thereby, the imbalance of the key management range length of each node apparatus can be corrected efficiently, and the number of virtual nodes allocated to the entire system can be suppressed. Correcting the imbalance in the key management range length also corrects the load imbalance of each node device. Therefore, it is possible to distribute the load of each node device while suppressing an increase in the key search node table. .

［２］本発明の一態様は、上述する負荷分散装置であって、前記管理範囲長演算部は、前記ノード装置の管理範囲長を、前記ノード装置の性能を示す値に応じて重み付けし、前記仮想ノード登録部は、前記ノード装置の性能を示す値に応じて重み付けされた前記管理範囲長の差異が所定の条件に合致するかを判断する、ことを特徴とする。
この態様によれば、負荷分散装置は、ノード性能の指標値によって各ノード装置におけるキーの管理範囲長を重み付けし、重み付けした管理範囲長が同等に近くなるまで、重み付けした管理範囲長が最も狭いノード装置に対して仮想ノードのノードＩＤを割り当てる。一例として、キーの管理範囲長の重み付けは、キーの管理範囲長に、ＣＰＵの性能を示す値の逆数を乗算したり、総ディスク容量の逆数を乗算したりすることにより行なう。言い換えれば、ノードの性能値に対するキーの管理範囲長の比の値を用いる。
従って、ノード装置の処理能力やメモリ容量などに応じた管理範囲長を各ノード装置に割り当てることができる。 [2] One aspect of the present invention is the load distribution device described above, wherein the management range length calculation unit weights the management range length of the node device according to a value indicating the performance of the node device, The virtual node registration unit determines whether a difference in the management range length weighted according to a value indicating the performance of the node device matches a predetermined condition.
According to this aspect, the load distribution device weights the key management range length in each node device by the node performance index value, and the weighted management range length is the narrowest until the weighted management range length becomes nearly equal. A node ID of the virtual node is assigned to the node device. As an example, the key management range length is weighted by multiplying the key management range length by the reciprocal of a value indicating the performance of the CPU or the reciprocal of the total disk capacity. In other words, the value of the ratio of the key management range length to the node performance value is used.
Therefore, it is possible to assign a management range length corresponding to the processing capability and memory capacity of the node device to each node device.

［３］本発明の一態様は、上述する負荷分散装置であって、前記管理範囲長演算部は、前記仮想ノード登録部により前記ノード装置のノード特定情報と新たに生成した前記ノードＩＤとの対応付けが前記キー検索用ノードテーブルに追加された場合、前記キー検索用ノードテーブルに追加されたノードＩＤと隣接するノードＩＤを特定してキーの管理範囲長を算出し、新たにノードＩＤが生成された前記ノード装置の管理範囲長を、算出した前記管理範囲長を加算した値に更新するとともに、前記キー検索用ノードテーブルから前記仮想ノードのノードＩＤに隣接するノードＩＤに対応したノード特定情報を読み出し、読み出したノード特定情報により特定される前記ノード装置のキー管理範囲長を、算出した前記管理範囲長を減算した値に更新する、ことを特徴とする。
この態様によれば、負荷分散装置は、一旦算出した各ノード装置のキーの管理範囲長を記憶しておき、ノード装置に仮想ノードのノードＩＤが追加されてキー検索用ノードテーブルが更新されたときは、追加されたノードＩＤの仮想ノードについてキーの管理範囲長を算出する。そして、記憶していた管理範囲長のうち、仮想ノードが追加されたノード装置と、仮想ノードに隣接するノードに対応したノード装置とについて記憶していたキー管理範囲長のみを変更する。
これにより、各ノード装置のキーの管理範囲長を高速に算出することが可能となる。 [3] One aspect of the present invention is the load distribution apparatus described above, wherein the management range length calculation unit is configured to perform node identification information of the node device and the node ID newly generated by the virtual node registration unit. When the association is added to the key search node table, the node ID adjacent to the node ID added to the key search node table is specified to calculate the key management range length. Update the generated management range length of the node device to a value obtained by adding the calculated management range length, and specify a node corresponding to a node ID adjacent to the node ID of the virtual node from the key search node table Information is read, and the key management range length of the node device identified by the read node identification information is subtracted from the calculated management range length. It is characterized by updating.
According to this aspect, the load distribution device stores the key management range length of each node device once calculated, and the node ID of the virtual node is added to the node device and the key search node table is updated. If so, the key management range length is calculated for the virtual node with the added node ID. Then, of the stored management range length, only the key management range length stored for the node device to which the virtual node is added and the node device corresponding to the node adjacent to the virtual node is changed.
This makes it possible to calculate the key management range length of each node device at high speed.

［４］本発明の一態様は、上述する負荷分散装置であって、前記ノード特定情報は、前記ノード装置のアドレスであることを特徴とする。
この態様によれば、キー検索用ノードテーブルに、ノード特定情報としてノードアドレスが設定される。
これにより、キーテーブル処理部は、キー検索用ノードテーブルからキーを管理するノード装置のアドレスを取得することができるため、読み出したアドレスを宛先に用いてキーに対応する値の問合せを行なうことが可能となる。 [4] One aspect of the present invention is the load distribution apparatus described above, wherein the node specifying information is an address of the node apparatus.
According to this aspect, the node address is set as the node specifying information in the key search node table.
As a result, the key table processing unit can acquire the address of the node device that manages the key from the key search node table, so that the value corresponding to the key can be inquired using the read address as the destination. It becomes possible.

［５］本発明の一態様は、負荷分散装置として用いられるコンピュータを、データ管理システムを構成するノード装置を特定するノード特定情報を含んだノードテーブルを記憶するとともに、前記ノード特定情報と、前記ノード特定情報により特定される前記ノード装置に対して生成されたノードＩＤとの対応付けを示すキー検索用ノードテーブルを記憶する記憶部、前記データ管理システムに参加したノード装置のノード特定情報を前記ノードテーブルに書き込み、前記データ管理システムから離脱したノード装置のノード特定情報を前記ノードテーブルから削除するノードテーブル管理部、前記キー検索用ノードテーブルに設定されているノードＩＤと、前記ノードＩＤに対応付けられた前記ノード特定情報とに基づいて、各ノード装置が管理対象とするキーの管理範囲長を算出する管理範囲長演算部、前記ノードテーブル内のノード特定情報と、前記ノード特定情報により特定されるノード装置に対して生成されたノードＩＤとの対応付けを設定したキー検索用ノードテーブルを前記記憶部に書き込み、生成した前記キー検索用ノードテーブルに基づいて前記管理範囲長演算部により算出された各ノード装置の管理範囲長の差異が所定の条件に合致すると判断するまで、前記管理範囲長に基づいて選択した前記ノード装置に対して新たにノードＩＤを生成するとともに、前記ノード装置のノード特定情報と新たに生成した前記ノードＩＤとの対応付けを前記キー検索用ノードテーブルに追加して更新する処理を繰り返す仮想ノード登録部、前記キー検索用ノードテーブル内のノードＩＤと、指定されたキーに基づいて、前記キーに対応した値を管理している前記ノード装置のノード特定情報を取得するキーテーブル処理部、として機能させることを特徴とするプログラムである。 [5] According to one aspect of the present invention, a computer used as a load distribution apparatus stores a node table including node specification information for specifying a node device constituting a data management system, and the node specification information; A storage unit for storing a key search node table indicating a correspondence with a node ID generated for the node device specified by the node specifying information, and node specifying information of the node device participating in the data management system Corresponds to the node ID that is written in the node table, deletes node identification information of the node device that has left the data management system from the node table, the node ID set in the key search node table, and the node ID Each node device based on the node identification information attached. A management range length calculation unit for calculating a management range length of a key to be managed, a correspondence between node identification information in the node table and a node ID generated for a node device identified by the node identification information A key search node table in which an attachment is set is written in the storage unit, and a difference in management range length of each node device calculated by the management range length calculation unit based on the generated key search node table is a predetermined condition A node ID is newly generated for the node device selected based on the management range length, and the node identification information of the node device is associated with the newly generated node ID Is added to the key search node table, and the virtual node registration unit that repeats the process of updating the key search node table, A program that functions as a key table processing unit that acquires node identification information of the node device that manages a value corresponding to the key based on a node ID and a specified key.

本発明によれば、複数のノード装置によりデータを分散して管理するKey-Value型データ管理システムにおいて、システムに参加しているノード装置を管理するために使用する記憶容量の増加を抑えながら、各ノード装置におけるデータの管理範囲長に偏りをなくすことができる。 According to the present invention, in a key-value type data management system that distributes and manages data by a plurality of node devices, while suppressing an increase in storage capacity used to manage node devices participating in the system, It is possible to eliminate the bias in the data management range length in each node device.

本発明の第１の実施形態によるデータ管理システムの構成図である。1 is a configuration diagram of a data management system according to a first embodiment of the present invention. 同実施形態による負荷分散装置の構成を示すブロック図である。It is a block diagram which shows the structure of the load distribution apparatus by the embodiment. 同実施形態によるノードテーブルのデータ構成例を示す図である。It is a figure which shows the data structural example of the node table by the embodiment. 同実施形態によるキー検索用ノードテーブルのデータ構成例を示す図である。It is a figure which shows the data structural example of the node table for key searches by the embodiment. 同実施形態によるキーテーブルのデータ構成例を示す図である。It is a figure which shows the data structural example of the key table by the embodiment. 同実施形態によるノードテーブル管理部のノードテーブル更新処理フローを示す図である。It is a figure which shows the node table update process flow of the node table management part by the embodiment. 同実施形態による仮想ノード登録部のキー検索用ノードテーブル更新処理フローを示す図である。It is a figure which shows the node table update process flow for a key search of the virtual node registration part by the embodiment. 同実施形態による管理範囲長演算部の管理範囲長算出処理を説明するための図である。It is a figure for demonstrating the management range length calculation process of the management range length calculating part by the embodiment. 同実施形態によるデータ管理システムの実験結果を示す図である。It is a figure which shows the experimental result of the data management system by the embodiment. 第２の実施形態による管理範囲長演算部の管理範囲長算出処理を説明するための図である。It is a figure for demonstrating the management range length calculation process of the management range length calculating part by 2nd Embodiment. 第３の実施形態によるノードテーブルのデータ構成例を示す図である。It is a figure which shows the data structural example of the node table by 3rd Embodiment. 同実施形態によるキー検索用ノードテーブルのデータ構成例を示す図である。It is a figure which shows the data structural example of the node table for key searches by the embodiment. 同実施形態による管理範囲長演算部の管理範囲長算出処理を説明するための図である。It is a figure for demonstrating the management range length calculation process of the management range length calculating part by the embodiment.

以下、図面を参照しながら本発明の実施形態を詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

[１．第１の実施形態]
[１．１全体構成]
図１は、本発明の第１の実施形態によるデータ管理システムの構成図である。本実施形態のデータ管理システムは、ＤＨＴを用いたKey-Value（キー（鍵）−値）型データ管理システムであり、例えば、分散型データベースシステムや、分散ファイルシステムなどである。同図に示すように、データ管理システムは、通信ネットワーク５を介して複数のノード装置１を接続することにより構成される。通信ネットワーク５は、例えば、ルータなどの通信装置やケーブルなどの通信線等の各種ネットワーク装置から構成されるＩＰ（Internet Protocol、インターネットプロトコル）通信網である。ノード装置１は、負荷分散装置１０とデータ記憶部３０とを備える。データ記憶部３０は、データ管理システムにおいて管理対象となるファイル等のデータを記憶する。 [1. First Embodiment]
[1.1 Overall configuration]
FIG. 1 is a configuration diagram of a data management system according to the first embodiment of the present invention. The data management system of this embodiment is a key-value (key (value) -value) type data management system using DHT, for example, a distributed database system or a distributed file system. As shown in the figure, the data management system is configured by connecting a plurality of node devices 1 via a communication network 5. The communication network 5 is an IP (Internet Protocol) communication network composed of various network devices such as communication devices such as routers and communication lines such as cables. The node device 1 includes a load distribution device 10 and a data storage unit 30. The data storage unit 30 stores data such as files to be managed in the data management system.

[１．２負荷分散装置とデータ管理システムとの関係]
負荷分散装置１０は、各ノード装置１内で動作する。ノード装置１は、例えば、ノードを実行するコンピュータ装置であり、負荷分散装置１０は、ノード装置１がアプリケーションプログラムを実行することにより実行されるプロセスに相当する。各負荷分散装置１０は、Key-Value型データ管理システムに参加しているノード装置１を管理するノードテーブルと、キー管理を行っているノード装置１の発見に用いるキー検索用ノードテーブルを保持している。Key-Value型データ管理システムへの新たなノード装置１の参加や、Key-Value型データ管理システムに参加していたノード装置１の離脱が発生した場合に、負荷分散装置１０が動作してノードテーブル及びキー検索用ノードテーブルを修正する。全ての負荷分散装置１０は同じ動作をするため、Key-Value型データ管理システム内の全ての負荷分散装置１０のノードテーブルとキー検索用ノードテーブルは同一に保たれる。 [1.2 Relationship between load balancer and data management system]
The load balancer 10 operates within each node device 1. The node device 1 is, for example, a computer device that executes a node, and the load distribution device 10 corresponds to a process that is executed when the node device 1 executes an application program. Each load balancer 10 holds a node table for managing the node devices 1 participating in the key-value type data management system and a key search node table used for discovery of the node devices 1 performing key management. ing. When a new node device 1 participates in the key-value type data management system or when the node device 1 that has participated in the key-value type data management system is disconnected, the load distribution device 10 operates and the node Modify the table and the key retrieval node table. Since all the load distribution apparatuses 10 perform the same operation, the node table and the key search node table of all the load distribution apparatuses 10 in the key-value type data management system are kept the same.

本実施形態において、ノードテーブルには、Key-Value型データ管理システムに参加しているノード装置１のみがノードとして登録される。一方、キーの登録や検索に使用されるキー検索用ノードテーブルには、ＤＨＴにおける円状のオーバーレイネットワークを構成するノードが登録される。オーバーレイネットワークを構成するノードには、ノード装置１のように実体があるノードと、ノード装置１に対して仮想的に生成された仮想ノードとが含まれる。本実施形態では、キー検索用ノードテーブルに仮想ノードを登録することにより負荷分散を実現する。 In the present embodiment, only the node devices 1 participating in the key-value type data management system are registered as nodes in the node table. On the other hand, nodes constituting a circular overlay network in DHT are registered in the key search node table used for key registration and search. The nodes constituting the overlay network include a node that has an entity like the node device 1 and a virtual node that is virtually generated for the node device 1. In the present embodiment, load distribution is realized by registering virtual nodes in the key search node table.

なお、ＤＨＴを用いたKey-Value型データ管理システムの構成方法や、ノードの参加・離脱の通知方法は、既存のKey-Value型データ管理システムの技術を適用する。 The existing key-value type data management system technology is applied to the configuration method of the key-value type data management system using DHT and the node joining / leaving notification method.

[１．３負荷分散装置の構成]
図２は、本実施形態による負荷分散装置１０の構成を示すブロック図であり、本実施形態と関係する機能ブロックのみ抽出して示している。同図に示すように、負荷分散装置１０は、ノードテーブル管理部１１、仮想ノード登録部１２、ノードＩＤ生成部１３、管理範囲長演算部１４、記憶部１５、及び、キーテーブル処理部１６を備えて構成される。 [1.3 Configuration of load balancer]
FIG. 2 is a block diagram showing a configuration of the load distribution apparatus 10 according to the present embodiment, and only functional blocks related to the present embodiment are extracted and shown. As shown in the figure, the load distribution apparatus 10 includes a node table management unit 11, a virtual node registration unit 12, a node ID generation unit 13, a management range length calculation unit 14, a storage unit 15, and a key table processing unit 16. It is prepared for.

記憶部１５は、ハードディスク装置や半導体メモリなどで実現され、ノードテーブル２１、キー検索用ノードテーブル２２、及び、キーテーブル２３を記録する。ノードテーブル２１は、データ管理システムを構成するノード装置１のノードＩＤとアドレスを対応付けたデータである。キー検索用ノードテーブル２２は、オーバーレイネットワークに参加しているノードのノードＩＤとアドレスを対応付けたデータである。キーテーブル２３は、「キー」と「値」の組みを示すデータである。 The storage unit 15 is realized by a hard disk device, a semiconductor memory, or the like, and records a node table 21, a key search node table 22, and a key table 23. The node table 21 is data in which the node ID and the address of the node device 1 constituting the data management system are associated with each other. The key search node table 22 is data in which node IDs and addresses of nodes participating in the overlay network are associated with each other. The key table 23 is data indicating a set of “key” and “value”.

ノードテーブル管理部１１は、データ管理システムに新たなノード装置１が参加した場合や、データ管理システムに参加していたノード装置１が離脱した場合に、記憶部１５に記憶されているノードテーブル２１を更新する。ノードＩＤ生成部１３は、ノード装置１のアドレスと仮想ノードの数に基づいてノードＩＤを生成する。管理範囲長演算部１４は、キー検索用ノードテーブル２２に設定されているノードＩＤに基づいて各ノード装置１が管理するキーの範囲の広さである管理範囲長を算出する。 The node table management unit 11 stores the node table 21 stored in the storage unit 15 when a new node device 1 participates in the data management system or when the node device 1 that has participated in the data management system leaves. Update. The node ID generation unit 13 generates a node ID based on the address of the node device 1 and the number of virtual nodes. The management range length calculation unit 14 calculates a management range length that is the width of the key range managed by each node device 1 based on the node ID set in the key search node table 22.

仮想ノード登録部１２は、管理範囲長演算部１４が算出した管理範囲長が最も狭いノード装置１に対して仮想ノードを生成し、記憶部１５に記憶されているキー検索用ノードテーブル２２を更新する処理を、各ノード装置１の管理範囲長の差異が所定以下になるか、仮想ノード数が上限に達するまで繰り返す。仮想ノード登録部１２が備えるメモリ１２１は、各ノード装置１の管理範囲長の差異が所定以下であると判断するための条件である管理範囲しきい値Ｔｈと、仮想ノード数の上限を示す最大仮想ノード数Ｖｍａｘを記憶する。管理範囲しきい値Ｔｈは、最も狭い管理範囲長と最も広い管理範囲長の比についてのしきい値である。キーテーブル処理部１６は、記憶部１５に保存されているキーテーブル２３を参照して、キーに対応した値を読み出す。 The virtual node registration unit 12 generates a virtual node for the node device 1 with the shortest management range length calculated by the management range length calculation unit 14 and updates the key search node table 22 stored in the storage unit 15. This process is repeated until the difference in the management range length of each node device 1 becomes equal to or less than a predetermined value or the number of virtual nodes reaches the upper limit. The memory 121 included in the virtual node registration unit 12 includes a management range threshold Th that is a condition for determining that a difference in management range length of each node device 1 is equal to or less than a predetermined value, and a maximum indicating the upper limit of the number of virtual nodes. The virtual node number Vmax is stored. The management range threshold Th is a threshold for the ratio of the narrowest management range length to the widest management range length. The key table processing unit 16 refers to the key table 23 stored in the storage unit 15 and reads a value corresponding to the key.

[１．４データ構成]
図３は、本実施形態によるノードテーブル２１のデータ構成例を示す図である。
同図に示すように、ノードテーブル２１は、データ管理システムを構成するノード装置１のノードＩＤとノードアドレスと対応付けたデータのリストである。ノードテーブル２１は、ノードＩＤの数値順にソートされている。ノードテーブル２１のノードＩＤには、仮想ノードのノードＩＤは含まれない。本実施形態では、ノード装置１のＩＰ（Internet Protocol）アドレスとポート番号の組み合わせをノードアドレスとして用いる。 [1.4 Data structure]
FIG. 3 is a diagram illustrating a data configuration example of the node table 21 according to the present embodiment.
As shown in the figure, the node table 21 is a list of data associated with the node ID and node address of the node device 1 constituting the data management system. The node table 21 is sorted in numerical order of node IDs. The node ID of the node table 21 does not include the node ID of the virtual node. In the present embodiment, a combination of an IP (Internet Protocol) address and a port number of the node device 1 is used as the node address.

図４は、本実施形態によるキー検索用ノードテーブル２２のデータ構成例を示す図である。
同図に示すように、キー検索用ノードテーブル２２は、オーバーレイネットワークを構成するノードのノードＩＤ、ノードアドレス、及び、仮想ノード数を対応付けたデータのリストである。キー検索用ノードテーブル２２は、ノードＩＤの数値順にソートされている。キー検索用ノードテーブル２２のノードＩＤには、ノード装置１のノードＩＤに加え、仮想ノードのノードＩＤも含まれる。ＤＨＴを用いたKey-Value型データ管理システムである本実施形態のデータ管理システムは、キー管理を行うノードの発見にこのキー検索用ノードテーブル２２を用いる。すなわち、キー検索用ノードテーブル２２がＤＨＴにおける円状のオーバーレイネットワークを示している。ノードアドレスは、ノード装置１を特定するノード特定情報として用いることができる。 FIG. 4 is a diagram illustrating a data configuration example of the key search node table 22 according to the present embodiment.
As shown in the figure, the key search node table 22 is a list of data in which node IDs, node addresses, and virtual node numbers of nodes constituting the overlay network are associated with each other. The key search node table 22 is sorted in numerical order of node IDs. In addition to the node ID of the node device 1, the node ID of the key search node table 22 includes the node ID of the virtual node. The data management system of the present embodiment, which is a key-value type data management system using DHT, uses this key search node table 22 for finding a node that performs key management. That is, the key search node table 22 indicates a circular overlay network in DHT. The node address can be used as node specifying information for specifying the node device 1.

図５は、本実施形態によるキーテーブル２３のデータ構成例を示す図である。
同図に示すように、キーテーブル２３は、データ管理システムで管理するキーと値との組み合わせを対応付けたデータのリストである。本実施形態では、キーがデータを特定するデータＩＤであり、値がデータを記憶しているノード装置１のノードアドレスである場合を例に説明する。 FIG. 5 is a diagram illustrating a data configuration example of the key table 23 according to the present embodiment.
As shown in the figure, the key table 23 is a list of data in which combinations of keys and values managed by the data management system are associated. In the present embodiment, a case where the key is a data ID for specifying data and the value is the node address of the node device 1 storing the data will be described as an example.

[１．５負荷分散装置の動作]
次に、負荷分散装置１０の動作について説明する。
まず、ＤＨＴを用いたKey-Value型データ管理システムにおける参加及び離脱の通知手順の一例について説明する。ノード装置１がオーバーレイネットワークに参加する場合、当該ノード装置１内の負荷分散装置１０は、既にオーバーレイネットワークに参加しているいずれかのノード装置１内の負荷分散装置１０に対して参加を要求する。参加の要求を受信した負荷分散装置１０は、ノードの参加をスライス・リーダー（Slice Leader）へ通知し、スライス・リーダーは他のスライス・リーダーへ通知を行い、さらに、スライス・リーダーから自領域内のユニット・リーダー（Unit Leader）への通知を行なう。このように、ノードの参加情報をオーバーレイネットワーク内の全ノードに通知する。なお、負荷分散装置１０は、ノードテーブル２１に登録されているノードＩＤに基づいてスライス・リーダー、ユニット・リーダーなどの通知先を決定する。 [1.5 Operation of load balancer]
Next, the operation of the load distribution apparatus 10 will be described.
First, an example of a joining / leaving notification procedure in a key-value data management system using DHT will be described. When the node device 1 participates in the overlay network, the load distribution device 10 in the node device 1 requests participation from the load distribution device 10 in any of the node devices 1 that have already participated in the overlay network. . The load balancer 10 that has received the request for participation notifies the participation of the node to the slice leader (Slice Leader), and the slice leader notifies other slice leaders. Notification to the unit leader. In this way, the node participation information is notified to all nodes in the overlay network. Note that the load balancer 10 determines a notification destination such as a slice leader or a unit leader based on the node ID registered in the node table 21.

また、負荷分散装置１０がノード装置１の離脱を検出した場合、参加の通知の場合と同様に、ノードの離脱をスライス・リーダーへ通知し、スライス・リーダーは他のスライス・リーダーへ通知を行い、さらに、スライス・リーダーから自領域内のユニット・リーダーへの通知を行なう。負荷分散装置１０は、オーバーレイネットワークから離脱した他のノード装置１内の負荷分散装置１０から離脱の通知を受けるか、定期的に他のノード装置１に送信する生存確認信号に対する応答を受信しないことによって離脱を検出する。なお、生存確認信号は、ノードテーブル２１に登録されているノードアドレスを宛先として送信される。 When the load balancer 10 detects the detachment of the node device 1, the detachment of the node is notified to the slice leader as in the case of the notification of participation, and the slicing leader notifies the other slice leaders. In addition, the slice leader notifies the unit leader in his area. The load balancer 10 does not receive a notification of leaving from the load balancer 10 in the other node device 1 that has left the overlay network or does not receive a response to the survival confirmation signal that is periodically transmitted to the other node device 1. To detect withdrawal. The survival confirmation signal is transmitted with the node address registered in the node table 21 as the destination.

なお、上述した参加・離脱の通知手順は、ＤＨＴを用いたオーバーレイネットワークの構成方法の一つであるOne Hoopの例であり、ＤＨＴを用いたKey-Value型データ管理システムの他の既存の技術により、全ノード装置１に参加や離脱を通知することができる。 The above-described participation / leaving notification procedure is an example of One Hoop, which is one of the overlay network configuration methods using DHT, and other existing technologies of the key-value type data management system using DHT. Thus, it is possible to notify participation and withdrawal to all the node devices 1.

[１．５．１ノードテーブル管理部の動作]
図６は、本実施形態によるノードテーブル管理部１１のノードテーブル更新処理フローを示す図である。
ノードテーブル管理部１１は、他の負荷分散装置１０からノードの参加の通知あるいは離脱の通知を受信する。参加の通知には、「参加」を示す通知種別、及び、参加したノード装置１のノードアドレスが含まれる。また、離脱の通知には、「離脱」を示す通知種別、及び、離脱したノード装置１のノードアドレスが含まれる。ノードテーブル管理部１１は、参加の通知または離脱の通知に含まれるノードアドレスと、仮想ノード数「０」とをノードＩＤ生成部１３に出力し、ノードＩＤの生成を指示する。ノードＩＤ生成部１３はノードアドレスと仮想ノード数の組み合わせからノードＩＤを生成し、ノードテーブル管理部１１に通知する（ステップＳ１１）。 [1.5.1 Operation of node table manager]
FIG. 6 is a diagram showing a node table update processing flow of the node table management unit 11 according to the present embodiment.
The node table management unit 11 receives a node join notification or a leave notification from another load balancer 10. The notification of participation includes a notification type indicating “participation” and the node address of the participating node apparatus 1. The leave notification includes a notification type indicating “leave” and the node address of the node device 1 that has left. The node table management unit 11 outputs the node address included in the notification of participation or the notification of withdrawal and the virtual node number “0” to the node ID generation unit 13 and instructs generation of the node ID. The node ID generation unit 13 generates a node ID from the combination of the node address and the number of virtual nodes, and notifies the node table management unit 11 (step S11).

続いて、ノードテーブル管理部１１は、通知種別が「参加」であると判断した場合（ステップＳ１２：ＹＥＳ）、記憶部１５に記憶されているノードテーブル２１に、ステップＳ１１において管理範囲長演算部１４から受信したノードＩＤと、参加の通知に設定されているノードアドレスとを対応付けたデータを追加する（ステップＳ１３）。 Subsequently, when the node table management unit 11 determines that the notification type is “participation” (step S12: YES), the node table management unit 11 stores the management range length calculation unit in step S11. Data in which the node ID received from 14 is associated with the node address set in the notification of participation is added (step S13).

一方、ノードテーブル管理部１１は、通知種別が「離脱」であると判断した場合（ステップＳ１２：ＮＯ）、記憶部１５に記憶されているノードテーブル２１から、ステップＳ１１において管理範囲長演算部１４から受信したノードＩＤと、当該ノードＩＤに対応付けられたノードアドレスの組のデータを削除する（ステップＳ１４）。 On the other hand, when the node table management unit 11 determines that the notification type is “leave” (step S12: NO), the management table length calculation unit 14 in step S11 from the node table 21 stored in the storage unit 15. The data of the set of the node ID received from the node ID and the node address associated with the node ID is deleted (step S14).

ステップＳ１３またはステップＳ１４の処理の後、ノードテーブル管理部１１は、仮想ノード登録部１２に実行指示を出力し、更新後のノードテーブル２１に基づくキー検索用ノードテーブルの生成を指示する（ステップＳ１５）。 After the process of step S13 or step S14, the node table management unit 11 outputs an execution instruction to the virtual node registration unit 12, and instructs generation of a key search node table based on the updated node table 21 (step S15). ).

なお、ノードテーブル管理部１１は、ノードテーブル２１に登録されているノードアドレスを宛先として定期的に他の負荷分散装置１０に生存確認信号を送信し、応答がなかった場合、その宛先のノードアドレスが設定されたノードの離脱の通知を受信したものとして、上記処理を行なう。
また、ノードテーブル管理部１１は、離脱の通知を受信した場合、ステップＳ１１の処理を行なわず、ステップＳ１３において、離脱の通知に設定されているノードアドレスと、当該ノードアドレスに対応付けられたノードＩＤの組のデータを削除してもよい。 The node table management unit 11 periodically transmits a survival confirmation signal to another load balancer 10 with the node address registered in the node table 21 as a destination, and when there is no response, the node address of the destination The above processing is performed on the assumption that the notification of leaving of the node for which is set is received.
Further, when the node table management unit 11 receives the leave notification, the node table management unit 11 does not perform the process of step S11, and in step S13, the node address set in the leave notification and the node associated with the node address. The data of the ID set may be deleted.

以上の動作により、ＤＨＴを用いたKey-Value型データ管理システムに参加しているノードを記録したノードテーブル２１が維持される。 With the above operation, the node table 21 in which the nodes participating in the key-value type data management system using DHT are recorded is maintained.

[１．５．２仮想ノード登録部の動作]
図７は、本実施形態による仮想ノード登録部１２のキー検索用ノードテーブル更新処理フローを示す図である。このキー検索用ノードテーブル更新処理により、仮想ノード登録部１２は、ノードテーブル２１からキー検索用ノードテーブル２２を生成する。仮想ノード登録部１２のメモリ１２１には、図示しない入力手段などにより設定された管理範囲しきい値Ｔｈと、最大仮想ノード数Ｖｍａｘが予め記憶される。 [1.5.2 Operation of virtual node registration unit]
FIG. 7 is a diagram showing a key search node table update processing flow of the virtual node registration unit 12 according to the present embodiment. By this key search node table update process, the virtual node registration unit 12 generates a key search node table 22 from the node table 21. In the memory 121 of the virtual node registration unit 12, a management range threshold Th set by an input unit (not shown) and the maximum virtual node number Vmax are stored in advance.

仮想ノード登録部１２は、図６のステップＳ１５においてノードテーブル管理部１１が出力したキー検索用ノードテーブル生成の実行指示を受信する。仮想ノード登録部１２は、記憶部１５に現在記憶されているキー検索用ノードテーブル２２の内容を削除し、ノードテーブル２１の現在の設定内容をキー検索用ノードテーブル２２へコピーする（ステップＳ２１）。このとき、仮想ノード登録部１２は、キー検索用ノードテーブル２２にコピーした各データのノードＩＤ及びノードアドレスの組に対応させて、仮想ノード数のフィールドに「０」を設定する。 The virtual node registration unit 12 receives the instruction to generate the key search node table output from the node table management unit 11 in step S15 of FIG. The virtual node registration unit 12 deletes the contents of the key search node table 22 currently stored in the storage unit 15, and copies the current setting contents of the node table 21 to the key search node table 22 (step S21). . At this time, the virtual node registration unit 12 sets “0” in the field of the number of virtual nodes in association with the set of the node ID and the node address of each data copied to the key search node table 22.

続いて、仮想ノード登録部１２は、生成済みの仮想ノード数のカウンタとして利用するパラメータＮに「０」をセットする（ステップＳ２２）。仮想ノード登録部１２は、パラメータＮがメモリ１２１に記憶されている最大仮想ノード数Ｖｍａｘより小さいか否かを判断する（ステップＳ２３）。仮想ノード登録部１２は、パラメータＮが最大仮想ノード数Ｖｍａｘより小さいと判断した場合（ステップＳ２３：ＮＯ）、ＤＨＴを用いたKey-Value型データ管理システムに参加しているノード装置１の管理範囲長のうち、最も大きい管理範囲長である最大管理範囲長Ｌｍａｘと、最も小さい管理範囲長である最小管理範囲長Ｌｍｉｎを出力するよう管理範囲長演算部１４に指示する（ステップＳ２４）。管理範囲長演算部１４は、キー検索用ノードテーブル２２を参照し、最大管理範囲長Ｌｍａｘ、及び、最小管理範囲長Ｌｍｉｎと、最小管理範囲長Ｌｍｉｎのノード装置１（以下、「最小管理範囲長ノード装置」と記載する。）のノードアドレスと、最小管理範囲長ノード装置の仮想ノード数を返送する。 Subsequently, the virtual node registration unit 12 sets “0” to a parameter N used as a counter of the number of generated virtual nodes (step S22). The virtual node registration unit 12 determines whether or not the parameter N is smaller than the maximum virtual node number Vmax stored in the memory 121 (step S23). When the virtual node registration unit 12 determines that the parameter N is smaller than the maximum virtual node number Vmax (step S23: NO), the management range of the node devices 1 participating in the key-value type data management system using DHT The management range length calculator 14 is instructed to output the maximum management range length Lmax, which is the largest management range length, and the minimum management range length Lmin, which is the smallest management range length (step S24). The management range length calculation unit 14 refers to the key search node table 22, and refers to the node device 1 having the maximum management range length Lmax, the minimum management range length Lmin, and the minimum management range length Lmin (hereinafter, “minimum management range length”). The node address of “node device” and the number of virtual nodes of the minimum management range length node device are returned.

仮想ノード登録部１２は、最大管理範囲長Ｌｍａｘ／最小管理範囲長Ｌｍｉｎが管理範囲しきい値Ｔｈより大きいか否かを判断する（ステップＳ２５）。仮想ノード登録部１２は、最大管理範囲長Ｌｍａｘ／最小管理範囲長Ｌｍｉｎが管理範囲しきい値Ｔｈより大きいと判断した場合（ステップＳ２５：ＹＥＳ）、現時点で管理範囲長が最小である最小管理範囲長ノード装置に仮想ノードを１つ追加する（ステップＳ２６）。具体的には、以下のように動作する。 The virtual node registration unit 12 determines whether or not the maximum management range length Lmax / minimum management range length Lmin is greater than the management range threshold Th (step S25). When the virtual node registration unit 12 determines that the maximum management range length Lmax / minimum management range length Lmin is greater than the management range threshold Th (step S25: YES), the minimum management range having the minimum management range length at the present time One virtual node is added to the long node device (step S26). Specifically, it operates as follows.

仮想ノード登録部１２は、ステップＳ２４において受信した仮想ノード数に１を加算して、追加対象の仮想ノード数を算出する。仮想ノード登録部１２は、最小管理範囲長ノード装置のノードアドレスと、追加対象の仮想ノード数とを管理範囲長演算部１４に出力し、ノードＩＤの生成を指示する。管理範囲長演算部１４はノードアドレスと仮想ノード数の組み合わせからノードＩＤを生成し、仮想ノード登録部１２に通知する。仮想ノード登録部１２は、管理範囲長演算部１４から受信した仮想ノードのノードＩＤ、最小管理範囲長ノード装置のノードアドレス、及び、追加対象の仮想ノード数を対応付けたデータを、キー検索用ノードテーブル２２に追加する。 The virtual node registration unit 12 calculates the number of virtual nodes to be added by adding 1 to the number of virtual nodes received in step S24. The virtual node registration unit 12 outputs the node address of the minimum management range length node device and the number of virtual nodes to be added to the management range length calculation unit 14 and instructs generation of a node ID. The management range length calculation unit 14 generates a node ID from the combination of the node address and the number of virtual nodes, and notifies the virtual node registration unit 12 of the node ID. The virtual node registration unit 12 uses a key search for data in which the node ID of the virtual node received from the management range length calculation unit 14, the node address of the minimum management range length node device, and the number of virtual nodes to be added are associated with each other. Add to the node table 22.

仮想ノードの追加後、仮想ノード登録部１２は、現在のパラメータＮの値に１を加算し（ステップＳ２７）、ステップＳ２３からの処理を繰り返す。
そして、最終的に、ステップＳ２３において、仮想ノード数の合計が最大仮想ノード数Ｖｍａｘに達し、パラメータＮが最大仮想ノード数Ｖｍａｘ以上であると判断した場合（ステップＳ２３：ＹＥＳ）、あるいは、ステップＳ２５において、最大管理範囲長Ｌｍａｘ／最小管理範囲長Ｌｍｉｎが管理範囲しきい値Ｔｈ以下であると判断した場合（ステップＳ２５：ＮＯ）、仮想ノード登録部１２は、処理を終了する。 After adding the virtual node, the virtual node registration unit 12 adds 1 to the current value of the parameter N (step S27), and repeats the processing from step S23.
Finally, when it is determined in step S23 that the total number of virtual nodes has reached the maximum number of virtual nodes Vmax and the parameter N is equal to or greater than the maximum number of virtual nodes Vmax (step S23: YES), or step S25 When it is determined that the maximum management range length Lmax / minimum management range length Lmin is equal to or less than the management range threshold Th (step S25: NO), the virtual node registration unit 12 ends the process.

以上のように、本実施形態では、キーの管理範囲長が最小のノード装置１に対して仮想ノードを追加する処理を繰り返す。これにより、効率的に管理範囲長の不均衡を是正し、各ノード装置１のキーの管理範囲長が同程度になるように仮想ノード数を決定することができる。
なお、管理しきい値Ｔｈが１に近いほど、各ノード装置１において管理されるキーの数が均等に近くなるが、その反面、仮想ノード数が増え、結果としてキー検索用ノードテーブル２２のサイズが増加してしまう。そのため、実験等の結果に基づいて、キー検索用ノードテーブル２２のサイズが大きくなりすぎない範囲で管理しきい値Ｔｈを決定する。また、最大仮想ノード数Ｖｍａｘが大きいほど、キー検索用ノードテーブル２２のサイズが大きくなる可能性がある。そのため、最大仮想ノード数Ｖｍａｘは、記憶部１５として使用されるメモリの容量に基づいて決定される。 As described above, in the present embodiment, the process of adding a virtual node to the node device 1 with the smallest key management range length is repeated. Thereby, the imbalance of the management range length can be corrected efficiently, and the number of virtual nodes can be determined so that the management range length of the key of each node device 1 becomes approximately the same.
Note that the closer the management threshold Th is to 1, the more the number of keys managed in each node device 1 becomes closer, but on the other hand, the number of virtual nodes increases, resulting in the size of the key search node table 22 as a result. Will increase. Therefore, the management threshold value Th is determined in a range in which the size of the key search node table 22 does not become too large based on the results of experiments and the like. Further, the larger the maximum virtual node number Vmax, the larger the size of the key search node table 22 may be. Therefore, the maximum virtual node number Vmax is determined based on the capacity of the memory used as the storage unit 15.

本実施形態では、ステップＳ２５において、最大管理範囲長Ｌｍａｘ／最小管理範囲長Ｌｍｉｎと管理範囲のしきい値Ｔｈとを比較し、処理が終了したかを判定しているが、これに限定されるものではない。例えば、最大管理範囲長Ｌｍａｘと所定のしきい値を比較してもよく、最小管理範囲長Ｌｍｉｎと所定のしきい値を比較してもよい。あるいは、キー検索用ノードテーブル２２に登録されている全てのノード装置１及び仮想ノードの管理範囲長の標準偏差や分散と所定のしきい値を比較してもよい。 In the present embodiment, in step S25, the maximum management range length Lmax / minimum management range length Lmin is compared with the threshold value Th of the management range, and it is determined whether the processing has been completed. It is not a thing. For example, the maximum management range length Lmax may be compared with a predetermined threshold value, or the minimum management range length Lmin may be compared with a predetermined threshold value. Alternatively, the standard deviation or variance of the management range lengths of all the node devices 1 and virtual nodes registered in the key search node table 22 may be compared with a predetermined threshold value.

[１．５．３ノードＩＤ生成部の動作]
次に、本実施形態によるノードＩＤ生成部１３のノードＩＤ生成処理を説明する。
ノードテーブル管理部１１または仮想ノード登録部１２からノードアドレス及び仮想ノード数を受信すると、ノードＩＤ生成部１３は、ノードアドレスと仮想ノード数とを結合したデータにハッシュ関数を用いてノードＩＤを生成する。仮想ノード数は０からはじまる整数である。本実施形態では、ハッシュ関数としてＳＨＡ−１（Secure Hash Algorithm 1）を用いる。従って、ノードＩＤ生成部１３は、ノードＩＤとして１６０ビットの値を生成する。例えば、ノードアドレスが「192.168.100.10:3939」であり、仮想ノード数が「3」である場合、ノードＩＤ生成部１３は、ノードアドレスと仮想ノード数を結合したデータ「192.168.100.10:3939_3」を生成する。ＳＨＡ-１のハッシュ関数をHash()とすると、ノードＩＤ生成部１３は、Hash(“192.168.100.10:3939_3”)によりノードＩＤを算出する。ノードＩＤ生成部１３は、ノードアドレス及び仮想ノード数の出力元であるノードテーブル管理部１１または仮想ノード登録部１２に、算出したノードＩＤを返送する。 [1.5.3 Operation of node ID generator]
Next, node ID generation processing of the node ID generation unit 13 according to the present embodiment will be described.
Upon receipt of the node address and the number of virtual nodes from the node table management unit 11 or the virtual node registration unit 12, the node ID generation unit 13 generates a node ID by using a hash function for data obtained by combining the node address and the number of virtual nodes. To do. The number of virtual nodes is an integer starting from 0. In the present embodiment, SHA-1 (Secure Hash Algorithm 1) is used as a hash function. Therefore, the node ID generation unit 13 generates a 160-bit value as the node ID. For example, when the node address is “192.168.100.10:3939” and the number of virtual nodes is “3”, the node ID generation unit 13 uses the data “192.168.100.10:3939_3” that combines the node address and the number of virtual nodes. Is generated. If the hash function of SHA-1 is Hash (), the node ID generation unit 13 calculates the node ID using Hash (“192.168.100.10:3939_3”). The node ID generation unit 13 returns the calculated node ID to the node table management unit 11 or the virtual node registration unit 12 that is the output source of the node address and the number of virtual nodes.

[１．５．４管理範囲長演算部の動作]
次に、本実施形態による管理範囲長演算部１４の管理範囲長算出処理を説明する。管理範囲長演算部１４は、管理範囲長算出処理において、オーバーレイネットワークに参加している各ノードのハッシュ空間上での管理範囲長を計算する。 [1.5.4 Operation of control range length calculation unit]
Next, the management range length calculation process of the management range length calculation unit 14 according to the present embodiment will be described. The management range length calculation unit 14 calculates the management range length on the hash space of each node participating in the overlay network in the management range length calculation process.

図８は、管理範囲長演算部１４の管理範囲長算出処理を説明するための図である。上述したように、本実施形態では１６０ビット長のノードＩＤを利用しているが、説明を簡単にするため、同図においては、ノードＩＤを８ビット長として説明する。ノードＩＤが８ビット長であるため、ノードＩＤは０以上かつ２５５以下の範囲の整数値となる。 FIG. 8 is a diagram for explaining the management range length calculation processing of the management range length calculation unit 14. As described above, a node ID having a 160-bit length is used in the present embodiment. However, in order to simplify the description, the node ID is described as having an 8-bit length in FIG. Since the node ID is 8 bits long, the node ID is an integer value in the range of 0 to 255.

図８（ａ）は、オーバーレイネットワークを示す図であり、図８（ｂ）は、図８（ａ）に示すオーバーレイネットワークが形成されているときのキー検索用ノードテーブル２２を示す。図８（ａ）及び図８（ｂ）に示す例においては、３つのノード装置１がオーバーレイネットワークに参加している状態を示しており、これらの３つのノード装置１を大文字のＡ，Ｂ，Ｃを用いてそれぞれノードＡ、ノードＢ、ノードＥとする。また、ノードＡ、ノードＢ、ノードＥそれぞれの仮想ノードを、小文字のａ、ｂ、ｃを用いてノードａ、ノードｂ、ノードｅとする。
以下、ノードＩＤ＝ｉのノードｘをノードｘ[ｉ]と記載する。 FIG. 8A shows the overlay network, and FIG. 8B shows the key search node table 22 when the overlay network shown in FIG. 8A is formed. 8A and 8B show a state in which three node devices 1 are participating in the overlay network, and these three node devices 1 are represented by capital letters A, B, Let C be node A, node B, and node E, respectively. In addition, the virtual nodes of the node A, the node B, and the node E are defined as a node a, a node b, and a node e by using lowercase letters a, b, and c.
Hereinafter, node x with node ID = i is referred to as node x [i].

図８（ｃ）は、各ノード装置１の管理範囲長を示す図である。各ノードのキーの管理範囲は、時計回りに反対方向の隣接ノードであるpredecessorとの距離である。例えば、ノードＥ[９０]のpredecessorは、ノードｂ[７０]である。従って、ノードＥ単独のキー管理範囲は「７１〜９０」であり、管理範囲長は「２０」となる。また、ノードＥには、ノードｅ[６０]、ノードｅ[１９０]の２つの仮想ノードがある。ノードｅ[１９０]のpredecessorは、ノードａ[１６０]であるため、キー管理範囲は「１６１〜１９０」であり、管理範囲長は「３０」となる。また、ノードｅ[６０]のpredecessorは、ノードａ[１０]であるため、キー管理範囲は「１１〜６０」であり、管理範囲長は「５０」となる。よって、ノードＥ全体の管理範囲長は、２０＋３０＋５０＝１００である。 FIG. 8C is a diagram showing the management range length of each node device 1. The key management range of each node is a distance from a predecessor that is an adjacent node in the opposite direction in the clockwise direction. For example, the predecessor of the node E [90] is the node b [70]. Accordingly, the key management range of the node E alone is “71 to 90”, and the management range length is “20”. In addition, node E has two virtual nodes, node e [60] and node e [190]. Since the predecessor of the node e [190] is the node a [160], the key management range is “161 to 190”, and the management range length is “30”. Further, since the predecessor of the node e [60] is the node a [10], the key management range is “11 to 60”, and the management range length is “50”. Therefore, the management range length of the entire node E is 20 + 30 + 50 = 100.

同様にして、ノードＡ全体の管理範囲長は、ノードＡ[１０]単独のキー管理範囲「２４１〜２５５、０〜１０」の管理範囲長「２６」と、ノードａ[１６０]単独のキー管理範囲「１４１〜１６０」の管理範囲長「２０」を合計した「４６」となる。
また、ノードＢ全体の管理範囲長は、ノードＢ[１４０]単独のキー管理範囲「９１〜１４０」の管理範囲長「５０」と、ノードｂ[２４０]単独のキー管理範囲「１９１〜２４０」の管理範囲長「５０」と、ノードｂ[７０]単独のキー管理範囲「６１〜７０」の管理範囲長「１０」を合計した「１１０」である。 Similarly, the management range length of the entire node A is the management range length “26” of the key management range “241 to 255, 0 to 10” of the node A [10] alone and the key management range of the node a [160] alone. The total of the management range length “20” in the range “141 to 160” is “46”.
Further, the management range length of the entire node B is the management range length “50” of the key management range “91 to 140” of the node B [140] alone and the key management range “191 to 240” of the node b [240] alone. And the management range length “10” of the key management range “61 to 70” of the node b [70] alone is “110”.

このように、各ノード装置１の合計の管理範囲長は、当該ノード装置１のノード単独の管理範囲長と、当該ノード装置１について生成された各仮想ノード単独の管理範囲長の合計となる。結果として、管理範囲長演算部１４は、最大管理範囲長ＬｍａｘとしてノードＢの管理範囲長「１１０」を、最小管理範囲長ＬｍｉｎとしてノードＡの管理範囲長「４６」を出力する。 Thus, the total management range length of each node device 1 is the sum of the management range length of the node alone of the node device 1 and the management range length of each virtual node generated for the node device 1. As a result, the management range length calculation unit 14 outputs the management range length “110” of the node B as the maximum management range length Lmax and the management range length “46” of the node A as the minimum management range length Lmin.

上記の動作を行うため、管理範囲長演算部１４は、キー検索用ノードテーブル２２を参照し、全ノード装置１の管理範囲長を計算する。具体的には、以下のように動作する。
管理範囲長演算部１４は、キー検索用ノードテーブル２２に設定されているノードＩＤに順に注目していき、注目しているノードＩＤと、注目しているノードＩＤよりも小さい隣接するノードＩＤとの差から、注目しているノードＩＤ単独の管理範囲長を求める。 In order to perform the above operation, the management range length calculation unit 14 refers to the key search node table 22 and calculates the management range length of all the node devices 1. Specifically, it operates as follows.
The management range length computing unit 14 pays attention to the node IDs set in the key search node table 22 in order, and the node ID of interest and the adjacent node ID smaller than the node ID of interest From the difference, the management range length of the node ID of interest is obtained.

キー検索用ノードテーブル２２に注目しているノードＩＤよりも小さい隣接するノードＩＤが設定されていない場合、その注目しているノードＩＤは、オーバーレイネットワークに参加している中で最も小さいノードＩＤである。従って、predecessorのノードＩＤは、キー検索用ノードテーブル２２に登録されている中でも最も大きいノードＩＤである。管理範囲長演算部１４は、注目しているノードＩＤ（例えば、図８（ｂ）に示すノードＩＤ＝１０）に、ノードＩＤが取りうる最大の値（ノードＩＤが8ビット長である場合「２５５」）を加算した後、predecessorのノードＩＤ（図８（ｂ）に示すノードＩＤ＝２４０）を減算することによって管理範囲長を算出する。 If an adjacent node ID smaller than the node ID of interest is not set in the key search node table 22, the node ID of interest is the smallest node ID participating in the overlay network. is there. Therefore, the node ID of predecessor is the largest node ID among those registered in the key search node table 22. The management range length calculation unit 14 adds the maximum value that the node ID can take to the node ID of interest (for example, node ID = 10 shown in FIG. 8B) (when the node ID is 8 bits long). 255 ") is added, and the management range length is calculated by subtracting the node ID of the predecessor (node ID = 240 shown in FIG. 8B).

管理範囲長演算部１４は、キー検索用ノードテーブル２２に登録されている各ノードＩＤについて管理範囲長を算出した後、キー検索用ノードテーブル２２に同じノードアドレスが対応付けられているノードＩＤについて算出した管理範囲長を合計する。これによって、各ノード装置１の管理範囲長が算出される。管理範囲長演算部１４は、算出した管理範囲長のうち最大である最大管理範囲長Ｌｍａｘ、最小である最小管理範囲長Ｌｍｉｎを選択する。管理範囲長演算部１４は、最小管理範囲長Ｌｍｉｎに対応したノードアドレスと対応付けてキー検索用ノードテーブル２２に設定されている仮想ノード数の中から、最も大きい仮想ノード数を読み出す。管理範囲長演算部１４は、最大管理範囲長Ｌｍａｘ、最小管理範囲長Ｌｍｉｎ、最小管理範囲長ノード装置のノードアドレス、読み出した仮想ノード数を仮想ノード登録部１２に返送する。 The management range length calculation unit 14 calculates the management range length for each node ID registered in the key search node table 22, and then calculates the node ID associated with the same node address in the key search node table 22. Total the calculated management range lengths. Thereby, the management range length of each node device 1 is calculated. The management range length calculator 14 selects the maximum maximum management range length Lmax and the minimum minimum management range length Lmin among the calculated management range lengths. The management range length calculation unit 14 reads the largest number of virtual nodes from the number of virtual nodes set in the key search node table 22 in association with the node address corresponding to the minimum management range length Lmin. The management range length calculation unit 14 returns the maximum management range length Lmax, the minimum management range length Lmin, the node address of the minimum management range length node device, and the number of read virtual nodes to the virtual node registration unit 12.

[１．５．５キーテーブル処理部の動作]
続いて、キーテーブル処理部の動作例について説明する。
いずれかのノード装置１がデータ記憶部３０に新たなデータを記憶すると、キーテーブル処理部１６は、この新たなデータのデータ名からＳＨＡ−１ハッシュ関数によりデータＩＤを生成する。キーテーブル処理部１６は、データ記憶部３０に保存されているキー検索用ノードテーブル２２から、生成したデータＩＤに時計回りに近いノードＩＤを検索し、そのノードＩＤに対応したノードアドレスを取得する。キーテーブル処理部１６は、取得したノードアドレスを宛先として、データＩＤと自ノード装置１のノードアドレスを設定したキー登録要求を送信する。キー登録要求を受信したノード装置１において、負荷分散装置１０のキーテーブル処理部１６は、キー登録要求内のデータＩＤとノードアドレスの組をデータ記憶部３０に記憶しているキーテーブル２３に追加する。 [1.5.5 Operation of key table processing unit]
Subsequently, an operation example of the key table processing unit will be described.
When any one of the node devices 1 stores new data in the data storage unit 30, the key table processing unit 16 generates a data ID from the data name of the new data using a SHA-1 hash function. The key table processing unit 16 searches for a node ID close to the generated data ID in the clockwise direction from the key search node table 22 stored in the data storage unit 30, and acquires a node address corresponding to the node ID. . The key table processing unit 16 transmits a key registration request in which the acquired node address is set as the destination and the data ID and the node address of the own node device 1 are set. In the node device 1 that has received the key registration request, the key table processing unit 16 of the load distribution device 10 adds the combination of the data ID and the node address in the key registration request to the key table 23 stored in the data storage unit 30. To do.

いずれかのノード装置１がデータにアクセスする場合、負荷分散装置１０のキーテーブル処理部１６は、そのデータのデータ名からＩＤを生成する。キーテーブル処理部１６は、データ記憶部３０に保存されているキー検索用ノードテーブル２２から、生成したデータＩＤに時計回りに近いノードＩＤを検索し、そのノードＩＤに対応したノードアドレスを取得する。キーテーブル処理部１６は、取得したノードアドレスを宛先として、データＩＤをキーとして設定した問合せを送信する。問合せを受信したノード装置１において、負荷分散装置１０のキーテーブル処理部１６は、データ記憶部３０に記憶されているキーテーブル２３から、問合せに設定されているキーに対応した値を読み出して返送する。この値は、データを記憶しているノード装置１のノードアドレスを示す。問合せ送信元のノード装置１は、受信した「値」により示されるノードアドレス宛先として、データを記憶しているノード装置１にアクセスし、アクセス先のノード装置１が備えるデータ記憶部３０に記憶されている操作対象のデータの読み出し、書替え、削除などを行なう。 When any one of the node devices 1 accesses data, the key table processing unit 16 of the load distribution device 10 generates an ID from the data name of the data. The key table processing unit 16 searches for a node ID close to the generated data ID in the clockwise direction from the key search node table 22 stored in the data storage unit 30, and acquires a node address corresponding to the node ID. . The key table processing unit 16 transmits an inquiry in which the acquired node address is set as the destination and the data ID is set as the key. In the node device 1 that has received the inquiry, the key table processing unit 16 of the load distribution device 10 reads out a value corresponding to the key set in the inquiry from the key table 23 stored in the data storage unit 30 and returns it. To do. This value indicates the node address of the node device 1 storing the data. The node device 1 as the inquiry transmission source accesses the node device 1 storing the data as the node address destination indicated by the received “value”, and is stored in the data storage unit 30 provided in the node device 1 as the access destination. Reading, rewriting, deleting, etc. of the operation target data.

上記においては、キーがデータを特定するデータＩＤであり、値がデータを記憶しているノード装置１のノードアドレスである場合の例を説明したが、ＤＨＴを用いたKey-Value型データ管理システムの既存の技術を用い、他のキーとデータの組を分散して管理することもできる。 In the above description, an example in which the key is a data ID for identifying data and the value is the node address of the node device 1 storing the data has been described. However, a key-value type data management system using DHT has been described. Other key and data sets can be distributed and managed using existing technology.

[１．６性能]
図９は、本実施形態による負荷分散装置１０を使った実験結果を示す。図９（ａ）は、データ管理システムに参加しているノード装置１の数と、全ノード装置１の管理範囲長の標準偏差との関連を示し、図９（ｂ）は、データ管理システムに参加しているノード装置１の数と、キー検索用ノードテーブルのサイズとの関係を示す。
実験では、管理範囲しきい値Ｔｈ=１．５、最大仮想ノード数Ｖｍａｘ=１００００とし、ノード装置１の数を３〜１０００の範囲とした。なお、比較として、全ノード装置それぞれに０、１０、１００個の固定数の仮想ノードを割り当てる方式を用いた場合における全ノード装置の管理範囲長の標準偏差と、キー検索用ノードテーブルのサイズを示している。 [1.6 Performance]
FIG. 9 shows experimental results using the load distribution apparatus 10 according to the present embodiment. FIG. 9A shows the relationship between the number of node devices 1 participating in the data management system and the standard deviation of the management range length of all node devices 1, and FIG. The relationship between the number of participating node devices 1 and the size of the key search node table is shown.
In the experiment, the management range threshold value Th = 1.5, the maximum number of virtual nodes Vmax = 10000, and the number of node devices 1 were in the range of 3 to 1000. As a comparison, the standard deviation of the management range length of all the node devices and the size of the key search node table when using a method in which 0, 10, 100 virtual nodes are assigned to all the node devices are used. Show.

図９（ａ）に示す管理範囲長の標準偏差の結果から、本実施形態では、データ管理システムに参加している各ノード装置１の負荷分散を行なう場合に、１００個の固定数の仮想ノードを割り当てる方式と同等の性能が得られることがわかる。一方、図９（ｂ）に示すの結果から、本実施形態の負荷分散装置１０を用いたときのキー検索用ノードテーブル２２のサイズは、各ノード装置に１０個の固定数の仮想ノードを割り当てる方式において用いられるキー検索用ノードテーブルのサイズと同程度であることがわかる。
これらの実験結果から、本実施形態によれば、各ノード装置に固定数１００の仮想ノードを割り当てるの場合と同程度の性能を、１０分の１のサイズのキー検索用ノードテーブルによりで実現できることがわかる。 From the result of the standard deviation of the management range length shown in FIG. 9A, in the present embodiment, when performing load distribution of each node device 1 participating in the data management system, 100 fixed number of virtual nodes It can be seen that the performance equivalent to the method of assigning is obtained. On the other hand, from the result shown in FIG. 9B, the size of the key search node table 22 when the load distribution apparatus 10 of this embodiment is used is that 10 fixed virtual nodes are allocated to each node apparatus. It can be seen that the size is the same as the size of the key search node table used in the system.
From these experimental results, according to the present embodiment, it is possible to realize a performance equivalent to the case of allocating a fixed number of 100 virtual nodes to each node device by using a key search node table of 1/10 size. I understand.

[２．第２の実施形態]
上述した第１の実施形態において、管理範囲長演算部１４は、キー検索用ノードテーブル２２に登録されている全てのノードの管理範囲長を求めた後、各ノード装置１が管理しているキーの管理範囲長を算出しているが、算出方法はこれに限定されるものではない。本実施形態では、各ノードが管理しているキーの管理範囲長を高速に算出する。
以下では、第１の実施形態との差分を説明する。 [2. Second Embodiment]
In the first embodiment described above, the management range length calculation unit 14 obtains the management range lengths of all the nodes registered in the key search node table 22, and then manages the keys managed by each node device 1. However, the calculation method is not limited to this. In this embodiment, the key management range length managed by each node is calculated at high speed.
Hereinafter, differences from the first embodiment will be described.

図１０は、本実施形態による管理範囲長演算部１４の処理を説明するための図である。
図１０（ａ）は、図７のステップＳ２６の処理によって、図８（ａ）に示すオーバーレイネットワークにノードＡの仮想ノードであるノードａ[２３０]が追加された場合の例を示している。また、図１０（ｂ）は、ノードａ[２３０]の追加後の各ノード装置１の管理範囲長を示している。 FIG. 10 is a diagram for explaining the processing of the management range length calculation unit 14 according to the present embodiment.
FIG. 10A illustrates an example in which the node a [230], which is the virtual node of the node A, is added to the overlay network illustrated in FIG. 8A by the process of step S26 of FIG. FIG. 10B shows the management range length of each node device 1 after the addition of the node a [230].

図１０（ａ）に示すノードａ[２３０]の追加により、ノードＡの管理範囲長には、追加されたノードａ[２３０]とそのpredecessorのノードｅ[１９０]との距離である「４０」が追加される。一方、追加されたノードａ[２３０]のsuccessor（時計回り方向の隣接ノード）のノードｂ[２４０]の管理範囲長は「４０」だけ短くなる。この例では、追加されたノードａ[２３０]のsuccessorであるノードｂ[２４０]は、ノードＢの仮想ノードであるので、ノードＢの管理範囲長を「４０」だけ短くする。
結果として、図１０（ｂ）に示すように、管理範囲長の演算は、ノードＡの管理範囲長に「４０」を加算し、ノードＢの管理範囲長から「４０」を減算するだけで求めることが可能である。 As a result of the addition of the node a [230] shown in FIG. 10A, the management range length of the node A is “40” which is the distance between the added node a [230] and the node e [190] of the predecessor. Is added. On the other hand, the management range length of the node b [240] of the successor (the adjacent node in the clockwise direction) of the added node a [230] is shortened by “40”. In this example, since the node b [240], which is the successor of the added node a [230], is a virtual node of the node B, the management range length of the node B is shortened by “40”.
As a result, as shown in FIG. 10B, the calculation of the management range length is obtained simply by adding “40” to the management range length of node A and subtracting “40” from the management range length of node B. It is possible.

以上の動作を行うため、管理範囲長演算部１４は、図７のステップＳ２４において各ノードアドレスについて算出した管理範囲長を記憶部１５に書き込む。そして、ステップＳ２６において仮想ノードが追加され、次のループでステップＳ２４を実行する際、仮想ノード登録部１２からステップＳ２６において追加された仮想ノードのノードＩＤを受信する。追加されたノードＩＤを追加ノードＩＤと記載する。 In order to perform the above operation, the management range length calculation unit 14 writes the management range length calculated for each node address in step S24 of FIG. Then, a virtual node is added in step S26, and when executing step S24 in the next loop, the node ID of the virtual node added in step S26 is received from the virtual node registration unit 12. The added node ID is described as an additional node ID.

管理範囲長演算部１４は、キー検索用ノードテーブル２２から追加ノードＩＤよりも小さい隣接するノードＩＤとの差から、追加された仮想ノードの管理範囲長を求めるとともに、追加ノードＩＤに対応したノードアドレスを読み出す。管理範囲長演算部１４は、読み出したノードアドレスに対応して記憶部１５に記憶されている管理範囲長を、追加された仮想ノードの管理範囲長を加算した値に更新する。 The management range length calculation unit 14 obtains the management range length of the added virtual node from the difference from the adjacent node ID smaller than the additional node ID from the key search node table 22, and the node corresponding to the additional node ID Read the address. The management range length calculation unit 14 updates the management range length stored in the storage unit 15 corresponding to the read node address to a value obtained by adding the management range length of the added virtual node.

さらに、管理範囲長演算部１４は、キー検索用ノードテーブル２２から追加ノードＩＤよりも大きい隣接するノードＩＤを特定し、特定したノードＩＤに対応したノードアドレスを読み出す。管理範囲長演算部１４は、読み出したノードアドレスに対応して記憶部１５に記憶されている管理範囲長を、追加された仮想ノードの管理範囲長を減算した値に更新する。 Further, the management range length calculation unit 14 specifies an adjacent node ID larger than the additional node ID from the key search node table 22, and reads a node address corresponding to the specified node ID. The management range length calculation unit 14 updates the management range length stored in the storage unit 15 corresponding to the read node address to a value obtained by subtracting the management range length of the added virtual node.

上記処理の後、管理範囲長演算部１４は、仮想ノード追加後の各ノード装置１のアドレスに対応した管理範囲長を記憶部１５から読み出す。 After the above processing, the management range length calculation unit 14 reads the management range length corresponding to the address of each node device 1 after the addition of the virtual node from the storage unit 15.

[３．第３の実施形態]
第３の実施形態には、各ノード装置１のノード性能が異なる場合に適用する。つまり、ＤＨＴに参加する各ノード装置１の性能が同一ではない場合、それらのノード性能に応じて各負荷分散装置１０が管理及び保管する（キー、値）のデータ量を制御する。以下、第１の実施形態及び第２の実施形態との差分を説明する。 [3. Third Embodiment]
The third embodiment is applied when the node performance of each node device 1 is different. That is, when the performance of the node devices 1 participating in the DHT is not the same, the data amount managed (stored) (key, value) by each load distribution device 10 is controlled according to the node performance. Hereinafter, the difference between the first embodiment and the second embodiment will be described.

[３．１データ構成]
図１１は、本実施形態のノードテーブル２１のデータ構成例を示す図である。同図に示すノードテーブル２１が、図３に示すノードテーブル２１と異なる点は、ノードＩＤ及びノードアドレスに対応付けて、ノード性能値が設定される点である。ノード性能値は、ノードの性能を示す指標についての定量的な値であり、例えば、ＣＰＵ（central processing unit）の性能を示す値や、ノードが保管できる総ディスク容量などがある。 [3.1 Data structure]
FIG. 11 is a diagram illustrating a data configuration example of the node table 21 according to the present embodiment. The node table 21 shown in the figure is different from the node table 21 shown in FIG. 3 in that node performance values are set in association with node IDs and node addresses. The node performance value is a quantitative value for an index indicating the performance of the node, and includes, for example, a value indicating the performance of a CPU (central processing unit) and the total disk capacity that can be stored in the node.

図１２は、本実施形態のキー検索用ノードテーブル２２のデータ構成例を示す図である。同図に示すキー検索用ノードテーブル２２が、図４に示すキー検索用ノードテーブル２２と異なる点は、ノードＩＤ、ノードアドレス、及び、仮想ノード数に対応付けて、ノード性能値が設定される点である。 FIG. 12 is a diagram illustrating a data configuration example of the key search node table 22 according to the present embodiment. The key search node table 22 shown in the figure is different from the key search node table 22 shown in FIG. 4 in that a node performance value is set in association with the node ID, the node address, and the number of virtual nodes. Is a point.

[３．２負荷分散装置の動作]
次に、負荷分散装置１０の動作について説明する。 [3.2 Operation of load balancer]
Next, the operation of the load distribution apparatus 10 will be described.

[３．２．１ノードテーブル管理部の動作]
図６のステップＳ１１において、ノードテーブル管理部１１が受信するノードの参加の通知には、通知種別「参加」、参加したノード装置１のノードアドレス、及び、参加したノード装置１の性能値が含まれる。ステップＳ１３において、ノードテーブル管理部１１は、記憶部１５に記憶されているノードテーブル２１に、ステップＳ１１において管理範囲長演算部１４から受信したノードＩＤと、参加の通知に含まれるノードアドレス及びノード性能値とを対応付けたデータを新たに追加する。また、ステップＳ１４において、ノードテーブル管理部１１は、記憶部１５に記憶されているノードテーブル２１から、ステップＳ１１においてノードＩＤ生成部１３から受信したノードＩＤと、当該ノードＩＤに対応付けられたノードアドレス及び性能値の組のデータを削除する。 [3.2.1 Operation of node table management unit]
In step S11 of FIG. 6, the node participation notification received by the node table management unit 11 includes the notification type “participation”, the node address of the participating node device 1, and the performance value of the participating node device 1. It is. In step S13, the node table management unit 11 adds to the node table 21 stored in the storage unit 15 the node ID received from the management range length calculation unit 14 in step S11, and the node address and node included in the notification of participation. Data that is associated with the performance value is newly added. In step S14, the node table management unit 11 also receives the node ID received from the node ID generation unit 13 in step S11 from the node table 21 stored in the storage unit 15, and the node associated with the node ID. Delete the data of the address / performance value pair.

[３．２．２仮想ノード登録部の動作]
図７のステップＳ２１において、仮想ノード登録部１２が、ノードテーブル２１の設定内容をキー検索用ノードテーブル２２にコピーする際、ノードＩＤ、ノードアドレス及び性能値の組がそのままコピーされる。また、ステップＳ２６において、仮想ノード登録部１２は、仮想ノードのノードＩＤ、最小管理範囲長ノード装置のノードアドレス及び性能値に、追加対象の仮想ノード数を対応付けたデータを、キー検索用ノードテーブル２２に追加する。 [3.2.2 Operation of virtual node registration unit]
In step S21 of FIG. 7, when the virtual node registration unit 12 copies the setting contents of the node table 21 to the key search node table 22, the set of the node ID, the node address, and the performance value is copied as it is. Further, in step S26, the virtual node registration unit 12 uses the key search node to store data in which the number of virtual nodes to be added is associated with the node ID of the virtual node, the node address of the minimum management range length node device, and the performance value. Add to table 22.

[３．２．３管理範囲長演算部の動作]
本実施形態による管理範囲長演算部１４は、上述した第１の実施形態、または、第２の実施形態のように算出した各ノード装置１の管理範囲長に、当該ノード装置１の性能値に応じて重み付けを行う。 [3.2.3 Operation of control range length calculation unit]
The management range length calculation unit 14 according to the present embodiment sets the management range length of each node device 1 calculated as in the first embodiment or the second embodiment described above to the performance value of the node device 1. Weighting is performed accordingly.

管理範囲長演算部１４は、第１の実施形態または第２の実施形態と同様に、各ノード装置１のノードアドレス毎に管理範囲長を算出する。管理範囲長演算部１４は、各ノードアドレスに対応付けてノードテーブル２１またはキー検索用ノードテーブル２２に設定されている性能値を読み出すと、ノードアドレス毎に算出した管理範囲長を、そのノードアドレスについて読み出した性能値によって除算する。管理範囲長演算部１４は、除算した結果の管理範囲長の中で最大の管理範囲長を最大管理範囲長Ｌｍａｘ、最小の管理範囲長を最小管理範囲長Ｌｍｉｎとして選択する。管理範囲長演算部１４は、最小管理範囲長Ｌｍｉｎに対応したノードアドレスと対応付けてキー検索用ノードテーブル２２に設定されている仮想ノード数の中から、最も大きい仮想ノード数を読み出す。管理範囲長演算部１４は、最大管理範囲長Ｌｍａｘ、最小管理範囲長Ｌｍｉｎ、最小管理範囲長ノード装置のノードアドレス、読み出した仮想ノード数をノードテーブル管理部１１に返送する。 The management range length calculation unit 14 calculates the management range length for each node address of each node device 1 as in the first embodiment or the second embodiment. When the management range length calculation unit 14 reads the performance value set in the node table 21 or the key search node table 22 in association with each node address, the management range length calculation unit 14 calculates the management range length calculated for each node address. Divide by the read performance value. The management range length computing unit 14 selects the maximum management range length as the maximum management range length Lmax and the minimum management range length as the minimum management range length Lmin among the management range lengths as a result of the division. The management range length calculation unit 14 reads the largest number of virtual nodes from the number of virtual nodes set in the key search node table 22 in association with the node address corresponding to the minimum management range length Lmin. The management range length calculation unit 14 returns the maximum management range length Lmax, the minimum management range length Lmin, the node address of the minimum management range length node device, and the number of read virtual nodes to the node table management unit 11.

図１３は、ノード性能を考慮した場合の管理範囲長演算を説明するための図である。オーバーレイネットワークは、図８（ａ）と同様に構成されており、ノードＡ、ノードＢ、ノードＣについて第１の実施形態と同様の処理により算出された管理範囲長はそれぞれ「４６」、「１１０」、「１００」である。また、ノードＡ、ノードＢ、ノードＣのノード性能値はそれぞれ「１」、「２」、「３」である。この場合、ノードＡの管理範囲長は、ノードＡの管理範囲長「４６」を、ノードＡのノード性能値「１」によって除算した値「４６」となる。また、ノードＢの管理範囲長は、ノードＢの管理範囲長「１１０」をノードＢのノード性能値「２」によって除算した値「５５」となる。また、ノードＥの管理範囲長は、ノードＥの管理範囲長「１００」をノードＥのノード性能値「３」によって除算した値「３３」となる。従って、最大管理範囲長Ｌｍａｘは、ノードＢの管理範囲長「５５」となり、最小管理範囲長Ｌｍｉｎは、ノードＥの最小管理範囲長ノード装置の管理範囲長となる。
この結果、性能が高いノードＥの仮想ノードを増加させることになる。 FIG. 13 is a diagram for explaining the management range length calculation in consideration of node performance. The overlay network is configured in the same manner as in FIG. 8A, and the management range lengths calculated by the same processing as in the first embodiment for the nodes A, B, and C are “46” and “110”, respectively. ”,“ 100 ”. The node performance values of the node A, the node B, and the node C are “1”, “2”, and “3”, respectively. In this case, the management range length of the node A is a value “46” obtained by dividing the management range length “46” of the node A by the node performance value “1” of the node A. The management range length of the node B is a value “55” obtained by dividing the management range length “110” of the node B by the node performance value “2” of the node B. The management range length of the node E is a value “33” obtained by dividing the management range length “100” of the node E by the node performance value “3” of the node E. Therefore, the maximum management range length Lmax is the management range length “55” of the node B, and the minimum management range length Lmin is the management range length of the minimum management range length node device of the node E.
As a result, the number of virtual nodes of node E with high performance is increased.

[４．本発明の実施形態の効果]
以上説明したように、本発明の実施形態によれば、ＤＨＴを用いたKey-Value型データ管理システムを構成するノード装置に実装される負荷分散装置は、システムを構成しているノード装置の参加や離脱が発生した際、各ノード装置の管理範囲長が同等に近くなるまで、管理範囲長が最も狭いノード装置へ仮想ノードの割り当てを行う処理を繰り返し、ノード装置のアドレスと当該ノード装置に対して生成された仮想ノードのノードＩＤとからなるキー検索用ノードテーブルを生成する。よって、従来技術と比較して各ノード装置のキーの管理範囲長の不均衡を効率的に是正し、仮想ノードの数を抑えることが可能となるため、キー検索用ノードテーブルの増加が少ない。
また、各ノード装置を均等に負荷分散したり、各ノード装置のノード性能に応じて負荷分散したりすることが可能となる。 [4. Effects of the embodiment of the present invention]
As described above, according to the embodiment of the present invention, the load balancer installed in the node device constituting the key-value type data management system using DHT is the participation of the node device constituting the system. Or when a node is disconnected, the process of assigning a virtual node to the node device with the narrowest management range length is repeated until the management range length of each node device is nearly equal. A key search node table including the node ID of the generated virtual node is generated. Therefore, compared with the prior art, it is possible to efficiently correct the imbalance of the key management range length of each node device and to reduce the number of virtual nodes, and therefore the increase in the key search node table is small.
In addition, it becomes possible to distribute the load evenly among the node devices, or to distribute the load according to the node performance of each node device.

［５．その他］
なお、上述した実施形態では、各負荷分散装置１０においてノードテーブル２１及びキー検索用ノードテーブル２２を生成しているが、一部の負荷分散装置１０においてノードテーブル２１及びキー検索用ノードテーブル２２を生成し、他の負荷分散装置１０に配信してもよい。 [5. Others]
In the above-described embodiment, the node table 21 and the key search node table 22 are generated in each load balancer 10. However, in some load balancers 10, the node table 21 and the key search node table 22 are stored. It may be generated and distributed to other load balancers 10.

上述した負荷分散装置１０を実装するノード装置１は、内部にコンピュータシステムを有している。そして、負荷分散装置１０の動作の過程は、プログラムの形式でコンピュータ読み取り可能な記録媒体に記憶されており、このプログラムをコンピュータシステムが読み出して実行することによって、上記処理が行われる。ここでいうコンピュータシステムとは、ＣＰＵ及び各種メモリやＯＳ、周辺機器等のハードウェアを含むものである。 The node device 1 in which the load distribution device 10 described above is mounted has a computer system therein. The process of the operation of the load balancer 10 is stored in a computer-readable recording medium in the form of a program, and the above processing is performed by the computer system reading and executing this program. The computer system here includes a CPU, various memories, an OS, and hardware such as peripheral devices.

また、「コンピュータシステム」は、ＷＷＷシステムを利用している場合であれば、ホームページ提供環境（あるいは表示環境）も含むものとする。
また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶部のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの、その場合のサーバーやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含むものとする。また上記プログラムは、前述した機能の一部を実現するためのものであっても良く、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであっても良い。 Further, the “computer system” includes a homepage providing environment (or display environment) if a WWW system is used.
The “computer-readable recording medium” refers to a portable medium such as a flexible disk, a magneto-optical disk, a ROM, and a CD-ROM, and a storage unit such as a hard disk built in the computer system. Furthermore, the “computer-readable recording medium” dynamically holds a program for a short time like a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line. In this case, a volatile memory in a computer system serving as a server or a client in that case is also used to hold a program for a certain period of time. The program may be a program for realizing a part of the functions described above, and may be a program capable of realizing the functions described above in combination with a program already recorded in a computer system.

以上、この発明の実施形態について図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も含まれる。 The embodiment of the present invention has been described in detail with reference to the drawings. However, the specific configuration is not limited to this embodiment, and includes designs and the like that do not depart from the gist of the present invention.

１…ノード装置
１０…負荷分散装置
１１…ノードテーブル管理部
１２…仮想ノード登録部
１２１…メモリ
１３…ノードＩＤ生成部
１４…管理範囲長演算部
１５…記憶部
１６…キーテーブル処理部
２１…ノードテーブル
２２…キー検索用ノードテーブル
２３…キーテーブル
３０…記憶部
５…通信ネットワーク DESCRIPTION OF SYMBOLS 1 ... Node apparatus 10 ... Load distribution apparatus 11 ... Node table management part 12 ... Virtual node registration part 121 ... Memory 13 ... Node ID generation part 14 ... Management range length calculating part 15 ... Storage part 16 ... Key table processing part 21 ... Node Table 22 ... Key search node table 23 ... Key table 30 ... Storage unit 5 ... Communication network

Claims

Storing a node table including node specifying information for specifying a node device constituting the data management system; and the node specifying information and a node ID generated for the node device specified by the node specifying information; A storage unit for storing a key search node table indicating the association of
A node table management unit for writing node specifying information of a node device participating in the data management system to the node table, and deleting node specifying information of a node device leaving the data management system from the node table;
A management range for calculating a management range length of a key to be managed by each node device based on the node ID set in the node table for key search and the node specifying information associated with the node ID A length calculator,
The node table for key search that sets the correspondence between the node specifying information in the node table and the node ID generated for the node device specified by the node specifying information is written in the storage unit and generated. The node device selected based on the management range length until it is determined that the difference in management range length of each node device calculated by the management range length calculation unit based on the key search node table matches a predetermined condition Virtual node registration that repeats the process of adding a new node ID to the key search node and updating the key identification node table by associating the node identification information of the node device with the newly generated node ID And
A key table processing unit that acquires node identification information of the node device that manages a value corresponding to the key based on a node ID in the key search node table and a specified key;
A load balancer comprising:

The management range length calculation unit weights the management range length of the node device according to a value indicating the performance of the node device,
The virtual node registration unit determines whether a difference in the management range length weighted according to a value indicating the performance of the node device meets a predetermined condition;
The load distribution apparatus according to claim 1.

The management range length calculation unit, when an association between node identification information of the node device and the newly generated node ID is added to the key search node table by the virtual node registration unit, The node ID adjacent to the node ID added to the node table is specified to calculate the management range length of the key, and the management range length of the node device for which the node ID has been newly generated is calculated. The node management information is updated to the added value, and node identification information corresponding to the node ID adjacent to the node ID of the virtual node is read from the key search node table, and the key management of the node device identified by the read node identification information Update the range length to a value obtained by subtracting the calculated management range length.
The load distribution apparatus according to claim 1 or 2, wherein

The load distribution apparatus according to claim 1, wherein the node specifying information is an address of the node device.

A computer used as a load balancer
Storing a node table including node specifying information for specifying a node device constituting the data management system; and the node specifying information and a node ID generated for the node device specified by the node specifying information; A storage unit for storing a key search node table indicating the association of
A node table management unit for writing node specifying information of a node device participating in the data management system to the node table, and deleting node specifying information of a node device leaving the data management system from the node table;
A management range for calculating a management range length of a key to be managed by each node device based on the node ID set in the node table for key search and the node specifying information associated with the node ID Long operation part,
The node table for key search that sets the correspondence between the node specifying information in the node table and the node ID generated for the node device specified by the node specifying information is written in the storage unit and generated. The node device selected based on the management range length until it is determined that the difference in management range length of each node device calculated by the management range length calculation unit based on the key search node table matches a predetermined condition Virtual node registration that repeats the process of adding a new node ID to the key search node and updating the key identification node table by associating the node identification information of the node device with the newly generated node ID Part,
A key table processing unit that acquires node identification information of the node device that manages a value corresponding to the key based on a node ID in the key search node table and a specified key;
A program characterized by functioning as