JP6353395B2

JP6353395B2 - Distribution request distribution method, node, and program

Info

Publication number: JP6353395B2
Application number: JP2015078854A
Authority: JP
Inventors: 后宏水谷; 井上　武; 武井上; 暢間野; 修明石
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2015-04-08
Filing date: 2015-04-08
Publication date: 2018-07-04
Anticipated expiration: 2035-04-08
Also published as: JP2016201604A

Description

本発明は、複数のノード（ＰＣ）間で構成される分散ハッシュテーブルおいて、膨大なデータを分散管理し配信する際に特定のノードに配信要求が偏らないように配信要求を分散させる配信要求分散方法、ノード、及びプログラムに関する。 The present invention relates to a distribution request that distributes distribution requests so that distribution requests are not biased to specific nodes when a large amount of data is distributed and managed in a distributed hash table composed of a plurality of nodes (PCs). The present invention relates to a distribution method, a node, and a program.

分散ハッシュテーブルとは、ひとつの論理的なＩＤ空間を複数のノードで分担し管理する技術であり、ＫｅｙＶａｌｕｅＳｔｏｒｅ等の分散データベースにて利用されている。分散ハッシュテーブルにデータを配置する場合、当該データの名前をキーとし、当該キーを担当するノードに対して当該データを配置する。データを分散ハッシュテーブルから取得する場合、取得したいデータの名前をキーとし、分散ハッシュテーブル上から当該キーを担当するノードを探し、データを取得する。しかしながら、特定のデータもしくはデータ群に配信要求が集中する場合、当該データ（群）を管理するノードの配信負荷が高くなるという問題が存在している。 The distributed hash table is a technique for sharing and managing one logical ID space by a plurality of nodes, and is used in a distributed database such as Key Value Store. When data is arranged in the distributed hash table, the name of the data is used as a key, and the data is arranged for a node in charge of the key. When acquiring data from the distributed hash table, the name of the data to be acquired is used as a key, the node in charge of the key is searched from the distributed hash table, and the data is acquired. However, when distribution requests are concentrated on specific data or a data group, there is a problem that a distribution load of a node that manages the data (group) increases.

本問題に対して、既存手法１（例えば、非特許文献１を参照。）は、特定のデータ群に配信要求が集中する場合、配信要求が少ないノードを当該データ群のＩＤ空間の一部を担当するように、当該ノードを移動させる。図１では、ノードＢ，Ｃに配信要求が集中している分散ハッシュテーブルを既存手法１を用いて配信要求を平滑化している。具体的には、配信要求数が少ないノードＤが配信要求数の多いＢの担当範囲の一部：［１５，２０）を請け負うことで、配信要求数の平滑化を行う。この時、Ｄは自身の担当範囲を隣接するノードＣに対して受け渡す。既存手法２（例えば、非特許文献２を参照。）では、各ノードが配信要求数の上限を超えた場合、自身の担当範囲を隣接するノードに対して譲渡する。図２では、ノードＢ，Ｃが担当範囲を狭め、Ａ，Ｄは担当範囲を広めることで、配信要求数の平滑化を行っている。 To solve this problem, the existing method 1 (for example, refer to Non-Patent Document 1), when distribution requests are concentrated on a specific data group, a node having a small distribution request is assigned a part of the ID space of the data group. Move the node to take charge. In FIG. 1, a distributed hash table in which distribution requests are concentrated on nodes B and C is smoothed using the existing method 1. Specifically, the node D with a small number of distribution requests undertakes a part of the range of responsibility of B with a large number of distribution requests: [15, 20], thereby smoothing the number of distribution requests. At this time, D passes its assigned range to the adjacent node C. In the existing method 2 (for example, see Non-Patent Document 2), when each node exceeds the upper limit of the number of distribution requests, its own assigned range is transferred to an adjacent node. In FIG. 2, nodes B and C narrow the assigned range, and A and D widen the assigned range, thereby smoothing the number of distribution requests.

Ａ．Ｒ．Ｂｈａｒａｍｂｅ，Ｍ．Ａｇｒａｗａｌ，ａｎｄＳ．Ｓｅｓｈａｎ， “Ｍｅｒｃｕｒｙ：ｓｕｐｐｏｒｔｉｎｇｓｃａｌａｂｌｅｍｕｌｔｉ−ａｔｔｒｉｂｕｔｅｒａｎｇｅｑｕｅｒｉｅｓ”，ｐｐ．３５３−３６６，２００４．A. R. Bharambe, M.M. Agrawal, and S.A. Seshan, “Mercury: supporting scalable multi-attribute range queries”, pp. 197 353-366, 2004. Ｉ．Ｋｏｎｓｔａｎｔｉｎｏｕ，Ｄ．Ｔｓｏｕｍａｋｏｓ，ａｎｄＮ．Ｋｏｚｉｒｉｓ， “Ｆａｓｔａｎｄｃｏｓｔ−ｅｆｆｅｃｔｉｖｅｏｎｌｉｎｅｌｏａｄ−ｂａｌａｎｃｉｎｇｉｎｄｉｓｔｒｉｂｕｔｅｄｒａｎｇｅ−ｑｕｅｒｉａｂｌｅｓｙｓｔｅｍｓ”，ＩＥＥＥＴｒａｎｓａｃｔｉｏｎｓｏｎＰａｒａｌｌｅｌａｎｄＤｉｓｔｒｉｂｕｔｅｄＳｙｓｔｅｍｓ，ｖｏｌ．２２，ｎｏ．８，ｐｐ．１３５０−１３６４，２０１１．I. Konstantinou, D.H. Tsusumakos, and N.M. Koziris, “Fast and cost-effective online load-balancing in distributed systems”, IEEE Transactions on Parallels and Distillates. 22, no. 8, pp. 1350-1364, 2011. Ｍ．ＨｅｆｅｅｄａａｎｄＯ．Ｓａｌｅｈ， “Ｔｒａｆｆｉｃｍｏｄｅｌｉｎｇａｎｄｐｒｏｐｏｒｔｉｏｎａｌｐａｒｔｉａｌｃａｃｈｉｎｇｆｏｒｐｅｅｒ−ｔｏ−ｐｅｅｒｓｙｓｔｅｍｓ”，ＩＥＥＥ／ＡＣＭＴｒａｎｓａｃｔｉｏｎｓｏｎＮｅｔｗｏｒｋｉｎｇ，ｖｏｌ．１６，ｎｏ．６，ｐｐ．１４４７−１４６０，２００８．M.M. Hefeeda and O. Saleh, “Traffic modeling and proportional partial caching for peer-to-peer systems”, IEEE / ACM Transactions on Networking, vol. 16, no. 6, pp. 1447-1460, 2008.

しかし、既存手法１，２は逐次的かつ非同期で行うため、配信要求数が平滑化されるまでは何度もノードの担当範囲が切り替わり、ノード間でのデータ移動が多発することになる。つまり、既存手法１，２には、ノードの担当範囲の切替わりに伴うノード間でのデータ移動が多発するという課題があった。 However, since the existing methods 1 and 2 are performed sequentially and asynchronously, the assigned range of the node is switched many times until the number of distribution requests is smoothed, and data movement between the nodes frequently occurs. In other words, the existing methods 1 and 2 have a problem that data movement frequently occurs between nodes due to switching of the assigned range of nodes.

そこで、本発明は、上記課題を解決すべく、頻繁なデータの移動を抑制しつつ、各ノード間のデータ配信数を平準化することができる配信要求分散方法、ノード、及びプログラムを提供することを目的とする。 Accordingly, the present invention provides a distribution request distribution method, a node, and a program capable of leveling the number of data distribution between nodes while suppressing frequent data movement in order to solve the above problems. With the goal.

そこで、本発明は、ある時間単位で各ノードに対する配信要求数を計測し、当該結果の元、要求数が多いノードが管理するデータを当該ノードと隣接する複数のノードで分散して管理することとした。 Therefore, the present invention measures the number of distribution requests for each node in a certain time unit, and distributes and manages the data managed by the node having a large number of requests based on the result at a plurality of nodes adjacent to the node. It was.

具体的には、本発明に係る配信要求分散方法は、共通の大きさの担当範囲Ｈにあるデータｘをそれぞれ保持する複数のノードで構成される分散ハッシュテーブルにおいて、担当範囲Ｈを変化させてデータの配信要求を各ノードに分散させる配信要求分散方法であって、
１つのノードが一定の時間に他のノードにされる配信要求の数を調査するデータ配信数調査ステップと、
前記１つのノードが配信要求を受信したときに、
前記データ配信数調査ステップで取得した前記配信要求の数に基づいて、データｘが配信要求される確率密度分布である配信要求関数ｆ（ｘ）を決定し、データｘを保持するノード数の確率密度分布である包含関数ｇ（ｘ、Ｈ）とｆ（ｘ）との二乗誤差が所望のデータｘの範囲において最小となるように担当範囲Ｈを決定する担当範囲決定ステップと、
を行うことを特徴とする。 Specifically, the distribution request distribution method according to the present invention changes the assigned range H in a distributed hash table composed of a plurality of nodes each holding data x in the assigned range H of a common size. A distribution request distribution method for distributing data distribution requests to each node,
A data distribution number investigation step in which one node investigates the number of distribution requests made to other nodes at a certain time;
When the one node receives a distribution request,
Based on the number of distribution requests acquired in the data distribution number checking step, a distribution request function f (x), which is a probability density distribution in which data x is requested to be distributed, is determined, and the probability of the number of nodes holding the data x A charge range determination step for determining the charge range H so that the square error between the inclusion functions g (x, H) and f (x), which are density distributions, is minimized in the range of the desired data x;
It is characterized by performing.

また、本発明に係るノードは、分散ハッシュテーブルを構成し、共通の大きさの担当範囲Ｈにあるデータｘを保持するノードであって、
一定の時間に他のノードにされる配信要求の数を調査するデータ配信数調査モジュールと、
配信要求を受信したときに、
前記データ配信数調査モジュールが取得した前記配信要求の数に基づいて、データｘが配信要求される確率密度分布である配信要求関数ｆ（ｘ）を決定し、データｘを保持するノード数の確率密度分布である包含関数ｇ（ｘ、Ｈ）とｆ（ｘ）との二乗誤差が所望のデータｘの範囲において最小となるように担当範囲Ｈを決定する担当範囲決定モジュールと、
を備える。 In addition, the node according to the present invention is a node that constitutes a distributed hash table and holds data x in the charge range H of a common size,
A data distribution number survey module that investigates the number of distribution requests made to other nodes at a certain time,
When a delivery request is received,
Based on the number of distribution requests acquired by the data distribution number survey module, a distribution request function f (x), which is a probability density distribution in which data x is requested to be distributed, is determined, and the probability of the number of nodes holding the data x A responsible range determination module for determining the responsible range H such that the square error between the inclusion functions g (x, H) and f (x), which are density distributions, is minimized in the range of the desired data x;
Is provided.

本発明は、各ノードの配信要求数から負荷が最も平滑化できるような各ノードの担当範囲を一意に決定する。具体的には、特定の時間間隔にて配信要求数が多いノードのデータは隣接するノード間で共有することにする（図３を参照。）。これにより、配信要求数が平滑化されるまで、担当範囲が切りかわることによる頻繁なデータ転送を抑制することができる。 The present invention uniquely determines the assigned range of each node from which the load can be most smoothed from the number of distribution requests of each node. Specifically, data of a node having a large number of distribution requests at a specific time interval is shared between adjacent nodes (see FIG. 3). As a result, frequent data transfer due to switching of the assigned range can be suppressed until the number of distribution requests is smoothed.

さらに、本発明に係るプログラムは、コンピュータに前記配信要求分散方法を実行させるためのプログラムである。 Furthermore, the program according to the present invention is a program for causing a computer to execute the distribution request distribution method.

本発明に係る配信要求分散方法は、コンピュータとプログラムによっても実現でき、プログラムを記録媒体に記録することも、ネットワークを通して提供することも可能である。 The distribution request distribution method according to the present invention can be realized by a computer and a program, and can be recorded on a recording medium or provided through a network.

本発明は、頻繁なデータの移動を抑制しつつ、各ノード間のデータ配信数を平準化することができる配信要求分散方法、ノード、及びプログラムを提供することができる。 The present invention can provide a distribution request distribution method, a node, and a program capable of leveling the number of data distribution between nodes while suppressing frequent data movement.

従来の配信要求分散方法を説明する図である。（ａ）は担当範囲変更前であり、（ｂ）は担当範囲変更後である。It is a figure explaining the conventional delivery request distribution method. (A) is before the assigned range change, and (b) is after the assigned range change. 従来の配信要求分散方法を説明する図である。（ａ）は担当範囲変更前であり、（ｂ）は担当範囲変更後である。It is a figure explaining the conventional delivery request distribution method. (A) is before the assigned range change, and (b) is after the assigned range change. 本発明に係る配信要求分散方法を説明する図である。It is a figure explaining the distribution request distribution method concerning the present invention. 本発明に係るノードを説明する図である。It is a figure explaining the node which concerns on this invention. 本発明に係るノードの動作を説明する図である。It is a figure explaining operation | movement of the node which concerns on this invention. 本発明に係る配信要求分散方法を説明する図である。It is a figure explaining the distribution request distribution method concerning the present invention. 本発明に係る配信要求分散方法の実験結果を説明する図である。It is a figure explaining the experimental result of the distribution request distribution method which concerns on this invention. 本発明に係る配信要求分散方法の実験結果を説明する図である。It is a figure explaining the experimental result of the distribution request distribution method which concerns on this invention. 本発明に係る配信要求分散方法と従来の配信要求分散方法とを比較するシミュレーション結果である。It is the simulation result which compares the delivery request distribution method which concerns on this invention, and the conventional delivery request distribution method. 本発明に係る配信要求分散方法と従来の配信要求分散方法とを比較するシミュレーション結果である。It is the simulation result which compares the delivery request distribution method which concerns on this invention, and the conventional delivery request distribution method.

添付の図面を参照して本願発明の実施形態を説明する。以下に説明する実施形態は本願発明の実施の例であり、本願発明は、以下の実施形態に制限されるものではない。なお、本明細書及び図面において符号が同じ構成要素は、相互に同一のものを示すものとする。また、以下の説明で配信要求数と配信数とは同数であるとする。 Embodiments of the present invention will be described with reference to the accompanying drawings. Embodiment described below is an example of implementation of this invention, and this invention is not restrict | limited to the following embodiment. In the present specification and drawings, the same reference numerals denote the same components. In the following description, it is assumed that the number of distribution requests and the number of distributions are the same.

図４は、本実施形態のノード１００を説明する図である。
ノード１００は、分散ハッシュテーブルを構成し、共通の大きさの担当範囲Ｈにあるデータｘを保持するノードであって、
一定の時間に他のノードにされる配信要求の数を調査するデータ配信数調査モジュール３０と、
配信要求を受信したときに、
データ配信数調査モジュール３０が取得した前記配信要求の数に基づいて、データｘが配信要求される確率密度分布である配信要求関数ｆ（ｘ）を決定し、データｘを保持するノード数の確率密度分布である包含関数ｇ（ｘ、Ｈ）とｆ（ｘ）との二乗誤差が所望のデータｘの範囲において最小となるように担当範囲Ｈを決定する担当範囲決定モジュール４０と、
を備える。
ノード１００は、さらに、データベース１０と経路制御モジュール２０を備える。 FIG. 4 is a diagram illustrating the node 100 according to the present embodiment.
The node 100 is a node that constitutes a distributed hash table and holds data x in a common range of charge H,
A data distribution number investigation module 30 for investigating the number of distribution requests made to other nodes at a certain time;
When a delivery request is received,
Based on the number of distribution requests acquired by the data distribution number survey module 30, a distribution request function f (x) that is a probability density distribution in which the data x is requested to be distributed is determined, and the probability of the number of nodes that hold the data x An assigned range determination module 40 for determining the assigned range H such that the square error between the inclusion functions g (x, H) and f (x), which are density distributions, is minimized in the range of the desired data x;
Is provided.
The node 100 further includes a database 10 and a path control module 20.

図５は、これらのモジュールの動作例である。
経路制御モジュール２０、データベース１０、およびデータ配信数調査モジュール３０は既存手法と同様の機能である。経路制御モジュール２０は、一般的な構造化オーバレイが持つべきルーティングのモジュールとなっており、ネットワーク上からデータ等を探索／配信する機能を持つ。 FIG. 5 shows an operation example of these modules.
The route control module 20, the database 10, and the data distribution number survey module 30 have the same functions as the existing method. The route control module 20 is a routing module that a general structured overlay should have, and has a function of searching / distributing data and the like from the network.

データベース１０は、ノードの担当範囲、経路情報（他のノードのＩＰアドレスとＩＤ）、他のノードの配信要求数等を保存している。データベース１０も一般的な構造化オーバレイの必要機能となっている。 The database 10 stores node assignment ranges, route information (IP addresses and IDs of other nodes), distribution request counts of other nodes, and the like. The database 10 is also a necessary function for general structured overlay.

データ配信数調査モジュール３０は、既存手法１で提案されている手法を用いて、ネットワーク上の他のノードの配信数を一定時間間隔で調査し（ステップＳ０１）、データベース１０に書き込む（ステップＳ０３）。また、データ配信数調査モジュール３０は、経路制御モジュール２０に対してされた自身への配信要求を受信し（ステップＳ０２）、データベース１０に書き込む（ステップＳ０３）。データベース１０は、配信要求が更新されたことを担当範囲決定モジュール４０に通知する（ステップＳ０４）。担当範囲決定モジュール４０は、他のノード及び自身への配信要求数の調査結果を用いて、自身の担当範囲を決定し（ステップＳ０５）、データベース１０が保持する担当範囲を更新する（ステップＳ０６）。 The data distribution number checking module 30 uses the method proposed in the existing method 1 to check the number of distributions of other nodes on the network at regular time intervals (step S01) and write it in the database 10 (step S03). . Further, the data distribution number investigation module 30 receives the distribution request to itself made to the path control module 20 (step S02) and writes it in the database 10 (step S03). The database 10 notifies the assigned range determination module 40 that the distribution request has been updated (step S04). The assigned range determination module 40 determines the assigned range by using the investigation result of the number of distribution requests to other nodes and itself (Step S05), and updates the assigned range held by the database 10 (Step S06). .

続いて、担当範囲を決定する手法を説明するが、既存手法と異なる手法は、担当範囲決定モジュール４０でなされるため、以下では担当範囲決定モジュール４０に関する説明を行う。 Subsequently, a method for determining the assigned range will be described. Since a method different from the existing method is performed by the assigned range determining module 40, the assigned range determining module 40 will be described below.

各ノードはＩＤを持ち、ノードｉ（＞０）のＩＤをＸ_ｉとし、ＩＤ空間上、Ｘ_ｉより大きくもっともＩＤの距離が近いノードをノードｉのＳｕｃｃｅｓｓｏｒと呼び、当該ノードのＩＤをｓｕｃ_ｉとする。この時、ＩＤ空間上のノードｉの担当範囲は［Ｘ_ｉ，ｓｕｃ_ｉ）とする。 Each node has an ID, the ID of node i (> 0) is X _i , the node that is larger than X _{i and} has the closest ID distance in the ID space is called the successor of node i, and the ID of the node is suc _i And At this time, the assigned range of the node i in the ID space is [X _i , suc _i ).

本実施形態では、配信要求数が高いノードに対して、多くのノードの担当範囲を広げ、担当範囲を重複させることで、配信数の増加による負荷を分散させる。広げられた担当範囲をＨとすると、ノードの担当範囲は、［Ｘ_ｉ，Ｈ＋Ｘ_ｉ）となる（ただし、ｓｕｃ_ｉ≦Ｈ＋Ｘ_ｉ）。ここで、担当範囲Ｈは全てのノードで同じ値である。 In the present embodiment, the load due to the increase in the number of distributions is distributed by expanding the assigned ranges of many nodes and overlapping the assigned ranges with respect to nodes with a high number of distribution requests. When the expanded assigned range is H, the assigned range of the node is [X _i , H + X _i ) (where suc _i ≦ H + X _i ). Here, the assigned range H is the same value for all nodes.

担当範囲Ｈは、各データの配信数の確率密度分布と各データがどれだけのノードの担当範囲に含まれているかを示す確率密度分布との距離を最小化するように決定する。これは、配信要求数が高いノードほど多くのノードに対してデータを多く複製する際の最適な複製数を決定することと同義である。本最小化問題の概要を図６に示す。 The assigned range H is determined so as to minimize the distance between the probability density distribution of the number of distributions of each data and the probability density distribution indicating how many nodes each data is included in. This is synonymous with determining the optimum number of copies when replicating more data to more nodes as the number of distribution requests increases. An outline of the minimization problem is shown in FIG.

ここで、データ配信数調査モジュール３０が、既存手法１を用い、他のノードの配信数を一定間隔で調査した結果、Ｌ個のデータの配信数を取得できたとする。データｘの配信要求数をＬ_ｘとすると、各ノードの配信数の確率密度分布ｆ（ｘ）は、

と定義できる。 Here, it is assumed that, as a result of the data distribution number investigation module 30 using the existing method 1 and examining the number of distributions of other nodes at regular intervals, the number of distributions of L data can be acquired. If the number of distribution requests for data x is L _x , the probability density distribution f (x) of the number of distributions of each node is

Can be defined.

また、データｘがどれだけのノードの担当範囲に含まれているかを示す確率密度分布ｇ（ｘ，Ｈ）は、

と定義する。Ｎは全ノード数である。 Further, a probability density distribution g (x, H) indicating how many nodes the data x is included in is assigned as follows:

It is defined as N is the total number of nodes.

ここで、

は、ノードｊの担当範囲にデータｘが含まれていれば１となり、そうでなければ０となるカーネル関数となる関数である。本実施形態では

と定義する。 here,

Is a function that is a kernel function that is 1 if the data x is included in the range in charge of the node j, and 0 otherwise. In this embodiment

It is defined as

そして、

で表現される両確率密度分布の二乗誤差を最小にするＨを求める。但し、Ｑはデータ配信数調査モジュールにて得られた各データの配信数の集合（具体的には、「所定の時間内に配信されたｘ＝ｊのデータ数」であり、「Ｌ個のデータの配信要求数」）である。 And

H that minimizes the square error of both probability density distributions expressed by However, Q is a set of distribution numbers of each data obtained by the data distribution number survey module (specifically, “the number of data of x = j distributed within a predetermined time”), The number of data delivery requests ").

Ｈの値域はＩＤ空間の大きさに依存し、一般的に２^１６０程度の大きさとなる。そのため、線形探索にてＨを計算することは難しいため、当該二乗誤差の関数を凸関数と見立て、Ｈを二分探索にて求め、ノードｉは自身の担当範囲を［Ｘ_ｉ，Ｈ＋Ｘ_ｉ）と設定し、データベースの担当範囲を更新する。 Range of H depends on the size of the ID space, and generally of the order of 2 ¹⁶⁰ size. Therefore, since it is difficult to calculate H by linear search, the function of the square error is regarded as a convex function, H is obtained by binary search, and node i has its own assigned range as [X _i , H + X _i ). Set and update the database responsibility.

複数のノードの担当範囲にまたがって管理されるデータに対する配信要求は、当該ノード群内からランダムに選ばれたノードが受信し、要求元に配信する。 A distribution request for data managed across a range of responsibility of a plurality of nodes is received by a node randomly selected from the node group and distributed to the request source.

（効果）
既存手法１，２との比較実験にて、本実施形態の効果を定量化する。実験パラメータは図９の通りである。なお、各ノードに対するデータ数は一様であり、各ノードのＩＤと各データに対する配信要求数は表２で示す各Ｍａｎｄｅｌｂｒｏｔ−Ｚｉｐｆｄｉｓｔｒｉｂｕｔｉｏｎ（非特許文献３）の値（ｑ，α）に依存するものとする。これらのパラメータを用いて、各ノードは、自身の配信数を１０，０００になるよう、各手法を用いて配信数を平滑化する。 (effect)
The effects of the present embodiment are quantified by comparison experiments with the existing methods 1 and 2. Experimental parameters are as shown in FIG. The number of data for each node is uniform, and the ID of each node and the number of distribution requests for each data depend on the value (q, α) of each Mandelbrot-Zipf distribution (Non-Patent Document 3) shown in Table 2. Shall. Using these parameters, each node smoothes the number of distributions using each method so that its number of distributions becomes 10,000.

図７に各手法を用いて配信要求数の平滑化を行った後のデータの最大配信数を示す。本実施形態における最大配信数は、他の手法よりも低い値になっていることがわかる。図８にデータの総転送数を示す。既存手法２では最大配信数を平滑化するために本実施形態の８倍程度のデータを転送しなければならない。また既存手法１では、本実施形態の３倍程度のデータ転送を行わなければならない。以上により、本実施形態は少ないデータ転送（複製配置）によって、データ配信数を平滑化できることが分かる。 FIG. 7 shows the maximum number of data distributions after smoothing the number of distribution requests using each method. It can be seen that the maximum number of distributions in this embodiment is lower than other methods. FIG. 8 shows the total number of data transfers. In the existing method 2, in order to smooth the maximum number of distributions, it is necessary to transfer about eight times the data of this embodiment. In the existing method 1, data transfer about three times that of this embodiment must be performed. From the above, it can be seen that the present embodiment can smooth the number of data distributions with a small amount of data transfer (replication arrangement).

１０：データベース
２０：経路制御モジュール
３０：データ配信数調査モジュール
４０：担当範囲決定モジュール
１００：ノード 10: Database 20: Path control module 30: Data distribution number investigation module 40: Responsible range determination module 100: Node

Claims

In a distributed hash table composed of a plurality of nodes each holding data x in a shared range H with a common size, a distribution request distribution method that distributes data distribution requests to each node by changing the range H There,
A data distribution number investigation step in which one node investigates the number of distribution requests made to other nodes at a certain time;
When the one node receives a distribution request,
Based on the number of distribution requests acquired in the data distribution number checking step, a distribution request function f (x), which is a probability density distribution in which data x is requested to be distributed, is determined, and the probability of the number of nodes holding the data x A charge range determination step for determining the charge range H so that the square error between the inclusion functions g (x, H) and f (x), which are density distributions, is minimized in the range of the desired data x;
A distribution request distribution method characterized by:

A node that constitutes a distributed hash table and holds data x in the charge range H of a common size,
A data distribution number survey module that investigates the number of distribution requests made to other nodes at a certain time,
When a delivery request is received,
Based on the number of distribution requests acquired by the data distribution number survey module, a distribution request function f (x), which is a probability density distribution in which data x is requested to be distributed, is determined, and the probability of the number of nodes holding the data x A responsible range determination module for determining the responsible range H such that the square error between the inclusion functions g (x, H) and f (x), which are density distributions, is minimized in the range of the desired data x;
A node characterized by comprising:

A program for causing a computer to execute the distribution request distribution method according to claim 1.