JP2012238084A

JP2012238084A - Data load distribution arrangement system and data load distribution arrangement method

Info

Publication number: JP2012238084A
Application number: JP2011105225A
Authority: JP
Inventors: Hikotoshi Nakazato; 彦俊中里; Manabu Nishio; 学西尾; Masafumi Shimizu; 雅史清水
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2011-05-10
Filing date: 2011-05-10
Publication date: 2012-12-06
Anticipated expiration: 2031-05-10
Also published as: JP5540269B2

Abstract

PROBLEM TO BE SOLVED: To improve throughput of an entire system by realizing load distribution, while inhibiting a number of arranged virtual servers.SOLUTION: In a data load distribution arrangement system 1 comprising a virtual server arrangement device 10 which is communicably connected to a plurality of DB servers 20, the virtual server arrangement device 10 calculates an absolute value of a difference between a load and a server threshold for each DB server 20 and generates aggregation of servers Swhose loads are equal to or less than the server threshold and aggregation of servers Swhose loads are more than the server threshold. Then the virtual server arrangement device 10 generates data information for server in charge 100 by arranging the virtual servers of the servers Sagainst an excess of an area of which the servers Sare in charge on a hash space, sequentially from the DB server 20 with the largest absolute value among the servers S. Each DB server 20 exchanges data stored by itself with another DB server 20 on the basis of the data information for server in charge 100 generated by the virtual server arrangement device 10.

Description

本発明は、大量のデータを分散して格納する分散データベースの技術分野において、データを格納する各サーバ間の負荷分散性を維持するための仮想サーバを用いたデータ負荷分散配置システムおよびデータ負荷分散配置方法に関する。 The present invention relates to a distributed database system for storing a large amount of data in a distributed manner, and a data load distribution arrangement system and a data load distribution using a virtual server for maintaining load distribution among servers storing data It relates to the arrangement method.

データを複数のサーバ間に分散して格納する技術として、データとサーバとをハッシュ関数にかけてリング状のハッシュ空間に配置し、ハッシュ空間上のデータとサーバの位置関係から、各サーバが担当すべきデータを決定するコンシステントハッシュ法がある（非特許文献１参照）。図１９（ａ）に示すように、リング状のハッシュ空間にデータとサーバとが配置され、あるサーバ（例えば、「Ｓ_１」）が配置された位置から、その１つ前に配置されたサーバ「Ｓ_５」の配置位置の前までの領域（担当領域）に配置されたデータを、時計回りに、そのサーバ「Ｓ_１」が格納する。このコンシステントハッシュ法によるデータの分散格納方法では、サーバ数の増減に対して格納先のサーバが変更となるデータが限られるため、拡張性に富むという特徴がある。 As a technology for storing data distributed among multiple servers, the data and server should be assigned to a ring-shaped hash space by applying a hash function, and each server should be responsible for the positional relationship between the data on the hash space and the server. There is a consistent hash method for determining data (see Non-Patent Document 1). As shown in FIG. 19A, data and a server are arranged in a ring-shaped hash space, and a server arranged immediately before a certain server (for example, “S ₁ ”) is arranged. The server “S ₁ ” stores the data arranged in the area (area in charge) before the arrangement position of “S ₅ ” clockwise. This distributed storage method of data by the consistent hash method has a feature that it is highly extensible because data for which the storage destination server is changed is limited as the number of servers increases or decreases.

しかし、このコンシステントハッシュ法を用いたデータ配置方法では、ハッシュ空間におけるサーバとデータのハッシュ値の配置が一様にならず、各サーバの担当するデータ数に偏りが生じやすい。この問題を解決するため、一つのサーバから複数のハッシュ値を算出することにより、ハッシュ空間上に仮想サーバを配置する手法がある（非特許文献２参照）。例えば、図１９（ｂ）に示すように、ハッシュ空間上にサーバ「Ｓ_３」の仮想サーバ「ＶＳ_３」を、例えば、サーバ「Ｓ_１」の担当領域にハッシュ値に基づき配置する。そして、この仮想サーバ「ＶＳ_３」を設けることで、サーバ「Ｓ_１」の担当領域の一部のデータを、仮想サーバ「ＶＳ_３」の基のサーバ「Ｓ_３」に格納させることで、サーバ「Ｓ_１」の負荷を減らし、負荷分散性を向上させる。
この仮想サーバを用いた方法によれば、各サーバの仮想サーバをハッシュ空間上に増やすことで、担当領域を細かく分割し各サーバが格納するデータ数を、より均一にすることができる。 However, in the data arrangement method using the consistent hash method, the arrangement of the hash values of the server and the data in the hash space is not uniform, and the number of data handled by each server tends to be biased. In order to solve this problem, there is a method of arranging a virtual server on a hash space by calculating a plurality of hash values from one server (see Non-Patent Document 2). For example, as shown in FIG. 19B, the virtual server “VS ₃ ” of the server “S ₃ ” is arranged on the hash space based on the hash value, for example, in the assigned area of the server “S ₁ ”. Then, by providing the virtual server "VS _3", a part of the data area of responsibility of the server "S _1", by storing in the server "S _3" groups of virtual servers "VS _3", the server The load of “S ₁ ” is reduced to improve load distribution.
According to this method using virtual servers, the number of data stored in each server can be made more uniform by increasing the number of virtual servers of each server on the hash space and finely dividing the assigned area.

D.Karger,E.Lehman, T.Leighton, R.Panigraphy, M.Levine, and D.Lewin, “Consistent hashing and random trees,” Proc. Twenty-Ninth Annual ACM Symposium on Theory of Computing, pp.654-663, Elpaso, Texas, 1997D. Karger, E. Lehman, T. Leighton, R. Panigraphy, M. Levine, and D. Lewin, “Consistent hashing and random trees,” Proc. Twenty-Ninth Annual ACM Symposium on Theory of Computing, pp.654- 663, Elpaso, Texas, 1997 Rajesh Swaminathan,“Web Caching with Consistet Hashing,”University of Waterloo, pp.11-13, San Francisco, CA, 2009, ［online］、［平成23年4月22日検索］、インターネット＜http://rajesh.rapidtech.ca/publications/work_reports/report5.pdf＞Rajesh Swaminathan, “Web Caching with Consistet Hashing,” University of Waterloo, pp.11-13, San Francisco, CA, 2009, [online], [searched April 22, 2011], Internet <http: // rajesh .rapidtech.ca / publications / work_reports / report5.pdf>

しかしながら、従来のコンシステントハッシュ法による仮想サーバの配置方法では、サーバにハッシュ関数を繰り返し適用するなどして複数のハッシュ値を算出し、仮想サーバをハッシュ空間に配置する。従って、適用するハッシュ関数に応じて負荷分散効果が異なると同時に、分散効果を高めるためには大量の仮想サーバをハッシュ空間上に配置する必要がある。その結果、仮想サーバ保持のための記述量が増加し、クライアントからデータの問い合わせを受けた際に、そのデータを格納している担当サーバの検索に要する時間が増加してしまう。 However, in the conventional virtual server arrangement method using the consistent hash method, a plurality of hash values are calculated by repeatedly applying a hash function to the server, and the virtual server is arranged in the hash space. Accordingly, the load distribution effect varies depending on the hash function to be applied, and at the same time, in order to increase the distribution effect, it is necessary to arrange a large number of virtual servers on the hash space. As a result, the amount of description for holding a virtual server increases, and when a data inquiry is received from a client, the time required to search for a responsible server storing the data increases.

また、従来の仮想サーバを用いた負荷分散方法では、各サーバに設ける仮想サーバ数は同一に設定されることが多く、各サーバの性能を考慮したものではない。さらに、システム稼動後、時間経過に伴う担当データ数の増減や各データへの実アクセス数に応じて変動するサーバの実負荷の変化に対応できず、負荷分散効果を継続して維持するには問題があった。 Also, in the conventional load balancing method using virtual servers, the number of virtual servers provided in each server is often set to be the same, and the performance of each server is not considered. Furthermore, after the system is running, it is impossible to respond to changes in the actual load on the server that fluctuate depending on the increase or decrease in the number of data in charge over time or the actual number of accesses to each data, and to maintain the load balancing effect continuously There was a problem.

このような背景に鑑みて本発明がなされたのであり、本発明は、仮想サーバの配置数を抑えた上で、サーバ間の負荷分散を実現し、システム全体としてスループットを向上させることができる、仮想サーバを用いたデータ負荷分散配置システムおよびデータ負荷分散配置方法を提供することを課題とする。 The present invention has been made in view of such a background, and the present invention can achieve load distribution between servers while suppressing the number of virtual servers arranged, and improve the throughput of the entire system. It is an object of the present invention to provide a data load distribution and arrangement system and a data load distribution and arrangement method using a virtual server.

前記した課題を解決するため、請求項１に記載の発明は、相互に通信可能に接続される複数のＤＢサーバと、前記複数のＤＢサーバそれぞれと通信可能に接続される仮想サーバ配置装置とを備えるデータ負荷分散配置システムであって、前記ＤＢサーバが、（１）クライアントからの検索対象となるデータ、およびそのデータそれぞれに対応付けられるデータキー、（２）前記ＤＢサーバ自身のサーバ性能、並びに、（３）前記データそれぞれの保存先の前記ＤＢサーバを示すサーバ担当データ情報が保存される記憶部と、前記データキーおよび前記サーバ性能を、前記仮想サーバ配置装置に送信するデータ管理部と、前記仮想サーバ配置装置から、新たな前記サーバ担当データ情報を受信し、前記保存していたサーバ担当データ情報と比較することにより、自身の前記ＤＢサーバの前記記憶部に保存していないデータおよびそのデータの保存先となる他のＤＢサーバを抽出し、前記抽出した他のＤＢサーバにデータ交換要求を送信することにより、前記自身が保存していないデータを取得し、前記保存していたサーバ担当データ情報を前記新たなサーバ担当データ情報に更新するデータ交換部と、を備え、前記仮想サーバ配置装置が、前記データ負荷分散配置システム全体に求められる処理性能と、前記ＤＢサーバそれぞれに設定する負荷の閾値を示す負荷閾値定数が保存される記憶部と、前記ＤＢサーバそれぞれから取得した、前記サーバ性能と、前記負荷閾値定数とを用いて、前記ＤＢサーバそれぞれの負荷の閾値であるサーバ閾値を計算し、前記データ負荷分散配置システム全体に求められる処理性能と、前記ＤＢサーバそれぞれから取得した前記データキーの数の合計であるシステム全体のデータ数とに基づき、前記データへのアクセス回数が均等であると仮定して、前記ＤＢサーバそれぞれの負荷を計算し、前記ＤＢサーバごとに、前記負荷と前記サーバ閾値との差の絶対値を計算し、前記負荷が前記サーバ閾値以下のサーバＳ⁻の集合と、前記負荷が前記サーバ閾値を超えるサーバＳ^＋の集合とを生成するサーバ負荷計算部と、前記ＤＢサーバおよび前記データそれぞれについて、所定のハッシュ関数を用いてハッシュ値を算出することにより、ハッシュ空間上に、前記データの保存を担当する前記ＤＢサーバの担当領域を設定し、前記サーバＳ^＋の集合うち、前記負荷と前記サーバ閾値との差の絶対値が大きいＤＢサーバから順に、前記ハッシュ空間上の前記サーバＳ^＋の担当領域において、担当できるデータ数が自身のＤＢサーバのサーバ閾値を超える超過分に対して、前記サーバＳ⁻の集合のうち、前記絶対値が大きいＤＢサーバから順に仮想サーバを配置し、前記超過分のデータの前記ハッシュ空間上での担当領域を前記仮想サーバに変更し、前記変更したハッシュ空間上での担当領域に基づき、前記ＤＢサーバそれぞれが担当するデータの保存先を示す前記新たなサーバ担当データ情報を生成し、前記生成した新たなサーバ担当データ情報を、前記ＤＢサーバそれぞれに送信する配置処理部と、を備えることを特徴とするデータ負荷分散配置システムとした。 In order to solve the above-described problem, the invention according to claim 1 includes a plurality of DB servers that are communicably connected to each other, and a virtual server arrangement device that is communicably connected to each of the plurality of DB servers. A data load distribution arrangement system comprising: (1) data to be searched from a client and a data key associated with each of the data; (2) server performance of the DB server itself; (3) a storage unit storing server charge data information indicating the DB server that stores each of the data; a data management unit that transmits the data key and the server performance to the virtual server placement device; Receives the new server charge data information from the virtual server placement device and compares it with the stored server charge data information To extract data not stored in the storage unit of the DB server and another DB server as a storage destination of the data, and transmit a data exchange request to the extracted other DB server. A data exchanging unit that obtains data that is not stored by itself and updates the stored server charge data information to the new server charge data information. The processing performance required for the entire data load distribution and arrangement system, a storage unit storing a load threshold constant indicating a load threshold set for each of the DB servers, the server performance acquired from each of the DB servers, A server threshold value that is a load threshold value for each of the DB servers is calculated using a load threshold constant, and the data load distribution and arrangement system Assuming that the number of accesses to the data is equal based on the overall required processing performance and the number of data in the entire system, which is the total number of the data keys acquired from each of the DB servers, The load of each server is calculated, the absolute value of the difference between the load and the server threshold is calculated for each DB server, the set of servers S ⁻ whose load is equal to or less than the server threshold, and the load is the server A server load calculation unit that generates a set of servers S ⁺ exceeding a threshold value, and for each of the DB server and the data, by calculating a hash value using a predetermined hash function, An area in charge of the DB server in charge of storage is set, and the absolute value of the difference between the load and the server threshold value in the set of the servers S ⁺ is In descending order DB server, at the server S ⁺ in charge area on the hash space for excess of charge can number data exceeds the server threshold own DB server, the server S ^- of the set of the A virtual server is arranged in order from a DB server having a larger absolute value, and a responsible area on the hash space of the excess data is changed to the virtual server, based on the changed responsible area on the hash space, An arrangement processing unit that generates the new server charge data information indicating a storage location of data handled by each DB server, and transmits the generated new server charge data information to each DB server. A featured data load distribution and arrangement system was adopted.

また、請求項３に記載の発明は、相互に通信可能に接続される複数のＤＢサーバと、前記複数のＤＢサーバそれぞれと通信可能に接続される仮想サーバ配置装置とを備えるデータ負荷分散配置システムのデータ負荷分散配置方法であって、前記ＤＢサーバが、（１）クライアントからの検索対象となるデータ、およびそのデータそれぞれに対応付けられるデータキー、（２）前記ＤＢサーバ自身のサーバ性能、並びに、（３）前記データそれぞれの保存先の前記ＤＢサーバを示すサーバ担当データ情報が保存されている記憶部を備えており、前記データキーおよび前記サーバ性能を、前記仮想サーバ配置装置に送信するステップを実行し、前記仮想サーバ配置装置が、前記データ負荷分散配置システム全体に求められる処理性能と、前記ＤＢサーバそれぞれに設定する負荷の閾値を示す負荷閾値定数が保存されている記憶部を備えており、前記ＤＢサーバそれぞれから取得した、前記サーバ性能と、前記負荷閾値定数とを用いて、前記ＤＢサーバそれぞれの負荷の閾値であるサーバ閾値を計算し、前記データ負荷分散配置システム全体に求められる処理性能と、前記ＤＢサーバそれぞれから取得した前記データキーの数の合計であるシステム全体のデータ数とに基づき、前記データへのアクセス回数が均等であると仮定して、前記ＤＢサーバそれぞれの負荷を計算し、前記ＤＢサーバごとに、前記負荷と前記サーバ閾値との差の絶対値を計算し、前記負荷が前記サーバ閾値以下のサーバＳ⁻の集合と、前記負荷が前記サーバ閾値を超えるサーバＳ^＋の集合とを生成するステップと、前記ＤＢサーバおよび前記データそれぞれについて、所定のハッシュ関数を用いてハッシュ値を算出することにより、ハッシュ空間上に、前記データの保存を担当する前記ＤＢサーバの担当領域を設定し、前記サーバＳ^＋の集合うち、前記負荷と前記サーバ閾値との差の絶対値が大きいＤＢサーバから順に、前記ハッシュ空間上の前記サーバＳ^＋の担当領域において、担当できるデータ数が自身のＤＢサーバのサーバ閾値を超える超過分に対して、前記サーバＳ⁻の集合のうち、前記絶対値が大きいＤＢサーバから順に仮想サーバを配置し、前記超過分のデータの前記ハッシュ空間上での担当領域を前記仮想サーバに変更し、前記変更したハッシュ空間上での担当領域に基づき、前記ＤＢサーバそれぞれが担当するデータの保存先を示す新たなサーバ担当データ情報を生成し、前記生成した新たなサーバ担当データ情報を、前記ＤＢサーバそれぞれに送信するステップと、を実行し、前記ＤＢサーバが、前記仮想サーバ配置装置から、前記新たなサーバ担当データ情報を受信し、前記保存していたサーバ担当データ情報と比較することにより、自身の前記ＤＢサーバの前記記憶部に保存していないデータおよびそのデータの保存先となる他のＤＢサーバを抽出し、前記抽出した他のＤＢサーバにデータ交換要求を送信することにより、前記自身が保存していないデータを取得し、前記保存していたサーバ担当データ情報を前記新たなサーバ担当データ情報に更新するステップを実行することを特徴とするデータ負荷分散配置方法とした。 According to a third aspect of the present invention, there is provided a data load distribution arrangement system comprising a plurality of DB servers that are communicably connected to each other and a virtual server arrangement apparatus that is communicably connected to each of the plurality of DB servers. In this data load distribution and arrangement method, the DB server is (1) data to be searched from a client and a data key associated with each of the data, (2) server performance of the DB server itself, and (3) a step of transmitting the data key and the server performance to the virtual server placement device, comprising a storage unit in which server-in-charge data information indicating the DB server of the storage destination of each of the data is stored; And the virtual server placement apparatus performs processing performance required for the entire data load distribution placement system, and the DB A storage unit storing a load threshold constant indicating a load threshold to be set for each server, and using the server performance and the load threshold constant acquired from each DB server, the DB A server threshold that is a load threshold of each server is calculated, the processing performance required for the entire data load distribution and arrangement system, and the total number of data that is the total number of the data keys acquired from each of the DB servers, Based on the above, assuming that the number of accesses to the data is equal, calculate the load of each DB server, and for each DB server, calculate the absolute value of the difference between the load and the server threshold, Generating a set of servers S ⁻ where the load is less than or equal to the server threshold and a set of servers S ⁺ where the load exceeds the server threshold; For each of the DB server and the data, by calculating a hash value using a predetermined hash function, an area in charge of the DB server responsible for storing the data is set on the hash space, and the server S ⁺ From the DB server in which the absolute value of the difference between the load and the server threshold is large, the number of data that can be handled in the server S ^{+ in} the hash space is the server threshold of the own DB server. relative excess of greater than, the server S ^- of the set of the virtual server absolute value from large DB server in order to place, the coverage area in said hash space data of the excess to the virtual server And a new storage location indicating the data storage location of each of the DB servers based on the changed area in the hash space. Generating new server charge data information, and transmitting the generated new server charge data information to each of the DB servers, wherein the DB server receives the new server from the virtual server placement device. By receiving the charge data information and comparing it with the stored server charge data information, the data not saved in the storage unit of the DB server and the other DB server that is the save destination of the data can be obtained. By extracting and sending a data exchange request to the other extracted DB server, the data not stored by itself is acquired, and the stored server charge data information is used as the new server charge data information. The data load distribution and placement method is characterized by executing the updating step.

このようにすることで、本実施形態に係るデータ負荷分散配置システムおよびデータ負荷分散配置方法によれば、各ＤＢ（DataBase）サーバのサーバ性能を考慮した上でサーバ閾値を計算し、ハッシュ空間上で、負荷がサーバ閾値を超えるサーバＳ^＋のＤＢサーバの負荷の超過分の領域を、負荷がサーバ閾値以下のサーバＳ⁻のＤＢサーバが、仮想サーバを設けることにより、その領域のデータを、サーバＳ⁻のＤＢサーバが担当する。よって、ＤＢサーバ間の負荷分散を実現し、さらに、各サーバの担当領域に配置する仮想サーバの数が同一に設定される従来技術に比べ、仮想サーバの数を減らすことができる。また、仮想サーバを大量に配置する必要がなくなるため、仮想サーバ保持のための記述量の増加を抑え、クライアントからデータの問い合わせを受けた際に、そのデータを格納している担当サーバの検索に要する時間の増加を抑えることができる。よって、システム全体としてのスループットを向上させることができる。 In this way, according to the data load distribution and arrangement system and the data load distribution and arrangement method according to this embodiment, the server threshold value is calculated in consideration of the server performance of each DB (DataBase) server, in, load a region of excess load of the server S ⁺ DB server exceeding server threshold, load server S follows the server threshold ^- the DB server, by providing the virtual server, the data of the area, server S ^- DB server is in charge of. Therefore, load distribution among DB servers can be realized, and the number of virtual servers can be reduced as compared with the conventional technology in which the number of virtual servers arranged in the assigned area of each server is set to be the same. Also, since there is no need to place a large number of virtual servers, the increase in the amount of description for holding virtual servers is suppressed, and when a data inquiry is received from a client, the server in charge that stores the data is searched. An increase in time required can be suppressed. Therefore, the throughput of the entire system can be improved.

請求項２に記載の発明は、前記ＤＢサーバが、自身が保存する前記データの増減と、前記クライアントからの前記データに対するアクセス数とを監視し、前記データそれぞれへの前記アクセス数を示すアクセス回数情報を生成するアクセス監視部を、さらに備え、前記データ管理部が、所定間隔ごとに、前記記憶部に保存されている前記データの前記データキーと、前記アクセス回数情報とを、前記仮想サーバ配置装置に送信し、前記仮想サーバ配置装置が、仮想サーバ再配置計算部を、さらに備えており、前記サーバ負荷計算部が、前記所定間隔ごとに前記ＤＢサーバそれぞれから取得した前記アクセス回数情報に含まれる、前記データキーの数であるデータ数と、前記データへのアクセス数とを用いて、前記所定間隔ごとの前記ＤＢサーバそれぞれの前記負荷を計算することにより、前記サーバＳ⁻の集合と、前記サーバＳ^＋の集合とを生成し、前記仮想サーバ再配置計算部が、前記サーバＳ⁻の集合のうち、前記ハッシュ空間上に前記仮想サーバが配置されている前記ＤＢサーバについて、前記絶対値が大きいＤＢサーバから順に、前記仮想サーバの担当領域を前記変更前に担当していた前記ＤＢサーバの担当領域に戻したときに、当該ＤＢサーバの負荷が当該ＤＢサーバのサーバ閾値を超えない場合に、前記仮想サーバの担当領域を前記ハッシュ空間上から取り除き、前記ＤＢサーバそれぞれの前記負荷を再計算し、前記ＤＢサーバごとに、前記負荷と前記サーバ閾値との差の絶対値を計算し直すことにより、前記サーバＳ⁻の集合と、前記サーバＳ^＋の集合とを再度生成して、前記配置処理部に引き渡すこと、を特徴とする請求項１に記載のデータ負荷分散配置システムとした。 In the invention according to claim 2, the DB server monitors the increase / decrease of the data stored by itself and the number of accesses to the data from the client, and indicates the number of accesses to each of the data. An access monitoring unit that generates information, and the data management unit is configured to allocate the data key of the data stored in the storage unit and the access count information to the virtual server arrangement at predetermined intervals. The virtual server placement device further includes a virtual server relocation calculation unit, and the server load calculation unit is included in the access count information acquired from each of the DB servers at each predetermined interval. Using the number of data that is the number of the data keys and the number of accesses to the data, By calculating each of the load, the server S ^- a set of said generates a set of servers S ^+, the virtual server rearrangement calculation unit, the server S ^- of the set of the hash space Regarding the DB server on which the virtual server is arranged, when the area in charge of the virtual server is returned to the area in charge of the DB server that was in charge before the change in order from the DB server having the largest absolute value When the load of the DB server does not exceed the server threshold of the DB server, the responsible area of the virtual server is removed from the hash space, the load of each DB server is recalculated, to, by re-calculating the absolute value of the difference between the load and the server threshold value, the server S ^- life and collection of, and collection of the server S ⁺ again The data load distribution and placement system according to claim 1, wherein the data is distributed to the placement processing unit.

また、請求項４に記載の発明は、前記ＤＢサーバが、自身が保存する前記データの増減と、前記クライアントからの前記データに対するアクセス数とを監視し、前記データそれぞれへの前記アクセス数を示すアクセス回数情報を生成するステップと、所定間隔ごとに、前記記憶部に保存されている前記データの前記データキーと、前記アクセス回数情報とを、前記仮想サーバ配置装置に送信するステップとを、さらに実行し、前記仮想サーバ配置装置が、前記所定間隔ごとに前記ＤＢサーバそれぞれから取得した前記アクセス回数情報に含まれる、前記データキーの数であるデータ数と、前記データへのアクセス数とを用いて、前記所定間隔ごとの前記ＤＢサーバそれぞれの前記負荷を計算することにより、前記サーバＳ⁻の集合と、前記サーバＳ^＋の集合とを生成するステップと、前記サーバＳ⁻の集合のうち、前記ハッシュ空間上に前記仮想サーバが配置されている前記ＤＢサーバについて、前記絶対値が大きいＤＢサーバから順に、前記仮想サーバの担当領域を前記変更前に担当していた前記ＤＢサーバの担当領域に戻したときに、当該ＤＢサーバの負荷が当該ＤＢサーバのサーバ閾値を超えない場合に、前記仮想サーバの担当領域を前記ハッシュ空間上から取り除き、前記ＤＢサーバそれぞれの前記負荷を再計算し、前記ＤＢサーバごとに、前記負荷と前記サーバ閾値との差の絶対値を計算し直すことにより、前記サーバＳ⁻の集合と、前記サーバＳ^＋の集合とを再度生成するステップとを、実行すること、を特徴とする請求項３に記載のデータ負荷分散配置方法とした。 According to a fourth aspect of the present invention, the DB server monitors the increase / decrease of the data stored by itself and the number of accesses to the data from the client, and indicates the number of accesses to each of the data. Generating the access count information; and transmitting the data key of the data stored in the storage unit and the access count information to the virtual server placement device at predetermined intervals; And the virtual server placement device uses the number of data that is the number of the data keys and the number of accesses to the data included in the access count information acquired from each of the DB servers at the predetermined intervals. Then, by calculating the load of each of the DB servers at each predetermined interval, the set of servers S ^{− and} the server And generating a set of over server S ^+, the server S ^- of a set of, for the DB server where the virtual server is located on the hash space, in order from the large absolute value DB server, If the load of the DB server does not exceed the server threshold of the DB server when the charge area of the virtual server is returned to the DB server responsible area that was in charge before the change, the virtual server charge By removing the area from the hash space, recalculating the load of each DB server, and recalculating the absolute value of the difference between the load and the server threshold for each DB server, the server S ⁻ The data load distribution and placement method according to claim 3, wherein the step of regenerating the set of and the set of the server S ⁺ is executed. .

このようにすることで、本実施形態に係るデータ負荷分散配置システムおよびデータ負荷分散配置方法によれば、所定間隔ごとに、負荷がサーバ閾値以下のサーバＳ⁻に含まれることになったＤＢサーバのハッシュ空間上での担当領域に、仮想サーバが配置されている場合に、その仮想サーバの担当領域を、元のＤＢサーバの担当領域に戻すことができる。そして、負荷とサーバ閾値との差の絶対値を計算し直すことにより、負荷がサーバ閾値以下のサーバＳ⁻の集合と、負荷がサーバ閾値を超えるサーバＳ^＋の集合とを生成して、再度、ハッシュ空間上での仮想サーバの配置を計算する。これにより、各ＤＢサーバの負荷を均一化させるとともに、ハッシュ空間上に配置する仮想サーバの数を減らすことができる。 By doing in this way, according to the data load distribution and arrangement system and the data load distribution and arrangement method according to the present embodiment, the DB server whose load is included in the server S ⁻ having a server threshold value or less at every predetermined interval. When the virtual server is arranged in the assigned area in the hash space, the assigned area of the virtual server can be returned to the assigned area of the original DB server. Then, by recalculating the absolute value of the difference between the load and the server threshold, a set of servers S ⁻ whose load is less than or equal to the server threshold and a set of servers S ⁺ whose load exceeds the server threshold are generated again. Calculate the placement of the virtual server on the hash space. As a result, the load on each DB server can be made uniform, and the number of virtual servers arranged in the hash space can be reduced.

本発明によれば、仮想サーバの配置数を抑えた上で、サーバ間の負荷分散を実現し、システム全体としてスループットを向上させる、仮想サーバを用いたデータ負荷分散配置システムおよびデータ負荷分散配置方法を提供することができる。 According to the present invention, a data load distribution and arrangement system using a virtual server and a data load distribution and arrangement method for realizing load distribution among servers and improving throughput as a whole system while suppressing the number of arrangement of virtual servers. Can be provided.

本実施形態に係るデータ負荷分散配置システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the data load distribution arrangement system which concerns on this embodiment. 本実施形態に係る仮想サーバ配置装置による、ハッシュ空間上での仮想サーバの初期配置の概要を説明するための図である。It is a figure for demonstrating the outline | summary of the initial arrangement | positioning of the virtual server on the hash space by the virtual server arrangement | positioning apparatus which concerns on this embodiment. 本実施形態に係る仮想サーバ配置装置による、ハッシュ空間上での仮想サーバの初期配置の概要を説明するための図である。It is a figure for demonstrating the outline | summary of the initial arrangement | positioning of the virtual server on the hash space by the virtual server arrangement | positioning apparatus which concerns on this embodiment. 本実施形態に係る仮想サーバ配置装置による、ハッシュ空間上での仮想サーバの初期配置の概要を説明するための図である。It is a figure for demonstrating the outline | summary of the initial arrangement | positioning of the virtual server on the hash space by the virtual server arrangement | positioning apparatus which concerns on this embodiment. 本実施形態に係る仮想サーバ配置装置による、ハッシュ空間上での仮想サーバの初期配置の概要を説明するための図である。It is a figure for demonstrating the outline | summary of the initial arrangement | positioning of the virtual server on the hash space by the virtual server arrangement | positioning apparatus which concerns on this embodiment. 本実施形態に係るサーバ担当データ情報のデータ構成の一例を示す図である。It is a figure which shows an example of a data structure of the server charge data information which concerns on this embodiment. 本実施形態に係るアクセス回数情報のデータ構成の一例を示す図である。It is a figure which shows an example of a data structure of the access frequency information which concerns on this embodiment. 本実施形態に係るデータ負荷分散配置処理（仮想サーバの初期配置）の全体の処理の流れを示すシーケンス図である。It is a sequence diagram which shows the flow of the whole process of the data load distribution arrangement | positioning process (initial arrangement | positioning of a virtual server) which concerns on this embodiment. 本実施形態に係る仮想サーバ配置装置の仮想サーバ配置処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the virtual server arrangement | positioning process of the virtual server arrangement | positioning apparatus which concerns on this embodiment. 本実施形態に係る仮想サーバ配置装置の仮想サーバ配置処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the virtual server arrangement | positioning process of the virtual server arrangement | positioning apparatus which concerns on this embodiment. 本実施形態に係るデータ負荷分散配置処理（仮想サーバの再配置）の全体の処理の流れを示すシーケンス図である。It is a sequence diagram which shows the flow of the whole process of the data load distribution arrangement | positioning process (virtual server rearrangement) concerning this embodiment. 本実施形態に係る仮想サーバ配置装置による仮想サーバ再配置計算の概要を説明するための図である。It is a figure for demonstrating the outline | summary of the virtual server rearrangement calculation by the virtual server arrangement | positioning apparatus which concerns on this embodiment. 本実施形態に係る仮想サーバ配置装置による仮想サーバ再配置計算の概要を説明するための図である。It is a figure for demonstrating the outline | summary of the virtual server rearrangement calculation by the virtual server arrangement | positioning apparatus which concerns on this embodiment. 本実施形態に係る仮想サーバ配置装置による仮想サーバ再配置計算の概要を説明するための図である。It is a figure for demonstrating the outline | summary of the virtual server rearrangement calculation by the virtual server arrangement | positioning apparatus which concerns on this embodiment. 本実施形態に係る仮想サーバ配置装置による仮想サーバ再配置計算の概要を説明するための図である。It is a figure for demonstrating the outline | summary of the virtual server rearrangement calculation by the virtual server arrangement | positioning apparatus which concerns on this embodiment. 本実施形態に係る仮想サーバ配置装置による仮想サーバ再配置計算の概要を説明するための図である。It is a figure for demonstrating the outline | summary of the virtual server rearrangement calculation by the virtual server arrangement | positioning apparatus which concerns on this embodiment. 本実施形態に係る仮想サーバ配置装置による仮想サーバ再配置計算の概要を説明するための図である。It is a figure for demonstrating the outline | summary of the virtual server rearrangement calculation by the virtual server arrangement | positioning apparatus which concerns on this embodiment. 本実施形態に係る仮想サーバ配置装置の仮想サーバ再配置計算の処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a process of the virtual server rearrangement calculation of the virtual server arrangement | positioning apparatus which concerns on this embodiment. 従来技術におけるコンシステントハッシュ法による仮想サーバの配置方法を説明するための図である。It is a figure for demonstrating the arrangement | positioning method of the virtual server by the consistent hash method in a prior art.

次に、本発明を実施するための形態（以下、「実施形態」という）について、適宜図面を参照しながら詳細に説明する。 Next, modes for carrying out the present invention (hereinafter referred to as “embodiments”) will be described in detail with reference to the drawings as appropriate.

本実施形態に係る仮想サーバを用いたデータ負荷分散配置システム１およびデータ負荷分散配置方法では、（１）各ＤＢサーバ２０のサーバ性能を考慮した仮想サーバの初期配置によるデータ負荷分散配置処理と、（２）システム稼動後の時間経過に伴うデータ数やデータへのアクセス回数の増減に対応した動的な仮想サーバの再配置と、を実現する。まず、本実施形態に係るデータ負荷分散配置システム１の概要について説明する。 In the data load distribution and arrangement system 1 and data load distribution and arrangement method using virtual servers according to the present embodiment, (1) data load distribution and arrangement processing by initial arrangement of virtual servers in consideration of server performance of each DB server 20; (2) Realization of dynamic virtual server relocation corresponding to increase / decrease in the number of data and the number of accesses to data with the passage of time after system operation. First, an outline of the data load distribution arrangement system 1 according to the present embodiment will be described.

図１は、本実施形態に係るデータ負荷分散配置システム１の構成を示すブロック図である。
図１に示すように、本実施形態に係るデータ負荷分散配置システム１は、各データを保存する複数のＤＢサーバ２０と、ハッシュ空間上での仮想サーバの配置による各データの保存先となるＤＢサーバ２０の決定や、そのデータの保存先の変更等を管理する仮想サーバ配置装置１０とが、通信可能に接続されて構成される。この仮想サーバ配置装置１０と各ＤＢサーバ２０それぞれとが通信可能に接続されるとともに、各ＤＢサーバ２０同士も、通信可能に接続される。 FIG. 1 is a block diagram showing a configuration of a data load distribution and arrangement system 1 according to the present embodiment.
As shown in FIG. 1, the data load distribution and arrangement system 1 according to the present embodiment includes a plurality of DB servers 20 that store each data and a DB that is a storage destination of each data by arrangement of virtual servers on a hash space. The server 20 is configured to be communicably connected to the virtual server placement apparatus 10 that manages the determination of the server 20 and the change of the storage destination of the data. The virtual server placement device 10 and each DB server 20 are connected to be communicable, and the DB servers 20 are also connected to be communicable.

また、仮想サーバ配置装置１０は、管理者サーバ５と通信可能に接続されており、仮想サーバをハッシュ空間上に配置するために必要となる、設定情報（後記参照）を管理者サーバ５から受信する。そして、仮想サーバ配置装置１０は、仮想サーバの配置処理の結果、ＤＢサーバ２０が不足する場合には、リソース不足通知を管理者サーバ５へ送信する。 The virtual server placement apparatus 10 is communicably connected to the administrator server 5 and receives setting information (see below) from the administrator server 5 necessary for placing the virtual server on the hash space. To do. The virtual server placement apparatus 10 transmits a resource shortage notification to the administrator server 5 when the DB server 20 is short as a result of the virtual server placement processing.

ＤＢサーバ２０は、クライアント６とも通信可能に接続されている。ＤＢサーバ２０は、クライアント６からのデータ要求を受け付けると、そのデータを保存するＤＢサーバ２０を、後記する記憶部２４内のサーバ担当データ情報１００（図６参照）を参照して検索し、その検索結果を担当サーバ通知として、クライアント６に送信する。そして、クライアント６は、その担当サーバ通知に示されるデータの保存先のＤＢサーバ２０に対して、データ取得要求を送信し、そのデータを取得する。 The DB server 20 is also communicably connected to the client 6. Upon receiving a data request from the client 6, the DB server 20 searches for the DB server 20 that stores the data with reference to server charge data information 100 (see FIG. 6) in the storage unit 24 described later. The search result is transmitted to the client 6 as a server notification in charge. Then, the client 6 transmits a data acquisition request to the DB server 20 that stores the data indicated in the notification of the server in charge, and acquires the data.

次に、本実施形態に係るデータ負荷分散配置システム１の仮想サーバの初期配置の概要について説明する。その後に、データ負荷分散配置システム１の具体的な構成と、データ負荷分散配置方法の具体的な処理の流れについて説明する。 Next, an outline of the initial arrangement of the virtual server of the data load distribution arrangement system 1 according to the present embodiment will be described. After that, a specific configuration of the data load distribution and arrangement system 1 and a specific processing flow of the data load distribution and arrangement method will be described.

（仮想サーバの初期配置の概要）
まず、本実施形態に係るデータ負荷分散配置システム１が仮想サーバの初期配置を行う際の前提となる条件および処理の概要について説明する。 (Outline of initial placement of virtual servers)
First, a description will be given of conditions and an outline of processing that are preconditions when the data load distribution and placement system 1 according to the present embodiment performs initial placement of virtual servers.

本実施形態に係るデータ負荷分散配置システム１では、初期状態として、Ｎ台のＤＢサーバ（物理サーバ）２０にＨ個のデータが保存されているものとする。この場合に、初期状態でのシステム性能要件は、以下の（Ａ）（Ｂ）ようになる。 In the data load distribution and arrangement system 1 according to the present embodiment, it is assumed that H pieces of data are stored in N DB servers (physical servers) 20 as an initial state. In this case, the system performance requirements in the initial state are as follows (A) and (B).

（Ａ）各サーバＳ_ｍにかかる負荷Ｗ_ｍは、そのサーバ性能Ｔ_ｍ以下である。 (A) The load W _m applied to each server S _m is equal to or less than the server performance T _m .

（Ｂ）各サーバＳ_ｍにかかる負荷Ｗ_ｍの合計が、システム全体に求める処理性能Ｒ以下である。 (B) The total load W _m applied to each server S _m is equal to or less than the processing performance R required for the entire system.

この（Ａ）および（Ｂ）のシステム性能条件を満たす限り、クライアント６は、正常なアクセス応答時間内でシステム内のデータへのアクセスが可能である。 As long as the system performance conditions (A) and (B) are satisfied, the client 6 can access data in the system within a normal access response time.

本実施形態に係る仮想サーバの初期配置では、システム全体に対して１秒当たりのシステム最大性能であるＲ回のアクセスがあり、各データへのアクセス回数は均等であるとして、１つのデータ当たりＲ／Ｈ回のアクセスがあると仮定する。 In the initial arrangement of the virtual server according to the present embodiment, there are R accesses that are the system maximum performance per second for the entire system, and it is assumed that the number of accesses to each data is equal, and R per data. Assume that there are / H accesses.

そして、システム性能条件（Ａ）および（Ｂ）を満たしながら、さらに次のシステム性能条件（Ｃ）を満たすように、仮想サーバを配置する。 Then, the virtual server is arranged so as to satisfy the following system performance condition (C) while satisfying the system performance conditions (A) and (B).

（Ｃ）各サーバＳ_ｍにかかる負荷Ｗ_ｍは、そのサーバ閾値α_ｍ以下である。 (C) The load W _m applied to each server S _m is the server threshold value α _m or less.

なお、このサーバ閾値α_ｍは、以下の（式１）により計算される。 The server threshold value α _m is calculated by the following (Equation 1).

ここで、ｃは、負荷閾値定数であり、例えば、サーバ性能Ｔ_ｍの７割をサーバ閾値α_ｍとして設定する場合には、システムの管理者等により「0.7」が、負荷閾値定数ｃとして設定される。
また、負荷Ｗ_ｍは、以下の（式２）により計算される。 Here, c is the load threshold constant setting, for example, 70% of the server performance T _m in the case of setting as a server threshold value alpha _m is the "0.7" by the administrator of the system, as the load threshold constant c Is done.
Further, the load W _m is calculated by the following (Equation 2).

ここで、Ｄ_ｍは、サーバＳ_ｍに保存されるデータ数であり、αＳ_ｍ(i)は、サーバＳ_ｍが保存するデータｄＳ_ｍ(i)へのアクセス回数である。 Here, D _m is the number of data stored in the server S _m , and αS _m (i) is the number of accesses to the data dS _m (i) stored in the server S _m .

本実施形態に係る仮想サーバ配置装置１０は、ハッシュ空間における各サーバＳ_ｍ（ＤＢサーバ２０）の担当領域に、仮想サーバを配置し、サーバＳ_ｍ（物理サーバ）の担当領域の一部を仮想サーバに担当させる。これにより、そのサーバＳ_ｍ（物理サーバ）の担当データ（ＤＢサーバ２０が保存するデータ）の一部を、仮想サーバの基となる他の物理サーバ（ＤＢサーバ２０）に変更すること、つまり、他のＤＢサーバ２０にそのデータを移して保存させることが可能となる。そして、仮想サーバ配置装置１０は、この仮想サーバを用いて、ハッシュ空間における負荷の大きい物理サーバ（ＤＢサーバ２０）の担当領域内に、負荷の小さい物理サーバ（ＤＢサーバ２０）の仮想サーバを割り当てることで、システム全体の負荷分散を行う。なお、本実施形態において、ＤＢサーバ２０をハッシュ空間に配置した場合に、ＤＢサーバ２０を、仮想サーバに対応させる意味において物理サーバということがある。 The virtual server placement apparatus 10 according to the present embodiment places a virtual server in the area in charge of each server S _m (DB server 20) in the hash space, and virtualizes a part of the area in charge of the server S _m (physical server). Let the server take charge. Thereby, a part of the data in charge of the server S _m (physical server) (data stored by the DB server 20) is changed to another physical server (DB server 20) that is the basis of the virtual server, that is, The data can be moved and stored in another DB server 20. Then, the virtual server placement apparatus 10 uses this virtual server to allocate the virtual server of the physical server (DB server 20) with a low load in the area in charge of the physical server (DB server 20) with a high load in the hash space. By doing so, load distribution of the entire system is performed. In the present embodiment, when the DB server 20 is arranged in a hash space, the DB server 20 may be referred to as a physical server in the sense of corresponding to a virtual server.

仮想サーバ配置装置１０は、各ＤＢサーバ２０からサーバ性能Ｔ_ｍを取得し、管理者サーバ５から負荷閾値定数ｃを取得するなどして、（式１）を用いて、サーバ閾値α_ｍを計算する。そして、仮想サーバ配置装置１０は、負荷Ｗ_ｍがかかっている物理サーバＳ_ｍに対し、Ｏ_ｍを以下のように定義する。 The virtual server placement apparatus 10 calculates the server threshold value α _m using (Equation 1) by acquiring the server performance T _m from each DB server 20 and the load threshold constant c from the administrator server 5. To do. Then, the virtual server placement apparatus 10 defines O _m for the physical server S _{m on} which the load W _m is applied as follows.

ここで、Ｏ_ｍは、物理サーバＳ_ｍの負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値である。 Here, _{O m} is the absolute value of the difference between the load _{W m} and the server threshold value alpha _m of physical server _{S m.}

そして、仮想サーバ配置装置１０は、物理サーバの集合Ｓを、以下のようにＳ⁻，Ｓ^＋の２つの集合に分割する。 Then, the virtual server placement apparatus 10 divides the set S of physical servers into two sets S ⁻ and S ⁺ as follows.

ここで、負荷がサーバ閾値以下の物理サーバの集合を、Ｓ⁻とする。そして、Ｓ⁻に含まれる物理サーバＳ_ｍをＯ_ｍの大きい順に並べたものを以下のように記述する。 Here, the load is a set of the following physical server servers threshold, S ^- to. Then, S ^- described as follows an ordered physical servers S _m included in the descending order of O _m.

一方、負荷がサーバ閾値を超える物理サーバの集合を、Ｓ^＋とする。そして、Ｓ^＋に含まれる物理サーバＳ_ｍをＯ_ｍの大きい順に並べたものを以下のように記述する。 On the other hand, a set of physical servers whose load exceeds the server threshold is defined as S ⁺ . Then, the physical servers S _m included in S ⁺ arranged in order of increasing O _m are described as follows.

また、負荷がサーバ閾値以下のサーバＳ⁻の絶対値Ｏ⁻ _kj、負荷がサーバ閾値を超えるサーバＳ^＋の絶対値Ｏ^＋ _liを、以下のように定義する。 Further, the absolute value O ⁻ _kj of the server S ⁻ whose load is equal to or less than the server threshold and the absolute value O ⁺ _li of the server S ⁺ whose load exceeds the server threshold are defined as follows.

次に、仮想サーバ配置装置１０による、ハッシュ空間上での仮想サーバの初期配置の概要について説明する。図２〜図５は、本実施形態に係る仮想サーバ配置装置１０による、ハッシュ空間上での仮想サーバの初期配置の概要を説明するための図である。 Next, an outline of the initial placement of the virtual server on the hash space by the virtual server placement device 10 will be described. 2-5 is a figure for demonstrating the outline | summary of the initial arrangement | positioning of the virtual server on the hash space by the virtual server arrangement | positioning apparatus 10 which concerns on this embodiment.

ここでは、図２（ｂ）に示すように、仮想サーバ配置装置１０により、負荷がサーバ閾値以下のサーバＳ⁻が、Ｓ_３，Ｓ_４，Ｓ_５のように、負荷とサーバ閾値との差の絶対値Ｏ⁻の大きい順に並べられ、負荷がサーバ閾値を超えるサーバＳ^＋が、Ｓ_１，Ｓ_２のように、負荷とサーバ閾値との差の絶対値Ｏ^＋の大きい順に並べられているものとする。そして、この各サーバＳ_１〜Ｓ_５とその担当データとが、仮想サーバ配置装置１０により、図２（ａ）に示すように、ハッシュ空間上に配置されるとする。 Here, as shown in FIG. 2B, the virtual server placement apparatus 10 causes the server S ⁻ whose load is equal to or less than the server threshold to be the difference between the load and the server threshold, as in S ₃ , S ₄ , S _5. Are arranged in descending order of the absolute value O ⁻ , and servers S ⁺ whose loads exceed the server threshold are arranged in descending order of the absolute value O ⁺ of the difference between the load and the server threshold as S ₁ and S ₂ . Shall. Then, it is assumed that the servers S _{1 to} S ₅ and the data in charge thereof are arranged on the hash space by the virtual server arrangement device 10 as shown in FIG.

図２（ａ）に示すような状態から、仮想サーバ配置装置１０は、負荷とサーバ閾値との差の絶対値が大きい順に、負荷がサーバ閾値を超えるサーバＳ^＋の負荷の超過分の領域に、負荷がサーバ閾値以下のサーバＳ⁻の仮想サーバを配置することで、負荷がサーバ閾値を超えるサーバＳ^＋の負荷の超過分のデータの担当を、負荷がサーバ閾値以下のサーバＳ⁻に変更する。 From the state shown in FIG. 2A, the virtual server placement apparatus 10 increases the load of the server S ^{+ in which} the load exceeds the server threshold in the descending order of the absolute value of the difference between the load and the server threshold. By placing a virtual server of the server S ⁻ whose load is below the server threshold, the responsibility for the excess data of the server S ⁺ whose load exceeds the server threshold is changed to the server S ⁻ whose load is below the server threshold To do.

具体的には、図３に示すように、まず、負荷がサーバ閾値を超えるサーバＳ^＋のうち、絶対値Ｏ^＋が一番大きいＳ_１のサーバの負荷の超過分の領域に、負荷がサーバ閾値以下のサーバＳ⁻のうち、絶対値Ｏ⁻が一番大きいサーバＳ_３の仮想サーバＶＳ_３を配置し、その領域のデータをサーバＳ_３が担当する。 Specifically, as shown in FIG. 3, first, the load of the servers S ⁺ exceeding server threshold, the area of the excess of the absolute value O ⁺ server load largest S _1, the load server The virtual server VS ₃ of the server S ₃ having the largest absolute value O ⁻ among the servers S ⁻ below the threshold is arranged, and the server S ₃ takes charge of the data in the area.

次に、図４に示すように、サーバＳ_１の残りの負荷の超過分の領域に、負荷がサーバ閾値以下の次のサーバであるサーバＳ_４の仮想サーバＶＳ_４を配置し、その領域のデータをサーバＳ_４が担当する。これによって、サーバＳ_１の負荷の超過分が解消される。続いて、負荷がサーバ閾値を超える次のサーバであるサーバＳ_２の負荷の超過分の領域に、負荷がサーバ閾値以下のサーバＳ_４の残りの余剰分を、仮想サーバＶＳ_４として配置し、その領域のデータをサーバＳ_４が担当する。 Next, as shown in FIG. 4, the virtual server VS ₄ of the server S ₄ , which is the next server whose load is equal to or less than the server threshold, is arranged in the area where the remaining load of the server S ₁ is exceeded. the data server _{S 4} is in charge. Thus, excess of load on the server S ₁ is being eliminated. Subsequently, the remaining surplus portion of the server S ₄ whose load is equal to or less than the server threshold is arranged as a virtual server VS _{4 in} the region where the load of the server S ₂ which is the next server whose load exceeds the server threshold value. the data of the area server S ₄ is in charge.

続いて、図５に示すように、サーバＳ_２の残りの負荷の超過分の領域に、負荷がサーバ閾値以下の次のサーバであるサーバＳ_５の仮想サーバＶＳ_５を配置し、その領域のデータをサーバＳ_５が担当する。これによって、サーバＳ_２の負荷の超過分が解消される。なお、仮想サーバ配置装置１０は、この仮想サーバの配置処理の結果、負荷がサーバ閾値以下のサーバＳ⁻が不足する場合には、リソース不足通知を管理者サーバ５へ送信する。 Subsequently, as shown in FIG. 5, the virtual server VS ₅ of the server S ₅ , which is the next server whose load is equal to or less than the server threshold, is arranged in the region where the remaining load of the server S ₂ is exceeded. the data server _{S 5} in charge. Thus, excess of load on the server S ₂ is eliminated. Note that the virtual server placement apparatus 10 transmits a resource shortage notification to the administrator server 5 when the server S ⁻ whose load is equal to or less than the server threshold is insufficient as a result of the virtual server placement processing.

本実施形態に係る仮想サーバ配置装置１０は、このように、ハッシュ空間上で、負荷がサーバ閾値を超えるサーバＳ^＋の負荷の超過分の領域を、負荷がサーバ閾値以下のサーバＳ⁻が、仮想サーバを設けることにより、その領域のデータを、負荷がサーバ閾値以下のサーバＳ⁻のサーバが担当する。よって、サーバ間の負荷分散を実現し、仮想サーバの数を減らすことができる。 As described above, the virtual server arrangement device 10 according to the present embodiment has an excess load area of the server S ⁺ whose load exceeds the server threshold on the hash space, and the server S ⁻ whose load is equal to or less than the server threshold. by providing the virtual server, the data of the region, the load is less than the server threshold value server S ^- server is responsible for. Therefore, load distribution among servers can be realized and the number of virtual servers can be reduced.

（データ負荷分散配置システムの構成）
次に、本実施形態に係るデータ負荷分散配置システム１を構成するＤＢサーバ２０および仮想サーバ配置装置１０の構成例について詳細に説明する。 (Data load distribution system configuration)
Next, a configuration example of the DB server 20 and the virtual server arrangement device 10 configuring the data load distribution arrangement system 1 according to the present embodiment will be described in detail.

＜ＤＢサーバの構成＞
まず、本実施形態に係るＤＢサーバ２０の構成例について、図１を参照して説明する。
ＤＢサーバ２０は、データを自身が備える記憶手段（後記する記憶部２４）に保存しておき、クライアント６からのデータ取得要求に応じて、その要求のあったデータをクライアント６へ送信する。また、ＤＢサーバ２０は、仮想サーバ配置装置１０から、各データの保存先となるＤＢサーバ２０を示すサーバ担当データ情報１００（詳細は後記する図６参照）を受信すると、そのサーバ担当データ情報１００を参照して、自身が担当すべきデータを記憶する他のＤＢサーバ２０から、そのデータを取得し保存する処理を行う。
このＤＢサーバ２０は、図１に示すように、処理部２１と、通信部２２と、メモリ部２３と、記憶部２４とを備える。 <DB server configuration>
First, a configuration example of the DB server 20 according to the present embodiment will be described with reference to FIG.
The DB server 20 stores the data in a storage means (storage unit 24 described later) provided in the DB server 20 and transmits the requested data to the client 6 in response to a data acquisition request from the client 6. Further, when the DB server 20 receives from the virtual server placement apparatus 10 the server charge data information 100 (see FIG. 6 to be described later in detail) indicating the DB server 20 that is the storage destination of each data, the server charge data information 100 Referring to FIG. 4, a process of acquiring and storing the data from the other DB server 20 storing the data to be handled by itself is performed.
As illustrated in FIG. 1, the DB server 20 includes a processing unit 21, a communication unit 22, a memory unit 23, and a storage unit 24.

通信部２２は、クライアント６、自身以外の他のＤＢサーバ２０および仮想サーバ配置装置１０との間の通信を司る、通信インタフェースにより構成される。 The communication unit 22 includes a communication interface that controls communication between the client 6, the DB server 20 other than itself, and the virtual server placement apparatus 10.

処理部２１は、ＤＢサーバ２０全体の制御を司り、情報受信部２１１と、データ管理部２１２と、アクセス監視部２１４と、データ交換部２１５と、情報送信部２１６とを含んで構成される。なお、この処理部２１は、例えば、ＤＢサーバ２０の記憶部２４に格納されたプログラムをＣＰＵ（Central Processing Unit）が、メモリ部２３であるＲＡＭ（Random Access Memory）に展開し実行することで実現される。 The processing unit 21 controls the entire DB server 20 and includes an information receiving unit 211, a data management unit 212, an access monitoring unit 214, a data exchange unit 215, and an information transmission unit 216. The processing unit 21 is realized by, for example, a CPU (Central Processing Unit) developing and executing a program stored in the storage unit 24 of the DB server 20 in a RAM (Random Access Memory) that is the memory unit 23. Is done.

情報受信部２１１は、通信部２２を介して、クライアント６からのデータ要求や、仮想サーバ配置装置１０からのサーバ担当データ情報１００（図６参照）、他のＤＢサーバ２０から送信されたデータ等を取得する。 The information receiving unit 211 receives a data request from the client 6, server data information 100 (see FIG. 6) from the virtual server placement apparatus 10, data transmitted from another DB server 20, etc. via the communication unit 22. To get.

データ管理部２１２は、記憶部２４に記憶されたデータの管理全般を司る。具体的には、データ管理部２１２は、クライアント６からのデータ要求を、情報受信部２１１を介して受け付けると、記憶部２４内のサーバ担当データ情報１００を参照し、そのデータを記憶するＤＢサーバ２０を検索する。そして、その検索結果を、担当サーバ通知として、情報送信部２１６を介して、クライアント６に送信する。 The data management unit 212 manages the overall management of data stored in the storage unit 24. Specifically, when the data management unit 212 receives a data request from the client 6 via the information reception unit 211, the data management unit 212 refers to the server charge data information 100 in the storage unit 24 and stores the data. 20 is searched. Then, the search result is transmitted to the client 6 via the information transmission unit 216 as a server notification in charge.

図６は、本実施形態に係るサーバ担当データ情報１００のデータ構成の一例を示す図である。
図６に示すように、サーバ担当データ情報１００は、各ＤＢサーバ（サーバＳ_ｍ）２０ごとに、そのサーバＳ_ｍが保存する担当データが記憶される。例えば、サーバＳ_１は、「ｄ１〜ｄ３，ｄ６」のデータを保存することを示している。なお、この担当データには、各データのデータキーが「ｄ１」「ｄ２」…、等として記憶される。なお、このサーバ担当データ情報１００には、仮想サーバ配置装置１０による仮想サーバの初期配置の処理の初期値として、現時点においてＤＢサーバ２０ごとに保存される各データの情報が記憶されているものとする。 FIG. 6 is a diagram illustrating an example of a data configuration of the server charge data information 100 according to the present embodiment.
As shown in FIG. 6, the server handled data information 100, the DB server for each (server _S m) 20, charge data that the server _{S m} is saved is stored. For example, the server S ₁ indicates that data “d1 to d3, d6” is stored. Note that the data key of each data is stored as “d1,” “d2,”. The server charge data information 100 stores information on each data stored for each DB server 20 at the present time as an initial value of the virtual server initial placement processing by the virtual server placement device 10. To do.

また、データ管理部２１２は、図１に示すように、ハッシュ値計算部２１３を備えている。そして、データ管理部２１２は、通信部２２や、不図示の入力部を介して、新たなデータを取得すると、そのデータのデータキーに対して、ハッシュ値計算部２１３が、所定のハッシュ関数によりハッシュ値を計算する。データ管理部２１２は、そのデータのデータキーと、そのハッシュ値と、そのデータ（データ値）とを対応付けて、記憶部２４内の後記するデータ格納部２００に記憶する。 The data management unit 212 includes a hash value calculation unit 213 as shown in FIG. When the data management unit 212 acquires new data via the communication unit 22 or an input unit (not shown), the hash value calculation unit 213 performs a predetermined hash function on the data key of the data. Calculate the hash value. The data management unit 212 associates the data key of the data, the hash value, and the data (data value), and stores them in the data storage unit 200 described later in the storage unit 24.

そして、データ管理部２１２は、クライアント６から自身が保存するデータに対するデータ取得要求を受け付けると、そのデータ取得要求に付されたデータキーについて、ハッシュ値計算部２１３がハッシュ値を計算する。次に、データ管理部２１２は、そのデータキーのハッシュ値を用いて、データ格納部２００に記憶されたデータ（データ値）を検索し、検索したデータを、情報送信部２１６を介して、クライアント６へ送信する。 When the data management unit 212 receives a data acquisition request for data stored by itself from the client 6, the hash value calculation unit 213 calculates a hash value for the data key attached to the data acquisition request. Next, the data management unit 212 searches for the data (data value) stored in the data storage unit 200 using the hash value of the data key, and the searched data is sent to the client via the information transmission unit 216. 6 to send.

また、データ管理部２１２は、仮想サーバ配置装置１０による仮想サーバの初期配置の処理に際して、記憶部２４内のデータ格納部２００を参照し、自身が保存するすべてのデータのデータキーと、予め記憶部２４等に保存されている自身のサーバ性能Ｔ_ｍとを、自身のＤＢサーバ２０に固有な番号と共に、情報送信部２１６を介して仮想サーバ配置装置１０に送信する。なお、自身のＤＢサーバ２０に固有な番号とは、例えば、そのＤＢサーバ２０のＩＰアドレス等であるが、これに限定されず、他のＤＢサーバ２０等と識別可能な番号であればよい。 In addition, the data management unit 212 refers to the data storage unit 200 in the storage unit 24 when the virtual server placement apparatus 10 performs the initial placement processing of the virtual server, and stores in advance the data keys of all the data stored by itself. The server performance T _m stored in the unit 24 and the like is transmitted to the virtual server placement apparatus 10 via the information transmitting unit 216 together with a number unique to the own DB server 20. The number unique to the own DB server 20 is, for example, the IP address of the DB server 20, but is not limited thereto, and may be any number that can be distinguished from other DB servers 20.

また、データ管理部２１２は、システム稼働後、所定間隔Ｚごとに、自身がその時点で保存するすべてのデータのデータキーと、アクセス監視部２１４が生成したアクセス回数情報３００（後記する図７参照）とを、自身のＤＢサーバ２０に固有な番号と共に、情報送信部２１６を介して仮想サーバ配置装置１０に送信する。
さらに、データ管理部２１２は、仮想サーバ配置装置１０から、新たなサーバ担当データ情報１００を取得すると、記憶部２４内のサーバ担当データ情報１００を更新する処理を行う。 Further, the data management unit 212, after the system operation, at every predetermined interval Z, the data key of all data stored at that time and the access count information 300 generated by the access monitoring unit 214 (see FIG. 7 described later). ) Together with a number unique to its own DB server 20, to the virtual server placement apparatus 10 via the information transmission unit 216.
Furthermore, when the data management unit 212 obtains new server handling data information 100 from the virtual server placement apparatus 10, the data management unit 212 performs a process of updating the server handling data information 100 in the storage unit 24.

アクセス監視部２１４は、データ管理部２１２を監視し、クライアント６からのデータ取得要求を受け付けるごとに、データ格納部２００に記憶された各データへのアクセス回数をカウントし、その結果を、アクセス回数情報３００として記憶する。 The access monitoring unit 214 monitors the data management unit 212 and counts the number of accesses to each data stored in the data storage unit 200 every time a data acquisition request from the client 6 is received. Store as information 300.

図７は、本実施形態に係るアクセス回数情報３００のデータ構成の一例を示す図である。
図７に示すように、アクセス回数情報３００には、各ＤＢサーバ２０自身が記憶するデータごとに、クライアント６からアクセスが何回あったかを示す情報であり、各データのデータキーと、そのデータのアクセス回数が記憶される。なお、このアクセス回数情報３００は、データ管理部２１２により、仮想サーバ配置装置１０へ所定間隔Ｚごとに送信される。 FIG. 7 is a diagram illustrating an example of a data configuration of the access count information 300 according to the present embodiment.
As shown in FIG. 7, the access count information 300 is information indicating the number of accesses from the client 6 for each data stored in each DB server 20 itself, the data key of each data, and the data The number of accesses is stored. The access count information 300 is transmitted by the data management unit 212 to the virtual server placement apparatus 10 at predetermined intervals Z.

データ交換部２１５は、データ管理部２１２が仮想サーバ配置装置１０から新たなサーバ担当データ情報１００を受信したことを契機として、その新たなサーバ担当データ情報１００と記憶部２４に保存していたサーバ担当データ情報１００と比較することにより、自身が保存していないデータおよびそのデータの保存先となる他のＤＢサーバ２０を抽出する。そして、データ交換部２１５は、その抽出した自身が担当すべきデータを記憶している他のＤＢサーバ２０に対して、データ交換要求を送信することにより、そのデータの送信を依頼し、担当すべきデータを取得する。続いて、データ交換部２１５は、記憶部２４に保存していたサーバ担当データ情報１００を、データ管理部２１２を介して、受信した新たなサーバ担当データ情報１００に更新させる。 The data exchange unit 215 receives the server management data information 100 from the virtual server placement apparatus 10 by the data management unit 212 as a trigger, and the server stored in the new server management data information 100 and the storage unit 24. By comparing with the responsible data information 100, data that is not stored by itself and another DB server 20 that is the storage destination of the data are extracted. Then, the data exchange unit 215 requests the transmission of the data by sending a data exchange request to the other DB server 20 storing the data to be handled by the extracted data 215, and is in charge of it. Get power data. Subsequently, the data exchange unit 215 updates the server charge data information 100 stored in the storage unit 24 to the received new server charge data information 100 via the data management unit 212.

情報送信部２１６は、通信部２２を介して、クライアント６への担当サーバ通知や、仮想サーバ配置装置１０への自身が保存するデータに関する情報、他のＤＢサーバ２０へのデータ交換要求等を送信する。 The information transmission unit 216 transmits, via the communication unit 22, notification of the server in charge to the client 6, information regarding data stored by itself to the virtual server placement apparatus 10, data exchange requests to other DB servers 20, and the like. To do.

次に、記憶部２４は、ハードディスクやフラッシュメモリ等の記憶装置からなり、サーバ担当データ情報１００、データ格納部２００、アクセス回数情報３００を記憶している。 Next, the storage unit 24 includes a storage device such as a hard disk or a flash memory, and stores server-in-charge data information 100, a data storage unit 200, and access count information 300.

メモリ部２３は、ＲＡＭ等の一次記憶装置からなり、処理部２１によるデータ処理に必要な情報を一次的に記憶している。 The memory unit 23 includes a primary storage device such as a RAM, and temporarily stores information necessary for data processing by the processing unit 21.

＜仮想サーバ配置装置の構成＞
次に、本実施形態に係る仮想サーバ配置装置１０の構成について、図１を参照して説明する。
仮想サーバ配置装置１０は、データを複数のＤＢサーバ２０に分散して保存させるために、データ（データキー）とＤＢサーバ（固有の番号）２０とをハッシュ関数にかけてリング状のハッシュ空間に配置する。そして、仮想サーバ配置装置１０は、各ＤＢサーバ２０のサーバ性能Ｔ_ｍや、時間経過に伴うサーバ負荷の変動を考慮した上で、仮想サーバをハッシュ空間上に配置する。仮想サーバ配置装置１０は、仮想サーバの配置により決定した各ＤＢサーバ２０が保存すべきデータの情報（サーバ担当データ情報１００）を、各ＤＢサーバ２０それぞれに送信する。 <Configuration of virtual server placement device>
Next, the configuration of the virtual server placement apparatus 10 according to the present embodiment will be described with reference to FIG.
The virtual server arrangement device 10 arranges data (data key) and DB server (unique number) 20 in a ring-shaped hash space by applying a hash function in order to distribute and store the data in a plurality of DB servers 20. . Then, the virtual server placement device 10 and server performance the T _m of each of the DB server 20, in consideration of variations in the server load over time, placing the virtual server on the hash space. The virtual server placement apparatus 10 transmits information on data to be stored by each DB server 20 determined by the placement of the virtual server (server-in-charge data information 100) to each DB server 20.

この仮想サーバ配置装置１０は、図１に示すように、制御部１１と、通信部１２と、メモリ部１３と、記憶部１４とを備える。 As illustrated in FIG. 1, the virtual server arrangement device 10 includes a control unit 11, a communication unit 12, a memory unit 13, and a storage unit 14.

通信部１２は、各ＤＢサーバ２０および管理者サーバ５との間の通信を司る、通信インタフェースにより構成される。 The communication unit 12 includes a communication interface that manages communication between each DB server 20 and the administrator server 5.

制御部１１は、仮想サーバ配置装置１０全体の制御を司り、情報受信部１１１と、サーバ負荷計算部１１２と、配置処理部１１３と、仮想サーバ再配置計算部１１５と、情報送信部１１６とを含んで構成される。なお、この制御部１１は、例えば、仮想サーバ配置装置１０の記憶部１４に格納されたプログラムをＣＰＵが、メモリ部１３であるＲＡＭに展開し実行することで実現される。 The control unit 11 controls the entire virtual server placement apparatus 10, and includes an information reception unit 111, a server load calculation unit 112, a placement processing unit 113, a virtual server relocation calculation unit 115, and an information transmission unit 116. Consists of including. The control unit 11 is realized, for example, when the CPU stores the program stored in the storage unit 14 of the virtual server placement apparatus 10 in the RAM that is the memory unit 13 and executes the program.

情報受信部１１１は、通信部１２を介して、ＤＢサーバ２０それぞれから、各ＤＢサーバ２０に記憶されたデータ等に関する情報を受信したり、管理者サーバ５から、仮想サーバの配置に必要となる設定情報等を取得する。 The information receiving unit 111 receives information related to data stored in each DB server 20 from each DB server 20 via the communication unit 12, or is necessary for placement of the virtual server from the administrator server 5. Get setting information.

サーバ負荷計算部１１２は、各ＤＢサーバ２０のサーバ閾値α_ｍおよび負荷Ｗ_ｍを計算する。
具体的には、サーバ負荷計算部１１２は、仮想サーバの初期配置の処理において、管理者サーバ５等から、システム全体に求める処理性能Ｒおよび負荷閾値定数ｃを取得し、ＤＢサーバ２０から各ＤＢサーバ２０のサーバ性能Ｔ_ｍおよびデータキーを取得する。そして、サーバ負荷計算部１１２は、各ＤＢサーバ２０のサーバ性能Ｔ_ｍと負荷閾値定数ｃにより、（式１）を用いて、各ＤＢサーバ２０のサーバ閾値α_ｍを計算する。また、サーバ負荷計算部１１２は、各ＤＢサーバ２０から取得したデータキーの数を合計し、システム全体のデータ数Ｈを計算する。そして、各データへのアクセス回数が均等であると仮定し、１つのデータへのアクセス回数をＲ／Ｈ回として、（式２）を用いて、各ＤＢサーバ２０の負荷Ｗ_ｍを計算する。 The server load calculation unit 112 calculates the server threshold value α _m and the load W _m of each DB server 20.
Specifically, the server load calculation unit 112 acquires the processing performance R and the load threshold constant c required for the entire system from the administrator server 5 or the like in the process of initial placement of the virtual server, and each DB from the DB server 20. The server performance T _m and the data key of the server 20 are acquired. Then, the server load calculation unit 112 calculates the server threshold value α _m of each DB server 20 using (Equation 1) based on the server performance T _m of each DB server 20 and the load threshold constant c. Further, the server load calculation unit 112 sums the number of data keys acquired from each DB server 20 and calculates the number of data H of the entire system. Then, assuming that the number of accesses to each data is equal, the number of accesses to one data is R / H times, and the load W _m of each DB server 20 is calculated using (Equation 2).

そして、サーバ負荷計算部１１２は、計算したＤＢサーバ２０ごとの負荷Ｗ_ｍとサーバ閾値α_ｍとを比較し、負荷がサーバ閾値以下のサーバＳ⁻の集合と、負荷がサーバ閾値を超えるサーバＳ^＋の集合とを生成して、その集合ごとに、負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値が大きい順にサーバを整列する処理を行う。 Then, the server load calculation unit 112 compares the calculated load W _m for each DB server 20 with the server threshold value α _m, and the set of servers S ⁻ whose load is equal to or less than the server threshold value and the server S whose load exceeds the server threshold value. A set of ⁺ is generated, and for each set, the servers are arranged in descending order of the absolute value of the difference between the load W _m and the server threshold value α _m .

また、サーバ負荷計算部１１２は、仮想サーバの再配置の処理において、所定間隔Ｚごとに各ＤＢサーバ２０が記憶するデータのデータキーと、そのデータに対するアクセス回数を記録したアクセス回数情報３００を、各ＤＢサーバ２０から取得する。そして、（式２）を用いて、各ＤＢサーバ２０の負荷Ｗ_ｍを計算する。
また、サーバ負荷計算部１１２は、計算した所定間隔Ｚ経過後のＤＢサーバ２０ごとの負荷Ｗ_ｍとサーバ閾値α_ｍとを比較し、負荷がサーバ閾値以下のサーバＳ⁻の集合と、負荷がサーバ閾値を超えるサーバＳ^＋の集合とを生成して、その集合ごとに、所定間隔Ｚ経過時点における、負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値が大きい順にサーバを整列する処理を行う。 Further, the server load calculation unit 112, in the virtual server rearrangement process, obtains the data key of the data stored in each DB server 20 for each predetermined interval Z and the access count information 300 that records the access count for the data. Obtained from each DB server 20. Then, using the equation (2), calculates the load _{W m} of the DB server 20.
Further, the server load calculation unit 112 compares the calculated load W _m for each DB server 20 after the elapse of the predetermined interval Z with the server threshold value α _m, and the set of servers S ⁻ whose load is equal to or less than the server threshold value and the load A process of generating a set of servers S ⁺ exceeding the server threshold and arranging the servers in the descending order of the absolute value of the difference between the load W _m and the server threshold α _m when the predetermined interval Z elapses. Do.

次に、配置処理部１１３は、ハッシュ値計算部１１４を備えており、コンシステントハッシュ法（時計回り方式）を用いて、各ＤＢサーバ（固有の番号）２０およびデータ（データキー）のハッシュ値を計算し、ハッシュ空間上に各ＤＢサーバ２０およびデータを配置する。
そして、配置処理部１１３は、負荷がサーバ閾値を超えるサーバＳ^＋のうち、絶対値Ｏ^＋が大きいＤＢサーバ２０から順に、ハッシュ空間上の担当領域の超過分に対して、負荷がサーバ閾値以下のサーバＳ⁻のうち、絶対値Ｏ⁻が大きいＤＢサーバ２０から順に仮想サーバを配置することで、負荷がサーバ閾値を超えるサーバＳ^＋の超過分を解消する処理（仮想サーバ配置処理）を行う。なお、この仮想サーバ配置処理の詳細については、図９および図１０を用いて説明する。 Next, the arrangement processing unit 113 includes a hash value calculation unit 114, and uses a consistent hash method (clockwise method) to hash values of each DB server (unique number) 20 and data (data key). And each DB server 20 and data are arranged on the hash space.
The placement processing unit 113 then loads the server S ⁺ with the load exceeding the server threshold, in order from the DB server 20 with the largest absolute value O ⁺ , the load is less than the server threshold with respect to the excess of the assigned area in the hash space. Among the servers S ⁻ , the virtual server is arranged in order from the DB server 20 having the largest absolute value O ^−, thereby performing a process (virtual server arrangement process) for eliminating the excess of the server S ⁺ whose load exceeds the server threshold. . Details of the virtual server arrangement processing will be described with reference to FIGS. 9 and 10.

配置処理部１１３は、負荷がサーバ閾値を超えるサーバＳ^＋のすべての超過分について、仮想サーバを配置することにより決定した、各ＤＢサーバ２０が担当するデータについての新たなサーバ担当データ情報１００を生成する。そして、配置処理部１１３は、情報送信部１１６を介して、各ＤＢサーバ２０それぞれに、新たなサーバ担当データ情報１００を送信する。 The placement processing unit 113 obtains new server charge data information 100 for data handled by each DB server 20 determined by placing virtual servers for all excesses of the server S ⁺ whose load exceeds the server threshold. Generate. Then, the arrangement processing unit 113 transmits the new server charge data information 100 to each DB server 20 via the information transmission unit 116.

また、配置処理部１１３は、仮想サーバを配置しても、負荷がサーバ閾値を超えるサーバＳ^＋の超過分を解消できない場合には、情報送信部１１６を介して、管理者サーバ５に、ＤＢサーバ２０のリソースが足りないことを示すリソース不足通知を送信する。 Further, if the placement processing unit 113 cannot solve the excess of the server S ⁺ whose load exceeds the server threshold even if the virtual server is placed, the placement processing unit 113 sends the DB to the administrator server 5 via the information transmission unit 116. A resource shortage notification indicating that the resources of the server 20 are insufficient is transmitted.

次に、仮想サーバ再配置計算部１１５は、仮想サーバの再配置の処理において、サーバ負荷計算部１１２が、所定間隔Ｚ経過時点における、負荷がサーバ閾値以下のサーバＳ⁻と、負荷がサーバ閾値を超えるサーバＳ^＋とについて、負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値が大きい順にサーバを整列する処理が行ったことを契機として、所定間隔Ｚ経過時点において、負荷がサーバ閾値以下のサーバＳ⁻に、ハッシュ空間上で仮想サーバが配置されていた場合に、その仮想サーバを取り除く処理（仮想サーバ再配置計算）を行う。つまり、元々そのＤＢサーバ２０自身が担当すべきハッシュ空間の担当領域を、そのＤＢサーバ２０の負荷Ｗ_ｍがサーバ閾値α_ｍより大きいため、他のＤＢサーバ２０が仮想サーバを配置することにより、データの保存先を変更してもらっていたものを、時間の経過により、サーバの負荷が変化し、負荷に余裕が生じたため、仮想サーバが担当していたデータ分を、元々のＤＢサーバ２０の担当領域に戻す処理を行う。なお、この仮想サーバ再配置計算の詳細は、後記する図１２〜図１８を参照して説明する。 Next, in the virtual server relocation processing, the virtual server relocation calculation unit 115 determines that the server load calculation unit 112 has a server S ⁻ whose load is equal to or less than the server threshold when the predetermined interval Z has elapsed, and the load is the server threshold. For the server S ⁺ exceeding the load, the load is less than or equal to the server threshold at the time when the predetermined interval Z has elapsed, as a result of performing the process of arranging the servers in descending order of the absolute value of the difference between the load W _m and the server threshold α _m. server S ^- to, if the virtual server has been arranged on the hash space, performs processing (virtual server relocation calculation) to remove the virtual server. In other words, since the load W _m of the DB server 20 is larger than the server threshold value α _m , the area of the hash space that the DB server 20 should originally be responsible for is allocated to another DB server 20 by placing a virtual server. As the data storage destination was changed, the load on the server changed over time, and there was a margin in the load, so the data that was handled by the virtual server was handled by the original DB server 20 Perform processing to return to the area. Details of the virtual server relocation calculation will be described with reference to FIGS.

情報送信部１１６は、通信部１２を介して、各ＤＢサーバ２０それぞれに、配置処理部１１３が生成したサーバ担当データ情報１００を送信する。また、情報送信部１１６は、管理者サーバ５に対し、リソース不足通知を送信する。 The information transmission unit 116 transmits the server handling data information 100 generated by the arrangement processing unit 113 to each DB server 20 via the communication unit 12. Further, the information transmission unit 116 transmits a resource shortage notification to the administrator server 5.

次に、記憶部１４は、ハードディスクやフラッシュメモリ等の記憶装置からなり、配置処理部１１３が生成したサーバ担当データ情報１００を記憶している。 Next, the storage unit 14 includes a storage device such as a hard disk or a flash memory, and stores server-in-charge data information 100 generated by the arrangement processing unit 113.

メモリ部１３は、ＲＡＭ等の一次記憶装置からなり、制御部１１によるデータ処理に必要な情報を一次的に記憶している。 The memory unit 13 includes a primary storage device such as a RAM, and temporarily stores information necessary for data processing by the control unit 11.

次に、本実施形態に係るデータ負荷分散配置システム１が行う、（１）各ＤＢサーバ２０のサーバ性能を考慮した仮想サーバの初期配置によるデータ負荷分散配置処理と、（２）システム稼動後の時間経過に伴うデータ数やデータアクセス回数の増減に対応した動的な仮想サーバの再配置と、について具体的に説明する。 Next, the data load distribution arrangement system 1 according to this embodiment performs (1) data load distribution arrangement processing by initial arrangement of virtual servers in consideration of server performance of each DB server 20, and (2) after system operation The dynamic virtual server relocation corresponding to the increase / decrease in the number of data and the number of data accesses over time will be specifically described.

（データ負荷分散配置処理―仮想サーバの初期配置）
まず、図８を参照して、データ負荷分散配置システム１が行う、仮想サーバの初期配置における全体の処理の流れについて説明する。図８は、本実施形態に係るデータ負荷分散配置処理（仮想サーバの初期配置）の全体の処理の流れを示すシーケンス図である。 (Data load distribution placement processing-virtual server initial placement)
First, the overall processing flow in the initial placement of the virtual server performed by the data load distribution and placement system 1 will be described with reference to FIG. FIG. 8 is a sequence diagram showing an overall processing flow of the data load distribution arrangement processing (initial arrangement of virtual servers) according to the present embodiment.

まず、管理者サーバ５から、仮想サーバ配置装置１０が、システム全体に求める処理性能Ｒと、各サーバのサーバ閾値を決定するための負荷閾値定数ｃとを、設定情報として取得する（ステップＳ１１）。なお、この処理は、予め、仮想サーバ配置装置１０の記憶部１４に、設定情報として記憶するようにしてもよい。 First, the processing performance R required for the entire system and the load threshold constant c for determining the server threshold value of each server are acquired as setting information from the administrator server 5 (step S11). . This process may be stored in advance as setting information in the storage unit 14 of the virtual server placement apparatus 10.

次に、各ＤＢサーバ２０は、自身が保存しているデータのデータキー、および自サーバのサーバ性能Ｔ_ｍを、自身に固有な番号とともに、仮想サーバ配置装置１０に送信する（ステップＳ１２）。 Then, the DB server 20, the data key of the data itself is stored, and a server performance T _m of a local server with a unique number to it, and transmits to the virtual server placement device 10 (step S12).

続いて、仮想サーバ配置装置１０のサーバ負荷計算部１１２は、サーバ負荷計算処理を行う（ステップＳ１３）。
具体的には、サーバ負荷計算部１１２は、管理者サーバ５から受信した設定情報に含まれる負荷閾値定数ｃと、各ＤＢサーバ２０から受信したサーバ性能Ｔ_ｍとに基づき、（式１）を用いて、各サーバのサーバ閾値α_ｍを計算する。そして、サーバ負荷計算部１１２は、管理者サーバ５から受信したシステム全体に求める処理性能Ｒと、各サーバから受信したデータキーの数の合計から得られるデータ数Ｈとに基づき、（式２）を用いて、ＤＢサーバ２０ごとの負荷Ｗ_ｍを計算する。なお、この仮想サーバの初期配置では、システム全体に対して１秒当たりシステム最大性能であるＲ回のアクセスがあり、各データへのアクセス回数は均等であるＲ／Ｈ回と想定する。
そして、サーバ負荷計算部１１２は、計算したＤＢサーバ２０ごとの負荷Ｗ_ｍとサーバ閾値α_ｍとを比較し、負荷がサーバ閾値以下のサーバＳ⁻の集合と、負荷がサーバ閾値を超えるサーバＳ^＋の集合とを生成して、その集合ごとに、負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値が大きい順にサーバを整列する処理を行う。 Subsequently, the server load calculation unit 112 of the virtual server arrangement device 10 performs a server load calculation process (step S13).
Specifically, the server load calculation unit 112 calculates (Equation 1) based on the load threshold constant c included in the setting information received from the administrator server 5 and the server performance T _m received from each DB server 20. And calculate the server threshold α _m for each server. Then, the server load calculation unit 112 is based on the processing performance R required for the entire system received from the administrator server 5 and the number of data H obtained from the total number of data keys received from each server (Formula 2). Is used to calculate the load W _m for each DB server 20. In this initial placement of the virtual server, it is assumed that there are R accesses that are the maximum system performance per second for the entire system, and the number of accesses to each data is equal R / H times.
Then, the server load calculation unit 112 compares the calculated load W _m for each DB server 20 with the server threshold value α _m, and the set of servers S ⁻ whose load is equal to or less than the server threshold value and the server S whose load exceeds the server threshold value. A set of ⁺ is generated, and for each set, the servers are arranged in descending order of the absolute value of the difference between the load W _m and the server threshold value α _m .

そして、仮想サーバ配置装置１０の配置処理部１１３が、ハッシュ空間上での仮想サーバの配置処理（「仮想サーバ配置処理」）を行うことにより、サーバ担当データ情報１００を生成する（ステップＳ１４）。ここで、配置処理部１１３は、この仮想サーバ配置処理において、リソース不足と判定した場合には、サーバ担当データ情報１００を生成せずに、そのリソース不足通知を、管理者サーバ５に送信し（ステップＳ１５）、物理サーバ（ＤＢサーバ２０）の追加を促して（ステップＳ１６）、処理を終える。 Then, the placement processing unit 113 of the virtual server placement device 10 performs the placement processing of the virtual server on the hash space (“virtual server placement processing”), thereby generating the server handling data information 100 (step S14). Here, if the placement processing unit 113 determines that the resource is insufficient in this virtual server placement processing, the placement processing unit 113 transmits the resource shortage notification to the administrator server 5 without generating the server charge data information 100 ( Step S15) prompts the addition of a physical server (DB server 20) (Step S16), and ends the process.

一方、仮想サーバ配置装置１０が、サーバ担当データ情報１００を生成した場合には、そのサーバ担当データ情報１００を、各ＤＢサーバ２０それぞれに送信する（ステップＳ１７）。 On the other hand, when the virtual server placement apparatus 10 generates the server charge data information 100, the server charge data information 100 is transmitted to each DB server 20 (step S17).

そして、各ＤＢサーバ２０同士で担当データの交換処理を行う（ステップＳ１８）。具体的には、各ＤＢサーバ２０のデータ交換部２１５は、仮想サーバ配置装置１０から受信した新たなサーバ担当データ情報１００と、記憶部２４に保存していたサーバ担当データ情報１００とを比較し、自身のＤＢサーバ２０が、追加して記憶すべきデータを、そのデータを保存するＤＢサーバ２０に問い合わせる。そして、問い合わせを受けたＤＢサーバ２０は、そのデータを問い合わせ元のＤＢサーバ２０に送信し、その送信確認後に自身のサーバからそのデータを削除する処理を行う。また、データ交換部２１５は、記憶部２４に保存していたサーバ担当データ情報１００を、データ管理部２１２を介して、受信した新たなサーバ担当データ情報１００に更新させる。 Then, each DB server 20 exchanges responsible data (step S18). Specifically, the data exchange unit 215 of each DB server 20 compares the new server charge data information 100 received from the virtual server placement apparatus 10 with the server charge data information 100 stored in the storage unit 24. The own DB server 20 inquires the DB server 20 that stores the data to be additionally stored. Then, the DB server 20 that has received the inquiry transmits the data to the inquiring DB server 20, and performs processing to delete the data from its own server after confirming the transmission. In addition, the data exchange unit 215 updates the server charge data information 100 stored in the storage unit 24 to the received new server charge data information 100 via the data management unit 212.

＜仮想サーバ配置処理＞
次に、図９および図１０を参照して、仮想サーバ配置装置１０（配置処理部１１３）による仮想サーバ配置処理（図８のステップＳ１４）について詳細に説明する。図９および図１０は、本実施形態に係る仮想サーバ配置装置１０の配置処理部１１３による仮想サーバ配置処理の流れを示すフローチャートである（適宜図１参照）。 <Virtual server placement processing>
Next, with reference to FIG. 9 and FIG. 10, the virtual server placement process (step S14 in FIG. 8) by the virtual server placement apparatus 10 (placement processing unit 113) will be described in detail. 9 and 10 are flowcharts showing the flow of the virtual server placement processing by the placement processing unit 113 of the virtual server placement device 10 according to the present embodiment (see FIG. 1 as appropriate).

なお、この配置処理部１１３による仮想サーバ配置処理は、図８のステップＳ１３において、仮想サーバ配置装置１０のサーバ負荷計算部１１２により、各ＤＢサーバ２０の負荷Ｗ_ｍが計算され、負荷がサーバ閾値以下のサーバＳ⁻と、負荷がサーバ閾値を超えるサーバＳ^＋とについて、負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値が大きい順に、サーバがすでに整列されているものとして説明する。
そして、配置処理部１１３は、図２〜図５において説明したように、負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値が大きい順に、負荷がサーバ閾値を超えるサーバＳ^＋の負荷の超過分の領域に、負荷がサーバ閾値以下のサーバＳ⁻の仮想サーバを配置することで、負荷がサーバ閾値を超えるサーバＳ^＋の負荷の超過分の担当データを、負荷がサーバ閾値以下のサーバＳ⁻に変更する処理を行う。以下、具体的に説明する。 Note that in the virtual server placement processing by the placement processing unit 113, the load W _m of each DB server 20 is calculated by the server load calculation unit 112 of the virtual server placement device 10 in step S13 of FIG. the following server S ^- a, the server S ⁺ capital the load exceeds the server threshold value, in order the larger the absolute value of the difference between the load W _m and the server threshold value alpha _m, be described as the server is already aligned.
Then, as described with reference to FIGS. 2 to 5, the arrangement processing unit 113 exceeds the load of the server S ⁺ whose load exceeds the server threshold in descending order of the absolute value of the difference between the load W _m and the server threshold α _m. By placing the virtual server of the server S ⁻ whose load is less than the server threshold in the area of the minute, the responsible data for the excess load of the server S ⁺ whose load exceeds the server threshold, and the server S whose load is less than the server threshold ^- carry out the process of changing to. This will be specifically described below.

まず、配置処理部１１３は、負荷がサーバ閾値を超えるサーバＳ^＋、負荷がサーバ閾値以下のサーバＳ⁻それぞれについて、初期値として、絶対値（Ｏ^＋ _li,Ｏ⁻ _kj）が大きい順に１番目のサーバを選択し（「ｉ←１」「ｊ←１」）、初期値のFlagとして「True」を設定する（ステップＳ１０１）。なお、ここで、「ｉ」は、（式５）で示したように、０＜ｉ≦ｑを満たし、「ｊ」は、（式４）で示したように、０＜ｊ≦ｐを満たす値である。また、このFlagは、以後のステップＳ１１３，Ｓ１１７，Ｓ１１９（図１０参照）で設定されるものであり、負荷がサーバ閾値を超えるサーバＳ^＋の負荷の超過分を解消するために、負荷がサーバ閾値以下のサーバＳ⁻が仮想サーバの割当てを行った場合に、さらに、まだ仮想サーバを割り当てる負荷の余裕分があるか否かを示すFlagである。そして、次に割り当てる負荷の余裕分がない場合に「True」、次に割り当てる負荷の余裕分がある場合に「false」のFlagが設定される。 First, the placement processing unit 113 is the first in descending order of the absolute value (O ⁺ _li , O ⁻ _kj ) as an initial value for each of the server S ⁺ whose load exceeds the server threshold and the server S ⁻ whose load is equal to or less than the server threshold. Are selected (“i ← 1” and “j ← 1”), and “True” is set as the initial value Flag (step S101). Here, “i” satisfies 0 <i ≦ q as shown in (Expression 5), and “j” satisfies 0 <j ≦ p as shown in (Expression 4). Value. In addition, this Flag is set in subsequent steps S113, S117, and S119 (see FIG. 10). In order to eliminate the excess load of the server S ⁺ whose load exceeds the server threshold, subthreshold server S ^- if the performed assignment of virtual servers, further a Flag still indicates whether there is a margin in the load allocating a virtual server. Then, “True” is set when there is no margin for the next load to be allocated, and “false” is set when there is a margin for the next load to be allocated.

次に、配置処理部１１３は、負荷がサーバ閾値を超えるサーバＳ^＋ _liのうち、まだ処理していないサーバがあるか否かを判定する（ｉ＜ｑ＋１）（ステップＳ１０２）。ここで、まだ処理していない、負荷がサーバ閾値を超えるサーバＳ^＋ _liがない、つまり、すべての負荷がサーバ閾値を超えるサーバＳ^＋ _liの処理を終えている場合には（ステップＳ１０２→Ｎｏ）、処理を終了する。一方、まだ、処理していない、負荷がサーバ閾値を超えるサーバＳ^＋ _liがある場合には（ステップＳ１０２→Ｙｅｓ）、次のステップＳ１０３へ進む。 Next, the arrangement processing unit 113 determines whether there is a server that has not yet been processed among the servers S ⁺ _li whose load exceeds the server threshold (i <q + 1) (step S102). Here, not yet processed, the load is no server S ⁺ _li exceeding server threshold, that is, when all the load is finished with the server S ⁺ _li exceeding the server threshold value (step S102 → No ), The process is terminated. On the other hand, if there is a server S ⁺ _li that has not yet been processed and the load exceeds the server threshold (step S102 → Yes), the process proceeds to the next step S103.

ステップＳ１０３において、配置処理部１１３は、負荷がサーバ閾値を超えるサーバＳ^＋ _liが、どのくらいサーバ閾値を超えているかの絶対値（Ｏ^＋ _li：超過分）を計算する（Ｏ^＋ _li ← W_li−α_li）。 In step S103, the arrangement processing unit 113 calculates the absolute value (O ⁺ _li : excess) of how much the server S ⁺ _li whose load exceeds the server threshold exceeds the server threshold (O ⁺ _li ← W _li −α _li ).

次に、配置処理部１１３は、その負荷がサーバ閾値を超えるサーバＳ^＋ _liが、そのサーバのサーバ閾値α_li以下でそのサーバが担当可能な最大個数のデータのハッシュ空間上での値ｘを計算する（ステップＳ１０４）。このｘは、負荷がサーバ閾値を超えるサーバＳ^＋ _liの担当領域内のデータのハッシュ値をその負荷がサーバ閾値を超えるサーバＳ^＋ _liのハッシュ値に近い順に並べたときに、負荷がサーバ閾値α_ｍ以下で最大となるｈ番目のデータのハッシュ値となる（x ← max{ x ;Σ _h=1,…,x b_li (h)≦ α_li }）。ここで、b_li (h)は、サーバＳ^＋ _liの担当領域のうちのデータのハッシュ値をサーバＳ^＋ _liのハッシュ値に近い順に並べたときに、ｈ番目のデータにかかる負荷を示す。なお、この処理は、後記するステップＳ１０９において、仮想サーバの配置位置を決定する際に利用される。 Next, the placement processing unit 113 _calculates the value x on the hash space of the maximum number of data that can be handled by the server S ⁺ _li whose load exceeds the server threshold value and is below the server threshold value α _li of the server. Calculate (step S104). This x indicates that when the hash values of the data in the area in charge of the server S ⁺ _li whose load exceeds the server threshold are arranged in the order close to the hash value of the server S ⁺ _li whose load exceeds the server threshold, This is the hash value of the h-th data that is the maximum at α _m or less (x ← max {x; Σh _{= 1,..., x} b _li (h) ≦ α _li }). Here, b _li (h), when arranged a hash value of data in the coverage area of the server S ⁺ _li sequentially closer to the hash value of the server S ⁺ _li, showing the load on the h-th data. This process is used when determining the placement position of the virtual server in step S109 described later.

続いて、配置処理部１１３は、負荷がサーバ閾値以下のサーバＳ⁻ _kjがあるか否かを判定する（ｊ＜ｐ＋１）（ステップＳ１０５）。ここで、負荷がサーバ閾値以下のサーバＳ⁻ _kjがない場合には（ステップＳ１０５→Ｎｏ）、ステップＳ１２２（図１０参照）に進み、管理者サーバ５に、物理サーバ（ＤＢサーバ２０）の追加依頼を送信し、処理を終える。一方、負荷がサーバ閾値以下のサーバＳ⁻ _kjがある場合には（ステップＳ１０５→Ｙｅｓ）、次のステップＳ１０６へ進む。 Subsequently, the placement processing unit 113 determines whether there is a server S ⁻ _kj whose load is equal to or less than the server threshold (j <p + 1) (step S105). Here, when there is no server S ^- _kj whose load is equal to or less than the server threshold (step S105 → No), the process proceeds to step S122 (see FIG. 10), and a physical server (DB server 20) is added to the administrator server 5. Send the request and finish the process. On the other hand, if there is a server S ^- _kj whose load is equal to or less than the server threshold (step S105 → Yes), the process proceeds to the next step S106.

ステップＳ１０６において、配置処理部１１３は、Flagが「true」か否かを判定する。つまり、直前の負荷解消のために割り当てた、負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の余裕分が使えるか否かを判定する。具体的には、負荷を解消するために、自身の負荷の余裕分をすべて使って仮想サーバの割り当てを行ったか（Flag=true）、仮想サーバの割り当てを行ってもなお負荷の余裕分が残っているか（Flag=false）を判定する。なお、負荷がサーバ閾値を超えるサーバＳ^＋ _liの絶対値Ｏ^＋ _liの大きい順で１番目のサーバ（ｉ＝１）については、ステップＳ１０１で初期値のFlagとして「True」が設定されている。 In step S <b> 106, the arrangement processing unit 113 determines whether or not Flag is “true”. That is, it is determined whether or not the load margin of the server S ^- _kj , which is assigned to eliminate the previous load and the load is equal to or less than the server threshold, can be used. Specifically, in order to eliminate the load, the virtual server was allocated using all of its own load margin (Flag = true), or the load margin still remained after the virtual server allocation. (Flag = false) is determined. For the first server (i = 1) in the descending order of the absolute value O ⁺ _li of the server S ⁺ _li whose load exceeds the server threshold, “True” is set as the initial value Flag in step S101. .

ここで、Flag=trueの場合は（ステップＳ１０６→Ｙｅｓ）、次のステップＳ１０７において、配置処理部１１３が、負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の絶対値を計算し（Ｏ⁻ _kj← α_kj−W_kj）、ステップＳ１０９へ進む。一方、Flag=falseの場合は（ステップＳ１０６→Ｎｏ）、次のステップＳ１０８において、直前に負荷を割り当てた、負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の余裕分「Rest」を取得し（Ｏ⁻ _kj← Rest）、ステップＳ１０９へ進む。なお、この負荷の余裕分「Rest」は、後記するステップＳ１１６（図１０参照）において、計算されるものである。 Here, if Flag = true (step S106 → Yes), in the next step S107, the placement processing unit 113 calculates the absolute value of the load of the server S ^- _kj whose load is equal to or less than the server threshold (O ^- _kj). ← α _kj −W _kj ), the process proceeds to step S109. On the other hand, when Flag = false (step S106 → No), in the next step S108, the load “Rest” of the load of the server S ⁻ _kj whose load is assigned immediately before and whose load is equal to or less than the server threshold is acquired ( O ⁻ _kj ← Rest), the process proceeds to step S109. Note that this load margin “Rest” is calculated in step S116 (see FIG. 10) described later.

そして、配置処理部１１３は、ステップＳ１０９において、サーバＳ^＋ _liの担当領域のｘ＋１の位置に、サーバＳ⁻ _kjの仮想サーバを割り当てる。この仮想サーバの配置位置は、ステップＳ１０４において計算した、負荷がサーバ閾値を超えるサーバが、サーバ閾値以下でそのサーバが担当可能な最大個数のデータのハッシュ空間上での値ｘの次のハッシュ値であるｘ＋１に設定される。 In step S109, the arrangement processing unit 113 assigns the virtual server of the server S ⁻ _kj to the position of x + 1 in the area in charge of the server S ⁺ _li . The placement position of this virtual server is the hash value next to the value x in the hash space of the maximum number of data that can be handled by the server whose load exceeds the server threshold value and is below the server threshold value calculated in step S104. Is set to x + 1.

図１０に進み、ステップＳ１１０において、配置処理部１１３は、ステップＳ１０９（図９参照）で決定した配置位置に、仮想サーバを配置した場合に、負荷がサーバ閾値を超えるサーバＳ^＋ _liの負荷が解消されるか否かを判定する。具体的には、配置処理部１１３は、負荷がサーバ閾値を超えるサーバＳ^＋ _liの負荷の絶対値（Ｏ^＋ _li：超過分）と、仮想サーバを設定する負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の絶対値（Ｏ⁻ _kj：余裕分）とを比較し、負荷がサーバ閾値を超えるサーバＳ^＋ _liの負荷の絶対値Ｏ^＋ _liの方が大きいか否かを判定する（Ｏ^＋ _li＞Ｏ⁻ _kj）。 Proceeding to FIG. 10, in step S110, when the placement processing unit 113 places a virtual server at the placement position determined in step S109 (see FIG. 9), the load of the server S ⁺ _li whose load exceeds the server threshold is _detected . It is determined whether or not it is resolved. Specifically, the arrangement processing unit 113, the absolute value of the load of the server S ⁺ _li the load exceeds the server threshold: and (O ⁺ _li excess), the load of setting the virtual server is less server threshold server S ^- the absolute value of the load _kj ^-: compare (O _kj margin) and the load determines whether towards the absolute value O ⁺ _li of the load of the server S ⁺ _li exceeding the server threshold value is large (O ⁺ _li > O ^- _kj ).

ここで、負荷がサーバ閾値を超えるサーバＳ^＋ _liの負荷の絶対値Ｏ^＋ _liの方が大きい場合（ステップＳ１１０→Ｙｅｓ）、つまり、仮想サーバを割り当てても、まだ、負荷がサーバ閾値を超えている場合（超過分が残っている場合）には、次のステップＳ１１１へ進む。 Here, when the absolute value O ⁺ _li of the load of the server S ⁺ _li whose load exceeds the server threshold is larger (step S110 → Yes), that is, even if a virtual server is allocated, the load still exceeds the server threshold If there is an excess (if there is an excess), the process proceeds to the next step S111.

ステップＳ１１１において、配置処理部１１３は、ステップＳ１０９（図９参照）で設定した、ハッシュ空間上の仮想サーバの配置位置から、その仮想サーバの担当可能な最大個数のデータのハッシュ空間上での値ｘを計算する（x ← max{ x ;Σ _h=1,…,x b_li (h)≦ α_li }）。なお、このハッシュ空間上での値ｘは、絶対値Ｏ⁻ _kjの大きさ順が次の負荷がサーバ閾値以下のサーバＳ⁻ _kjの仮想サーバを、ハッシュ空間上に割り当てる際（再び、ステップＳ１０９の処理を行う際）に、その仮想サーバの配置位置を決定するときに利用される。 In step S111, the placement processing unit 113 sets the value on the hash space of the maximum number of data that can be handled by the virtual server from the placement position of the virtual server on the hash space set in step S109 (see FIG. 9). x is calculated (x ← max {x; Σh _{= 1,...,} xb _li (h) ≦ α _li }). The value x on the hash space is assigned to the virtual server of the server S ^- _kj whose absolute load O ^- _kj in the order of magnitude is the server load below the server threshold value on the hash space (again, step S109). This is used when determining the placement position of the virtual server.

次に、配置処理部１１３は、仮想サーバを割り当てた後の負荷がサーバ閾値を超えるサーバＳ^＋ _liについて、サーバ閾値を超える負荷の絶対値Ｏ^＋ _liを計算する（Ｏ^＋ _li ← Ｏ^＋ _li−Ｏ⁻ _kj）（ステップＳ１１２）。 Next, the placement processing unit 113 calculates the absolute value O ⁺ _li of the load exceeding the server threshold for the server S ⁺ _li whose load after the allocation of the virtual server exceeds the server threshold (O ⁺ _li ← O ⁺ _li -O ^- _kj ) (step S112).

そして、配置処理部１１３は、負荷解消のために、負荷がサーバ閾値以下のサーバＳ⁻ _kjの余裕部のすべてを割り当てたことを示す、Flagを「true」に設定する（ステップＳ１１３）。 Then, the placement processing unit 113 sets Flag to “true” indicating that all of the surplus portions of the server S ^- _kj whose load is equal to or less than the server threshold value has been allocated in order to eliminate the load (step S113).

続いて、配置処理部１１３は、サーバ負荷計算部１１２が計算し整列した後の、負荷がサーバ閾値以下のサーバＳ⁻ _kjのうち、次に絶対値Ｏ⁻ _kjが大きいサーバを選択し（ｊ←ｊ＋１）（ステップＳ１１４）、ステップＳ１０５（図９参照）へ戻る。 Subsequently, the placement processing section 113, after alignment calculated by the server load calculation unit 112, the load server S follows the server threshold value ^- of _kj, then the absolute value O ^- _kj select the server is large (j ← j + 1) (step S114), the process returns to step S105 (see FIG. 9).

一方、ステップＳ１１０において、配置処理部１１３は、負荷がサーバ閾値を超えるサーバＳ^＋ _liの負荷の絶対値（Ｏ^＋ _li：超過分）と、仮想サーバを設定する負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の絶対値（Ｏ⁻ _kj：余裕分）とを比較し、負荷がサーバ閾値を超えるサーバＳ^＋ _liの負荷の絶対値Ｏ^＋ _liの方が大きい場合でないとき（ステップＳ１１０→Ｎｏ）、つまり、負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の余裕の絶対値Ｏ⁻ _kjが、負荷がサーバ閾値を超えるサーバＳ^＋ _liの負荷の絶対値Ｏ^＋ _liと同じか、それを超える場合には、ステップＳ１１５へ進む。 On the other hand, in step S110, the placement processing unit 113 determines the absolute value (O ⁺ _li : excess) of the server S ⁺ _li whose load exceeds the server threshold and the server S whose load for setting the virtual server is equal to or less than the server threshold. ^- the absolute value of the load of _kj ^-: compare (O _kj margin) and, when the load is not the case is larger absolute value O ⁺ _li of the load of the server S ⁺ _li exceeding the server threshold value (step S110 → no ), i.e., the load is less than the server threshold value server S ^- absolute value O of the load _kj margin ^- _kj is, the load is equal to or absolute value O ⁺ _li of the load of the server S ⁺ _li exceeding server threshold, it When exceeding, it progresses to step S115.

そして、ステップＳ１１５において、配置処理部１１３は、負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の絶対値（Ｏ⁻ _kj：余裕分）が、負荷がサーバ閾値を超えるサーバＳ^＋の負荷の絶対値（Ｏ^＋ _li：超過分）を超えるか否かを判定する（Ｏ^＋ _li ＜Ｏ⁻ _kj）。そして、負荷がサーバ閾値以下のサーバＳ⁻の負荷の絶対値（Ｏ⁻ _kj：余裕分）が、負荷がサーバ閾値を超えるサーバＳ^＋ _liの負荷の絶対値（Ｏ^＋ _li：超過分）を超える場合には（ステップＳ１１５→Ｙｅｓ）、つまり、仮想サーバの割当てによって、負荷がサーバ閾値を超えるサーバＳ^＋ _liの超過分が解消され、さらに、仮想サーバを割り当てたサーバＳ⁻ _kjにまだ余裕分が残っている場合には、次のステップＳ１１６へ進む。 Then, in step S115, the placement processing section 113, the load server threshold following servers S ^- absolute value of the load of _{_kj} (O ^- _kj: margin) is loaded absolute server S ⁺ load exceeding the server threshold value It is determined whether or not the value (O ⁺ _li : excess) is exceeded (O ⁺ _li <O ⁻ _kj ). Then, the absolute value (O ⁻ _kj : margin) of the load of the server S ⁻ whose load is equal to or less than the server threshold is the absolute value (O ⁺ _li : excess) of the load of the server S ⁺ _li whose load exceeds the server threshold. In the case of exceeding (step S115 → Yes), that is, the excess of the server S ⁺ _li whose load exceeds the server threshold is eliminated by the allocation of the virtual server, and further, the server S ^- _kj to which the virtual server is allocated still has a margin. If the minute remains, the process proceeds to the next step S116.

次に、配置処理部１１３は、仮想サーバを割り当てた負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の余裕分（Rest）を計算する（Rest ← Ｏ⁻ _kj − Ｏ^＋ _li）（ステップＳ１１６）。そして、配置処理部１１３は、その負荷の余裕分（Rest）を記憶する。 Next, the placement processing unit 113 calculates a load margin (Rest) of the server S ⁻ _kj whose load assigned to the virtual server is equal to or less than the server threshold (Rest ← O ⁻ _kj −O ⁺ _li ) (step S116). . Then, the placement processing unit 113 stores the load margin (Rest).

次に、配置処理部１１３は、負荷解消のために、負荷がサーバ閾値以下のサーバＳ⁻ _kjに、まだ余裕分があること（次に割り当てる負荷の余裕分があること）を示す、Flagを「false」に設定する（ステップＳ１１７）。 Next, in order to eliminate the load, the placement processing unit 113 sets a Flag indicating that the server S ^- _kj whose load is equal to or less than the server threshold still has a margin (there is a margin for the next load to be allocated). “False” is set (step S117).

そして、配置処理部１１３は、サーバ負荷計算部１１２が計算し整列した後の、負荷がサーバ閾値を超えるサーバＳ^＋ _liのうち、次に絶対値Ｏ^＋ _liの大きいサーバを選択し（ｉ←ｉ＋１）（ステップＳ１１８）、次のステップＳ１２１へ進む。 Then, the arrangement processing unit 113 selects the server having the next largest absolute value O ⁺ _{li from} the servers S ⁺ _li whose loads exceed the server threshold after the server load calculation unit 112 calculates and arranges (i ← i + 1) (step S118), the process proceeds to the next step S121.

一方、ステップＳ１１５において、配置処理部１１３は、負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の絶対値（Ｏ⁻ _kj：余裕分）が、負荷がサーバ閾値を超えるサーバＳ^＋ _liの負荷の絶対値（Ｏ^＋ _li：超過分）と同じ場合は（ステップＳ１１５→Ｎｏ）、次のステップＳ１１９へ進む。そして、配置処理部１１３は、負荷解消のために、負荷がサーバ閾値以下のサーバＳ⁻ _kjの余裕分のすべてを割り当てたことを示す、Flagを「true」に設定する（ステップＳ１１９）。 On the other hand, in step S115, the placement processing unit 113 determines the load of the server S ⁺ _li whose load exceeds the server threshold when the absolute value (O ⁻ _kj : margin) of the load of the server S ⁻ _kj whose load is equal to or less than the server threshold. If it is the same as the absolute value (O ⁺ _li : excess) (step S115 → No), the process proceeds to the next step S119. Then, the placement processing unit 113 sets Flag to “true” indicating that all the margins of the server S ⁻ _kj whose load is equal to or less than the server threshold are allocated to eliminate the load (step S119).

続いて、配置処理部１１３は、サーバ負荷計算部１１２が計算し整列した後の、負荷がサーバ閾値を超えるサーバＳ^＋ _li、負荷がサーバ閾値以下のサーバＳ⁻ _kjのうち、次に絶対値（Ｏ^＋ _li,Ｏ⁻ _kj）の大きいサーバをそれぞれ選択し（ｉ←ｉ＋１，ｊ←ｊ＋１）（ステップＳ１２０）、次のステップＳ１２１へ進む。 Subsequently, after the server load calculation unit 112 calculates and arranges the arrangement processing unit 113, the absolute value of the server S ⁺ _li whose load exceeds the server threshold and the server S ⁻ _kj whose load is equal to or less than the server threshold is next. Servers with large (O ⁺ _li , O ⁻ _kj ) are selected (i ← i + 1, j ← j + 1) (step S120), and the process proceeds to the next step S121.

そして、ステップＳ１１８およびステップＳ１２０の次に、ステップＳ１２１において、配置処理部１１３は、まだ余裕分の残っている、負荷がサーバ閾値以下のサーバＳ⁻ _kjがあるか否かを判定する（ｊ＝ｐ＋１）。そして、まだ余裕分が残っている、負荷がサーバ閾値以下のサーバＳ⁻ _kjがある場合には（ステップＳ１２１→Ｙｅｓ）、ステップＳ１０２（図９参照）へ戻り処理を続ける。一方、配置処理部１１３は、負荷がサーバ閾値以下のサーバＳ⁻ _kjで、余裕分が残っているサーバがない場合には（ステップＳ１２１→Ｎｏ）、ステップＳ１２２へ進み、管理者サーバ５に、物理サーバ（ＤＢサーバ２０）の追加依頼を送信し、処理を終える。
なお、配置処理部１１３は、ハッシュ空間上での仮想サーバの割当てを完了することにより、新たなサーバ担当データ情報１００を生成し、各ＤＢサーバ２０それぞれに送信する。 Then, after step S118 and step S120, in step S121, the placement processing unit 113 determines whether there is a server S ^- _kj whose load is still less than or equal to the server threshold (j = p + 1). If there is a server S ^- _kj whose load is still less than the server threshold (step S121 → Yes), the process returns to step S102 (see FIG. 9) to continue the process. On the other hand, if the load is the server S ^- _kj whose load is equal to or less than the server threshold and there is no remaining server (step S121 → No), the arrangement processing unit 113 proceeds to step S122, and the administrator server 5 A request for addition of the physical server (DB server 20) is transmitted, and the process ends.
The placement processing unit 113 completes the allocation of the virtual server on the hash space, thereby generating new server handling data information 100 and transmitting it to each DB server 20.

このようにすることで、本実施形態に係るデータ負荷分散配置システム１およびデータ負荷分散配置方法による仮想サーバの初期配置によって、負荷分散を実現し、さらに、各サーバの担当領域に配置する仮想サーバの数が同一に設定される従来技術に比べ、仮想サーバの数を減らすことができる。また、仮想サーバを大量に配置する必要がなくなるため、仮想サーバ保持のための記述量の増加を抑え、クライアントからデータの問い合わせを受けた際に、そのデータを格納している担当サーバの検索に要する時間の増加を抑えることができる。よって、システム全体としてのスループットを向上させることができる。 By doing in this way, load distribution is realized by the initial arrangement of the virtual server by the data load distribution arrangement system 1 and the data load distribution arrangement method according to the present embodiment, and the virtual server arranged in the assigned area of each server The number of virtual servers can be reduced compared to the prior art in which the number of servers is set to be the same. Also, since there is no need to place a large number of virtual servers, the increase in the amount of description for holding virtual servers is suppressed, and when a data inquiry is received from a client, the server in charge that stores the data is searched. An increase in time required can be suppressed. Therefore, the throughput of the entire system can be improved.

（データ負荷分散配置処理―仮想サーバの再配置）
次に、図１１を参照して、データ負荷分散配置システム１が行う、所定間隔Ｚごとの仮想サーバの再配置処理の全体の処理の流れについて説明する。図１１は、本実施形態に係るデータ負荷分散配置処理（仮想サーバの再配置）の全体の処理の流れを示すシーケンス図である。なお、図８に示した仮想サーバの初期配置の全体の流れと同一の処理については、図１１においても、同一の名称と符号を付し、説明を省略する。 (Data load distribution placement processing-virtual server relocation)
Next, the overall processing flow of the virtual server relocation processing for each predetermined interval Z performed by the data load distribution and placement system 1 will be described with reference to FIG. FIG. 11 is a sequence diagram showing the overall processing flow of the data load distribution arrangement processing (virtual server relocation) according to the present embodiment. In addition, about the process same as the whole flow of the initial arrangement | positioning of the virtual server shown in FIG. 8, the same name and code | symbol are attached | subjected also in FIG. 11, and description is abbreviate | omitted.

まず、管理者サーバ５から、仮想サーバ配置装置１０が、システム全体に求める処理性能Ｒと、各サーバの負荷のサーバ閾値を決定するための負荷閾値定数ｃとを、設定情報として取得する（ステップＳ１１）。なお、この処理は、所定間隔Ｚごとに行われてもよいが、システム全体に求める処理性能Ｒおよび負荷閾値定数ｃに変更がなければ、このステップＳ１１を行わないようにしてもよい。また、予め、仮想サーバ配置装置１０の記憶部１４に、設定情報（Ｒ，ｃ）を記憶させておき利用するようにしてもよい。 First, from the administrator server 5, the virtual server placement apparatus 10 acquires, as setting information, the processing performance R required for the entire system and the load threshold constant c for determining the server threshold of the load of each server (step). S11). This process may be performed at every predetermined interval Z. However, if the processing performance R and load threshold constant c required for the entire system are not changed, step S11 may not be performed. In addition, the setting information (R, c) may be stored in advance in the storage unit 14 of the virtual server placement apparatus 10 for use.

そして、各ＤＢサーバ２０それぞれは、所定間隔Ｚごとに、自身のＤＢサーバ２０に保存しているデータのデータキーおよびそのデータへのアクセス回数の情報であるアクセス回数情報３００（図７参照）を、自身に固有な番号とともに、仮想サーバ配置装置１０へ送信する（ステップＳ２１）。 Each DB server 20 receives, for each predetermined interval Z, a data key of data stored in its own DB server 20 and access count information 300 (see FIG. 7) which is information on the number of accesses to the data. , Together with a number unique to itself, is transmitted to the virtual server placement apparatus 10 (step S21).

次に、仮想サーバ配置装置１０のサーバ負荷計算部１１２は、サーバ負荷計算処理を行う（ステップＳ２２）。
具体的は、サーバ負荷計算部１１２は、図８の仮想サーバの初期配置におけるステップＳ１２で受信している、各ＤＢサーバ２０のサーバ性能Ｔ_ｍと、自身の記憶部１４に記憶する負荷閾値定数ｃまたはステップＳ１１で取得した負荷閾値定数ｃに基づき、（式１）を用いて、各サーバのサーバ閾値α_ｍを設定する。また、サーバ負荷計算部１１２は、ステップＳ２１で各ＤＢサーバ２０から受信した、データキーおよびアクセス回数を含むアクセス回数情報３００（図７参照）に基づき、各ＤＢサーバの負荷Ｗ_ｍを計算する。そして、サーバ負荷計算部１１２は、計算したＤＢサーバ２０ごとの負荷Ｗ_ｍとサーバ閾値α_ｍとを比較し、負荷がサーバ閾値以下のサーバＳ⁻の集合と、負荷がサーバ閾値を超えるサーバＳ^＋の集合とを生成して、その集合ごとに、負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値が大きい順にサーバを整列する処理を行う。 Next, the server load calculation unit 112 of the virtual server arrangement device 10 performs server load calculation processing (step S22).
Specifically, the server load calculation unit 112 receives the server performance T _m of each DB server 20 received in step S12 in the initial arrangement of the virtual server in FIG. The server threshold value α _m of each server is set using (Equation 1) based on c or the load threshold constant c acquired in step S11. The server load calculation unit 112, received from each DB server 20 in step S21, based on the access count information 300 containing the data key and the access count (see FIG. 7) calculates the load _{W m} of the DB servers. Then, the server load calculation unit 112 compares the calculated load W _m for each DB server 20 with the server threshold value α _m, and the set of servers S ⁻ whose load is equal to or less than the server threshold value and the server S whose load exceeds the server threshold value. A set of ⁺ is generated, and for each set, the servers are arranged in descending order of the absolute value of the difference between the load W _m and the server threshold value α _m .

続いて、仮想サーバ配置装置１０の仮想サーバ再配置計算部１１５は、負荷がサーバ閾値以下となったＤＢサーバ２０について、仮想サーバを取り除く処理（仮想サーバ再配置計算）を行う（ステップＳ２３）。そして、仮想サーバ再配置計算部１１５は、仮想サーバを取り除いた後の各サーバの負荷に基づき、負荷がサーバ閾値以下のサーバＳ⁻と、負荷がサーバ閾値を超えるサーバＳ^＋とについて、負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値が大きい順にサーバを整列する。そして、その情報を、配置処理部１１３に引き渡す。なお、この仮想サーバ再配置計算は、後記する図１２〜図１８において詳細に説明する。 Subsequently, the virtual server rearrangement calculation unit 115 of the virtual server arrangement apparatus 10 performs a process of removing the virtual server (virtual server rearrangement calculation) for the DB server 20 whose load is equal to or less than the server threshold (step S23). Then, based on the load of each server after the virtual server is removed, the virtual server relocation calculation unit 115 calculates the load W for the server S ⁻ whose load is equal to or less than the server threshold and the server S ⁺ whose load exceeds the server threshold. _The servers are arranged in descending order of the absolute value of the difference between _m and the server threshold α _m . Then, the information is transferred to the arrangement processing unit 113. This virtual server rearrangement calculation will be described in detail with reference to FIGS.

そして、仮想サーバ配置装置１０の配置処理部１１３は、仮想サーバ再配置計算部１１５が計算し整列させた各サーバの負荷に関する情報に基づき、再度、仮想サーバ配置処理（図９および図１０参照）を実行する（ステップＳ１４）。そして、以下は、図８のステップＳ１５〜Ｓ１８と同様の処理を行って、各ＤＢサーバ２０同士が担当データの交換処理を行う。 Then, the placement processing unit 113 of the virtual server placement apparatus 10 again performs the virtual server placement processing (see FIGS. 9 and 10) based on the information regarding the load of each server calculated and aligned by the virtual server relocation calculation unit 115. Is executed (step S14). Then, the following processes similar to those in steps S15 to S18 in FIG. 8 are performed, and the respective DB servers 20 exchange data in charge.

＜仮想サーバ再配置計算＞
次に、図１２〜図１７を参照して、図１１のステップＳ２３における、仮想サーバ配置装置１０（仮想サーバ再配置計算部１１５）による仮想サーバ再配置計算の概要を説明する。そして、図１８を用いて、仮想サーバ再配置計算部１１５による仮想サーバ再配置計算の流れを、さらに詳細にフローチャートを用いて説明する（適宜図１参照）。 <Virtual server relocation calculation>
Next, an overview of the virtual server relocation calculation by the virtual server allocation device 10 (virtual server relocation calculation unit 115) in step S23 of FIG. 11 will be described with reference to FIGS. Then, the flow of the virtual server rearrangement calculation performed by the virtual server rearrangement calculation unit 115 will be described in more detail using a flowchart (see FIG. 1 as appropriate) with reference to FIG.

なお、この仮想サーバ再配置計算部１１５による仮想サーバ再配置計算は、例えば、図１１のステップＳ２２において、サーバ負荷計算部１１２が、所定間隔Ｚごとに各ＤＢサーバ２０の負荷Ｗ_ｍを算出し、負荷がサーバ閾値以下のサーバＳ⁻と、負荷がサーバ閾値を超えるサーバＳ^＋とについて、負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値が大きい順にサーバを整列させたことを契機として実行される。 The virtual server relocation calculation by the virtual server relocation calculation unit 115 is performed by, for example, the server load calculation unit 112 calculating the load W _m of each DB server 20 at every predetermined interval Z in step S22 of FIG. When the server S ⁻ whose load is equal to or less than the server threshold and the server S ⁺ whose load exceeds the server threshold, the servers are arranged in descending order of the absolute value of the difference between the load W _m and the server threshold α _m. Executed.

図１２〜図１７は、本実施形態に係る仮想サーバ配置装置１０（仮想サーバ再配置計算部１１５）による仮想サーバ再配置計算の概要を説明するための図である。
ここでは、仮想サーバ配置装置１０による仮想サーバの初期配置により、図５（ａ）に示すようにハッシュ空間上で仮想サーバが配置され（図１２（ａ）参照）、システム稼働後の所定間隔Ｚ後に、各サーバＳ_ｍの負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値が、図１２（ｂ）のようになったものとして説明する。例えば、サーバＳ_１は、初期配置では、仮想サーバ（ＶＳ_３，ＶＳ_４）が配置されることにより、負荷がサーバ閾値を超えない設定で、処理を開始したが、時間の経過とともに、図１２（ｂ）に示すように、負荷がサーバ閾値以下のサーバＳ⁻となっている。つまりサーバＳ_１の負荷が減少している。 12 to 17 are diagrams for explaining the outline of the virtual server relocation calculation performed by the virtual server arrangement device 10 (virtual server relocation calculation unit 115) according to the present embodiment.
Here, by the initial placement of the virtual server by the virtual server placement device 10, the virtual server is placed on the hash space as shown in FIG. 5A (see FIG. 12A), and the predetermined interval Z after the system is operated. later, the absolute value of the difference between the load W _m and the server threshold value alpha _m of each server S _m is described as that looks like Figure 12 (b). For example, in the initial placement, the server S ₁ starts processing with a setting in which the load does not exceed the server threshold by placing virtual servers (VS ₃ , VS ₄ ). As shown in (b), the load is the server S ⁻ whose load is equal to or less than the server threshold. This means that the load of the server S ₁ is decreased.

仮想サーバ再配置計算部１１５は、負荷がサーバ閾値以下のサーバＳ⁻のうち、ハッシュ空間上で仮想サーバが配置されているものを取り除く処理を行う。なお、その際、仮想サーバ再配置計算部１１５は、仮想サーバを取り除くと、その負荷Ｗ_ｍがサーバ閾値α_ｍを超えてしまう場合には、負荷がサーバ閾値以下のサーバＳ⁻のその仮想サーバを取り除く処理を行わないものとする。 The virtual server rearrangement calculation unit 115 performs a process of removing a server S ⁻ whose load is equal to or less than the server threshold value, in which the virtual server is arranged in the hash space. At that time, if the virtual server rearrangement calculation unit 115 removes the virtual server and the load W _m exceeds the server threshold value α _m , the virtual server of the server S ⁻ whose load is equal to or less than the server threshold value. It is assumed that the process of removing is not performed.

まず、仮想サーバ再配置計算部１１５は、図１２に示す状態から、負荷がサーバ閾値以下のサーバＳ⁻のうち、負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値が大きい順で、かつ、ハッシュ空間上で仮想サーバが配置されているサーバを選択する。ここでは、Ｓ_１のサーバを選択する。そして、ハッシュ空間上のサーバＳ_１に近い順に、１番目に配置された仮想サーバＶＳ_３を取り除く（図１３（ａ）参照）。この状態での各サーバＳ_ｍの負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値を図１３（ｂ）に示す。サーバＳ_１は、仮想サーバＶＳ_３が担当していたデータの分だけ、負荷の余裕分が減っている。これに対し、サーバＳ_３は、仮想サーバＶＳ_３の担当していたデータ分だけ、負荷の余裕分が増えている。 First, the virtual server rearrangement calculation unit 115 starts from the state shown in FIG. 12 in order of increasing absolute value of the difference between the load W _m and the server threshold value α _m among the servers S ⁻ whose loads are equal to or less than the server threshold value, and The server on which the virtual server is arranged on the hash space is selected. Here, you select the server S _1. Then, the virtual server VS ₃ arranged first is removed in the order close to the server S ₁ on the hash space (see FIG. 13A). FIG. 13B shows the absolute value of the difference between the load W _m of each server S _m and the server threshold value α _{m in} this state. The server S ₁ has a reduced load margin by the amount of data handled by the virtual server VS ₃ . On the other hand, the load margin of the server S ₃ is increased by the amount of data handled by the virtual server VS ₃ .

次に、仮想サーバ再配置計算部１１５は、ハッシュ空間上のサーバＳ_１に近い順に、２番目に配置された仮想サーバＶＳ_４を取り除く（図１４（ａ）参照）。この状態での各サーバＳ_ｍの負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値を図１４（ｂ）に示す。サーバＳ_１は、仮想サーバＶＳ_４が担当していたデータ分だけ、負荷の余裕分が減っている。これに対し、サーバＳ_４は、仮想サーバＶＳ_４の担当していたデータ分だけ、負荷の超過分が減っている。 Next, the virtual server rearrangement calculation unit 115 removes the second virtual server VS ₄ that is arranged in order from the server S ₁ in the hash space (see FIG. 14A). FIG. 14B shows the absolute value of the difference between the load W _m of each server S _m and the server threshold value α _{m in} this state. The server S ₁ has a reduced load margin corresponding to the data handled by the virtual server VS ₄ . On the other hand, in the server S ₄ , the excess load is reduced by the amount of data handled by the virtual server VS ₄ .

これで、サーバＳ_１の仮想サーバをすべて取り除いたので、図１２（ｂ）において、負荷がサーバ閾値以下のサーバＳ⁻で、仮想サーバが配置されている次のサーバであるサーバＳ_２を選択する。なお、サーバＳ_３は、仮想サーバが配置されていないので、選択せずに、サーバＳ_２を選択する。そして、ハッシュ空間上のサーバＳ_２に近い順に、１番目に配置された仮想サーバＶＳ_４を取り除く（図１５（ａ）参照）。この状態での各サーバＳ_ｍの負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値を図１５（ｂ）に示す。サーバＳ_２は、仮想サーバＶＳ_４が担当していたデータ分だけ、負荷の余裕分が減っている。これに対し、サーバＳ_４は、仮想サーバＶＳ_４が担当していたデータ分だけ、負荷の超過分が減っている。 Since all the virtual servers of the server S ₁ are now removed, in FIG. 12B, the server S ⁻ whose load is equal to or less than the server threshold and the server S ₂ that is the next server on which the virtual server is arranged is selected. To do. The server S _3, since the virtual server is not arranged, without selecting, selects the server S _2. Then, the virtual server VS ₄ arranged first is removed in the order close to the server S ₂ on the hash space (see FIG. 15A). The absolute value of the difference between the load W _m of each server S _m and the server threshold value α _{m in} this state is shown in FIG. The server S ₂ has a reduced load margin corresponding to the data handled by the virtual server VS ₄ . On the other hand, in the server S ₄ , the excess load is reduced by the amount of data handled by the virtual server VS ₄ .

次に、仮想サーバ再配置計算部１１５は、ハッシュ空間上のサーバＳ_２に近い順に、２番目に配置された仮想サーバＶＳ_５を取り除く処理を行おうとするが、仮想サーバＶＳ_５の担当データ分の負荷が、サーバＳ_２の余裕分より大きいため、仮想サーバＶＳ_５を取り除くと、サーバＳ_２の負荷がサーバ閾値を超えてしまうため、取り除く処理を行わない（図１６参照）。 Next, the virtual server relocation calculation section 115, order close to the server S ₂ on the hash space, but an attempt is made to processing for removing the virtual servers VS ₅ disposed in the second, charge data content of the virtual servers VS ₅ Since the load of the server S ₂ is larger than the margin of the server S ₂ , if the virtual server VS ₅ is removed, the load of the server S ₂ exceeds the server threshold value, so the removal process is not performed (see FIG. 16).

そして、仮想サーバ再配置計算部１１５は、負荷がサーバ閾値以下のサーバＳ⁻のすべてのサーバの処理を終えると、仮想サーバを取り除いても負荷がサーバ閾値を超えない範囲で、その仮想サーバを取り除いた状態の各サーバＳ_ｍの負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値を示す図１６（ｂ）のようになる。仮想サーバ再配置計算部１１５は、負荷がサーバ閾値以下のサーバＳ⁻と、負荷がサーバ閾値を超えるサーバＳ^＋とについて、負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値が大きい順に整列し直し（図１７参照）、仮想サーバ再配置計算を終え、その情報を配置処理部１１３に渡す。そして、配置処理部１１３が、図１１のステップＳ１４の仮想サーバ配置処理（図９および図１０参照）を続ける。 Then, the virtual server relocation calculation section 115, the load is less than the server threshold value server S ^- Upon completion of the processing of all the servers, to the extent that the load be removed virtual servers does not exceed the server threshold value, the virtual server FIG. 16B shows the absolute value of the difference between the load W _m of each server S _m and the server threshold value α _{m in the} removed state. The virtual server rearrangement calculation unit 115 sorts the server S ⁻ whose load is less than or equal to the server threshold and the server S ⁺ whose load exceeds the server threshold in descending order of the absolute value of the difference between the load W _m and the server threshold α _m. Then, the virtual server rearrangement calculation is finished, and the information is passed to the arrangement processing unit 113. Then, the placement processing unit 113 continues the virtual server placement processing (see FIGS. 9 and 10) in step S14 of FIG.

次に、図１８を用いて、仮想サーバ再配置計算部１１５による仮想サーバ再配置計算の流れを、さらに詳細にフローチャートを用いて説明する（適宜図１参照）。 Next, the flow of the virtual server relocation calculation performed by the virtual server relocation calculation unit 115 will be described in more detail using a flowchart (see FIG. 1 as appropriate) with reference to FIG.

まず、仮想サーバ再配置計算部１１５は、初期値として、負荷がサーバ閾値以下のサーバＳ⁻ _kjについて、絶対値Ｏ⁻ _kjが大きい順に１番目のサーバを選択し（ｊ←１）、その選択した負荷がサーバ閾値以下のサーバＳ⁻ _kjに、ハッシュ空間上で近い順で一番目の仮想サーバを初期値として選択する（ｌ←１）（ステップＳ２０１）。 First, the virtual server relocation calculating unit 115 as an initial value, the load is less server threshold of the server S ^- for _kj, absolute value O ^- _kj select the first server to the descending order (j ← 1), the selection The first virtual server is selected as an initial value in the order close to the server S ^- _kj whose load is equal to or less than the server threshold in the hash space (l ← 1) (step S201).

次に、仮想サーバ再配置計算部１１５は、まだ処理していない、負荷がサーバ閾値以下のサーバＳ⁻ _kjがあるか否かを判定する（ｊ＜ｐ＋１）（ステップＳ２０２）。ここで、まだ処理していない、負荷がサーバ閾値以下のサーバＳ⁻ _kjがない、つまり、すべての負荷がサーバ閾値以下のサーバＳ⁻ _kjの処理を終えている場合には（ステップＳ２０２→Ｎｏ）、ステップＳ２１０へ進む。一方、まだ処理していない、負荷がサーバ閾値以下のサーバＳ⁻ _kjがある場合には（ステップＳ２０２→Ｙｅｓ）、次のステップＳ２０３へ進む。 Next, the virtual server relocation calculation unit 115 determines whether there is a server S ^- _kj that has not been processed yet and whose load is equal to or less than the server threshold (j <p + 1) (step S202). Here, in the case where there is no server S ^- _kj whose load is not more than the server threshold, that is, the processing of the server S ^- _kj whose load is not more than the server threshold is finished (step S202 → No) ), Go to step S210. On the other hand, when there is a server S ^- _kj whose load is not more than the server threshold value (step S202 → Yes), the process proceeds to the next step S203.

ステップＳ２０３において、仮想サーバ再配置計算部１１５は、負荷がサーバ閾値以下のサーバＳ⁻ _kjのハッシュ空間上での担当領域に、仮想サーバが配置されているか否かを判定する（ｙ_kj ＞０）。ここでｙ_kjは、負荷がサーバ閾値以下のサーバＳ⁻ _kjに配置された仮想サーバ数を示す。そして、負荷がサーバ閾値以下のサーバＳ⁻ _kjに仮想サーバが配置されていない場合には（ステップＳ２０３→Ｎｏ）、ステップＳ２０９へ進む。一方、仮想サーバが配置されている場合には（ステップＳ２０３→Ｙｅｓ）、次のステップＳ２０４へ進む。 In step S203, the virtual server relocation calculation unit 115 determines whether or not a virtual server is allocated in the assigned area in the hash space of the server S ^- _kj whose load is equal to or less than the server threshold (y _kj > 0). ). Here, y _kj indicates the number of virtual servers arranged in the server S ^- _kj whose load is equal to or less than the server threshold. If no virtual server is placed on the server S ^- _kj whose load is equal to or less than the server threshold (step S203 → No), the process proceeds to step S209. On the other hand, when the virtual server is arranged (step S203 → Yes), the process proceeds to the next step S204.

続いて、仮想サーバ再配置計算部１１５は、負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の絶対値（Ｏ⁻ _kj：余裕分）が０を超えているか否かを判定する（Ｏ⁻ _kj ＞０）（ステップＳ２０４）。つまり、負荷がサーバ閾値以下のサーバＳ⁻ _kjが、仮想サーバを取り除いた後でも、まだ、その余裕分が残っているか否かを判定する。なお、このステップＳ２０４を最初に実行する際は、ステップＳ２０２で、負荷がサーバ閾値以下のサーバが１つでも存在すれば、当然に、余裕分が残っていることになる。ここで、負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の絶対値（Ｏ⁻ _kj：余裕分）が０以下、つまり余裕分が残っていない場合は（ステップＳ２０４→Ｎｏ）、ステップＳ２０９へ進む。一方、負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の絶対値（Ｏ⁻ _kj：余裕分）が０より大きい、つまり余裕分が残っている場合には（ステップＳ２０４→Ｙｅｓ）、次のステップＳ２０５へ進む。 Subsequently, the virtual server relocation calculation unit 115 determines whether or not the absolute value (O ⁻ _kj : margin) of the load of the server S ⁻ _kj whose load is equal to or less than the server threshold exceeds 0 (O ⁻ _kj). > 0) (step S204). That is, it is determined whether or not the server S ^- _kj whose load is equal to or less than the server threshold still has a margin after the virtual server is removed. When step S204 is executed for the first time, if at least one server having a load equal to or lower than the server threshold exists in step S202, a margin is naturally left. Here, when the absolute value (O ^- _kj : margin) of the load of the server S ^- _kj whose load is not more than the server threshold is 0 or less, that is, when there is no margin (step S204 → No), the process proceeds to step S209. . On the other hand, when the absolute value (O ^- _kj : margin) of the load of the server S ^- _kj whose load is equal to or less than the server threshold is greater than 0, that is, when the margin remains (step S204 → Yes), the next step The process proceeds to S205.

次に、仮想サーバ再配置計算部１１５は、負荷がサーバ閾値以下のサーバＳ⁻ _kjの担当領域に、まだ未処理の仮想サーバが配置されているか否かを判定する（ｌ＜ｙ_kj＋１）（ステップＳ２０５）。つまり、取り除くことができる可能性のある仮想サーバが残っているか否かを判定する。ここで、未処理の仮想サーバがハッシュ空間上に配置されていない場合には（ステップＳ２０５→Ｎｏ）、ステップＳ２０９へ進む。一方、未処理の仮想サーバがそのサーバのハッシュ空間上の担当領域に残っている場合には（ステップＳ２０５→Ｙｅｓ）、次のステップＳ２０６へ進む。 Next, the virtual server relocation calculation unit 115 determines whether or not an unprocessed virtual server is still allocated in the area in charge of the server S ⁻ _kj whose load is equal to or less than the server threshold (l <y _kj +1). (Step S205). That is, it is determined whether or not there is a virtual server that can be removed. Here, when an unprocessed virtual server is not arranged in the hash space (step S205 → No), the process proceeds to step S209. On the other hand, if an unprocessed virtual server remains in the assigned area in the hash space of the server (step S205 → Yes), the process proceeds to the next step S206.

ステップＳ２０６において、仮想サーバ再配置計算部１１５は、仮想サーバの担当する負荷（Ｚ_kj(l)）が、負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の余裕分の絶対値（Ｏ⁻ _kj：余裕分）より小さいか否かを判定する（Ｚ_kj(l) ＜Ｏ⁻ _kj）。つまり、仮想サーバを取り除いても、その仮想サーバの基のサーバの負荷が、サーバ閾値を超えないか否かを判定する。ここで、仮想サーバの担当する負荷（Ｚ_kj(l)）が、負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の絶対値（Ｏ⁻ _kj：余裕分）以上の場合は（ステップＳ２０６→Ｎｏ）、ステップＳ２０９へ進む。一方、仮想サーバの担当する負荷（Ｚ_kj(l)）が、負荷がサーバ閾値以下のサーバＳ⁻ _kjの負荷の絶対値（Ｏ⁻ _kj：余裕分）より小さい場合には（ステップＳ２０６→Ｙｅｓ）、次のステップＳ２０７へ進む。 In step S206, the virtual server rearrangement calculation unit 115 determines that the load (Z _kj (l)) in charge of the virtual server is an absolute value (O ⁻ _kj ) of the load margin of the server S ⁻ _kj whose load is equal to or less than the server threshold. : It is determined whether it is smaller than the margin) (Z _kj (l) <O ⁻ _kj ). That is, even if the virtual server is removed, it is determined whether or not the load on the base server of the virtual server does not exceed the server threshold. Here, when the load (Z _kj (l)) in charge of the virtual server is equal to or greater than the absolute value (O ⁻ _kj : margin) of the load of the server S ⁻ _kj whose load is equal to or less than the server threshold (step S206 → No). ), Go to step S209. On the other hand, when the load handled by the virtual server (Z _kj (l)) is smaller than the absolute value (O ⁻ _kj : margin) of the load of the server S ⁻ _kj whose load is equal to or less than the server threshold (step S206 → Yes). ), And proceeds to the next Step S207.

ステップＳ２０７において、仮想サーバ再配置計算部１１５は、仮想サーバをハッシュ空間上から取り除く。そして、仮想サーバ再配置計算部１１５は、その領域を担当する元のサーバに負荷を戻す処理を行う。具体的には、仮想サーバ（ｖ_k(l)）を取り除いた場合の負荷を以下の式により計算する。 In step S207, the virtual server relocation calculation unit 115 removes the virtual server from the hash space. Then, the virtual server rearrangement calculation unit 115 performs processing for returning the load to the original server in charge of the area. Specifically, the load when the virtual server (v _k (l)) is removed is calculated by the following equation.

ここで、Wv_kj(l)は、仮想サーバを取り除いた元のサーバの負荷を示す。 Here, Wv _kj (l) indicates the load of the original server from which the virtual server is removed.

そして、仮想サーバ再配置計算部１１５は、負荷がサーバ閾値以下のサーバＳ⁻ _kjの担当領域の、次の仮想サーバを選択し（ｌ←ｌ＋１）（ステップＳ２０８）、ステップＳ２０４へ戻り、処理を続ける。 Then, the virtual server rearrangement calculation unit 115 selects the next virtual server in the area in charge of the server S ^- _kj whose load is equal to or less than the server threshold (l ← l + 1) (step S208), returns to step S204, and performs processing. to continue.

一方、ステップＳ２０３→Ｎｏ、ステップＳ２０４→Ｎｏ、ステップＳ２０５→Ｎｏ、ステップＳ２０６→Ｎｏの場合には、ステップＳ２０９において、仮想サーバ再配置計算部１１５は、サーバ負荷計算部１１２が計算し整列した後の、負荷がサーバ閾値以下のサーバＳ⁻ _kjのうち、次に絶対値が大きいサーバを選択し（ｊ←ｊ＋１）、ステップＳ２０２へ戻る。 On the other hand, in the case of step S203 → No, step S204 → No, step S205 → No, step S206 → No, in step S209, the virtual server relocation calculation unit 115 calculates and arranges the server load calculation unit 112. Of the servers S ⁻ _kj whose load is equal to or less than the server threshold, the server having the next largest absolute value is selected (j ← j + 1), and the process returns to step S202.

そして、仮想サーバ再配置計算部１１５は、ステップＳ２０２において、負荷がサーバ閾値以下のサーバＳ⁻ _kjすべての処理を終えている場合には（ステップＳ２０２→Ｎｏ）、ステップＳ２０７において再計算された各サーバの負荷Ｗ_ｍに基づき、負荷がサーバ閾値以下のサーバＳ⁻と、負荷がサーバ閾値を超えるサーバＳ^＋とについて、負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値が大きい順に整列する（ステップＳ２１０）。そして、その情報を、配置処理部１１３に渡す。配置処理部１１３では、図９および図１０で示した仮想サーバ配置処理を実行する。 Then, in step S202, the virtual server rearrangement calculation unit 115, when the processing of all the servers S ^- _kj whose loads are equal to or less than the server threshold has been completed (step S202 → No), each of the recalculated values in step S207. Based on the server load W _m , the server S ⁻ whose load is equal to or less than the server threshold and the server S ⁺ whose load exceeds the server threshold are arranged in descending order of the absolute value of the difference between the load W _m and the server threshold α _m. (Step S210). Then, the information is passed to the arrangement processing unit 113. The placement processing unit 113 executes the virtual server placement processing shown in FIGS. 9 and 10.

このようにすることで、本実施形態に係るデータ負荷分散配置システム１およびデータ負荷分散配置方法による仮想サーバの再配置によれば、所定間隔ごとの担当データ数の増減や各データへの実アクセス数に応じた、各ＤＢサーバ２０の負荷を計算する。そして、負荷がサーバ閾値以下のサーバＳ⁻に含まれることになったＤＢサーバ２０のハッシュ空間上での担当領域に、仮想サーバが配置されている場合に、その仮想サーバの担当領域を、元のＤＢサーバの担当領域に戻すことができる。そして、負荷Ｗ_ｍとサーバ閾値α_ｍとの差の絶対値を計算し直すことにより、負荷がサーバ閾値以下のサーバＳ⁻の集合と、負荷がサーバ閾値を超えるサーバＳ^＋の集合とを生成して、再度、ハッシュ空間上での仮想サーバの配置を計算する。これにより、各ＤＢサーバ２０の負荷を均一化させるとともに、ハッシュ空間上に配置する仮想サーバの数を減らすことができる。 In this way, according to the virtual server rearrangement by the data load distribution and arrangement system 1 and the data load distribution and arrangement method according to the present embodiment, the number of assigned data increases or decreases at each predetermined interval and the actual access to each data The load on each DB server 20 is calculated according to the number. When a virtual server is arranged in the area in charge on the hash space of the DB server 20 whose load is included in the server S ⁻ having a server threshold value or less, the area in charge of the virtual server is It is possible to return to the area in charge of the DB server. Then, by recalculating the absolute value of the difference between the load W _m and the server threshold value α _m , a set of servers S ⁻ whose load is equal to or less than the server threshold value and a set of servers S ⁺ whose load exceeds the server threshold value are generated. Then, the arrangement of the virtual server on the hash space is calculated again. As a result, the load on each DB server 20 can be made uniform and the number of virtual servers arranged in the hash space can be reduced.

なお、本実施形態において仮想サーバ配置装置１０は、コンシステントハッシュ法の領域分割方式として、時計回り方式を用いて、ハッシュ空間上に仮想サーバを配置するものとして説明した。しかし、本発明はこれに限定されず、反時計回り方式や、各データがリング上で最も隣接するサーバにデータを格納する最近値方式等を用いることもできる。 In the present embodiment, the virtual server placement apparatus 10 has been described as placing a virtual server on a hash space using a clockwise method as a region partitioning method of the consistent hash method. However, the present invention is not limited to this, and a counterclockwise method, a nearest value method in which each data is stored in a server closest to the ring, or the like can also be used.

１データ負荷分散配置システム
５管理者サーバ
６クライアント
１０仮想サーバ配置装置
１１制御部
１２，２２通信部
１３，２３メモリ部
１４，２４記憶部
２０ＤＢサーバ
２１処理部
１００サーバ担当データ情報
１１１，２１１情報受信部
１１２サーバ負荷計算部
１１３配置処理部
１１４，２１３ハッシュ値計算部
１１５仮想サーバ再配置計算部
１１６，２１６情報送信部
２００データ格納部
２１２データ管理部
２１４アクセス監視部
２１５データ交換部
３００アクセス回数情報 DESCRIPTION OF SYMBOLS 1 Data load distribution arrangement system 5 Administrator server 6 Client 10 Virtual server arrangement apparatus 11 Control part 12,22 Communication part 13,23 Memory part 14,24 Storage part 20 DB server 21 Processing part 100 Server charge data information 111,211 information Reception unit 112 Server load calculation unit 113 Allocation processing unit 114, 213 Hash value calculation unit 115 Virtual server relocation calculation unit 116, 216 Information transmission unit 200 Data storage unit 212 Data management unit 214 Access monitoring unit 215 Data exchange unit 300 Number of accesses information

Claims

A data load distribution arrangement system comprising a plurality of DB servers that are communicably connected to each other, and a virtual server arrangement apparatus that is communicably connected to each of the plurality of DB servers,
The DB server
(1) Data to be searched from the client, data key associated with each of the data, (2) server performance of the DB server itself, and (3) the DB server of the storage destination of the data A storage unit for storing server data data to be shown;
A data management unit for transmitting the data key and the server performance to the virtual server placement device;
Data received from the virtual server placement device is received by the server, and compared with the stored server charge data information. The other DB server that is the data storage destination is extracted, and a data exchange request is sent to the extracted other DB server, so that the data that is not stored by itself is acquired, and the stored server manager A data exchanging unit that updates data information to the new server charge data information,
The virtual server placement device is:
A storage unit that stores processing performance required for the entire data load distribution and arrangement system, and a load threshold constant indicating a threshold of load set for each of the DB servers;
The server performance obtained from each of the DB servers and the load threshold constant are used to calculate a server threshold that is a load threshold of each of the DB servers, and the processing performance required for the entire data load distribution arrangement system And the total number of data keys acquired from each DB server, and the load on each DB server is calculated on the assumption that the number of accesses to the data is equal. Then, for each DB server, the absolute value of the difference between the load and the server threshold is calculated, the set of servers S ⁻ whose load is equal to or less than the server threshold, and the server S ⁺ whose load exceeds the server threshold A server load calculation unit for generating a set of
For each of the DB server and the data, by calculating a hash value using a predetermined hash function, an area in charge of the DB server responsible for storing the data is set on the hash space, and the server S ⁺ From the DB server in which the absolute value of the difference between the load and the server threshold is large, the number of data that can be handled in the server S ^{+ in} the hash space is the server threshold of the own DB server. relative excess of greater than, the server S ^- of the set of the virtual server absolute value from large DB server in order to place, the coverage area in said hash space data of the excess to the virtual server Before the storage location of the data that each DB server is in charge of based on the changed area in the changed hash space Generates new server contact data information, a new server contact data information the generating, by and a placement processing section that transmits to each of the DB server,
A data load distribution arrangement system characterized by

The DB server
An access monitoring unit that monitors increase / decrease of the data stored by itself and the number of accesses to the data from the client, and generates access count information indicating the number of accesses to each of the data;
The data management unit transmits the data key of the data stored in the storage unit and the access count information to the virtual server placement device at predetermined intervals,
The virtual server placement device is:
A virtual server relocation calculation unit;
The server load calculation unit uses the number of data, which is the number of data keys, and the number of accesses to the data, included in the access count information acquired from each DB server at each predetermined interval, By calculating the load on each of the DB servers at predetermined intervals, a set of the servers S ^{− and} a set of the servers S ⁺ are generated,
The virtual server relocation calculator, the server S ^- of a set of, for the DB server where the virtual server is located on the hash space, the absolute value of a large DB servers in the order, the virtual server If the load of the DB server does not exceed the server threshold of the DB server when the charge area is returned to the charge area of the DB server that was assigned before the change, the hash value of the charge area of the virtual server is the hash removed from the space, the recalculates the DB server each of the load, for each of the DB server, by recalculating the absolute value of the difference between the load and the server threshold value, the server S ^- a set of, Re-generating the set of servers S ⁺ and delivering them to the placement processing unit;
The data load distribution and arrangement system according to claim 1.

A data load distribution arrangement method of a data load distribution arrangement system, comprising: a plurality of DB servers that are communicably connected to each other; and a virtual server arrangement apparatus that is communicably connected to each of the plurality of DB servers,
The DB server
(1) Data to be searched from the client, data key associated with each of the data, (2) server performance of the DB server itself, and (3) the DB server of the storage destination of the data A storage unit for storing server data data to be shown,
Executing the step of transmitting the data key and the server performance to the virtual server placement device;
The virtual server placement device is:
A processing unit required for the entire data load distribution and arrangement system, and a storage unit storing a load threshold constant indicating a load threshold set for each of the DB servers,
The server performance obtained from each of the DB servers and the load threshold constant are used to calculate a server threshold that is a load threshold of each of the DB servers, and the processing performance required for the entire data load distribution arrangement system And the total number of data keys acquired from each DB server, and the load on each DB server is calculated on the assumption that the number of accesses to the data is equal. Then, for each DB server, the absolute value of the difference between the load and the server threshold is calculated, the set of servers S ⁻ whose load is equal to or less than the server threshold, and the server S ⁺ whose load exceeds the server threshold Generating a set of
For each of the DB server and the data, by calculating a hash value using a predetermined hash function, an area in charge of the DB server responsible for storing the data is set on the hash space, and the server S ⁺ From the DB server in which the absolute value of the difference between the load and the server threshold is large, the number of data that can be handled in the server S ^{+ in} the hash space is the server threshold of the own DB server. relative excess of greater than, the server S ^- of the set of the virtual server absolute value from large DB server in order to place, the coverage area in said hash space data of the excess to the virtual server And a new storage location indicating the data storage location of each of the DB servers based on the changed area in the hash space. Generate a server contact data information, a new server contact data information the generating, running, and transmitting to each of the DB server,
The DB server
By receiving the new server charge data information from the virtual server placement device and comparing it with the stored server charge data information, the data not stored in the storage unit of the own DB server and the data The other DB server that is the data storage destination is extracted, and a data exchange request is sent to the extracted other DB server, so that the data that is not stored by itself is acquired, and the stored server manager Performing a step of updating data information to the new server data information;
A data load distribution arrangement method characterized by the above.

The DB server
Monitoring the increase / decrease in the data stored by itself and the number of accesses to the data from the client, and generating access count information indicating the number of accesses to each of the data;
Further executing the step of transmitting the data key of the data stored in the storage unit and the access count information to the virtual server placement device at predetermined intervals,
The virtual server placement device is:
The DB server at each predetermined interval using the number of data that is the number of the data keys and the number of accesses to the data included in the access count information acquired from each DB server at each predetermined interval Generating the set of servers S ^{− and} the set of servers S ⁺ by calculating each of the loads;
The server S ^- of a set of, for the DB server where the virtual server is located on the hash space, in order from the large absolute value DB server, responsible for coverage area of the virtual server before the change If the load on the DB server does not exceed the server threshold value of the DB server when returning to the assigned area of the DB server, the virtual server is removed from the hash space, and each DB server And recalculate the absolute value of the difference between the load and the server threshold value for each DB server, thereby re-establishing the set of the servers S ^{− and} the set of the servers S ⁺ Performing the generating step;
The data load distribution and arrangement method according to claim 3.