JP2013182553A

JP2013182553A - Management device and program

Info

Publication number: JP2013182553A
Application number: JP2012047725A
Authority: JP
Inventors: Eriko Iwasa; 絵里子岩佐; Satoru Kondo; 悟近藤; Michio Irie; 道生入江; Masashi Kaneko; 雅志金子; Takeshi Fukumoto; 健福元
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2012-03-05
Filing date: 2012-03-05
Publication date: 2013-09-12
Anticipated expiration: 2032-03-05
Also published as: JP5745445B2

Abstract

PROBLEM TO BE SOLVED: To reduce a processing load generated during data duplication even when adding a server to a cluster for performing distributed processing.SOLUTION: A management device 4 in a distributed processing system 1000 can reduce a processing load generated during data duplication and achieve high-speed processing by selecting a server 5 physically existing in an area corresponding to a position in an ID space determined for a server 5 to be added when adding the server 5 constituting a cluster 100.

Description

本発明は、協調してデータ処理を行うクラスタを構成する複数のサーバを管理する管理装置に関する。 The present invention relates to a management apparatus that manages a plurality of servers constituting a cluster that performs data processing in a coordinated manner.

近年、クラウドコンピューティングの隆盛に伴い、多量のデータの処理や保持を効率的に行うことが求められている。そこで、複数のサーバを協調動作させることにより効率的な処理を実現する分散処理技術が発展している。 In recent years, with the rise of cloud computing, it has been required to efficiently process and retain a large amount of data. Thus, distributed processing technology has been developed that realizes efficient processing by operating a plurality of servers in a coordinated manner.

分散処理を行う際には、処理対象（管理対象）のデータを、クラスタを構成する各サーバ（以下、「クラスタメンバ」または「メンバ」とも称する。）に振り分けておく必要がある。このとき、クラスタ全体での処理能力を高めるためには、各クラスタメンバが担当するデータ数（データ量）は平均化されていることが望ましい。 When performing distributed processing, it is necessary to distribute processing target (management target) data to each server constituting the cluster (hereinafter also referred to as “cluster member” or “member”). At this time, in order to increase the processing capacity of the entire cluster, it is desirable that the number of data (data amount) handled by each cluster member is averaged.

代表的なデータの振り分け手法として、各データのｋｅｙをハッシュ関数にかけた値（以下、「ｈａｓｈ（ｋｅｙ）」と称する。）をクラスタメンバ数Ｎで割った余り、すなわち「ｈａｓｈ（ｋｅｙ）ｍｏｄＮ」を番号として持つクラスタメンバにデータを振り分ける手法がある。この場合、各クラスタメンバに事前に「０」から「Ｎ−１」までの番号を割り当てていることが前提となる。このような振り分け手法を用いた場合、クラスタメンバを追加すると、Ｎの値が変化して、多くのデータについて、担当するクラスタメンバが変更になるため、担当するデータの再配置が必要になる。 As a typical data distribution method, a remainder obtained by dividing a value obtained by multiplying the key of each data by a hash function (hereinafter referred to as “hash (key)”) by the number N of cluster members, that is, “hash (key) mod N There is a method of distributing data to cluster members having "" as a number. In this case, it is assumed that numbers “0” to “N−1” are assigned to each cluster member in advance. When such a distribution method is used, if a cluster member is added, the value of N changes, and the cluster member in charge of a lot of data is changed, so that the data in charge must be rearranged.

そこで、クラスタメンバの追加に伴い担当するクラスタメンバが変更になるデータ数を約１／Ｎに抑える方法として、コンシステント・ハッシュ法［Consistent Hashing］（非特許文献１参照）を用いた振り分け手法がある。このコンシステント・ハッシュ法は、Amazon Dynamo（非特許文献２参照）等で用いられている。 Therefore, as a method for suppressing the number of data that the cluster member in charge changes with the addition of the cluster member to about 1 / N, there is a distribution method using a consistent hashing method (see Non-Patent Document 1). is there. This consistent hash method is used in Amazon Dynamo (see Non-Patent Document 2) and the like.

このコンシステント・ハッシュ法を用いたデータ振り分け手法では、クラスタメンバとデータの双方にＩＤ（IDentifier）を割り当て、データのＩＤからＩＤ空間を時計回りに辿った場合に最初に出合ったクラスタメンバをそのデータの担当とする。 In this data distribution method using the consistent hash method, an ID (IDentifier) is assigned to both the cluster member and the data, and when the ID space is traced clockwise from the data ID, the first cluster member encountered is Take charge of data.

また、多量のデータの管理をクラスタ構成の分散処理システムで行う場合、あるクラスタメンバに障害が発生した場合でも他のクラスタメンバで処理を継続できるように、データの複製を保持することでデータ冗長化を実現する必要がある。これは、コンシステント・ハッシュ法によるデータ管理手法を用いた分散処理システムにおいても同様である。 In addition, when managing a large amount of data in a cluster-structured distributed processing system, data redundancy is maintained by maintaining a copy of the data so that even if a failure occurs in one cluster member, processing can be continued on other cluster members. Needs to be realized. The same applies to a distributed processing system that uses a data management technique based on the consistent hash method.

図７に示すように、コンシステント・ハッシュ法では、クラスタメンバ（メンバ１〜４）とデータ（データＡ〜Ｄ。黒丸（●）で表示）の双方にＩＤを割り当て、データのＩＤからＩＤ空間を時計回りに辿り最初に出合ったクラスタメンバをそのデータの担当として決定する。そして、担当するクラスタメンバのさらに右隣（時計回りに次）のクラスタメンバに複製データを担当させる。 As shown in FIG. 7, in the consistent hash method, IDs are assigned to both cluster members (members 1 to 4) and data (data A to D, indicated by black circles (●)), and the ID space is determined from the data ID. The cluster member meeting first is determined as the charge of the data. Then, the cluster member that is further to the right of the cluster member in charge (next clockwise) is assigned the duplicate data.

例えば、図７においては、データＡはＩＤ空間上を時計回りに辿り最初に出合ったメンバ１が担当となり、その複製データはＩＤ空間上でメンバ１の右隣にあたるメンバ２に担当させる。このように原本データ・複製データを担当するクラスタメンバを決定することで、クラスタメンバに離脱があった場合でも複製データを所持しているクラスタメンバが新しくデータを担当するクラスタメンバとなることで対応できるという利点がある。なお、複製データを複数個とる場合には、さらに右隣のクラスタメンバに２個目の複製データを担当させるようにすることもできる。 For example, in FIG. 7, data A is traced clockwise in the ID space and is assigned to the member 1 that first encounters, and the duplicated data is assigned to the member 2 on the right side of the member 1 in the ID space. By determining the cluster member responsible for the original data / replicated data in this way, even if the cluster member leaves, the cluster member that owns the replicated data becomes a new cluster member responsible for the data. There is an advantage that you can. When a plurality of pieces of duplicate data are taken, the second duplicate data can be assigned to the cluster member on the right side.

David Karger et al., “Consistent Hashing and Random Trees: Distributed Caching Protocols for Relieving Hot Spots on the World Wide Web”, [online], 1997, ACM, [平成24年2月20日検索], インターネット<ＵＲＬ：http://www.akamai.com/dl/technical_publications/ConsistenHashingandRandomTreesDistributedCachingprotocolsforrelievingHotSpotsontheworldwideweb.pdf>David Karger et al., “Consistent Hashing and Random Trees: Distributed Caching Protocols for Relieving Hot Spots on the World Wide Web”, [online], 1997, ACM, [searched February 20, 2012], Internet <URL: http://www.akamai.com/dl/technical_publications/ConsistenHashingandRandomTreesDistributedCachingprotocolsforrelievingHotSpotsontheworldwideweb.pdf> Giuseppe DeCandia et al., “Dynamo: Amazon’s Highly Available Key-value Store”, [online], 2007, ACM, [平成24年2月20日検索], インターネット<ＵＲＬ：http://www.allthingsdistributed.com/files/amazon-dynamo-sosp2007.pdf>Giuseppe DeCandia et al., “Dynamo: Amazon's Highly Available Key-value Store”, [online], 2007, ACM, [Search February 20, 2012], Internet <URL: http://www.allthingsdistributed.com /files/amazon-dynamo-sosp2007.pdf>

前記したように、コンシステント・ハッシュ法によるデータ管理手法では、データを担当するクラスタメンバの右隣に複製データを配置する方法を採用する。ここで、図８に示すように、クラスタを構成するメンバは、地理的に離れたＫ個（ここでは５個）のデータセンタエリア（サーバを管理するデータセンタが管轄するエリア）内から選択されているとする。 As described above, the data management method based on the consistent hash method employs a method in which replicated data is arranged on the right side of the cluster member in charge of data. Here, as shown in FIG. 8, the members constituting the cluster are selected from K (in this case, five) data center areas (areas managed by the data center that manages the server) that are geographically separated. Suppose that

このとき、通常、コンシステント・ハッシュ法のＩＤ空間上では、サーバについて物理位置（物理的な位置）を考慮せずにランダムに配置しているため、ＩＤ空間上で隣り合うクラスタメンバ同士が物理的に遠く離れていることがある。そうなると、ＩＤ空間上で右隣のクラスタメンバに複製データを作成する際のネットワークコストが膨大になり、処理遅延を発生させる懸念がある。なお、クラスタメンバ間の物理的距離とは、サーバ間通信に要するＮＷ（ネットワーク）のコストを表しており、具体例としては、物理的な距離だけでなく、トポロジー、ＮＷ帯域等を考慮して決定することが可能である。 At this time, usually, in the ID space of the consistent hash method, the servers are randomly arranged without considering the physical position (physical position), so that adjacent cluster members in the ID space are physically connected to each other. May be far away. In that case, there is a concern that the network cost when creating duplicate data for the cluster member on the right side in the ID space becomes enormous, causing processing delay. The physical distance between cluster members represents the cost of the NW (network) required for server-to-server communication. As a specific example, not only the physical distance but also the topology, NW band, etc. are considered. It is possible to determine.

そこで、本発明は、前記した事情に鑑みてなされたものであり、分散処理システムにおける管理装置が、クライアントマシンからのリクエストを複数のサーバのいずれに振り分けるかを決定する際に、サーバの物理位置を考慮したサーバ選択を行うことで、データ複製時に発生する処理負荷を低減することを課題とする。 Therefore, the present invention has been made in view of the above-described circumstances, and when the management apparatus in the distributed processing system determines which of a plurality of servers a request from a client machine is distributed to, the physical location of the server It is an object of the present invention to reduce the processing load that occurs during data replication by performing server selection in consideration of the above.

前記課題を解決するために、本発明は、環状のＩＤ空間に、管理対象の複数のデータ、および、前記データを管理しクラスタを構成する複数のサーバ、が割り振られ、それぞれの前記サーバが、前記ＩＤ空間において自身から所定方向回りに次の前記サーバまでの間に位置する前記データを管理するとともに、当該次の前記サーバから前記所定方向回りにさらに次の前記サーバまでの間に位置する前記データの複製を記憶する分散処理システムにおいて、クライアントマシンからのリクエストを前記複数のサーバのいずれに振り分けるかを決定する管理装置であって、前記ＩＤ空間における前記データの管理を担当するサーバを管理するＩＤ空間管理情報、前記ＩＤ空間が所定の複数の物理的な地域の数に分割されており、前記地域と前記分割されたＩＤ空間との対応付けを管理するＩＤ・地域割当情報、および、前記地域ごとに、当該地域に物理的に存在する前記複数のサーバおよび待機中のサーバとの対応付けを管理するサーバ管理情報、を記憶する記憶部と、前記クラスタに対して前記待機中のサーバを増設する場合、当該増設するサーバについての前記ＩＤ空間における挿入位置に基づき、前記ＩＤ・地域割当情報を参照して当該挿入位置に対応する前記地域を特定し、前記サーバ管理情報を参照して当該特定した地域に物理的に存在するサーバを選択し、当該選択したサーバと当該挿入位置の情報を用いて前記ＩＤ空間管理情報を更新する処理部と、を有することを特徴とする。 In order to solve the above-described problem, the present invention allocates a plurality of data to be managed and a plurality of servers that manage the data and constitute a cluster in a circular ID space, and each of the servers includes: Managing the data located between itself and the next server around the predetermined direction in the ID space, and located between the next server and the next server around the predetermined direction. In a distributed processing system that stores a copy of data, a management device that determines to which of the plurality of servers a request from a client machine is distributed, and manages a server in charge of managing the data in the ID space ID space management information, the ID space is divided into a predetermined number of physical regions, and the regions and the ID / region allocation information for managing the association with the designated ID space, and server management for managing the association between the plurality of servers physically present in the region and the standby server for each region When adding the storage server that stores information and the standby server to the cluster, based on the insertion position in the ID space for the server to be added, refer to the ID / region allocation information. The area corresponding to the insertion position is specified, the server management information is referred to, a server physically existing in the specified area is selected, and the ID space is determined using the selected server and the insertion position information. And a processing unit for updating the management information.

これによれば、クラスタに対して待機中のサーバを増設する場合、ＩＤ空間における挿入位置に対応する地域に物理的に存在するサーバを選択することで、データ複製時に発生する処理負荷を低減することができる。 According to this, when adding a standby server to a cluster, a server that physically exists in an area corresponding to the insertion position in the ID space is selected, thereby reducing the processing load generated during data replication. be able to.

また、本発明は、環状のＩＤ空間に、管理対象の複数のデータ、および、前記データを管理しクラスタを構成する複数のサーバ、が割り振られ、それぞれの前記サーバが、前記ＩＤ空間において自身から所定方向回りに次の前記サーバまでの間に位置する前記データを管理するとともに、当該次の前記サーバから前記所定方向回りにさらに次の前記サーバまでの間に位置する前記データの複製を記憶する分散処理システムにおいて、クライアントマシンからのリクエストを前記複数のサーバのいずれに振り分けるかを決定する管理装置であって、前記ＩＤ空間における前記データの管理を担当するサーバを管理するＩＤ空間管理情報、前記ＩＤ空間が所定の複数の物理的な地域の数に分割されており、前記地域と前記分割されたＩＤ空間との対応付けを管理するＩＤ・地域割当情報、および、前記地域ごとに、当該地域に物理的に存在する前記複数のサーバおよび待機中のサーバとの対応付けを管理するサーバ管理情報、を記憶する記憶部と、前記クラスタに対して前記待機中のサーバを増設する場合、前記ＩＤ空間管理情報を参照して前記ＩＤ空間において前記サーバ同士の間が最も広い箇所を特定し、前記ＩＤ・地域割当情報を参照して前記ＩＤ空間における当該箇所の中央の位置に対応する前記地域を特定し、前記サーバ管理情報を参照して当該特定した地域に物理的に存在するサーバを選択し、当該選択したサーバと当該中央の位置の情報を用いて前記ＩＤ空間管理情報を更新する処理部と、を有することを特徴とする。 Further, according to the present invention, a plurality of data to be managed and a plurality of servers that manage the data and constitute a cluster are allocated to the annular ID space, and each of the servers is assigned to itself in the ID space. Manages the data located between the next server around the predetermined direction and stores a copy of the data located between the next server and the next server around the predetermined direction. In a distributed processing system, a management device that determines to which of the plurality of servers a request from a client machine is assigned, ID space management information for managing a server in charge of managing the data in the ID space, The ID space is divided into a predetermined number of physical regions, and the region and the divided ID space Storage for storing ID / region allocation information for managing association, and server management information for managing association between the plurality of servers physically present in the region and a standby server for each region When the standby server is added to the cluster and the cluster, the ID space management information is referred to specify the widest portion between the servers in the ID space, and the ID / region allocation information The region corresponding to the center position of the location in the ID space is identified with reference to the server, the server management information is referenced to select a server physically existing in the identified region, and the selected server And a processing unit that updates the ID space management information using the information on the center position.

これによれば、複数のサーバに対して待機中のサーバを増設する場合、ＩＤ空間においてサーバ同士の間が最も広い箇所の中央の位置に対応する地域に物理的に存在するサーバを選択することで、データ複製時に発生する処理負荷を低減するとともに、各サーバにおける処理負荷をより平均化することができる。なお、本明細書および特許請求の範囲において、「中央の位置」とは、厳密な中央の位置だけでなく、中央付近の位置も含む。 According to this, when adding a standby server to a plurality of servers, selecting a server physically present in an area corresponding to the central position of the widest portion between the servers in the ID space. Thus, it is possible to reduce the processing load generated at the time of data replication and to further average the processing load on each server. In the present specification and claims, the “center position” includes not only the exact center position but also the position near the center.

また、本発明は、前記地域が、前記サーバを管理するデータセンタが管轄するデータセンタエリアであることを特徴とする。 Further, the present invention is characterized in that the area is a data center area managed by a data center that manages the server.

これによれば、地域としてデータセンタエリアを採用することで、クラスタや管理装置の運用がより容易になる。 According to this, by adopting the data center area as a region, the operation of the cluster and the management apparatus becomes easier.

また、本発明は、コンピュータを前記管理装置として機能させるためのプログラムである。 Further, the present invention is a program for causing a computer to function as the management device.

これによれば、このようなプログラムを実装したコンピュータを管理装置として機能させることができる。 According to this, a computer in which such a program is installed can be caused to function as a management device.

本発明によれば、分散処理システムにおける管理装置が、クライアントマシンからのリクエストを複数のサーバのいずれに振り分けるかを決定する際に、サーバの物理位置を考慮したサーバ選択を行うことで、データ複製時に発生する処理負荷を低減することができる。 According to the present invention, when a management apparatus in a distributed processing system determines which of a plurality of servers a request from a client machine is distributed to, data replication is performed by performing server selection in consideration of the physical location of the server. The processing load sometimes generated can be reduced.

本発明の概要の説明図である。It is explanatory drawing of the outline | summary of this invention. 本実施形態の管理装置等の構成図である。It is a block diagram of the management apparatus etc. of this embodiment. 本実施形態のＩＤ空間管理情報を示す図である。It is a figure which shows ID space management information of this embodiment. 本実施形態のＩＤ・地域割当情報を示す図である。It is a figure which shows ID and area allocation information of this embodiment. 本実施形態のサーバ管理情報を示す図である。It is a figure which shows the server management information of this embodiment. 本実施形態の管理装置による処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the process by the management apparatus of this embodiment. 従来のコンシステント・ハッシュ法の説明図である。It is explanatory drawing of the conventional consistent hash method. 従来のコンシステント・ハッシュ法におけるクラスタメンバの物理位置の説明図である。It is explanatory drawing of the physical position of the cluster member in the conventional consistent hash method.

以下、本発明を実施するための形態（以下、実施形態と称する。）について、図面を参照（言及図以外の図も適宜参照）しながら説明する。なお、理解を容易にするために、まず、図１を参照して本実施形態の概要について説明し、その後、実施形態について説明する。 Hereinafter, modes for carrying out the present invention (hereinafter referred to as embodiments) will be described with reference to the drawings (refer to drawings other than the referenced drawings as appropriate). In order to facilitate understanding, the outline of the present embodiment will be described first with reference to FIG. 1, and then the embodiment will be described.

（本実施形態の概要）
図１に示すように、データセンタエリアの数（ここでは５個）に応じて、ＩＤ空間を事前に分割しておく。そして、クラスタにサーバを追加する場合には、ＩＤ空間における新たなクラスタメンバの挿入先（「挿入位置」と同義。ＩＤ）を決定する（図１の（１））。 (Outline of this embodiment)
As shown in FIG. 1, the ID space is divided in advance according to the number of data center areas (here, 5). When a server is added to the cluster, a new cluster member insertion destination (synonymous with “insertion position”, ID) in the ID space is determined ((1) in FIG. 1).

ＩＤ空間における新たなクラスタメンバの挿入先を決定した（図１の（１））後に、その挿入先に対応するデータセンタエリアを特定し、そのデータセンタエリアに物理的に存在するサーバを選択し（図１の（２））、クラスタメンバに組み込む。このようなアルゴリズムにより、サーバの物理位置を考慮したサーバ選択を行うことができ、データ複製時に発生する処理負荷を低減することができるとともに、各サーバにおける処理負荷をより平均化することができる。 After determining the insertion destination of the new cluster member in the ID space ((1) in FIG. 1), the data center area corresponding to the insertion destination is specified, and the server physically existing in the data center area is selected. ((2) in FIG. 1), it is incorporated in the cluster member. With such an algorithm, server selection can be performed in consideration of the physical location of the server, the processing load generated during data replication can be reduced, and the processing load on each server can be further averaged.

（実施形態）
次に、本実施形態について説明する。図２に示すように、本実施形態の分散処理システム１０００は、負荷分散装置３、管理装置４、クラスタ１００を構成する複数のサーバ５を備えている。負荷分散装置３は、インターネット等のネットワーク２を介して、複数のクライアントマシン１と接続されている。 (Embodiment)
Next, this embodiment will be described. As illustrated in FIG. 2, the distributed processing system 1000 according to the present embodiment includes a load distribution device 3, a management device 4, and a plurality of servers 5 configuring the cluster 100. The load balancer 3 is connected to a plurality of client machines 1 via a network 2 such as the Internet.

全体の主な動作について説明すると、クライアントマシン１からのデータ処理リクエストを、ネットワーク２経由で負荷分散装置３が受け取る。負荷分散装置３は、データのＩＤ空間上のサーバ割当表（ＩＤ空間管理情報）に基づいて、そのリクエストを、データ処理を行う複数のサーバ５のいずれかに振り分ける。振り分けられたサーバ５は、そのリクエストの処理を行う。管理装置４は、ＩＤ空間管理情報を管理する。 The main operation will be described. The data distribution request from the client machine 1 is received by the load balancer 3 via the network 2. The load balancer 3 distributes the request to one of a plurality of servers 5 that perform data processing based on the server allocation table (ID space management information) on the data ID space. The distributed server 5 processes the request. The management device 4 manages ID space management information.

次に、負荷分散装置３と管理装置４の構成について説明する。
負荷分散装置３は、記憶部３１、処理部３２、通信部３３を備える。
記憶部３１は、情報を記憶する手段であり、ＲＡＭ（Random Access Memory）やＲＯＭ（Read Only Memory）などのメモリ、ＨＤＤ（Hard Disk Drive）などによって構成される。記憶部３１には、管理装置４から受信したＩＤ空間管理情報４１１が、ＩＤ空間管理情報３１１として格納されている。なお、記憶部３１には、処理部３２の動作プログラムなども格納されている（図示を省略）。 Next, the configuration of the load distribution device 3 and the management device 4 will be described.
The load distribution device 3 includes a storage unit 31, a processing unit 32, and a communication unit 33.
The storage unit 31 is a means for storing information, and includes a memory such as a random access memory (RAM) or a read only memory (ROM), a hard disk drive (HDD), or the like. The storage unit 31 stores ID space management information 411 received from the management device 4 as ID space management information 311. The storage unit 31 also stores an operation program for the processing unit 32 (not shown).

処理部３２は、記憶部３１に格納された情報に基づいて演算処理を行う手段であり、例えばＣＰＵ（Central Processing Unit）によって構成される。 The processing unit 32 is means for performing arithmetic processing based on information stored in the storage unit 31, and is configured by, for example, a CPU (Central Processing Unit).

通信部３３は、外部装置との通信に用いられる通信インタフェースである。
なお、負荷分散装置３は、ほかに、負荷分散装置３のユーザが情報を入力する入力部や、情報を表示する表示部などを備えていてもよい。 The communication unit 33 is a communication interface used for communication with an external device.
In addition, the load balancer 3 may further include an input unit for the user of the load balancer 3 to input information, a display unit for displaying information, and the like.

管理装置４は、コンシステント・ハッシュ法に基づいて、クライアントマシン１からのリクエストを複数のサーバ５のいずれに振り分けるかを決定するコンピュータ装置である。なお、前記したように、このコンシステント・ハッシュ法では、環状のＩＤ空間に、管理対象の複数のデータ、および、データを管理しクラスタ１００を構成する複数のサーバ５（クラスタメンバ）、が割り振られ、それぞれのサーバ５が、ＩＤ空間において自身から時計回り（所定方向回り）に次のサーバ５までの間に位置するデータを管理（担当）するとともに、当該次のサーバ５から時計回りにさらに次のサーバ５までの間に位置するデータの複製を記憶することを前提とする。 The management device 4 is a computer device that determines to which of the plurality of servers 5 a request from the client machine 1 is distributed based on the consistent hash method. As described above, in the consistent hash method, a plurality of data to be managed and a plurality of servers 5 (cluster members) that manage the data and configure the cluster 100 are allocated to the circular ID space. Each server 5 manages (responsible for) data located between itself and the next server 5 in the clockwise direction (predetermined direction) in the ID space, and further clockwise from the next server 5 It is assumed that a copy of the data located until the next server 5 is stored.

管理装置４は、記憶部４１、処理部４２、入力部４３、表示部４４、通信部４５を備える。
記憶部４１は、情報を記憶する手段であり、ＲＡＭやＲＯＭなどのメモリ、ＨＤＤなどによって構成される。記憶部４１には、ＩＤ空間管理情報４１１、ＩＤ・地域割当情報４１２、サーバ管理情報４１３が格納されている。なお、記憶部４１には、処理部４２の動作プログラムなども格納されている（図示を省略）。 The management device 4 includes a storage unit 41, a processing unit 42, an input unit 43, a display unit 44, and a communication unit 45.
The storage unit 41 is a means for storing information, and includes a memory such as a RAM or a ROM, an HDD, or the like. The storage unit 41 stores ID space management information 411, ID / area allocation information 412, and server management information 413. The storage unit 41 also stores an operation program for the processing unit 42 (not shown).

ＩＤ空間管理情報４１１は、管理対象のデータについて所定のハッシュ値変換を行って算出されたＩＤを用いて、そのデータを担当するサーバ５を管理する情報である（図３で後記）。 The ID space management information 411 is information for managing the server 5 in charge of data using an ID calculated by performing predetermined hash value conversion on the management target data (described later in FIG. 3).

ＩＤ・地域割当情報４１２は、ＩＤ空間が所定の複数の物理的な地域の数に分割されているという前提の下、その地域と分割されたＩＤ空間との対応付けを管理する情報である（図４で後記）。 The ID / region allocation information 412 is information for managing the association between a region and the divided ID space under the assumption that the ID space is divided into a predetermined number of physical regions ( (See below in FIG. 4).

サーバ管理情報４１３は、地域ごとに、当該地域に物理的に存在する複数のサーバ５との対応付けを管理する情報である（図５で後記）。 The server management information 413 is information for managing the association with a plurality of servers 5 physically present in the area for each area (described later in FIG. 5).

処理部４２は、記憶部４１に格納された情報に基づいて演算処理を行う手段であり、例えばＣＰＵによって構成される。処理部４２は、クラスタ１００に対して新たな（待機中の）サーバ５を増設する場合、まず、当該新たなサーバ５についてＩＤ空間における挿入位置を決定する。次に、処理部４２は、ＩＤ・地域割当情報４１２を参照して当該挿入位置に対応する地域を特定する。次に、処理部４２は、サーバ管理情報４１３を参照して当該特定した地域に物理的に存在するサーバ５を選択する。次に、処理部４２は、当該選択したサーバと当該挿入位置の情報を用いてＩＤ空間管理情報４１１を更新する。詳細は図６の説明で後記する。 The processing unit 42 is a unit that performs arithmetic processing based on information stored in the storage unit 41, and is configured by a CPU, for example. When adding a new (standby) server 5 to the cluster 100, the processing unit 42 first determines an insertion position in the ID space for the new server 5. Next, the processing unit 42 refers to the ID / region allocation information 412 to identify the region corresponding to the insertion position. Next, the processing unit 42 refers to the server management information 413 and selects the server 5 that physically exists in the specified area. Next, the processing unit 42 updates the ID space management information 411 using the selected server and the information on the insertion position. Details will be described later with reference to FIG.

また、処理部４２は、クラスタ１００に対して新たなサーバ５を増設する場合、一例として、ＩＤ空間管理情報４１１を参照してＩＤ空間においてサーバ５同士の間が最も広い箇所を特定する、次に、処理部４２は、ＩＤ・地域割当情報４１２を参照してＩＤ空間における当該箇所の中央の位置に対応する地域を特定する。次に、処理部４２は、サーバ管理情報４１３を参照して当該特定した地域に物理的に存在するサーバ５を選択する。次に、処理部４２は、当該選択したサーバと当該中央の位置の情報を用いてＩＤ空間管理情報４１１を更新する。詳細は図６の説明で後記する。 Further, when adding a new server 5 to the cluster 100, the processing unit 42 refers to the ID space management information 411 as an example, and identifies the place where the space between the servers 5 is the widest in the ID space. In addition, the processing unit 42 refers to the ID / region allocation information 412 to identify the region corresponding to the center position of the location in the ID space. Next, the processing unit 42 refers to the server management information 413 and selects the server 5 that physically exists in the specified area. Next, the processing unit 42 updates the ID space management information 411 using the information on the selected server and the center position. Details will be described later with reference to FIG.

入力部４３は、管理装置４のユーザが情報を入力する手段であり、例えば、キーボードやマウスによって実現される。
表示部４４は、情報を表示する手段であり、例えば、ＬＣＤ（Liquid Crystal Display）によって実現される。
通信部４５は、外部装置との通信に用いられる通信インタフェースである。 The input unit 43 is a means for the user of the management device 4 to input information, and is realized by, for example, a keyboard or a mouse.
The display unit 44 is a means for displaying information, and is realized by an LCD (Liquid Crystal Display), for example.
The communication unit 45 is a communication interface used for communication with an external device.

次に、ＩＤ空間管理情報４１１について説明する。図３に示すように、ＩＤ空間管理情報４１１は、ＩＤ、サーバの各カラムから構成され、ＩＤの値の大きさでソートされている。
ＩＤは、ＩＤ空間におけるＩＤであり、サーバ５が管理を担当するデータの領域を特定するために格納される。
サーバは、クラスタ１００を構成するサーバ５の識別子を表す。 Next, the ID space management information 411 will be described. As shown in FIG. 3, the ID space management information 411 includes ID and server columns, and is sorted according to the size of the ID value.
The ID is an ID in the ID space, and is stored in order to specify an area of data for which the server 5 is in charge of management.
The server represents an identifier of the server 5 constituting the cluster 100.

例えば、図３に示す、第１行目のＩＤの値が「００５６」の場合は、識別子が「００００」〜「００５６」の領域に属するデータを「サーバＡ」が担当することを示す。また、第２行目のＩＤの値が「０１７２」の場合は、１つ前の行のＩＤの値に１をプラスした「００５７」〜「０１７２」の識別子に属するデータを「サーバＢ」が担当することを示す。 For example, when the ID value in the first row shown in FIG. 3 is “0056”, this indicates that “server A” is in charge of data belonging to the area having identifiers “0000” to “0056”. When the ID value of the second row is “0172”, “Server B” stores the data belonging to the identifiers “0057” to “0172” obtained by adding 1 to the ID value of the previous row. Indicates that you are in charge.

次に、ＩＤ・地域割当情報４１２について説明する。図４に示すように、ＩＤ・地域割当情報４１２は、地域ＩＤ、ＩＤの各カラムから構成される。
地域ＩＤは、所定の複数の物理的な地域の識別子を表す。
ＩＤは、その行の地域ＩＤの地域に対応するＩＤ空間におけるＩＤの範囲を表す。 Next, the ID / area allocation information 412 will be described. As shown in FIG. 4, the ID / region assignment information 412 includes regions ID and ID columns.
The region ID represents an identifier of a predetermined plurality of physical regions.
ID represents a range of IDs in the ID space corresponding to the area of the area ID in the row.

次に、サーバ管理情報４１３について説明する。図５に示すように、サーバ管理情報４１３は、地域ＩＤ、サーバの各カラムから構成される。
地域ＩＤは、所定の複数の物理的な地域の識別子を表す。
サーバは、対応する地域ＩＤの地域に物理的に存在するサーバ５の識別子を表す。なお、このサーバ５には、まだクラスタ１００として使用されていないものも含まれている。 Next, the server management information 413 will be described. As shown in FIG. 5, the server management information 413 is composed of each column of area ID and server.
The region ID represents an identifier of a predetermined plurality of physical regions.
The server represents the identifier of the server 5 that physically exists in the area of the corresponding area ID. The server 5 includes servers that have not been used as the cluster 100 yet.

次に、管理装置４による処理について説明する。
図６に示すように、ステップＳ１において、管理装置４の処理部４２は、クラスタ構成変更要求があるか否かを判定し、Ｙｅｓの場合はステップＳ２に進み、Ｎｏの場合はステップＳ１に戻る。なお、ここでのクラスタ構成変更要求は、例えば、管理装置４のユーザによるその旨の入力や、他の装置からのその旨のリクエストの受信などが考えられる。 Next, processing by the management device 4 will be described.
As shown in FIG. 6, in step S1, the processing unit 42 of the management device 4 determines whether or not there is a cluster configuration change request. If Yes, the process proceeds to step S2, and if No, the process returns to step S1. . The cluster configuration change request here may be, for example, an input to that effect by the user of the management apparatus 4 or a request to that effect from another apparatus.

ステップＳ２において、処理部４２は、そのクラスタ構成変更要求がクラスタメンバの追加か否かを判定し、Ｙｅｓの場合はステップＳ３に進み、Ｎｏの場合はステップＳ７に進む。 In step S2, the processing unit 42 determines whether or not the cluster configuration change request is to add a cluster member. If Yes, the process proceeds to step S3, and if No, the process proceeds to step S7.

ステップＳ３において、処理部４２は、ＩＤ空間において、追加するクラスタメンバの挿入先（ＩＤ）を決定する。 In step S3, the processing unit 42 determines the insertion destination (ID) of the cluster member to be added in the ID space.

どこに挿入するかは、入力部４３を通じて保守者から指定を受けてもよいし、あるいは、処理部４２が任意の方法で算出することもできる。なお、ＩＤ空間において選択する位置は任意でよく、必須事項は、そのＩＤ空間において選択した位置に対応するデータセンタエリア（地域）に物理的に存在するサーバ５を選択することである。これにより、サーバ５の物理位置を考慮したサーバ選択を行うことができ、データ複製時に発生する処理負荷を低減することができる。 Where to insert may be designated by the maintenance person through the input unit 43, or the processing unit 42 can calculate by any method. Note that the position to be selected in the ID space may be arbitrary, and the essential matter is to select the server 5 that physically exists in the data center area (region) corresponding to the position selected in the ID space. As a result, server selection considering the physical location of the server 5 can be performed, and the processing load generated during data replication can be reduced.

例えば、処理部４２が、記憶部４１のＩＤ空間管理情報４１１を参照して、ＩＤ空間の中でクラスタメンバ間の距離（ＩＤ空間上の距離）が最も広い個所を検索（特定）し、そのクラスタメンバ間の中央の位置を新たなクラスタメンバのＩＤの挿入先として決定する。なお、このような位置を選択する理由は、クラスタ構成の分散処理システムでは、クラスタメンバ間で負荷がなるべく平均化されていることが望ましく、統計的に、データ数がクラスタメンバ数よりも格段に多い場合には、各クラスタメンバが担当するデータ数はＩＤ空間上のクラスタメンバ間の距離にほぼ比例するためである。 For example, the processing unit 42 refers to the ID space management information 411 stored in the storage unit 41 to search (specify) a location where the distance between cluster members (distance on the ID space) is the largest in the ID space, and The center position between the cluster members is determined as the insertion destination of the new cluster member ID. The reason for selecting such a position is that, in a distributed processing system with a cluster configuration, it is desirable that the load among the cluster members be averaged as much as possible. Statistically, the number of data is much larger than the number of cluster members. This is because the number of data handled by each cluster member is almost proportional to the distance between the cluster members in the ID space when there are many.

続いて、ステップＳ４において、処理部４２は、記憶部４１のＩＤ・地域割当情報４１２を参照して、決定したＩＤを担当するデータセンタエリア（地域ＩＤ）を特定する。 Subsequently, in step S <b> 4, the processing unit 42 refers to the ID / region allocation information 412 in the storage unit 41 and identifies the data center area (region ID) that is responsible for the determined ID.

続いて、ステップＳ５において、処理部４２は、記憶部４１のサーバ管理情報４１３を参照して、そのデータセンタエリアに物理的に存在する（つまり、ステップＳ４で特定した地域ＩＤに対応する）サーバ５を選択する。 Subsequently, in step S5, the processing unit 42 refers to the server management information 413 in the storage unit 41, and physically exists in the data center area (that is, corresponds to the area ID specified in step S4). 5 is selected.

ステップＳ６において、処理部４２は、ＩＤ空間管理情報４１１を更新して、処理を終了する。具体的には、ステップＳ３〜Ｓ５を経由した場合は、ＩＤ空間管理情報４１１において、新たな行を挿入し、該当するＩＤおよび追加するサーバ５の識別子を書き込む。また、例えば、ステップＳ３〜Ｓ５を経由せず、障害等によりクラスタメンバの一部を削除する場合は、ＩＤ空間管理情報４１１において、該当する行を削除する。 In step S6, the processing unit 42 updates the ID space management information 411 and ends the process. Specifically, when going through steps S3 to S5, in the ID space management information 411, a new line is inserted, and the corresponding ID and the identifier of the server 5 to be added are written. For example, when a part of the cluster member is deleted due to a failure or the like without going through steps S3 to S5, the corresponding line is deleted in the ID space management information 411.

なお、この後、負荷分散装置３は、管理装置４から最新のＩＤ空間管理情報４１１を受信して記憶部３１にＩＤ空間管理情報３１１として保存する。そして、負荷分散装置３は、そのＩＤ空間管理情報３１１に基づいて、コンシステント・ハッシュ法によって、クライアントマシン１から受け取ったリクエストを、複数のサーバ５のいずれかに振り分ける。
また、複数のサーバ５は、管理装置４から最新のＩＤ空間管理情報４１１を受信して記憶部（不図示）に保存し、コンシステント・ハッシュ法に基づく前記したデータの複製を行う。 Thereafter, the load distribution device 3 receives the latest ID space management information 411 from the management device 4 and stores it as the ID space management information 311 in the storage unit 31. Then, the load balancer 3 distributes the request received from the client machine 1 to one of the plurality of servers 5 by the consistent hash method based on the ID space management information 311.
Further, the plurality of servers 5 receive the latest ID space management information 411 from the management apparatus 4 and store it in a storage unit (not shown), and replicate the data described above based on the consistent hash method.

このようにして、本実施形態の管理装置４によれば、クラスタ１００を構成するサーバ５を増設する場合、増設するサーバ５について決定したＩＤ空間における位置に対応する地域に物理的に存在するサーバ５を選択することで、データ複製時に発生する処理負荷（ネットワークコスト）を低減し、高速な処理を実現することができる。なお、この手法は、特に、データの更新や複製の頻度が高い場合や、複製対象のデータサイズが大きい場合等に、大きく奏効する。 As described above, according to the management device 4 of the present embodiment, when adding the server 5 constituting the cluster 100, the server physically existing in the area corresponding to the position in the ID space determined for the server 5 to be added. By selecting 5, it is possible to reduce the processing load (network cost) generated at the time of data replication and to realize high-speed processing. This method is particularly effective when the frequency of data update or replication is high or when the data size to be replicated is large.

また、増設するサーバ５についてＩＤ空間における位置を決定する際、ＩＤ空間においてサーバ５同士の間が最も広い箇所の中央の位置を選択することで、データ複製時に発生する処理負荷を低減するとともに、各サーバ５における負荷をより平均化することができる。
また、地域としてデータセンタエリアを採用することで、クラスタ１００や管理装置４の運用がより容易になる。 Moreover, when determining the position in the ID space for the server 5 to be added, by selecting the central position of the widest portion between the servers 5 in the ID space, the processing load generated at the time of data replication is reduced, The load on each server 5 can be further averaged.
Further, by adopting the data center area as a region, the operation of the cluster 100 and the management apparatus 4 becomes easier.

以上で本実施形態の説明を終えるが、本発明の態様はこれらに限定されるものではない。
例えば、管理装置４と負荷分散装置３を同一のハードウエアに並存させる構成としてもよい。
また、本実施形態ではコンシステント・ハッシュ法を前提としたが、他の手法を前提としてもよい。 Although description of this embodiment is finished above, the aspect of the present invention is not limited to these.
For example, the management device 4 and the load distribution device 3 may be configured to coexist on the same hardware.
In the present embodiment, the consistent hash method is assumed, but another method may be assumed.

また、負荷分散装置３を使用せず、それぞれのクライアントマシン１が管理装置４から受信したＩＤ空間管理情報４１１を保持して、ネットワーク２経由で複数のサーバ５のいずれかに直接アクセスするようにしてもよい。 Further, without using the load balancer 3, each client machine 1 holds the ID space management information 411 received from the management device 4 and directly accesses any of the plurality of servers 5 via the network 2. Also good.

また、地域として、データセンタエリアを単位とする場合を例にとって説明したが、データセンタエリアをさらに分割したものや都道府県等の別の単位を採用してもよい。
また、本発明は、コンピュータを管理装置４として機能させるためのプログラムとしても具現化可能である。
その他、具体的な構成について、本発明の主旨を逸脱しない範囲で適宜変更が可能である。 Moreover, although the case where the data center area is used as a unit as an area has been described as an example, another unit such as a further divided data center area or a prefecture may be adopted.
The present invention can also be embodied as a program for causing a computer to function as the management device 4.
In addition, about a concrete structure, it can change suitably in the range which does not deviate from the main point of this invention.

１クライアントマシン
２ネットワーク
３負荷分散装置
４管理装置
５サーバ
３１記憶部
３２処理部
３３通信部
４１記憶部
４２処理部
４３入力部
４４表示部
４５通信部
１００クラスタ
３１１ＩＤ空間管理情報
４１１ＩＤ空間管理情報
４１２ＩＤ・地域割当情報
４１３サーバ管理情報
１０００分散処理システム DESCRIPTION OF SYMBOLS 1 Client machine 2 Network 3 Load distribution apparatus 4 Management apparatus 5 Server 31 Storage part 32 Processing part 33 Communication part 41 Storage part 42 Processing part 43 Input part 44 Display part 45 Communication part 100 Cluster 311 ID space management information 411 ID space management information 412 ID / area allocation information 413 Server management information 1000 Distributed processing system

Claims

In a circular ID (IDentifier) space, a plurality of data to be managed and a plurality of servers that manage the data and constitute a cluster are allocated, and each of the servers rotates in a predetermined direction from itself in the ID space. A distributed processing system for managing the data located between the next server and storing a copy of the data located between the next server and the next server around the predetermined direction. In the management device for determining which of the plurality of servers the request from the client machine is distributed,
ID space management information for managing a server in charge of managing the data in the ID space;
ID / region allocation information for managing the association between the region and the divided ID space, wherein the ID space is divided into a predetermined number of physical regions; and
A storage unit that stores, for each region, server management information that manages the association between the plurality of servers physically present in the region and the standby server;
When adding the standby server to the cluster, the region corresponding to the insertion position is identified by referring to the ID / region allocation information based on the insertion position in the ID space for the added server. A processing unit that refers to the server management information, selects a server that physically exists in the specified area, and updates the ID space management information using the selected server and the information on the insertion position;
A management apparatus comprising:

In a circular ID (IDentifier) space, a plurality of data to be managed and a plurality of servers that manage the data and constitute a cluster are allocated, and each of the servers rotates in a predetermined direction from itself in the ID space. A distributed processing system for managing the data located between the next server and storing a copy of the data located between the next server and the next server around the predetermined direction. In the management device for determining which of the plurality of servers the request from the client machine is distributed,
ID space management information for managing a server in charge of managing the data in the ID space;
ID / region allocation information for managing the association between the region and the divided ID space, wherein the ID space is divided into a predetermined number of physical regions; and
A storage unit that stores, for each region, server management information that manages the association between the plurality of servers physically present in the region and the standby server;
When adding the standby server to the cluster, refer to the ID space management information to identify the widest part between the servers in the ID space, and refer to the ID / region allocation information. The region corresponding to the center position of the location in the ID space is identified, the server management information is referenced to select a server physically existing in the identified region, and the selected server and the center A processing unit for updating the ID space management information using the position information of
A management apparatus comprising:

3. The management apparatus according to claim 1, wherein the area is a data center area managed by a data center that manages the server.

The program for functioning a computer as a management apparatus as described in any one of Claim 1- Claim 3.