JP2010182017A

JP2010182017A - Distributed computer system, manager succession method and manager succession program

Info

Publication number: JP2010182017A
Application number: JP2009023835A
Authority: JP
Inventors: Yoshifumi Kokado; 能史小角; Chie Konuma; 千絵小沼
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2009-02-04
Filing date: 2009-02-04
Publication date: 2010-08-19

Abstract

<P>PROBLEM TO BE SOLVED: To select an active-system management program suitable for a situation in which the status of a node or a network dynamically changes even in such a situation with a distributed computer system comprising a plurality of nodes. <P>SOLUTION: When failure occurs in an active-system manager 3, each node 1 constituting this distributed computer system selects a candidate of the active-system manager 3 from a plurality of standby-system managers 3, and transmits a selection result to the selected standby-system manager 3. Each standby-system manager 3 receives the selection result, decides whether or not itself becomes the active-system manager 3 based on the received selection result, and changes into the active-system manager 3 according to a decision result thereof. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、複数のノードで分散処理を行う分散計算機システムに係り、詳しくは、複数のノード（計算機）を管理するマネージャ（管理プログラム）に障害が発生した際、マネージャの引き継ぎを行う分散計算機システム、マネージャ引き継ぎ方法及びマネージャ引き継ぎプログラムに関する。 The present invention relates to a distributed computer system that performs distributed processing at a plurality of nodes, and more specifically, a distributed computer system that takes over a manager when a failure occurs in a manager (management program) that manages the plurality of nodes (computers). The present invention relates to a manager takeover method and a manager takeover program.

処理速度を向上させるために、行うべき処理を複数のノードに分散して実行させる分散計算機システムが知られている。この種の分散計算機システムでは、各ノードの負荷などを監視して、各ノードに適切な処理を割り当てたり、一のノードに障害が発生した場合に、そのノードで実行している処理を他のノードに代行させるなど、システム構成やシステム状態の変化に柔軟に対応することができる（例えば、特許文献１参照）。 In order to improve processing speed, a distributed computer system is known in which processing to be performed is distributed to a plurality of nodes and executed. In this type of distributed computer system, the load on each node is monitored, and appropriate processing is assigned to each node. If a failure occurs on one node, the processing executed on that node is It is possible to flexibly cope with changes in the system configuration and system state, for example, by delegating to a node (see, for example, Patent Document 1).

上記のような分散計算機システムにおいては、各ノードに適切な処理を割り当てるために、システム上にマネージャが１つ存在している。マネージャや、マネージャが存在するノードに障害が発生した場合は、マネージャが不在となり、システム全体が動作不能に陥るので、マネージャの処理を引き継ぐための待機系マネージャ（待機系マネージャに対し、現在管理を行っているマネージャを現用系マネージャと呼ぶ）を配置することがある。また、より高い信頼性を確保するために、待機系マネージャを複数用意する場合もある。その際には、現用系マネージャとなる順番を静的に予め決めておく方法が多く採られる。 In the distributed computer system as described above, one manager exists on the system in order to assign an appropriate process to each node. If a failure occurs in the manager or the node on which the manager exists, the manager will be absent and the entire system will become inoperable. Therefore, the standby manager for taking over the manager processing (currently managing the standby manager) (The manager in charge is called the working manager). In order to ensure higher reliability, a plurality of standby managers may be prepared. In that case, a method of statically predetermining the order of becoming active managers is often employed.

特開２０００−２０７３７４号公報JP 2000-207374 A

高い信頼性を確保するために、待機系マネージャを複数用意し、静的な順位に従って、現用系となるマネージャを決定する場合、以下に示すような問題点がある。 In order to ensure high reliability, when a plurality of standby managers are prepared and a manager to be an active manager is determined according to a static order, there are the following problems.

第１の問題点は、待機系マネージャが配置されているノード上に、その他の処理プログラムも動的に配置され、ノードの負荷が時々刻々と変化するような場合、必ずしも適切なマネージャが選ばれるとは限らない点である。
その理由は、現用系マネージャが不在となった時点で、負荷が高く重要な処理プログラムが動いているノードに配置されている待機系マネージャが、静的順位に従って現用系マネージャに選ばれる可能性があり、そうなった場合、そのノード上のマネージャが現用系マネージャとして作動し始めると、以前から動いていた処理プログラムが、充分な性能を出せない可能性があるからである。 The first problem is that other processing programs are also dynamically allocated on the node where the standby manager is allocated, and an appropriate manager is always selected when the load on the node changes from moment to moment. That is not necessarily the case.
The reason is that when the active manager is absent, the standby manager placed on the node where the important processing program with high load is running may be selected as the active manager according to the static order. In this case, if the manager on the node starts to operate as the active manager, the processing program that has been operating may not be able to produce sufficient performance.

第２の問題点は、モバイルネットワークやアドホックネットワーク環境においては、必ずしも適切なマネージャが選ばれるとは限らない点である。
その理由は、静的順位に従って現用系マネージャとなるよう選択された待機系マネージャが、他のノードからネットワーク的に遠い位置に存在していたり、他のノードとの間のネットワーク性能が悪かったりする場合があるからである。 The second problem is that an appropriate manager is not always selected in a mobile network or an ad hoc network environment.
The reason is that the standby manager selected to become the active manager according to the static order exists in a position far from the other nodes in the network, or the network performance with the other nodes is poor. Because there are cases.

本発明は、上記の事情にかんがみなされたものであり、複数のノードを管理するマネージャに障害が発生した際、マネージャの引き継ぎを行うにあたり、各ノードやネットワークの状態が動的に変化する状況であっても、状況に適した待機系マネージャを現用系マネージャとして選択し、処理プログラムの性能低下やノード間の通信効率の低下を回避することができる分散計算機システム、マネージャ引き継ぎ方法及びマネージャ引き継ぎプログラムの提供を目的とする。 The present invention has been considered in view of the above circumstances. When a manager that manages a plurality of nodes fails, the state of each node or network changes dynamically when the manager takes over. Even if there is a standby computer manager that is suitable for the situation, it can be selected as the active manager, so that the performance of the processing program and the communication efficiency between nodes can be avoided. For the purpose of provision.

上記目的を達成するため本発明の分散計算機システムは、分散計算機システムを構成する複数のノードには、これらの各ノードに処理を割り当てる現用系マネージャが備えられた一のノードと、前記現用系マネージャに障害が発生したとき、現用系マネージャとして動作可能な待機系マネージャが備えられた複数のノードと、が含まれ、前記各ノードは、前記現用系マネージャに障害が発生したとき、複数の前記待機系マネージャの中から現用系マネージャの候補を選択するマネージャ選択手段と、前記選択結果を選択した待機系マネージャに送信する選択結果送信手段と、を備え、前記各待機系マネージャは、前記選択結果を受信する選択結果受信手段と、受信した前記選択結果にもとづいて、自身が現用系マネージャになるか否かを判断する選択結果判断手段と、前記判断結果に応じて、現用系マネージャに変化する状態変更手段と、を備える構成としてある。 In order to achieve the above object, a distributed computer system according to the present invention comprises a plurality of nodes constituting a distributed computer system, one node provided with an active manager that assigns processing to each of these nodes, and the active manager And a plurality of nodes provided with standby managers operable as active managers when a failure occurs, wherein each node has a plurality of standbys when a failure occurs in the active manager Manager selection means for selecting a candidate for the active system manager from among the system managers, and selection result transmission means for transmitting the selection result to the selected standby system manager, wherein each standby system manager displays the selection result. Based on the selection result receiving means to be received and the received selection result, it is determined whether or not it is an active manager. A selection result determining means that, depending on the determination result, a structure comprising a state changing unit that changes the working system manager, a.

また、上記目的を達成するため本発明のマネージャ引き継ぎ方法は、分散計算機システムを構成する複数のノードには、これらの各ノードに処理を割り当てる現用系マネージャが備えられた一のノードと、前記現用系マネージャに障害が発生したとき、現用系マネージャとして動作可能な待機系マネージャが備えられた複数のノードと、が含まれ、前記各ノードは、前記現用系マネージャに障害が発生したとき、複数の前記待機系マネージャの中から現用系マネージャの候補を選択し、前記選択結果を選択した待機系マネージャに送信し、前記各待機系マネージャは、前記選択結果を受信し、受信した前記選択結果にもとづいて、自身が現用系マネージャになるか否かを判断し、前記判断結果に応じて、現用系マネージャに変化する方法としてある。 In order to achieve the above object, the manager takeover method of the present invention provides a plurality of nodes constituting a distributed computer system, wherein one node provided with an active manager that assigns processing to each of these nodes, and the active And a plurality of nodes provided with standby managers operable as active managers when a failure occurs in the system manager, and each node includes a plurality of nodes when a failure occurs in the active manager. A candidate for the active manager is selected from the standby managers, and the selection result is transmitted to the selected standby manager. Each standby manager receives the selection result and is based on the received selection result. And determine whether or not to become the active manager, and change to the active manager according to the determination result. A.

また、上記目的を達成するため本発明のマネージャ引き継ぎプログラムは、分散計算機システムを構成する複数のノードには、これらの各ノードに処理を割り当てる現用系マネージャが備えられた一のノードと、前記現用系マネージャに障害が発生したとき、現用系マネージャとして動作可能な待機系マネージャが備えられた複数のノードと、が含まれ、前記各ノード用のマネージャ引き継ぎプログラムは、前記現用系マネージャに障害が発生したとき、複数の前記待機系マネージャの中から現用系マネージャの候補を選択する処理と、前記選択結果を選択した待機系マネージャに送信する処理と、をコンピュータに実行させ、前記各待機系マネージャ用のマネージャ引き継ぎプログラムは、前記選択結果を受信する処理と、受信した前記選択結果にもとづいて、自身が現用系マネージャになるか否かを判断する処理と、前記判断結果に応じて、現用系マネージャに変化する処理と、をコンピュータに実行させる構成としてある。 In order to achieve the above object, the manager takeover program of the present invention provides a plurality of nodes constituting a distributed computer system, one node provided with an active manager for assigning processing to each of these nodes, and the active And a plurality of nodes with standby managers that can operate as active managers when a failure occurs in the active manager, and the manager takeover program for each node generates a fault in the active manager A process for selecting an active manager candidate from the plurality of standby managers and a process for transmitting the selection result to the selected standby manager, and for each standby manager. The manager takeover program for receiving the selection result and the received selection result. Based on the results, the treatment to which it is determined whether becomes active system manager, according to the determination result, a process of changing the current system manager, as configured to be executed by a computer.

本発明によれば、ノードの負荷が時々刻々と変化するような場合でも、その時点の状況に適した待機系マネージャを現用系マネージャとして選択できるとともに、モバイルネットワークやアドホックネットワーク環境においても、その時点の状況に適した待機系マネージャを現用系マネージャとして選択することができる。 According to the present invention, even when the load of a node changes from moment to moment, a standby manager suitable for the situation at that time can be selected as an active manager, and even in a mobile network or an ad hoc network environment, A standby manager suitable for the situation can be selected as the active manager.

本発明の実施形態に係る分散計算機システムの構成を示すブロック図である。It is a block diagram which shows the structure of the distributed computer system which concerns on embodiment of this invention. 本発明の実施形態に係る分散計算機システムにおいて、マネージャ選択装置の動作を示すフローチャートである。5 is a flowchart showing an operation of a manager selection device in the distributed computer system according to the embodiment of the present invention. 本発明の実施形態に係る分散計算機システムにおいて、マネージャの動作を示すフローチャートである。5 is a flowchart showing an operation of a manager in the distributed computer system according to the embodiment of the present invention. 本発明の実施形態に係る分散計算機システムの具体的な構成を示すブロック図である。It is a block diagram which shows the specific structure of the distributed computer system which concerns on embodiment of this invention. 本発明の実施形態に係る分散計算機システムにおいて、マネージャ選択装置の具体的な動作を示すフローチャートである。5 is a flowchart showing a specific operation of the manager selection device in the distributed computer system according to the embodiment of the present invention. 本発明の実施形態に係る分散計算機システムにおいて、マネージャの具体的な動作を示すフローチャートである。5 is a flowchart showing a specific operation of a manager in the distributed computer system according to the embodiment of the present invention. 本発明の実施形態に係る分散計算機システムの具体的なシステム構成を示すブロック図である。1 is a block diagram showing a specific system configuration of a distributed computer system according to an embodiment of the present invention. 本発明の実施形態に係る分散計算機システムにおいて、ノード情報記録部に記録される情報の具体例を示す説明図である。It is explanatory drawing which shows the specific example of the information recorded on a node information recording part in the distributed computer system which concerns on embodiment of this invention. 本発明の実施形態に係る分散計算機システムにおいて、選択結果記録部に記録される情報の具体例を示す説明図である。It is explanatory drawing which shows the specific example of the information recorded on the selection result recording part in the distributed computer system which concerns on embodiment of this invention.

以下、本発明の実施形態について、図面を参照して説明する。
［本発明の実施形態に係る分散計算機システムの概要］
まず、本発明の実施形態に係る分散計算機システムの概要について、図１〜図３を参照して説明する。
図１は、本発明の実施形態に係る分散計算機システムの構成を示すブロック図である。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.
[Outline of Distributed Computer System According to Embodiment of the Present Invention]
First, an overview of a distributed computer system according to an embodiment of the present invention will be described with reference to FIGS.
FIG. 1 is a block diagram showing a configuration of a distributed computer system according to an embodiment of the present invention.

この図に示すように、本発明の実施形態に係る分散計算機システムは、複数のノード１から構成されており、各ノード１には、マネージャ選択装置２と、必要に応じてマネージャ３が配置されている。
ここで、分散計算機システムを構成する複数のノード１には、これらの各ノード１に処理を割り当てる現用系マネージャ３が備えられた一のノード１と、現用系マネージャ３に障害が発生したとき、現用系マネージャ３として動作可能な待機系マネージャ３が備えられた複数のノード１と、マネージャ３を備えない一又は複数のノード１が含まれる。
なお、図１は、待機系マネージャ３が備えられた２つのノード１の構成を示している。 As shown in this figure, the distributed computer system according to the embodiment of the present invention is composed of a plurality of nodes 1, and each node 1 is provided with a manager selection device 2 and a manager 3 as necessary. ing.
Here, when a failure occurs in one node 1 provided with an active manager 3 that assigns processing to each of these nodes 1 and the active manager 3 in a plurality of nodes 1 constituting the distributed computer system, A plurality of nodes 1 provided with the standby manager 3 operable as the active manager 3 and one or a plurality of nodes 1 not including the manager 3 are included.
FIG. 1 shows a configuration of two nodes 1 provided with a standby manager 3.

マネージャ選択装置２は、各ノード用のマネージャ引き継ぎプログラムによってコンピュータ（計算機）上に構成される装置であって、マネージャ選択手段２１と、選択結果送信手段２２とを含む。
マネージャ選択手段２１は、現用系マネージャ３に障害が発生したとき、複数の待機系マネージャ３の中から現用系マネージャ３の候補を選択する。マネージャ選択手段２１によるマネージャ選択条件や選択手順は、各ノード１間で異なっていてもよい。
選択結果送信手段２２は、マネージャ選択手段２１が選択した結果を、選択された待機系マネージャ３に送信する。 The manager selection device 2 is a device configured on a computer (computer) by a manager takeover program for each node, and includes a manager selection means 21 and a selection result transmission means 22.
The manager selection means 21 selects a candidate for the active manager 3 from the plurality of standby managers 3 when a failure occurs in the active manager 3. Manager selection conditions and selection procedures by the manager selection means 21 may be different among the nodes 1.
The selection result transmission unit 22 transmits the result selected by the manager selection unit 21 to the selected standby manager 3.

マネージャ３は、管理プログラム及び待機系マネージャ用のマネージャ引き継ぎプログラムによってコンピュータ上に構成される装置であって、選択結果受信手段３１と、選択結果記録部３２と、選択結果判断手段３３と、状態変更手段３４を含む。
選択結果受信手段３１は、マネージャ選択装置２の選択結果送信手段２２から送信された選択結果を受信し、選択結果記録部３２に記録する。
選択結果記録部３２は、選択結果受信手段３１により受信された選択結果を記録する。
選択結果判断手段３３は、各マネージャ選択装置２からの選択結果の送信が済んだと判断された時点で、選択結果記録部３２に記録されている選択結果を読み出し、それに基づいて、自身が現用系マネージャ３になるか否かを判断する。選択結果判断手段３３の判断基準は、すべてのノード１で同一である必要がある。
状態変更手段３４は、選択結果判断手段３３が現用系マネージャになると判断した場合、現用系マネージャ３としての動作を開始させる。 The manager 3 is an apparatus configured on the computer by a management program and a manager takeover program for the standby manager, and includes a selection result receiving unit 31, a selection result recording unit 32, a selection result determining unit 33, and a state change Means 34 are included.
The selection result receiving unit 31 receives the selection result transmitted from the selection result transmitting unit 22 of the manager selection device 2 and records it in the selection result recording unit 32.
The selection result recording unit 32 records the selection result received by the selection result receiving unit 31.
The selection result determination means 33 reads the selection result recorded in the selection result recording unit 32 when it is determined that the transmission of the selection result from each manager selection device 2 has been completed. It is determined whether or not to become a system manager 3. The judgment criteria of the selection result judgment means 33 need to be the same for all nodes 1.
The state changing unit 34 starts the operation as the active manager 3 when the selection result determining unit 33 determines that it becomes the active manager.

つぎに、本発明の実施形態に係る分散計算機システムの動作について、図１〜図３を参照して説明する。
図２は、本発明の実施形態に係る分散計算機システムにおいて、マネージャ選択装置の動作を示すフローチャート、図３は、本発明の実施形態に係る分散計算機システムにおいて、マネージャの動作を示すフローチャートである。 Next, the operation of the distributed computer system according to the embodiment of the present invention will be described with reference to FIGS.
FIG. 2 is a flowchart showing the operation of the manager selection device in the distributed computer system according to the embodiment of the present invention. FIG. 3 is a flowchart showing the operation of the manager in the distributed computer system according to the embodiment of the present invention.

図２に示すように、各ノード１のマネージャ選択装置２は、現用系マネージャ３の不在を検知したタイミングで、あらかじめ決められた判断基準に従って、適切な待機系マネージャ３を選択し（ステップＡ１：マネージャ選択手段２１）、その選択結果を選択した待機系マネージャ３に送信する（ステップＡ２：選択結果送信手段２２）。 As shown in FIG. 2, the manager selection device 2 of each node 1 selects an appropriate standby manager 3 according to a predetermined criterion at the timing when the absence of the active manager 3 is detected (step A1: The manager selection means 21) transmits the selection result to the selected standby manager 3 (step A2: selection result transmission means 22).

一方、各ノード１の待機系マネージャ３は、図３に示すように、マネージャ選択装置２からの送信を待ち（ステップＢ１、Ｂ２）、送信があれば選択結果を受信し、選択結果記録部３２に記録する（ステップＢ３：選択結果受信手段３１）。
つぎに、全てのマネージャ選択装置２からの選択結果の送信が済んだか否かを判断し（ステップＢ４）、まだ済んでないと判断された場合は、ステップＢ１に戻り、他のマネージャ選択装置２からの送信を待つ。
ステップＢ４でマネージャ選択装置２からの送信が済んだと判断された場合、或いは、ステップＢ１及びＢ２で予め決められた待ち時間が過ぎたと判断された場合は、選択結果記録部３２から選択結果を読み出し（ステップＢ５）、その内容から自分が現用系になるべきか否かを判断する（ステップＢ６：選択結果判断手段３３）。
待機系マネージャ３は、現用系になるべきと判断した場合は、現用系マネージャとなって動作し（ステップＢ７：状態変更手段３４）、現用系になるべきでないと判断した場合は待機系としての動作を継続する（ステップＢ８）。 On the other hand, as shown in FIG. 3, the standby manager 3 of each node 1 waits for transmission from the manager selection device 2 (steps B1 and B2), and if there is transmission, receives the selection result and selects the selection result recording unit 32. (Step B3: Selection result receiving means 31).
Next, it is determined whether or not the transmission of the selection results from all the manager selection devices 2 has been completed (step B4). If it is determined that the selection results have not yet been transmitted, the process returns to step B1 and the other manager selection devices 2 Wait for transmission.
If it is determined in step B4 that the transmission from the manager selection device 2 has been completed, or if it is determined that the predetermined waiting time has passed in steps B1 and B2, the selection result is recorded from the selection result recording unit 32. Reading (step B5), it is determined from the contents whether or not it should become the active system (step B6: selection result determination means 33).
If the standby manager 3 determines that it should become the active system, it operates as the active manager (step B7: state change means 34), and if it determines that it should not become the active system, The operation is continued (step B8).

以上のように構成された本実施形態の分散計算機システムによれば、分散計算機システムを構成する複数のノード１には、これらの各ノード１に処理を割り当てる現用系マネージャ３が備えられた一のノード１と、現用系マネージャ３に障害が発生したとき、現用系マネージャ３として動作可能な待機系マネージャ３が備えられた複数のノード１と、が含まれ、各ノード１は、現用系マネージャ３に障害が発生したとき、複数の待機系マネージャ３の中から現用系マネージャ３の候補を選択し、選択結果を選択した待機系マネージャ３に送信し、各待機系マネージャ３は、選択結果を受信し、受信した選択結果にもとづいて、自身が現用系マネージャ３になるか否かを判断し、この判断結果に応じて、現用系マネージャ３に変化するので、静的に現用系マネージャ３になる順番を決定しておくシステムに比べて、その時点の状況に適した現用系マネージャ３を選択することができる。 According to the distributed computer system of the present embodiment configured as described above, the plurality of nodes 1 constituting the distributed computer system are provided with the active manager 3 that assigns processing to each of these nodes 1. A node 1 and a plurality of nodes 1 provided with a standby manager 3 that can operate as the active manager 3 when a failure occurs in the active manager 3. When a failure occurs, a candidate for the active manager 3 is selected from the plurality of standby managers 3 and the selection result is transmitted to the selected standby manager 3, and each standby manager 3 receives the selection result. Then, based on the received selection result, it is determined whether or not it becomes the active manager 3 and changes to the active manager 3 according to this determination result. Compared to the system to keep determines the order to be active system manager 3, it is possible to select the active system manager 3 which is suitable for conditions at that time.

［本発明の実施形態に係る分散計算機システムの詳細］
つぎに、本発明の実施形態に係る分散計算機システムの具体的な構成及び動作について、図４〜図９を参照して説明する。
図４は、本発明の実施形態に係る分散計算機システムの具体的な構成を示すブロック図である。 [Details of Distributed Computer System According to Embodiment of the Present Invention]
Next, a specific configuration and operation of the distributed computer system according to the embodiment of the present invention will be described with reference to FIGS.
FIG. 4 is a block diagram showing a specific configuration of the distributed computer system according to the embodiment of the present invention.

この図に示す分散計算機システムは、図１に示した分散計算機システムに比べ、待機系マネージャ３内にノード情報送信手段３５及び調停手段３６が加わり、また、マネージャ選択装置２内にノード情報受信手段２３及びノード情報記録部２４が加わっている。 Compared with the distributed computer system shown in FIG. 1, the distributed computer system shown in this figure includes node information transmission means 35 and arbitration means 36 in the standby manager 3, and node information reception means in the manager selection device 2. 23 and a node information recording unit 24 are added.

ノード情報送信手段３５は、マネージャ選択手段２１が待機系マネージャ３を選択するための判断材料となるノード情報を各ノード１のノード情報受信手段２３に送信し、ノード情報受信手段２３は、受信したノード情報をノード情報記録部２４に記録する（図４の破線矢印参照）。
調停手段３６は、待機系マネージャ３の総数に比して、選択結果送信手段２２から送信された得票数が所定の比率を下回り、現用系マネージャ３となるかどうかを判断できない場合に、現用系マネージャ３に変化させる一の待機系マネージャ３を決定するための手段であり、例えば、他の待機系マネージャ３の調停手段３６に依頼して得票数の情報を送信してもらい、自身より多くの得票数を得た待機系マネージャ３が存在するかどうかを判断する。 The node information transmitting unit 35 transmits node information that is used as a judgment material for the manager selecting unit 21 to select the standby manager 3, to the node information receiving unit 23 of each node 1, and the node information receiving unit 23 receives the received node information. Node information is recorded in the node information recording unit 24 (see broken line arrows in FIG. 4).
The arbitration unit 36 determines that the number of votes transmitted from the selection result transmission unit 22 is less than a predetermined ratio compared to the total number of standby system managers 3 and cannot determine whether or not to become the active system manager 3. It is a means for determining one standby manager 3 to be changed to the manager 3, for example, asks the arbitration means 36 of another standby manager 3 to send information on the number of votes, and more It is determined whether or not there is a standby manager 3 that has obtained the number of votes.

つぎに、本発明の実施形態に係る分散計算機システムの具体的な動作について、図４〜図９を参照して説明する。
図５は、本発明の実施形態に係る分散計算機システムにおいて、マネージャ選択装置の具体的な動作を示すフローチャート、図６は、本発明の実施形態に係る分散計算機システムにおいて、マネージャの具体的な動作を示すフローチャート、図７は、本発明の実施形態に係る分散計算機システムの具体的なシステム構成を示すブロック図、図８は、本発明の実施形態に係る分散計算機システムにおいて、ノード情報記録部に記録される情報の具体例を示す説明図、図９は、本発明の実施形態に係る分散計算機システムにおいて、選択結果記録部に記録される情報の具体例を示す説明図である。
なお、図６は、本実施形態におけるマネージャ３の動作のうち、図３のステップＢ６に該当する部分を具体的に示したフローチャートである。 Next, specific operations of the distributed computer system according to the embodiment of the present invention will be described with reference to FIGS.
FIG. 5 is a flowchart showing a specific operation of the manager selection device in the distributed computer system according to the embodiment of the present invention. FIG. 6 is a specific operation of the manager in the distributed computer system according to the embodiment of the present invention. FIG. 7 is a block diagram showing a specific system configuration of the distributed computer system according to the embodiment of the present invention, and FIG. 8 is a node information recording unit in the distributed computer system according to the embodiment of the present invention. FIG. 9 is an explanatory diagram showing a specific example of information recorded in the selection result recording unit in the distributed computer system according to the embodiment of the present invention.
FIG. 6 is a flowchart specifically showing a portion corresponding to step B6 in FIG. 3 in the operation of the manager 3 in the present embodiment.

図７に示す分散計算機システムは、４つのノードＸ１１〜Ｘ１４を持っている。各ノードＸ１１〜Ｘ１４には、図４に示したマネージャ選択装置２とマネージャ３とが配置されており、ノードＸ１１に配置されたマネージャ３が現用系として動作しているとする。ノードＸ１１の現用系マネージャ３は、ハートビートパケットを各ノードＸ１２〜Ｘ１４に定期的に送信している。これにより、各ノードＸ１２〜Ｘ１４は、ノードＸ１１のマネージャ３が現用系マネージャ３として正常に稼動していることを認知することができる。 The distributed computer system shown in FIG. 7 has four nodes X11 to X14. In each of the nodes X11 to X14, the manager selection device 2 and the manager 3 shown in FIG. 4 are arranged, and it is assumed that the manager 3 arranged in the node X11 operates as an active system. The active manager 3 of the node X11 periodically transmits heartbeat packets to the nodes X12 to X14. Thereby, each of the nodes X12 to X14 can recognize that the manager 3 of the node X11 is operating normally as the active manager 3.

ノードＸ１２〜Ｘ１４の各待機系マネージャ３のノード情報送信手段３５は、例えば５秒ごとに、定期的にノード情報として空きメモリ容量と空きディスク容量の情報を各ノードＸ１２〜Ｘ１４のノード情報受信手段２３に送信している。
各マネージャ選択装置２のノード情報受信手段２３は、各待機系マネージャ３からノード情報として受信した空きメモリ容量と空きディスク容量を、図８に示すような形式でノード情報記録部２４に記録する。
ノードＸ１１のノードに障害が発生して、一定期間ハートビートパケットが受信されなくなると、各ノードＸ１２〜Ｘ１４のマネージャ選択装置２は、現用系マネージャ３の不在を検知する。現用系マネージャ３の不在が検知されると、各マネージャ選択装置２のマネージャ選択手段２１は、ノード情報記録部２４に記録されたノード情報を読み出す（図５のステップＡ０）。 The node information transmission means 35 of each standby manager 3 of the nodes X12 to X14 periodically sends the information on the free memory capacity and the free disk capacity as node information every 5 seconds, for example, as the node information receiving means of each node X12 to X14. 23.
The node information receiving means 23 of each manager selecting device 2 records the free memory capacity and free disk capacity received as node information from each standby manager 3 in the node information recording unit 24 in the format shown in FIG.
When a failure occurs in the node X11 and the heartbeat packet is not received for a certain period, the manager selection device 2 of each of the nodes X12 to X14 detects the absence of the active manager 3. When the absence of the active manager 3 is detected, the manager selection means 21 of each manager selection device 2 reads the node information recorded in the node information recording unit 24 (step A0 in FIG. 5).

ノードＸ１２、Ｘ１３のマネージャ選択手段２１は、ステップＡ１のマネージャ選択の手順として、例えば、空きメモリ容量の大きさをもって現用系マネージャ３を選択する手順を実装しているとする。その結果、ノードＸ１２、Ｘ１３のマネージャ選択手段２１は、ノードＸ１２の待機系マネージャ３を選択し、その選択結果を選択結果送信手段２２からノードＸ１２の待機系マネージャ３に送信する（ステップＡ２）。
ノードＸ１４のマネージャ選択手段２１は、ステップＡ１のマネージャ選択の手順として、例えば、空きディスク容量の大きさをもって現用系マネージャ３を選択する手順を実装しているとする。その結果、ノードＸ１４のマネージャ選択手段２１は、ノードＸ１４の待機系マネージャ３を選択し、その選択結果を、選択結果送信手段２２からノードＸ１４の待機系マネージャ３に送信する（ステップＡ２）。 It is assumed that the manager selection means 21 of the nodes X12 and X13 implements, for example, a procedure for selecting the active manager 3 with a free memory capacity as the manager selection procedure in step A1. As a result, the manager selection means 21 of the nodes X12 and X13 selects the standby manager 3 of the node X12, and transmits the selection result from the selection result transmission means 22 to the standby manager 3 of the node X12 (step A2).
Assume that the manager selection means 21 of the node X14 implements, for example, a procedure for selecting the active manager 3 with a free disk capacity as the manager selection procedure in step A1. As a result, the manager selection unit 21 of the node X14 selects the standby manager 3 of the node X14, and transmits the selection result from the selection result transmission unit 22 to the standby manager 3 of the node X14 (step A2).

各待機系マネージャ３の選択結果受信手段３１は、自身が受信した選択結果の数（得票数と呼ぶ）を選択結果記録部３２に記録する。各ノードＸ１２〜Ｘ１４の待機系マネージャ３に記録される得票数は図９のようになる。 The selection result receiving means 31 of each standby manager 3 records the number of selection results received by itself (referred to as the number of votes) in the selection result recording unit 32. The number of votes recorded in the standby manager 3 of each of the nodes X12 to X14 is as shown in FIG.

ノードＸ１２の待機系マネージャ３の選択結果判断手段３３は、選択結果記録部３２から「２」という情報を読み出し（ステップＢ５）、マネージャ選択装置３の総数「３」に対して過半数の数の得票を得たと判断して（ステップＢ６１）、現用系マネージャ３として動作を開始する（ステップＢ７）。 The selection result judging means 33 of the standby manager 3 of the node X12 reads information “2” from the selection result recording unit 32 (step B5), and obtains a majority of votes with respect to the total number “3” of the manager selection devices 3. (Step B61), the operation is started as the active manager 3 (step B7).

ノードＸ１３の待機系マネージャ３の選択結果判断手段３３は、選択結果記録部３２から「０」という情報を読み出す（ステップＢ５）。これにより、自身より得票数の多い待機系マネージャ３が存在すると判断し（ステップＢ６２）、待機系マネージャ３としての動作を継続する（ステップＢ８）。 The selection result determination unit 33 of the standby manager 3 of the node X13 reads information “0” from the selection result recording unit 32 (step B5). As a result, it is determined that there is a standby manager 3 with more votes than itself (step B62), and the operation as the standby manager 3 is continued (step B8).

ノードＸ１４の待機系マネージャ３の選択結果判断手段３３は、選択結果記録部３２から「１」という情報を読み出す（ステップＢ５）。この情報だけでは、自身より得票数の多い待機系マネージャ３が存在するか否かを判断できないので、調停手段３６が、ノードＸ１２、Ｘ１３の待機系マネージャ３に得票数を教えてくれるよう依頼する（ステップＢ６３）。その結果、ノードＸ１２が自身より得票数が多いことを判断し（ステップＢ６４）、待機系マネージャ３としての動作を継続する（ステップＢ８）。 The selection result judging means 33 of the standby manager 3 of the node X14 reads information “1” from the selection result recording unit 32 (step B5). With this information alone, it cannot be determined whether or not there is a standby manager 3 with more votes than itself, so the arbitrating means 36 requests the standby manager 3 of the nodes X12 and X13 to tell the number of votes. (Step B63). As a result, it is determined that the node X12 has more votes than itself (step B64), and the operation as the standby manager 3 is continued (step B8).

以上のように構成された本実施形態の分散計算機システムによれば、各待機系マネージャ３は、当該待機系マネージャ３が備えられたノード１の情報を各ノード１に送信し、各ノード１は、ノード情報を受信し、受信したノード情報をノード情報記録部２４に記録し、マネージャ選択手段２１は、ノード情報記録部２４に記録されたノード情報にもとづいて、複数の待機系マネージャ３の中から現用系マネージャ３の候補を選択するので、待機系マネージャ３が備えられた各ノード１の情報にもとづいて、適切な現用系マネージャの候補を選択することができる。 According to the distributed computer system of the present embodiment configured as described above, each standby manager 3 transmits information of the node 1 provided with the standby manager 3 to each node 1, and each node 1 The node information is received, the received node information is recorded in the node information recording unit 24, and the manager selection unit 21 determines whether the plurality of standby managers 3 are based on the node information recorded in the node information recording unit 24. Therefore, the candidate for the active manager 3 can be selected based on the information of each node 1 in which the standby manager 3 is provided.

また、各待機系マネージャ３は、受信した選択結果の数が待機系マネージャ３の総数に比して所定の比率を下回る場合に、現用系マネージャ３に変化させる一の待機系マネージャ３を決定するための調停手段３６を備えるので、いかなる状況においても、一の待機系マネージャ３を確実に選出し、現用系マネージャ３が不在の状態や、現用系マネージャ３が複数存在する状態の発生を防止することができる。 Each standby manager 3 determines one standby manager 3 to be changed to the active manager 3 when the number of received selection results is below a predetermined ratio compared to the total number of standby managers 3. Therefore, in any situation, one standby system manager 3 is reliably selected to prevent the occurrence of a state where the active system manager 3 is absent or a state where a plurality of active system managers 3 exist. be able to.

また、本実施形態の調停手段３６は、受信した選択結果の数が待機系マネージャ３の総数に比して所定の比率を下回る場合に、他の待機系マネージャ３に対し、他の待機系マネージャ３が受信した選択結果の数を問い合わせ、自身よりも受信した選択結果の数が多い待機系マネージャ３が存在しないとき、自身が現用系マネージャ３になると判断するので、待機系マネージャ３間のやり取りだけで、一の待機系マネージャ３を確実に選出することができる。 In addition, the arbitrating unit 36 of the present embodiment, when the number of received selection results is less than a predetermined ratio compared to the total number of standby managers 3, the other standby managers 3 3 inquires about the number of selection results received, and when there is no standby manager 3 having more selection results received than itself, it is determined that the manager 3 becomes the active manager 3, so the exchange between the standby managers 3 Only one standby manager 3 can be selected with certainty.

［第二実施形態］
つぎに、本発明の第二実施形態に係る分散計算機システムについて説明する。
第二実施形態の分散計算機システムは、調停手段３６が、受信した前記選択結果の数が待機系マネージャ３の総数に比して所定の比率を下回る場合に、各ノード１に現用系マネージャ３の候補を再度選択させ、その選択結果を選択した待機系マネージャ３に送信させる点が前記実施形態と相違している。 [Second Embodiment]
Next, a distributed computer system according to the second embodiment of the present invention will be described.
In the distributed computer system according to the second embodiment, when the number of the selection results received by the arbitrating unit 36 is lower than a predetermined ratio compared to the total number of standby managers 3, each node 1 has an active manager 3. The point that the candidates are selected again and the selection result is transmitted to the selected standby manager 3 is different from the above embodiment.

例えば、前記実施形態では、選択結果判断手段３３において、最も得票数の多い待機系マネージャ３が現用系となるべきと判断していたが、過半数の得票を得たもののみが現用系になるというルールを規定し、その際、過半数を得る待機系マネージャ３が存在しなかった場合は、調停手段３６において、各マネージャ選択装置２に再投票を依頼することとする。そして、過半数の得票を得た待機系マネージャ３を現用系マネージャ３に変化させる。 For example, in the above-described embodiment, the selection result determination unit 33 determines that the standby system manager 3 with the largest number of votes should be the active system, but only those who have obtained a majority of the votes will be the active system. When a rule is defined and there is no standby system manager 3 that obtains a majority, the arbitration means 36 requests each manager selection device 2 to vote again. Then, the standby manager 3 that has obtained a majority vote is changed to the active manager 3.

以上のように構成された第二実施形態の分散計算機システムによれば、調停手段３６が、受信した前記選択結果の数が待機系マネージャ３の総数に比して所定の比率を下回る場合に、各ノード１に現用系マネージャ３の候補を再度選択させ、その選択結果を選択した待機系マネージャ３に送信させるので、状況に適した待機系マネージャ３を厳選することができる。 According to the distributed computer system of the second embodiment configured as described above, when the arbitration unit 36 receives the number of selection results below a predetermined ratio as compared to the total number of standby managers 3, Since each node 1 is made to select a candidate for the active manager 3 again and the selection result is transmitted to the selected standby manager 3, the standby manager 3 suitable for the situation can be selected carefully.

［第三実施形態］
つぎに、本発明の第三実施形態に係る分散計算機システムについて説明する。
第三実施形態の分散計算機システムは、選択結果送信手段２２が、前記選択結果をすべての待機系マネージャ３に送信し、選択結果判断手段３３が、受信した前記選択結果を集計し、自身の選択された数が最大である場合に、自身が現用系マネージャ３になると判断する点が前記実施形態と相違している。 [Third embodiment]
Next, a distributed computer system according to the third embodiment of the present invention will be described.
In the distributed computer system according to the third embodiment, the selection result transmitting unit 22 transmits the selection result to all the standby managers 3, and the selection result determining unit 33 totals the received selection results and selects itself. This is different from the above-described embodiment in that it determines that it becomes the active manager 3 when the number obtained is the maximum.

このように構成された第三実施形態の分散計算機システムによれば、マネージャ選択装置２の選択結果送信手段２２が、選択した待機系マネージャ３にのみ選択結果を送信するのではなく、すべての待機系マネージャ３に選択結果を送信し、待機系マネージャ３の選択結果判断手段３３が、受信した選択結果を集計し、自身の選択された数が最大である場合に、自身が現用系マネージャ３になると判断するので、ネットワークトラヒックは増えるものの、自身の得票数が待機系マネージャ３の総数に比して所定の比率を下回る場合であっても、現用系マネージャ３になるべきか否かを判断することができ、その結果、調停手段３６により他の待機系マネージャ３に得票数情報を問い合わせるステップ（ステップＢ６４）を省くことができる。 According to the distributed computer system of the third embodiment configured as described above, the selection result transmission unit 22 of the manager selection device 2 does not transmit the selection result only to the selected standby manager 3, but all the standbys. The selection result is transmitted to the system manager 3, and the selection result determination means 33 of the standby system manager 3 totals the received selection results. Therefore, although the network traffic increases, it is determined whether or not it should become the active manager 3 even when its own number of votes falls below a predetermined ratio compared to the total number of standby managers 3. As a result, it is possible to omit the step (step B64) of inquiring other standby managers 3 for the number-of-voting information by the arbitrating means 36.

［第四実施形態］
つぎに、本発明の第四実施形態に係る分散計算機システムについて説明する。
第四実施形態の分散計算機システムは、マネージャ選択手段２１が、各待機系マネージャ３の負荷、各待機系マネージャ３までのネットワークのホップ数、及び／又は、各待機系マネージャ３までのネットワークのトラヒック量に応じて、現用系マネージャ３の候補を選択する点が前記実施形態と相違している。 [Fourth embodiment]
Next, a distributed computer system according to the fourth embodiment of the present invention will be described.
In the distributed computer system of the fourth embodiment, the manager selection means 21 is configured such that the load on each standby manager 3, the number of network hops to each standby manager 3, and / or the network traffic to each standby manager 3. The point which selects the candidate of the active system manager 3 according to quantity differs from the said embodiment.

例えば、前記実施形態では、ノード情報送信手段３４が、ノード情報として空きメモリ容量と空きディスク容量とを送信していたが、空きメモリ容量や空きディスク容量以外のノード情報として、ＣＰＵの使用率や送信時刻を送信する。送信時刻をノード情報として送信すると、マネージャ選択装置２のノード情報受信手段２３は、ノード情報の受信時刻との差分からノード情報が到達するのにかかった時間をノード情報記録部２４に記録することができるので、マネージャ選択手段２１において待機系マネージャ３を選択するための情報として使用することができる。 For example, in the embodiment, the node information transmission unit 34 transmits the free memory capacity and the free disk capacity as the node information. However, as the node information other than the free memory capacity and the free disk capacity, the CPU usage rate and Send the transmission time. When the transmission time is transmitted as the node information, the node information receiving unit 23 of the manager selection device 2 records the time taken for the node information to reach from the difference from the reception time of the node information in the node information recording unit 24. Therefore, it can be used as information for selecting the standby manager 3 in the manager selection means 21.

また、前記実施形態では、ノード情報記録部２４に記録する情報として、ノード情報送信手段３５により送信されたノード情報をそのまま記録していたが、その他に、送信パケットから情報を取得し、記録することができる。送信パケットから取得可能な情報としては、例えば、ノード１の間のホップ数やネットワークの回線の太さなどがある。このようにすると、マネージャ選択手段２１は、各待機系マネージャ３の負荷、各待機系マネージャ３までのネットワークのホップ数、各待機系マネージャ３までのネットワークのトラヒック量などに応じて、現用系マネージャの候補を選択することが可能になる。 In the above embodiment, the node information transmitted by the node information transmission unit 35 is recorded as it is as the information to be recorded in the node information recording unit 24. In addition, the information is acquired from the transmission packet and recorded. be able to. Examples of information that can be acquired from the transmission packet include the number of hops between the nodes 1 and the network line thickness. In this way, the manager selection means 21 selects the active manager according to the load of each standby manager 3, the number of network hops to each standby manager 3, the amount of network traffic to each standby manager 3, and the like. Can be selected.

以上のように構成された第四実施形態の分散計算機システムは、マネージャ選択手段２１が、各待機系マネージャ３の負荷、各待機系マネージャ３までのネットワークのホップ数、及び／又は、各待機系マネージャ３までのネットワークのトラヒック量に応じて、現用系マネージャ３の候補を選択するので、各ノード１やネットワークの状態が動的に変化する状況であっても、状況に適した待機系マネージャ３を現用系マネージャ３として選択することができる。 In the distributed computer system of the fourth embodiment configured as described above, the manager selection means 21 is configured so that the load on each standby system manager 3, the number of network hops to each standby system manager 3, and / or each standby system. Since the candidate of the active manager 3 is selected according to the traffic volume of the network up to the manager 3, even if the status of each node 1 or the network changes dynamically, the standby manager 3 suitable for the situation Can be selected as the working manager 3.

以上説明したように、本発明に係る実施形態によれば、以下に示すような効果が得られる。
第１の効果は、ノードの負荷が時々刻々と変化するような場合でも、その時点の状況に適した待機系マネージャを現用系マネージャとして選択できることにある。
その理由は、待機系マネージャが配置された各ノードの負荷情報に従って、各マネージャ選択手段が適切と判断する待機系マネージャを選択すれば、その選択結果により、その時点で最も負荷の軽いノードに配置された待機系マネージャが現用系マネージャとして選択されるためである。 As described above, according to the embodiment of the present invention, the following effects can be obtained.
The first effect is that even when the load on the node changes from moment to moment, a standby manager suitable for the situation at that time can be selected as the active manager.
The reason for this is that if a standby manager that each manager selection means determines to be appropriate is selected according to the load information of each node where the standby manager is placed, it will be placed on the lightest node at that time according to the selection result. This is because the selected standby manager is selected as the active manager.

第２の効果は、モバイルネットワークやアドホックネットワーク環境においても、その時点の状況に適した待機系マネージャを現用系マネージャとして選択できることにある。
その理由は、待機系マネージャが配置された各ノードまでの距離やネットワーク性能に従って、各マネージャ選択手段が適切と判断する待機系マネージャを選択すれば、その選択結果により、その時点で最もネットワーク効率のよいノードに配置された待機系マネージャが現用系マネージャとして選択されるためである。 The second effect is that, even in a mobile network or an ad hoc network environment, a standby manager suitable for the situation at that time can be selected as an active manager.
The reason is that if the standby manager that each manager selection unit determines to be appropriate is selected according to the distance to each node where the standby manager is placed and the network performance, the most efficient network efficiency at that time can be obtained. This is because the standby manager placed in a good node is selected as the active manager.

以上、本発明の分散計算機システムについて、実施形態を示して説明したが、本発明に係る分散計算機システムは、上述した実施形態にのみ限定されるものではなく、特許請求の範囲内で種々の変更が可能であることは言うまでもない。
例えば、前記実施形態では、図７に示すように、各ノード１にマネージャ３を配置していたが、信頼性とノード１の負荷とのバランスを考慮し、一部のノード１にのみマネージャ３を配置するシステム構成であってもよい。 The distributed computer system of the present invention has been described with reference to the embodiment. However, the distributed computer system according to the present invention is not limited to the above-described embodiment, and various modifications can be made within the scope of the claims. It goes without saying that is possible.
For example, in the embodiment, as shown in FIG. 7, the manager 3 is arranged in each node 1. However, in consideration of the balance between the reliability and the load of the node 1, the manager 3 is provided only in some nodes 1. May be a system configuration.

また、前記実施形態では、ノード情報送信手段３４がノード情報を送信するタイミングとして、５秒毎としていたが、必ずしも５秒である必要はなく、情報の正確さとネットワークやノードの負荷とのバランスで任意の時間を決めてよい。また、ノード情報は必ずしも定期的に送る必要はなく、現用系マネージャ３の不在を検知した時点でノード情報の送信を開始する方法を採ることも可能である。その際には、図５のステップＡ０でマネージャ選択手段２１がノード情報を読み出す際に、すべてのノード情報の受信が完了するのを待つ手順が必要となる。 In the above embodiment, the node information transmission unit 34 transmits the node information every 5 seconds. However, it is not necessarily 5 seconds. The balance between the accuracy of the information and the load of the network and the node is not necessarily required. You may decide any time. Further, it is not always necessary to send the node information regularly, and it is also possible to adopt a method of starting transmission of the node information when the absence of the active manager 3 is detected. In that case, when the manager selection means 21 reads node information in step A0 of FIG. 5, a procedure for waiting for completion of reception of all node information is required.

また、前記実施形態では、ノード情報記録部２４にノード情報を記録する方法として、最新のノード情報のみを記録していたが、過去一定期間、或いは一定個数の情報を履歴として記録してもよい。
また、前記実施形態では、マネージャ選択手段２１が１つの待機系マネージャ３のみを選択していたが、必ずしも１つである必要はなく、複数選択するようにしてもよい。この場合には、選択した待機系マネージャ３にすべて同じ得票数を与えるのではなく、適切度を表す点数をつけるようにしてもよい。 In the embodiment, only the latest node information is recorded as a method for recording the node information in the node information recording unit 24. However, a certain number of pieces of information or a certain number of information may be recorded as a history. .
In the above-described embodiment, the manager selection unit 21 selects only one standby manager 3, but it is not always necessary to select one, and a plurality may be selected. In this case, instead of giving the same number of votes to all the selected standby managers 3, a score representing the appropriateness may be given.

また、前記実施形態では、マネージャ選択手段２１における選択方法として、１つの情報のみで決定していたが、１つの情報に限る必要はなく、各情報に重み付けして足し合わせるなど、複数の情報を組み合わせて判断することも可能である。また、過去の履歴情報も併せて記録する場合には、過去の一定期間、或いは一定個数の情報の平均値を用いて判断するようにしてもよい。また、過去の履歴情報において変動が少ないものを選択する方法もある。 In the embodiment, the selection method in the manager selection means 21 is determined by only one information. However, the selection is not limited to one information, and a plurality of information such as weighting and adding each information is added. It is also possible to judge in combination. When past history information is also recorded, it may be determined using a past fixed period or an average value of a certain number of pieces of information. There is also a method of selecting past history information that has little variation.

また、前記実施形態では、マネージャ選択装置２からの得票数において、同数１位の待機系マネージャ３が複数存在した場合の解決方法について示していないが、解決方法としては、調停手段３６において、静的に決められた順位表を予め保持しておき、それに従う方法や、ＩＰアドレスの若い順など、一意に機械的に決まる方法を用いることができる。 In the above-described embodiment, a solution when there are a plurality of standby managers 3 of the same number in the number of votes obtained from the manager selection device 2 is not shown. It is possible to use a method in which a uniquely determined ranking table is stored in advance, and a method that uniquely determines the table, such as a method according to the table or a sequence in which IP addresses are younger.

本発明は、複数のノードで分散処理を行う分散計算機システムで利用することができる。特に、複数のノードを管理するマネージャに障害が発生した際、マネージャの引き継ぎを自動的に行うことにより、高い信頼性を確保することが求められる分散計算機システムに好適に用いることができる。 The present invention can be used in a distributed computer system that performs distributed processing with a plurality of nodes. In particular, when a failure occurs in a manager that manages a plurality of nodes, it can be suitably used in a distributed computer system that is required to ensure high reliability by automatically taking over the manager.

１ノード
２マネージャ選択装置
３マネージャ（待機系マネージャ、現用系マネージャ）
２１マネージャ選択手段
２２選択結果送信手段
２３ノード情報受信手段
２４ノード情報記録部
３１選択結果受信手段
３２選択結果記録部
３３選択結果判断手段
３４状態変更手段
３５ノード情報送信手段
３６調停手段 1 Node 2 Manager selection device 3 Manager (standby manager, active manager)
21 Manager selection unit 22 Selection result transmission unit 23 Node information reception unit 24 Node information recording unit 31 Selection result reception unit 32 Selection result recording unit 33 Selection result determination unit 34 Status change unit 35 Node information transmission unit 36 Arbitration unit

Claims

The multiple nodes that make up the distributed computer system
One node with an active manager that assigns processing to each of these nodes;
A plurality of nodes provided with standby managers operable as active managers when a failure occurs in the active managers, and
Each of the nodes
Manager selection means for selecting a candidate for the active manager from the plurality of standby managers when a failure occurs in the active manager;
Selection result transmission means for transmitting the selection result to the selected standby manager,
Each of the standby managers
Selection result receiving means for receiving the selection result;
A selection result judging means for judging whether or not itself becomes an active manager based on the received selection result;
A distributed computer system, comprising: state change means that changes to an active manager according to the determination result.

Each of the standby managers
Node information transmission means for transmitting information of the node provided with the standby manager to each of the nodes;
Each of the nodes
Node information receiving means for receiving the node information;
A node information recording unit for recording the received node information,
The manager selecting means includes
The distributed computer system according to claim 1, wherein a candidate for the active manager is selected from the plurality of standby managers based on the node information recorded in the node information recording unit.

Each of the standby managers
Arrangement means for determining one standby manager to be changed to an active manager when the number of received selection results is below a predetermined ratio compared to the total number of standby managers is provided. The distributed computer system according to claim 1 or 2.

The mediation means is
When the number of the selection results received is lower than a predetermined ratio compared to the total number of standby managers, the other standby managers are inquired about the number of selection results received by the other standby managers. The distributed computer system according to claim 3, wherein when there is no standby manager that receives a larger number of selection results than the standby manager, the distributed computer system is determined to be the active manager.

The mediation means is
When the number of received selection results falls below a predetermined ratio compared to the total number of standby managers, the nodes are selected again as candidates for the active manager, and the selection results are transmitted to the selected standby manager. The distributed computer system according to claim 3, wherein:

The selection result transmission means includes:
Send the selection result to all the standby managers;
The selection result judging means includes
3. The distributed computer system according to claim 1, wherein the received selection results are totaled, and when the selected number is the maximum, it is determined that the selected result becomes an active manager. 4.

The manager selecting means includes
A candidate for the active manager is selected according to the load of each standby manager, the number of network hops to each standby manager, and / or the amount of network traffic to each standby manager. The distributed computer system according to claim 1.

The multiple nodes that make up the distributed computer system
One node with an active manager that assigns processing to each of these nodes;
A plurality of nodes provided with standby managers operable as active managers when a failure occurs in the active managers, and
Each of the nodes
When a failure occurs in the active manager, a candidate for the active manager is selected from the plurality of standby managers,
Send the selection result to the selected standby manager,
Each of the standby managers
Receiving the selection result;
Based on the received selection result, it is determined whether or not it becomes an active manager,
A manager takeover method, wherein the manager is changed to an active manager according to the determination result.

Each of the standby managers
Send information of the node provided with the standby manager to each of the nodes,
Each of the nodes
Receiving the node information;
Record the received node information in a node information recording unit,
The manager takeover method according to claim 8, wherein a candidate for the active manager is selected from the plurality of standby managers based on the node information recorded in the node information recording unit.

The multiple nodes that make up the distributed computer system
One node with an active manager that assigns processing to each of these nodes;
A plurality of nodes provided with standby managers operable as active managers when a failure occurs in the active managers, and
The manager takeover program for each node is:
When a failure occurs in the active manager, a process of selecting a candidate for the active manager from the plurality of standby managers;
Processing to send the selection result to the selected standby manager, and causing the computer to execute,
The manager takeover program for each standby manager is:
Receiving the selection result;
Based on the received selection result, a process for determining whether or not it becomes an active manager;
A manager takeover program that causes a computer to execute processing that changes to an active manager in accordance with the determination result.

The manager takeover program for each standby manager is:
Causing the computer to execute node information transmission processing for transmitting information on the node provided with the standby manager to each of the nodes;
The manager takeover program for each node is:
Processing to receive the node information;
A process of recording the received node information in a node information recording unit;
The computer is configured to execute a process of selecting an active manager candidate from the plurality of standby managers based on the node information recorded in the node information recording unit. Manager takeover program.