JP4435142B2

JP4435142B2 - Cluster system and synchronization method

Info

Publication number: JP4435142B2
Application number: JP2006325033A
Authority: JP
Inventors: 雅田中; 哲也飯沼; 茂夫大道; 真也安藤; 卓也熊谷
Original assignee: Toshiba Corp; Toshiba Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2006-11-30
Filing date: 2006-11-30
Publication date: 2010-03-17
Anticipated expiration: 2026-11-30
Also published as: JP2008140079A

Description

本発明は、稼動系計算機で障害が発生した場合に、当該稼動系計算機で開始されているサービスを待機系計算機が引き継ぐクラスタシステムに係り、特に稼動系計算機及び待機系計算機の各々がデータ記憶手段を有するクラスタシステムにおいて、当該データ記憶手段のデータを同期化するのに好適なクラスタシステム及び同期化方法に関する。 The present invention relates to a cluster system in which a standby computer takes over a service started by an active computer when a failure occurs in the active computer. The present invention relates to a cluster system and a synchronization method suitable for synchronizing data stored in the data storage means.

従来から、システムの可用性を高める計算機システムとして、同一のサービスを提供可能な複数の計算機を含むクラスタシステムが知られている。この種のクラスタシステムでは、現在サービスを提供している計算機に障害が発生した場合、他の正常な計算機が当該サービスを開始する。このサービスの引き継ぎにより、サービスが使用不能になる時間を最小限にすることができる。 2. Description of the Related Art Conventionally, a cluster system including a plurality of computers capable of providing the same service is known as a computer system that increases system availability. In this type of cluster system, when a failure occurs in a computer that currently provides a service, another normal computer starts the service. By taking over this service, the time during which the service becomes unavailable can be minimized.

障害が発生した計算機から他の計算機がサービスを引き継ぐためには、各計算機がサービスを続行する上で必要となるデータを共有する必要がある。このデータ共有方式の１つとして、クラスタシステムに含まれる全ての計算機（ノード）が同じデータを個々に保持する分散共有方式が知られている。 In order for another computer to take over the service from the computer in which the failure has occurred, it is necessary for each computer to share data necessary for continuing the service. As one of the data sharing methods, a distributed sharing method is known in which all computers (nodes) included in a cluster system individually hold the same data.

分散共有方式を適用するクラスタシステムでは、あるノードにおいてデータの更新が発生した際には、当該ノードからクラスタシステム内の他のノードにネットワークを介して更新情報が送信される。この更新情報により、クラスタシステム内の他のノードにおいても上記あるノードでのデータ更新と同一のデータ更新が行われる。これにより、システム内の全てのノードが保持するデータの一致化（同期化）が図られる。このような、システム内の全てのノードが保持するデータを同期化させる仕組みは、例えば特許文献１に記載されている。
特開２０００−２７６４２０（段落００２６） In a cluster system to which the distributed sharing method is applied, when data update occurs in a certain node, update information is transmitted from the node to other nodes in the cluster system via the network. With this update information, the same data update as the data update at a certain node is performed at other nodes in the cluster system. Thereby, matching (synchronization) of data held by all the nodes in the system is achieved. Such a mechanism for synchronizing the data held by all the nodes in the system is described in, for example, Patent Document 1.
JP 2000-276420 (paragraph 0026)

クラスタシステムでは、サービスを提供しているノード（計算機）の障害に備えて待機しているノード（待機系ノード）が存在する。待機系ノードが障害の発生により停止すると、その停止期間に他のノードで発生するデータ更新は当該待機系ノードに反映されない。このため、待機系ノードは、障害から復帰した場合に、他のノードからデータを送信してもらうことにより自身が保持するデータを他のノードのデータに一致化させる同期化処理を行う必要がある。この同期化処理が完了するまでの間は、たとえサービスを提供しているノード（稼動系ノード）に障害が発生しても、待機系ノードは当該待機系ノードが保持しているデータを利用できず当該サービスを引き継ぐ状態には至らない。 In a cluster system, there is a node (standby node) that is waiting for a failure of a node (computer) that provides a service. When the standby node stops due to the occurrence of a failure, data updates that occur in other nodes during the stop period are not reflected on the standby node. For this reason, when the standby node recovers from the failure, it is necessary to perform a synchronization process for matching the data held by the standby node with the data of the other node by having the data transmitted from the other node. . Until this synchronization process is completed, even if a failure occurs in the node providing the service (active node), the standby node can use the data held by the standby node. Therefore, the service will not be taken over.

本発明は上記事情を考慮してなされたものでその目的は、待機系計算機の障害からの復帰時に行われる同期化処理が完了する前でも、当該待機系計算機が稼動系計算機からサービスを引き継ぐことが可能となるクラスタシステム及び同期化処理方法を提供することにある。 The present invention has been made in consideration of the above circumstances, and its purpose is that the standby computer takes over the service from the active computer even before the synchronization processing performed at the time of recovery from the failure of the standby computer is completed. It is an object of the present invention to provide a cluster system and a synchronization processing method.

本発明の１つの観点によれば、第１及び第２の計算機のいずれか１つが所定のサービスの提供に関して稼動系計算機として機能すると共に、前記第１及び第２の計算機の残りが待機系計算機として機能し、前記稼動系計算機で障害が発生した場合に、当該稼動系計算機で開始されているサービスを前記待機系計算機が引き継ぐクラスタシステムが提供される。このクラスタシステムの第１及び第２の計算機の各々は、データを記憶するデータ記憶手段と、前記稼動系計算機として機能している状態でデータ更新が発生した前記データ記憶手段の領域の情報を、前記待機系計算機でのデータ更新に未反映の領域を表す更新反映待ち情報として記憶する更新反映待ち情報記憶手段と、前記発生したデータ更新を前記待機系計算機に反映させるための更新要求情報を当該待機系計算機に送信する更新要求情報送信手段と、前記待機系計算機として機能している状態で前記稼動系計算機で発生したデータ更新が当該待機系計算機に反映されていない領域の情報を未更新領域情報として記憶する未更新領域情報記憶手段と、前記待機系計算機として機能している状態で前記稼動系計算機の前記更新要求情報送信手段から送信された更新要求情報を受信して、当該稼動系計算機で発生したデータ更新を当該更新要求情報に基づいて前記待機系計算機の前記データ記憶手段に反映させて、当該データ更新が反映された領域の更新領域情報を当該稼動系計算機に通知すると共に、当該データ更新が反映された領域に対応する未更新領域情報が前記待機系計算機の前記未更新領域情報記憶手段に記憶されている場合には当該未更新領域情報を削除する更新要求情報受信手段と、前記更新要求情報に対して前記待機系計算機の前記更新要求情報受信手段から前記稼動系計算機に更新領域情報が通知された場合、当該更新領域情報の示す領域に対応する更新反映待ち情報を当該稼動系計算機の前記更新反映待ち情報記憶手段から削除する更新反映待ち手段と、前記待機系計算機として機能している状態で障害が発生して当該待機系計算機が一旦動作を停止し、その後障害から復旧した場合に、前記稼動系計算機に対して同期化処理の開始を要求して、当該稼動系計算機の前記更新反映待ち情報記憶手段に記憶されている更新反映待ち情報を当該稼動系計算機から取得して、当該更新反映待ち情報を未更新領域情報として前記待機系計算機の前記未更新領域情報記憶手段に格納する組み込み要求手段と、前記稼動系計算機として機能している状態で前記待機系計算機の前記組み込み要求手段から同期化処理の開始が要求された場合、当該稼動系計算機の前記更新反映待ち情報記憶手段に記憶されている更新反映待ち情報を当該組み込み要求手段に送信すると共に、当該更新反映待ち情報によって表される領域のデータを当該稼動系計算機の前記データ記憶手段から読み出し、当該データにより前記待機系計算機の前記データ記憶手段の対応する領域のデータを更新させるための同期化処理に必要な更新要求情報を当該待機系計算機の前記更新要求情報受信手段に送信する組込要求受理手段とを具備する。 According to one aspect of the present invention, one of the first and second computers functions as an active computer with respect to providing a predetermined service, and the rest of the first and second computers are standby computers. When a failure occurs in the active computer, a cluster system is provided in which the standby computer takes over the service started by the active computer. Each of the first and second computers of the cluster system includes data storage means for storing data, and information on the area of the data storage means in which data update has occurred while functioning as the active computer. Update reflection wait information storage means for storing as update reflection wait information representing an area not reflected in data update in the standby computer, and update request information for reflecting the generated data update in the standby computer Update request information transmitting means for transmitting to the standby computer, and information on an area in which data update generated in the active computer is not reflected in the standby computer in a state of functioning as the standby computer is an unupdated area Unupdated area information storage means for storing information, and transmission of the update request information of the active computer in a state of functioning as the standby computer The update request information transmitted from the stage is received, the data update generated in the active computer is reflected in the data storage means of the standby computer based on the update request information, and the data update is reflected The updated area information of the updated area is notified to the active computer, and the unupdated area information corresponding to the area in which the data update is reflected is stored in the unupdated area information storage unit of the standby computer When the update area information is notified from the update request information receiving means for deleting the unupdated area information to the active computer from the update request information receiving means of the standby computer with respect to the update request information, Update reflection waiting means for deleting update reflection waiting information corresponding to the area indicated by the update area information from the update reflection waiting information storage means of the active computer; and When a failure occurs when the computer is functioning as a computer, the standby computer temporarily stops operating, and then recovers from the failure, requesting the active computer to start synchronization processing The update reflection wait information stored in the update reflection wait information storage means of the active computer is acquired from the active computer, and the update reflection wait information is used as unupdated area information to determine whether the standby computer When the start of synchronization processing is requested from the built-in request unit stored in the update area information storage unit and the built-in request unit of the standby computer in a state of functioning as the active system computer, The update reflection waiting information stored in the update reflection waiting information storage means is transmitted to the incorporation request means, and the area represented by the update reflection waiting information Is read from the data storage unit of the active computer, and update request information necessary for synchronization processing for updating the data in the corresponding area of the data storage unit of the standby computer is updated with the data. Embedded request receiving means for transmitting to the update request information receiving means of the system computer.

本発明によれば、待機系計算機での同期化処理の開始に際して、稼動系計算機から取得される更新反映待ち領域の情報（更新反映待ち情報）が未更新領域情報として未更新領域情報記憶手段に記憶される。このため待機系計算機は、同期化処理が完了する前に稼動系計算機で障害が発生した場合でも、未更新領域情報記憶手段に記憶されている未更新領域情報に基づいて未更新領域を確認できるため、当該稼動系計算機で開始されていたサービスを速やかに引き継ぐことが可能となる。 According to the present invention, at the start of the synchronization processing in the standby computer, the update reflection waiting area information (update reflection waiting information) acquired from the active computer is stored in the unupdated area information storage means as unupdated area information. Remembered. Therefore, the standby computer can check the unupdated area based on the non-updated area information stored in the non-updated area information storage unit even if a failure occurs in the active system before the synchronization processing is completed. Therefore, it is possible to quickly take over the service that has been started on the active computer.

以下、本発明の実施の形態につき図面を参照して説明する。
［第１の実施形態］
図１は本発明の第１の実施形態に係るクラスタシステムの構成を示すブロック図である。このクラスタシステムは、例えば２台の計算機（ノード）１０-1及び１０-2から構成されるものとする。なお、クラスタシステムが３台以上の計算機（ノード）から構成されていても構わない。ノード１０-1及び１０-2は、ネットワークのような通信路１１によって相互接続されている。 Embodiments of the present invention will be described below with reference to the drawings.
[First Embodiment]
FIG. 1 is a block diagram showing a configuration of a cluster system according to the first embodiment of the present invention. This cluster system is composed of, for example, two computers (nodes) 10-1 and 10-2. Note that the cluster system may be composed of three or more computers (nodes). The nodes 10-1 and 10-2 are interconnected by a communication path 11 such as a network.

図１の状態では、あるサービスの提供に関して、ノード１０-1が稼動系として機能し、ノード１０-2が待機系として機能しているものとする。この状態において、別のサービスに関して、ノード１０-2が稼動系として機能し、ノード１０-1が待機系として機能することもある。但し、以下の説明では、便宜的にノード１０-1が稼動系ノードであり、ノード１０-2が待機系ノードであるとする。なお図１では、サービスの提供を受けるクライアント端末は省略されている。 In the state of FIG. 1, it is assumed that the node 10-1 functions as an active system and the node 10-2 functions as a standby system for providing a certain service. In this state, for another service, the node 10-2 may function as an active system, and the node 10-1 may function as a standby system. However, in the following description, for the sake of convenience, it is assumed that the node 10-1 is an active node and the node 10-2 is a standby node. In FIG. 1, a client terminal that receives service provision is omitted.

ノード１０-1及び１０-2は、それぞれ、クラスタ制御部１００-1及び１００-2を有する。クラスタ制御部１００-1及び１００-2は、ノード１０-1及び１０-2の一方（１つ）が稼動系となって、他方（残り）が待機系となるように、通信路１１を介して互いに合意をとりながら動作する。クラスタ制御部１００-1及び１００-2は、障害検知部１０１-1及び１０１-2を有する。障害検知部１０１-1及び１０１-2は、通信路１１を介して互いに通信を行うことにより、それぞれノード１０-2及び１０-1の障害を検知する。 The nodes 10-1 and 10-2 have cluster control units 100-1 and 100-2, respectively. The cluster control units 100-1 and 100-2 are connected via the communication path 11 so that one (one) of the nodes 10-1 and 10-2 is an active system and the other (remaining) is a standby system. Work with mutual agreement. The cluster control units 100-1 and 100-2 include failure detection units 101-1 and 101-2. The failure detection units 101-1 and 101-2 detect the failure of the nodes 10-2 and 10-1 by communicating with each other via the communication path 11, respectively.

ノード１０-1及び１０-2は、それぞれ、データ記憶部１０２-1及び１０２-2と、更新反映待ち情報記憶部１０３-1及び１０３-2と、未更新領域情報記憶部１０４-1及び１０４-2とを有する。本実施形態において、データ記憶部１０２-1及び１０２-2は、それぞれハードディスクドライブのようなディスク記憶装置上に確保される。一方、更新反映待ち情報記憶部１０３-1及び１０３-2と、未更新領域情報記憶部１０４-1及び１０４-2とは、それぞれ書き換え可能な不揮発性メモリ内に確保される。なお、更新反映待ち情報記憶部１０３-1及び１０３-2と、未更新領域情報記憶部１０４-1及び１０４-2とが、それぞれディスク記憶装置上に確保されても構わない。 The nodes 10-1 and 10-2 include data storage units 102-1 and 102-2, update reflection waiting information storage units 103-1 and 103-2, and unupdated area information storage units 104-1 and 104, respectively. -2. In the present embodiment, the data storage units 102-1 and 102-2 are each secured on a disk storage device such as a hard disk drive. On the other hand, the update reflection waiting information storage units 103-1 and 103-2 and the non-updated area information storage units 104-1 and 104-2 are secured in rewritable nonvolatile memories, respectively. The update reflection waiting information storage units 103-1 and 103-2 and the unupdated area information storage units 104-1 and 104-2 may be secured on the disk storage device, respectively.

データ記憶部１０２-1及び１０２-2は、稼動系が提供するサービス１０５からの要求によって更新／参照されるデータを記憶する。データ記憶部１０２-1及び１０２-2の記憶領域は、複数の一定サイズの領域（ブロック、部分領域）に分割して管理される。各領域には、連続する番号であるアドレスが割り当てられる。図１では、作図の都合上、データ記憶部１０２-1及び１０２-2の記憶領域が９つの領域に分割して管理される例が示されている。この９つの領域には、図１に示されるように、それぞれアドレス１乃至９が割り当てられているものとする。 The data storage units 102-1 and 102-2 store data that is updated / referenced by a request from the service 105 provided by the active system. The storage areas of the data storage units 102-1 and 102-2 are managed by being divided into a plurality of areas (blocks, partial areas) of a certain size. Each area is assigned an address which is a continuous number. FIG. 1 shows an example in which the storage areas of the data storage units 102-1 and 102-2 are divided into nine areas and managed for convenience of drawing. Assume that addresses 9 to 9 are assigned to these nine areas, as shown in FIG.

ノード１０-1が稼動系である図１の例では、当該ノード１０-1でサービス１０５が開始されている。サービス１０５は、データの変更（更新）が必要な場合には更新要求情報１０６を発行し、データの参照が必要な場合には参照要求情報１０７を発行する。 In the example of FIG. 1 in which the node 10-1 is an active system, the service 105 is started on the node 10-1. The service 105 issues update request information 106 when data change (update) is necessary, and issues reference request information 107 when data reference is necessary.

図２は更新要求情報１０６及び参照要求情報１０７のデータ構造例を示す。ここでは更新要求情報１０６は、更新領域情報と更新データとから構成される。更新領域情報は、データ記憶部１０２-1内の更新の対象となる領域範囲を指定する。本実施形態において更新領域情報は、更新の対象となる領域範囲の先頭領域のアドレス（開始アドレス）及び最終領域のアドレス（終了アドレス）から構成される。なお、更新領域情報が、開始アドレス及び領域数から構成されていても構わない。一方、参照要求情報１０７は、参照領域情報から構成される。参照領域情報は、データ記憶部１０２-1内の参照の対象となる領域範囲を指定する。本実施形態において参照領域情報は、参照の対象となる領域範囲の開始アドレス及び終了アドレスから構成される。参照領域情報が、開始アドレス及び領域数から構成されていても構わない。 FIG. 2 shows an example of the data structure of the update request information 106 and the reference request information 107. Here, the update request information 106 includes update area information and update data. The update area information specifies an area range to be updated in the data storage unit 102-1. In the present embodiment, the update area information is composed of the address (start address) of the start area and the address (end address) of the end area of the area range to be updated. Note that the update area information may be composed of a start address and the number of areas. On the other hand, the reference request information 107 includes reference area information. The reference area information specifies an area range to be referred to in the data storage unit 102-1. In the present embodiment, the reference area information includes a start address and an end address of an area range to be referred to. The reference area information may be composed of a start address and the number of areas.

更新反映待ち情報記憶部１０３-1及び１０３-2は、キュー構造の更新反映待ち情報（更新反映待ち情報キュー）を記憶する。更新反映待ち情報キューは、更新要求情報中の更新領域情報の待ち行列である。ここでは、ノード１０-1または１０-2が稼動系として機能している状態で、当該ノード１０-1または１０-2で更新要求情報に基づいてデータ記憶部１０２-1または１０２-2の領域に対するデータ更新が発生した場合に、その領域の情報（更新領域情報）が、待機系ノードでのデータ更新に未反映の領域を表す更新反映待ち情報として、更新反映待ち情報記憶部１０３-1及び１０３-2内の更新反映待ち情報キューにつながれる（追加される）。未更新領域情報記憶部１０４-1及び１０４-2は、未更新の領域を示す情報（未更新領域情報）を記憶する。 The update reflection wait information storage units 103-1 and 103-2 store update reflection wait information (update reflection wait information queue) having a queue structure. The update reflection wait information queue is a queue of update area information in the update request information. Here, in a state where the node 10-1 or 10-2 functions as an active system, the area of the data storage unit 102-1 or 102-2 based on the update request information in the node 10-1 or 10-2. When a data update occurs for the update update wait information storage unit 103-1 and the update area information (update area information) as update reflection wait information indicating an area not yet reflected in the data update in the standby node. It is connected (added) to the update reflection waiting information queue in 103-2. The unupdated area information storage units 104-1 and 104-2 store information indicating unupdated areas (unupdated area information).

ノード１０-1は、データ要求監視部１０８を有する。データ要求監視部１０８はノード１０-1が稼動系の場合に、サービス１０５によって発行される更新要求情報１０６及び参照要求情報１０７を監視する。データ要求監視部１０８は、更新要求情報１０６を検知したときは、当該更新要求情報１０６に従ってデータ記憶部１０２-1内の該当する領域のデータを更新し、当該更新要求情報１０６を後述する更新要求情報送信部１０９-1に渡す。なお、図１では省略されているが、ノード１０-2には、データ要求監視部１０８に相当するデータ要求監視部が設けられている。但し、ノード１０-2のデータ要求監視部は、当該ノード１０-2が待機系の状態では動作しない。 The node 10-1 has a data request monitoring unit 108. The data request monitoring unit 108 monitors the update request information 106 and the reference request information 107 issued by the service 105 when the node 10-1 is active. When the data request monitoring unit 108 detects the update request information 106, the data request monitoring unit 108 updates the data in the corresponding area in the data storage unit 102-1 according to the update request information 106, and updates the update request information 106, which will be described later. The information is sent to the information transmission unit 109-1. Although omitted in FIG. 1, a data request monitoring unit corresponding to the data request monitoring unit 108 is provided in the node 10-2. However, the data request monitoring unit of the node 10-2 does not operate when the node 10-2 is in a standby state.

ノード１０-1及び１０-2は、それぞれ、更新要求情報送信部１０９-1及び１０９-2と、更新要求情報受信部１１０-1及び１１０-2と、更新反映待ち部１１１-1及び１１１-2と、組込要求部１１２-1及び１１２-2と、組込要求受理部１１３-1及び１１３-2とを有する。 The nodes 10-1 and 10-2 respectively include update request information transmitting units 109-1 and 109-2, update request information receiving units 110-1 and 110-2, and update reflection waiting units 111-1 and 111-. 2, incorporation request units 112-1 and 112-2, and incorporation request reception units 113-1 and 113-2.

ノード１０-1の更新要求情報送信部１０９-1は、データ要求監視部１０８から更新要求情報１０６を渡された場合、当該更新要求情報１０６に含まれている更新領域情報を更新反映待ち情報記憶部１０３-1に追加する。更新要求情報送信部１０９-1は、ノード１０-1以外の更新要求情報受信部（ここでは、ノード１０-2の更新要求情報受信部１１０-2）に、更新要求情報１０６を送信する。 When receiving the update request information 106 from the data request monitoring unit 108, the update request information transmitting unit 109-1 of the node 10-1 stores the update area information included in the update request information 106 in the update reflection waiting information storage. Added to the section 103-1. The update request information transmitting unit 109-1 transmits the update request information 106 to an update request information receiving unit other than the node 10-1 (here, the update request information receiving unit 110-2 of the node 10-2).

更新要求情報送信部１０９-1は、更新反映待ち情報記憶部１０３-1に格納されている更新反映待ち情報が組込要求受理部１１３-1によってノード１０-2の組込要求部１１２-2に送信された場合、当該更新反映待ち情報の示す領域のデータ（更新済みデータ）をデータ記憶部１０２-1から取得する。更新要求情報送信部１０９-1は、更新反映待ち情報（更新領域情報）と取得されたデータとを含む更新要求情報を生成して、当該更新要求情報をノード１０-2の更新要求情報受信部１１０-2に送信する。 The update request information transmission unit 109-1 receives the update reflection wait information stored in the update reflection wait information storage unit 103-1 from the incorporation request reception unit 113-1 and includes the integration request unit 112-2 of the node 10-2. In this case, the data (updated data) in the area indicated by the update reflection waiting information is acquired from the data storage unit 102-1. The update request information transmitting unit 109-1 generates update request information including update reflection waiting information (update area information) and the acquired data, and the update request information is received by the update request information receiving unit of the node 10-2 To 110-2.

ノード１０-2の更新要求情報送信部１０９-2も、当該ノード１０-2が稼動系ノードの場合に、上述の更新要求情報送信部１０９-1と同様の動作を行う。 The update request information transmitting unit 109-2 of the node 10-2 performs the same operation as the above-described update request information transmitting unit 109-1 when the node 10-2 is an active node.

ノード１０-2の更新要求情報受信部１１０-2は、ノード１０-1の更新要求情報送信部１０９-1から更新要求情報が送信された場合に当該更新要求情報を受信して、データ記憶部１０２-1に格納されているデータを当該更新要求情報に従って更新する。更新要求情報受信部１１０-2は、受信された更新要求情報に含まれている更新領域情報と同一の未更新領域情報が未更新領域情報記憶部１０４-2に格納されている場合に、当該未更新領域情報を未更新領域情報記憶部１０４-2から削除する。更新要求情報受信部１１０-2は更新要求情報に従うデータ更新の完了後に、当該更新要求情報に含まれている更新領域情報をノード１０-1の更新反映待ち部１１１-1に通知する。ノード１０-1の更新要求情報受信部１１０-1も、当該ノード１０-1が待機系ノードであって、且つノード１０-2の更新要求情報送信部１０９-2から更新要求情報が送信された場合に、上述の更新要求情報受信部１１０-2と同様の動作を行う。 The update request information receiving unit 110-2 of the node 10-2 receives the update request information when the update request information is transmitted from the update request information transmitting unit 109-1 of the node 10-1, and the data storage unit The data stored in 102-1 is updated according to the update request information. The update request information receiving unit 110-2, when the non-updated area information identical to the update area information included in the received update request information is stored in the unupdated area information storage unit 104-2, Unupdated area information is deleted from the unupdated area information storage unit 104-2. After completing the data update according to the update request information, the update request information receiving unit 110-2 notifies the update reflection waiting unit 111-1 of the node 10-1 of the update area information included in the update request information. The update request information receiving unit 110-1 of the node 10-1 also receives the update request information from the update request information transmitting unit 109-2 of the node 10-2 when the node 10-1 is a standby node. In this case, the same operation as the update request information receiving unit 110-2 described above is performed.

ノード１０-2の組込要求部１１２-2は、ノード１０-2が待機系として動作している状態で当該ノード１０-2（つまり待機系ノード）に障害が発生して当該ノード１０-2が停止し、その後障害から復旧して再度動作を開始して同期化処理が開始される際に動作する。組込要求部１１２-2は、ノード１０-2が障害から復旧した待機系ノードである場合に、稼動系ノード１０-1の組込要求受理部１１３-1に同期化処理の開始を要求する。組込要求部１１２-2は、ノード１０-1の組込要求受理部１１３-1から同期化処理の開始要求に対する応答として更新反映待ち情報を受け取った場合に、当該更新反映待ち情報を未更新領域情報記憶部１０４-2に反映する。ノード１０-1の組込要求部１１２-1も、当該ノード１０-1が待機系ノードであって、且つ当該ノード１０-1に障害が発生して当該ノード１０-1が停止し、その後障害から復旧して再度動作を開始して同期化処理が開始される際に、上述の組込要求部１１２-2と同様の動作を行う。 The integration request unit 112-2 of the node 10-2 causes a failure in the node 10-2 (that is, the standby node) while the node 10-2 is operating as a standby system, and the node 10-2 Stops, then recovers from the failure and starts again to start the synchronization process. When the node 10-2 is a standby node that has recovered from a failure, the integration request unit 112-2 requests the integration request reception unit 113-1 of the active node 10-1 to start the synchronization process. . When the update request wait information is received as a response to the synchronization processing start request from the request to receive synchronization 113-1 of the node 10-1, the update request wait information 112-2 is not updated. This is reflected in the area information storage unit 104-2. The integration request unit 112-1 of the node 10-1 also has a failure in the node 10-1 when the node 10-1 is a standby node, and the node 10-1 is stopped. When the synchronization processing is started after the operation is resumed and the operation is started again, the same operation as the above-described incorporation request unit 112-2 is performed.

ノード１０-1の組込要求受理部１１３-1は、ノード１０-2の組込要求部１１２-2から同期化処理の開始が要求された場合、更新反映待ち情報記憶部１０３-1に格納されている更新反映待ち情報を当該組込要求部１１２-2に送信する。このとき更新要求情報送信部１０９-1は、更新反映待ち情報記憶部１０３-1に格納されている更新反映待ち情報を先頭から順に取り出し、更新反映待ち情報の示す領域のデータ（更新済みデータ）をデータ記憶部１０２-1から取得する。更新要求情報送信部１０９-1は、更新反映待ち情報（更新領域情報）と取得されたデータとを含む更新要求情報を生成して、当該更新要求情報を更新要求情報受信部１１０-2に送信する。更新要求情報送信部１０９-1は、たとえノード１０-1のサービス１０５が停止しても、更新反映待ち情報記憶部１０３-1に情報が格納されている限り動作する。ノード１０-2の組込要求受理部１１３-2も、当該ノード１０-2が稼動系ノードであって、且つノード１０-1の組込要求部１１２-1から同期化処理の開始が要求された場合に、上述の組込要求受理部１１３-1と同様の動作を行う。この点は、ノード１０-2の更新要求情報送信部１０９-2についても同様である。 The integration request reception unit 113-1 of the node 10-1 stores in the update reflection waiting information storage unit 103-1 when the synchronization request is started from the integration request unit 112-2 of the node 10-2. The updated reflection waiting information is transmitted to the incorporation request unit 112-2. At this time, the update request information transmission unit 109-1 sequentially extracts the update reflection wait information stored in the update reflection wait information storage unit 103-1 from the head, and data in the area indicated by the update reflection wait information (updated data) Is acquired from the data storage unit 102-1. The update request information transmitting unit 109-1 generates update request information including update reflection waiting information (update region information) and the acquired data, and transmits the update request information to the update request information receiving unit 110-2. To do. Even if the service 105 of the node 10-1 is stopped, the update request information transmitting unit 109-1 operates as long as information is stored in the update reflection waiting information storage unit 103-1. The incorporation request receiving unit 113-2 of the node 10-2 is also requested to start the synchronization process from the integration request unit 112-1 of the node 10-1 when the node 10-2 is an active node. In the case, the same operation as the above-described built-in request receiving unit 113-1 is performed. The same applies to the update request information transmission unit 109-2 of the node 10-2.

次に、図１のクラスタシステムにおける動作について、図３のフローチャートを参照して説明する。
まず、サービス１０５の実行に関して、ノード１０-1が稼動系ノードとして機能し、ノード１０-2が待機系ノードとして機能しているものとする。つまりノード１０-1において、サービス１０５が開始されているものとする。 Next, the operation of the cluster system of FIG. 1 will be described with reference to the flowchart of FIG.
First, regarding execution of the service 105, it is assumed that the node 10-1 functions as an active node and the node 10-2 functions as a standby node. In other words, it is assumed that the service 105 is started in the node 10-1.

ノード１０-1のデータ要求監視部１０８は、サービス１０５がデータ記憶部１０２-1に格納されているデータを更新または参照するために当該サービス１０５によって発行される要求情報を監視する（ステップＳ１）。データ要求監視部１０８は、サービス１０５によって要求情報が発行されたことを検知した場合、当該要求情報の種類を判定する（ステップＳ２）。即ちデータ要求監視部１０８は、検知された要求情報が更新要求情報１０６及び参照要求情報１０７のいずれであるかを判定する。 The data request monitoring unit 108 of the node 10-1 monitors request information issued by the service 105 so that the service 105 updates or references the data stored in the data storage unit 102-1 (step S1). . When the data request monitoring unit 108 detects that the request information has been issued by the service 105, the data request monitoring unit 108 determines the type of the request information (step S2). That is, the data request monitoring unit 108 determines whether the detected request information is update request information 106 or reference request information 107.

もし、更新要求情報１０６である場合、データ要求監視部１０８は当該更新要求情報１０６に従って、データ記憶部１０２-1内の該当する領域のデータを更新する（ステップＳ３）。次にデータ要求監視部１０８は、データ更新に用いられた更新要求情報１０６を更新要求情報送信部１０９-1に送出する（ステップＳ４）。この場合、データ要求監視部１０８はステップＳ１に戻って、サービス１０５によって発行される要求情報を監視する。 If it is the update request information 106, the data request monitoring unit 108 updates the data in the corresponding area in the data storage unit 102-1 according to the update request information 106 (step S3). Next, the data request monitoring unit 108 sends the update request information 106 used for data update to the update request information transmitting unit 109-1 (step S4). In this case, the data request monitoring unit 108 returns to step S1 and monitors request information issued by the service 105.

更新要求情報送信部１０９-1は、データ要求監視部１０８によって送出された更新要求情報１０６を受け取ると（ステップＳ１１）、更新反映待ち情報記憶部１０３-1に当該更新要求情報１０６中の更新領域情報を更新反映待ち情報として追加する（ステップＳ１２）。ここでは、更新要求情報１０６中の更新領域情報は、更新反映待ち情報記憶部１０３-1に格納されている更新反映待ち情報キューの最後尾につながれる。更新要求情報送信部１０９-1は、受け取った更新要求情報１０６を当該更新要求情報送信部１０９-1を有するノード以外（つまり他のノード）の更新要求情報受信部に通信路１１を介して送信する（ステップＳ１３）。ここでは、更新要求情報１０６はノード１０-2の更新要求情報受信部１１０-2に送信される。更新要求情報送信部１０９-1は、ステップＳ１３を実行すると、データ要求監視部１０８からの新たな更新要求情報１０６を待つ。 When the update request information transmission unit 109-1 receives the update request information 106 sent from the data request monitoring unit 108 (step S11), the update request information transmission unit 109-1 stores the update area in the update request information 106 in the update reflection waiting information storage unit 103-1. Information is added as update reflection waiting information (step S12). Here, the update area information in the update request information 106 is linked to the tail of the update reflection waiting information queue stored in the update reflection waiting information storage unit 103-1. The update request information transmitting unit 109-1 transmits the received update request information 106 to the update request information receiving unit other than the node having the update request information transmitting unit 109-1 (that is, another node) via the communication path 11. (Step S13). Here, the update request information 106 is transmitted to the update request information receiving unit 110-2 of the node 10-2. When executing step S13, the update request information transmitting unit 109-1 waits for new update request information 106 from the data request monitoring unit 108.

更新要求情報受信部１１０-2は、更新要求情報送信部１０９-1から更新要求情報１０６を受け取ると（ステップＳ２１）、当該更新要求情報１０６に従ってデータ記憶部１０２-2内の該当する領域のデータを更新する（ステップＳ２２）。次に更新要求情報受信部１１０-2は、データ更新に用いられた更新要求情報１０６に含まれている更新領域情報と同一の領域（アドレス）を示す未更新領域情報が未更新領域情報記憶部１０４-2に格納されているかを判定する（ステップＳ２３）。つまり更新要求情報受信部１１０-2は、ステップＳ２２でデータ更新されたデータ記憶部１０２-1内の領域が、未更新領域情報記憶部１０４-2に格納されている未更新領域情報によってデータ未更新の領域として示されているかを判定する。 When receiving the update request information 106 from the update request information transmitting unit 109-1 (step S21), the update request information receiving unit 110-2 receives the data in the corresponding area in the data storage unit 102-2 according to the update request information 106. Is updated (step S22). Next, the update request information receiving unit 110-2 receives unupdated area information indicating the same area (address) as the update area information included in the update request information 106 used for data update. It is determined whether it is stored in 104-2 (step S23). That is, the update request information receiving unit 110-2 uses the unupdated area information stored in the non-updated area information storage unit 104-2 to store the data in the data storage unit 102-1 that has been updated in step S22. It is determined whether it is indicated as an update area.

もし、データ更新された領域がデータ未更新の領域であると未更新領域情報によって示されている場合、更新要求情報受信部１１０-2は当該未更新領域情報を未更新領域情報記憶部１０４-2から削除する（ステップＳ２４）。そして更新要求情報受信部１１０-2は、ノード１０-1の更新反映待ち部１１１-1に通信路１１を介して更新領域情報を返すことにより更新完了を通知する（ステップＳ２５）。これに対し、データ更新された領域に関する未更新領域情報が未更新領域情報記憶部１０４-2に格納されていない場合（ステップＳ２３）、更新要求情報受信部１１０-2はステップＳ２４をスキップして、更新反映待ち部１１１-1に更新完了を通知する（ステップＳ２５）。この場合、更新要求情報受信部１１０-2はステップＳ２１に戻って、ノード１０-1の更新要求情報送信部１０９-1から新たな更新要求情報１０６が送信されるのを待つ。 If the data updated area is indicated by the unupdated area information that the data has not been updated, the update request information receiving unit 110-2 stores the unupdated area information in the unupdated area information storage unit 104-. Delete from 2 (step S24). Then, the update request information receiving unit 110-2 notifies the update completion by returning the update area information to the update reflection waiting unit 111-1 of the node 10-1 via the communication path 11 (step S25). On the other hand, when the non-updated area information regarding the data-updated area is not stored in the non-updated area information storage unit 104-2 (step S23), the update request information receiving unit 110-2 skips step S24. Then, the update reflection waiting unit 111-1 is notified of the update completion (step S25). In this case, the update request information receiving unit 110-2 returns to step S21 and waits for new update request information 106 to be transmitted from the update request information transmitting unit 109-1 of the node 10-1.

更新反映待ち部１１１-1は、ノード１０-2の更新要求情報受信部１１０-2から更新完了の通知を受け取ると（ステップＳ３１）、当該更新要求情報受信部１１０-2によってデータ更新された領域と同一の領域を指定する更新領域情報を更新反映待ち情報記憶部１０３-1から削除する（ステップＳ３２）。そして、更新反映待ち部１１１-1はステップＳ３１に戻って、更新要求情報受信部１１０-2からの新たな更新完了の通知を待つ。 When the update reflection waiting unit 111-1 receives an update completion notification from the update request information receiving unit 110-2 of the node 10-2 (step S31), the data updated by the update request information receiving unit 110-2 The update area information designating the same area as is deleted from the update reflection waiting information storage unit 103-1 (step S32). Then, the update reflection waiting unit 111-1 returns to step S31 and waits for a new update completion notification from the update request information receiving unit 110-2.

一方、ステップＳ１で検知された要求情報が参照要求情報１０７である場合（ステップＳ２）、データ要求監視部１０８は当該参照要求情報１０７の指定する参照領域とアドレスが同一の領域を示す未更新領域情報が未更新領域情報記憶部１０４-1に格納されているかを判定する（ステップＳ５）。もし、参照領域と同一の領域を示す未更新領域情報が未更新領域情報記憶部１０４-1に格納されている場合、データ要求監視部１０８は参照領域が未更新領域であると判定する。この場合、データ要求監視部１０８は、参照領域と同一の領域を示す未更新領域情報が未更新領域情報記憶部１０４-1から削除されるのを待つ（ステップＳ６）。やがて、参照領域とアドレスが同一の領域を示す未更新領域情報が未更新領域情報記憶部１０４-1から削除されたものとする。すると、データ要求監視部１０８は参照要求情報１０７によって指定されるデータ記憶部１０２-1の参照領域が未更新領域でなくなったものとして、当該参照領域からデータを読み出して、当該データをサービス１０５に返す（ステップＳ７）。そして、データ要求監視部１０８はステップＳ１に戻って、サービス１０５によって発行される要求情報を監視する。 On the other hand, when the request information detected in step S1 is the reference request information 107 (step S2), the data request monitoring unit 108 indicates an unupdated area indicating an area having the same address as the reference area specified by the reference request information 107. It is determined whether the information is stored in the unupdated area information storage unit 104-1 (step S5). If the unupdated area information indicating the same area as the reference area is stored in the unupdated area information storage unit 104-1, the data request monitoring unit 108 determines that the reference area is an unupdated area. In this case, the data request monitoring unit 108 waits for the non-updated region information indicating the same region as the reference region to be deleted from the non-updated region information storage unit 104-1 (step S6). Eventually, it is assumed that the unupdated area information indicating the area having the same address as the reference area is deleted from the unupdated area information storage unit 104-1. Then, the data request monitoring unit 108 reads the data from the reference area, assuming that the reference area of the data storage unit 102-1 designated by the reference request information 107 is no longer an unupdated area, and sends the data to the service 105. Return (step S7). Then, the data request monitoring unit 108 returns to step S1 and monitors request information issued by the service 105.

このような状態で、サービス１０５の提供に関して待機系ノードとして動作しているノード１０-2に障害が発生して当該ノード１０-2が停止し、その後当該ノード１０-2が障害から復旧して再度動作を開始したものとする。この場合、図１のクラスタシステムでは同期化処理が行われる。この同期化処理について、図４の動作説明図を参照して説明する。 In this state, a failure occurs in the node 10-2 operating as a standby node for providing the service 105, the node 10-2 stops, and then the node 10-2 recovers from the failure. It is assumed that the operation has started again. In this case, synchronization processing is performed in the cluster system of FIG. This synchronization processing will be described with reference to the operation explanatory diagram of FIG.

まず、ノード１０-2の組込要求部１１２-2が動作を開始して、ノード１０-1の組込要求受理部１１３-1に対して同期化処理の開始を要求する（ステップＳ４１）。このとき、ノード１０-2のデータ記憶部１０２-2においてアドレス１乃至９が割り当てられている９つの領域（アドレスが１乃至９の９つの領域）にいずれもデータＡが格納されているものとする。 First, the incorporation request unit 112-2 of the node 10-2 starts operation, and requests the incorporation request reception unit 113-1 of the node 10-1 to start the synchronization process (step S41). At this time, the data A is stored in all nine areas (the nine areas having addresses 1 to 9) to which the addresses 1 to 9 are assigned in the data storage unit 102-2 of the node 10-2. To do.

一方、ノード１０-1のデータ記憶部１０２-1におけるアドレスが１乃至９の９つの領域領域のうち、アドレスが２，３，４，５及び８の領域のデータは、ノード１０-2が停止していた期間にそれぞれＢ，Ｂ，Ｂ，Ｃ及びＤに更新されているものとする。つまり、データ記憶部１０２-1においてアドレスが２，３，４，５及び８の領域のデータの更新が、データ記憶部１０２-2の同じアドレスの領域（同一領域）に反映されていないものとする。この場合、ノード１０-1の更新反映待ち情報記憶部１０３-1には、アドレス２，３，４，５及び８を示す更新反映待ち情報が格納されている。 On the other hand, among the nine area areas with addresses 1 to 9 in the data storage unit 102-1 of the node 10-1, the data in the areas with addresses 2, 3, 4, 5, and 8 is stopped by the node 10-2. It is assumed that they are updated to B, B, B, C, and D, respectively, during the period that was being performed. In other words, the update of the data in the areas of addresses 2, 3, 4, 5, and 8 in the data storage unit 102-1 is not reflected in the same address area (the same area) of the data storage unit 102-2. To do. In this case, update reflection waiting information indicating addresses 2, 3, 4, 5 and 8 is stored in the update reflection waiting information storage unit 103-1 of the node 10-1.

さて、ノード１０-1の組込要求受理部１１３-1はノード１０-2の組込要求部１１２-2から同期化処理の開始が要求されると（ステップＳ４１）、更新反映待ち情報記憶部１０３-1に格納されている更新反映待ち情報を、当該同期化処理の開始要求に対する応答として当該組込要求部１１２-2に送信する（ステップＳ４２）。この更新反映待ち情報は、ノード１０-1の更新要求情報送信部１０９-1からノード１０-2の更新要求情報受信部１１０-2に送信された更新要求情報の示す更新要求に対して、当該更新要求情報受信部１１０-2から更新完了が通知されていないことを示す。つまり、更新反映待ち情報は、ノード１０-2においてデータ更新が完了していない領域の情報であって、ノード１０-1でのデータ更新をノード１０-2に反映する必要がある領域の情報を示す。 When the integration request reception unit 113-1 of the node 10-1 is requested to start the synchronization process from the integration request unit 112-2 of the node 10-2 (step S41), the update reflection waiting information storage unit The update reflection waiting information stored in 103-1 is transmitted to the integration request unit 112-2 as a response to the synchronization processing start request (step S42). This update reflection waiting information is in response to the update request indicated by the update request information transmitted from the update request information transmitting unit 109-1 of the node 10-1 to the update request information receiving unit 110-2 of the node 10-2. The update request information receiving unit 110-2 indicates that update completion has not been notified. In other words, the update reflection waiting information is information on an area where the data update is not completed in the node 10-2, and information on an area where the data update in the node 10-1 needs to be reflected in the node 10-2. Show.

組込要求部１１２-2は、組込要求受理部１１３-1から更新反映待ち情報を受け取ると、当該更新反映待ち情報を未更新領域情報記憶部１０４-2に反映する（ステップＳ４３）。即ち組込要求部１１２-2は、組込要求受理部１１３-1から受け取った更新反映待ち情報を未更新領域情報として未更新領域情報記憶部１０４-2に追加する。ここでは、アドレス２，３，４，５及び８を示す更新反映待ち情報が未更新領域情報として未更新領域情報記憶部１０４-2に格納される。 When receiving the update reflection waiting information from the integration request receiving unit 113-1, the incorporation request unit 112-2 reflects the update reflection waiting information in the unupdated area information storage unit 104-2 (step S43). That is, the incorporation request unit 112-2 adds the update reflection waiting information received from the incorporation request reception unit 113-1 to the unupdated area information storage unit 104-2 as unupdated area information. Here, the update reflection waiting information indicating the addresses 2, 3, 4, 5, and 8 is stored in the unupdated area information storage unit 104-2 as unupdated area information.

この状態で、ノード１０-1に障害が発生し、当該ノード１０-1で開始されていたサービス１０５をノード１０-2が引き継ぐ必要があるものとする。この場合、従来であれば、同期化処理が完了するまでノード１０-2によるサービス１０５の引き継ぎが待たされる。その理由は、同期化処理が完了するまでは、どの領域のデータが未更新であるか確認する手段がないためである。 In this state, it is assumed that a failure occurs in the node 10-1 and the node 10-2 needs to take over the service 105 started in the node 10-1. In this case, conventionally, the takeover of the service 105 by the node 10-2 is waited until the synchronization processing is completed. The reason is that there is no means for confirming which area data is not updated until the synchronization processing is completed.

これに対して本実施形態では、ノード１０-2（の組込要求部１１２-2）は障害からの復帰時にノード１０-1から更新反映待ち情報を取得して、当該更新反映待ち情報を未更新領域情報記憶部１０４-2に未更新領域情報として反映している。このため、ノード１０-2は図４のフローチャートからも明らかなように、未更新領域情報記憶部１０４-2に格納されている未更新領域情報の示す未更新領域（ここではアドレス２，３，４，５及び８の領域）を参照する処理は行えないものの、その他の領域を参照する処理と全ての領域のデータを更新する処理は行える。よってノード１０-2は、サービス１０５を引き継ぐことができる。未更新領域を参照する処理は、図３のフローチャートに示されるように、未更新領域情報記憶部１０４-2から当該領域を示す未更新領域情報が削除されるまで、つまり当該領域が未更新領域でなくなるまで待たされる（ステップＳ６）
さて、ノード１０-1の更新要求情報送信部１０９-1は、更新反映待ち情報記憶部１０３-1に格納されている更新反映待ち情報が組込要求受理部１１３-1によってノード１０-2の組込要求部１１２-2に送信された場合にも動作する。この場合、更新要求情報送信部１０９-1は、更新反映待ち情報記憶部１０３-1に格納されている更新反映待ち情報を先頭から順に取り出し、当該更新反映待ち情報の示す領域のデータ（更新済みデータ）をデータ記憶部１０２-1から取得する。更新要求情報送信部１０９-1は、更新反映待ち情報（更新領域情報）と取得されたデータとを含む更新要求情報を生成して、当該更新要求情報を更新要求情報受信部１１０-2に送信する。更新要求情報送信部１０９-1は、たとえノード１０-1のサービス１０５が停止しても、更新反映待ち情報記憶部１０３-1に情報が格納されている限り動作する。 On the other hand, in this embodiment, the node 10-2 (embedding request unit 112-2) acquires the update reflection waiting information from the node 10-1 at the time of recovery from the failure, and the update reflection waiting information is not yet received. The updated area information storage unit 104-2 reflects it as unupdated area information. Therefore, as is apparent from the flowchart of FIG. 4, the node 10-2 has an unupdated area (in this case, addresses 2, 3 and 3) indicated by the unupdated area information stored in the unupdated area information storage unit 104-2. Although the process of referring to the areas (4, 5 and 8) cannot be performed, the process of referring to other areas and the process of updating the data in all areas can be performed. Therefore, the node 10-2 can take over the service 105. As shown in the flowchart of FIG. 3, the process of referring to the unupdated area is performed until the unupdated area information indicating the area is deleted from the unupdated area information storage unit 104-2, that is, the area is an unupdated area. It waits until it stops (step S6).
Now, the update request information transmitting unit 109-1 of the node 10-1 receives the update reflection waiting information stored in the update reflection waiting information storage unit 103-1 by the built-in request receiving unit 113-1 of the node 10-2. It also operates when it is transmitted to the incorporation request unit 112-2. In this case, the update request information transmission unit 109-1 sequentially extracts the update reflection wait information stored in the update reflection wait information storage unit 103-1 from the top, and the data in the area indicated by the update reflection wait information (updated) Data) is acquired from the data storage unit 102-1. The update request information transmitting unit 109-1 generates update request information including update reflection waiting information (update region information) and the acquired data, and transmits the update request information to the update request information receiving unit 110-2. To do. Even if the service 105 of the node 10-1 is stopped, the update request information transmitting unit 109-1 operates as long as information is stored in the update reflection waiting information storage unit 103-1.

図４の例では、まず、アドレス２の領域の領域情報（更新領域情報）と当該アドレス２の領域のデータＢとを含む更新要求情報が、ノード１０-1の更新要求情報送信部１０９-1からノード１０-2の更新要求情報受信部１１０-2に送信される（ステップＳ４４）。この場合、ノード１０-2では、データ記憶部１０２-2のアドレス２の領域のデータＡがデータＢに更新される（ステップＳ４５）。すると、ノード１０-2の未更新領域情報記憶部１０４-2から、アドレス２の領域を示す未更新領域情報が削除される（ステップＳ４６）。この段階で、ノード１０-2は、アドレス３，４，５及び８の領域を参照する処理は行えないものの、その他の領域を参照する処理と全ての領域のデータを更新する処理を行うことができる。この参照する処理が行えない領域は、未更新領域情報記憶部１０４-2の内容から判定される。 In the example of FIG. 4, first, update request information including area information (update area information) of the area of address 2 and data B of the area of address 2 is updated request information transmission section 109-1 of node 10-1. To the update request information receiving unit 110-2 of the node 10-2 (step S44). In this case, in the node 10-2, the data A in the area of the address 2 in the data storage unit 102-2 is updated to the data B (step S45). Then, the unupdated area information indicating the area of address 2 is deleted from the unupdated area information storage unit 104-2 of the node 10-2 (step S46). At this stage, the node 10-2 cannot perform the process of referring to the areas of the addresses 3, 4, 5, and 8, but can perform the process of referring to the other areas and the process of updating the data of all the areas. it can. The area that cannot be referred to is determined from the contents of the unupdated area information storage unit 104-2.

次に、アドレス３の領域の領域情報と当該アドレス３の領域のデータＢとを含む更新要求情報が、ノード１０-1の更新要求情報送信部１０９-1からノード１０-2の更新要求情報受信部１１０-2に送信される（ステップＳ４７）。この場合、ノード１０-2では、データ記憶部１０２-2のアドレス３の領域のデータＡがデータＢに更新される（ステップＳ４８）。すると、ノード１０-2の未更新領域情報記憶部１０４-2から、アドレス３の領域を示す未更新領域情報が削除される（ステップＳ４９）。この段階で、ノード１０-2は、アドレス４，５及び８の領域を参照する処理は行えないものの、その他の領域を参照する処理と全ての領域のデータを更新する処理を行うことができる。 Next, the update request information including the area information of the area of address 3 and the data B of the area of address 3 is received from the update request information transmitting unit 109-1 of the node 10-1 and the update request information of the node 10-2. Is transmitted to the unit 110-2 (step S47). In this case, in the node 10-2, the data A in the area of the address 3 in the data storage unit 102-2 is updated to the data B (step S48). Then, the unupdated area information indicating the area of address 3 is deleted from the unupdated area information storage unit 104-2 of the node 10-2 (step S49). At this stage, the node 10-2 cannot perform the process of referring to the areas of the addresses 4, 5, and 8, but can perform the process of referring to the other areas and the process of updating the data of all the areas.

次に、アドレス４の領域の領域情報と当該アドレス３の領域のデータＢとを含む更新要求情報が、ノード１０-1の更新要求情報送信部１０９-1からノード１０-2の更新要求情報受信部１１０-2に送信される（ステップＳ５０）。この場合、ノード１０-2では、データ記憶部１０２-2のアドレス４の領域のデータＡがデータＢに更新される（ステップＳ５１）。すると、ノード１０-2の未更新領域情報記憶部１０４-2から、アドレス４の領域を示す未更新領域情報が削除される（ステップＳ５２）。この段階で、ノード１０-2は、アドレス５及び８の領域を参照する処理は行えないものの、その他の領域を参照する処理と全ての領域のデータを更新する処理を行うことができる。 Next, the update request information including the area information of the area of address 4 and the data B of the area of address 3 is received from the update request information transmitting unit 109-1 of the node 10-1 Is transmitted to the unit 110-2 (step S50). In this case, in the node 10-2, the data A in the area of the address 4 in the data storage unit 102-2 is updated to the data B (step S51). Then, the unupdated area information indicating the area of address 4 is deleted from the unupdated area information storage unit 104-2 of the node 10-2 (step S52). At this stage, the node 10-2 cannot perform the process of referring to the areas of the addresses 5 and 8, but can perform the process of referring to the other areas and the process of updating the data of all the areas.

次に、アドレス５の領域の領域情報と当該アドレス５の領域のデータＣとを含む更新要求情報が、ノード１０-1の更新要求情報送信部１０９-1からノード１０-2の更新要求情報受信部１１０-2に送信される（ステップＳ５３）。この場合、ノード１０-2では、データ記憶部１０２-2のアドレス５の領域のデータＡがデータＣに更新される（ステップＳ５４）。すると、ノード１０-2の未更新領域情報記憶部１０４-2から、アドレス５の領域を示す未更新領域情報が削除される（ステップＳ５５）。この段階で、ノード１０-2は、アドレス８の領域を参照する処理は行えないものの、その他の領域を参照する処理と全ての領域のデータを更新する処理を行うことができる。 Next, the update request information including the area information of the area of the address 5 and the data C of the area of the address 5 is received from the update request information transmitting unit 109-1 of the node 10-1 Is transmitted to the unit 110-2 (step S53). In this case, in the node 10-2, the data A in the area of the address 5 in the data storage unit 102-2 is updated to the data C (step S54). Then, the unupdated area information indicating the area of address 5 is deleted from the unupdated area information storage unit 104-2 of the node 10-2 (step S55). At this stage, the node 10-2 cannot perform the process of referring to the area of the address 8, but can perform the process of referring to the other area and the process of updating the data of all areas.

次に、アドレス８の領域の領域情報と当該アドレス８の領域のデータＤとを含む更新要求情報が、ノード１０-1の更新要求情報送信部１０９-1からノード１０-2の更新要求情報受信部１１０-2に送信される（ステップＳ５６）。この場合、ノード１０-2では、データ記憶部１０２-2のアドレス８の領域のデータＡがデータＤに更新される（ステップＳ５７）。すると、ノード１０-2の未更新領域情報記憶部１０４-2から、アドレス８の領域を示す未更新領域情報が削除される（ステップＳ５８）。この段階で、ノード１０-2は、全ての領域について、参照する処理とデータを更新する処理を行うことができる。 Next, the update request information including the area information of the area of address 8 and the data D of the area of address 8 is received from the update request information transmitting unit 109-1 of the node 10-1 and the update request information of the node 10-2. Is transmitted to the unit 110-2 (step S56). In this case, in the node 10-2, the data A in the area of the address 8 in the data storage unit 102-2 is updated to the data D (step S57). Then, the unupdated area information indicating the area of the address 8 is deleted from the unupdated area information storage unit 104-2 of the node 10-2 (Step S58). At this stage, the node 10-2 can perform a process of referring to and a process of updating data for all areas.

このように本実施形態においては、同期化処理の開始に際してノード１０-2からノード１０-1に与えられる同期化処理開始要求に応じて、ノード１０-1からノード１０-2に更新反映待ち領域の情報（更新反映待ち情報）を未更新領域情報として送信し、その領域に対する参照要求をノード１０-2にてブロックすることで、ノード１０-2での同期化処理が完了していない状態でノード１０-1に障害が発生しても、ノード１０-2は速やかにサービス１０５を引き継ぐことができる。このように、本実施形態のクラスタシステムにおいては、従来技術に比較してより早くサービスを引き継げる状態となるため、可用性が高くなる。 As described above, in the present embodiment, the update reflection waiting area is transferred from the node 10-1 to the node 10-2 in response to the synchronization process start request given from the node 10-2 to the node 10-1 at the start of the synchronization process. Information (waiting for update reflection) is transmitted as unupdated area information, and the reference request for that area is blocked at the node 10-2, so that the synchronization processing at the node 10-2 has not been completed. Even if a failure occurs in the node 10-1, the node 10-2 can quickly take over the service 105. As described above, in the cluster system according to the present embodiment, the service can be taken over earlier than in the conventional technique, so that the availability is increased.

［第２の実施形態］
図５は本発明の第２の実施形態に係るクラスタシステムの構成を示すブロック図である。図５において、図１と同様の要素には同一参照符号を付してある。図５に示すクラスタシステムの特徴は、図１のノード１０-1及び１０-2に代えて、当該ノード１０-1及び１０-2の構成に、それぞれデータ要求部２０１-1及び２０１-2と、データ要求受理部２０２-1及び２０２-2とが追加されたノード２０-1及び２０-2が用いられることにある。 [Second Embodiment]
FIG. 5 is a block diagram showing the configuration of the cluster system according to the second embodiment of the present invention. In FIG. 5, the same elements as those in FIG. 5 is different from the nodes 10-1 and 10-2 in FIG. 1 in that the configuration of the nodes 10-1 and 10-2 includes data request units 201-1 and 201-2, respectively. The nodes 20-1 and 20-2 to which the data request receiving units 202-1 and 202-2 are added are used.

次に、図５のクラスタシステムの動作について、図１のクラスタシステムと相違する点を中心に、図６のフローチャートを参照して説明する。
まず、ノード２０-1が、ノード２０-2で開始されていたサービス１０５を引き継いで稼動系となったものとする。ここでは、ノード２０-1が障害から回復した際に実行される同期化処理が完了しない状態で、当該ノード２０-1がサービス１０５を引き継いだものとする。また、ノード２０-1の同期化処理の開始に際して、当該ノード２０-1からノード２０-2に同期化処理開始要求が与えられることにより、ノード２０-2からノード２０-1に更新反映待ち情報が未更新領域情報として送信され、その領域に対する参照要求がノード２０-1にてブロックされるものとする。これにより、ノード２０-1での同期化処理が完了していない状態でノード２０-2に障害が発生しても、ノード２０-1はサービス１０５を速やかに引き継ぐことができる。この点は、ノード２０-1とノード２０-2との関係（稼動系と待機系との関係）が上記第１の実施形態と逆となった点を除けば、当該第１の実施形態と同様である。 Next, the operation of the cluster system of FIG. 5 will be described with reference to the flowchart of FIG. 6, focusing on the differences from the cluster system of FIG.
First, it is assumed that the node 20-1 takes over the service 105 started by the node 20-2 and becomes an active system. Here, it is assumed that the node 20-1 has taken over the service 105 in a state where the synchronization process executed when the node 20-1 recovers from the failure is not completed. Further, when the synchronization processing of the node 20-1 is started, the node 20-1 sends a synchronization processing start request to the node 20-2, so that the update reflection waiting information is sent from the node 20-2 to the node 20-1. Is transmitted as unupdated area information, and a reference request for the area is blocked by the node 20-1. Thereby, even if a failure occurs in the node 20-2 in a state where the synchronization processing in the node 20-1 is not completed, the node 20-1 can take over the service 105 promptly. This is the same as the first embodiment except that the relationship between the node 20-1 and the node 20-2 (the relationship between the active system and the standby system) is the reverse of the first embodiment. It is the same.

さて、ノード２０-1の同期化処理が完了しない状態で、当該ノード２０-1がサービス１０５を引き継いだ本実施形態では、上記第１の実施形態とは逆に、ノード２０-1の未更新領域情報記憶部１０４-1には未更新領域情報が格納され、ノード２０-2の更新反映待ち情報記憶部１０３-2には更新反映待ち情報が格納されている。このような状態でサービス１０５が開始されているノード２０-1（つまり新たに稼動系となったノード２０-1）において、図５において矢印５１で示されるように、サービス１０５によって参照要求情報１０７が発行されたものとする。 In the present embodiment in which the synchronization process of the node 20-1 is not completed and the node 20-1 takes over the service 105, the node 20-1 is not updated, contrary to the first embodiment. The area information storage unit 104-1 stores unupdated area information, and the update reflection wait information storage unit 103-2 of the node 20-2 stores update reflection wait information. In the node 20-1 where the service 105 has been started in this state (that is, the node 20-1 that has newly become active), as indicated by the arrow 51 in FIG. Shall be issued.

データ要求監視部１０８は、サービス１０５によって発行される参照要求情報１０７を検知したときは（ステップＳ４１，Ｓ４２）、未更新領域情報記憶部１０４-1を参照する。もし、参照要求情報１０７中の参照領域情報の指定する参照領域とアドレスが同一の領域を示す未更新領域情報が未更新領域情報記憶部１０４-1に格納されているならば（ステップＳ４５）、データ要求監視部１０８は、参照領域が未更新領域である（つまり参照領域に格納されているデータが未更新データである）と判定する。ここまでの動作は、前記第１の実施形態のステップＳ１乃至Ｓ５と同様である。 When the data request monitoring unit 108 detects the reference request information 107 issued by the service 105 (steps S41 and S42), the data request monitoring unit 108 refers to the unupdated area information storage unit 104-1. If unupdated area information indicating an area having the same address as the reference area specified by the reference area information in the reference request information 107 is stored in the unupdated area information storage unit 104-1 (step S45), The data request monitoring unit 108 determines that the reference area is an unupdated area (that is, data stored in the reference area is unupdated data). The operations so far are the same as steps S1 to S5 of the first embodiment.

データ要求監視部１０８は、参照領域が未更新領域であると判定した場合、当該参照領域を指定する参照領域情報を図５において矢印５２で示されるようにデータ要求部２０１-1に渡す（ステップＳ４６）。そしてデータ要求監視部１０８は、参照領域と同一の領域を示す未更新領域情報が未更新領域情報記憶部１０４-1から削除されるのを待つ（ステップＳ４７）。 If the data request monitoring unit 108 determines that the reference region is an unupdated region, the data request monitoring unit 108 passes the reference region information designating the reference region to the data requesting unit 201-1 as indicated by the arrow 52 in FIG. S46). Then, the data request monitoring unit 108 waits for the non-updated area information indicating the same area as the reference area to be deleted from the non-updated area information storage unit 104-1 (step S47).

一方、データ要求監視部１０８から参照領域情報を渡されたデータ要求部２０１-1は、当該参照領域情報を図５において矢印５３で示されるようにノード２０-2のデータ要求受理部２０２-2に送信して、当該参照領域情報の指定する領域のデータ（更新データ）を要求する（ステップＳ５０）。 On the other hand, the data request unit 201-1 to which the reference area information is passed from the data request monitoring unit 108 indicates the reference area information as indicated by an arrow 53 in FIG. And request data (update data) of the area specified by the reference area information (step S50).

データ要求受理部２０２-2は、データ要求部２０１-1から参照領域情報を受け取ると、当該参照領域情報によって指定されるデータ記憶部１０２-2の参照領域から図５において矢印５４で示されるようにデータを読み出す（ステップＳ６１）。このデータは、参照要求情報１０７中の参照領域情報によって指定されるデータ記憶部１０２-1の参照領域のデータ（未更新データ）に対応する更新データである。 When the data request receiving unit 202-2 receives the reference area information from the data requesting unit 201-1, the data request receiving unit 202-2 is indicated by an arrow 54 in FIG. 5 from the reference area of the data storage unit 102-2 specified by the reference area information. The data is read out (step S61). This data is update data corresponding to the reference area data (unupdated data) in the data storage unit 102-1 specified by the reference area information in the reference request information 107.

データ要求受理部２０２-2は、参照領域情報によって指定されるデータ記憶部１０２-2の参照領域から読み出されたデータと当該参照領域情報に一致する更新領域情報とを含む更新要求情報を生成して、当該更新要求情報を図５において矢印５５で示されるようにノード２０-1の更新要求情報受信部１１０-1に送信する（ステップＳ６２）。このデータは、上記第１の実施形態であれば同期化処理でノード２０-1のデータ記憶部１０２-1に反映されるべきデータである。 The data request receiving unit 202-2 generates update request information including data read from the reference region of the data storage unit 102-2 designated by the reference region information and update region information that matches the reference region information. Then, the update request information is transmitted to the update request information receiving unit 110-1 of the node 20-1 as indicated by the arrow 55 in FIG. 5 (step S62). This data is data to be reflected in the data storage unit 102-1 of the node 20-1 in the synchronization process in the first embodiment.

更新要求情報受信部１１０-1は、データ要求受理部２０２-2からの更新要求情報を受け取ると、当該更新要求情報に従って、図５において矢印５６で示されるようにデータ記憶部１０２-1内の該当する領域のデータを更新する（ステップＳ７１）。これにより、サービス１０５からの参照要求情報１０７中の参照領域情報（つまりデータ要求受理部２０２-2からの更新要求情報中の更新領域情報）によって指定されるデータ記憶部１０２-1のデータ（未更新データ）は、当該領域情報によって指定されるデータ記憶部１０２-2のデータ（更新データ）に一致化される。そこで更新要求情報受信部１１０-1は、ステップＳ７１でデータが更新された領域と同一の領域を指定する未更新領域情報を、図５において矢印５７で示されるように未更新領域情報記憶部１０４-1から削除する（ステップＳ７２）。 When the update request information receiving unit 110-1 receives the update request information from the data request receiving unit 202-2, the update request information receiving unit 110-1 stores the update request information in the data storage unit 102-1 as indicated by an arrow 56 in FIG. The data in the corresponding area is updated (step S71). As a result, the data (not yet stored) in the data storage unit 102-1 designated by the reference region information in the reference request information 107 from the service 105 (that is, the update region information in the update request information from the data request receiving unit 202-2). (Update data) is matched with data (update data) in the data storage unit 102-2 designated by the area information. Therefore, the update request information receiving unit 110-1 uses the non-updated region information storage unit 104 as shown by an arrow 57 in FIG. 5 to indicate the non-updated region information specifying the same region as the region where the data has been updated in step S71. -1 is deleted (step S72).

データ要求監視部１０８は、ステップＳ４７の待ち合わせ中に、参照要求情報１０７中の参照要求情報で指定される領域と同一の領域を指定する未更新領域情報が未更新領域情報記憶部１０４-1から削除されると、当該領域のデータに関してノード２０-1（のデータ記憶部１０２-1）とノード２０-2（のデータ記憶部１０２-2）との間で一致化されていると判定する。この場合、データ要求監視部１０８は、参照要求情報１０７中の参照要求情報で指定されるデータ記憶部１０２-1の領域からデータを読み出して、図５において矢印５８で示されるように当該データをサービス１０５に返す（ステップＳ４８）。 The data request monitoring unit 108 receives from the unupdated region information storage unit 104-1 unupdated region information that specifies the same region as the region specified by the reference request information in the reference request information 107 during the waiting of step S47. When deleted, it is determined that the data in the area is matched between the node 20-1 (the data storage unit 102-1) and the node 20-2 (the data storage unit 102-2). In this case, the data request monitoring unit 108 reads the data from the area of the data storage unit 102-1 designated by the reference request information in the reference request information 107, and stores the data as indicated by an arrow 58 in FIG. It returns to the service 105 (step S48).

一方、更新要求情報受信部１１０-1は、データ要求受理部２０２-2からの更新要求情報に従う更新完了の通知として、当該更新要求情報中の更新領域情報を図５において矢印５９で示されるようにノード２０-2の更新反映待ち部１１１-2に返す（ステップＳ７３）。更新反映待ち部１１１-2は、更新要求情報受信部１１０-1から更新完了の通知として更新領域情報を受け取ると、当該更新領域情報と同一の領域を指定する更新反映待ち情報を図５において矢印６０で示されるように更新反映待ち情報記憶部１０３-2から削除する（ステップＳ８０）。 On the other hand, the update request information receiving unit 110-1 indicates the update area information in the update request information as indicated by an arrow 59 in FIG. 5 as a notification of the update completion in accordance with the update request information from the data request receiving unit 202-2. To the update reflection waiting section 111-2 of the node 20-2 (step S73). When the update reflection waiting section 111-2 receives the update area information as an update completion notification from the update request information receiving section 110-1, the update reflection waiting information for designating the same area as the update area information is indicated by an arrow in FIG. As indicated by 60, it is deleted from the update reflection waiting information storage unit 103-2 (step S80).

このように本実施形態においては、同期化処理が完了しない状態でサービス１０５を引き継いだノード２０-1において、未更新領域に対する参照要求が発生した場合に、当該ノード２０-1のデータ要求部２０１-1からノード２０-2のデータ要求受理部２０２-2に当該未更新領域のデータ更新が要求される。するとノード２０-1の更新要求情報受信部１１０-1は、ノード２０-2のデータ記憶部１０２-2の対応する領域のデータ（更新データ）をデータ要求受理部２０２-2から受け取って、そのデータで当該未更新領域のデータを更新する。つまり本実施形態においては、未更新領域に対する参照要求が発生した場合、その参照要求をトリガとして当該未更新領域のデータが更新される。これにより、データ要求監視部１０８が参照要求を検知してから当該参照要求で指定される領域のデータを読み出せるようになるまで待ち合わせる時間を短縮することができる。 As described above, in this embodiment, when a reference request for an unupdated area occurs in the node 20-1 that has taken over the service 105 in a state where the synchronization processing is not completed, the data request unit 201 of the node 20-1 -1 requests the data request receiving unit 202-2 of the node 20-2 to update the data in the unupdated area. Then, the update request information receiving unit 110-1 of the node 20-1 receives the data (update data) of the corresponding area of the data storage unit 102-2 of the node 20-2 from the data request receiving unit 202-2, and Data in the unupdated area is updated with data. That is, in this embodiment, when a reference request for an unupdated area occurs, the data in the unupdated area is updated using the reference request as a trigger. As a result, it is possible to shorten the time for which the data request monitoring unit 108 waits until it can read the data in the area specified by the reference request after detecting the reference request.

［第３の実施形態］
図７は本発明の第３の実施形態に係るクラスタシステムの構成を示すブロック図である。図７において、図５と同様の要素には同一参照符号を付してある。図７に示すクラスタシステムの特徴は、図５のノード２０-1及び２０-2に代えて、３台のノード３０-1，３０-2及び３０-3が用いられることにある。ノード３０-1，３０-2及び３０-3の基本的な構成は、図５のノード２０-1及び２０-2のそれと同様である。このため、図７のクラスタシステムにおいても前記第１及び第２の実施形態と同様の動作が行われる。 [Third Embodiment]
FIG. 7 is a block diagram showing the configuration of the cluster system according to the third embodiment of the present invention. In FIG. 7, the same elements as those in FIG. The feature of the cluster system shown in FIG. 7 is that three nodes 30-1, 30-2 and 30-3 are used instead of the nodes 20-1 and 20-2 of FIG. The basic configuration of the nodes 30-1, 30-2 and 30-3 is the same as that of the nodes 20-1 and 20-2 in FIG. Therefore, the same operation as in the first and second embodiments is also performed in the cluster system of FIG.

さて、図７の例では、ノード３０-1でサービス１０５が開始されている。つまり図７の例では、サービス１０５に関して、ノード３０-1が稼動系として機能し、ノード３０-2及び３０-3が待機系として機能する。 In the example of FIG. 7, the service 105 is started at the node 30-1. That is, in the example of FIG. 7, for the service 105, the node 30-1 functions as an active system, and the nodes 30-2 and 30-3 function as standby systems.

ノード３０-1がノード２０-1と異なるのは、更新反映待ち情報記憶部１０３-1に加えて、当該更新反映待ち情報記憶部１０３-1と同様の更新反映待ち情報記憶部２０３を有する点にある。更新反映待ち情報記憶部１０３-1及び２０３は、書き換え可能な不揮発性メモリ内に確保される。更新反映待ち情報記憶部１０３-1及び２０３は、ノード３０-1のデータ記憶部１０２-1でのデータ更新がそれぞれノード３０-2及び３０-3のデータ記憶部に反映されていない領域の情報（更新領域情報）を更新反映待ち情報として格納する。ノード３０-2及び３０-3においても、更新反映待ち情報記憶部１０３-1及び２０３に相当する、それぞれ２つの更新反映待ち情報記憶部が書き換え可能な不揮発性メモリ内に確保される。但し、図７では、ノード３０-2内の更新反映待ち情報記憶部１０３-2以外の更新反映待ち情報記憶部については省略されている。なお、更新反映待ち情報記憶部１０３-1内に、ノード３０-1のデータ記憶部１０２-1でのデータ更新がそれぞれノード３０-2及び３０-3のデータ記憶部に反映されていない領域の情報を更新反映待ち情報として格納する領域が確保されても構わない。この場合、更新反映待ち情報記憶部２０３は不要となる。 The node 30-1 is different from the node 20-1 in that, in addition to the update reflection wait information storage unit 103-1, the node 30-1 has an update reflection wait information storage unit 203 similar to the update reflection wait information storage unit 103-1. It is in. The update reflection waiting information storage units 103-1 and 203 are secured in a rewritable nonvolatile memory. The update reflection waiting information storage units 103-1 and 203 are information on areas in which data updates in the data storage unit 102-1 of the node 30-1 are not reflected in the data storage units of the nodes 30-2 and 30-3, respectively. (Update area information) is stored as update reflection wait information. Also in the nodes 30-2 and 30-3, two update reflection waiting information storage units corresponding to the update reflection waiting information storage units 103-1 and 203, respectively, are secured in a rewritable nonvolatile memory. However, in FIG. 7, the update reflection waiting information storage unit other than the update reflection waiting information storage unit 103-2 in the node 30-2 is omitted. In the update reflection waiting information storage unit 103-1, the data update in the data storage unit 102-1 of the node 30-1 is not reflected in the data storage units of the nodes 30-2 and 30-3, respectively. An area for storing information as update reflection waiting information may be secured. In this case, the update reflection waiting information storage unit 203 is not necessary.

図７のクラスタシステムにおいて、ノード３０-1で障害が発生して、ノード３０-1のクラスタ制御部１００-1に含まれている障害検知部１０１-1、ノード３０-2のクラスタ制御部１００-1に含まれている障害検知部１０１-2またはノード３０-2のクラスタ制御部に含まれている障害検知部（図示せず）で当該ノード３０-1の障害が検出されたものとする。この場合、ノード３０-1で開始されていたサービス１０５が、ノード３０-2または３０-3に引き継がれる。 In the cluster system of FIG. 7, when a failure occurs in the node 30-1, the failure detection unit 101-1 included in the cluster control unit 100-1 of the node 30-1 and the cluster control unit 100 of the node 30-2. -1 includes a failure detection unit 101-2 or a failure detection unit (not shown) included in the cluster control unit of the node 30-2, and the failure of the node 30-1 is detected. . In this case, the service 105 started at the node 30-1 is taken over by the node 30-2 or 30-3.

本実施形態においてノード３０-2のクラスタ制御部１００-2及びノード３０-3のクラスタ制御部は、サービス１０５が開始されていたノード３０-1の更新反映待ち情報記憶部１０３-1及び更新反映待ち情報記憶部２０３にそれぞれ格納されている更新反映待ち情報を比較することにより、ノード３０-2及び３０-3のいずれが当該サービス１０５を引き継ぐかを決定する。具体的には、ノード３０-2のクラスタ制御部１００-2及びノード３０-3のクラスタ制御部は、更新反映待ち情報記憶部１０３-1及び更新反映待ち情報記憶部２０３にそれぞれ格納されている更新反映待ち情報の示す更新反映待ちの総領域量が少ない方のノードがサービス１０５を引き継ぐように合意をとる。 In this embodiment, the cluster control unit 100-2 of the node 30-2 and the cluster control unit of the node 30-3 are the update reflection waiting information storage unit 103-1 and the update reflection of the node 30-1 where the service 105 has been started. By comparing the update reflection waiting information stored in the waiting information storage unit 203, it is determined which of the nodes 30-2 and 30-3 takes over the service 105. Specifically, the cluster control unit 100-2 of the node 30-2 and the cluster control unit of the node 30-3 are stored in the update reflection waiting information storage unit 103-1 and the update reflection waiting information storage unit 203, respectively. An agreement is made so that the node having the smaller total area of the update reflection wait indicated by the update reflection wait information takes over the service 105.

これにより、サービス１０５を引き継ぐノードが同期化処理を完了していないとしても、更新反映待ちの総領域量が少ないことから、同期化処理が完了して全ての領域に対する参照が可能となるまでの時間が短くて済む。また、更新反映待ちの総領域量が少ない方のノードがサービス１０５を引き継ぐことにより、未更新領域に対する参照要求が発生する確率を低くすることができる。また、たとえ未更新領域に対する参照要求が発生しても、第２の実施形態と同様に、未更新領域のデータに対する更新要求を行うことで、参照要求で指定される領域のデータを読み出せるようになるまで待ち合わせる時間を短縮することもできる。 As a result, even if the node that takes over the service 105 has not completed the synchronization process, the total amount of area waiting for update reflection is small, and therefore, until the synchronization process is completed and all areas can be referenced. Time is short. In addition, since the node with the smaller total area waiting for update reflection takes over the service 105, the probability of a reference request for an unupdated area can be reduced. Further, even if a reference request for an unupdated area occurs, the data in the area specified by the reference request can be read by making an update request for the data in the unupdated area, as in the second embodiment. You can also reduce the time to wait until

［第３の実施形態の変形例］
次に、本発明の第３の実施形態の変形例について説明する。本変形例で適用されるクラスタシステムの構成は第３の実施形態と同様であるため、図７を援用する。本変形例の特徴は、ノード３０-1でサービス１０５が開始されている状態で、ノード３０-2で同期化処理が開始される場合を想定すると、ノード３０-1の組込要求受理部１１３-1によって更新反映待ち情報がノード３０-2に送信された場合に、ノード３０-1の更新要求情報送信部１０９-1による更新要求情報の送信が行われない点にある。 [Modification of Third Embodiment]
Next, a modification of the third embodiment of the present invention will be described. Since the configuration of the cluster system applied in this modification is the same as that of the third embodiment, FIG. 7 is used. The feature of this modification is that assuming that the synchronization process is started in the node 30-2 while the service 105 is started in the node 30-1, the built-in request receiving unit 113 of the node 30-1 is assumed. The update request information is not transmitted by the update request information transmitting unit 109-1 of the node 30-1 when the update reflection waiting information is transmitted to the node 30-2.

以下、本変形例の同期化処理について説明する。まず、ノード３０-1でサービス１０５が開始されている状態で、ノード３０-2で同期化処理が開始されるものとする。この場合、ノード３０-2の組込要求部１１２-2からノード３０-1の組込要求受理部１１３-1に対して同期化処理の開始が要求される。すると組込要求受理部１１３-1は、更新反映待ち情報記憶部１０３-1に格納されている更新反映待ち情報を組込要求部１１２-2に送信する。組込要求部１１２-2は、組込要求受理部１１３-1から更新反映待ち情報を受け取ると、当該更新反映待ち情報を未更新領域情報として未更新領域情報記憶部１０４-2に追加する。ここまでの動作は、第１の実施形態における同期化処理の場合（ステップＳ４１〜Ｓ４３）と同様である。 Hereinafter, the synchronization processing of this modification will be described. First, it is assumed that the synchronization process is started in the node 30-2 while the service 105 is started in the node 30-1. In this case, the integration request unit 112-2 of the node 30-2 requests the integration request reception unit 113-1 of the node 30-1 to start the synchronization process. Then, the incorporation request receiving unit 113-1 transmits the update reflection waiting information stored in the update reflection waiting information storage unit 103-1 to the incorporation request unit 112-2. When receiving the update reflection waiting information from the integration request receiving unit 113-1, the incorporation request unit 112-2 adds the update reflection waiting information as unupdated region information to the unupdated region information storage unit 104-2. The operations so far are the same as those in the case of the synchronization processing in the first embodiment (steps S41 to S43).

組込要求部１１２-2は、更新反映待ち情報を未更新領域情報として未更新領域情報記憶部１０４-2に追加すると、当該未更新領域情報記憶部１０４-2に格納されている未更新領域情報の示す領域のデータを、データ要求部２０１-2によりノード３０-3のデータ要求受理部（図示せず）から取得させる。即ちデータ要求部２０１-2は、未更新領域情報記憶部１０４-2に格納されている未更新領域情報を参照領域情報としてノード３０-3のデータ要求受理部に送信することにより、当該参照領域情報（未更新領域情報）の指定する領域のデータを要求する。 When the update request waiting information 112-2 is added to the unupdated area information storage unit 104-2 as update update area information, the incorporation request unit 112-2 stores the unupdated area stored in the unupdated area information storage unit 104-2. Data in the area indicated by the information is acquired from the data request receiving unit (not shown) of the node 30-3 by the data requesting unit 201-2. That is, the data request unit 201-2 transmits the non-updated area information stored in the non-updated area information storage unit 104-2 as reference area information to the data request receiving unit of the node 30-3, so that the reference area Requests data in the area specified by the information (unupdated area information).

すると、ノード３０-3のデータ要求受理部は、前記第２の実施形態において、ノード１０-2のデータ要求受理部２０２-2がノード１０-1のデータ要求部２０１-1から参照領域情報を受け取った場合と同様の動作を行う。即ちノード３０-3のデータ要求受理部は、データ要求部２０１-1からの参照領域情報（未更新領域情報）の指定するデータ記憶部の領域からデータを読み出して、そのデータと当該参照領域情報とを含む更新要求情報を生成し、当該更新要求情報をノード３０-2の更新要求情報受信部１１０-2に送信する。 Then, in the second embodiment, the data request receiving unit 202-2 of the node 30-3 receives the reference area information from the data requesting unit 201-1 of the node 10-1 by the data request receiving unit 202-2 of the node 10-2. The same operation as when received is performed. That is, the data request receiving unit of the node 30-3 reads the data from the area of the data storage unit designated by the reference area information (unupdated area information) from the data requesting unit 201-1, and the data and the reference area information. Is generated, and the update request information is transmitted to the update request information receiving unit 110-2 of the node 30-2.

更新要求情報受信部１１０-2は、ノード３０-3のデータ要求受理部から更新要求情報を受け取ると、当該更新要求情報に従って、データ記憶部１０２-2内の該当する領域のデータを更新する。このように本変形例においては、ノード３０-2の組込要求部１１２-2からノード３０-1の組込要求受理部１１３-1に対する同期化処理の開始要求に応じて当該組込要求受理部１１３-1から返される更新反映待ち情報によって指定される領域のデータを、待機系として機能しているノード３０-3のデータ要求受理部から取得して、当該データでデータ記憶部１０２-2の該当する領域の未更新データを更新することができる。ここで、ノード３０-1の更新要求情報送信部１０９-1による更新要求情報の生成及び送信が行われない点に注意されたい。つまり本変形例においては、サービス１０５が開始されているノード３０-1に負担をかけることなく、同期化処理を行うことができる。 When receiving the update request information from the data request receiving unit of the node 30-3, the update request information receiving unit 110-2 updates the data in the corresponding area in the data storage unit 102-2 in accordance with the update request information. As described above, in the present modification, the incorporation request acceptance is received in response to the synchronization processing start request from the incorporation request unit 112-2 of the node 30-2 to the incorporation request acceptance unit 113-1 of the node 30-1. The data of the area specified by the update reflection waiting information returned from the unit 113-1 is acquired from the data request receiving unit of the node 30-3 functioning as the standby system, and the data storage unit 102-2 is used as the data. The unupdated data in the corresponding area can be updated. Here, it should be noted that update request information is not generated and transmitted by the update request information transmitting unit 109-1 of the node 30-1. That is, in this modification, the synchronization process can be performed without imposing a burden on the node 30-1 where the service 105 is started.

さてノード３０-2の更新要求情報受信部１１０-2は、ノード３０-3のデータ要求受理部から送信された更新要求情報に従ってデータ記憶部１０２-2内の該当する領域のデータ（未更新データ）を更新すると、前記第１の実施形態におけるステップＳ２３〜Ｓ２５と同様の処理を行い、ノード３０-1の更新反映待ち部１１１-1に更新領域情報を返すことにより更新完了を通知する。これにより更新領域情報と同一の領域を指定する更新反映待ち情報が、更新反映待ち部１１１-1によって更新反映待ち情報記憶部１０３-1から削除される。 Now, the update request information receiving unit 110-2 of the node 30-2 receives the data (unupdated data) in the corresponding area in the data storage unit 102-2 according to the update request information transmitted from the data request receiving unit of the node 30-3. ) Is updated, the same processing as steps S23 to S25 in the first embodiment is performed, and the update completion information is notified by returning the update area information to the update reflection waiting unit 111-1 of the node 30-1. As a result, the update reflection waiting information specifying the same area as the update area information is deleted from the update reflection waiting information storage unit 103-1 by the update reflection waiting unit 111-1.

なお、本発明は、上記各実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記各実施形態に開示されている複数の構成要素の適宜な組み合せにより種々の発明を形成できる。例えば、各実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。 Note that the present invention is not limited to the above-described embodiments as they are, and can be embodied by modifying constituent elements without departing from the scope of the invention in the implementation stage. Various inventions can be formed by appropriately combining a plurality of constituent elements disclosed in the above embodiments. For example, some components may be deleted from all the components shown in each embodiment.

本発明の第１の実施形態に係るクラスタシステムの構成を示すブロック図。1 is a block diagram showing a configuration of a cluster system according to a first embodiment of the present invention. 図１に示される更新要求情報及び参照要求情報のデータ構造例を示す図。The figure which shows the data structure example of the update request information and reference request information which are shown by FIG. 同第１の実施形態におけるクラスタシステムの各部の処理手順を示すフローチャート。6 is a flowchart showing a processing procedure of each unit of the cluster system in the first embodiment. 同第１の実施形態における同期化処理の動作を説明するための図。The figure for demonstrating the operation | movement of the synchronization process in the said 1st Embodiment. 本発明の第２の実施形態に係るクラスタシステムの構成を示すブロック図。The block diagram which shows the structure of the cluster system which concerns on the 2nd Embodiment of this invention. 同第１の実施形態におけるクラスタシステムの各部の処理手順を示すフローチャート。6 is a flowchart showing a processing procedure of each unit of the cluster system in the first embodiment. 本発明の第３の実施形態に係るクラスタシステムの構成を示すブロック図。The block diagram which shows the structure of the cluster system which concerns on the 3rd Embodiment of this invention.

Explanation of symbols

１０-1，１０-2，２０-1，２０-2，３０-1，３０-2，３０-3…ノード（計算機）、１００-1，１００-2…クラスタ制御部、１０２-1，１０２-2…データ記憶部、１０３-1，１０３-2，２０３…更新反映待ち情報記憶部、１０４-1，１０４-2…未更新領域情報記憶部、１０５…サービス、１０６…更新要求情報、１０７…参照要求情報、１０８…データ要求監視部、１０９-1，１０９-2…更新要求情報送信部、１１０-1，１１０-2…更新要求情報受信部、１１１-1，１１１-2…更新反映待ち部、１１２-1，１１２-2…組込要求部、１１３-1，１１３-2…組込要求受理部、２０１-1，２０１-2…データ要求部、２０２-1，２０２-2…データ要求受理部。 10-1, 10-2, 20-1, 20-2, 30-1, 30-2, 30-3 ... nodes (computers), 100-1, 100-2 ... cluster control units, 102-1, 102 -2 ... Data storage unit, 103-1, 103-2, 203 ... Update reflection waiting information storage unit, 104-1, 104-2 ... Unupdated area information storage unit, 105 ... Service, 106 ... Update request information, 107 Reference request information 108 Data request monitoring unit 109-1, 109-2 Update request information transmission unit 110-1, 110-2 Update request information reception unit 111-1, 111-2 Update reflection Waiting unit, 112-1, 112-2 ... Incorporation request unit, 113-1, 113-2 ... Incorporation request receiving unit, 201-1, 201-2 ... Data requesting unit, 202-1, 202-2 ... Data request acceptance department.

Claims

One of the first and second computers functions as an active computer with respect to provision of a predetermined service, and the rest of the first and second computers function as standby computers, causing a failure in the active computer. In the cluster system in which the standby computer takes over the service started by the active computer,
Each of the first and second computers is
Data storage means for storing data;
Update reflection storing information of the area of the data storage means in which data update has occurred while functioning as the active computer as update reflection wait information representing an area not yet reflected in the data update in the standby computer Waiting information storage means;
Update request information transmitting means for transmitting update request information for reflecting the generated data update to the standby computer to the standby computer;
Unupdated area information storage means for storing, as unupdated area information, information of an area in which data update generated in the active system computer in a state of functioning as the standby system computer is not reflected in the standby system computer;
Based on the update request information, the update request information transmitted from the update request information transmitting means of the active computer in a state of functioning as the standby computer is received, and the data update generated in the active computer is based on the update request information. To update the data storage means of the standby computer and notify the active computer of the update region information of the region in which the data update is reflected, and not update corresponding to the region in which the data update is reflected An update request information receiving means for deleting the unupdated area information when the area information is stored in the unupdated area information storage means of the standby computer;
If the update request information update area information to the operating system computer from the update request information receiving means of the standby computer for transmission to said standby computer is notified, corresponding to the region indicated by the update area information Update reflection waiting means for deleting the update reflection waiting information to be deleted from the update reflection waiting information storage means of the active computer;
When a failure occurs in the state of functioning as the standby computer and the standby computer temporarily stops operating and then recovers from the failure, it requests the active computer to start synchronization processing. The update reflection wait information stored in the update reflection wait information storage means of the active computer is acquired from the active computer, and the update reflection wait information is set as unupdated area information in the standby computer. Built-in request means for storing in the unupdated area information storage means,
When the start of synchronization processing is requested from the incorporation request unit of the standby computer while functioning as the active computer, the update stored in the update reflection waiting information storage unit of the active computer The reflection waiting information is transmitted to the incorporation request means of the standby computer, and the data of the area represented by the update reflection waiting information is read from the data storage means of the active computer, and the standby computer is read by the data. Embedded request receiving means for transmitting update request information necessary for synchronization processing for updating data in the corresponding area of the data storage means to the update request information receiving means of the standby computer ,
When the update request information receiving unit receives the update request information necessary for the synchronization processing transmitted from the built-in request receiving unit of the active computer while functioning as the standby computer, Based on the update request information, the data update generated in the active computer is reflected in the data storage means of the standby computer, and the update area information of the area in which the data update is reflected is notified to the active computer. In addition, when the unupdated area information corresponding to the area in which the data update is reflected is stored in the unupdated area information storage unit of the standby computer, the unupdated area information is deleted. Cluster system.

Each of the first and second computers is
When the service is started on the computer, a reference request issued by the service is detected, and whether the area designated by the detected reference request is an unupdated area is not updated on the computer. Based on the non-updated area information stored in the area information storage means, and if it is an unupdated area, the reference request until the corresponding non-updated area information is deleted from the non-updated area information storage means 2. The cluster system computer according to claim 1, further comprising a data request monitoring unit for waiting for execution of the cluster system.

Each of the first and second computers is
Specification of a reference request detected by the data request monitoring unit of the computer when the service is taken over in a state where unupdated region information indicating an unupdated region is stored in the unupdated region of the computer If the area to be updated is an unupdated area, reference area information representing the area is used to request the computer in which the service has been started and the service has been stopped until the service is taken over. A data requesting means for transmitting the request to the computer where the service is stopped,
When the reference area information is transmitted from the data request unit of the computer in which the service is started in the state where the service is stopped, the data of the area specified by the reference area information is read from the data storage unit, A data request for transmitting update request information for updating the data in the corresponding area of the data storage means of the computer on which the service has been started to the update request information receiving means of the computer on which the service is started. The cluster system according to claim 2, further comprising: a receiving unit.

And further comprising at least one third computer having the same configuration as the first and second computers,
Each of the first and second computers and the at least one third computer operate in agreement with each other such that any one computer becomes an active system and the remaining computers become standby systems. Cluster control means for
The update reflection waiting information storage means included in each of the first and second computers and the at least one third computer is such that the computer functions as the active computer and each remaining computer is the standby computer. If the function is functioning as an update, the information of the area of the data storage means in which the data update has occurred in the active system computer is updated for each standby system computer to indicate the area that has not been reflected in the data update in the standby system computer. Used to store as reflection waiting information,
The cluster control means of the first and second computers and the at least one third computer are stored in the update reflection waiting information storage means of the active computer when a failure occurs in the active computer. Comparing the total area waiting for update reflection indicated by the update reflection waiting information for each standby computer, and making an agreement so that the computer having the smallest total area waiting for update reflection takes over the service. The cluster system according to claim 1.

When a failure occurs in the active computer, the data storage means of each of the active computer and the standby computer applied to the cluster system in which the standby computer takes over the service started by the active computer In the synchronization method for synchronizing data,
When data update to the data storage unit of the active system computer occurs in the active system computer, update information indicating an area that has not been reflected in the data update in the standby system computer, information on the area in which the data update has occurred Storing in the update reflection waiting information storage means of the active computer as waiting information;
Sending an update request information for reflecting the generated data update to the standby computer to the standby computer from the operating system computer,
Based on the transmitted update request information from the operating system computer to the standby computer, the steps of the data update generated in the active system computer said standby computer is reflected in the data storage means of the standby computer,
A step of deleting the non-updated area information by the standby computer when the non-updated area information corresponding to the area where the data update is reflected is stored in the non-updated area information storage means of the standby computer; ,
Notifying the active computer from the standby computer of update region information of the region in which the data update is reflected;
Deleting the update reflection wait information corresponding to the area indicated by the update area information notified to the active computer from the update reflection wait information storage unit of the active computer;
A step of requesting the active computer to start synchronization processing from the standby computer when a failure occurs in the standby computer and the standby computer temporarily stops operating and then recovers from the failure. When,
When the start of the synchronization process is requested, the update reflection waiting information stored in the update reflection waiting information storage means of the active computer is transmitted from the active computer to the standby computer;
The standby computer stores the transmitted update reflection waiting information as unupdated region information in the unupdated region information storage unit of the standby computer;
The active computer reads the data in the area represented by the update reflection waiting information transmitted from the active computer to the standby computer from the data storage means of the active computer, and the data of the standby computer is determined by the data. Transmitting update request information necessary for the synchronization process for updating data in the corresponding area of the data storage means from the active computer to the standby computer ;
Based on the update request information necessary for the synchronization process transmitted from the active computer to the standby computer, the standby computer stores the data update generated by the active computer in the data storage of the standby computer. Steps reflected in the means;
If the non-updated area information corresponding to the area where the data update is reflected based on the update request information necessary for the synchronization processing is stored in the non-updated area information storage unit of the standby computer, the standby A synchronization method comprising: deleting a non-updated area information from the system computer .