JP2010198442A

JP2010198442A - Distributed system with failover function and failover method in the same

Info

Publication number: JP2010198442A
Application number: JP2009043912A
Authority: JP
Inventors: Masa Tanaka; 雅田中
Original assignee: Toshiba Corp; Toshiba Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2009-02-26
Filing date: 2009-02-26
Publication date: 2010-09-09
Anticipated expiration: 2029-02-26
Also published as: JP4818379B2

Abstract

PROBLEM TO BE SOLVED: To ensure real-time property by detecting a computer in deadlock state and failing over to a standby computer. SOLUTION: In each of n active computers 100-i among m computers 100-i connected via a network, a consistency part 262 of an ordered multicast part 2 performs ordered-multicasting of input data by establishing consistency of the input data among (n-t) or more computers. A system configuration change detection part 63 detects a computer in deadlock state at a fixed level or higher in comparison with the computer based on the operation condition of the ordered-multicasting by the consistency part 262, and establishes consistency on the detection among (n-t) or more computers including the deadlock computer. When consistency is established on the detection of the deadlock computer, a system configuration setting part 64 suspends the deadlock computer and activates the standby computer. COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、ネットワークで接続されたｍ台（ｍは５より大きい整数）のコンピュータのうちｎ台（ｎは４＜ｎ＜ｍを満たす整数）のコンピュータを同期的に動作させ、（ｎ−ｔ）台（ｔは３ｔ＜ｎを満たす整数）以上での多重化を保証する分散システムに係り、特に当該ｎ台のコンピュータのいずれかの動作が停滞している場合に、当該停滞しているコンピュータから残りの（ｍ−ｎ）台のコンピュータのうちの１台にフェイルオーバするフェイルオーバ機能を持つ分散システムおよび同システムにおけるフェイルオーバ方法に関する。 The present invention synchronously operates n computers (m is an integer satisfying 4 <n <m) among m computers (m is an integer greater than 5) connected by a network, and (n−t ) Related to a distributed system that guarantees multiplexing over t (t is an integer satisfying 3t <n), and particularly when any of the n computers is stagnant, the stagnating computer The present invention relates to a distributed system having a failover function for failing over to one of the remaining (mn) computers and a failover method in the system.

近年、コンピュータ技術やネットワーク技術の向上は目覚ましく、これに伴って、業務のコンピュータ化が広く行われている。また、その業務の内容によっては、故障などによる中断が許されないものも多く、最近では、複数のコンピュータをネットワークで結合した分散システムを構築することが一般的になりつつある。そして、この分散システムの運用手法の１つに、整列マルチキャストを用いた決定性のプログラムの実行の多重化が存在する。 In recent years, computer technology and network technology have been remarkably improved, and along with this, computerization of business has been widely performed. In addition, depending on the contents of the business, there are many things that cannot be interrupted due to a failure or the like, and recently, it is becoming common to construct a distributed system in which a plurality of computers are connected via a network. One of the distributed system operation techniques is multiplexing of execution of a deterministic program using ordered multicast.

整列マルチキャストは、分散システムへの入力をすべてのコンピュータに配送する仕組みであり、データの到着順序がすべてのコンピュータで同じであることを保証するものである。 The ordered multicast is a mechanism for delivering the input to the distributed system to all computers, and guarantees that the arrival order of data is the same on all computers.

分散システムにおける多重化の手法として、例えば特許文献１は、ｎ台（ｎは４以上の整数）のコンピュータ（つまり４台以上のコンピュータ）がネットワークで接続された分散システムにおいて、スプリットブレインを原理的に発生させず、タイムアウトによる故障発生時の処理の中断も発生させることがない手法を開示している。 As a multiplexing method in a distributed system, for example, Patent Document 1 is based on the principle of split brain in a distributed system in which n computers (n is an integer of 4 or more) (that is, 4 or more computers) are connected by a network. And a method that does not cause interruption of processing when a failure occurs due to a timeout.

この特許文献１が開示する従来の多重化手法（先行技術）においては、ｎ台（ｎは４以上の整数）のコンピュータから構成される分散システムを例にとると、当該ｎ台のコンピュータを同期的に動作させ、当該ｎ台のコンピュータのうちの（ｎ−ｆ）台（ｆは、最大許容障害数と呼ばれる、３ｆ＜ｎを満たす最大の整数）以上での多重化を保証するために、当該ｎ台のコンピュータの各々は、入力候補収集手段と入力候補選定制御手段（第１の入力候補選定制御手段）とを具備する。入力候補収集手段は、ｎ台のコンピュータそれぞれが次に処理する候補として選択した入力データをネットワークを介して収集する。入力候補選定制御手段は、入力候補収集手段により収集された入力データが（ｎ−ｆ）個以上存在する場合に、その中に同一内容の入力データが（ｎ−ｆ）個以上あるか否かを判定し、（ｎ−ｆ）個以上あったときに、その入力データを次に処理する対象として確定する。これにより入力データが整列マルチキャストされる。このように、収集された（ｎ−ｆ）個以上の入力データの中に同一内容の入力データが（ｎ−ｆ）個以上あることを入力候補選定制御手段が判定することは、（ｎ−ｆ）台以上のコンピュータで入力データの合意をとることに他ならない。つまり入力候補選定制御手段は合意手段として機能することを意味する。 In the conventional multiplexing method (prior art) disclosed in Patent Document 1, when a distributed system composed of n computers (n is an integer of 4 or more) is taken as an example, the n computers are synchronized. In order to guarantee multiplexing in (n−f) computers (where f is the maximum integer satisfying 3f <n), of the n computers, Each of the n computers includes input candidate collection means and input candidate selection control means (first input candidate selection control means). The input candidate collecting means collects input data selected as candidates to be processed next by each of the n computers via the network. When there are (n−f) or more input data collected by the input candidate collecting unit, the input candidate selection control unit determines whether or not there are (n−f) or more input data having the same contents. When there are (n−f) or more, the input data is determined as a target to be processed next. As a result, the input data is arranged and multicast. In this way, the input candidate selection control means determines that there are (n−f) or more input data having the same content among the (n−f) or more collected input data. f) It is nothing but an agreement on input data with more than one computer. That is, the input candidate selection control means functions as an agreement means.

例えば、特許文献２では、最大許容障害数ｆに相当する変数としてｔが用いられる。この場合、（ｎ−ｔ）台以上のコンピュータで入力データの合意をとることにより、ｎ台のコンピュータのうちの最大ｔ台の故障（いわゆるビザンチン型の故障）が許容される。このようなアルゴリズムは、ｔ−耐故障ビザンチン合意アルゴリズム、ｔ−耐故障ビザンチン将軍アルゴリズム、或いは単にｔ−耐故障アルゴリズムと呼ばれる。以下の説明では、上述のアルゴリズムをｔ−耐故障アルゴリズムと称し、特許文献１が開示する先行技術に関しても、煩雑さを防ぐために最大許容障害数として「ｔ」を用いることにする。また以下の説明では、上述のｎ台のコンピュータを、多重化を構成するコンピュータまたは同期的に動作させられるべきコンピュータと称することもある。 For example, in Patent Document 2, t is used as a variable corresponding to the maximum allowable failure number f. In this case, the agreement of input data is obtained by (n−t) or more computers, and t failures (so-called Byzantine failure) among the n computers are allowed. Such an algorithm is called a t-fault tolerant Byzantine consensus algorithm, a t-fault tolerant Byzantine general algorithm, or simply a t-fault tolerant algorithm. In the following description, the above-mentioned algorithm is referred to as a t-fault tolerance algorithm, and “t” is used as the maximum allowable failure number in order to prevent complexity even with respect to the prior art disclosed in Patent Document 1. Further, in the following description, the above-mentioned n computers may be referred to as computers constituting the multiplexing or computers to be operated synchronously.

ｔ−耐故障アルゴリズムでは、ｔは、３ｔ＜ｎを満たす０より大きい整数であればよく、必ずしも特許文献１に記載されているように、３ｔ＜ｎを満たす最大の整数である必要はない。 In the t-fault tolerance algorithm, t may be an integer larger than 0 that satisfies 3t <n, and does not necessarily need to be the maximum integer that satisfies 3t <n as described in Patent Document 1.

特許第３６５５２６３号公報Japanese Patent No. 3655263 特開平６-８３６１１号公報JP-A-6-83611

特許文献１に記載の先行技術によれば、ｎ台（ｎは４以上の整数）のコンピュータから構成される分散システム、つまり多重化を構成するコンピュータがｎ台の分散システムでは、ｔ台までのコンピュータの故障が許容される。 According to the prior art described in Patent Document 1, in a distributed system composed of n computers (n is an integer of 4 or more), that is, in a distributed system with n computers constituting multiplexing, up to t computers Computer failure is acceptable.

ここで、ｎ台のコンピュータの中に定常的に遅延している（つまり動作が停滞している）コンピュータが存在する場合を想定する。この場合、遅延しているコンピュータの台数がｔ台までであれば、残りの（ｎ−ｔ）台のコンピュータで合意をとることができるため、リアルタイム性を確保できる。 Here, it is assumed that there are computers that are constantly delayed (that is, whose operations are stagnant) among the n computers. In this case, if the number of delayed computers is t, the remaining (nt) computers can be agreed, so that real-time performance can be ensured.

次に、ｔ台のコンピュータが故障して、動作しているコンピュータが（ｎ−ｔ）台となった状態を想定する。このような状態では、動作している（ｎ−ｔ）台のコンピュータの中に遅延しているコンピュータがあると、その遅延しているコンピュータを含めて合意をとる必要がある。このような場合、分散システムにおける処理速度が、遅延しているコンピュータの処理速度になってしまう。 Next, it is assumed that t computers have failed and there are (nt) computers operating. In such a state, if there is a delayed computer among the (nt) computers in operation, it is necessary to obtain an agreement including the delayed computer. In such a case, the processing speed in the distributed system becomes the processing speed of the computer that is delayed.

つまり先行技術においては、ｔ台までのコンピュータの故障が許容されるものの、ｔ台のコンピュータが故障して、（ｎ−ｔ）台のコンピュータの中に遅延しているコンピュータがあると、分散システム全体の処理速度の低下を招く。 That is, in the prior art, a failure of up to t computers is allowed, but if t computers fail and there are (nt) computers that are delayed, the distributed system The overall processing speed is lowered.

本発明は上記事情を考慮してなされたものでその目的は、多重化を構成するコンピュータのうちで動作が停滞しているコンピュータを検出して待機状態にあるコンピュータにフェイルオーバすることで、リアルタイム性を確保できるフェイルオーバ機能を持つ分散システムおよび同システムにおけるフェイルオーバ方法を提供することにある。 The present invention has been made in consideration of the above circumstances, and its purpose is to detect a computer that is stagnant among computers constituting a multiplex and to fail over to a computer in a standby state, thereby realizing real-time performance. It is an object to provide a distributed system having a failover function capable of ensuring the above and a failover method in the system.

本発明の１つの観点によれば、ネットワークで接続されたｍ台（ｍは５以上の整数）のコンピュータのうちｎ台（ｎは４≦ｎ＜ｍを満たす整数）のコンピュータを稼動状態に設定して、ｔ−耐故障アルゴリズムの適用により同期的に動作させる、フェイルオーバ機能を持つ分散システムが提供される。この分散システムを構成する前記ｍ台のコンピュータの各々は、前記ｍ台のコンピュータが、それぞれ同期的に動作させられるべき稼動状態にあるか、或いは待機状態にあるかを示すシステム構成情報を格納するシステム構成記憶手段と、前記システム構成記憶手段に格納されているシステム構成情報によって稼動状態にあることが示されているｎ台のコンピュータのうちの（ｎ−ｔ）台（ｔは３ｔ＜ｎを満たす０より大きい整数）以上での多重化を保証するために、当該（ｎ−ｔ）台以上のコンピュータで入力データの合意をとることで当該入力データを整列マルチキャストする合意手段と、前記ｎ台のコンピュータのうち、自身と比較して予め定められたレベル以上停滞しているコンピュータを、前記合意手段による整列マルチキャストの実行状況に基づいて検出し、前記停滞しているコンピュータを検出した場合、前記停滞しているコンピュータの検出に関し、前記ｎ台のコンピュータのうちの自身を含む（ｎ−ｔ）台以上のコンピュータで合意をとることで、前記停滞しているコンピュータの検出を確認する検出手段と、前記停滞しているコンピュータの検出の確認に応じて、確認された前記停滞しているコンピュータを待機状態とし、待機状態にあるコンピュータを稼動状態とするように、前記システム構成記憶手段に格納されているシステム構成情報情報を更新することによって、前記停滞しているコンピュータから前記待機状態にあるコンピュータへフェイルオーバするシステム構成設定手段とを具備する。 According to one aspect of the present invention, n (n is an integer satisfying 4 ≦ n <m) computers among m computers (m is an integer of 5 or more) connected via a network are set to an operating state. Thus, a distributed system having a failover function that operates synchronously by application of the t-fault tolerance algorithm is provided. Each of the m computers constituting the distributed system stores system configuration information indicating whether the m computers are in an operating state to be operated synchronously or in a standby state. The system configuration storage means and (nt) computers (n is 3t <n) among the n computers indicated to be in the operating state by the system configuration information stored in the system configuration storage means. In order to guarantee multiplexing at an integer greater than or equal to 0), an agreement means for arranging and multicasting the input data by agreeing the input data with the (nt) or more computers, and the n units Computers that are stagnant more than a predetermined level compared to itself, are arranged multicast by the agreement means When detecting based on the execution status and detecting the stagnant computer, the detection of the stagnant computer is performed by (nt) or more computers including itself among the n computers. According to the agreement, the detection means for confirming the detection of the stagnant computer and the confirmed stagnant computer are put into a standby state in accordance with the confirmation of the detection of the stagnant computer, and the standby System configuration for failing over from the stagnated computer to the computer in the standby state by updating the system configuration information information stored in the system configuration storage means so that the computer in the state is in the operating state Setting means.

本発明によれば、停滞しているコンピュータを検出した場合、この停滞しているコンピュータの検出に関し、停滞しているコンピュータを検出したコンピュータを含む（ｎ−ｔ）台以上のコンピュータで合意をとることで、停滞しているコンピュータを正しく検出できたことを確認することができる。また本発明によれば、この確認に応じて、確認された停滞しているコンピュータを待機状態とし、待機状態にあるコンピュータを稼動状態とするシステム変更を行うことで、つまり停滞しているコンピュータから待機状態にあるコンピュータにフェイルオーバすることで、アルタイム性を確保することができる。 According to the present invention, when a stagnant computer is detected, an agreement is reached between (nt) or more computers including the computer that has detected the stagnant computer regarding the detection of the stagnant computer. Thus, it can be confirmed that the stagnant computer has been correctly detected. Further, according to the present invention, in response to this confirmation, by making a system change in which the confirmed stagnated computer is set in a standby state and the computer in a standby state is in an operating state, that is, from a stagnant computer By failing over to a computer in a standby state, it is possible to ensure real-time performance.

本発明の一実施形態に係る分散システムの構成を示すブロック図。1 is a block diagram showing a configuration of a distributed system according to an embodiment of the present invention. 同実施形態の分散システムを構成するコンピュータの機能構成を示すブロック図。2 is an exemplary block diagram showing a functional configuration of a computer constituting the distributed system of the embodiment. FIG. 同実施形態の入力受付キュー部に積まれる入力パケットデータのデータ構造例を示す図。The figure which shows the example of a data structure of the input packet data piled up in the input reception queue part of the embodiment. 同実施形態の分散システムを構成するコンピュータの整列マルチキャストプロトコルデータ送受信部間で送受信される整列マルチキャストプロトコルデータのレイアウトを示す図。The figure which shows the layout of the alignment multicast protocol data transmitted / received between the alignment multicast protocol data transmission / reception parts of the computer which comprises the distributed system of the embodiment. 同実施形態におけるシステム構成記憶部のデータ構造例を示す図。The figure which shows the example of a data structure of the system configuration | structure storage part in the embodiment. 同実施形態における最大確定入力順序番号履歴記憶部のデータ構造例を示す図。The figure which shows the example of a data structure of the largest definite input sequence number history storage part in the embodiment. 同実施形態の分散システムを構成するコンピュータのシステム構成変更合意プロトコルデータ送受信部間で送受信されるシステム構成変更合意プロトコルデータのレイアウトを示す図。The figure which shows the layout of the system configuration change agreement protocol data transmitted / received between the system configuration change agreement protocol data transmission / reception parts of the computer which comprises the distributed system of the embodiment. 同実施形態において各コンピュータが実行する整列マルチキャストの１回の配送を行う基本的な部分の動作手順を示す第１のフローチャート。The 1st flowchart which shows the operation | movement procedure of the basic part which performs one delivery of the ordered multicast which each computer performs in the same embodiment. 同実施形態において各コンピュータが実行する整列マルチキャストの１回の配送を行う基本的な部分の動作手順を示す第２のフローチャート。The 2nd flowchart which shows the operation | movement procedure of the basic part which performs one delivery of the ordered multicast which each computer performs in the same embodiment. 同実施形態において各コンピュータが実行する、多重化実行の遅延を解消するための動作手順を示す第１のフローチャート。The 1st flowchart which shows the operation | movement procedure for eliminating the delay of multiplexing execution which each computer performs in the same embodiment. 同実施形態において各コンピュータが実行する、多重化実行の遅延を解消するための動作手順を示す第２のフローチャート。The 2nd flowchart which shows the operation | movement procedure for eliminating the delay of multiplexing execution which each computer performs in the same embodiment. 同実施形態において各コンピュータが実行する、多重化実行の遅延を解消するための動作手順を示す第３のフローチャート。The 3rd flowchart which shows the operation | movement procedure for eliminating the delay of multiplexing execution which each computer performs in the same embodiment. 同実施形態において各コンピュータが実行する、多重化実行の遅延を解消するための動作手順を示す第４のフローチャート。FIG. 10 is a fourth flowchart showing an operation procedure for eliminating a delay in multiplexing execution, which is executed by each computer in the embodiment. 同実施形態において各コンピュータが実行する、停滞しているコンピュータを探すための処理の手順を示すフローチャート。The flowchart which shows the procedure of the process for searching for the computer which has stagnated which each computer performs in the embodiment. 同実施形態において各コンピュータが実行する、報告種類のシステム構成変更合意プロトコルデータを受け取った場合の処理の手順を示すフローチャート。6 is an exemplary flowchart illustrating a processing procedure executed when each computer receives report-type system configuration change agreement protocol data in the embodiment. 同実施形態において各コンピュータが実行する、同意種類または非同意種類のシステム構成変更合意プロトコルデータを受け取った場合の処理の手順を示すフローチャート。6 is an exemplary flowchart illustrating a procedure of processing performed when each computer receives agreement type or non-agreement type system configuration change agreement protocol data executed by each computer in the embodiment. 同実施形態において各コンピュータが実行する、確定されたシステム構成変更要求パケットを受け取った場合の処理の手順を示すフローチャート。6 is an exemplary flowchart illustrating a processing procedure executed by each computer in the embodiment when a confirmed system configuration change request packet is received. 同実施形態において各コンピュータが実行する、システム構成通知プロトコルデータを受け取った場合の処理の手順を示すフローチャート。6 is an exemplary flowchart illustrating a processing procedure executed by each computer in the embodiment when system configuration notification protocol data is received.

以下、本発明の実施の形態につき図面を参照して説明する。
図１は本発明の一実施形態に係る分散システムの構成を示すブロック図である。図１において、分散システム１０は、例えば５台のコンピュータ１００-1（＃１）〜１００-5（＃５）から構成されている。コンピュータ１００-1〜１００-5のうち、ｔ−耐故障アルゴリズムによる多重化を保証するのに必要な台数以上のコンピュータ、例えば４台のコンピュータ１００-1〜１００-4は稼動状態にあるものとする。この稼動状態にあるコンピュータ１００-1〜１００-4は多重化を構成する。コンピュータ１００-1〜１００-5のうち、残りのコンピュータ１００-5は待機状態にある。 Embodiments of the present invention will be described below with reference to the drawings.
FIG. 1 is a block diagram showing a configuration of a distributed system according to an embodiment of the present invention. In FIG. 1, the distributed system 10 includes, for example, five computers 100-1 (# 1) to 100-5 (# 5). Of the computers 100-1 to 100-5, it is assumed that more than the number of computers necessary for guaranteeing multiplexing by the t-fault tolerance algorithm, for example, four computers 100-1 to 100-4 are in operation. . The computers 100-1 to 100-4 in this operating state constitute multiplexing. Of the computers 100-1 to 100-5, the remaining computer 100-5 is in a standby state.

ここで、分散システム１０を構成するコンピュータの数をｍで表し、分散システム１０において現在多重化されている（稼動状態にある）コンピュータの数をｎで表すものとする。ｎは上記特許文献１と同様に４以上の整数であり、ｍはｎより大きい整数である。つまり、ｎは、４≦ｎ＜ｍを満たす整数である。この場合、ｔを３ｔ＜ｎを満たす０より大きい整数とすると、ｔ−耐故障アルゴリズムにより、ｎ台のコンピュータのうちの最大ｔ台の故障が許容される。図１の分散システムの例では、ｎは４であり、ｍは５である。ｎが４の場合、ｔは１となり、１台のコンピュータの故障が許容される。ｔが１の場合、ｔ台のコンピュータの故障を許容するのに必要なｎ、つまりｔ−耐故障アルゴリズムによる多重化を保証するのに必要なコンピュータの台数の最小値は４である。図１の分散システムの例は、この場合に相当する。 Here, the number of computers constituting the distributed system 10 is represented by m, and the number of computers currently multiplexed in the distributed system 10 (in an operating state) is represented by n. n is an integer of 4 or more as in Patent Document 1, and m is an integer larger than n. That is, n is an integer that satisfies 4 ≦ n <m. In this case, assuming that t is an integer greater than 0 that satisfies 3t <n, the t-fault tolerance algorithm allows a maximum of t failures of n computers. In the example of the distributed system of FIG. 1, n is 4 and m is 5. When n is 4, t is 1, and one computer is allowed to fail. When t is 1, the minimum value of n necessary to tolerate the failure of t computers, that is, the minimum number of computers required to guarantee multiplexing by the t-fault tolerance algorithm is 4. The example of the distributed system in FIG. 1 corresponds to this case.

コンピュータ１００-1〜１００-5は、ネットワークＡを介してクライアント装置２００と接続されている。コンピュータ１００-1〜１００-5は、クライアント装置２００以外のクライアント装置（図示せず）ともネットワークＡを介して接続されているものとする。本実施形態においてネットワークＡはパブリックネットワーク（外部ネットワーク）である。コンピュータ１００-1〜１００-5間は、ネットワークＢを介して接続されている。本実施形態においてネットワークＢはプライベートネットワーク（内部ネットワーク）である。 The computers 100-1 to 100-5 are connected to the client device 200 via the network A. The computers 100-1 to 100-5 are also connected to a client device (not shown) other than the client device 200 via the network A. In the present embodiment, the network A is a public network (external network). The computers 100-1 to 100-5 are connected via a network B. In the present embodiment, the network B is a private network (internal network).

稼動状態にあるコンピュータ１００-1〜１００-4、つまり分散システム１０において多重化を構成しているコンピュータ１００-1〜１００-4は、前記特許文献１に記載された分散システムにおけるコンピュータと同様に、ネットワークＡを介してクライアント装置２００から受け取った入力パケット（入力）を他のコンピュータと同じ順序で処理していく。なお、クライアント装置２００からの入力パケットは、コンピュータ１００-1〜１００-4のうちのいずれかのコンピュータに入力される。 The computers 100-1 to 100-4 in operation, that is, the computers 100-1 to 100-4 constituting multiplexing in the distributed system 10, are the same as the computers in the distributed system described in Patent Document 1. The input packet (input) received from the client device 200 via the network A is processed in the same order as other computers. An input packet from the client device 200 is input to any one of the computers 100-1 to 100-4.

コンピュータ１００-1〜１００-5は、それぞれ同一のアプリケーションプログラム３（図２参照）を有している。コンピュータ１００-1〜１００-5のうち、多重化を構成するコンピュータ１００-1〜１００-4は、同一の初期状態から始まる。その後、クライアント装置２００から分散システムに入力されるデータは、必ず整列マルチキャストを通して、コンピュータ１００-1〜１００-4に同一順序で配送される。これにより、コンピュータ１００-1〜１００-4においてそれぞれのアプリケーションプログラム３が実行される。 Each of the computers 100-1 to 100-5 has the same application program 3 (see FIG. 2). Among the computers 100-1 to 100-5, the computers 100-1 to 100-4 constituting the multiplexing start from the same initial state. Thereafter, data input from the client device 200 to the distributed system is always delivered to the computers 100-1 to 100-4 in the same order through ordered multicast. Thereby, each application program 3 is executed in the computers 100-1 to 100-4.

コンピュータ１００-1〜１００-4がそれぞれ有するアプリケーションプログラム３への入力データ列は、整列マルチキャストにより同一順序となっている。このため、前記特許文献１に記載されているような決定性のプログラムの特徴により、コンピュータ１００-1〜１００-4の状態が同一に保たれ、出力データ列もすべて同じとなる。つまり、プログラムの実行が多重化される。 The input data string to the application program 3 that each of the computers 100-1 to 100-4 has has the same order by the ordered multicast. For this reason, the states of the computers 100-1 to 100-4 are kept the same and the output data strings are all the same due to the characteristics of the deterministic program as described in Patent Document 1. That is, program execution is multiplexed.

図２は、図１に示されるコンピュータ１００-i（ｉ＝１，２，…-5）の構成を示すブロック図である。図２において、クライアント装置２００からネットワークＡを介してコンピュータ１００-iに送信されて、当該コンピュータ１００-iの入力受付キュー部（以下、受付キューと称する）１で受け付けられた入力パケットは、当該受付キュー１に受付順に積まれる。受付キュー１に積まれた入力パケットは、整列マルチキャスト部２（内の入力パケット確定判定部２６に含まれている合意部２６２）によってアプリケーションプログラム３または後述するシステム構成管理部６に配送される。なお、受付キュー１に積まれる入力パケットには、クライアント装置２００からの入力パケットの他に、システム構成管理部６のシステム構成変更検出部６３から送られるシステム構成変更を要求する特定の入力パケット（システム構成変更要求パケット）がある。 FIG. 2 is a block diagram showing a configuration of the computer 100-i (i = 1, 2,... -5) shown in FIG. In FIG. 2, an input packet transmitted from the client apparatus 200 to the computer 100-i via the network A and received by the input reception queue unit (hereinafter referred to as reception queue) 1 of the computer 100-i is Stacked in the reception queue 1 in the order of reception. The input packet loaded in the reception queue 1 is delivered to the application program 3 or the system configuration management unit 6 described later by the ordered multicast unit 2 (the agreement unit 262 included in the input packet determination unit 26). In addition to the input packet from the client device 200, the input packet loaded in the reception queue 1 includes a specific input packet requesting a system configuration change sent from the system configuration change detection unit 63 of the system configuration management unit 6 ( System configuration change request packet).

アプリケーションプログラム３は、配送された入力パケットを受けて、プログラム状態管理部４に保存されている状態に従って当該入力パケットを処理し、出力パケットを生成する。生成された出力パケットは、出力フィルタ部５で選別されてから、ネットワークＡを介してクライアント装置２００に返却される（出力）。 The application program 3 receives the delivered input packet, processes the input packet according to the state stored in the program state management unit 4, and generates an output packet. The generated output packet is selected by the output filter unit 5 and then returned to the client device 200 via the network A (output).

次に、コンピュータ１００-iの整列マルチキャスト部２の構成について説明する。整列マルチキャスト部２は、前記特許文献１に記載された整列マルチキャスト部と同様に、入力順序番号記憶部２１、入力パケットジャーナル記憶部２２、整列マルチキャストプロトコルデータ送受信部２３、ステップ番号記憶部２４、候補パケット記憶部２５、入力パケット確定判定部２６、最大確定入力順序番号記憶部２７、遅延記憶部２８およびスキップ判定部２９の周知の構成を含んでいる。 Next, the configuration of the ordered multicast unit 2 of the computer 100-i will be described. Similar to the ordered multicast unit described in Patent Document 1, the ordered multicast unit 2 includes an input sequence number storage unit 21, an input packet journal storage unit 22, an ordered multicast protocol data transmission / reception unit 23, a step number storage unit 24, a candidate This includes well-known configurations of a packet storage unit 25, an input packet determination determination unit 26, a maximum determination input sequence number storage unit 27, a delay storage unit 28, and a skip determination unit 29.

入力順序番号記憶部２１は、整列マルチキャストによってコンピュータ１００-iへ次に配送される入力パケットの順序番号（つまり整列マルチキャストにシリアルに付される最新の順序番号）を格納する。入力パケットジャーナル記憶部２２は、整列マルチキャストによってコンピュータ１００-iへの配送が確定した入力パケットの列を最近のものから一定の量だけ格納する。 The input sequence number storage unit 21 stores the sequence number of the next input packet to be delivered to the computer 100-i by the ordered multicast (that is, the latest sequence number assigned serially to the ordered multicast). The input packet journal storage unit 22 stores a certain amount of the input packet sequence that is confirmed to be delivered to the computer 100-i by the ordered multicast from the latest one.

整列マルチキャストプロトコルデータ送受信部２３は、システム構成管理部６の後述するシステム構成記憶部６１を参照して、他の稼動状態にあるコンピュータの整列マルチキャストプロトコルデータ送受信部２３とネットワークＢを介してプロトコルデータ（整列マルチキャストプロトコルデータ）を授受する。 The ordered multicast protocol data transmission / reception unit 23 refers to a system configuration storage unit 61 (to be described later) of the system configuration management unit 6, and transmits protocol data via the ordered multicast protocol data transmission / reception unit 23 of the computer in another operating state and the network B. Exchange (aligned multicast protocol data).

本実施形態では、クライアント装置２００とコンピュータ１００-iとの間のデータの授受と、コンピュータ１００-i相互間のデータの授受とで、使用するネットワークが切り替えられる。これによりネットワーク負荷が軽減される。しかし、クライアント装置２００とコンピュータ１００-iとの間のデータの授受と、コンピュータ１００-i相互間のデータの授受とが、例えばネットワークＡを介して行われる構成であっても構わない。またネットワークＡが必ずしもパブリックネットワークである必要はない。 In this embodiment, the network to be used is switched between data exchange between the client apparatus 200 and the computer 100-i and data exchange between the computers 100-i. This reduces the network load. However, the configuration may be such that data exchange between the client device 200 and the computer 100-i and data exchange between the computers 100-i are performed via the network A, for example. Further, the network A is not necessarily a public network.

ステップ番号記憶部２４、候補パケット記憶部２５および入力パケット確定判定部２６は、整列マルチキャストによってコンピュータ１００-iへ次に配送される入力パケットを決定し、またシステム構成を変更するアルゴリズムで用いられる。 The step number storage unit 24, the candidate packet storage unit 25, and the input packet decision determination unit 26 are used in an algorithm that determines the next input packet to be delivered to the computer 100-i by ordered multicast and changes the system configuration.

ステップ番号記憶部２４は、プロトコルのステップを示すステップ番号を格納する。候補パケット記憶部２５は、そのステップにおける各コンピュータの「入力候補」となる入力パケットを計ｎ個格納する。 The step number storage unit 24 stores a step number indicating a protocol step. The candidate packet storage unit 25 stores a total of n input packets that are “input candidates” for each computer in that step.

入力パケット確定判定部２６は、候補パケット記憶部２５の情報から入力パケットの確定の判定および次ステップの「入力候補」の決定を行う。入力パケット確定判定部２６はさらに、前記特許文献１に記載された入力パケット確定判定部と異なり、入力パケットをアプリケーションプログラム３およびシステム構成管理部６（内のシステム構成設定部６４）のいずれに渡すかを決定する。この決定のために、受付キュー１に積まれる入力パケットには、処理種別を示す情報（処理種別情報）が付加される。入力パケット確定判定部２６は、入力候補収集部２６１および合意部２６２を含む。 The input packet determination determination unit 26 determines the determination of the input packet from the information in the candidate packet storage unit 25 and determines the “input candidate” in the next step. Unlike the input packet determination determination unit described in Patent Document 1, the input packet determination determination unit 26 passes the input packet to either the application program 3 or the system configuration management unit 6 (the system configuration setting unit 64). To decide. For this determination, information indicating the processing type (processing type information) is added to the input packet loaded in the reception queue 1. The input packet confirmation determination unit 26 includes an input candidate collection unit 261 and an agreement unit 262.

図３は受付キュー１に積まれるデータ（入力パケットデータ）のデータ構造例を示す。図３に示されるように、入力パケットデータは、処理種別および入力パケットの各フィールドを含む。入力パケットフィールドには入力パケットが格納（設定）され、処理種別フィールドには処理種別情報が格納（設定）される。 FIG. 3 shows an example of the data structure of data (input packet data) accumulated in the reception queue 1. As shown in FIG. 3, the input packet data includes fields of processing type and input packet. An input packet is stored (set) in the input packet field, and process type information is stored (set) in the process type field.

本実施形態において処理種別情報は、入力パケットフィールドに格納されている入力パケットをアプリケーションプログラム３またはシステム構成管理部６（内のシステム構成設定部６４）のいずれに渡すかを入力パケット確定判定部２６（内の合意部２６２）が決定するための処理種別を示す。そのため、処理種別情報の示す処理種別は、（１）アプリケーションと（２）構成とに分けられる。処理種別が「アプリケーション」の場合、入力パケットが外部のクライアント装置２０００から入力されたものであることをも示し、処理種別が「構成」の場合、入力パケットが分散システム１０を構成するいずれかのコンピュータのシステム構成管理部６（内のシステム構成変更検出部６３）から当該いずれかのコンピュータの受付キュー１に入力されたものであることをも示す。 In the present embodiment, the processing type information indicates whether the input packet stored in the input packet field is to be passed to the application program 3 or the system configuration management unit 6 (internal system configuration setting unit 64). The processing type for the (consensus unit 262) to determine is shown. Therefore, the process type indicated by the process type information is divided into (1) application and (2) configuration. When the processing type is “application”, it also indicates that the input packet is input from the external client device 2000. When the processing type is “configuration”, the input packet constitutes one of the distributed system 10 It also indicates that the information is input from the computer system configuration management unit 6 (internal system configuration change detection unit 63) to the reception queue 1 of any one of the computers.

再び図２を参照すると、最大確定入力順序番号記憶部２７は、多重化を構成している（稼動状態にある）他のコンピュータも含め、配送が確定したことがわかっている最大の入力順序番号を格納する。遅延記憶部２８は、多重化を構成している他の（ｎ−１）台（ｎ＝４）のコンピュータよりも遅延しているかどうかを示す（ｎ−１）個の遅延フラグ（ｎ＝４の本実施形態では、３個のフラグ）を格納する。スキップ判定部２９は、遅延記憶部２８の情報からスキップ動作の必要性を判定およびスキップ動作を実行する。 Referring to FIG. 2 again, the maximum confirmed input sequence number storage unit 27 includes the largest input sequence number that is known to have been confirmed to be delivered, including other computers that are configured in multiplexing (in an operating state). Is stored. The delay storage unit 28 has (n−1) delay flags (n = 4) indicating whether or not the delay storage is delayed from the other (n−1) computers (n = 4) constituting the multiplexing. In this embodiment, three flags) are stored. The skip determination unit 29 determines the necessity of the skip operation from the information in the delay storage unit 28 and executes the skip operation.

以降の説明では、入力順序番号記憶部２１に格納された入力順序番号を該当入力順序番号と呼び、ステップ番号記憶部２４に格納されたステップ番号を該当ステップ番号と呼ぶ。コンピュータ１００-iの整列マルチキャスト部２に含まれている候補パケット記憶部２５に格納されているｎ個の「入力候補」のうち、当該コンピュータ１００-i自身（自コンピュータ）に対応する「入力候補」を自候補と呼び、当該自候補以外の「入力候補」を他候補と呼ぶ。 In the following description, the input sequence number stored in the input sequence number storage unit 21 is referred to as a corresponding input sequence number, and the step number stored in the step number storage unit 24 is referred to as a corresponding step number. Of the n “input candidates” stored in the candidate packet storage unit 25 included in the ordered multicast unit 2 of the computer 100-i, the “input candidates” corresponding to the computer 100-i itself (own computer) Is called a self-candidate, and “input candidates” other than the self-candidate are called other candidates.

次に、整列マルチキャストプロトコルデータ送受信部２３によって送受信される整列マルチキャストプロトコルデータについて説明する。
図４は、整列マルチキャストプロトコルデータのレイアウトを示す図である。図４に示されるように、整列マルチキャストプロトコルデータ送受信部２３によって送受信される整列マルチキャストプロトコルデータは、種類、送信者、入力順序番号、ステップ番号（整列マルチキャストステップ番号）、最大確定入力順序番号、処理種別および入力パケットの各フィールドを含む。図４に示される整列マルチキャストプロトコルデータが前記特許文献１に記載されているプロトコルデータと相違するのは、前述の処理種別フィールドが追加されている点にある。 Next, the ordered multicast protocol data transmitted / received by the ordered multicast protocol data transmitting / receiving unit 23 will be described.
FIG. 4 is a diagram showing a layout of the ordered multicast protocol data. As shown in FIG. 4, the ordered multicast protocol data transmitted / received by the ordered multicast protocol data transmitting / receiving unit 23 includes type, sender, input order number, step number (ordered multicast step number), maximum confirmed input order number, processing. Includes fields for type and input packet. The aligned multicast protocol data shown in FIG. 4 is different from the protocol data described in Patent Document 1 in that the processing type field is added.

整列マルチキャストプロトコルデータは先頭の種類フィールドによって、次の３つに使い分けられる。
（１）候補種類
入力順序番号フィールド、ステップ番号フィールド、入力パケットフィールドには、それぞれ、送信者（送信側コンピュータ）の送信時における該当入力順序番号、該当ステップ番号、自候補が格納される。 The ordered multicast protocol data is used in the following three types depending on the type field at the top.
(1) Candidate type The input order number field, the step number field, and the input packet field respectively store a corresponding input order number, a corresponding step number, and a self-candidate at the time of transmission by the sender (transmission side computer).

（２）確定種類
その入力順序番号（入力順序番号フィールドに格納されている入力順序番号）に対応する入力パケットが、送信者の送信時における入力パケットジャーナル記憶部２２にあることを示し、入力パケットフィールドには、その入力パケットが格納される。この場合、ステップ番号フィールドは使用されない。 (2) Determined type Indicates that the input packet corresponding to the input sequence number (the input sequence number stored in the input sequence number field) is in the input packet journal storage unit 22 at the time of transmission by the sender. The input packet is stored in the field. In this case, the step number field is not used.

（３）遅延種類
その入力順序番号に対応する入力パケットが、送信者の送信時における入力パケットジャーナル記憶部２２にないことを示す。この場合、ステップ番号フィールドおよび入力パケットフィールドは使用されない。 (3) Delay type Indicates that there is no input packet corresponding to the input sequence number in the input packet journal storage unit 22 at the time of transmission by the sender. In this case, the step number field and the input packet field are not used.

いずれの種類の整列マルチキャストプロトコルデータにおいても、最大確定入力順序番号フィールドには、送信者（送信側コンピュータ）からの整列マルチキャストプロトコルデータ送信時における該当最大確定入力順序番号が格納される。また、整列マルチキャストプロトコルデータの受信側コンピュータにおける該当最大確定入力順序番号は、当該受信側コンピュータで確定された入力パケットの順序番号と、当該受信側コンピュータで受信された整列マルチキャストプロトコルデータ中の最大確定入力順序番号とのうち、最も大きいものに更新される。 In any type of ordered multicast protocol data, the maximum determined input sequence number field stores the corresponding maximum determined input sequence number at the time of transmitting the ordered multicast protocol data from the sender (sender computer). In addition, the corresponding maximum confirmed input sequence number in the receiving computer of the ordered multicast protocol data is the sequence number of the input packet determined in the receiving computer and the maximum determined in the ordered multicast protocol data received in the receiving computer. The input sequence number is updated to the largest one.

本実施形態では、分散システム１０内のコンピュータ１００-iは、前記特許文献１に記載されたコンピュータと異なり、多重化を構成するコンピュータを、当該システム１０の状況に応じて動的に変更するための新規の構成を含む。即ちコンピュータ１００-iは、図２に示されるように、システム構成管理部６をさらに有する。システム構成管理部６は、分散システム１０の構成を管理・決定する。この分散システム１０の構成の決定は、多重化を構成するコンピュータ間の合意に基づいて行われる。 In the present embodiment, unlike the computer described in Patent Document 1, the computer 100-i in the distributed system 10 dynamically changes the computer constituting the multiplexing according to the situation of the system 10. Includes new configurations. That is, the computer 100-i further includes a system configuration management unit 6 as shown in FIG. The system configuration management unit 6 manages and determines the configuration of the distributed system 10. The determination of the configuration of the distributed system 10 is performed based on an agreement between the computers constituting the multiplexing.

システム構成管理部６は、システム構成記憶部６１、最大確定入力順序番号履歴記憶部６２、システム構成変更検出部６３、システム構成設定部６４、システム構成変更合意プロトコルデータ送受信部６５、およびシステム構成通知プロトコルデータ送受信部６６から構成される。 The system configuration management unit 6 includes a system configuration storage unit 61, a maximum confirmed input sequence number history storage unit 62, a system configuration change detection unit 63, a system configuration change setting unit 64, a system configuration change agreement protocol data transmission / reception unit 65, and a system configuration notification The protocol data transmission / reception unit 66 is configured.

システム構成記憶部６１は、分散システム１０の構成に関するシステム構成情報を格納する。システム構成情報は、分散システム１０を構成するコンピュータと当該コンピュータの状態とを示す情報を含む。コンピュータの状態は、当該コンピュータが稼動している稼動状態（つまり多重化を構成している状態）、或いは待機している待機状態（つまり多重化を構成していない状態）のいずれかである。システム構成情報はさらに、分散システム１０を構成するコンピュータの優先度を示す情報（優先度情報）を含む。この優先度は、稼動状態にあったコンピュータの停滞の検出に応じてフェイルオーバを実施する際に優先的に使用するコンピュータを決定するのに用いられる。 The system configuration storage unit 61 stores system configuration information related to the configuration of the distributed system 10. The system configuration information includes information indicating the computers constituting the distributed system 10 and the state of the computers. The state of the computer is either an operating state in which the computer is operating (that is, a state that configures multiplexing) or a standby state that is waiting (that is, a state that does not configure multiplexing). The system configuration information further includes information (priority information) indicating the priority of the computers constituting the distributed system 10. This priority is used to determine a computer to be preferentially used when a failover is performed in response to detection of a stagnation of a computer in an operating state.

図５は、システム構成記憶部６１のデータ構造例を示す。図５の例では、分散システム１０を構成するコンピュータを示す情報として、コンピュータ名が用いられている。本実施形態において、コンピュータ１００-1〜１００-5のコンピュータ名がコンピュータ＃１〜＃５であるものとすると、図５に示すシステム構成記憶部６１は、分散システム１０がコンピュータ１００-1〜１００-5から構成されていることを示す。 FIG. 5 shows an exemplary data structure of the system configuration storage unit 61. In the example of FIG. 5, a computer name is used as information indicating the computers constituting the distributed system 10. In this embodiment, if the computer names of the computers 100-1 to 100-5 are computers # 1 to # 5, the system configuration storage unit 61 shown in FIG. -5.

また図５に示すシステム構成記憶部６１は、コンピュータ１００-1〜１００-4が稼動状態にあり、コンピュータ１００-5が待機状態にあることを示す。つまり図５に示すシステム構成記憶部６１は、図１の状態の分散システム１０の構成を示している。 The system configuration storage unit 61 shown in FIG. 5 indicates that the computers 100-1 to 100-4 are in an operating state and the computer 100-5 is in a standby state. That is, the system configuration storage unit 61 shown in FIG. 5 shows the configuration of the distributed system 10 in the state shown in FIG.

図５に示すシステム構成記憶部６１はさらに、コンピュータ１００-1，１００-2，１００-3，１００-4，１００-5の優先度が、それぞれ１，２，３，４，５であることを示す。本実施形態では、優先度の値が小さいほど高優先度となる。したがって図５の例では、コンピュータの優先度は、コンピュータ１００-1が最も高く、以下コンピュータ１００-2，１００-3，１００-4，１００-5の順となる。但し、ｎが４、ｍが５である本実施形態では、待機状態にあるコンピュータはコンピュータ１００-5のみであることから、フェイルオーバを実施する際には、当該コンピュータ１００-5が優先度に無関係に使用される。もし、ｍが５よりも大きく、したがって待機状態にあるコンピュータがコンピュータ１００-5を含めて複数存在するならば、フェイルオーバを実施する際には、この複数の待機状態にあるコンピュータのうち最も優先度が高いコンピュータが使用される。 In the system configuration storage unit 61 shown in FIG. 5, the priorities of the computers 100-1, 100-2, 100-3, 100-4, and 100-5 are 1, 2, 3, 4, and 5 respectively. Indicates. In the present embodiment, the lower the priority value, the higher the priority. Therefore, in the example of FIG. 5, the computer 100-1 has the highest priority, and the computers 100-2, 100-3, 100-4, and 100-5 are in the following order. However, in the present embodiment in which n is 4 and m is 5, only the computer 100-5 is in the standby state, and therefore when performing failover, the computer 100-5 is irrelevant to the priority. Used for. If m is larger than 5, and therefore there are a plurality of computers in the standby state including the computer 100-5, the highest priority among the plurality of computers in the standby state when performing failover. A high computer is used.

最大確定入力順序番号履歴記憶部６２は、多重化を構成している（稼動状態にある）各コンピュータの、配送が確定したことが分かっている最大の入力順序番号（最大確定入力順序番号）の履歴を格納する。より詳細には、最大確定入力順序番号履歴記憶部６２は、自コンピュータの最大確定入力順序番号記憶部２７に格納されている最大確定入力順序番号が変更される毎に、その時刻を示すタイムスタンプに加えて、多重化を構成している各コンピュータ（図１の例ではコンピュータ１００-1〜１００-4）の、配送が確定したことがわかっている最大の入力順序番号を、時系列順に格納するのに用いられる。 The maximum confirmed input sequence number history storage unit 62 stores the maximum input sequence number (maximum confirmed input sequence number) for which it is known that the delivery has been confirmed for each computer constituting the multiplexing (in operation state). Store history. More specifically, the maximum confirmed input sequence number history storage unit 62 is a time stamp indicating the time each time the maximum confirmed input sequence number stored in the maximum confirmed input sequence number storage unit 27 of the own computer is changed. In addition, the maximum input sequence number of each computer that constitutes multiplexing (computers 100-1 to 100-4 in the example of FIG. 1) whose delivery has been confirmed is stored in chronological order. Used to do.

図６は最大確定入力順序番号履歴記憶部６２のデータ構造例を示す。図６の例では、ある時点以降のコンピュータ１００-4（＃４）における最大確定入力順序番号が、他のコンピュータ１００-1（＃１）〜１００-3（＃３）よりも小さく、遅延していることが示されている。 FIG. 6 shows an example of the data structure of the maximum confirmed input sequence number history storage unit 62. In the example of FIG. 6, the maximum confirmed input sequence number in the computer 100-4 (# 4) after a certain time is smaller than that of the other computers 100-1 (# 1) to 100-3 (# 3) and is delayed. It is shown that.

システム構成変更検出部６３は、最大確定入力順序番号履歴記憶部６２に格納されている最大確定入力順序番号履歴に基づいて停滞しているコンピュータを検出することにより、システム構成を変更すべきことを検出（決定）する。システム構成変更検出部６３はまた、停滞しているコンピュータを検出した場合、当該検出された停滞しているコンピュータを、他の稼動状態にあるすべてのコンピュータに対してシステム構成変更合意プロトコルデータ送受信部６５を介して報告する。この報告には、報告種類のシステム構成変更合意プロトコルデータが用いられる。 The system configuration change detection unit 63 detects that the computer configuration is stagnant based on the maximum confirmed input sequence number history stored in the maximum confirmed input sequence number history storage unit 62, thereby changing the system configuration. Detect (determine). When the system configuration change detection unit 63 detects a stagnation computer, the system configuration change agreement protocol data transmission / reception unit transmits the detected stagnation computer to all computers in other operating states. Report via 65. For this report, a report type system configuration change agreement protocol data is used.

システム構成変更検出部６３はまた、上述の報告に対して、他の稼動状態にあるコンピュータのうちの（ｎ−ｔ−１）台以上（ｎ＝４、ｔ＝１の本実施形態では２台以上）から、同意する旨がシステム構成変更合意プロトコルデータ送受信部６５を介して返された場合、停滞しているコンピュータの検出に関し、自コンピュータを含めて（ｎ−ｔ）台以上で合意がとれたと判断する。 In response to the above-described report, the system configuration change detection unit 63 also includes (n−t−1) or more (n = 4, t = 1 in this embodiment) among the computers in other operating states. From the above, if an agreement is returned via the system configuration change agreement protocol data transmission / reception unit 65, an agreement can be reached on (nt) or more units including the own computer regarding the detection of the stagnant computer. Judge that

システム構成変更検出部６３はまた、他のコンピュータから停滞しているコンピュータが報告された場合、最大確定入力順序番号履歴記憶部６２に格納されている最大確定入力順序番号履歴に基づいて、その報告に同意するかを判定する。システム構成変更検出部６３は、この判定結果（つまり、同意するか或いは非同意とするか）を、停滞しているコンピュータを報告したコンピュータに通知する。この通知には、同意種類または非同意種類のシステム構成変更合意プロトコルデータが用いられる。 The system configuration change detection unit 63 also reports the report based on the maximum confirmed input sequence number history stored in the maximum confirmed input sequence number history storage unit 62 when a stagnated computer is reported from another computer. Determine whether you agree with. The system configuration change detection unit 63 notifies this determination result (that is, whether to agree or disagree) to the computer that reported the stagnating computer. This notification uses agreement type or non-agreement type system configuration change agreement protocol data.

システム構成変更検出部６３はまた、停滞しているコンピュータの検出に関し、自コンピュータを含めて（ｎ−ｔ）台以上で合意がとれたことにより、停滞しているコンピュータの検出を確認する。システム構成変更検出部６３はさらに、停滞しているコンピュータの検出の確認に応じて、確認された停滞しているコンピュータを待機状態とし、待機状態にあるコンピュータを稼動状態とするように分散システムの構成の変更を要求するためのシステム構成変更要求パケットを、入力パケットとして受付キュー１に渡す。 The system configuration change detection unit 63 also confirms the detection of a stagnant computer when an agreement has been reached on (nt) or more units including its own computer regarding the detection of the stagnant computer. Further, the system configuration change detection unit 63 sets the confirmed stagnated computer in a standby state and sets the computer in the standby state in an operating state in response to confirmation of detection of the stagnating computer. A system configuration change request packet for requesting a configuration change is passed to the reception queue 1 as an input packet.

システム構成設定部６４は、システム構成変更要求パケットについて合意部２６２によって合意された結果、つまりシステム構成変更要求パケットが合意部２６２によって確定された結果、当該システム構成変更要求パケットが合意部２６２から渡された場合、当該システム構成変更要求パケットで要求された分散システムの構成の変更を行う。 The system configuration setting unit 64 sends the system configuration change request packet from the agreement unit 262 as a result of the agreement of the system configuration change request packet by the agreement unit 262, that is, as a result of the system configuration change request packet being confirmed by the agreement unit 262. If so, the configuration of the distributed system requested by the system configuration change request packet is changed.

システム構成変更合意プロトコルデータ送受信部６５は、停滞しているコンピュータがシステム構成変更検出部６３によって検出された際に、他のコンピュータと合意を形成するためのシステム構成変更合意プロトコルデータの通信を行う。 The system configuration change agreement protocol data transmission / reception unit 65 communicates system configuration change agreement protocol data for forming an agreement with another computer when a stagnant computer is detected by the system configuration change detection unit 63. .

図７は、システム構成変更合意プロトコルデータ送受信部６５によって送受信されるシステム構成変更合意プロトコルデータのレイアウトを示す図である。図７に示されるように、システム構成変更合意プロトコルデータは、種類、送信者、提案者、提案時タイムスタンプ、および停滞確認対象コンピュータ名（停滞している疑いをかけられているコンピュータ名）の各フィールドを含んでいる。停滞確認対象コンピュータ名フィールドは、整列マルチキャストプロトコルデータの入力パケットフィールドに相当する。つまり本実施形態では、停滞確認対象コンピュータ名フィールドの内容は、入力パケットとして扱われる。 FIG. 7 is a diagram showing a layout of system configuration change agreement protocol data transmitted / received by the system configuration change agreement protocol data transmission / reception unit 65. As shown in FIG. 7, the system configuration change agreement protocol data includes the type, sender, proposer, proposal time stamp, and stagnation confirmation target computer name (the name of the computer suspected of stagnation). Contains each field. The stagnation confirmation target computer name field corresponds to the input packet field of the ordered multicast protocol data. That is, in this embodiment, the contents of the stagnation confirmation target computer name field are treated as an input packet.

システム構成変更合意プロトコルデータは、先頭の種類フィールドによって次の３つに使い分けられる。
（１）報告種類
提案者フィールドで示される提案者（コンピュータ）が、定常的に停滞している他のコンピュータを検出したことを示す。送信者フィールド、および提案時タイムスタンプフィールドには、それぞれ、報告種類を示す種類フィールドを含むシステム構成変更合意プロトコルデータ（つまり報告種類のシステム構成変更合意プロトコルデータ）の送信者（送信側コンピュータ）を示すコンピュータ名、および停滞しているコンピュータを検出した際のタイムスタンプ（提案時のタイムスタンプ）が格納される。停滞確認対象コンピュータ名フィールドには、停滞の検出に関して確認が必要なコンピュータ（つまり停滞確認の対象となるコンピュータ）のコンピュータ名が格納される。送信者フィールドの示す送信者は、提案者フィールドの示す提案者に一致する。 The system configuration change agreement protocol data is divided into the following three types according to the type field at the top.
(1) Report type This indicates that the proposer (computer) indicated in the proposer field has detected another computer that is constantly stagnating. In the sender field and the proposal time stamp field, the sender (sender computer) of the system configuration change agreement protocol data (that is, the report type system configuration change agreement protocol data) including the type field indicating the report type, respectively. The computer name to be shown and the time stamp when the stagnating computer is detected (time stamp at the time of proposal) are stored. In the stagnation confirmation target computer name field, the computer name of the computer that needs to be confirmed regarding the detection of stagnation (that is, the computer subject to stagnation confirmation) is stored. The sender indicated by the sender field matches the proposer indicated by the proposer field.

（２）同意種類
報告種類のシステム構成変更合意プロトコルデータを受信したコンピュータが、同等な停滞を検出したことを示す。送信者フィールドには、同意種類を示す種類フィールドを含むシステム構成変更合意プロトコルデータ（つまり同意種類のシステム構成変更合意プロトコルデータ）の送信者を示すコンピュータ名が格納される。提案者フィールド、提案時タイムスタンプフィールド、および停滞確認対象コンピュータ名フィールドには、それぞれ報告種類のシステム構成変更合意プロトコルデータの対応するフィールドの内容のコピーが用いられる。 (2) Consent type Indicates that the computer that received the report type system configuration change agreement protocol data has detected an equivalent stagnation. The sender field stores the computer name indicating the sender of the system configuration change agreement protocol data (that is, the agreement type system configuration change agreement protocol data) including the type field indicating the agreement type. For the proposer field, the proposal time stamp field, and the stagnation confirmation target computer name field, copies of the contents of the corresponding fields of the report type system configuration change agreement protocol data are used.

（３）非同意種類
報告種類のシステム構成変更合意プロトコルデータを受信したコンピュータが、同等な停滞を検出していないことを示す。送信者フィールドには、非同意種類を示す種類フィールドを含むシステム構成変更合意プロトコルデータ（つまり非同意種類のシステム構成変更合意プロトコルデータ）の送信者を示すコンピュータ名が格納される。提案者フィールド、提案時タイムスタンプフィールド、および停滞確認対象コンピュータ名フィールドには、それぞれ報告種類のシステム構成変更合意プロトコルデータの対応するフィールドの内容のコピーが用いられる。 (3) Disagreement type Indicates that the computer that received the report type system configuration change agreement protocol data has not detected an equivalent stagnation. The sender field stores the computer name indicating the sender of the system configuration change agreement protocol data including the type field indicating the non-agreement type (that is, the system configuration change agreement protocol data of the non-agreement type). For the proposer field, the proposal time stamp field, and the stagnation confirmation target computer name field, copies of the contents of the corresponding fields of the report type system configuration change agreement protocol data are used.

システム構成通知プロトコルデータ送受信部６６は、システム構成を変更すべきことがシステム構成変更検出部６３によって検出されたタイミングで、待機状態にあったコンピュータを稼動状態にするためのシステム構成通知プロトコルデータを送受信する。システム構成通知プロトコルデータは、システム構成記憶部６１に格納されているデータ（システム構成情報）それ自体（つまりシステム構成記憶部６１のダンプ情報）を含む。 The system configuration notification protocol data transmission / reception unit 66 receives system configuration notification protocol data for putting a computer in a standby state into an operating state at a timing when the system configuration change detection unit 63 detects that the system configuration should be changed. Send and receive. The system configuration notification protocol data includes data (system configuration information) stored in the system configuration storage unit 61 itself (that is, dump information of the system configuration storage unit 61).

次に、分散システム１０内で稼動状態にあるコンピュータの動作原理について具体的に説明する。
（初期化処理）
まず、コンピュータ１００-1〜１００-5のシステム構成記憶部６１は、分散システム１０の初期状態を示す情報を格納する。図５は、このときのシステム構成記憶部６１のデータ構造例を示しているものとする。ここでは、コンピュータ１００-1〜１００-4が稼動状態にあり、コンピュータ１００-5が待機状態にある。 Next, the operation principle of the computer in the operating state in the distributed system 10 will be specifically described.
(Initialization process)
First, the system configuration storage unit 61 of the computers 100-1 to 100-5 stores information indicating the initial state of the distributed system 10. FIG. 5 shows an example of the data structure of the system configuration storage unit 61 at this time. Here, the computers 100-1 to 100-4 are in an operating state, and the computer 100-5 is in a standby state.

稼動状態にあるコンピュータ１００-j（ｊ＝１〜４）の入力順序番号記憶部２１は、初期状態の入力順序番号、つまり初期入力順序番号（例えば１）を格納する。コンピュータ１００-jの入力パケットジャーナル記憶部２２および候補パケット記憶部２５は空の状態にある。 The input sequence number storage unit 21 of the computer 100-j (j = 1 to 4) in the operating state stores the input sequence number in the initial state, that is, the initial input sequence number (for example, 1). The input packet journal storage unit 22 and the candidate packet storage unit 25 of the computer 100-j are empty.

コンピュータ１００-jのステップ番号記憶部２４は初期ステップ番号（例えば１）を記憶し、最大確定入力順序番号記憶部２７は初期入力順序番号を格納する。コンピュータ１００-jの遅延記憶部２８の（ｎ−１）個のフラグ（ここでは３個のフラグ）はすべてリセットされており、最大確定入力順序番号履歴記憶部６２は現在のタイムスタンプとコンピュータ１００-1〜１００-4の各々の初期入力順序番号を記憶する。 The step number storage unit 24 of the computer 100-j stores an initial step number (for example, 1), and the maximum determined input sequence number storage unit 27 stores the initial input sequence number. All the (n-1) flags (here, three flags) in the delay storage unit 28 of the computer 100-j are reset, and the maximum confirmed input sequence number history storage unit 62 stores the current time stamp and the computer 100. The initial input sequence number for each of -1 to 100-4 is stored.

コンピュータ１００-jの整列マルチキャスト部２が実行する整列マルチキャストによって各コンピュータへ配送される入力パケットを決定する処理は、アルゴリズム１〜９により実現される。 The process of determining the input packet delivered to each computer by the ordered multicast executed by the ordered multicast unit 2 of the computer 100-j is realized by algorithms 1-9.

まず、アルゴリズム１〜４について、図８および図９を参照して説明する。図８および図９は、整列マルチキャストの１回の配送を行う基本的な部分の動作手順を示すフローチャートである。図８のフローチャート（第１のフローチャート）のステップＡ１はアルゴリズム１〜３全体を表し、残りのステップＡ２〜Ａ１３はアルゴリズム４を表す。図９のフローチャート（第２のフローチャート）のステップＢ１〜Ｂ４はアルゴリズム１を、ステップＢ５〜Ｂ８はアルゴリズム２を、そしてステップＢ９，Ｂ１０はアルゴリズム３を、それぞれ示す。 First, algorithms 1 to 4 will be described with reference to FIGS. 8 and 9. FIG. 8 and FIG. 9 are flowcharts showing an operation procedure of a basic part that performs one delivery of ordered multicast. Step A1 in the flowchart of FIG. 8 (first flowchart) represents the entire algorithms 1 to 3, and the remaining steps A2 to A13 represent the algorithm 4. In the flowchart of FIG. 9 (second flowchart), steps B1 to B4 indicate algorithm 1, steps B5 to B8 indicate algorithm 2, and steps B9 and B10 indicate algorithm 3.

（アルゴリズム１）
整列マルチキャスト部２内の入力パケット確定判定部２６に含まれる入力候補収集部２６１は、コンピュータ１００-1〜１００-4（ｎ＝４）がそれぞれ次に処理する候補（入力候補）として選択した入力パケット（入力データ）を収集するための候補一覧作成処理（図８のステップＡ１）の一部（図９のステップＢ１〜Ｂ４）を次のように実行する。 (Algorithm 1)
The input candidate collection unit 261 included in the input packet confirmation determination unit 26 in the ordered multicast unit 2 is an input selected as a candidate (input candidate) to be processed next by each of the computers 100-1 to 100-4 (n = 4). A part (steps B1 to B4 in FIG. 9) of the candidate list creation process (step A1 in FIG. 8) for collecting packets (input data) is executed as follows.

入力候補収集部２６１は、ステップ番号記憶部２４に格納されている該当ステップ番号が初期値であるときは（図９のステップＢ１のＹＥＳ）、受付キュー１に入力パケットが存在するかを判定する（図９のステップＢ２）。 When the corresponding step number stored in the step number storage unit 24 is an initial value (YES in step B1 in FIG. 9), the input candidate collection unit 261 determines whether an input packet exists in the reception queue 1 (Step B2 in FIG. 9).

もし、入力パケットが存在するならば（図９のステップＢ２のＹＥＳ）、入力候補収集部２６１はステップ番号記憶部２４に格納されている該当ステップ番号を次に進める（図９のステップＢ３）。そして入力候補収集部２６１は、入力パケットを自候補として候補パケット記憶部２５に格納し、かつ、この自候補が入力パケットフィールドに設定された候補種類のプロトコルデータ（整列マルチキャストプロトコルデータ）を整列マルチキャストプロトコルデータ送受信部２３によりネットワークＢを介して他のすべてのコンピュータ（つまりコンピュータ１００-1〜１００-4のうち、コンピュータ１００-jを除くすべてのコンピュータ）に送信させる（図９のステップＢ４）。このステップＢ４において、入力候補収集部２６１は、候補パケット記憶部２５内のすべての他候補を空にする。 If there is an input packet (YES in step B2 in FIG. 9), the input candidate collection unit 261 advances the corresponding step number stored in the step number storage unit 24 (step B3 in FIG. 9). Then, the input candidate collection unit 261 stores the input packet as a self-candidate in the candidate packet storage unit 25, and sorts the candidate type protocol data (aligned multicast protocol data) in which the self-candidate is set in the input packet field. The protocol data transmitting / receiving unit 23 transmits the data to all other computers (that is, all computers except the computer 100-j among the computers 100-1 to 100-4) via the network B (step B4 in FIG. 9). In step B4, the input candidate collection unit 261 empties all other candidates in the candidate packet storage unit 25.

（アルゴリズム２）
一方、該当ステップ番号が初期値でないか（図９のステップＢ１のＮＯ）、または受付キュー１に入力パケットがないとき（図９のステップＢ２のＮＯ）、入力候補収集部２６１は、入力順序番号記憶部２１に格納されている該当入力順序番号に一致する入力順序番号（が設定された入力順序番号フィールド）を持つ候補種類の整列マルチキャストプロトコルデータが整列マルチキャストプロトコルデータ送受信部２３によって受信されているかを判定する（図９のステップＢ５）。もし、受信されているならば（図９のステップＢ５のＹＥＳ）、入力候補収集部２６１は、受信されている整列マルチキャストプロトコルデータ（受信整列マルチキャストプロトコルデータ）内の（ステップ番号フィールドに設定されている）ステップ番号は該当ステップ番号よりも大きいかを判定する（図９のステップＢ６）。 (Algorithm 2)
On the other hand, when the corresponding step number is not the initial value (NO in step B1 in FIG. 9) or there is no input packet in the reception queue 1 (NO in step B2 in FIG. 9), the input candidate collection unit 261 receives the input sequence number. Whether the ordered multicast protocol data transmitting / receiving unit 23 receives candidate type of ordered multicast protocol data having an input order number (input order number field in which the input order number matches the corresponding input order number stored in the storage unit 21). Is determined (step B5 in FIG. 9). If it is received (YES in step B5 in FIG. 9), the input candidate collection unit 261 is set in the (step number field) in the received ordered multicast protocol data (received ordered multicast protocol data). It is determined whether the step number is larger than the corresponding step number (step B6 in FIG. 9).

もし、受信整列マルチキャストプロトコルデータ内のステップ番号が該当ステップ番号よりも大きいならば（図９のステップＢ６のＹＥＳ）、入力候補収集部２６１は、ステップ番号記憶部２４に格納されている該当ステップ番号を受信整列マルチキャストプロトコルデータ内のステップ番号に更新する（図９のステップＢ７）。 If the step number in the reception aligned multicast protocol data is larger than the corresponding step number (YES in step B6 in FIG. 9), the input candidate collection unit 261 stores the corresponding step number stored in the step number storage unit 24. Is updated to the step number in the reception aligned multicast protocol data (step B7 in FIG. 9).

次に入力候補収集部２６１は、受信整列マルチキャストプロトコルデータ内の（入力パケットフィールドに設定されている）入力パケットを自候補として候補パケット記憶部２５に格納し、かつ、この自候補が入力パケットフィールドに設定された（つまり受信整列マルチキャストプロトコルデータ内の入力パケットフィールドのコピーを含む）候補種類の整列マルチキャストプロトコルデータを整列マルチキャストプロトコルデータ送受信部２３によりネットワークＢを介して他のすべてのコンピュータに送信させる（図９のステップＢ８）。このステップＢ８において入力候補収集部２６１は、受信整列マルチキャストプロトコルデータ内の入力パケット（ここでは自候補とされた入力パケット）を、当該受信整列マルチキャストプロトコルデータの（送信者フィールドの示す）送信者に対応する他候補として、候補パケット記憶部２５に格納する。つまりステップＢ８では、受信整列マルチキャストプロトコルデータ中の入力パケットが、自候補として設定されるとともに、当該受信整列マルチキャストプロトコルデータの送信者に対応する他候補としても設定される。このとき入力候補収集部２６１は、候補パケット記憶部２５内の、受信整列マルチキャストプロトコルデータの送信者に対応する他候補以外の他候補をすべて破棄する（空にする）。 Next, the input candidate collection unit 261 stores the input packet (set in the input packet field) in the received aligned multicast protocol data as a self-candidate in the candidate packet storage unit 25, and the self-candidate is stored in the input packet field. Aligned multicast protocol data of a candidate type set to (that is, including a copy of the input packet field in the received aligned multicast protocol data) is transmitted to all other computers via the network B by the aligned multicast protocol data transmitting / receiving unit 23 (Step B8 in FIG. 9). In this step B8, the input candidate collection unit 261 sends the input packet in the received aligned multicast protocol data (in this case, the input packet determined as its own candidate) to the sender (indicated by the sender field) of the received aligned multicast protocol data. The candidate packet storage unit 25 stores the corresponding other candidate. That is, in step B8, the input packet in the received aligned multicast protocol data is set as its own candidate and also set as another candidate corresponding to the sender of the received aligned multicast protocol data. At this time, the input candidate collection unit 261 discards (empties) all other candidates in the candidate packet storage unit 25 other than the other candidates corresponding to the sender of the received aligned multicast protocol data.

（アルゴリズム３）
これに対し、受信整列マルチキャストプロトコルデータ内のステップ番号と該当ステップ番号とが等しいならば（図９のステップＢ６のＮＯ，ステップＢ９のＹＥＳ）、入力候補収集部２６１は、受信整列マルチキャストプロトコルデータ内の入力パケットを、当該受信整列マルチキャストプロトコルデータの送信者に対応する他候補として候補パケット記憶部２５に格納する（図９のステップＢ１０）。 (Algorithm 3)
On the other hand, if the step number in the reception aligned multicast protocol data is equal to the corresponding step number (NO in step B6 in FIG. 9, YES in step B9), the input candidate collection unit 261 includes the received alignment multicast protocol data. Are stored in the candidate packet storage unit 25 as other candidates corresponding to the sender of the received aligned multicast protocol data (step B10 in FIG. 9).

入力候補収集部２６１は、ステップＢ６またはステップＢ１０を実行すると、候補パケット記憶部２５に格納された候補数が（ｎ−ｔ）個以上（ｎ＝４、ｔ＝１の例では３個以上）になったかを判定する（図９のステップＢ１１）。 When the input candidate collection unit 261 executes Step B6 or Step B10, the number of candidates stored in the candidate packet storage unit 25 is (nt) or more (in the example of n = 4, t = 1, 3 or more). (Step B11 in FIG. 9).

入力候補収集部２６１は、候補数が（ｎ−ｔ）個以上になっていないならば（図９のステップＢ１１のＮＯ）、ステップＢ１からの処理を再び実行する。これに対し、候補数が（ｎ−ｔ）個以上になっているならば（図９のステップＢ１１のＹＥＳ）、入力候補収集部２６１は候補一覧作成処理を終了する。なお、ステップＢ５またはステップＢ９の判定がＮＯの場合にも、ステップＢ１からの処理が再び実行される。 If the number of candidates does not reach (nt) or more (NO in step B11 in FIG. 9), the input candidate collection unit 261 executes the processing from step B1 again. On the other hand, if the number of candidates is (nt) or more (YES in step B11 in FIG. 9), the input candidate collection unit 261 ends the candidate list creation process. Even when the determination in step B5 or step B9 is NO, the processing from step B1 is executed again.

（アルゴリズム４）
候補一覧作成処理（図８のステップＡ１）が終了すると、即ち候補パケット記憶部２５に格納された候補（入力候補）数（空でない候補数）が（ｎ−ｔ）個以上になると、入力パケット確定判定部２６内の合意部２６２は第１の入力候補選定制御手段として機能して、当該候補パケット記憶部２５に（ｎ−ｔ）個以上の同一の候補が存在するか、つまり（ｎ−ｔ）台以上のコンピュータで合意がとられた（合意が形成された）候補が存在するかを判定する（図８のステップＡ２）。 (Algorithm 4)
When the candidate list creation process (step A1 in FIG. 8) is completed, that is, when the number of candidates (input candidates) (number of non-empty candidates) stored in the candidate packet storage unit 25 becomes (nt) or more, the input packet The agreement unit 262 in the decision determination unit 26 functions as a first input candidate selection control unit, and whether or not (nt) or more identical candidates exist in the candidate packet storage unit 25, that is, (n− t) It is determined whether or not there is a candidate for which an agreement has been reached (an agreement has been formed) on more than one computer (step A2 in FIG. 8).

もし、（ｎ−ｔ）個以上の同一（同一内容）の候補が存在するならば（図８のステップＡ２のＹＥＳ）、合意部２６２は、その候補を該当入力順序番号における入力パケットとして確定する（図８のステップＡ３）。このステップＡ３において合意部２６２は、最大確定入力順序番号記憶部２７に格納されている該当最大確定入力順序番号を該当入力順序番号に更新する。また、ステップＡ３において合意部２６２は、確定された入力パケットが受付キュー１に存在するならば、当該入力パケットを受付キュー１から削除する。 If there are (nt) or more (same content) candidates (YES in step A2 in FIG. 8), the agreement unit 262 determines the candidates as input packets in the corresponding input sequence number. (Step A3 in FIG. 8). In step A 3, the agreement unit 262 updates the corresponding maximum confirmed input sequence number stored in the maximum confirmed input sequence number storage unit 27 to the corresponding input sequence number. In step A 3, if the confirmed input packet exists in the reception queue 1, the agreement unit 262 deletes the input packet from the reception queue 1.

このように、該当入力順序番号における入力パケットが確定すると、つまり該当入力順序番号における入力パケットに関して合意がとられて、整列マルチキャストが確定すると、合意部２６２は図８のステップＡ４に進む。このステップＡ４において合意部２６２は、候補出力先切り替え手段として機能して、確定された入力パケットに付されている処理種別情報の示す処理種別が「構成」であるか否（「アプリケーション」である）か、つまり確定された入力パケットがシステム構成変更を要求するシステム構成変更要求パケットであるかを判定する。システム構成変更要求パケットの詳細については、後述するアルゴリズム１２の処理の中で説明する。 As described above, when the input packet at the corresponding input sequence number is confirmed, that is, when the input packet at the corresponding input sequence number is agreed and the ordered multicast is confirmed, the agreement unit 262 proceeds to step A4 in FIG. In step A4, the agreement unit 262 functions as a candidate output destination switching unit, and whether or not the processing type indicated by the processing type information attached to the determined input packet is “configuration” (“application”). In other words, it is determined whether the determined input packet is a system configuration change request packet for requesting a system configuration change. Details of the system configuration change request packet will be described in the processing of the algorithm 12 described later.

もし、処理種別が「構成」であるならば、つまり確定された入力パケットがシステム構成変更要求パケットであるならば（図８のステップＡ４のＹＥＳ）、合意部２６２は、確定された入力パケットをシステム構成管理部６内のシステム構成設定部６４に渡すとともに、当該入力パケットを入力パケットジャーナル記憶部２２に格納する（図８のステップＡ５）。 If the processing type is “configuration”, that is, if the confirmed input packet is a system configuration change request packet (YES in step A4 in FIG. 8), the agreement unit 262 determines the confirmed input packet as The input packet is transferred to the system configuration setting unit 64 in the system configuration management unit 6 and the input packet is stored in the input packet journal storage unit 22 (step A5 in FIG. 8).

システム構成設定部６４は、合意部２６２から確定された入力パケットを受け取ると、当該パケットがコンピュータ１００-j自身（自コンピュータ）を待機状態に変更することを要求するシステム構成変更要求パケットであるかを判定する（図８のステップＡ６）。もし、自コンピュータを待機状態に変更することを要求するシステム構成変更要求パケットである場合（図８のステップＡ６のＹＥＳ）、システム構成設定部６４は上述の初期化処理を実施して自コンピュータを待機状態にする（図８のステップＡ７）。待機状態となったコンピュータのシステム構成管理部６は、稼動状態にあるコンピュータから、待機状態から稼動状態への変更を通知するシステム構成通知プロトコルデータが送られるのを待つ。一方、待機状態となったコンピュータの整列マルチキャスト部２は、例えば省電力化のために動作を停止する。 When the system configuration setting unit 64 receives the input packet determined from the agreement unit 262, is the system configuration change request packet requesting that the packet change the computer 100-j itself (own computer) to a standby state? Is determined (step A6 in FIG. 8). If it is a system configuration change request packet for requesting that the own computer be changed to a standby state (YES in step A6 in FIG. 8), the system configuration setting unit 64 performs the initialization process described above to A standby state is set (step A7 in FIG. 8). The system configuration management unit 6 of the computer in the standby state waits for the system configuration notification protocol data for notifying the change from the standby state to the operating state from the computer in the operating state. On the other hand, the aligned multicast unit 2 of the computer that has entered the standby state stops its operation to save power, for example.

これに対し、確定された入力パケットが、他のコンピュータを待機状態に変更することを要求するシステム構成変更要求パケットである場合（図８のステップＡ６のＮＯ）、システム構成設定部６４は、システム構成記憶部６１に格納されているシステム構成情報を、変更後のシステム構成を示すように更新するとともに、その更新後のシステム構成情報を含むシステム構成通知プロトコルデータ（つまり変更後のシステム構成を通知するためのシステム構成通知プロトコルデータ）をシステム構成通知プロトコルデータ送受信部６６により、分散システムを構成する他のすべてのコンピュータに送信させる（図８のステップＡ８）。この場合、合意部２６２は、次工程へ移行すべく、入力順序番号記憶部２１に格納されている該当入力順序番号を次に進め（１インクリメントし）、ステップ番号記憶部２４に格納されている該当ステップ番号を初期化する（図８のステップＡ９）。このステップＡ９において合意部２６２は、候補パケット記憶部２５に格納されているすべての候補を破棄し（空にし）、遅延記憶部２８に格納されている（ｎ−１）個の遅延フラグをすべてリセットする。 On the other hand, when the determined input packet is a system configuration change request packet requesting that another computer be changed to a standby state (NO in step A6 in FIG. 8), the system configuration setting unit 64 The system configuration information stored in the configuration storage unit 61 is updated to indicate the system configuration after the change, and system configuration notification protocol data including the updated system configuration information (that is, the system configuration after the change is notified) System configuration notification protocol data) is transmitted to all other computers constituting the distributed system by the system configuration notification protocol data transmitting / receiving unit 66 (step A8 in FIG. 8). In this case, the agreement unit 262 advances the corresponding input sequence number stored in the input sequence number storage unit 21 to the next step (increment by 1), and is stored in the step number storage unit 24. The corresponding step number is initialized (step A9 in FIG. 8). In step A9, the agreement unit 262 discards (empties) all candidates stored in the candidate packet storage unit 25, and sets all (n−1) delay flags stored in the delay storage unit 28. Reset.

一方、処理種別が「構成」でないならば、つまり確定された入力パケットがシステム構成変更要求パケットでない場合（図８のステップＡ４のＮＯ）、合意部２６２は、当該入力パケットをアプリケーションプログラム３に配送するとともに、当該入力パケットを入力パケットジャーナル記憶部２２に格納する（図８のステップＡ１０）。ここでは、確定された入力パケットは、現在入力パケットジャーナル記憶部２２に格納されている入力パケットの列の後ろに位置するように格納される。なお、入力パケットジャーナル記憶部２２に格納されている入力パケットの列が一定の量に達しているならば、先頭の入力パケット（つまり最も古い入力パケット）が破棄される。その後、合意部２６２は、次工程へ移行すべく、上記ステップＡ９を実行する。 On the other hand, if the processing type is not “configuration”, that is, if the determined input packet is not a system configuration change request packet (NO in step A4 in FIG. 8), the agreement unit 262 delivers the input packet to the application program 3 At the same time, the input packet is stored in the input packet journal storage unit 22 (step A10 in FIG. 8). Here, the determined input packet is stored so as to be positioned behind the column of input packets currently stored in the input packet journal storage unit 22. If the input packet sequence stored in the input packet journal storage unit 22 reaches a certain amount, the first input packet (that is, the oldest input packet) is discarded. Thereafter, the agreement unit 262 executes Step A9 to shift to the next process.

一方、（ｎ−ｔ）個以上の同一の候補が存在しなかった場合（図８のステップＡ２のＮＯ）、合意部２６２は第２の入力候補選定制御手段として機能して、今度は、候補パケット記憶部２５に過半数以上の同一の候補が存在するかを判定する（図８のステップＡ１１）。もし、過半数以上の同一の候補が存在するならば（図８のステップＡ１１のＹＥＳ）、合意部２６２は、その候補を選択して自候補として候補パケット記憶部２５に格納し（つまり、その候補を候補パケット記憶部２５内の自候補とし）、かつ、この自候補が入力パケットフィールドに設定された候補種類のプロトコルデータ（整列マルチキャストプロトコルデータ）を整列マルチキャストプロトコルデータ送受信部２３によりネットワークＢを介して他のすべてのコンピュータに送信させる（図８のステップＡ１２）。このステップＡ１２において合意部２６２は、候補パケット記憶部２５に格納されているすべての他候補を破棄する。 On the other hand, when there are not (nt) or more identical candidates (NO in step A2 in FIG. 8), the agreement unit 262 functions as a second input candidate selection control unit, and this time, candidates It is determined whether or not a majority of the same candidates exist in the packet storage unit 25 (step A11 in FIG. 8). If more than half of the same candidates exist (YES in step A11 in FIG. 8), the agreement unit 262 selects the candidate and stores it as a candidate in the candidate packet storage unit 25 (that is, the candidate). And the candidate type protocol data (aligned multicast protocol data) in which the self candidate is set in the input packet field via the network B by the aligned multicast protocol data transmitting / receiving unit 23. To all other computers (step A12 in FIG. 8). In step A12, the agreement unit 262 discards all other candidates stored in the candidate packet storage unit 25.

これに対し、過半数以上の同一の候補が存在しないならば（図８のステップＡ１１のＮＯ）、合意部２６２は第３の入力候補選定制御手段として機能して、候補パケット記憶部２５に格納されている入力候補の中からランダムに候補（入力パケット）を自候補として選択し、かつ、この自候補が入力パケットフィールドに設定された候補種類のプロトコルデータ（整列マルチキャストプロトコルデータ）を整列マルチキャストプロトコルデータ送受信部２３によりネットワークＢを介して他のすべてのコンピュータに送信させる（図８のステップＡ１３）。このステップＡ１３において合意部２６２は、候補パケット記憶部２５に格納されているすべての他候補を破棄する。 On the other hand, if more than a majority of the same candidates do not exist (NO in step A11 in FIG. 8), the agreement unit 262 functions as a third input candidate selection control unit and is stored in the candidate packet storage unit 25. A candidate (input packet) is randomly selected from among the input candidates, and the protocol data of the candidate type (aligned multicast protocol data) in which the self candidate is set in the input packet field is aligned multicast protocol data The transmission / reception unit 23 transmits the data to all other computers via the network B (step A13 in FIG. 8). In step A13, the agreement unit 262 discards all other candidates stored in the candidate packet storage unit 25.

ステップＡ１２（第２の入力候補選定制御手段としての動作）またはステップＡ１３（第３の入力候補選定制御手段としての動作）が終了すると、ステップＡ１からの処理が再び実行される。一方、ステップＡ７またはステップＡ９が終了すると、整列マルチキャストの１回の配送処理が終了する。 When step A12 (operation as the second input candidate selection control means) or step A13 (operation as the third input candidate selection control means) is completed, the processing from step A1 is executed again. On the other hand, when step A7 or step A9 ends, one delivery process of the ordered multicast ends.

以上の手順で、多重化を構成する各コンピュータ１００-j、つまりコンピュータ１００-1〜１００-4は、（ｎ−ｔ）台（ここでは３台）以上のコンピュータでの入力パケットの一致を確認しながら処理を進めていく。 With the above procedure, each computer 100-j constituting the multiplexing, that is, the computers 100-1 to 100-4, confirms the coincidence of input packets in (nt) computers (three computers in this case). While proceeding.

次に、アルゴリズム５〜９について、図１０乃至図１３を参照して説明する。図１０乃至図１３は、多重化実行の遅延を解消するための処理の手順を示すフローチャートである。図１０のフローチャート（第１のフローチャート）のステップＣ１，Ｃ２はアルゴリズム５を表し、ステップＣ３はアルゴリズム７を表す。図１１のフローチャート（第２のフローチャート）はアルゴリズム６を、図１２のフローチャート（第３のフローチャート）はアルゴリズム８を、そして図１３のフローチャート（第１のフローチャート）はアルゴリズム９を、それぞれ示す。 Next, algorithms 5 to 9 will be described with reference to FIGS. 10 to 13. FIG. 10 to FIG. 13 are flowcharts showing the procedure of processing for eliminating the delay of multiplexing execution. Steps C1 and C2 in the flowchart of FIG. 10 (first flowchart) represent algorithm 5, and step C3 represents algorithm 7. The flowchart of FIG. 11 (second flowchart) shows the algorithm 6, the flowchart of FIG. 12 (third flowchart) shows the algorithm 8, and the flowchart of FIG. 13 (first flowchart) shows the algorithm 9.

（アルゴリズム５）
整列マルチキャスト部２内の合意部２６２は、該当入力順序番号より小さい入力順序番号を持つ候補種類の整列マルチキャストプロトコルデータが整列マルチキャストプロトコルデータ送受信部２３によって受信された場合に、その入力順序番号に対応する入力パケットが入力パケットジャーナル記憶部２２に存在するかを判定する（図１０のステップＣ１）。 (Algorithm 5)
The agreement unit 262 in the ordered multicast unit 2 responds to the input sequence number when the ordered multicast protocol data transmitting / receiving unit 23 receives candidate type of ordered multicast protocol data having an input sequence number smaller than the corresponding input sequence number. It is determined whether or not an input packet to be present exists in the input packet journal storage unit 22 (step C1 in FIG. 10).

もし、該当入力順序番号より小さい入力順序番号に対応する入力パケットが入力パケットジャーナル記憶部２２に存在するならば（図１０のステップＣ１のＹＥＳ）、つまり短い多重化実行の遅延のために、該当入力順序番号より小さい入力順序番号に対応する入力パケットが入力パケットジャーナル記憶部２２に残されているならば、合意部２６２は、その入力パケットが入力パケットフィールドに設定された確定種類の整列マルチキャストプロトコルデータを整列マルチキャストプロトコルデータ送受信部２３によりネットワークＢを介して受信整列マルチキャストプロトコルデータの送信者に返送させる（図１０のステップＣ２）。 If an input packet corresponding to an input sequence number smaller than the corresponding input sequence number exists in the input packet journal storage unit 22 (YES in step C1 in FIG. 10), that is, because of a short multiplexing execution delay, If an input packet corresponding to an input sequence number smaller than the input sequence number is left in the input packet journal storage unit 22, the agreement unit 262 determines the type of ordered multicast protocol in which the input packet is set in the input packet field. The ordered multicast protocol data transmitting / receiving unit 23 returns the data to the sender of the received ordered multicast protocol data via the network B (step C2 in FIG. 10).

（アルゴリズム６）
また、合意部２６２は、該当入力順序番号に一致する入力順序番号を持つ確定種類の整列マルチキャストプロトコルデータが整列マルチキャストプロトコルデータ送受信部２３によって受信された場合には、その受信整列マルチキャストプロトコルデータ内の入力パケットを入力パケットとして確定する（図１１のステップＤ１）。このステップＤ１において合意部２６２は、確定された入力パケットが受付キュー１に存在するならば、当該入力パケットを受付キュー１から削除する。またステップＤ１では、受信整列マルチキャストプロトコルデータ内の最大確定入力順序番号が、最大確定入力順序番号記憶部２７に格納されている該当最大確定入力順序番号よりも大きいならば、該当最大確定入力順序番号が受信整列マルチキャストプロトコルデータ内の最大確定入力順序番号に更新される。 (Algorithm 6)
In addition, when the ordered multicast protocol data having a definite type having an input sequence number that matches the input sequence number is received by the ordered multicast protocol data transmitting / receiving unit 23, the agreement unit 262 The input packet is determined as the input packet (step D1 in FIG. 11). In step D 1, if the confirmed input packet exists in the reception queue 1, the agreement unit 262 deletes the input packet from the reception queue 1. In step D1, if the maximum confirmed input sequence number in the received aligned multicast protocol data is greater than the corresponding maximum confirmed input sequence number stored in the maximum confirmed input sequence number storage unit 27, the corresponding maximum confirmed input sequence number. Is updated to the maximum definitive input sequence number in the receive aligned multicast protocol data.

ステップＤ１の後、前述のアルゴリズム４におけるステップＡ４乃至Ａ９に相当する処理Ｄ２乃至Ｄ７が次のように実行される。まず合意部２６２は、確定された入力パケットに付されている処理種別情報の示す処理種別が「構成」であるか否（「アプリケーション」である）か、つまり確定された入力パケットがシステム構成変更を要求するシステム構成変更要求パケットであるかを判定する（図１１のステップＤ２）。 After step D1, processes D2 to D7 corresponding to steps A4 to A9 in algorithm 4 described above are executed as follows. First, the agreement unit 262 determines whether the processing type indicated by the processing type information attached to the confirmed input packet is “configuration” (“application”), that is, the confirmed input packet is a system configuration change. Is determined as a system configuration change request packet (step D2 in FIG. 11).

もし、処理種別が「構成」であり、したがって確定された入力パケットがシステム構成変更要求パケットであるならば（図１１のステップＤ２のＹＥＳ）、合意部２６２は、確定された入力パケットをシステム構成管理部６内のシステム構成設定部６４に渡すとともに、当該入力パケットを入力パケットジャーナル記憶部２２に格納する（図１１のステップＤ３）。 If the processing type is “configuration”, and therefore the confirmed input packet is a system configuration change request packet (YES in step D2 in FIG. 11), the agreement unit 262 converts the confirmed input packet into the system configuration. While passing to the system configuration setting unit 64 in the management unit 6, the input packet is stored in the input packet journal storage unit 22 (step D3 in FIG. 11).

システム構成設定部６４は、合意部２６２から確定された入力パケットを受け取ると、当該パケットが自コンピュータを待機状態に変更することを要求するシステム構成変更要求パケットであるかを判定する（図１１のステップＤ４）。もし、自コンピュータを待機状態に変更することを要求するシステム構成変更要求パケットである場合（図１１のステップＤ４のＹＥＳ）、システム構成設定部６４は初期化処理を実施して自コンピュータを待機状態にする（図１１のステップＤ５）。 When the system configuration setting unit 64 receives the input packet determined from the agreement unit 262, the system configuration setting unit 64 determines whether the packet is a system configuration change request packet for requesting the computer to be changed to a standby state (FIG. 11). Step D4). If it is a system configuration change request packet requesting that the own computer be changed to a standby state (YES in step D4 in FIG. 11), the system configuration setting unit 64 performs initialization processing and puts the own computer in a standby state. (Step D5 in FIG. 11).

これに対し、確定された入力パケットが、他のコンピュータを待機状態に変更することを要求するシステム構成変更要求パケットである場合（図１１のステップＤ４のＮＯ）、システム構成設定部６４は、システム構成記憶部６１に格納されているシステム構成情報を、変更後のシステム構成を示すように更新するとともに、その更新後のシステム構成情報を含むシステム構成通知プロトコルデータ（つまり変更後のシステム構成を通知するためのシステム構成通知プロトコルデータ）をシステム構成通知プロトコルデータ送受信部６６により、分散システム１０を構成する他のすべてのコンピュータに送信させる（図１１のステップＤ６）。この場合、合意部２６２は、次工程へ移行すべく、入力順序番号記憶部２１に格納されている該当入力順序番号を次に進め（１インクリメントし）、ステップ番号記憶部２４に格納されている該当ステップ番号を初期化する（図１１のステップＤ７）。このステップＤ７において合意部２６２は、候補パケット記憶部２５に格納されているすべての候補を破棄し、遅延記憶部２８に格納されている（ｎ−１）個の遅延フラグをすべてリセットする。 On the other hand, when the determined input packet is a system configuration change request packet that requests that another computer be changed to a standby state (NO in step D4 in FIG. 11), the system configuration setting unit 64 The system configuration information stored in the configuration storage unit 61 is updated to indicate the system configuration after the change, and system configuration notification protocol data including the updated system configuration information (that is, the system configuration after the change is notified) System configuration notification protocol data) is transmitted to all other computers constituting the distributed system 10 by the system configuration notification protocol data transmitter / receiver 66 (step D6 in FIG. 11). In this case, the agreement unit 262 advances the corresponding input sequence number stored in the input sequence number storage unit 21 to the next step (increment by 1), and is stored in the step number storage unit 24. The corresponding step number is initialized (step D7 in FIG. 11). In step D7, the agreement unit 262 discards all candidates stored in the candidate packet storage unit 25 and resets all (n−1) delay flags stored in the delay storage unit 28.

一方、処理種別が「構成」でないならば、つまり確定された入力パケットがシステム構成変更を要求する入力パケットでない場合（図１１のステップＤ２のＮＯ）、合意部２６２は、当該入力パケットをアプリケーションプログラム３に配送するとともに、当該入力パケットを入力パケットジャーナル記憶部２２に格納する（図１１のステップＤ８）。その後、合意部２６２は、次工程へ移行すべく、上記ステップＤ７を実行する。 On the other hand, if the processing type is not “configuration”, that is, if the determined input packet is not an input packet for requesting a system configuration change (NO in step D2 in FIG. 11), the agreement unit 262 determines that the input packet is an application program. 3 and the input packet is stored in the input packet journal storage unit 22 (step D8 in FIG. 11). Thereafter, the agreement unit 262 executes Step D7 to shift to the next process.

（アルゴリズム７）
一方、該当入力順序番号より小さい入力順序番号を持つ候補種類の整列マルチキャストプロトコルデータが受信された場合で、該当入力順序番号より小さい入力順序番号に対応する入力パケットが入力パケットジャーナル記憶部２２に存在しないならば（図１０のステップＣ１のＮＯ）、合意部２６２はステップＣ３に進む。つまり長い多重化実行の遅延のために、該当入力順序番号より小さい入力順序番号に対応する入力パケットが入力パケットジャーナル記憶部２２から既に捨てられている場合には（図１０のステップＣ１のＮＯ）、合意部２６２は、遅延種類の整列マルチキャストプロトコルデータを整列マルチキャストプロトコルデータ送受信部２３によりネットワークＢを介して受信整列マルチキャストプロトコルデータの送信者に返送させる（図１０のステップＣ３）。 (Algorithm 7)
On the other hand, when the candidate type of ordered multicast protocol data having an input sequence number smaller than the corresponding input sequence number is received, an input packet corresponding to the input sequence number smaller than the corresponding input sequence number exists in the input packet journal storage unit 22 If not (NO in step C1 in FIG. 10), the agreement unit 262 proceeds to step C3. That is, when an input packet corresponding to an input sequence number smaller than the corresponding input sequence number has already been discarded from the input packet journal storage unit 22 due to a long multiplexing execution delay (NO in step C1 in FIG. 10). The agreement unit 262 causes the arranged multicast protocol data transmission / reception unit 23 to return the delay type arranged multicast protocol data to the sender of the received arranged multicast protocol data via the network B (step C3 in FIG. 10).

（アルゴリズム８）
一方、整列マルチキャスト部２内のスキップ判定部２９は、該当入力順序番号に一致する入力順序番号を持つ遅延種類の整列マルチキャストプロトコルデータが整列マルチキャストプロトコルデータ送受信部２３によって受信された場合に、遅延記憶部２８に格納されている（ｎ−１）個の遅延フラグのうち、当該プロトコルデータの送信者に対応する遅延フラグをセットする（図１２のステップＥ１）。 (Algorithm 8)
On the other hand, the skip determining unit 29 in the ordered multicast unit 2 stores the delay when the ordered multicast protocol data transmitting / receiving unit 23 receives a delay type of ordered multicast protocol data having an input sequence number that matches the corresponding input sequence number. Among the (n−1) delay flags stored in the unit 28, a delay flag corresponding to the sender of the protocol data is set (step E1 in FIG. 12).

（アルゴリズム９）
スキップ判定部２９は、遅延記憶部２８に格納されている（ｎ−１）個の遅延フラグのうちのセットされた遅延フラグの数と、候補パケット記憶部２５に格納されている入力候補の数との和が、（ｎ−ｔ）個以上に達したかどうかを監視する（図１３のステップＦ１）。もし、（ｎ−ｔ）個以上に達しているならば（図１３のステップＦ１のＹＥＳ）、スキップ判定部２９は、候補パケット記憶部２５に格納されている入力候補の数が（ｎ−ｔ）個未満かを判定する（図１３のステップＦ２）。もし、（ｎ−ｔ）個未満であるならば（図１３のステップＦ２のＹＥＳ）、スキップ判定部２９はスキップ動作を行う（図１３のステップＦ３）。即ちスキップ判定部２９は、入力順序番号記憶部２１に格納されている該当入力順序番号を最大確定入力順序番号記憶部２７に格納されている該当最大確定入力順序番号に更新し、ステップ番号記憶部２４に格納されている該当ステップ番号を初期ステップ番号にする。またスキップ判定部２９は、候補パケット記憶部２５を空にし、遅延記憶部２８のすべての遅延フラグをリセットした上で、システム構成管理部６およびプログラム状態管理部４にスキップを通知する。 (Algorithm 9)
The skip determination unit 29 includes the number of set delay flags among the (n−1) delay flags stored in the delay storage unit 28 and the number of input candidates stored in the candidate packet storage unit 25. It is monitored whether the sum of and has reached (nt) or more (step F1 in FIG. 13). If the number has reached (n−t) or more (YES in step F1 in FIG. 13), the skip determination unit 29 determines that the number of input candidates stored in the candidate packet storage unit 25 is (nt). It is determined whether the number is less than the number (step F2 in FIG. 13). If the number is less than (nt) (YES in step F2 in FIG. 13), the skip determination unit 29 performs a skip operation (step F3 in FIG. 13). That is, the skip determination unit 29 updates the corresponding input sequence number stored in the input sequence number storage unit 21 to the corresponding maximum determined input sequence number stored in the maximum determined input sequence number storage unit 27, and the step number storage unit The corresponding step number stored in 24 is set as the initial step number. The skip determination unit 29 empties the candidate packet storage unit 25, resets all delay flags in the delay storage unit 28, and notifies the system configuration management unit 6 and the program state management unit 4 of the skip.

システム構成管理部６のシステム構成設定部６４は、スキップ判定部２９からスキップが通知されると、他のコンピュータのシステム構成記憶部６１に格納されているシステム構成情報を自コンピュータのシステム構成記憶部６１にコピーする（図１３のステップＦ４）。システム構成設定部６４は、コピーされたシステム構成情報に基づき、自コンピュータを待機状態に変更する必要があるかを判定する（図１３のステップＦ５）。 When the skip is notified from the skip determination unit 29, the system configuration setting unit 64 of the system configuration management unit 6 uses the system configuration information stored in the system configuration storage unit 61 of another computer as the system configuration storage unit of its own computer. 61 is copied (step F4 in FIG. 13). Based on the copied system configuration information, the system configuration setting unit 64 determines whether it is necessary to change the own computer to a standby state (step F5 in FIG. 13).

もし、自コンピュータを待機状態に変更する必要があるならば（図１３のステップＦ５のＹＥＳ）、システム構成設定部６４は上述の初期化処理を実施して自コンピュータを待機状態にする（図１３のステップＦ６）。これに対し、自コンピュータを待機状態に変更する必要がないならば（図１３のステップＦ５のＮＯ）、システム構成設定部６４は何もせずに、処理を終了する。 If it is necessary to change the own computer to the standby state (YES in step F5 in FIG. 13), the system configuration setting unit 64 performs the initialization process described above to place the own computer in the standby state (FIG. 13). Step F6). On the other hand, if it is not necessary to change the own computer to the standby state (NO in step F5 in FIG. 13), the system configuration setting unit 64 does nothing and ends the process.

一方、プログラム状態管理部４は、スキップ判定部２９からスキップが通知されると、該当入力順序番号の直前の状態を他のコンピュータのプログラム状態管理部４からコピーする（図１３のステップＦ７）。このために、プログラム状態管理部４は、各入力順序番号の直前の状態を最近のものから一定の量だけ保持している。 On the other hand, when the skip is notified from the skip determination unit 29, the program state management unit 4 copies the state immediately before the corresponding input sequence number from the program state management unit 4 of another computer (step F7 in FIG. 13). For this reason, the program state management unit 4 holds the state immediately before each input sequence number by a certain amount from the latest one.

なお、図１３のフローチャートでは、システム構成設定部６４による処理の後にプログラム状態管理部４による処理（ステップＦ７）が実行されている。しかし、この処理の順序が逆であっても、或いは両方の処理が並行して実行されても構わない。 In the flowchart of FIG. 13, the process by the program state management unit 4 (step F7) is executed after the process by the system configuration setting unit 64. However, the order of the processes may be reversed, or both processes may be executed in parallel.

次に、システム構成管理部６が実行するシステム構成変更のための処理について説明する。この処理は、アルゴリズム１０〜１４により実現される。
（アルゴリズム１０）
まず、アルゴリズム１０について、図１４を参照して説明する。図１４は、システム構成管理部６のシステム構成変更検出部６３によって実行される、停滞しているコンピュータを探すための処理の手順を示すフローチャートである。 Next, processing for system configuration change executed by the system configuration management unit 6 will be described. This processing is realized by algorithms 10 to 14.
(Algorithm 10)
First, the algorithm 10 will be described with reference to FIG. FIG. 14 is a flowchart showing a procedure of processing for searching for a stagnant computer executed by the system configuration change detection unit 63 of the system configuration management unit 6.

システム構成変更検出部６３は、一定時間間隔Δｔ１毎に以下の処理を実行する（図１４のステップＧ１）。
まずシステム構成変更検出部６３は、最大確定入力順序番号履歴記憶部６２を参照し、当該記憶部６２に格納されている最大確定入力順序番号履歴のうち、直近の時間Δｔ２（Δｔ２＞Δｔ１）の間に追加された最大確定入力順序番号の情報を取得する（図１４のステップＧ２）。この直近の時間Δｔ２の間に追加された最大確定入力順序番号の情報は、タイムスタンプに基づいて特定される。 The system configuration change detection unit 63 performs the following processing at regular time intervals Δt1 (step G1 in FIG. 14).
First, the system configuration change detection unit 63 refers to the maximum confirmed input sequence number history storage unit 62, and among the maximum confirmed input sequence number history stored in the storage unit 62, the latest time Δt2 (Δt2> Δt1). Information on the maximum confirmed input order number added in between is acquired (step G2 in FIG. 14). The information on the maximum confirmed input sequence number added during the latest time Δt2 is specified based on the time stamp.

次にシステム構成変更検出部６３は、稼動状態にある他のコンピュータについて、自コンピュータの最大確定入力順序番号と比べてΔｉ（Δｉは１より大きい整数）以上最大確定入力順序番号が下回る（つまり自コンピュータよりもΔｉに相当する時間以上遅延している）という条件を満たす回数が一定回数以上発生しているかを、取得された最大確定入力順序番号の情報に基づいて判定する（図１４のステップＧ３）。システム構成変更検出部６３は、このステップＧ３の判定を、稼動状態にある他のすべてのコンピュータについて繰り返す（図１４のステップＧ４）。これによりシステム構成変更検出部６３は、上記条件を満たす回数が一定回数以上の稼動状態にあるコンピュータを探す。 Next, the system configuration change detection unit 63 determines that other computers in the operating state have a maximum confirmed input sequence number that is less than or equal to Δi (Δi is an integer greater than 1) than the maximum confirmed input sequence number of the own computer (that is, the own computer Whether or not the number of times satisfying the condition that the computer is delayed by a time corresponding to Δi from the computer) is more than a certain number is determined based on the acquired information on the maximum confirmed input sequence number (step G3 in FIG. 14). ). The system configuration change detection unit 63 repeats the determination in step G3 for all other computers in the operating state (step G4 in FIG. 14). As a result, the system configuration change detection unit 63 searches for a computer in an operating state in which the number of times satisfying the above condition is equal to or greater than a certain number.

もし、このようなコンピュータを探したならば（図１４のステップＧ３のＹＥＳ）、システム構成変更検出部６３は、当該コンピュータは停滞（定常的に停滞）していると認定する。そこでシステム構成変更検出部６３は、例えば、停滞していると認定されたコンピュータを除く、他の稼動状態にあるコンピュータに対して、当該認定されたコンピュータを停滞確認の対象とする報告種類のシステム構成変更合意プロトコルデータを、システム構成変更合意プロトコルデータ送受信部６５により送信させる（図１４のステップＧ５）。 If such a computer is found (YES in step G3 in FIG. 14), the system configuration change detection unit 63 determines that the computer is stagnant (steadily stagnating). Therefore, the system configuration change detection unit 63, for example, a report type system that targets the certified computer as a stagnation confirmation target for computers in other operating states except for the computer that is certified to be stagnant. The configuration change agreement protocol data is transmitted by the system configuration change agreement protocol data transmission / reception unit 65 (step G5 in FIG. 14).

（アルゴリズム１１）
次に、アルゴリズム１１について、図１５を参照して説明する。図１５は、システム構成変更検出部６３によって実行される、他のコンピュータから報告種類のシステム構成変更合意プロトコルデータを受け取った場合の処理の手順を示すフローチャートである。 (Algorithm 11)
Next, the algorithm 11 will be described with reference to FIG. FIG. 15 is a flowchart illustrating a processing procedure executed by the system configuration change detection unit 63 when report type system configuration change agreement protocol data is received from another computer.

今、システム構成変更検出部６３が、他のコンピュータのシステム構成変更合意プロトコルデータ送受信部６５から送信された報告種類のシステム構成変更合意プロトコルデータを、自コンピュータのシステム構成変更合意プロトコルデータ送受信部６５を介して受け取ったものとする。するとシステム構成変更検出部６３は、最大確定入力順序番号履歴記憶部６２を参照し、当該記憶部６２に格納されている最大確定入力順序番号履歴のうち、直近の時間Δｔ２の間に追加された最大確定入力順序番号の情報を取得する（図１５のステップＨ１）。 Now, the system configuration change detection unit 63 uses the report type system configuration change agreement protocol data transmission / reception unit 65 of another computer as the system configuration change agreement protocol data transmission / reception unit 65 of its own computer. It is assumed that it was received via Then, the system configuration change detection unit 63 refers to the maximum confirmed input sequence number history storage unit 62 and is added during the latest time Δt2 in the maximum confirmed input sequence number history stored in the storage unit 62. Information on the maximum confirmed input order number is acquired (step H1 in FIG. 15).

次にシステム構成変更検出部６３は、受け取ったシステム構成変更合意プロトコルデータで報告された停滞確認の対象となるコンピュータについて、自コンピュータの最大確定入力順序番号と比べてΔｉ以上下回るという条件を満たす回数が一定回数以上発生しているかを、取得された最大確定入力順序番号の情報に基づいて判定する（図１５のステップＨ２）。つまりシステム構成変更検出部６３は、報告された停滞確認の対象となるコンピュータが停滞しているかを検出する。 Next, the system configuration change detecting unit 63 satisfies the condition that the stagnation confirmation target computer reported in the received system configuration change agreement protocol data satisfies a condition that the computer is less than Δi compared to the maximum confirmed input sequence number of the own computer. Is determined based on the acquired information on the maximum confirmed input sequence number (step H2 in FIG. 15). That is, the system configuration change detection unit 63 detects whether or not the reported stagnation confirmation target computer is stagnant.

もし、ステップＨ２の判定がＹＥＳであるならば、つまり報告された停滞確認の対象となるコンピュータの停滞が検出されたならば、システム構成変更検出部６３は、報告種類のシステム構成変更合意プロトコルデータの送信元のコンピュータに対して、同意種類のシステム構成変更合意プロトコルデータを、システム構成変更合意プロトコルデータ送受信部６５により返送させる（図１５のステップＨ３）。これに対し、ステップＨ２の判定がＮＯであるならば、つまり報告された停滞確認の対象となるコンピュータの停滞が検出されなかったならば、システム構成変更検出部６３は、報告種類のシステム構成変更合意プロトコルデータの送信元のコンピュータに対して、非同意種類のシステム構成変更合意プロトコルデータを、システム構成変更合意プロトコルデータ送受信部６５により返送させる（図１５のステップＨ４）。 If the determination in step H2 is YES, that is, if a stagnation of the computer subject to the reported stagnation confirmation is detected, the system configuration change detection unit 63 reports the system configuration change agreement protocol data of the report type. The system configuration change agreement protocol data transmission / reception unit 65 returns the consent type system configuration change agreement protocol data to the transmission source computer (step H3 in FIG. 15). On the other hand, if the determination in step H2 is NO, that is, if no stagnation of the computer subject to the reported stagnation confirmation is detected, the system configuration change detection unit 63 changes the report type system configuration change. The system configuration change agreement protocol data transmission / reception unit 65 of the non-agreement type is returned to the computer that has transmitted the agreement protocol data by the system configuration change agreement protocol data transmission / reception unit 65 (step H4 in FIG. 15).

（アルゴリズム１２）
次に、アルゴリズム１２について、図１６を参照して説明する。図１６は、システム構成変更検出部６３によって実行される、報告種類のシステム構成変更合意プロトコルデータに対する応答として、他のコンピュータから同意種類または非同意種類のシステム構成変更合意プロトコルデータを受け取った場合の処理の手順を示すフローチャートである。 (Algorithm 12)
Next, the algorithm 12 will be described with reference to FIG. FIG. 16 shows a case where consent type or non-agreement type system configuration change agreement protocol data is received from another computer as a response to the report type system configuration change agreement protocol data executed by the system configuration change detection unit 63. It is a flowchart which shows the procedure of a process.

今、システム構成変更検出部６３が、他のコンピュータのシステム構成変更合意プロトコルデータ送受信部６５から返された同意種類または非同意種類のシステム構成変更合意プロトコルデータを、自コンピュータのシステム構成変更合意プロトコルデータ送受信部６５を介して受け取ったものとする。するとシステム構成変更検出部６３は、受け取った同意種類または非同意種類のシステム構成変更合意プロトコルデータの提案者フィールドおよび提案時タイムスタンプフィールドを参照する（図１６のステップＩ１）。 Now, the system configuration change detection unit 63 uses the system configuration change agreement protocol data of the own computer as the agreement type or non-agreement type system configuration change agreement protocol data returned from the system configuration change agreement protocol data transmission / reception unit 65 of the other computer. It is assumed that the data is received via the data transmission / reception unit 65. Then, the system configuration change detection unit 63 refers to the proposer field and the proposal time stamp field of the received system configuration change agreement protocol data of the consent type or the non-agreement type (step I1 in FIG. 16).

次にシステム構成変更検出部６３は、今回受け取ったシステム構成変更プロトコルデータを含めて、提案者および提案時タイムスタンプがいずれも報告種類のシステム構成変更合意プロトコルデータと同一の同意種類または非同意種類のシステム構成変更合意プロトコルデータを（ｎ−ｔ−１）個以上受け取ったかを判定する（図１６のステップＩ２）。 Next, the system configuration change detection unit 63 includes the system configuration change protocol data received this time, and both the proposer and the proposal time stamp are the same agreement type or non-agreement type as the report type system configuration change agreement protocol data. It is determined whether (nt-1) or more system configuration change agreement protocol data has been received (step I2 in FIG. 16).

もし、このようなシステム構成変更合意プロトコルデータを（ｎ−ｔ−１）個以上受け取ったならば（図１６のステップＩ２のＹＥＳ）、システム構成変更検出部６３は、そのうちの（ｎ−ｔ−１）個以上が同意種類のシステム構成変更合意プロトコルデータであるかを判定する（図１６のステップＩ３）。つまり、ステップＩ３においてシステム構成変更検出部６３は、提案者および提案時タイムスタンプがいずれも報告種類のシステム構成変更合意プロトコルデータと同一の同意種類のシステム構成変更合意プロトコルデータを（ｎ−ｔ−１）個以上受け取ったかを判定する。この判定は、（ｎ−ｔ）台以上の稼動状態にあるコンピュータの間で、停滞が検出されたコンピュータについて合意がとられているかを判定することと等価である。 If (nt-1) or more of such system configuration change agreement protocol data is received (YES in step I2 in FIG. 16), the system configuration change detection unit 63 (nt-t-) 1) It is determined whether or not the number of pieces of data is agreement type system configuration change agreement protocol data (step I3 in FIG. 16). That is, in step I3, the system configuration change detection unit 63 obtains (nt−t−) the system configuration change agreement protocol data of the same agreement type as the report type system configuration change agreement protocol data. 1) It is determined whether or not it has been received. This determination is equivalent to determining whether or not an agreement has been reached for computers in which stagnation has been detected among computers in the operating state of (nt) or more.

ステップＩ３の判定がＹＥＳの場合、つまり停滞が検出されたコンピュータについて（ｎ−ｔ）台以上の稼動状態にあるコンピュータの間で合意がとられていると判定された場合、システム構成変更検出部６３は、ステップＩ４に進む。このステップＩ４において、システム構成変更検出部６３はシステム構成記憶部６１を参照して、待機状態にあるコンピュータが存在するかを判定する。即ちシステム構成変更検出部６３は、停滞が検出されたコンピュータからフェイルオーバ可能なコンピュータが存在するかを判定する（図１６のステップＩ４）。 If the determination in step I3 is YES, that is, if it is determined that an agreement has been reached between computers in the operating state of (nt) or more computers with stagnation detected, the system configuration change detection unit 63 proceeds to step I4. In step I4, the system configuration change detection unit 63 refers to the system configuration storage unit 61 and determines whether there is a computer in a standby state. That is, the system configuration change detection unit 63 determines whether there is a computer that can fail over from the computer in which the stagnation is detected (step I4 in FIG. 16).

ステップＩ４の判定がＹＥＳの場合、システム構成変更検出部６３は、システム構成変更要求パケットを生成し、当該システム構成変更要求パケットを入力パケットとして受付キュー１に積む（図１６のステップＩ５）。このシステム構成変更要求パケット（入力パケット）には、「構成」を示す処理種別情報が付されている。受付キュー１に積まれたシステム構成変更要求パケットは、入力パケットとして前述のアルゴリズム４で処理される。 If the determination in step I4 is YES, the system configuration change detection unit 63 generates a system configuration change request packet and places the system configuration change request packet in the reception queue 1 as an input packet (step I5 in FIG. 16). The system configuration change request packet (input packet) is attached with processing type information indicating “configuration”. The system configuration change request packet loaded in the reception queue 1 is processed by the algorithm 4 described above as an input packet.

システム構成変更要求パケットは、システム構成情報を含む。システム構成変更検出部６３は、このシステム構成情報を、現在自コンピュータのシステム構成記憶部６１に格納されているシステム構成情報のコピーを利用して生成する。即ちシステム構成変更検出部６３は、システム構成情報のコピーのうち、停滞が検出されたことの合意がとられたコンピュータの状態を示す情報を待機状態を示すように更新するとともに、当該コンピュータの優先度を他の待機状態にあるコンピュータよりも低くなるように更新し、さらに、最も優先度の高い待機状態にあるコンピュータの状態を示す情報を稼動状態を示すように更新する。この更新されたシステム構成情報のコピーが、システム構成変更要求パケットに用いられる。したがってシステム構成変更要求パケットは、停滞が検出されたことの合意がとられたコンピュータを待機状態とすることを要求するとともに、当該コンピュータの優先度を他の待機状態にあるコンピュータよりも低く設定し、かつ最も優先度の高い待機状態にあるコンピュータを稼動状態とすることを要求する。つまりシステム構成変更要求パケットは、停滞が検出されたことの合意がとられたコンピュータから最も優先度の高い待機状態にあるコンピュータにフェイルオーバすることを要求するパケットである。
一方、ステップＩ２，Ｉ３またはＩ４の判定がＮＯの場合、システム構成変更検出部６３は処理を終了する。 The system configuration change request packet includes system configuration information. The system configuration change detection unit 63 generates the system configuration information using a copy of the system configuration information currently stored in the system configuration storage unit 61 of the own computer. That is, the system configuration change detection unit 63 updates the information indicating the state of the computer that has been agreed that the stagnation is detected among the copies of the system configuration information so as to indicate the standby state, and prioritizes the computer. The information is updated so as to indicate the operating state, and the information indicating the state of the computer in the standby state with the highest priority is updated. The updated copy of the system configuration information is used for the system configuration change request packet. Therefore, the system configuration change request packet requests that the computer that has been agreed that the stagnation has been detected be set in the standby state, and sets the priority of the computer to be lower than those in other standby computers. The computer in the standby state with the highest priority is requested to be in the operating state. That is, the system configuration change request packet is a packet for requesting failover from the computer that has been agreed that the stagnation has been detected to the computer in the standby state with the highest priority.
On the other hand, if the determination in step I2, I3, or I4 is NO, the system configuration change detection unit 63 ends the process.

（アルゴリズム１３）
次に、アルゴリズム１３について、図１７を参照して説明する。図１７は、システム構成設定部６４によって実行される、整列マルチキャスト部２の合意部２６２から確定されたシステム構成変更要求パケットを受け取った場合の処理の手順を示すフローチャートである。 (Algorithm 13)
Next, the algorithm 13 will be described with reference to FIG. FIG. 17 is a flowchart showing a processing procedure executed by the system configuration setting unit 64 when a system configuration change request packet determined from the agreement unit 262 of the ordered multicast unit 2 is received.

システム構成設定部６４は、合意部２６２から確定されたシステム構成変更要求パケットを受け取ると、以下の処理を分散システム１０を構成するすべてのコンピュータについて実行する（図１７のステップＪ１）。 When the system configuration setting unit 64 receives the confirmed system configuration change request packet from the agreement unit 262, the system configuration setting unit 64 executes the following processing for all the computers constituting the distributed system 10 (step J1 in FIG. 17).

まずシステム構成設定部６４は、合意部２６２から渡されたシステム構成変更要求パケットに含まれているシステム構成情報（以下、新たなシステム構成情報と称する）を現在システム構成記憶部６１に格納されているシステム構成情報（以下、現システム構成情報と称する）と比較する（図１７のステップＪ２）。この比較の結果に基づき、システム構成設定部６４は、現在対象としているコンピュータについて、待機状態から稼動状態への変更が要求されているかを判定する（図１７のステップＪ３）。なお、稼動状態から待機状態への変更が要求されている場合、アルゴリズム４（具体的にはステップＡ４以降）で処理される。 First, the system configuration setting unit 64 stores system configuration information (hereinafter referred to as new system configuration information) included in the system configuration change request packet passed from the agreement unit 262 in the current system configuration storage unit 61. The system configuration information (hereinafter referred to as current system configuration information) is compared (step J2 in FIG. 17). Based on the result of this comparison, the system configuration setting unit 64 determines whether a change from the standby state to the operating state is requested for the current target computer (step J3 in FIG. 17). Note that when a change from the operating state to the standby state is requested, processing is performed by algorithm 4 (specifically, step A4 and subsequent steps).

もし、現在対象としているコンピュータについて、待機状態から稼動状態への変更が要求されているならば（図１７のステップＪ３のＹＥＳ）、システム構成設定部６４は、自コンピュータのシステム構成通知プロトコルデータ送受信部６６から現在対象としているコンピュータ（つまり待機状態から稼動状態への変更が要求されたコンピュータ）のシステム構成通知プロトコルデータ送受信部６６に対して、システム構成通知プロトコルデータにより新たなシステム構成情報を送信させる（図１７のステップＪ４）。このシステム構成通知プロトコルデータ（新たなシステム構成情報）の送信により、送信先のコンピュータに対して、待機状態から稼動状態への変更が通知される。 If the current target computer is requested to change from the standby state to the operating state (YES in step J3 in FIG. 17), the system configuration setting unit 64 transmits / receives the system configuration notification protocol data of its own computer. New system configuration information is transmitted from the unit 66 to the system configuration notification protocol data transmission / reception unit 66 of the current target computer (that is, the computer requested to change from the standby state to the operating state) using the system configuration notification protocol data. (Step J4 in FIG. 17). The transmission of the system configuration notification protocol data (new system configuration information) notifies the transmission destination computer of the change from the standby state to the operating state.

またシステム構成設定部６４は、自コンピュータ内の各記憶部に格納されている、稼動状態への変更が要求されたコンピュータに関する情報を初期化する（図１７のステップＪ５）。ここでは、遅延記憶部２８内の（ｎ−１）個の遅延フラグうちの、稼動状態への変更が要求されたコンピュータに新たに対応付けられる遅延フラグ、および、最大確定入力順序番号履歴記憶部６２内の最大確定入力順序番号履歴のうちの、稼動状態への変更が要求されたコンピュータに新たに対応付けられる最大確定入力順序番号履歴が初期化される。また、ステップＪ５では、システム構成記憶部６１に格納されている現システム構成情報が新たなシステム構成情報に更新される。 Further, the system configuration setting unit 64 initializes information related to the computer requested to be changed to the operating state, which is stored in each storage unit in the own computer (step J5 in FIG. 17). Here, among the (n−1) delay flags in the delay storage unit 28, a delay flag newly associated with the computer requested to change to the operating state, and the maximum confirmed input sequence number history storage unit Of the maximum confirmed input sequence number histories in 62, the maximum confirmed input sequence number history newly associated with the computer requested to change to the operating state is initialized. In step J5, the current system configuration information stored in the system configuration storage unit 61 is updated to new system configuration information.

（アルゴリズム１４）
次に、アルゴリズム１４について、図１８を参照して説明する。図１８は、システム構成設定部６４によって実行される、システム構成通知プロトコルデータ送受信部６６からシステム構成通知プロトコルデータを受け取った場合の処理の手順を示すフローチャートである。 (Algorithm 14)
Next, the algorithm 14 will be described with reference to FIG. FIG. 18 is a flowchart showing a processing procedure executed by the system configuration setting unit 64 when system configuration notification protocol data is received from the system configuration notification protocol data transmission / reception unit 66.

今、自コンピュータのシステム構成通知プロトコルデータ送受信部６６が他のコンピュータのシステム構成通知プロトコルデータ送受信部６６から送信されたシステム構成通知プロトコルデータを受信したものとする。システム構成通知プロトコルデータ送受信部６６は、自身が受信したシステム構成通知プロトコルデータを自コンピュータ内のシステム構成設定部６４に渡す。 Assume that the system configuration notification protocol data transmission / reception unit 66 of the own computer has received the system configuration notification protocol data transmitted from the system configuration notification protocol data transmission / reception unit 66 of another computer. The system configuration notification protocol data transmission / reception unit 66 passes the system configuration notification protocol data received by itself to the system configuration setting unit 64 in its own computer.

システム構成設定部６４は、システム構成通知プロトコルデータ送受信部６６からシステム構成通知プロトコルデータを受け取ると、自コンピュータが待機状態にあるかを判定する（図１８のステップＫ１）。もし、自コンピュータが待機状態にあるならば（図１８のステップＫ１のＹＥＳ）、システム構成設定部６４は受け取ったシステム構成通知プロトコルデータが、自コンピュータを稼動状態に変更することを示すシステム構成情報を含んでいるかを判定する（図１８のステップＫ２）
もし、システム構成通知プロトコルデータが自コンピュータを稼動状態に変更することを示しているならば（図１８のステップＫ２のＹＥＳ）、システム構成設定部６４は、システム構成記憶部６１に格納されているシステム構成情報を、当該プロトコルデータに含まれているシステム構成情報に更新する（図１８のステップＫ３）。そしてシステム構成設定部６４は、初期化処理を行い、整列マルチキャスト部２を動作可能状態にする（図１８のステップＫ４）。 Upon receiving the system configuration notification protocol data from the system configuration notification protocol data transmission / reception unit 66, the system configuration setting unit 64 determines whether the computer is in a standby state (step K1 in FIG. 18). If the own computer is in a standby state (YES in step K1 in FIG. 18), the system configuration setting unit 64 indicates that the received system configuration notification protocol data indicates that the own computer is changed to an operating state. (Step K2 in FIG. 18)
If the system configuration notification protocol data indicates that the own computer is to be changed to the operating state (YES in step K2 in FIG. 18), the system configuration setting unit 64 is stored in the system configuration storage unit 61. The system configuration information is updated to the system configuration information included in the protocol data (step K3 in FIG. 18). Then, the system configuration setting unit 64 performs initialization processing, and puts the ordered multicast unit 2 into an operable state (step K4 in FIG. 18).

一方、ステップＫ１またはＫ２の判定がＮＯの場合、システム構成設定部６４は処理を終了する。例えば、システム構成変更要求パケットによって稼動状態から待機状態に変更することが要求されているコンピュータのシステム構成変更検出部６３では、ステップＫ１の判定がＮＯとなる。この場合、システム構成変更検出部６３は、システム構成記憶部６１に格納されているシステム構成情報を新たなシステム構成情報に更新することなく処理を終了する。このようにしても、待機状態に変更することが要求されているコンピュータは、他の稼動状態にあるコンピュータから無視されるため何ら問題とならない。なお、稼動状態から待機状態に変更することが要求されているコンピュータにおいてシステム構成情報の更新が行われても構わない。 On the other hand, if the determination in step K1 or K2 is NO, the system configuration setting unit 64 ends the process. For example, in the system configuration change detection unit 63 of the computer that is requested to change from the operating state to the standby state by the system configuration change request packet, the determination in step K1 is NO. In this case, the system configuration change detection unit 63 ends the process without updating the system configuration information stored in the system configuration storage unit 61 to new system configuration information. Even if it does in this way, since the computer requested | required to change to a standby state is disregarded from the computer in another operating state, it will not pose any problem. The system configuration information may be updated in a computer that is requested to change from the operating state to the standby state.

なお、以上に述べたアルゴリズム１〜１４の順序は、必ずしもこの順序で実行されるというものではない。つまり、これらは、その動作条件が成立すれば独立して実行されるものである。 The order of the algorithms 1 to 14 described above is not necessarily executed in this order. That is, these are executed independently if the operating condition is satisfied.

本実施形態において、システム構成管理部６内のシステム構成変更検出部６３は、最大確定入力順序番号履歴記憶部６２に格納されている情報のうち、直近の一定時間（Δｔ２）における各コンピュータの整列マルチキャスト部２の処理状況を反映した情報に基づき、自コンピュータよりも最大確定入力順序番号がΔｉ（Δｉは１より大きい整数）以上下回ったコンピュータ（さらに詳細には、Δｉ以上下回った回数が一定回数以上となるコンピュータ）を、停滞しているコンピュータとして検出している（アルゴリズム１０）。Δｉを１より大きい整数としている理由は、ｔ−耐故障アルゴリズムを適用した分散システムでは、当該アルゴリズムが（ｎ−ｔ）台のコンピュータの合意があれば次の処理を実行できることによる。つまり、ｔ−耐故障アルゴリズムを適用すると、原理的に、分散システム内のあるコンピュータにとって、ｔ台のコンピュータは入力順序番号が１遅れている状態に見えてしまうためである。よって、Δｉを１より大きい整数とすることにより、停滞しているコンピュータの誤検出を防ぐことができる。 In the present embodiment, the system configuration change detection unit 63 in the system configuration management unit 6 arranges the computers in the latest fixed time (Δt2) among the information stored in the maximum confirmed input sequence number history storage unit 62. Based on information reflecting the processing status of the multicast unit 2, a computer whose maximum confirmed input sequence number is lower than its own computer by Δi (Δi is an integer greater than 1) or more (more specifically, a certain number of times that the number is less than Δi) The above computer is detected as a stagnant computer (algorithm 10). The reason why Δi is an integer larger than 1 is that in the distributed system to which the t-fault tolerance algorithm is applied, the algorithm can execute the following processing if there is an agreement of (nt) computers. That is, when the t-fault tolerance algorithm is applied, in principle, for a certain computer in the distributed system, t computers appear to be in a state where the input sequence number is delayed by one. Therefore, by making Δi an integer greater than 1, erroneous detection of a stagnant computer can be prevented.

また本実施形態では、停滞しているコンピュータを検出するための時間範囲Δｔ２を設定することで、一時的に停滞しているコンピュータが停滞しているコンピュータとして検出される不具合を防止できる。 Further, in this embodiment, by setting the time range Δt2 for detecting a stagnant computer, it is possible to prevent a problem that a temporarily stagnating computer is detected as a stagnating computer.

また本実施形態では、停滞しているコンピュータの検出について、（ｎ−ｔ）台以上のコンピュータで合意がとれたことを、フェイルオーバのきっかけとしている（アルゴリズム１２）。これにより、コンピュータ間で通信ができないという理由だけで、各々のコンピュータが通信相手のコンピュータを停滞していると判定して、スプリットブレインが発生するのを防ぐことができる。 In the present embodiment, the detection of a stagnant computer is triggered by a failover (algorithm 12) that an agreement is reached between (nt) or more computers. Accordingly, it is possible to prevent the occurrence of split brain by determining that each computer is stagnating with the communication partner computer only because the computers cannot communicate with each other.

また本実施形態では、システム構成変更要求パケットを整列マルチキャストの入力パケットとしている（アルゴリズム１２）。これにより、停滞が確認されたために待機状態に変更されたコンピュータが復帰してきた際に、変更される前の処理は実行し、変更された後の処理は実行しないようにできる。 In this embodiment, the system configuration change request packet is used as an input packet for ordered multicast (algorithm 12). As a result, when the computer that has been changed to the standby state because of the stagnation is restored, the process before the change is executed, and the process after the change is not executed.

なお、本発明は、上記実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。例えば、上記実施形態では、稼動状態に設定されるコンピュータ（多重化を構成するコンピュータ）の数ｎが４で、ｔが１である。しかし、ｎは４以上の整数であればよく、つまり稼動状態に設定されるコンピュータは４台以上であればよく、ｔは３ｔ＜ｎを満たす０より大きい整数であればよい。この場合、ｔ−耐故障アルゴリズムにより、ｎ台のコンピュータのうちの最大ｔ台の故障を許容することができる。 Note that the present invention is not limited to the above-described embodiment as it is, and can be embodied by modifying the constituent elements without departing from the scope of the invention in the implementation stage. For example, in the above-described embodiment, the number n of computers set in the operating state (computers constituting the multiplexing) is 4, and t is 1. However, n may be an integer equal to or greater than 4, that is, four or more computers may be set in the operating state, and t may be an integer greater than 0 that satisfies 3t <n. In this case, the t-fault tolerance algorithm can tolerate a maximum of t faults among the n computers.

特に、｛ｎ，ｔ｝を、上記条件を満たし、かつ（ｎ−ｔ）の値が最小値となるように設定するならば、ｎ台のコンピュータのうちの最大ｔ台の故障を許容するのに、最も少ない台数のコンピュータで合意をとることができる。ここで、ｔの条件３ｔ＜ｎを変形すると、ｔはｔ＜ｎ／３を満たす０より大きい整数であるといえる。このことから、（ｎ−ｔ）の値は、ｎに対して（ｎ−ｔ）＞２ｎ／３のような関係がある。したがって、｛ｎ，ｔ｝を、上記条件を満たし、かつ（ｎ−ｔ）の値が２ｎ／３を超える最小値となるように設定するならば、つまり｛ｎ，ｔ｝を、いわゆる過２／３の合意をとるように設定するならば、最も少ない台数のコンピュータで合意をとることができる。（ｎ−ｔ）の値が２ｎ／３を超える最小値となる｛ｎ，ｔ｝は、｛４，１｝，｛７，２｝，｛１０，３｝，｛１３，４｝…である。 In particular, if {n, t} is set so that the above condition is satisfied and the value of (n−t) is the minimum value, failure of a maximum of t computers out of n computers is allowed. In addition, an agreement can be reached with the smallest number of computers. Here, if the condition 3t <n of t is modified, it can be said that t is an integer larger than 0 that satisfies t <n / 3. From this, the value of (nt) has a relationship such that (nt)> 2n / 3 with respect to n. Therefore, if {n, t} is set so that the above condition is satisfied and the value of (nt) exceeds 2n / 3, that is, {n, t} If it is set so that an agreement of / 3 is obtained, an agreement can be reached with the smallest number of computers. {N, t} at which the value of (nt) exceeds 2n / 3 is {4, 1}, {7, 2}, {10, 3}, {13, 4}. .

また上記実施形態では、システム構成変更検出部６３が停滞しているコンピュータを検出するのに、最大確定入力順序番号履歴記憶部６２に直近の時間Δｔ２の間に追加された最大確定入力順序番号の情報の数（つまり標本数）を特に考慮していない。しかし、この標本数が一定数以下、つまり時間Δｔ２における最大確定入力順序番号の変更が一定回数以下である場合、停滞しているコンピュータの検出を抑止するようにしても良い。このように、停滞しているコンピュータの検出に標本数が少ない場合を排除することで、誤検出を防止することができる。 In the above embodiment, the system configuration change detection unit 63 detects the stagnant computer, and the maximum confirmed input sequence number added during the most recent time Δt2 to the maximum confirmed input sequence number history storage unit 62. No particular consideration is given to the number of information (ie the number of samples). However, when the number of samples is equal to or less than a certain number, that is, when the change of the maximum confirmed input sequence number at time Δt2 is equal to or less than a certain number of times, detection of a stagnant computer may be suppressed. In this way, erroneous detection can be prevented by eliminating the case where the number of samples is small for detection of a stagnant computer.

また、Δｔ２の間に最大確定入力順序番号履歴記憶部６２に追加された最大確定入力順序番号の情報を用いる代わりに、一定数の最大確定入力順序番号の情報を用いようにしてもよい。具体的には、自コンピュータについての変更された最大確定入力順序番号の情報の数（つまり自コンピュータについての標本数）を一定数として、これらの最大確定入力順序番号と対応する他のコンピュータについての最大確定入力順序番号とを比較して、最大確定入力順序番号が自コンピュータよりもΔｉ以上下回る数（つまり割合）が一定数以上となるコンピュータを、停滞しているコンピュータとして検出するようにしてもよい。このように、標本数を一定数とすることで、誤検出を防止することができる。 Further, instead of using the information on the maximum confirmed input sequence number added to the maximum confirmed input sequence number history storage unit 62 during Δt2, information on a certain number of maximum confirmed input sequence numbers may be used. Specifically, the number of information of the changed maximum confirmed input sequence number for the own computer (that is, the number of samples for the own computer) is set as a fixed number, and the other computers corresponding to these maximum confirmed input sequence numbers are determined. Comparing with the maximum confirmed input sequence number, a computer in which the number (that is, the ratio) of which the maximum confirmed input sequence number is lower than the own computer by Δi or more is a certain number or more is detected as a stagnant computer. Good. In this way, erroneous detection can be prevented by setting the number of samples to a fixed number.

また、時間Δｔ２の間に最大確定入力順序番号履歴記憶部６２に追加された各コンピュータの最大確定入力順序番号の情報に基づき、自コンピュータと比べてΔｉ以上最大確定入力順序番号が下回り、その番号の差分が時間の経過に応じて拡大しているコンピュータを、停滞しているコンピュータとして検出するようにしてもよい。このようにすると、停滞しているコンピュータを高精度で検出できる。 Further, based on the information of the maximum determined input sequence number of each computer added to the maximum determined input sequence number history storage unit 62 during the time Δt2, the maximum determined input sequence number is lower than the own computer by Δi or more, and the number A computer in which the difference between them increases with the passage of time may be detected as a stagnant computer. In this way, a stagnant computer can be detected with high accuracy.

また、上記実施形態に開示されている複数の構成要素の適宜な組み合わせにより種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。 In addition, various inventions can be formed by appropriately combining a plurality of components disclosed in the embodiment. For example, some components may be deleted from all the components shown in the embodiment.

１…入力受付キュー部（受付キュー）、２…整列マルチキャスト部、３…アプリケーションプログラム、４…プログラム状態管理部、５…出力フィルタ部、６…システム構成管理部、１０…分散システム、２１…入力順序番号記憶部、２２…入力パケットジャーナル記憶部、２３…整列マルチキャストプロトコルデータ送受信部、２４…ステップ番号記憶部、２５…候補パケット記憶部、２６…入力パケット確定判定部、２７…最大確定入力順序番号記憶部、２８…遅延記憶部、２９…スキップ判定部、６１…システム構成記憶部、６２…最大確定入力順序番号履歴記憶部、６３…システム構成変更検出部、６４…システム構成設定部、６５…システム構成変更合意プロトコルデータ送受信部、６６…システム構成通知プロトコルデータ送受信部、１００-1〜１００-5，１００-i…コンピュータ、２００…クライアント装置、２６１…入力候補収集部、２６２…合意部。 DESCRIPTION OF SYMBOLS 1 ... Input reception queue part (reception queue), 2 ... Alignment multicast part, 3 ... Application program, 4 ... Program state management part, 5 ... Output filter part, 6 ... System configuration management part, 10 ... Distributed system, 21 ... Input Sequence number storage unit, 22 ... input packet journal storage unit, 23 ... aligned multicast protocol data transmission / reception unit, 24 ... step number storage unit, 25 ... candidate packet storage unit, 26 ... input packet decision determination unit, 27 ... maximum decision input order Number storage unit, 28 ... delay storage unit, 29 ... skip determination unit, 61 ... system configuration storage unit, 62 ... maximum confirmed input sequence number history storage unit, 63 ... system configuration change detection unit, 64 ... system configuration setting unit, 65 ... System configuration change agreement protocol data transmission / reception unit, 66 ... System configuration notification protocol data transmission / reception Part, 100-1~100-5,100-i ... computer, 200 ... client device, 261 ... input candidate collection unit, 262 ... agreement portion.

Claims

By setting n computers (n is an integer satisfying 4 ≦ n <m) out of m computers (m is an integer of 5 or more) connected by a network, and applying the t-fault tolerance algorithm A distributed system with failover function that operates synchronously,
Each of the m computers is
System configuration storage means for storing system configuration information indicating whether each of the m computers is in an operating state to be operated synchronously or in a standby state;
(N−t) computers (n is an integer larger than 0 satisfying 3t <n) among n computers indicated to be in an operating state by the system configuration information stored in the system configuration storage unit. In order to guarantee the multiplexing described above, an agreement means for arranging and multicasting the input data by agreeing the input data with the (nt) computers or more,
Among the n computers, a computer stagnating more than a predetermined level compared to itself is detected based on the execution status of the ordered multicast by the agreement means, and the stagnating computer is detected. In this case, regarding the detection of the stagnant computer, the detection of the stagnant computer is confirmed by agreeing with (nt) or more computers including itself among the n computers. Detection means;
In response to the confirmation of the detection of the stagnant computer, the confirmed stagnant computer is set in the standby state, and the computer in the standby state is stored in the system configuration storage means so as to be in the operating state. And a system configuration setting unit that performs failover from the stagnating computer to the computer in the standby state by updating the system configuration information information.

When the detection unit confirms the stagnated computer, the detection unit changes the configuration of the distributed system so that the confirmed stagnant computer is set in a standby state and the computer in the standby state is set in an operating state. Providing the system configuration change request data for requesting the agreement means as the input data to align and multicast the system configuration change request data;
The system configuration setting unit performs a change to the configuration of the distributed system indicated by the system configuration change request data when the system configuration change request data is subjected to ordered multicast by the agreement unit. A distributed system having a failover function according to Item 1.

The agreement means manages the input data to be agreed by a sequence number serially attached to the input data,
Each of the m computers stores, for each of the n computers, a sequence number history for storing a history of the maximum sequence numbers attached to input data agreed with other computers in operation. A storage means;
The detection means detects the stagnated computer based on a history of maximum sequence numbers for each of the n computers stored in the sequence number history storage means. Item 3. A distributed system having a failover function according to item 1 or 2.

4. The distributed system having a failover function according to claim 3, wherein the predetermined level is a difference between the sequence numbers greater than 1.

When the detection means detects the stagnated computer, the report type protocol data for reporting the detected stagnant computer to all computers in other operating states, other operating System configuration change agreement protocol that transmits agreement type protocol data indicating that the report type protocol data transmitted from the computer in the state agrees, and non-agreement type protocol data indicating that the report is not agreed System configuration change agreement protocol data transmission / reception means for receiving the report type protocol data, the agreement type protocol data and the non-agreement type protocol data transmitted from a computer in another operating state Further comprising means,
The detection means detects the stagnated computer, and the system configuration change agreement protocol data transmission / reception means transmits the report type protocol data to another computer in the operating state, and the report type protocol data. On the other hand, when (n-t-1) or more computers return the protocol data of the consent type, the (n-t) or more computers including itself are related to the detection of the stagnant computer. 5. A distributed system having a failover function according to any one of claims 1 to 4, wherein an agreement is reached.

By setting n computers (n is an integer satisfying 4 ≦ n <m) out of m computers (m is an integer of 5 or more) connected by a network, and applying the t-fault tolerance algorithm In a distributed system that operates synchronously, when the operation of any one of the n computers is stagnant, a failover method for failing over from the stagnating computer to a computer in a standby state Because
Each of the m computers includes system configuration storage means for storing system configuration information indicating whether the m computers are in an operation state to be operated synchronously or in a standby state, respectively. An agreement means, a detection means, and a system configuration setting means,
The failover method is:
(N−t) of the n computers in which the agreement means of each of the m computers is shown to be in operation by the system configuration information stored in the system configuration storage means. (T is an integer greater than 0 satisfying 3t <n) In order to guarantee multiplexing at or above, the input data is agreed by the (n−t) or more computers, and the input data is arranged and multicast. Steps,
The detecting means detecting, among the n computers, a computer that has stagnated more than a predetermined level in comparison with itself based on an execution status of the ordered multicast by the agreement means;
When the stagnant computer is detected, the detection means agrees on (nt) or more computers including itself among the n computers with respect to detection of the stagnant computer. Confirming the detection of the stagnant computer;
In response to the confirmation of the detection of the stagnant computer, the confirmed stagnant computer is set in the standby state, and the computer in the standby state is stored in the system configuration storage means so as to be in the operating state. And failover to the computer in the standby state by updating the system configuration information information.