JP2005202779A

JP2005202779A - Synchronous processing method for distributed computer system

Info

Publication number: JP2005202779A
Application number: JP2004009570A
Authority: JP
Inventors: Kinji Mori; 欣司森; Hiroyuki Yashiro; 裕之矢代
Original assignee: IHI Aerospace Co Ltd; Mori Kinji
Current assignee: IHI Aerospace Co Ltd; Mori Kinji
Priority date: 2004-01-16
Filing date: 2004-01-16
Publication date: 2005-07-28
Anticipated expiration: 2024-01-16
Also published as: JP4753344B2

Abstract

<P>PROBLEM TO BE SOLVED: To solve the problems wherein conventional synchronous processing techniques are unsuitable for an application that must not be stopped and are difficult to use in synchronizing nodes of a distributed computer system. <P>SOLUTION: For synchronization of nodes in a distributed computer system, each node starts a synchronous timer ST corresponding to a period of synchronous processing and, upon the end thereof, broadcasts synchronization messages SM on a network. After the start of an authentication window Wq, upon exchange of a plurality of synchronization messages SM, which meets synchronization conditions, the start timing of the synchronous timer ST is autonomously adjusted so as to match common timing obtained from the exchange timing of the synchronization messages SM, and the synchronous timer ST is started at the start timing. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、複数のコンピュータをネットワークで相互に接続した分散コンピュータシステムの同期処理方法に関し、とくに、リアルタイム性やフォールトトレラント性が要求される分散コンピュータシステムにおいてコンピュータノード間の同期化を行うのに用いる同期処理方法に関するものである。 The present invention relates to a synchronization processing method for a distributed computer system in which a plurality of computers are connected to each other via a network. In particular, the present invention is used to perform synchronization between computer nodes in a distributed computer system that requires real-time performance and fault tolerance. The present invention relates to a synchronous processing method.

一般に、パーソナルコンピュータ、ワークステーション、組み込みコンピュータ、搭載コンピュータ及び制御用コンピュータなどのコンピュータシステムにおいては、単一のコンピュータを用いる代わりに、複数のコンピュータを何らかの情報伝達媒体から成るネットワークで相互に接続した分散コンピュータシステムを用いることがある。この分散コンピュータシステムは、単一のコンピュータでは演算能力が不足する場合、単一のコンピュータでは信頼性が不足する場合、コンピュータシステムで制御する対象が物理的に広範囲に存在する場合、コンピュータシステムに対する要求の変化が頻繁である場合などに用いるのに好適である。 In general, in a computer system such as a personal computer, a workstation, an embedded computer, an on-board computer, and a control computer, instead of using a single computer, a plurality of computers are connected to each other via a network composed of some kind of information transmission medium. A computer system may be used. This distributed computer system requires a computer system when a single computer lacks computing power, when a single computer lacks reliability, or when there is a physically wide range of objects to be controlled by the computer system. It is suitable for use in the case where the change of is frequent.

ところで、分散コンピュータシステム上で動作するアプリケーションには、何らかの実時間制約を受けて動作するものがある。このようなアプリケーションは、リアルタイムアプリケーションと呼ばれており、例えば、入力対象であるセンサ等からデータを周期的に取得して必要な演算を行い、出力対象であるアクチュエータ等に対して周期的に結果の出力を行うものがある。そして、このようなリアルタイムアプリケーションを分散コンピュータシステム上で動作させる場合には、その分散コンピュータシステムを構成する各コンピュータノード（以下、「ノード」とする）を所定の時間範囲内で同期させることが必要になる。 By the way, some applications that operate on a distributed computer system operate under some real-time constraints. Such an application is called a real-time application. For example, data is periodically acquired from a sensor that is an input target, necessary calculations are performed, and results are periodically output to an actuator that is an output target. Some output When such a real-time application is operated on a distributed computer system, it is necessary to synchronize each computer node (hereinafter referred to as “node”) constituting the distributed computer system within a predetermined time range. become.

より具体的に説明すると、図１は分散コンピュータシステムの一例を示す図であって、図示の分散コンピュータシステムは、個々のセンサＳａ，Ｓｂに接続した複数のセンサノードＮｓａ，Ｎｓｂと、複数の制御演算ノードＮｃ１，Ｎｃ２，Ｎｃ３と、多数決回路Ｖｏを介してアクチュエータＡに接続した複数の出力ノードＮａ１，Ｎａ２，Ｎａ３を備えると共に、ネットワークＮＷにより各ノードＮｓ，Ｎｃ，Ｎａを互いに接続している。 More specifically, FIG. 1 is a diagram illustrating an example of a distributed computer system. The illustrated distributed computer system includes a plurality of sensor nodes Nsa and Nsb connected to individual sensors Sa and Sb, and a plurality of controls. The operation nodes Nc1, Nc2, Nc3 and a plurality of output nodes Na1, Na2, Na3 connected to the actuator A via the majority circuit Vo are provided, and the nodes Ns, Nc, Na are connected to each other by a network NW.

図示の分散コンピュータシステムにおいて、センサノードＮｓａ，Ｎｓｂがデータを得るためにセンサＳａ，Ｓｂに対してリクエスト信号を送るケースを考えると、各センサＳａ，Ｓｂから同時刻にセンシングされたデータを得る必要がある場合、各センサノードＮｓａ，Ｎｓｂは、所定の範囲内の時間差で夫々のセンサＳａ，Ｓｂにリクエスト信号を送らねばならないので、共通のタイミングを有する必要がある。また、アクチュエータＡに制御信号を送信するケースを考えると、図示の場合には、信頼性確保のために冗長系を成している出力ノードＮａ１，Ｎａ２，Ｎａ３の各出力信号を多数決回路Ｖｏにより一つの信号にする。この際、各出力ノードＮａ１，Ｎａ２，Ｎａ３は、多数決回路Ｖｏに対する夫々の出力信号が所定の時間範囲内に収まっていなければならないので、共通のタイミングを有する必要がある。 In the illustrated distributed computer system, when sensor nodes Nsa and Nsb send request signals to sensors Sa and Sb in order to obtain data, it is necessary to obtain data sensed at the same time from the sensors Sa and Sb. If there is, the sensor nodes Nsa and Nsb must send a request signal to each of the sensors Sa and Sb with a time difference within a predetermined range, and thus need to have a common timing. Considering the case where a control signal is transmitted to the actuator A, in the case shown in the figure, each output signal of the output nodes Na1, Na2, and Na3 forming a redundant system is ensured by the majority circuit Vo to ensure reliability. Make one signal. At this time, each of the output nodes Na1, Na2, and Na3 must have a common timing because each output signal for the majority circuit Vo must be within a predetermined time range.

このように、リアルタイムアプリケーションでは、分散コンピュータシステムを構成するノードを所定の時間内に同期させることが要求されている。そして、一般的に各ノードは自らの発振器を備え、これらの発振器が時間経過とともにドリフトするため、ノード間において適切な周期でタイミング調整すなわち同期処理を行う必要がある。 As described above, the real-time application is required to synchronize the nodes constituting the distributed computer system within a predetermined time. In general, each node has its own oscillator, and these oscillators drift with time. Therefore, it is necessary to perform timing adjustment, that is, synchronization processing at an appropriate cycle between nodes.

ところで、ひと口にリアルタイムアプリケーションといっても、それが要求するリアルタイム性や信頼性は多様である。つまり、ノード間のタイミングのばらつきの許容度が大きいものもあれば小さいものもあるし、制御を行う周期や入力から出力までの応答時間についても、数ｍｓｅｃレベルから数秒レベルまで多岐にわたる。また、高信頼性を要するものとそうでないものもあり、さらには、別々のリアルタイム性要求や信頼性要求をもつ複数のアプリケーションがシステム中に同時に混在するケースもあれば、時間とともに要求レベルが変化するケースもある。 By the way, even if it is a real-time application, there are various real-time properties and reliability required. In other words, there are cases where the tolerance of timing variation between nodes is large and small, and the period for performing control and the response time from input to output range from several milliseconds to several seconds. Some require high reliability and some do not. Furthermore, there are cases where multiple applications with different real-time requirements and reliability requirements coexist in the system at the same time, and the request level changes over time. There is a case to do.

このため、アプリケーション毎の要求に応じて専用の同期処理方法を構築するのでは、非常にコストがかかるため、低コストでの同期化を実現するには、多様なアプリケーションに柔軟に対応できる汎用性が高い同期処理方法でなければならない。そのような同期処理方法は、信頼度要求レベルやリアルタイム要求レベルに依らずに、あらゆるケースに適用し得るものでなければならず、例えば下記の（Ａ）〜（Ｃ）の要求を満足させる必要がある。 For this reason, it is very expensive to build a dedicated synchronization processing method according to the requirements of each application. To achieve low-cost synchronization, versatility that can flexibly support various applications. There must be a high synchronous processing method. Such a synchronous processing method must be applicable to all cases regardless of the reliability requirement level or the real-time requirement level, and for example, needs to satisfy the following requirements (A) to (C): There is.

（Ａ）分散コンピュータシステム内の一式以上のノードが故障によって一時的に同期から離脱した場合でも、残りのノードにおいては、既存の同期状態が無停止で成立し続けなければならない。また、一式以上のノードが故障から復帰する場合、一式以上のノードをシステムに追加する場合、あるいはシステムの一部のノードがテスト状態にある場合でも、残りのノードにおいては、既存の同期状態が無停止で成立し続けなければならない。さらに、アプリケーションの種類や使用する部品の故障率によって使用されるノード数は多様であり、且つそれらが時間とともに変化し得るため、同期系を構成するノード数が固定される方法であってはならない。 (A) Even if one or more nodes in the distributed computer system are temporarily out of synchronization due to a failure, the existing synchronization state must continue to be established without stopping in the remaining nodes. In addition, even if one or more nodes recover from a failure, add one or more nodes to the system, or even if some of the nodes in the system are in the test state, It must continue to be established without stopping. Furthermore, since the number of nodes used varies depending on the type of application and the failure rate of the components used, and they can change with time, it should not be a method in which the number of nodes constituting the synchronous system is fixed. .

（Ｂ）同期している複数のノード間の同期精度に対する要求が動的に変化する場合、同期系に対する信頼性要求（誤同期確率要求）が動的に変化する場合、又は同期精度や信頼性要求のレベルが異なる複数のアプリケーションがシステム中に混在する場合でも、システムとしてアプリケーションの処理及び出力を停止せずに対応できなければならない。 (B) When the request for synchronization accuracy between multiple synchronized nodes dynamically changes, when the reliability requirement for the synchronization system (mis-synchronization probability request) changes dynamically, or the synchronization accuracy and reliability Even when a plurality of applications having different request levels coexist in the system, the system must be able to cope without stopping the processing and output of the application.

（Ｃ）低コスト化を実現するため、既存の汎用的なハードウエアを用いて構築できなければならない。 (C) In order to realize cost reduction, it must be possible to construct using existing general-purpose hardware.

ここで、コンピュータシステムにおいて、耐故障性（フォールトトレランス性）を実現する同期処理技術としては、例えば、複数のステーションのオペレーティングシステム時刻クロックを同期化するに際し、タイムキーパステーションからのクロックメッセージを受信した時の各ステーションのネットワークインターフェイスの自由走行カウンタのカウントをラッチし、タイムキーパステーションはそのオペレーティングシステム時刻クロック及び自由走行カウンタからそのクロックメッセージの受信時刻を計算し、このタイムキーパ受信時刻を他のステーションへブロードキャストし、その他のステーションはそのオペレーティングシステム時刻とその自由走行カウンタのカウントからそれ自身の受信時刻を計算し、その受信時刻とタイムキーパ受信時刻との差によりオペレーティングシステム時刻を補正するものがあった。 Here, in the computer system, as a synchronization processing technique for realizing fault tolerance (fault tolerance), for example, when a clock message from a timekeeper station is received when synchronizing the operating system time clocks of a plurality of stations. The time keeper station calculates the reception time of the clock message from its operating system time clock and free running counter, and broadcasts this time keeper reception time to other stations. The other station then calculates its own reception time from its operating system time and the count of its free running counter. It was something to correct the operating system time by the difference between the keeper reception time.

また、複数の冗長系からなるフォールトトレラントシステムにおいて、複数の冗長系毎に独立したクロック発振器及び全冗長系に共通したクロック発振器を備え、共通したクロック発振器からの共通クロックを全冗長系の同期に使用するようにしたものがあった。
特開平６−５２０７６号公報特開平３−２６６０１１号公報 In a fault-tolerant system consisting of multiple redundant systems, each redundant system has an independent clock oscillator and a clock oscillator that is common to all redundant systems. There was something to be used.
JP-A-6-52076 Japanese Patent Laid-Open No. 3-266011

上記したような従来の同期処理方法において、タイムキーパステーションがシステム全体に共通タイミングを供給する方法では、タイムキーパステーションが故障すると、次のタイムキーパが選出されるまでの間は同期処理が一時停止するため、停止が許容されないアプリケーションには用いることができないほか、故障したステーションが不正にタイムキーパになって、同期系を乱す恐れがあった。また、全冗長系に共通したクロック発振器を用いた方法では、一般の分散コンピュータシステムにおけるノード間の同期には用いることができなかった。 In the conventional synchronization processing method as described above, when the time keeper station supplies common timing to the entire system, if the time keeper station fails, the synchronization processing is temporarily stopped until the next time keeper is selected. For this reason, it cannot be used for an application where stoppage is not permitted, and the failed station may become a timekeeper illegally and disturb the synchronization system. Further, the method using a clock oscillator common to all redundant systems cannot be used for synchronization between nodes in a general distributed computer system.

本発明の分散コンピュータシステムの同期処理方法は、請求項１として、複数のコンピュータをネットワークで相互に接続した分散コンピュータシステムにおいてコンピュータノード間の同期化を行うに際し、各ノードは、同期処理を行う周期に相当する時間の同期タイマを開始し、同期タイマの終了とともにネットワーク上に同期メッセージをブロードキャストして全ての他ノードに同期メッセージを送信する一方で、同期メッセージを送受信した際には、所定時間の認証用ウインドウを開始し、認証用ウインドウ中に既定の複数の同期メッセージを送受信した場合には、同期条件成立として同期メッセージの送受信タイミングから得られる共通のタイミングに一致するように自らの同期タイマの開始時期を自律的に調整してその開始時期で同期タイマを開始し、同期条件成立の際に自ノードの同期タイマが終了していない場合には、同期メッセージをネットワーク上にブロードキャストせずに同期タイマを再び開始する構成としており、上記構成をもって従来の課題を解決するための手段としている。 According to a distributed processing method of a distributed computer system of the present invention, as claimed in claim 1, when performing synchronization between computer nodes in a distributed computer system in which a plurality of computers are connected to each other via a network, each node performs a synchronization process. When the synchronization timer is transmitted and received, the synchronization message is transmitted to all the other nodes by broadcasting the synchronization message on the network when the synchronization timer expires. When an authentication window is started and a plurality of predetermined synchronization messages are transmitted and received during the authentication window, the synchronization timer is set so that the synchronization timing is met and the common timing obtained from the transmission / reception timing of the synchronization message is matched. Autonomous adjustment of start time and start time When the synchronization timer is started and the synchronization timer of the own node has not expired when the synchronization condition is satisfied, the synchronization timer is started again without broadcasting the synchronization message on the network. As a means to solve the problem.

また、本発明の分散コンピュータシステムの同期処理方法は、請求項２として、既存の同期系にノードを追加する場合、ノードがリセット後又は電源ＯＮ後である場合、及びノードに故障が発生した場合等において、そのノードが非同期状態になった際に、該当するノードは、所定時間の非同期タイマを開始し、非同期タイマの終了とともにネットワーク上に同期メッセージをブロードキャストして全ての他ノードに同期メッセージを送信する一方で、同期メッセージを送受信した際には、所定時間の認証用ウインドウを開始し、認証用ウインドウ中に既定の複数の同期メッセージを送受信した場合には、同期条件成立として同期タイマを開始し、同期メッセージを送受信したにもかかわらず認証用ウインドウ中に同期条件が成立しない場合には、非同期タイマを再び開始することを特徴とし、請求項３として、必要に応じて同期メッセージの内部にタイムスタンプを記載し、複数のノード間において、同期タイマの開始時期の一致化と同時にタイムスタンプの一致化を行うことを特徴とし、請求項４として、アプリケーションの要求の動的変化や環境条件の変化に応じて、各ノードが自律的に同期タイマ、非同期タイマ及び認証用ウインドウの時間を変更することを特徴とし、請求項５として、必要に応じて同期メッセージの内部に、同期メッセージを送信するノードの、ネットワーク上で重複の無いノード識別番号を記載し、同期メッセージを受信するノードが、認証用ウインドウ内にて既定時間内に同一ノードから複数受信した同期メッセージを、認証用ウインドウ内に受信した同期メッセージ数としてカウントしないことを特徴としている。 Further, according to the synchronous processing method of the distributed computer system of the present invention, as claimed in claim 2, when adding a node to an existing synchronous system, when the node is after resetting or after power-on, and when a failure occurs in the node When the node becomes asynchronous, the corresponding node starts an asynchronous timer for a predetermined time, and when the asynchronous timer ends, broadcasts a synchronous message on the network and sends the synchronous message to all other nodes. On the other hand, when a synchronization message is transmitted / received, an authentication window for a predetermined time is started, and when a plurality of predetermined synchronization messages are transmitted / received in the authentication window, a synchronization timer is started as a synchronization condition is satisfied. If the synchronization condition is not satisfied in the authentication window even though the synchronization message is sent and received Is characterized in that the asynchronous timer is restarted, and according to claim 3, a time stamp is written in the synchronization message as necessary, and the time of synchronization timer start timing coincides between a plurality of nodes. According to claim 4, each node autonomously sets the time of a synchronous timer, an asynchronous timer, and an authentication window in response to a dynamic change of an application request or a change of environmental conditions. According to a fifth aspect of the present invention, a node that receives a synchronization message is described in the synchronization message as needed, wherein a node identification number of a node that transmits the synchronization message is described in the synchronization message without duplication on the network. In the authentication window, multiple synchronization messages received from the same node within the predetermined time are received in the authentication window. It is characterized in that not count as the synchronization number of messages.

本発明の請求項１に係る分散コンピュータシステムの同期処理方法によれば、単一故障点となりうるマスターノードを必要とせずに複数ノード間の同期化が可能であるため、耐故障性（フォールトトレランス性）が高いものとなり、また、自ノードの同期タイマが満期になる前に同期条件が成立した場合には、ネットワーク上への同期メッセージの送信を自律的に抑制するため、ネットワーク負荷の上昇を抑えて、同期に参加できるノード数の制限をなくすことができる。さらに、複数のノードに同時に故障が発生しても、成立している同期系に影響を与えることなく無停止で同期を継続させることが可能であり、さらには、標準的な構成のコンピュータノードとブロードキャストが可能なネットワークを用いる以外には特別なハードウエアを必要としないため、同期を必要とするあらゆるタイプのアプリケーションに適用可能であって、低コスト化も実現し得るものとなる。 According to the synchronous processing method of the distributed computer system according to claim 1 of the present invention, since synchronization between a plurality of nodes is possible without requiring a master node that can be a single failure point, fault tolerance (fault tolerance) ), And if the synchronization condition is met before the synchronization timer of the local node expires, the transmission of the synchronization message on the network is autonomously suppressed, so the increase in network load is suppressed. Thus, the limit on the number of nodes that can participate in synchronization can be removed. Furthermore, even if multiple nodes fail at the same time, it is possible to continue synchronization without interruption without affecting the established synchronization system. Since no special hardware is required other than using a network capable of broadcasting, the present invention can be applied to any type of application that requires synchronization, and the cost can be reduced.

本発明の請求項２に係る分散コンピュータシステムの同期処理方法によれば、請求項１と同様の効果を得ることができるうえに、複数のノードが既に成立している同期系へ参加したり復帰したりする際に、成立している同期系には影響を与えずに全体システムとして無停止で同期を継続させることが可能であり、また、システム全体の起動時においても、ノード単体の起動時や復帰時とを区別する特別なハードウエアの仕組みを必要とせず、標準的な構成のみで同期化が可能である。 According to the synchronous processing method of the distributed computer system according to claim 2 of the present invention, the same effect as that of claim 1 can be obtained, and a plurality of nodes can participate in or return to a synchronous system that has already been established. It is possible to continue synchronization without stopping the entire system without affecting the established synchronization system, and even when starting the entire system, It is possible to synchronize with only a standard configuration without requiring a special hardware mechanism for distinguishing from the time of recovery and recovery.

本発明の請求項３に係る分散コンピュータシステムの同期処理方法によれば、請求項１及び２と同様の効果を得ることができるうえに、複数のノード間の同期化のみならず、タイムスタンプの一致化を行うため、同期処理を行う周期よりも長い処理周期を持つアプリケーションに対しても有効である。 According to the synchronous processing method of the distributed computer system according to claim 3 of the present invention, the same effects as in claims 1 and 2 can be obtained, and not only the synchronization among a plurality of nodes but also the time stamp Since matching is performed, it is also effective for an application having a processing cycle longer than the cycle for performing the synchronization processing.

本発明の請求項４に係る分散コンピュータシステムの同期処理方法によれば、請求項１〜３と同様の効果を得ることができるうえに、アプリケーションのリアルタイム性要求や信頼性要求が動的に変化する場合や、異なるレベルのリアルタイム要求や信頼性要求を有する複数のアプリケーションがシステム中に混在している場合であっても、同期化を実現することができる。 According to the synchronous processing method of the distributed computer system according to claim 4 of the present invention, the same effects as in claims 1 to 3 can be obtained, and the real-time requirement and reliability requirement of the application change dynamically. In this case, synchronization can be realized even when a plurality of applications having different levels of real-time requirements and reliability requirements are mixed in the system.

本発明の請求項５に係る分散コンピュータシステムの同期処理方法によれば、請求項１〜４と同様の効果を得ることができるうえに、故障したノードが複数の同期メッセージを送信してしまうことによる障害を排除することができる。 According to the distributed computer system synchronization processing method of the fifth aspect of the present invention, the same effects as in the first to fourth aspects can be obtained, and a failed node can transmit a plurality of synchronous messages. The failure due to can be eliminated.

本発明の分散コンピュータシステムの同期処理方法に用いるハードウエアは、以下に示すノードとネットワークである。なお、ネットワークは、一般的に組織全体の状態を表わすものであるが、ここではノード間を接続する情報伝達媒体を示す。 The hardware used for the synchronous processing method of the distributed computer system of the present invention is the following nodes and network. Note that the network generally represents the state of the entire organization, but here shows an information transmission medium for connecting nodes.

図２に示すノード１は、プロセッサ２、メモリ３、クロック４、タイマ５、ネットワーク接続インターフェース６及びローカルＩ／Ｏインターフェース７を備えると共に、故障対策のためのハードウエアデバイスとして、必要に応じてウオッチドッグタイマ（ＷＤＴ）８、過電流検出／遮断回路９、及びメモリのビット反転を検出・修正するためのＥＤＡＣ回路１０を備えている。そして、分散コンピュータシステムは、複数のノード１をネットワークＮＷで相互に接続することにより構成してある。 The node 1 shown in FIG. 2 includes a processor 2, a memory 3, a clock 4, a timer 5, a network connection interface 6, and a local I / O interface 7, and as a hardware device for failure countermeasures, a watch is used as necessary. A dog timer (WDT) 8, an overcurrent detection / shut-off circuit 9, and an EDAC circuit 10 for detecting and correcting bit inversion of the memory are provided. The distributed computer system is configured by connecting a plurality of nodes 1 to each other via a network NW.

ネットワーク接続インターフェース６は、ネットワークＮＷを介して複数のノード１同士を相互に接続するために用いるもので、必要に応じて二重系や三重系の冗長構成を採用することができる。ローカルＩ／Ｏインターフェース７の先に接続するハードウエアデバイスは、アプリケーションによって異なり、パーソナルコンピュータやワークステーション等のマン−マシンインターフェースを有するコンピュータである場合には、モニタ、キーボード及びマウスなどであり、組み込みコンピュータや搭載コンピュータである場合には、センサやアクチュエータなどである。 The network connection interface 6 is used to connect a plurality of nodes 1 to each other via the network NW, and a double or triple redundant configuration can be adopted as necessary. The hardware device connected to the tip of the local I / O interface 7 differs depending on the application, and in the case of a computer having a man-machine interface such as a personal computer or a workstation, it is a monitor, a keyboard, a mouse, etc. In the case of a computer or an on-board computer, it is a sensor or an actuator.

ここで、当該同期処理方法では、信頼度の高い特別なノードが無い場合でも同期処理を実現することから、分散コンピュータシステム中にタイムキーパノード等の特別なノード（マスタノード）が存在せず、いずれのノード１にも故障が起こり得ることを前提としている。また、当該同期処理方法では、高い無停止及び拡張性を実現することから、各ノードは、自律した存在であって、ネットワーク接続インターフェース６又はローカルＩ／Ｏインターフェース７からデータを受信した際には、その内容を吟味して自律的判断により適切な処理を起動し、自らのタイマ管理を行い、適切と判断したタイミングでネットワーク接続インターフェース６又はローカルＩ／Ｏインターフェース７からメッセージを送信するものとなっている。 Here, in the synchronous processing method, since the synchronous processing is realized even when there is no special node with high reliability, there is no special node (master node) such as a timekeeper node in the distributed computer system, It is assumed that any node 1 can fail. In addition, since the synchronization processing method achieves high non-stop and expandability, each node is an autonomous entity and receives data from the network connection interface 6 or the local I / O interface 7. The content is examined and an appropriate process is started by autonomous judgment, the timer is managed by itself, and a message is transmitted from the network connection interface 6 or the local I / O interface 7 at a timing judged to be appropriate. ing.

さらに、ノード間の同期を行うために、各ノード１が相互に複数種の専用インターフェースを備えていると、ノード間の結合が密になって同期を行うノード数が固定される。したがって、分散コンピュータシステムが柔軟なノード数の構成を採用し得るようにするために、ノード間の接続はネットワークＮＷのみとしている。なお、ネットワークＮＷは、同期処理にのみ用いるもの又は同期処理とアプリケーションの処理を併用するもののいずれでもよい。また、ネットワークＮＷは、信頼性確保のために図１に示す如く冗長構成にしてもよい。さらに、ネットワークＮＷは、有線又は無線、あるいはその両方によって構成しても良い。 Furthermore, in order to synchronize between nodes, if each node 1 is provided with a plurality of types of dedicated interfaces, the coupling between the nodes becomes dense and the number of nodes that perform synchronization is fixed. Therefore, in order to enable the distributed computer system to adopt a configuration having a flexible number of nodes, only the network NW is connected between the nodes. The network NW may be either one used only for synchronization processing or one used in combination with synchronization processing and application processing. Further, the network NW may have a redundant configuration as shown in FIG. 1 in order to ensure reliability. Furthermore, the network NW may be configured by wire, wireless, or both.

このように、本発明の分散コンピュータシステムの同期処理方法に用いるハードウエアは、標準的な構成のノード１と、ブロードキャストが可能なネットワークＮＷのみであって、それ以外に特別なハードウエアを必要としない。これにより、以下に述べる当該同期処理方法は、同期を必要とするあらゆるアプリケーションに適用可能となる。 As described above, the hardware used in the synchronous processing method of the distributed computer system of the present invention is only the node 1 having a standard configuration and the network NW capable of broadcasting, and other special hardware is required. do not do. Accordingly, the synchronization processing method described below can be applied to any application that requires synchronization.

図３はノード１の状態遷移図であり、図４は同期成立条件としてＮ＝２の場合におけるノード１の状態遷移図である。図３及び図４において、当該同期処理方法を実施する各ノード１は、同期状態（Ｓ）、同期遷移状態（Ｔ）、非同期状態（Ａ）及び初期状態（Ｉ）のいずれかの状態にある。 FIG. 3 is a state transition diagram of the node 1, and FIG. 4 is a state transition diagram of the node 1 when N = 2 as a synchronization establishment condition. 3 and 4, each node 1 that implements the synchronous processing method is in one of a synchronous state (S), a synchronous transition state (T), an asynchronous state (A), and an initial state (I). .

同期状態（Ｓ）は、自ノード１の同期タイマＳＴが動作し、少なくとも一式以上の他ノードと同期している状態である。同期遷移状態（Ｔ）は、自ノード１の同期タイマＳＴも非同期タイマＡＴも動作せず、同期状態から非同期状態へ移る中間の状態、又は非同期状態から同期状態に移る中間の状態である。非同期状態（Ａ）は、自ノード１がどの他ノードとも同期しておらず、同期するために非同期タイマＡＴを動作させている状態である。そして、初期状態（Ｔ）は、自ノード１が電源ＯＮ又はリセット後の初期化時や、故障発生直後などのようにまだ同期処理を開始していない状態である。 The synchronization state (S) is a state in which the synchronization timer ST of the own node 1 operates and is synchronized with at least one set of other nodes. The synchronous transition state (T) is an intermediate state in which the synchronous timer ST and the asynchronous timer AT of the node 1 do not operate and the state shifts from the synchronous state to the asynchronous state, or the intermediate state shifts from the asynchronous state to the synchronous state. The asynchronous state (A) is a state in which the own node 1 is not synchronized with any other node, and the asynchronous timer AT is operated for synchronization. The initial state (T) is a state in which the synchronization process has not yet started, such as when the node 1 is initialized after power-on or reset, or immediately after a failure occurs.

ここで、図３及び図４において、同期タイマＳＴは、０を初期値として時間の経過とともにカウント値が増すタイマであり、その最大カウント値に相当する時間は全ノードで共通の時間Ｔｓ（秒）に設定してあり、同期条件成立時に動作して次の同期処理を行うタイミングを各ノードが認識するために用いる。ただし、同期タイマＳＴの最大カウント値に相当する時間Ｔｓは要求に応じて可変である。また、非同期タイマＡＴは、０を初期値として時間の経過とともにカウント値が増すタイマであり、その最大カウント値に相当する時間は全ノードで共通の時間Ｔａ（秒）に設定してあり、非同期時に他ノードとの同期を行うために用いる。ただし、非同期タイマＡＴの最大カウント値に相当する時間Ｔａは要求に応じて可変である。 Here, in FIGS. 3 and 4, the synchronization timer ST is a timer whose count value increases with the passage of time with 0 as an initial value, and the time corresponding to the maximum count value is a time Ts (seconds) common to all nodes. ), And is used for each node to recognize the timing at which the synchronization process is performed and the next synchronization process is performed. However, the time Ts corresponding to the maximum count value of the synchronization timer ST is variable as required. The asynchronous timer AT is a timer whose count value increases with the passage of time with 0 as an initial value, and the time corresponding to the maximum count value is set to a common time Ta (seconds) in all nodes. Sometimes used to synchronize with other nodes. However, the time Ta corresponding to the maximum count value of the asynchronous timer AT is variable as required.

図３及び図４において、符号Ｗｑは認証用ウインドウであって、この認証用ウインドウＷｑは、０を初期値として時間の経過とともにカウント値が増すタイマであり、その最大カウント値に相当する時間は全ノードで共通の時間Ｔｗｑ（秒）に設定してあり、受信した同期メッセージＳＭの受信タイミングが認証用ウインドウＷｑ内又は外であるかによって当該同期メッセージＳＭの有効性を判断するために用いる。ただし、認証用ウインドウＷｑの最大カウント値に相当する時間Ｔｗｑは要求に応じて可変である。符号ＳＭは同期メッセージであって、この同期メッセージＳＭは、必要に応じてそのデータ部にタイムスタンプ（シーケンスナンバー）が記載され、また、必要に応じてそのデータ内部に、同期メッセージＳＭを送信するノードの、ネットワークＮＷ上で重複の無いノード識別番号が記載される。さらに、符号Ｎは、同期条件成立に必要な同期メッセージＳＭの数であり、符号ｎｒｔは、認証用ウインドウＷｑ中に送受信した同期メッセージＳＭの数である。この同期メッセージＳＭの数ｎｒｔは、同期条件成立時においてタイムスタンプの一致を含める場合には、認証用ウインドウＷｑ中に送受信した同期メッセージＳＭのうちタイムスタンプが一致した同期メッセージＳＭの数となる。なお、上記の同期メッセージ数ｎｒｔは、故障ノードが同期メッセージＳＭを複数誤送信する影響が憂慮される場合には、これを排除するため、認証用ウインドウＷｑ内の既定時間内に同一ノードから複数の同期メッセージＳＭを受信してもこれをカウントしない。 In FIGS. 3 and 4, symbol Wq is an authentication window, and this authentication window Wq is a timer whose count value increases with the passage of time with 0 as an initial value, and the time corresponding to the maximum count value is A common time Twq (seconds) is set in all the nodes, and is used to determine the validity of the synchronization message SM depending on whether the reception timing of the received synchronization message SM is within or outside the authentication window Wq. However, the time Twq corresponding to the maximum count value of the authentication window Wq is variable as required. The code SM is a synchronization message, and this synchronization message SM has a time stamp (sequence number) written in its data part as necessary, and transmits the synchronization message SM inside the data as necessary. A node identification number that is not duplicated on the network NW of the node is described. Further, the symbol N is the number of synchronization messages SM necessary for establishing the synchronization condition, and the symbol nrt is the number of synchronization messages SM transmitted / received during the authentication window Wq. The number nrt of the synchronization messages SM is the number of synchronization messages SM having the same time stamp among the synchronization messages SM transmitted / received in the authentication window Wq when the time stamp match is included when the synchronization condition is satisfied. It should be noted that the number nrt of the synchronization messages may be set from the same node within a predetermined time within the authentication window Wq in order to eliminate the influence of the erroneous transmission of the synchronization message SM by the failed node. This is not counted even if the synchronization message SM is received.

当該同期処理方法は、自律タイミング同期処理方法とも呼べるもので、その基本処理において、各ノードは、同期処理を行う周期に相当する時間Ｔｓの同期タイマＳＴを開始し、同期タイマＳＴの終了とともにネットワークＮＷ上に同期メッセージＳＭをブロードキャストして全ての他ノードに同期メッセージＳＭを送信する。その一方で、各ノードは、認証用ウインドウＷｑが既に開始されていない場合において、他ノードからの同期メッセージＳＭを受信した際に、又は自ら同期メッセージＳＭを送信した際に、所定時間Ｔｗｑの認証用ウインドウＷｑを開始し、認証用ウインドウＷｑ中に既定の複数（図４においてはＮ＝２）の同期メッセージＳＭを送受信した場合には、同期条件成立として同期メッセージＳＭの送受信タイミングから得られる共通のタイミングに一致するように自らの同期タイマＳＴの開始時期を自律的に調整して、その開始時期で同期タイマＳＴを開始する。 The synchronization processing method can also be called an autonomous timing synchronization processing method. In the basic processing, each node starts a synchronization timer ST at a time Ts corresponding to a period for performing the synchronization processing, and when the synchronization timer ST ends, the network The synchronization message SM is broadcast on the NW, and the synchronization message SM is transmitted to all other nodes. On the other hand, when the authentication window Wq has not been started, each node authenticates the predetermined time Twq when receiving the synchronization message SM from another node or when transmitting the synchronization message SM itself. When the synchronization window SM is started and a plurality of predetermined synchronization messages SM (N = 2 in FIG. 4) are transmitted / received in the authentication window Wq, the common condition obtained from the transmission / reception timing of the synchronization message SM as the synchronization condition is established. The start time of its own synchronization timer ST is autonomously adjusted so as to coincide with this timing, and the synchronization timer ST is started at that start time.

これに対して、同期条件成立の際に自ノードの同期タイマＳＴが終了していない場合には、同期メッセージＳＭをネットワークＮＷ上にブロードキャストせず、同期タイマＳＴを再び開始する。このようにして、同期したい全てのノードが同様の処理を行うことにより同期が成立することとなる。アプリケーション処理を複数のノード間で同期させたい各ノードは、各々の同期タイマＳＴの値が予め定められた値になったときに当該処理を起動することにより、その処理が複数ノード間で同期して実施されることになる。 On the other hand, if the synchronization timer ST of the own node is not expired when the synchronization condition is satisfied, the synchronization timer ST is started again without broadcasting the synchronization message SM on the network NW. In this way, synchronization is established when all the nodes that want to synchronize perform the same processing. Each node that wishes to synchronize application processing between a plurality of nodes starts the processing when the value of each synchronization timer ST reaches a predetermined value, so that the processing is synchronized between the plurality of nodes. Will be implemented.

また、当該同期処理方法では、既存の同期系にノードを追加する場合、ノードがリセット後又は電源ＯＮ後である場合、及びノードに故障が発生した場合等において、そのノードが非同期状態になった際に、該当するノードは、所定時間Ｔａの非同期タイマＡＴを開始し、非同期タイマＡＴの終了とともにネットワークＮＷ上に同期メッセージＳＭをブロードキャストして全ての他ノードに同期メッセージＳＭを送信する。その一方で、各ノードは、認証用ウインドウＷｑが既に開始されていない場合において、他ノードからの同期メッセージＳＭを受信した際に、又は自ら同期メッセージＳＭを送信した際に、所定時間Ｔｗｑの認証用ウインドウＷｑを開始し、認証用ウインドウＷｑ中に既定の複数の同期メッセージＳＭを送受信した場合には、同期条件成立として同期タイマＳＴを開始する。その後の同期タイマＳＴの動作は先述の通りである。 In addition, in the synchronous processing method, when adding a node to an existing synchronous system, when the node is reset or after power is turned on, or when a failure occurs in the node, the node becomes asynchronous. At this time, the corresponding node starts the asynchronous timer AT for a predetermined time Ta, broadcasts the synchronous message SM on the network NW when the asynchronous timer AT ends, and transmits the synchronous message SM to all other nodes. On the other hand, when the authentication window Wq has not been started, each node authenticates the predetermined time Twq when receiving the synchronization message SM from another node or when transmitting the synchronization message SM itself. When the synchronization window ST is started and a plurality of predetermined synchronization messages SM are transmitted / received in the authentication window Wq, the synchronization timer ST is started as the synchronization condition is satisfied. The subsequent operation of the synchronization timer ST is as described above.

これに対して、同期メッセージＳＭを送受信したにもかかわらず認証用ウインドウＷｑ中で同期条件が成立しない場合には、各ノードは、非同期タイマＡＴを再び開始し、上記の処理を繰り返し行う。このようにして、最終的には同期が成立することとなる。 On the other hand, when the synchronization condition is not satisfied in the authentication window Wq despite the transmission / reception of the synchronization message SM, each node starts the asynchronous timer AT again and repeats the above processing. In this way, synchronization is finally established.

さらに、当該同期処理方法では、必要に応じて同期メッセージＳＭの内部にタイムスタンプを記載し、複数のノード間において、同期タイマＳＴの開始時期の一致化と同時にタイムスタンプの一致化を行うことにより、同期処理を行う周期よりも長い処理周期を持つアプリケーションに対しても有効となる。 Further, in the synchronization processing method, a time stamp is described in the synchronization message SM as necessary, and the time stamps are matched at the same time as the synchronization timer ST start times are matched between a plurality of nodes. This is also effective for an application having a processing cycle longer than the cycle for performing the synchronization processing.

さらに、当該同期処理方法では、アプリケーションの要求の動的変化や環境条件の変化に応じて、各ノードが予め定めたロジックに従って自律的に同期タイマＳＴ、非同期タイマＡＴ及び認証用ウインドウＷｑの時間Ｔｓ，Ｔａ，Ｔｗｑを変更すれば、アプリケーションのリアルタイム性要求や信頼性要求が動的に変化する場合や、異なるレベルのリアルタイム要求や信頼性要求を有する複数のアプリケーションがシステム中に混在している場合でも、同期化を実現することができる。なお、異なる要求レベルをもつ複数のアプリケーションがシステム中に混在する場合には、各アプリケーション毎に個別の同期タイマＳＴ、非同期タイマＡＴ、認証用ウインドウＷｑを設定してもよい。 Further, according to the synchronous processing method, each node autonomously synchronizes the timer T, the asynchronous timer AT, and the time Ts of the authentication window Wq in accordance with a predetermined logic in accordance with a dynamic change in an application request or a change in environmental conditions. , Ta, and Twq are changed, the application's real-time requirements and reliability requirements change dynamically, or multiple applications with different levels of real-time requirements and reliability requirements are mixed in the system But synchronization can be achieved. When a plurality of applications having different request levels are mixed in the system, an individual synchronous timer ST, asynchronous timer AT, and authentication window Wq may be set for each application.

さらに、当該同期処理方法では、必要に応じて同期メッセージＳＭの内部に、同期メッセージＳＭを送信するノードの、ネットワークＮＷ上で重複の無いノード識別番号を記載し、同期メッセージＳＭを受信するノードが、認証用ウインドウＷｑ内にて既定時間内に同一ノードから複数受信した同期メッセージＳＭを、認証用ウインドウＷｑ内に受信した同期メッセージ数ｎｒｔとしてカウントしないことにより、故障したノードが複数の同期メッセージＳＭを送信してしまうことによる障害を排除することができる。 Further, in the synchronization processing method, a node identification number having no duplication on the network NW of the node that transmits the synchronization message SM is described in the synchronization message SM as necessary, and the node that receives the synchronization message SM By not counting the synchronization messages SM received from the same node within the predetermined time in the authentication window Wq as the number of synchronization messages nrt received in the authentication window Wq, the failed node can receive a plurality of synchronization messages SM. Can be eliminated.

上記の同期タイマＳＴ、非同期タイマＡＴ及び認証用ウインドウＷｑの実装手段としては、ハードウエアによる方法及びソフトウエアによる方法のいずれを用いてもよい。各タイマＳＴ，ＡＴは、その動作を終了すると、割り込み機能等を用いて終了になったことをプロセッサに通知する。非同期タイマＡＴの時間Ｔａは、同期タイマＳＴの時間Ｔｓに比較して長くても、短くても、同じでもよい。 As a means for mounting the synchronous timer ST, the asynchronous timer AT, and the authentication window Wq, either a hardware method or a software method may be used. When the timers ST and AT end their operations, they notify the processor that they have ended using an interrupt function or the like. The time Ta of the asynchronous timer AT may be longer, shorter or the same as the time Ts of the synchronous timer ST.

認証用ウインドウＷｑ中に送受信した複数の同期メッセージＳＭの送受信タイミングから同期タイマＳＴの開始時期を決定するにはいくつかの方法がある（図３及び図４における『ＳＴカウント値設定』の部分）。例えば、同期が成立する条件であるＮ個の同期メッセージＳＭの送受信に対して、ｍ（１≦ｍ≦Ｎ）番目の同期メッセージＳＭを送受信したタイミングを同期タイマＳＴの開始時期に設定することができる。また、同期処理の周期（Ｔｓ）の同期処理毎のばらつきを補正するため、各ノードがｍ番目の同期メッセージＳＭを送受信したタイミングから所定のオフセット時間ｋ（秒）を経過した後、同期タイマＳＴを開始するようにしてもよい。さらに、ノード間のばらつきを補正するために、上記のオフセット時間ｋ（秒）は、ノード毎に異なる値や、負の値としてもよい。別の同期タイマＳＴの開始時期を決定する方法の一例としては、認証用ウインドウＷｑ中に送受信した全ての又は一部の同期メッセージＳＭの送受信タイミングの平均により決定する方法がある。 There are several methods for determining the start time of the synchronization timer ST from the transmission / reception timings of a plurality of synchronization messages SM transmitted / received during the authentication window Wq (the “ST count value setting” portion in FIGS. 3 and 4). . For example, the timing at which the m (1 ≦ m ≦ N) -th synchronization message SM is transmitted / received for the transmission / reception of N synchronization messages SM, which is a condition for establishing synchronization, may be set as the start time of the synchronization timer ST. it can. Further, in order to correct variations in the synchronization processing cycle (Ts) for each synchronization processing, after a predetermined offset time k (seconds) elapses from the timing at which each node transmits and receives the m-th synchronization message SM, the synchronization timer ST May be started. Furthermore, in order to correct the variation between nodes, the offset time k (seconds) may be different for each node or a negative value. As an example of a method of determining the start time of another synchronization timer ST, there is a method of determining by the average of transmission / reception timings of all or a part of the synchronization messages SM transmitted / received during the authentication window Wq.

認証用ウインドウＷｑの時間Ｔｗｑに対する条件としては、ノード間及び単一ノード内の内部処理時間のばらつきをσｔｓ、クロックドリフト時間のばらつきをσｃｄ、ネットワークアクセス時間のばらつきをσｂａとすると、これらのばらつきは当該分散コンピュータシステムを用いるうえで必然的に生じるものであるから、認証用ウインドウＷｑの時間Ｔｗｑは、これらのばらつきを許容する充分な長さになっている必要があり、少なくともσｔｓ＋σｃｄ＋σｂａ＜Ｔｗｑを満足するものとする。 As conditions for the time Twq of the authentication window Wq, assuming that the variation in internal processing time between nodes and within a single node is σts, the variation in clock drift time is σcd, and the variation in network access time is σba, these variations are The time Twq of the authentication window Wq needs to be long enough to allow these variations, and at least satisfies σts + σcd + σba <Twq, since it is inevitably generated when using the distributed computer system. It shall be.

同期メッセージＳＭの内部にタイムスタンプを記載し、複数のノード間において、同期化の際にタイムスタンプの一致化を行うことにより、同期処理周期（Ｔｓ）よりも長い周期で処理を行うアプリケーションにおいて、タイムスタンプを用いてノード間のアプリケーション処理の同期が可能となる。この際、タイムスタンプの設定方法の一例としては、ノードの起動時には０から開始し、同期処理毎に一定値を加算する方法がある。追加や故障からの復帰等のように既存の同期系に途中から組み込まれる場合は、既存の同期系で合意しているタイムスタンプを採用し、そこから加算すればよい。 In an application that performs processing in a cycle longer than the synchronization processing cycle (Ts) by describing a time stamp in the synchronization message SM and performing time stamp matching during synchronization between a plurality of nodes, Application processing can be synchronized between nodes using a time stamp. At this time, as an example of the time stamp setting method, there is a method of starting from 0 when the node is activated and adding a constant value for each synchronization process. When it is incorporated into the existing synchronous system from the middle, such as addition or recovery from failure, a time stamp agreed on by the existing synchronous system may be adopted and added from there.

また、ノードに用いるクロックのドリフトレートを最大ρ（秒／秒）とし、アプリケーションの要求するノード間の同期精度をΠ（秒）とし、同期処理直後のノード間の同期ばらつき最大値をΠｉ（秒）とするとき、同期処理周期（同期タイマＳＴの設定時間）Ｔｓ（秒）は、Ｔｓ＜（Π−Πｉ）／２ρを満足するものとする。本発明では、アプリケーションの要求する同期精度Πが時間とともに変化するとき、又は異なる同期精度要求の複数のアプリケーションがシステム中に混在するとき、これらに合わせて各ノードが予め定めたロジック（計算式）に従って同期処理周期Ｔｓを長くしたり短くしたりすることが可能である。 The maximum drift rate of the clock used for the node is ρ (seconds / second), the synchronization accuracy between the nodes required by the application is Π (seconds), and the maximum synchronization variation between the nodes immediately after the synchronization processing is Πi (seconds). ), The synchronization processing period (set time of the synchronization timer ST) Ts (seconds) satisfies Ts <(Π−Πi) / 2ρ. In the present invention, when the synchronization accuracy required by an application changes with time, or when a plurality of applications having different synchronization accuracy requirements coexist in the system, each node determines a logic (calculation formula) determined in advance. Accordingly, the synchronization processing cycle Ts can be lengthened or shortened.

さらに、認証用ウインドウＷｑの時間Ｔｗｑ（秒）は、同期条件成立に必要な同期メッセージＳＭの数Ｎと関係がある。つまり、誤ったノード同士が同期し、他の正常なノードが誤ったタイミングで同期する誤同期の現象を回避するためには、同期メッセージＳＭの数Ｎを大きくすればよいが、この数Ｎを大きくするとそれに応じて認証用ウインドウＷｑの時間Ｔｗｑを長く設定する必要がある。 Further, the time Twq (second) of the authentication window Wq is related to the number N of synchronization messages SM necessary for establishing the synchronization condition. That is, in order to avoid the phenomenon of false synchronization in which wrong nodes are synchronized and other normal nodes are synchronized at wrong timing, the number N of synchronization messages SM may be increased. If it is increased, the time Twq of the authentication window Wq needs to be set longer accordingly.

当該同期処理方法では、誤同期を回避する要求（同期信頼性要求）が時間とともに変化するとき、又は異なるレベルの同期信頼性要求をもつ複数のアプリケーションがシステム中に混在するときに、それに合わせて認証用ウインドウＷｑの時間Ｔｗｑを変化させることができる。一般に、同期タイマＳＴの時間Ｔｓを長く、又は認証用ウインドウＷｑの時間Ｔｗｑを短くすることにより、同期処理に費やす時間を相対的に短くし、アプリケーション処理に費やすことのできる時間を相対的に長くし得るという利点がある。同期精度要求や同期信頼性要求の時間変化の各ノードへの通知は、例えばネットワーク接続インターフェースやローカルＩ／Ｏインターフェースを通して行うことができる。 In the synchronization processing method, when a request for avoiding mis-synchronization (synchronous reliability request) changes with time, or when a plurality of applications having different levels of synchronous reliability requirements coexist in the system, it is adjusted accordingly. The time Twq of the authentication window Wq can be changed. Generally, by shortening the time Ts of the synchronization timer ST or shortening the time Twq of the authentication window Wq, the time spent for the synchronization process is relatively shortened, and the time that can be spent for the application process is relatively long. There is an advantage that can be. The notification of the time change of the synchronization accuracy request or the synchronization reliability request to each node can be performed, for example, through a network connection interface or a local I / O interface.

次に、図５〜図１０に基づいて、上記したアルゴリズムに従って各ノードが処理を行った場合の具体的な動作について説明する。各図は、ある一つのノードの処理過程を示すもので、左から右に向けて時間の経過を表わしている。また、以下の例では、同期成立の条件としてＮ＝２の場合を示し、さらに、認証用ウインドウＷｑ中に二番目の同期メッセージＳＭを受信した時点を次の同期タイマＳＴの開始時期としている。 Next, specific operations when each node performs processing according to the algorithm described above will be described with reference to FIGS. Each figure shows the process of a certain node, and represents the passage of time from left to right. The following example shows a case where N = 2 as a condition for establishing synchronization, and the time when the second synchronization message SM is received in the authentication window Wq is set as the start time of the next synchronization timer ST.

図５は、同期状態の更新と非同期要素の排除を説明する図である。各ノードは、同期処理を行う周期に相当する時間Ｔｓの同期タイマＳＴを開始し、この同期タイマＳＴが終了すると、図５中（ａ）に示すように、ネットワークＮＷ上に同期メッセージＳＭをブロードキャストして全ての他ノードに同期メッセージＳＭを送信する。また、認証用ウインドウＷｑの非動作時に、ネットワークＮＷから他ノードの同期メッセージＳＭを受信すると、図５中（ｂ）に示すように、認証用ウインドウＷｑを開始する。そして、認証用ウインドウＷｑ中に二つ目の同期メッセージＳＭを送受信した際、図５中（ｃ）に示すように、同期条件成立として新たに同期タイマＳＴを開始する。 FIG. 5 is a diagram for explaining synchronization state update and removal of asynchronous elements. Each node starts a synchronization timer ST at a time Ts corresponding to the period for performing the synchronization process, and when this synchronization timer ST ends, broadcasts a synchronization message SM on the network NW as shown in FIG. Then, the synchronization message SM is transmitted to all other nodes. Further, when the synchronization message SM of another node is received from the network NW when the authentication window Wq is not operating, the authentication window Wq is started as shown in FIG. 5 (b). When the second synchronization message SM is transmitted / received during the authentication window Wq, the synchronization timer ST is newly started as the synchronization condition is satisfied, as shown in FIG. 5C.

また、同期条件が成立している状態において、他ノードからの同期メッセージＳＭを受信し、これにより開始した認証用ウインドウＷｑ中に二つ目の同期メッセージＳＭを受信しない場合は、図５中（ｄ）に示すように、一つ目の同期メッセージＳＭが故障発生等による非同期ノードからのものであると判断してこれを無視し、同期タイマＳＴの開始を行わない。これにより、多くのケースにおいては、非同期の同期メッセージＳＭを各ノードにおいて排除する。偶然、不正なタイミングの同期メッセージＳＭに同期する充分な数の別の非同期なノードがあった場合には、この不正なタイミングで同期条件が成立することになる。ただし、この場合においても、全てのノードが今度はこれを正として追随するため、複数の異なる同期系が存在し続けることはなく、一つの同期系に収束する。また、以上の様な不正なタイミングでの同期の可能性を低くするためには、同期が成立する条件であるＮを大きくすればよい。 When the synchronization message SM is received from another node in the state where the synchronization condition is satisfied, and the second synchronization message SM is not received in the authentication window Wq started by this, FIG. As shown in d), it is determined that the first synchronization message SM is from an asynchronous node due to the occurrence of a failure or the like, and is ignored and the synchronization timer ST is not started. This in many cases eliminates asynchronous synchronous messages SM at each node. If there is a sufficient number of other asynchronous nodes that synchronize with the synchronization message SM at the incorrect timing, the synchronization condition is satisfied at the incorrect timing. However, even in this case, since all the nodes follow this as a positive state, a plurality of different synchronous systems do not continue to converge and converge to one synchronous system. Further, in order to reduce the possibility of synchronization at the incorrect timing as described above, N, which is a condition for establishing synchronization, may be increased.

図６は、同期状態の更新を説明する図である。ここでは、他ノードの同期メッセージＳＭを受信する前に自ノードの同期タイマＳＴが終了し、図６中（ａ）に示すように、同期メッセージＳＭを送信すると共に、図６中（ｂ）に示すように、認証用ウインドウＷｑを開始して一度同期遷移状態に移った後、認証用ウインドウＷｑ中に他ノードの同期メッセージＳＭを少なくとも一つ受信することにより、図６（ｃ）に示すように、同期条件が成立した様子を示している。 FIG. 6 is a diagram for explaining the update of the synchronization state. Here, before receiving the synchronization message SM of the other node, the synchronization timer ST of the own node expires, and as shown in FIG. 6A, the synchronization message SM is transmitted, and in FIG. As shown in FIG. 6C, after starting the authentication window Wq and moving to the synchronous transition state once, at least one synchronization message SM of another node is received in the authentication window Wq. Shows how the synchronization condition is satisfied.

図７は、同期メッセージＳＭの自律的送信の抑制を説明する図である。ここでは、各ノードは、自らの同期タイマＳＴが終了する前に、他ノードからの同期メッセージＳＭを受信して認証用ウインドウＷｑを開始し、この認証用ウインドウＷｑ中に同期条件が成立した場合（同期条件成立時に同期タイマＳＴが終了していない場合）において、同期メッセージＳＭをネットワークＮＷにブロードキャストしない様子を示している。各ノードがこのような動作を行うことで、同期を行うノード数が多くても同期メッセージＳＭの増加によってネットワークＮＷの負荷が増加することがなく、同期を成立させることができる。したがって、同期を行うノード数の制限をなくすことができる。 FIG. 7 is a diagram for explaining suppression of autonomous transmission of the synchronization message SM. Here, each node receives a synchronization message SM from another node before starting its own synchronization timer ST, starts an authentication window Wq, and a synchronization condition is satisfied in this authentication window Wq. In the case where the synchronization timer ST has not expired when the synchronization condition is satisfied, the synchronization message SM is not broadcast to the network NW. By performing such an operation, each node can establish synchronization without increasing the load on the network NW due to an increase in the synchronization message SM even if the number of nodes performing synchronization is large. Therefore, the limitation on the number of nodes performing synchronization can be eliminated.

図８は、既存の同期系へのノードの追加／故障ノードの復帰時の動作を説明する図である。より具体的には、既存の同期系に新たに複数のノードを追加する場合、又はあるノードに故障が発生し、その修復のためにノードが一度同期系を離脱（オフライン）した後に復帰する場合の動作を示している。復帰の際に想定している故障の中には一過性の故障も含まれており、この場合はＥＤＡＣ回路によるメモリ内容の修正、ノードの電源ＯＦＦ／再電源ＯＮ（パワーサイクル）、及びプロセッサのリセット等によって修復が行われる。 FIG. 8 is a diagram for explaining an operation when adding a node to an existing synchronous system / returning a failed node. More specifically, when a plurality of nodes are newly added to the existing synchronous system, or when a failure occurs in a certain node and the node once leaves the synchronous system (offline) to repair it, and then returns. Shows the operation. The failure assumed at the time of recovery includes a transient failure. In this case, the memory contents are corrected by the EDAC circuit, the node is powered off / repowered (power cycle), and the processor Restoration is performed by resetting or the like.

ノードを既存の同期系に追加する場合、又は故障が発生したノードに対して上記の修復が成された後（パワーサイクル又はリセットの場合は初期化処理が終了した後）、ノードは非同期状態となって非同期タイマＡＴを開始する。このとき、追加／復帰ノード以外は同期状態にあり、同期系を構成しているので、非同期状態の経過時間が同期タイマＳＴの時間Ｔｓに達する前に、認証用ウインドウＷｑ中に他ノードの同期メッセージＳＭを二つ以上受信することとなる。すなわち、非同期タイマＡＴの開始から時間Ｔｓ以内に追加／復帰ノードが同期系に組み込まれる。 When a node is added to an existing synchronous system, or after the above repair has been performed on a failed node (after initialization in the case of a power cycle or reset), the node Asynchronous timer AT starts. At this time, since the nodes other than the addition / return node are in a synchronous state and constitute a synchronous system, before the elapsed time of the asynchronous state reaches the time Ts of the synchronous timer ST, the synchronization of the other node is performed in the authentication window Wq. Two or more messages SM are received. That is, the add / return node is incorporated into the synchronous system within the time Ts from the start of the asynchronous timer AT.

非同期タイマＡＴの時間Ｔａが同期タイマＳＴの時間Ｔｓよりも短い場合には、非同期タイマＡＴの時間Ｔａ以内に他ノードの同期メッセージＳＭを受信しないと、非同期タイマＡＴが終了する。このとき、このノードは、同期メッセージＳＭをネットワークＮＷにブロードキャストする。しかし、このタイミングで同期メッセージＳＭを送信するのは追加／復帰ノードのみなので、同時に追加／復帰するノード数が同期成立条件であるＮ式に満たない場合には、既に同期している他ノードからは無視され、既存の同期系が乱されることはない。 If the time Ta of the asynchronous timer AT is shorter than the time Ts of the synchronous timer ST, the asynchronous timer AT ends if the synchronous message SM of another node is not received within the time Ta of the asynchronous timer AT. At this time, this node broadcasts the synchronization message SM to the network NW. However, since the synchronization message SM is transmitted only at the add / return node at this timing, if the number of nodes to be added / return at the same time does not satisfy the formula N, which is the condition for establishing synchronization, the other nodes already synchronized Is ignored and the existing synchronous system is not disturbed.

以上のように、追加／復帰ノードは、動作中の正常ノードの動作に何ら影響を与えないので、コンピュータシステムとしては無停止で追加／復帰ノードを既存の同期系に組み込むことが可能となる。これは当該同期処理方法の無停止性及び拡張性が高いことを意味する。なお、故障等により、同期しているノード数が同期成立の条件であるＮ式未満となった場合は、後述するシステム起動時の動作に相当する。 As described above, since the add / return node does not affect the operation of the normal node that is operating, the add / return node can be incorporated into the existing synchronous system without stopping as a computer system. This means that the non-stop property and expandability of the synchronous processing method are high. Note that when the number of synchronized nodes is less than the N formula, which is a condition for establishing synchronization, due to a failure or the like, this corresponds to an operation at the time of system startup described later.

図９は、コンピュータシステム中にノードが一式である場合のシステム初期時の動作を説明する参考図である。ノードは、電源がＯＮされて初期化処理が終了した後、復帰時と同様に非同期状態となって非同期タイマＡＴを開始する。そして、非同期タイマＡＴが終了した際、先述のアルゴリズムに従って同期メッセージＳＭをネットワークＮＷにブロードキャストし、認証用ウインドウＷｑを開始して同期遷移状態となる。ところが、システム中に他のノードが存在しないので、認証用ウインドウＷｑ中に他ノードからの同期メッセージＳＭを受信することはなく、これにより同期条件が成立せず、再び非同期タイマＡＴを開始して非同期状態に戻る。このように、システム中にノードが一式しかない場合には、以上の動作が繰り返し行われる。 FIG. 9 is a reference diagram for explaining the operation at the initial stage of the system when the computer system has a set of nodes. After the power is turned on and the initialization process is completed, the node enters an asynchronous state and starts the asynchronous timer AT similarly to the return. When the asynchronous timer AT expires, the synchronous message SM is broadcast to the network NW according to the algorithm described above, and the authentication window Wq is started to enter a synchronous transition state. However, since there is no other node in the system, the synchronization message SM from the other node is not received during the authentication window Wq, so that the synchronization condition is not satisfied and the asynchronous timer AT is started again. Return to the asynchronous state. Thus, when there is only one set of nodes in the system, the above operation is repeated.

図１０は、システム起動時の動作を説明する図であって、一例として、一番目に電源がＯＮされたノード（上側）と、二番目に電源がＯＮされたノード（下側）の二つの動作を示している。 FIG. 10 is a diagram for explaining the operation at the time of starting the system. As an example, there are two nodes: the first node that is powered on (upper side) and the second node that is powered on (lower side). The operation is shown.

コンピュータシステム中に同期系を構成すべき複数のノードが存在するとき、全体システムの起動時には全てのノードの電源がほぼ同時にＯＮされ、およそ同時間に初期化を終了すると考えられる。各ノードは、初期化終了後、復帰時と同様に非同期タイマＡＴを開始し、最初に非同期タイマＡＴが終了した一番目のノードが、ネットワークＮＷに同期メッセージＳＭをブロードキャストする。このとき、二番目にＯＮされたノードは、非同期タイマＡＴが終了する前に、一番目にＯＮされたノードからの同期メッセージＳＭを受信している。 When there are a plurality of nodes that should form a synchronous system in the computer system, it is considered that the power supplies of all the nodes are turned on almost simultaneously when the entire system is started, and the initialization is completed approximately at the same time. After initialization, each node starts an asynchronous timer AT in the same way as when returning, and the first node that first ends the asynchronous timer AT broadcasts the synchronous message SM to the network NW. At this time, the second ON node receives the synchronous message SM from the first ON node before the asynchronous timer AT expires.

非同期状態において同期メッセージＳＭを受信した場合、認証用ウインドウＷｑを開始して同期遷移状態となるが、図１０中（ａ）に示すように、認証用ウインドウＷｑ中に送受信した同期メッセージＳＭの数がＮに満たないために同期条件が成立しない場合には、再び非同期タイマＡＴを開始して非同期状態に戻る。一番目にＯＮされたノードにおいても、やはり同期条件が成立しないので、再び非同期タイマＡＴを開始する。そして、一番目にＯＮされたノードの非同期タイマＡＴが次に終了する時期に二番目にＯＮされたノードの非同期タイマＡＴも終了するため、この時点でいずれのノードからも同期メッセージＳＭが送信され、図１０中（ｂ）に示すように同期条件が成立する。 When the synchronous message SM is received in the asynchronous state, the authentication window Wq is started to enter the synchronous transition state. As shown in FIG. 10A, the number of synchronous messages SM transmitted / received in the authentication window Wq. If the synchronization condition is not satisfied because N is less than N, the asynchronous timer AT is started again to return to the asynchronous state. Even in the first turned-on node, the synchronization condition is not satisfied, so the asynchronous timer AT is started again. Since the asynchronous timer AT of the node that is turned on secondly ends when the asynchronous timer AT of the node that is turned on first ends, the synchronous message SM is transmitted from any node at this time. As shown in FIG. 10B, the synchronization condition is satisfied.

同期条件成立までの時間は、最初のＮ式の非同期タイマＡＴの開始時間差で決まる。すなわち、最初のＮノードの初期化終了後の非同期タイマＡＴの開始時間差をＴｇ（秒）とすると、Ｎ式の同期条件成立までの時間Ｔｓｙｎ（秒）は、Ｔｇ＝０のときにＴｓｙｎ＝Ｔａ（非同期タイマ長）となり、０＜Ｔｇ＜ＴａのときにＴｓｙｎ＝２Ｔａとなり、Ｔａ＜Ｔｇ＜２ＴａのときにＴｓｙｎ＝３Ｔａ（以下同様）となる。 The time until the synchronization condition is satisfied is determined by the start time difference of the first N-type asynchronous timer AT. That is, assuming that the start time difference of the asynchronous timer AT after the initialization of the first N node is Tg (seconds), the time Tsyn (seconds) until the N-type synchronization condition is satisfied is Tsyn = Ta when Tg = 0. (Asynchronous timer length), Tsyn = 2Ta when 0 <Tg <Ta, and Tsyn = 3Ta (hereinafter the same) when Ta <Tg <2Ta.

また、この同期条件成立までに非同期タイマＡＴを開始したノードは、全て上記時間で同様に同期する。さらに、ＯＮされるタイミングが遅く、同期条件成立以降に非同期タイマＡＴを開始したノードの同期条件成立時間Ｔｓｙｎは復帰と同じ扱いになり、非同期タイマＡＴの開始後、同期タイマＳＴの時間Ｔｓ以内に同期条件成立となる。 Further, all the nodes that have started the asynchronous timer AT before the establishment of the synchronization condition are similarly synchronized at the above time. Further, the synchronization condition establishment time Tsyn of the node that is turned on late and started the asynchronous timer AT after the establishment of the synchronization condition is treated the same as the return, and within the time Ts of the synchronization timer ST after the start of the asynchronous timer AT. The synchronization condition is satisfied.

以上により、非同期タイマＡＴの時間Ｔａを同期タイマＳＴの時間Ｔｓよりも短くすることで、システム起動時の同期条件成立時間Ｔｓｙｎをノード追加／復帰時の同期組込み時間Ｔｒｓ（最大Ｔｓ）よりも短くすることができる。ただし、非同期タイマＡＴの時間Ｔａをあまり短くすると、追加／復帰時に再同期が成立するまでの間にブロードキャストされる同期メッセージＳＭが多くなる。このように、システム起動時においても、特別なハードウエアを使用することなく、追加／復帰時と同じロジックを用いて同期する必要のあるノード数にかかわりなく同期を成立させることできる。 As described above, by setting the time Ta of the asynchronous timer AT to be shorter than the time Ts of the synchronous timer ST, the synchronization condition establishment time Tsyn at the time of system startup is shorter than the synchronous incorporation time Trs (maximum Ts) at the time of node addition / return. can do. However, if the time Ta of the asynchronous timer AT is made too short, the number of synchronization messages SM broadcast until resynchronization is established at the time of addition / return increases. In this way, even when the system is started, synchronization can be established without using special hardware, regardless of the number of nodes that need to be synchronized using the same logic as when adding / returning.

分散コンピュータシステムの一例を説明する図である。1 is a diagram illustrating an example of a distributed computer system. コンピュータノードの構成を説明する図である。It is a figure explaining the structure of a computer node. ノードの状態遷移図である。It is a state transition diagram of a node. 同期成立条件としてＮ＝２の場合におけるノードの状態遷移図である。It is a state transition diagram of a node when N = 2 as a synchronization establishment condition. 同期状態の更新と非同期要素の排除を説明する図である。It is a figure explaining the update of a synchronous state, and exclusion of an asynchronous element. 同期状態の更新を説明する図である。It is a figure explaining the update of a synchronous state. 同期メッセージの自律的送信の抑制を説明する図である。It is a figure explaining suppression of the autonomous transmission of a synchronous message. 既存の同期系へのノードの追加／故障ノードの復帰時の動作を説明する図である。It is a figure explaining the operation | movement at the time of the addition of the node to the existing synchronous system / return of a failure node. コンピュータシステム中にノードが一式である場合のシステム初期時の動作を説明する参考図である。It is a reference figure explaining operation | movement at the time of the system initial stage in case a node is a set in a computer system. システム起動時の動作を説明する図である。It is a figure explaining operation | movement at the time of system starting.

Explanation of symbols

１ノード
ＡＴ非同期タイマ
ＮＷネットワーク
ＳＭ同期メッセージ
ＳＴ同期タイマ
Ｗｑ認証用ウインドウ 1 node AT asynchronous timer NW network SM synchronous message ST synchronous timer Wq Authentication window

Claims

When synchronizing computer nodes in a distributed computer system in which a plurality of computers are connected to each other via a network, each node starts a synchronization timer for a time corresponding to the period for performing the synchronization processing, and the network is terminated with the end of the synchronization timer. While broadcasting a synchronization message and sending a synchronization message to all other nodes, when a synchronization message is transmitted / received, an authentication window for a predetermined time is started, and a plurality of predetermined synchronizations are performed in the authentication window. When a message is sent / received, the start time of its own synchronization timer is autonomously adjusted so that it matches the common timing obtained from the transmission / reception timing of the synchronization message as the synchronization condition is satisfied, and the synchronization timer is started at that start time. And the synchronization timer of the local node expires when the synchronization condition is met If not, the synchronization processing method of distributed computer system, characterized in that to start again a synchronization timer without broadcast synchronization messages over the network.

When adding a node to an existing synchronous system, when the node is reset or after power is turned on, or when a failure occurs in the node, the corresponding node becomes Asynchronous timer is started for a predetermined time, and when the asynchronous timer ends, a synchronous message is broadcast on the network and sent to all other nodes. When the window is started and a plurality of predetermined synchronization messages are sent and received during the authentication window, the synchronization timer is started as the synchronization condition is established, and the synchronization condition is not found in the authentication window even though the synchronization message is sent and received. 2. The distributed controller according to claim 1, wherein if it does not hold, the asynchronous timer is restarted. Synchronization processing method of Yutashisutemu.

The time stamp is described inside the synchronization message as necessary, and the time stamps are matched simultaneously with the synchronization timer start timings among a plurality of nodes. Method for synchronous processing of distributed computer systems.

The node according to any one of claims 1 to 3, wherein each node autonomously changes the time of the synchronous timer, the asynchronous timer, and the authentication window in response to a dynamic change in an application request or a change in environmental conditions. The synchronous processing method of the distributed computer system of description.

If necessary, describe the node identification number of the node that sends the synchronization message in the synchronization message so that there is no duplication on the network, and the node that receives the synchronization message is the same within the authentication window within the predetermined time. 5. The synchronization processing method for a distributed computer system according to claim 1, wherein a plurality of synchronization messages received from the node are not counted as the number of synchronization messages received in the authentication window.