JPH08185330A

JPH08185330A - Method for switching redundant computer system

Info

Publication number: JPH08185330A
Application number: JP6326666A
Authority: JP
Inventors: Manabu Tsukada; 学塚田; Masahide Yamashita; 正秀山下; Hiroyuki Yamashita; 博之山下
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1994-12-28
Filing date: 1994-12-28
Publication date: 1996-07-16

Abstract

PURPOSE: To provide a redundant computer system switching method capable of more economically attaining system reliability equivalent to or more than conventional one. CONSTITUTION: Primary stand-by computers 102, 112, 122, 132 fixedly allocated to current computers 101, 111, 121, 131 to execute high speed backup are connected to a transmission line 160 and secondary stand-by computers 141, 142 for improving the reliability of the system are also connected to the transmission line 160 so as to be shared by all the current computers 102, 112, 122, 132 and the primary stand-by computers 101, 111, 121, 131. In order to secure the reliability of the stand-by computers, all the computers are periodically driven as current computers by a centralized computer managing device 150 at the time of normal operation so as to quickly detect abnormality, and when abnormality is generated in the current computers, the primary stand-by computers 102, 112, 122, 132 are switched to current computers to execute high speed backup and the shared secondary stand-by computers 141, 142 are allocated to the new primary stand-by computers.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、複数の現用コンピュー
タと、この現用コンピュータの予備に当たる１次予備コ
ンピュータと、僅かな共用２次予備コンピュータと、コ
ンピュータ間の切り替え制御を集中的に行う集中コンピ
ュータ管理装置とで構成される冗長コンピュータシステ
ムの切り替え方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a plurality of active computers, a primary spare computer which is a spare of the active computers, a small shared secondary spare computer, and a centralized computer which centrally controls switching between computers. The present invention relates to a method of switching a redundant computer system configured with a management device.

【０００２】[0002]

【従来の技術】従来は、コンピュータ異常時のバックア
ップ方法として、以下の方法が採られていた。（１）ｎ重化構成：現用コンピュータ対応に固定的に予
備コンピュータが割り当てられており、現用コンピュー
タに障害が発生した場合、前記予備コンピュータが処理
を引き継ぐ方法である。また、更に高信頼度が要求され
る場合には上記予備コンピュータを増設するなどして、
冗長度を高める方法である。一方、前記予備コンピュー
タの正常性確認は、定期保守時等に保守試験プログラム
を実行することにより行っていた。（２）ｋｏｕｔｏｆｎ構成：ＬＣＭＰのネットワ
ーク分散システムにおいて、ｋ台のコンピュータが必要
な場合、ｎ台のコンピュータを用意し、コンピュータが
異常時に他コンピュータが処理をバックアップする方法
である。（３）ＤＩＰＳの高速予備切り替え方式（ＮＴＴ研究実
用化報告第３６巻第８号Ｐ１０２１〜１０５７記載）：
２台の現用コンピュータと１台の待機系コンピュータで
チェックポイントデータファイルを共有し、予備系コン
ピュータは２台の現用コンピュータとプログラムとデー
タをロードシェアシステムとして動作可能にしておき、
現用コンピュータ異常時に、これらコンピュータを集中
管理するシステム制御処理装置の制御により、予備コン
ピュータにチェックポイントデータファイルを利用しバ
ックアップさせる方法である。2. Description of the Related Art Conventionally, the following method has been adopted as a backup method when a computer malfunctions. (1) n-redundant configuration: a method in which a spare computer is fixedly assigned for the active computer and the spare computer takes over the processing when the active computer fails. Also, if higher reliability is required, add the above spare computer,
This is a method of increasing redundancy. On the other hand, the normality of the spare computer has been confirmed by executing a maintenance test program at the time of regular maintenance. (2) k out of n configuration: In a network distributed system of LCMP, when k computers are required, n computers are prepared, and when a computer is abnormal, another computer backs up processing. (3) High-speed preliminary switching method of DIPS (described in NTT Research Practical Report Vol. 36, No. 8, P1021 to 1057):
The checkpoint data file is shared between the two active computers and one standby computer, and the standby computer keeps the two active computers and the programs and data operable as a load sharing system.
This is a method of backing up a checkpoint data file to a spare computer under the control of a system control processing unit that centrally manages these computers when the current computer is abnormal.

【０００３】[0003]

【発明が解決しようとする課題】上記従来技術において
は、以下の問題があった。（１）ｎ重化構成では、現用コンピュータ対応に固定的
に予備コンピュータが割り付けられており、現用コンピ
ュータの状態変化に同期して予備コンピュータの状態を
更新することにより、現用コンピュータ異常時に高速バ
ックアップを実現することが可能である。しかし、２重
冗長構成以上の信頼度を確保するためには、予備（以下
１次予備と呼ぶ）に更に予備（以下２次予備と呼ぶ）を
追加する必要があり、システム全体での予備コンピュー
タのコストが約（ｎ−１）／ｎを占め不経済であった。
また、特にｎ−１が予備コンピュータの場合には予備コ
ンピュータの経年劣化などによる異常のため既に故障し
ている場合には、現用コンピュータ異常時に多重障害が
発生することによりシステムダウンになる問題もある。（２）ｋ台の現用コンピュータと（ｎ−ｋ）台の予備コ
ンピュータから構成されるｋｏｕｔｏｆｎ構成シ
ステムおよびＤＩＰＳの高速予備切り替え方式は、予備
コンピュータは、現用コンピュータ異常時にこれをバッ
クアップする共用予備コンピュータの状態を、故障にな
ったコンピュータの故障になる直前の状態にする必要が
あるが、共用しているため故障以前の状態にするまでに
時間がかかる問題があった。また、共用予備コンピュー
タは現用コンピュータ異常時に使用されるため、共用予
備コンピュータが経年劣化などにより故障している場合
には、現用コンピュータが故障するケースでは多重障害
になり切り替えに時間がかかるとともに信頼性が著しく
低下する問題があった。本発明の目的は、現用コンピュ
ータに固定的に割り当てられた、高速バックアップを実
現する１次予備コンピュータを伝送路に接続し、更にシ
ステムの高信頼化を図るための２次予備コンピュータも
伝送路に接続し、この２次予備コンピュータを全現用及
び前記１次予備コンピュータで共用し、かつ、予備コン
ピュータの信頼性確保のため、正常時に、周期的に全て
のコンピュータを現用コンピュータとして動作させ、異
常を迅速に検出可能とし、また、現用コンピュータ異常
時に、１次予備コンピュータを現用に切り替えて高速バ
ックアップするとともに、共用２次予備コンピュータを
この新たな１次予備コンピュータに割り当て、経済的
に、従来と同等以上のシステム信頼性を実現する冗長コ
ンピュータシステム切り替え方法を提供することにあ
る。The above-mentioned conventional techniques have the following problems. (1) In the n-duplex configuration, the spare computer is fixedly allocated for the active computer, and the status of the spare computer is updated in synchronization with the status change of the active computer, so that high-speed backup can be performed when the active computer fails. It can be realized. However, in order to secure the reliability of the double redundant configuration or more, it is necessary to add a spare (hereinafter referred to as a secondary spare) to the spare (hereinafter referred to as a primary spare). The cost was about (n-1) / n, which was uneconomical.
Further, particularly when n-1 is a spare computer, if the spare computer has already failed due to an abnormality due to aged deterioration or the like, there is a problem that a system failure occurs due to multiple failures when the working computer is abnormal. . (2) The k out of n configuration system composed of k active computers and (n−k) standby computers and the high-speed standby switching method of DIPS are shared by the backup computer when the active computer fails. It is necessary to set the state of the spare computer to the state immediately before the failure of the failed computer, but since it is shared, there is a problem that it takes time to bring it to the state before the failure. Also, since the shared spare computer is used when the active computer is abnormal, if the shared spare computer has failed due to aging, etc., multiple failures will occur in the case where the active computer fails, and switching will take time and reliability will be high. However, there was a problem in that An object of the present invention is to connect a primary spare computer, which is fixedly assigned to an active computer and realizes high-speed backup, to a transmission line, and also a secondary spare computer for achieving high system reliability as a transmission line. Connected, this secondary spare computer is shared by all working and primary spare computers, and in order to ensure reliability of the spare computer, at normal times, all computers are made to operate periodically as an active computer and abnormalities are detected. It can be detected quickly, and when the working computer is abnormal, the primary spare computer is switched to the working one for high-speed backup, and the shared secondary spare computer is assigned to this new primary spare computer. A redundant computer system switching method for realizing the above system reliability is provided. In the door.

【０００４】[0004]

【課題を解決するための手段】上記目的を達成するた
め、全コンピュータの動作モード（現用、１次予備、２次
予備、異常）、ノードアドレスを管理する状態管理テー
ブル（図２の２３１）、全コンピュータを対象とする動作モード変更タイミン
グを通知するタイマ（図２の２４１）を有し、図３〜図
５に示すフローに従ってコンピュータ間の切り替え制御
を行う切り替え制御処理部（図２の２４０）、図６、図７に示すフローに従って、前記状態管理テー
ブル読み出し／更新を行うコンピュータ管理部（図２の
２３０）、図９に示す処理フローに従って、前記切り替え制御処
理部から通知された異常コンピュータのノードアドレス
情報をネットワーク管理者に通知したり、異常コンピュ
ータを修復した保守者からの修復完了通知によりネット
ワーク管理者がコンピュータ状態を更新する等のために
使用する端末とのインタフェース制御を行う管理者イン
タフェース制御部（図２の２５０）、の要素〜を追
加したコンピュータを集中コンピュータ管理装置（図１
の１５０）として設置し、ａ．コンピュータのノードアドレス及びそのコンピュー
タと現用或いは１次予備の関係にあるコンピュータのノ
ードアドレス格納用のアドレステーブル（図１０の４３
１）、ｂ．図１１〜図１５に示すフローに従ってコンピュータ
間の切り替え処理を行う切り替え処理部（図１０の４５
０）、ｃ．第１タイマ（図１０の４４１）と第２タイマ（図１
０の４４２）を有し、図１８に示すフローに従って現用
コンピュータの異常を監視する監視処理部（図１０の４
４０）、ｄ．図１６に示すフローに従って前記監視処理部と前記
切り替え処理部から、及び集中コンピュータ管理装置か
らの指示により、前記アドレステーブルを更新したり読
み出したりするアドレス管理部（図１０の４３０）、ｅ．図１７に示す動作フローに従い動作するホットスタ
ンバイ処理部（図１０の４６０）、の要素ａ〜ｅを、現用、１次予備及び２次予備のコンピ
ュータに追加し、これら全てのコンピュータを伝送路で
接続する。In order to achieve the above object, an operation mode of all computers (active, primary spare, secondary spare, abnormal), a state management table for managing node addresses (231 in FIG. 2), A switching control processing unit (240 in FIG. 2) having a timer (241 in FIG. 2) for notifying the operation mode change timing for all computers and performing switching control between computers according to the flows shown in FIGS. , A computer management unit (230 in FIG. 2) for reading / updating the status management table according to the flows shown in FIGS. 6 and 7, and an abnormal computer notified from the switching control processing unit according to the processing flow shown in FIG. 9. The node address information is notified to the network administrator, or the repair completion notification is sent from the maintenance person who repaired the abnormal computer. A computer to which an element (1) of an administrator interface control unit (250 in FIG. 2) for performing interface control with a terminal used by the network administrator for updating the computer state, etc. is added to the central computer management device (FIG. 1).
150), and a. An address table for storing the node address of the computer and the node address of the computer which is in a current or primary spare relationship with the computer (43 in FIG. 10).
1), b. A switching processing unit (45 in FIG. 10) that performs switching processing between computers according to the flows shown in FIGS.
0), c. The first timer (441 in FIG. 10) and the second timer (FIG. 1)
No. 442 of FIG. 0) and monitors the abnormality of the active computer according to the flow shown in FIG. 18 (4 in FIG. 10).
40), d. An address management unit (430 in FIG. 10) that updates or reads the address table in accordance with instructions from the monitoring processing unit and the switching processing unit and the centralized computer management device according to the flow shown in FIG. 16, e. Elements a to e of the hot standby processing unit (460 in FIG. 10) that operates according to the operation flow shown in FIG. 17 are added to the working primary backup computer and secondary backup computer, and all of these computers are connected in the transmission path. Connecting.

【０００５】[0005]

【作用】本発明においては、上記手段を追加することに
より、集中コンピュータ管理装置が、全コンピュータを
管理し、これらコンピュータ間で一定時間毎に順次、１
次予備コンピュータから２次予備コンピュータへ、現用
から１次予備コンピュータに保持情報をそっくり送信し
て動作モードを変更させ、全コンピュータを現用コンピ
ュータとして動作させる。また、１次予備コンピュータ
が現用コンピュータの異常を検出すると、その１次予備
コンピュータは直ちに現用コンピュータになり高速バッ
クアップし、集中コンピュータ管理装置に検出したコン
ピュータの異常通知を通知する。コンピュータの異常を
通知された集中コンピュータ管理装置は、周期的動作モ
ード変更制御を中断し、任意の２次予備コンピュータ
を、新たに現用になり異常通知したコンピュータの１次
予備として割り当て、その新たに現用になったコンピュ
ータから保持情報をそっくり新たに割り当てた１次予備
コンピュータに転送させて１次予備コンピュータにする
ことにより、経済的な構成で迅速な検出および切り替え
が可能となる。In the present invention, by adding the above means, the centralized computer management device manages all the computers, and these computers are sequentially operated at regular time intervals.
The holding information is transmitted from the next spare computer to the second spare computer, and the operation mode is changed so that all the computers operate as the current computer. When the primary spare computer detects an abnormality in the active computer, the primary spare computer immediately becomes the active computer for high-speed backup and notifies the central computer management device of the detected abnormality of the computer. The centralized computer management device notified of the abnormality of the computer interrupts the periodical operation mode change control, assigns an arbitrary secondary spare computer as the primary spare of the computer which is newly in use and notified of the abnormality, and newly allocates it. By transferring the retained information from the currently used computer to the newly assigned primary spare computer to become the primary spare computer, rapid detection and switching can be performed with an economical configuration.

【０００６】[0006]

【実施例】以下、本発明の一実施例を図面により説明す
る。図１は、本発明の一実施例におけるコンピュータシ
ステムの構成図で、複数のコンピュータ、集中コンピュ
ータ管理装置１５０及び伝送路１６０から構成される。
１０１、１１１、１２１、１３１は現用コンピュータ
で、１０２、１１２、１２２、１３２は１次予備コンピ
ュータであり、それぞれ１０１と１０２、１１１と１１
２、１２１と１２２、１３１と１３２が現用コンピュー
タと１次予備コンピュータとの対応関係を示す。１４１
と１４２は共用の２次予備コンピュータである。なお、
点線部分は各コンピュータ内の追加部分を示す。An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram of a computer system according to an embodiment of the present invention, which comprises a plurality of computers, a central computer management device 150 and a transmission line 160.
101, 111, 121 and 131 are active computers, and 102, 112, 122 and 132 are primary spare computers, 101 and 102, 111 and 11 respectively.
Reference numerals 2, 121 and 122, 131 and 132 indicate the correspondence between the active computer and the primary spare computer. 141
And 142 are shared secondary spare computers. In addition,
Dotted lines indicate additional parts within each computer.

【０００７】図２は、本発明の一実施例における集中コ
ンピュータ管理装置の構成例である。集中コンピュータ
管理装置は通信制御処理部２１０、メッセージ処理部２
２０、切り替え制御処理部２４０、タイマ２４１、コン
ピュータ管理部２３０、状態管理テーブル２３１及び管
理者インタフェース制御部２５０から構成される。タイ
マ２４１は、予め設定された時間間隔で動作モード切り
替え指示を切り替え制御処理部２４０に通知する。な
お、１５１は追加部分を示す。FIG. 2 shows an example of the configuration of the centralized computer management apparatus in one embodiment of the present invention. The central computer management device includes a communication control processing unit 210 and a message processing unit 2.
20, a switching control processing unit 240, a timer 241, a computer management unit 230, a state management table 231, and an administrator interface control unit 250. The timer 241 notifies the switching control processing unit 240 of operation mode switching instructions at preset time intervals. In addition, 151 shows an additional part.

【０００８】ここで、切り替え制御処理部２４０の処理
フローを図３〜図５に示す。本実施例では、現用コンピ
ュータの異常を検出した１次予備コンピュータから予備
要求メッセージを受信すると（ステップ３０１１）、そ
の１次予備コンピュータへ異常通知・予備要求メッセー
ジ受信完了の応答メッセージを転送する（ステップ３０
１２）。あるいは、１次予備コンピュータの異常を検出
した現用コンピュータから予備要求メッセージを受信す
ると（ステップ３０１３）、その現用コンピュータへ異
常通知・予備要求メッセージ受信完了の応答メッセージ
を送信する（ステップ３０１４）。この後、周期的切り
替えタイマを停止して周期的切り替えを中断する（ステ
ップ３０１５）。次に、受信メッセージから異常が検出
された現用コンピュータのノードアドレスとその一次予
備コンピュータのノードアドレスを抽出し（ステップ３
０１６）、管理者インタフェース制御部へ異常コンピュ
ータのノードアドレス通知し（ステップ３０１７）、さ
らにコンピュータ管理部にその異常コンピュータのノー
ドアドレスと、新たに１次予備候補となる２次予備コン
ピュータの割当て要求を通知する（ステップ３０１
８）。一方、コンピュータ管理部から１次予備候補とな
る２次予備コンピュータのノードアドレスを受信すると
（ステップ３０２１）、その２次予備コンピュータに、
新たに現用となったコンピュータのノードアドレス通知
メッセージを転送する（ステップ３０２２）。Here, a processing flow of the switching control processing unit 240 is shown in FIGS. In this embodiment, when a preliminary request message is received from the primary spare computer that has detected an abnormality in the active computer (step 3011), a response message indicating completion of reception of the abnormality notification / preliminary request message is transferred to the primary spare computer (step 3011). Thirty
12). Alternatively, when a preliminary request message is received from the active computer that has detected an abnormality in the primary spare computer (step 3013), a response message indicating the completion of the abnormality notification / preliminary request message reception is transmitted to the active computer (step 3014). After that, the periodic switching timer is stopped to interrupt the periodic switching (step 3015). Next, the node address of the working computer in which the abnormality is detected and the node address of its primary spare computer are extracted from the received message (step 3
016), the node address of the abnormal computer is notified to the administrator interface control unit (step 3017), and the node address of the abnormal computer and a request for allocating a secondary spare computer that is newly a primary spare candidate are sent to the computer management unit. Notify (step 301)
8). On the other hand, when the node address of the secondary spare computer that is a primary spare candidate is received from the computer management unit (step 3021), the secondary spare computer
The node address notification message of the newly used computer is transferred (step 3022).

【０００９】また、ノードアドレス通知メッセージを転
送された２次予備コンピュータから、ノードアドレス設
定完了メッセージを受信した場合は（ステップ３０３
１，３０３２のＹＥＳ）、新たに現用となったコンピュ
ータに、２次予備コンピュータのノードアドレスを含む
役割交代メッセージを転送する（ステップ３０３３）。
あるいは（ステップ３０３２のＮＯ）、管理者インタフ
ェース制御部に異常を通知し（ステップ３０３４）、コ
ンピュータ管理部に新たな１次予備候補の２次予備コン
ピュータの割当て要求を通知する（ステップ３０３
５）。一方、役割交代メッセージを転送された現用コン
ピュータから、役割交代処理完了報告メッセージを受信
した場合は（ステップ３０４１、３０４２のＹＥＳ）、
コンピュータ管理部に、状態管理テーブル中の、役割交
代が完了した２次予備コンピュータの動作モードを１次
予備に更新する要求を送信し（ステップ３０４３）、周
期的切り替えのためのタイマ設定を行う（ステップ３０
４４）。また、役割交代処理完了報告メッセージを受信
した場合は（ステップ３０４１、３０４２のＮＯ）、管
理者インタフェース制御部に異常を通知し（ステップ３
０３４）、コンピュータ管理部に新たな１次予備候補の
２次予備コンピュータの割当て要求を通知する（ステッ
プ３０３５）。その周期的切り替えタイマの割り込みが
発生すると（ステップ３０５１）、コンピュータ管理部
へ切り替え元と切り替え先のノードアドレスを要求する
（ステップ３０５２）。そして、コンピュータ管理部か
ら切り替え元と切り替え先のノードアドレスを受信する
と（ステップ３０６１）、切り替え先に切り替え元のノ
ードアドレス通知メッセージを転送する（ステップ３０
６２）。When a node address setting completion message is received from the secondary spare computer to which the node address notification message has been transferred (step 303)
(YES at 1, 3032), the role change message including the node address of the secondary spare computer is transferred to the newly active computer (step 3033).
Alternatively (NO in step 3032), the administrator interface control unit is notified of the abnormality (step 3034), and the computer management unit is notified of the allocation request for the secondary spare computer of the new primary spare candidate (step 303).
5). On the other hand, when the role change processing completion report message is received from the active computer to which the role change message is transferred (YES in steps 3041 and 3042),
A request for updating the operation mode of the secondary spare computer whose role has been changed in the state management table to the primary spare in the state management table is transmitted to the computer management unit (step 3043), and the timer is set for periodic switching (step 3043). Step 30
44). When the role change process completion report message is received (NO in steps 3041 and 3042), the administrator interface control unit is notified of the abnormality (step 3).
034) and notifies the computer management unit of a request for allocation of a new secondary spare computer as a primary spare candidate (step 3035). When the interruption of the periodic switching timer occurs (step 3051), the node address of the switching source and the switching destination is requested to the computer management unit (step 3052). When the node addresses of the switching source and the switching destination are received from the computer management unit (step 3061), the node address notification message of the switching source is transferred to the switching destination (step 30).
62).

【００１０】また、コンピュータ管理部２３０の処理フ
ローを図６、図７に示す。本実施例では、切り替え制御
処理部から、異常コンピュータのノードアドレスと２次
予備コンピュータ検索依頼の通知を受信すると（ステッ
プ５０８１）、その異常コンピュータに関する動作モー
ドを異常に更新し、２次予備コンピュータのノードアド
レスを検索して（ステップ５０８２）、切り替え制御処
理部に通知する（ステップ５０８３）。一方、切り替え
制御処理部から、切り替え元および切り替え先コンピュ
ータのノードアドレスの要求を受信すると（ステップ５
０９１）、切り替え元と切り替え先の関係を、１次予備
と２次予備、現用と１次予備として順次状態管理テーブ
ルからそれぞれのノードアドレスを検索し（ステップ５
０９２）、切り替え制御処理部に通知する（ステップ５
０９３）。また、切り替え制御処理部から状態管理テー
ブルの更新依頼を受信すると（ステップ５１０１）、更
新を実行し（ステップ５１０２）、管理者インタフェー
ス制御部に実行結果を通知する（ステップ５１０３）。
また、管理者インタフェース制御部から状態管理テーブ
ルの更新／検索通知を受信すると（ステップ５１１
１）、更新／検索を実行し（ステップ５１１２）、管理
者インタフェース制御部に実行結果を通知する（ステッ
プ５１１３）。The processing flow of the computer management section 230 is shown in FIGS. 6 and 7. In this embodiment, when the node address of the abnormal computer and the notification of the secondary spare computer search request are received from the switching control processing unit (step 5081), the operation mode of the abnormal computer is updated to abnormal, and the secondary spare computer is updated. The node address is searched (step 5082) and the switching control processing unit is notified (step 5083). On the other hand, when a request for the node addresses of the switching source and switching destination computers is received from the switching control processing unit (step 5).
091), the relationship between the switching source and the switching destination is set as the primary spare and the secondary spare, and the working and the primary spare are sequentially searched for respective node addresses from the state management table (step 5).
092), and notifies the switching control processing unit (step 5).
093). When receiving a request for updating the state management table from the switching control processing unit (step 5101), the update is executed (step 5102) and the execution result is notified to the administrator interface control unit (step 5103).
Further, when the update / search notification of the status management table is received from the administrator interface control unit (step 511).
1), update / search is executed (step 5112), and the execution result is notified to the administrator interface control unit (step 5113).

【００１１】なお、状態管理テーブル２３１の構成例は
図８に示す通りであって、各コンピュータのノードアド
レス情報を保持するノードアドレス欄３０１、動作モー
ド欄３０２、現用／１次予備のノードアドレス欄３０３
からなる。また、管理者インタフェース制御部２５０の
処理フローを図９に示す。本実施例では、コンピュータ
管理部あるいは切り替え制御処理部から表示情報を受信
すると、通知情報を編集、表示する（ステップ７００
１、７００２）。また、入力情報を受信して、コンピュ
ータ管理部へ各種要求を通知する（ステップ７０１
１）。The configuration example of the state management table 231 is as shown in FIG. 8, and the node address column 301 holding the node address information of each computer, the operation mode column 302, the active / primary spare node address column 303
Consists of Further, FIG. 9 shows a processing flow of the administrator interface control unit 250. In this embodiment, when the display information is received from the computer management unit or the switching control processing unit, the notification information is edited and displayed (step 700).
1, 7002). Also, it receives the input information and notifies the computer management unit of various requests (step 701).
1).

【００１２】図１０は、本発明の一実施例におけるコン
ピュータの構成図である。本実施例では、メッセージ転
送部４２０、通信制御処理部４１０等からなるコンピュ
ータに、アドレス管理部４３０、アドレステーブル４３
１、監視処理部４４０、第１タイマ４４１、第２タイマ
４４２、切り替え処理部４５０、ホットスタンバイ処理
部４６０を追加する。なお、１７０は追加部分を示す。
この切り替え処理部４５０の処理フローを図１１〜図１
５に示す。本実施例では、図１１、図１２のように、監
視処理部から現用コンピュータの異常検出の報告を受信
すると（ステップ９０１１）、各部４３０、４４０、４
５０、４６０の動作モードを現用に更新し（ステップ９
０１２）、集中コンピュータ管理装置に異常通知・予備
要求メッセージを送信する（ステップ９０１３、９０１
４）。一方、集中コンピュータ管理装置から異常通知・
予備要求メッセージ受信の応答メッセージを受信すると
（ステップ９０４１）、正常終了するか（ステップ９０
４２のＹＥＳ）、集中コンピュータ管理装置に処理完了
メッセージを転送する（ステップ９０４２のＮＯ、９０
３６）。また、集中コンピュータ管理装置から役割交代
メッセージを受信すると（ステップ９０２１）、保持情
報を読み出し、情報転送メッセージとして１次予備候補
のコンピュータに送信する（ステップ９０２２）。ま
た、２次予備コンピュータから情報転送完了報告メッセ
ージを受信すると（ステップ９０３１、９０３２のＹＥ
Ｓ）、そのコンピュータの動作モードをチェックし（ス
テップ９０３３）、１次予備ならば各部４３０、４４
０、４５０、４６０の動作モードを２次予備に更新し
（ステップ９０３４）、現用ならばアドレス管理部へ１
次予備コンピュータのノードアドレス設定要求を通知し
て（ステップ９０３５）、集中コンピュータ管理装置に
処理完了メッセージを転送する（ステップ９０３６）。FIG. 10 is a block diagram of a computer according to an embodiment of the present invention. In this embodiment, a computer including a message transfer unit 420, a communication control processing unit 410, etc. is provided with an address management unit 430 and an address table 43.
1, a monitoring processing unit 440, a first timer 441, a second timer 442, a switching processing unit 450, and a hot standby processing unit 460 are added. In addition, 170 shows an additional part.
The processing flow of this switching processing unit 450 is shown in FIGS.
5 shows. In this embodiment, as shown in FIGS. 11 and 12, when a report of abnormality detection of the active computer is received from the monitoring processing unit (step 9011), the respective units 430, 440, 4 are transmitted.
The operation modes of 50 and 460 are updated to the current one (step 9
012), transmits an abnormality notification / preliminary request message to the central computer management device (steps 9013, 901).
4). On the other hand, the abnormality notification from the central computer management device
When a response message for receiving the preliminary request message is received (step 9041), the process normally ends (step 90).
42, YES), and transfers the processing completion message to the central computer management device (NO in step 9042, 90).
36). When the role change message is received from the central computer management device (step 9021), the retained information is read out and transmitted as an information transfer message to the computer of the primary preliminary candidate (step 9022). When the information transfer completion report message is received from the secondary spare computer (YE in steps 9031 and 9032).
S), the operation mode of the computer is checked (step 9033), and if it is a primary spare, each unit 430, 44
The operation mode of 0, 450, 460 is updated to the secondary spare (step 9034), and if it is in use, 1 is sent to the address management unit.
The node address setting request of the next spare computer is notified (step 9035), and the processing completion message is transferred to the central computer management device (step 9036).

【００１３】また、図１３のように、切り替え元からそ
のコンピュータの保持情報転送メッセージを受信すると
（ステップ９０５１）、そのメッセージの転送元ノード
アドレスと切り替え元のノードアドレスとを比較し（ス
テップ９０５２）、不一致ならば転送元ノードアドレス
不一致の情報転送完了報告メッセージを転送元に送信す
る（ステップ９０５３）。また、一致するならば受信情
報を格納し（ステップ９０５４）、各部４３０、４４
０、４５０、４６０の動作モードを１次予備に更新し
（ステップ９０５５）、転送元に情報転送完了報告メッ
セージを送信する（ステップ９０５６）。As shown in FIG. 13, when the retained information transfer message of the computer is received from the switching source (step 9051), the transfer source node address of the message is compared with the switching source node address (step 9052). If they do not match, a transfer source node address mismatch information transfer completion report message is transmitted to the transfer source (step 9053). If they match, the received information is stored (step 9054), and each unit 430, 44 is stored.
The operation modes of 0, 450, and 460 are updated to primary spare (step 9055), and an information transfer completion report message is transmitted to the transfer source (step 9056).

【００１４】また、図１４のように、周期的切り替えを
行う場合においては、集中コンピュータ管理装置から役
割交代メッセージを受信すると（ステップ９０６１）、
該当のコンピュータの動作モードをチェックし（ステッ
プ９０６２）、１次予備であれば保持情報を読み出し、
情報転送メッセージとして切り替え先に送信する（ステ
ップ９０６３）。また、現用であれば１次予備コンピュ
ータに現用モードへの切り替え指示を含む情報転送メッ
セージを送信する（ステップ９０６４）。一方、切り替
え先から情報転送完了報告メッセージを受信すると（ス
テップ９０７１、９０７２のＹＥＳ）、転送元の動作モ
ードをチェックし（ステップ９０７３）、現用ならば各
部４３０、４４０、４５０、４６０の動作モードを１次
予備に更新し（ステップ９０７４）、１次予備ならば各
部４３０、４４０、４５０、４６０の動作モードを２次
予備に更新して（ステップ９０７５）、集中コンピュー
タ管理装置に処理完了メッセージを転送する（ステップ
９０７６）。In the case of performing periodic switching as shown in FIG. 14, when a role change message is received from the central computer management unit (step 9061),
The operation mode of the corresponding computer is checked (step 9062), and if it is the primary spare, the retained information is read,
It is transmitted to the switching destination as an information transfer message (step 9063). If it is in use, an information transfer message including an instruction to switch to the active mode is transmitted to the primary spare computer (step 9064). On the other hand, when the information transfer completion report message is received from the switching destination (YES in steps 9071 and 9072), the operation mode of the transfer source is checked (step 9073), and if it is in use, the operation modes of the respective units 430, 440, 450 and 460 are checked. Update to the primary spare (step 9074), if it is the primary spare, update the operation mode of each unit 430, 440, 450, 460 to the secondary spare (step 9075), and transfer the processing completion message to the central computer management device. (Step 9076).

【００１５】また、周期的切り替えを行う場合におい
て、図１５のように、切り替え元からそのコンピュータ
の保持情報転送メッセージを受信すると（ステップ９０
８１）、そのメッセージの転送元ノードアドレスと切り
替え元のノードアドレスを比較し（ステップ９０８
２）、不一致であれば転送元ノードアドレス不一致の情
報転送完了メッセージを転送元に送信する（ステップ９
０８３）。また、一致であれば、さらにモード切り替え
のみ指示かをチェックし（ステップ９０８４）、そうで
あれば各部４３０、４４０、４５０、４６０の動作モー
ドを現用に更新する（ステップ９０８５）。また、そう
でなければ受信情報を格納し（ステップ９０８６）、各
部４３０、４４０、４５０、４６０の動作モードを１次
予備に更新して（ステップ９０８７）、転送元に情報転
送完了報告メッセージを送信する（ステップ９０８
８）。Further, in the case of performing the periodic switching, as shown in FIG. 15, when the retained information transfer message of the computer is received from the switching source (step 90).
81), and compares the transfer source node address of the message with the switching source node address (step 908).
2) If they do not match, the information transfer completion message of the transfer source node address mismatch is transmitted to the transfer source (step 9).
083). If they match, it is further checked whether or not only mode switching is instructed (step 9084), and if so, the operation mode of each unit 430, 440, 450, 460 is updated to the current one (step 9085). If not, the received information is stored (step 9086), the operation mode of each unit 430, 440, 450, 460 is updated to primary reserve (step 9087), and the information transfer completion report message is transmitted to the transfer source. Yes (step 908)
8).

【００１６】また、アドレス管理部４３０の処理フロー
を図１６に示す。本実施例では、集中コンピュータ管理
装置から現用コンピュータノードアドレス通知メッセー
ジを受信すると、アドレステーブルに現用ノードアドレ
スを設定する（ステップ１３０１）。図１７は、本発明
の一実施例におけるコンピュータのアドレステーブル４
３１の構成を示す図である。アドレス管理部４３０は、
図１７に示すように動作モードに応じて二つのノードア
ドレスを設定する。ノードアドレスには、そのコンピュ
ータが現用モードの場合、そのコンピュータのノードア
ドレスが設定され、１次予備モードの場合、そのコンピ
ュータに対応する現用コンピュータのノードアドレスが
設定されている。２次予備及び異常モードの場合、その
コンピュータのノードアドレスのみが設定されている。The processing flow of the address management unit 430 is shown in FIG. In this embodiment, when the working computer node address notification message is received from the central computer management device, the working node address is set in the address table (step 1301). FIG. 17 is a computer address table 4 according to an embodiment of the present invention.
It is a figure which shows the structure of 31. The address management unit 430
As shown in FIG. 17, two node addresses are set according to the operation mode. The node address is set to the node address of the computer when the computer is in the active mode, and is set to the node address of the active computer corresponding to the computer when the computer is in the primary spare mode. In the case of the secondary spare and abnormal mode, only the node address of the computer is set.

【００１７】また、ホットスタンバイ処理部４６０の処
理フローを図１８に示す。本実施例では、現用コンピュ
ータの保持情報を更新する場合、メモリ等に蓄積された
更新情報を情報メッセージとして１次予備コンピュータ
に送信する（ステップ１４０１）。また、その１次予備
コンピュータから情報転送完了報告メッセージを受信し
た場合、異常があれば（ステップ１４２１のＮＯ）、集
中コンピュータ管理装置に異常通知・予備要求メッセー
ジを送信する（ステップ１４２２）。また、現用コンピ
ュータから情報転送メッセージを受信した場合は、メモ
リ等の格納情報を更新し（ステップ１４１１）、現用コ
ンピュータへ情報転送メッセージ・完了報告メッセージ
を送信する（ステップ１４１２）。FIG. 18 shows a processing flow of the hot standby processing unit 460. In the present embodiment, when updating the information held in the active computer, the update information accumulated in the memory or the like is transmitted to the primary spare computer as an information message (step 1401). If an information transfer completion report message is received from the primary spare computer and there is an abnormality (NO in step 1421), an abnormality notification / preliminary request message is transmitted to the central computer management device (step 1422). When the information transfer message is received from the active computer, the information stored in the memory or the like is updated (step 1411) and the information transfer message / completion report message is transmitted to the active computer (step 1412).

【００１８】また、監視処理部４４０の処理フローを図
１９に示す。本実施例では、第２タイマからタイムアウ
トを受信するか、あるいは、１次予備コンピュータに対
応した現用コンピュータからの監視応答メッセージを受
信して異常検出した場合は、切り替え処理部に異常を通
知する（ステップ１５０１）。また、正常応答であれば
第２タイマを停止する（ステップ１５１２）。また、第
１タイマからの割り込みを受信した場合は、１次予備コ
ンピュータに対応した現用コンピュータからの監視メッ
セージを転送し（ステップ１５２１）、その監視メッセ
ージに対する第２タイマを設定する（ステップ１５２
２）。The processing flow of the monitoring processing unit 440 is shown in FIG. In this embodiment, when a timeout is received from the second timer or an abnormality is detected by receiving a monitoring response message from the active computer corresponding to the primary spare computer, the switching processing unit is notified of the abnormality ( Step 1501). If the response is normal, the second timer is stopped (step 1512). When the interrupt from the first timer is received, the monitoring message from the active computer corresponding to the primary spare computer is transferred (step 1521), and the second timer for the monitoring message is set (step 152).
2).

【００１９】次に、本発明のコンピュータ間交信の一実
施例を図２０〜図２５により説明する。図２０、図２１
は、本発明の一実施例におけるコンピュータ異常時の切
り替え概念を示すフローチャートである。本実施例で
は、集中コンピュータ管理装置において、各コンピュー
タから異常報告を受信すると、周期的切り替え動作を停
止し（ステップ１７０１）、コンピュータ動作状態管理
手段（図２の２３０、２３１等）から、２次予備コンピ
ュータのノードアドレスを選択し、現用コンピュータに
２次予備との間で待機２重構成をとるように指示する
（ステップ１７０２）。この指示を受けて、現用／１次
予備コンピュータでは、指定された２次予備コンピュー
タに自機の保持情報と１次予備モードへの変更指示を送
信する（ステップ１７１１）。そして、その送信先から
完了報告を受信すると（ステップ１７１２）、集中コン
ピュータ管理装置に転送する（ステップ１７１３）。集
中コンピュータ管理装置は、その現用／１次予備コンピ
ュータからの完了報告を受信すると（ステップ１７０
３）、周期的切り替え用タイマを再起動する（ステップ
１７０４）。なお、２次予備コンピュータは、動作モー
ド変更指示を受信すると、同時に送信された保持情報を
格納し（ステップ１７２１）、自機の動作モードを１次
予備に更新して（ステップ１７２２）、その保持情報送
信元に完了報告を送信する（ステップ１７２３）。Next, an embodiment of communication between computers of the present invention will be described with reference to FIGS. 20 and 21
FIG. 6 is a flowchart showing a concept of switching when a computer is abnormal in one embodiment of the present invention. In this embodiment, in the centralized computer management device, when an abnormality report is received from each computer, the periodical switching operation is stopped (step 1701), and the computer operation state management means (230, 231 etc. in FIG. 2) causes the secondary operation. The node address of the spare computer is selected, and the active computer is instructed to take a standby dual configuration with the secondary spare (step 1702). In response to this instruction, the working / primary spare computer transmits its own holding information and an instruction to change to the primary spare mode to the designated secondary spare computer (step 1711). When the completion report is received from the destination (step 1712), it is transferred to the central computer management device (step 1713). The central computer management device receives the completion report from the active / primary spare computer (step 170).
3) The timer for periodic switching is restarted (step 1704). Upon receipt of the operation mode change instruction, the secondary spare computer stores the holding information transmitted at the same time (step 1721), updates the operation mode of its own machine to the primary spare (step 1722), and holds the holding information. A completion report is transmitted to the information transmission source (step 1723).

【００２０】次に、現用コンピュータ異常のケースにつ
いてより詳細に述べる。図２２は、本発明の一実施例に
おけるコンピュータ間交信（現用コンピュータ異常のケ
ース）を示す図である。本実施例では、１次予備コンピ
ュータ１１２が現用コンピュータの異常を監視メッセー
ジの受信時間間隔と受信監視応答メッセージの内容によ
り検出（ａ）すると、１次予備コンピュータの動作モー
ドを現用モードに変更（ｂ）し、集中コンピュータ管理
装置１５０に異常通知・予備要求メッセージを転送する
（ｃ）。集中コンピュータ管理装置１５０は、異常通知
・予備要求メッセージを受信すると受信応答メッセージ
を１次予備コンピュータ１１２に転送（ｄ）するととも
に、コンピュータ間の動作モードの周期的切り替え制御
を中断する。また、状態管理テーブルを検索し、コンピ
ュータ１１２をバックアップする２次予備コンピュータ
１４２のノードアドレスを読み出し、その２次予備コン
ピュータ１４２に１次予備コンピュータ１１２のノード
アドレスを通知する、ノードアドレス通知メッセージを
転送する（ｅ）。２次予備コンピュータ１４２は、この
ノードアドレス通知メッセージを受信すると、アドレス
テーブルに通知されたノードアドレスを格納し、設定完
了メッセージを集中コンピュータ管理装置１５０に転送
する（ｆ）。更に、集中コンピュータ管理装置１５０
は、設定完了メッセージを受信すると、新たな現用コン
ピュータ１１２に対し、２次予備コンピュータ１４２と
の間での役割を交代する役割交代メッセージを転送する
（ｇ）。役割交代メッセージを受信した現用コンピュー
タ１１２は、そのコンピュータのメモリ情報／情報蓄積
格納情報などの保有情報を読み出し、２次予備コンピュ
ータ１４２に情報転送メッセージとして転送する
（ｈ）。この情報転送メッセージを受信した２次予備コ
ンピュータは、既に受信済みのノードアドレスに対応す
る新たな現用コンピュータからのメッセージであれば、
受信し受信情報に対応する格納場所に格納して、処理終
了後、そのコンピュータの動作モードを１次予備モード
にして、完了報告メッセージを現用コンピュータ１１２
に転送する（ｉ）。そして、この完了報告メッセージを
受信した現用コンピュータ１１２は、新たな１次予備コ
ンピュータの動作モードを１次予備に更新し、役割交代
処理の完了報告メッセージを集中コンピュータ管理装置
１５０に転送する（ｊ）。この完了報告メッセージを受
信した集中コンピュータ管理装置１５０は、２次予備コ
ンピュータ１４２の動作モードを１次予備に変更し、周
期的動作モード切り替え処理を再開する。一方、異常と
なった現用コンピュータは、修復後は未使用コンピュー
タとして使用される。Next, the case of the active computer abnormality will be described in more detail. FIG. 22 is a diagram showing communication between computers (case of abnormal computer in use) according to an embodiment of the present invention. In this embodiment, when the primary spare computer 112 detects an abnormality in the active computer based on the monitoring message reception time interval and the content of the reception monitoring response message (a), the operation mode of the primary spare computer is changed to the active mode (b). Then, the abnormality notification / preliminary request message is transferred to the central computer management device 150 (c). Upon receiving the abnormality notification / preliminary request message, the central computer management device 150 transfers (d) the reception response message to the primary spare computer 112, and interrupts the periodical switching control of the operation mode between the computers. In addition, a node address notification message that transfers the node address notification message that searches the state management table, reads the node address of the secondary spare computer 142 that backs up the computer 112, and notifies the secondary spare computer 142 of the node address of the primary spare computer 112 (E). Upon receiving this node address notification message, the secondary spare computer 142 stores the notified node address in the address table and transfers the setting completion message to the central computer management device 150 (f). Further, the central computer management device 150
Upon receiving the setting completion message, transfers a role change message for changing the role with the secondary spare computer 142 to the new active computer 112 (g). Upon receiving the role change message, the active computer 112 reads out the possession information such as the memory information / information storage / storage information of the computer and transfers it to the secondary spare computer 142 as an information transfer message (h). The secondary spare computer receiving this information transfer message is a message from the new working computer corresponding to the already received node address.
After receiving and storing in the storage location corresponding to the received information and after the processing is completed, the operation mode of the computer is set to the primary standby mode and the completion report message is sent to the working computer 112.
(I). Then, the active computer 112 that has received this completion report message updates the operation mode of the new primary spare computer to primary spare, and transfers the role change processing completion report message to the central computer management device 150 (j). . Upon receiving the completion report message, the centralized computer management device 150 changes the operation mode of the secondary spare computer 142 to the primary spare and restarts the periodical operation mode switching process. On the other hand, the abnormal working computer is used as an unused computer after restoration.

【００２１】次に、本実施例の周期的切り替えの概念に
ついて述べる。図２３、図２４は、本発明の一実施例に
おけるコンピュータの周期的切り替えの概念を示すフロ
ーチャートである。本実施例では、集中コンピュータ管
理装置において、周期的切り替え用タイマを再起動した
後（ステップ１９０１）、コンピュータ動作状態管理手
段から１次予備および２次予備コンピュータのノードア
ドレスを選択し、それらの間での動作モード交代を指示
する（ステップ１９０２）。この指示を受けた現用／１
次予備コンピュータでは、指定された２次予備コンピュ
ータに自機の保持情報と動作モードの変更指示を送信し
（ステップ１９１１）、その送信先からの完了報告を受
信すると（ステップ１９１２）、自機の動作状態を２次
予備に変更し（ステップ１９１３）、集中コンピュータ
管理装置に完了報告を送信する（ステップ１９１４）。
集中コンピュータ管理装置は、交代指示送信先が１次予
備の場合の完了報告を受信すると（ステップ１９０
３）、次にコンピュータ動作状態管理手段から現用コン
ピュータと２次予備コンピュータのノードアドレを選択
し、それらの間での動作モード交代を指示する（ステッ
プ１９０４）。そして、交代指示送信先が現用の場合の
完了報告を受信すると（ステップ１９０５）、ステップ
１９０１に戻って一連の処理を繰り返す。なお、２次予
備コンピュータでは、動作モード変更指示を受信する
と、現用／１次予備コンピュータから送信された保持情
報を格納し（ステップ１９２１）、自機をその保持情報
送信元の動作状態に変更して（ステップ１９２２）、そ
の送信元に完了報告を送信する（ステップ１９２３）。Next, the concept of periodic switching in this embodiment will be described. 23 and 24 are flowcharts showing the concept of periodical switching of computers in the embodiment of the present invention. In the present embodiment, in the centralized computer management device, after restarting the periodic switching timer (step 1901), the node addresses of the primary spare and secondary spare computers are selected from the computer operating state management means, and between them, The operation mode change is instructed (step 1902). Working / 1 which received this instruction
The next spare computer transmits its own holding information and an instruction to change the operation mode to the designated secondary spare computer (step 1911), and when it receives the completion report from the destination (step 1912), The operating state is changed to secondary standby (step 1913), and a completion report is sent to the central computer management device (step 1914).
The centralized computer management device receives the completion report when the replacement instruction transmission destination is the primary spare (step 190).
3) Next, the node address of the active computer and the secondary spare computer is selected from the computer operation state management means, and the operation mode change between them is instructed (step 1904). When the completion report is received when the replacement instruction transmission destination is the current one (step 1905), the process returns to step 1901 to repeat the series of processes. Upon receipt of the operation mode change instruction, the secondary spare computer stores the holding information transmitted from the active / primary spare computer (step 1921) and changes itself to the operating state of the holding information transmitting source. (Step 1922), the completion report is transmitted to the transmission source (step 1923).

【００２２】次に、周期的切り替えケースについてより
詳細に述べる。図２５は、本発明の一実施例におけるコ
ンピュータ間交信（周期的切り替えケース）を示す図で
ある。集中コンピュータ管理装置１５０は、状態管理テ
ーブル２３１が保持するコンピュータ情報を検索し、先
ず１次予備コンピュータ１１２を検索するとともに２次
予備コンピュータ１４２を検索し、２次予備コンピュー
タ１４２に１次予備コンピュータ１１２のノードアドレ
ス通知用のノードアドレス通知メッセージを転送する
（ア）。２次予備コンピュータ１４２は、このノードア
ドレス通知メッセージを受信すると、そのコンピュータ
内のアドレステーブルにそのノードアドレスを格納し、
設定完了メッセージを集中コンピュータ管理装置１５０
に転送する（イ）。設定完了メッセージを転送された集
中コンピュータ管理装置１５０は、１次予備コンピュー
タ１１２に対し、２次予備コンピュータ１４２との間で
の役割交代を行う役割交代メッセージを転送する
（ウ）。役割交代メッセージを受信した１次予備コンピ
ュータ１１２は、そのコンピュータのメモリ情報と情報
蓄積装置格納情報などの保持情報を読み出し、２次予備
コンピュータ１４２に情報転送メッセージとして転送す
る（エ）。この情報転送メッセージを受信した２次予備
コンピュータは、受信メッセージのノードアドレスとノ
ードアドレス通知メッセージで受信した１次予備コンピ
ュータのノードアドレスを比較し、一致時取り込み、受
信メッセージ内の情報を対応する格納場所に格納する。
処理終了後、そのコンピュータの動作モードを１次予備
コンピュータにして、完了報告メッセージを１次予備コ
ンピュータ１１２に転送する（オ）。そして、この完了
報告メッセージを転送された１次予備コンピュータ１１
２は、動作モードを２次予備モードに変更し役割交代処
理の完了報告メッセージを集中コンピュータ管理装置１
５０に転送する（カ）。この完了報告メッセージを受信
した集中コンピュータ管理装置１５０は、状態管理テー
ブル上のコンピュータ１４２の動作モードを１次予備
に、また、コンピュータ１１２の動作モードを２次予備
に変更する。次に、現用から２次予備への動作モード切
り替え処理を継続し、現用コンピュータ１１１を検索す
るとともに２次予備コンピュータ１１２を検索し、２次
予備コンピュータ１１２に現用コンピュータ１１１のノ
ードアドレス通知用のノードアドレス通知メッセージを
転送する（キ）。２次予備コンピュータ１１２は、この
ノードアドレス通知メッセージを受信すると、そのコン
ピュータ内のアドレステーブルに格納し、設定完了メッ
セージを集中コンピュータ管理装置１５０に転送する
（ク）。設定完了メッセージを転送された集中コンピュ
ータ管理装置１５０は、設定完了メッセージを受信する
と、現用コンピュータ１１１に対し、２次予備コンピュ
ータ１１２との間での役割交代を行う役割交代メッセー
ジを転送する（ケ）。役割交代メッセージを受信した現
用コンピュータ１１１は、そのコンピュータのメモリ情
報と情報蓄積装置格納情報などの保持情報を読み出し、
２次予備コンピュータ１１２に情報転送メッセージとし
て転送する（コ）。この情報転送メッセージを受信した
２次予備コンピュータは、受信メッセージのノードアド
レスとノードアドレス通知メッセージで転送された現用
コンピュータのノードアドレスを比較し、一致時取り込
み、受信メッセージ内の情報を対応する格納場所に格納
して、処理終了後、そのコンピュータの動作モードを現
用モードにして、完了報告メッセージを現用コンピュー
タ１１１に転送する（サ）。そして、この完了報告メッ
セージを転送された現用コンピュータ１１１は、動作モ
ードを２次予備モードに変更し役割交代処理の完了報告
メッセージを集中コンピュータ管理装置１５０に転送す
る（シ）。この完了報告メッセージを受信した集中コン
ピュータ管理装置１５０は、状態管理テーブル上の新た
な現用コンピュータ１１２の動作モードを現用に、ま
た、新たな２次予備コンピュータ１１１の動作モードを
２次予備に変更して、以上の動作モード切り替え処理を
繰り返す。Next, the periodic switching case will be described in more detail. FIG. 25 is a diagram showing communication between computers (periodic switching case) in the embodiment of the present invention. The central computer management device 150 searches the computer information held in the state management table 231, first searches for the primary spare computer 112 and the secondary spare computer 142, and then the secondary spare computer 142 to the primary spare computer 112. The node address notification message for node address notification of is transferred (A). Upon receiving the node address notification message, the secondary spare computer 142 stores the node address in the address table in the computer,
Set completion message is sent to central computer management device 150
Transfer to (a). The centralized computer management device 150, to which the setting completion message is transferred, transfers to the primary spare computer 112 a role change message for changing roles with the secondary spare computer 142 (c). Upon receiving the role change message, the primary spare computer 112 reads out the memory information of the computer and the stored information such as the information storage device storage information and transfers it to the secondary spare computer 142 as an information transfer message (d). The secondary spare computer receiving this information transfer message compares the node address of the received message with the node address of the primary spare computer received in the node address notification message, fetches when they match, and stores the information in the received message correspondingly. Store in place.
After the processing is completed, the operation mode of the computer is set to the primary spare computer, and the completion report message is transferred to the primary spare computer 112 (e). Then, the primary spare computer 11 to which this completion report message is transferred
2 is a central computer management apparatus 1 that changes the operation mode to the secondary standby mode and sends a completion report message of the role change processing.
Transfer to 50 (f). Upon receiving this completion report message, the centralized computer management device 150 changes the operation mode of the computer 142 on the status management table to primary reserve and the operation mode of the computer 112 to secondary reserve. Next, the operation mode switching process from the active to the secondary spare is continued, the active computer 111 is searched, the secondary spare computer 112 is searched, and the secondary spare computer 112 is notified of the node address of the active computer 111. Forward the address notification message (G). Upon receiving this node address notification message, the secondary spare computer 112 stores it in the address table in the computer and transfers the setting completion message to the central computer management device 150 (h). Upon receiving the setting completion message, the centralized computer management device 150, to which the setting completion message is transferred, transfers a role changing message for changing the role with the secondary spare computer 112 to the active computer 111 (K). . Upon receiving the role change message, the active computer 111 reads out the memory information of the computer and the retained information such as the information storage device storage information,
It is transferred to the secondary spare computer 112 as an information transfer message (K). The secondary spare computer receiving this information transfer message compares the node address of the received message with the node address of the active computer transferred in the node address notification message, fetches when they match, and stores the information in the received message at the corresponding storage location. After completion of the processing, the operation mode of the computer is set to the active mode and the completion report message is transferred to the active computer 111 (S). Then, the active computer 111 to which the completion report message is transferred changes the operation mode to the secondary standby mode and transfers the completion report message of the role change processing to the central computer management device 150 (S). Upon receiving the completion report message, the centralized computer management device 150 changes the operation mode of the new active computer 112 on the status management table to the active mode and changes the operation mode of the new secondary spare computer 111 to the secondary spare. Then, the above operation mode switching process is repeated.

【００２３】[0023]

【発明の効果】本発明によれば、次の効果が得られる。（１）正常時でも、現用、１次予備、共用２次予備コン
ピュータ間で周期的に動作モードを切り替えることによ
り、１次予備コンピュータ、２次予備コンピュータの故
障等を速やかに検出できる。更に、定期保守の稼働が不
要になり、保守コストの削減が望める。（２）僅かな共用２次予備コンピュータだけで、コンピ
ュータ異常時に、迅速に任意の共用２次予備コンピュー
タを割り当て、この共用２次予備コンピュータで直ちに
待機２重化冗長構成を形成させることにより、高信頼
で、かつ経済的な冗長システムの構築が可能となる。According to the present invention, the following effects can be obtained. (1) Even in a normal state, the failure of the primary spare computer and the secondary spare computer can be promptly detected by periodically switching the operation mode between the active primary spare computer and the shared secondary spare computer. Furthermore, the operation of regular maintenance becomes unnecessary, and maintenance costs can be reduced. (2) With a small number of shared secondary spare computers, when a computer malfunctions, any shared secondary spare computer can be quickly assigned, and this shared secondary spare computer can immediately form a standby duplex redundant configuration. It is possible to build a reliable and economical redundant system.

[Brief description of drawings]

【図１】本発明の一実施例におけるコンピュータシステ
ムの構成図である。FIG. 1 is a configuration diagram of a computer system according to an embodiment of the present invention.

【図２】本発明の一実施例における集中コンピュータ管
理装置の構成図である。FIG. 2 is a configuration diagram of a centralized computer management apparatus according to an embodiment of the present invention.

【図３】本発明の一実施例における切り替え制御処理部
の処理フローチャートの一部である。FIG. 3 is a part of a processing flowchart of a switching control processing unit according to an embodiment of the present invention.

【図４】本発明の一実施例における切り替え制御処理部
の処理フローチャートの一部である。FIG. 4 is a part of a processing flowchart of a switching control processing unit according to an embodiment of the present invention.

【図５】本発明の一実施例における切り替え制御処理部
の処理フローチャートの一部である。FIG. 5 is a part of a processing flowchart of a switching control processing unit according to an embodiment of the present invention.

【図６】本発明の一実施例におけるコンピュータ管理部
の処理フローチャートの一部である。FIG. 6 is a part of a processing flowchart of a computer management unit according to an embodiment of the present invention.

【図７】本発明の一実施例におけるコンピュータ管理部
の処理フローチャートの一部である。FIG. 7 is a part of a processing flowchart of a computer management unit according to an embodiment of the present invention.

【図８】本発明の一実施例における状態管理テーブルの
構成を示す図である。FIG. 8 is a diagram showing a configuration of a state management table according to an embodiment of the present invention.

【図９】本発明の一実施例における管理者インタフェー
ス制御部の処理フローチャートである。FIG. 9 is a processing flowchart of an administrator interface control unit according to an embodiment of the present invention.

【図１０】本発明の一実施例におけるコンピュータの構
成図である。FIG. 10 is a configuration diagram of a computer according to an embodiment of the present invention.

【図１１】本発明の一実施例における切り替え処理部の
処理フローチャートの一部である。FIG. 11 is a part of a processing flowchart of a switching processing unit according to an embodiment of the present invention.

【図１２】本発明の一実施例における切り替え処理部の
処理フローチャートの一部である。FIG. 12 is a part of a processing flowchart of a switching processing unit according to an embodiment of the present invention.

【図１３】本発明の一実施例における切り替え処理部の
処理フローチャートの一部である。FIG. 13 is a part of a processing flowchart of a switching processing unit according to an embodiment of the present invention.

【図１４】本発明の一実施例における切り替え処理部の
処理フローチャートの一部である。FIG. 14 is a part of a processing flowchart of a switching processing unit according to an embodiment of the present invention.

【図１５】本発明の一実施例における切り替え処理部の
処理フローチャートの一部である。FIG. 15 is a part of a processing flowchart of a switching processing unit according to an embodiment of the present invention.

【図１６】本発明の一実施例におけるアドレス管理部の
処理フローチャートである。FIG. 16 is a processing flowchart of an address management unit according to an embodiment of the present invention.

【図１７】本発明の一実施例におけるコンピュータのア
ドレステーブルの構成を示す図である。FIG. 17 is a diagram showing a configuration of an address table of a computer according to an embodiment of the present invention.

【図１８】本発明の一実施例におけるホットスタンバイ
処理部の処理フローチャートである。FIG. 18 is a processing flowchart of a hot standby processing unit according to an embodiment of the present invention.

【図１９】本発明の一実施例における監視処理部の処理
フローチャートである。FIG. 19 is a processing flowchart of a monitoring processing unit according to an embodiment of the present invention.

【図２０】本発明の一実施例におけるコンピュータ異常
時の切り替え概念を示すフローチャートの一部である。FIG. 20 is a part of a flowchart showing a concept of switching when a computer is abnormal in one embodiment of the present invention.

【図２１】本発明の一実施例におけるコンピュータ異常
時の切り替え概念を示すフローチャートの一部である。FIG. 21 is a part of a flowchart showing the concept of switching when a computer is abnormal in one embodiment of the present invention.

【図２２】本発明の一実施例におけるコンピュータ間交
信（現用コンピュータ異常のケース）を示す図である。FIG. 22 is a diagram showing communication between computers (case of abnormal computer in use) according to an embodiment of the present invention.

【図２３】本発明の一実施例におけるコンピュータの周
期的切り替えの概念を示すフローチャートの一部であ
る。FIG. 23 is a part of a flowchart showing the concept of periodic switching of computers in an embodiment of the present invention.

【図２４】本発明の一実施例におけるコンピュータの周
期的切り替えの概念を示すフローチャートの一部であ
る。FIG. 24 is a part of a flowchart showing the concept of periodic switching of computers in an embodiment of the present invention.

【図２５】本発明の一実施例におけるコンピュータ間交
信（周期的切り替えケース）を示す図である。FIG. 25 is a diagram showing communication between computers (periodic switching case) according to an embodiment of the present invention.

[Explanation of symbols]

１０１，１１１，１２１，１３１：現用コンピュータ、
１０２，１１２，１２２，１３２：１次予備コンピュー
タ、１４１，１４２：共用２次予備コンピュータ、１５
０：集中コンピュータ管理装置、１６０：伝送路。101, 111, 121, 131: active computer,
102, 112, 122, 132: primary spare computer, 141, 142: shared secondary spare computer, 15
0: central computer management device, 160: transmission line.

Claims

[Claims]

1. A system comprising a plurality of computers having a plurality of operation modes, wherein a primary spare computer assigned to an active computer and a secondary spare computer shared by the active and primary spare computers are connected on a transmission line. Then, when an abnormality of the active computer is detected, the primary spare computer is switched to the active computer, and all the retained information is transferred from the active computer to the primary spare computer,
A redundant computer system switching method, characterized in that any of the secondary spare computers is assigned as a new primary spare computer.

2. All the holding information is periodically transmitted from the primary spare computer to the secondary spare computer and from the active computer to the primary spare computer to change the operation mode to the plurality of computers, 2. The redundant computer system switching method according to claim 1, wherein the computer is operated as an active computer.

3. A plurality of computers having a plurality of operation modes, and a centralized computer management device which centrally manages the operation states of the computers and changes and controls the operation modes of the computers when a computer is abnormal Connected via the above-mentioned computer, the computer is assigned an active computer in an operation mode for executing a processing request, and a primary spare computer that operates in synchronization with the active computer when the active computer is abnormal. In a system used as a secondary spare computer shared by a primary spare computer, a) when the centralized computer management device receives a notification from a timer that notifies the opportunity to periodically switch the operation mode of each computer, The process of restarting the timer and b) centralized computer The data management device selects the node addresses of the primary spare computer and the secondary spare computer from the computer operating state management means, and instructs the primary spare computer to change the operating mode between the secondary spare computer and the secondary spare computer. C) A step in which the primary spare computer, to which the operation mode change instruction is transmitted, transmits the information held by the computer and the operation mode change instruction to the primary spare computer to the designated secondary spare computer; Retained information sent from computer 2
The next computer stores the information, changes the operation mode to the primary spare computer, and sends the completion report to the primary spare computer that is the source of the retained information, and (e) receives the completion report from the secondary spare computer. The step of the primary spare computer changing the operation mode to the secondary spare and transmitting the completion report to the central computer management device; (f) the central computer management device receiving the completion report from the primary spare computer The step of receiving the notification, and g) the central computer management device selects the node address of the active computer and the secondary spare computer from the computer operating state management means, and the active computer is allowed to change the operation mode between the secondary spare computer. The step of instructing, and h) the active computer that has received the operation mode change instruction is A step of transmitting the information held by the computer and an instruction to change the operating mode to the active computer to the standby computer, and (b) the secondary standby computer to which the retained information is sent from the active computer stores the information and sets the operating mode to the active mode. Change and send the completion report to the active computer that is the sender of the retained information, and n) The active computer that received the completion report from the newly active computer changes the operation mode to secondary standby and concentrates. A step of transmitting a completion report to the computer management device; a step of receiving the notification from the timer by the central computer management device which has received the completion report from the newly secondary spare computer; When the computer error report is received, the timer is stopped, the operation mode change is interrupted, and the computer operation status management The step of selecting the node address of the secondary spare computer from the stage and instructing the active computer to generate a standby dual redundant configuration with the secondary spare computer; A step of transmitting the holding information of the computer and an instruction to change the operation mode to the working to the designated secondary spare computer, and (f) the secondary spare computer to which the holding information is sent from the working computer stores the information. The process of changing the operation mode to active and transmitting the completion report to the active computer that is the transmission source of the retained information, and y) The active computer that received the completion report from the newly active computer secondary reserves the operational mode. And sending a completion report to the central computer management device, and The centralized computer management device that has received the completion report receives the notification from the timer, and) when the backup of the abnormal computer is completed, restarting the timer and restarting the operation mode change. Redundant computer system switching method.