JPH09186689A

JPH09186689A - Device state controlling method and data communication system

Info

Publication number: JPH09186689A
Application number: JP7341962A
Authority: JP
Inventors: Toshinobu Akitomi; 利伸秋冨
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1995-12-27
Filing date: 1995-12-27
Publication date: 1997-07-15
Anticipated expiration: 2015-12-27
Also published as: JP3419979B2

Abstract

PROBLEM TO BE SOLVED: To simultaneously control the device states of 'operation/standby/stop/ failure', etc., of the plural computers connected with a LAN in all the computers without a time difference. SOLUTION: All the device states 10 of all the computers A to D that a master computer A has is defined as the transmission data of health checks, the broadcast transmissions from a health check transmission mechanism 1 to all the devices are performed at a constant period or when a state is changed, and each computer B to D sends a reply by defining the its own device state as a response signal. The computer A judges the case where the response is not received from the computer B as a failure and the failure is set to the device state 12 of the computer B. By performing the broadcast transmissions for all the device states 10 at the time of the next broadcast transmission from the computer A, the computers C and D are capable of recognizing that the computer B is in a failure state and the device states can be simultaneously controlled in all the computers without a time difference.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】この発明は、ＬＡＮ等に接続
される複数の計算機の動作／待機／停止／故障等の装置
状態を、すべての計算機で時差なく同時に管理する装置
状態管理方法と、この方法を用いたデータ通信システム
に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a device state management method for simultaneously managing the device states such as operation / standby / stop / failure of a plurality of computers connected to a LAN etc. without any time difference, and The present invention relates to a data communication system using the method.

【０００２】[0002]

【従来の技術】例えば、図１３は特開昭６４−４６３４
５号公報に示された従来の状態管理方式を示す図であ
り、図において、１〜６はＬＡＮに接続された端末Ａ〜
Ｅであり、マスタ計算機６が各端末のレディ状態および
ダウン状態を管理するものである。2. Description of the Related Art For example, FIG.
It is a figure which shows the conventional state management system shown by the 5th publication, In the figure, 1-6 are terminals A-connected to LAN.
E, the master computer 6 manages the ready state and the down state of each terminal.

【０００３】図１３において、マスタ計算機はブロード
キャスト送信を行い、端末Ａ〜Ｅからの応答を待つ。各
端末はマスタ計算機からのブロードキャストによりヘル
スチェックを受信すると、識別番号等を設定してマスタ
計算機へ応答を返す。もし端末がダウンしていた場合は
応答が返されないため、マスタ計算機は端末をダウン状
態と判断する。In FIG. 13, the master computer performs broadcast transmission and waits for a response from the terminals A to E. When each terminal receives the health check by broadcasting from the master computer, it sets an identification number and returns a response to the master computer. If the terminal is down, no response is returned, so the master computer determines that the terminal is down.

【０００４】[0004]

【発明が解決しようとする課題】従来の状態管理方式は
以上のように動作するので、システムの監視を行うマス
タ計算機が必要であり、すべての端末の状態を把握して
いるのはマスタ計算機だけで、他の端末は別の端末の状
態がわからないという問題があった。Since the conventional state management system operates as described above, a master computer for monitoring the system is required, and only the master computer knows the states of all terminals. Then, there was a problem that other terminals do not know the status of another terminal.

【０００５】また、ルータ（伝送速度の異なる信号が中
継できる中継器で、一般にはブロードキャスト通信は出
来ないよう設定されている。）等を介して接続された端
末の場合は、ブロードキャスト通信ができなかったり、
応答が遅れてタイムアウトしてしまう場合があり、ヘル
スチェックできない場合があるという問題があった。Broadcast communication cannot be performed in the case of a terminal connected via a router (a relay that can relay signals having different transmission speeds and is generally set so that broadcast communication cannot be performed). Or
There was a problem that the response might be delayed and timed out, and it might not be possible to check the health.

【０００６】また、ブロードキャストは何らかの障害で
一時的に端末へ通知されない場合があり、その場合は、
当該端末を故障とみなしたり、状態変化が認識されない
という問題があった。In some cases, the broadcast may not be temporarily notified to the terminal due to some trouble. In that case,
There is a problem that the terminal is considered to be a failure and the state change is not recognized.

【０００７】また、マスタ計算機が故障した場合は端末
の状態管理ができないという問題があった。Further, when the master computer fails, the state of the terminal cannot be managed.

【０００８】ここで「装置の状態変化」とは、動作状態
の計算機と待機状態の計算機との切り替え（系切替）
や、「故障発生」「起動」等を起因として、その結果、
「停止←→動作」「動作←→待機」「待機←→停止」
「動作←→故障」等の装置（計算機）の状態が変化する
ことである。The term "device state change" as used herein means switching between a computer in an operating state and a computer in a standby state (system switching).
And, as a result of "failure occurrence""startup" etc.,
"Stop ← → Operation""Operation ← → Standby""Standby ← → Stop"
That is, the state of a device (computer) such as "operation ← → failure" changes.

【０００９】この発明は上記のような課題を解決するた
めになされたものであり、全計算機が、すべての計算機
の装置状態を、時差なく同時に管理できる装置状態管理
方法、および、この方法を用いたデータ通信システムを
得ることを目的とする。The present invention has been made in order to solve the above problems, and an apparatus state management method by which all computers can simultaneously manage the apparatus states of all the computers without a time difference, and this method is used. The purpose is to obtain the existing data communication system.

【００１０】また、ルータ装置等によりブロードキャス
ト通信が受信できない計算機や、通信回線速度が遅くデ
ータ送受信に時間がかかる計算機などがある場合でも、
ＬＡＮの構成に影響されず、装置状態管理ができること
を目的とする。Further, even if there is a computer that cannot receive broadcast communication due to a router device or a computer whose communication line speed is slow and data transmission / reception takes time,
The purpose is to be able to manage the device status without being affected by the LAN configuration.

【００１１】また、ブロードキャスト送信中に送信異常
が発生して、装置状態が通知されない場合でも、状態変
化が瞬時に通知され、かつ確実な装置状態管理ができる
ことを目的とする。It is another object of the present invention to be able to instantly be notified of a state change and to perform reliable device state management even when a device abnormality is not notified due to a transmission error during broadcast transmission.

【００１２】また、全装置状態の送信を行っていたマス
タ計算機が故障しても、システム全体の装置状態管理機
能を停止させずに、装置状態管理ができることを目的と
する。It is another object of the present invention to be able to perform device state management without stopping the device state management function of the entire system even if the master computer that has transmitted all device states fails.

【００１３】また、一般のデータ送信において、自計算
機が持つ他計算機の装置状態を利用して不要な応答タイ
ムアウト時間を無くしたり、送信先計算機の装置状態に
応じて論理的な通信切り離し制御する装置状態管理がで
きることを目的とする。Also, in general data transmission, a device for eliminating unnecessary response timeout time by utilizing the device state of another computer possessed by the own computer, or a device for logically disconnecting control according to the device state of the destination computer. The purpose is to be able to manage the state.

【００１４】[0014]

[Means for Solving the Problems]

（１）この発明に係る装置状態管理方法は、複数の計算
機をＬＡＮ等の通信回線に接続したシステムにあって、
一つの計算機がマスタとなり、このマスタ計算機は全て
の計算機の装置状態をヘルスチェックの送信データとし
て、周期的にまたは状態変化時に、スレーブ計算機に対
してブロードキャスト送信し、スレーブ計算機は上記受
信した全装置状態により、自計算機が保持する自計算機
以外の全装置状態を更新すると共に、自計算機の装置状
態を応答信号として返信し、マスタ計算機は上記応答信
号を受信し、応答を返さない計算機を故障状態とみなし
て全計算機の装置状態を更新し、この更新した全装置状
態を次回の送信データとする方法である。(1) A device state management method according to the present invention is a system in which a plurality of computers are connected to a communication line such as a LAN,
One computer becomes the master, and this master computer broadcasts the device status of all computers as health check transmission data to slave computers periodically or when the state changes, and the slave computers receive all the above devices. Depending on the status, all the device statuses other than the own computer held by the own computer are updated, and the device status of the own computer is returned as a response signal, and the master computer receives the above response signal and the computer that does not return a response is in a failure state. This is a method in which the device states of all the computers are updated and the updated all device states are used as the next transmission data.

【００１５】（２）また、上記（１）におてい、マスタ
計算機が周期的にヘルスチェック送信データをブロード
キャスト送信している場合、スレーブ計算機は上記送信
データが所定の時間受信されないと、異常発生とみなし
所定の対応処置をするようにした方法である。(2) Further, in the above (1), when the master computer periodically broadcasts the health check transmission data, the slave computer generates an error if the transmission data is not received for a predetermined time. This is a method in which it is considered that a predetermined countermeasure is taken.

【００１６】（３）また、上記（１）または（２）にお
いて、スレーブ計算機は自計算機に状態変化があると、
その状態変化をブロードキャスト送信するようにした方
法である。(3) Further, in the above (1) or (2), when the slave computer has a state change in its own computer,
In this method, the state change is broadcasted.

【００１７】（４）また、上記（１）または（２）にお
いて、スレーブ計算機は自計算機に状態変化があると、
その状態変化をブロードキャスト送信し、この送信から
所定時間後、１対１で他の計算機に上記状態変化を再送
信し、その応答信号の有無で他計算機の装置状態を確認
するようにした方法である。(4) Further, in the above (1) or (2), when the slave computer has a state change in its own computer,
By broadcasting the status change, after a predetermined time from this transmission, the status change is retransmitted to other computers one by one, and the device status of other computers is confirmed by the presence or absence of the response signal. is there.

【００１８】（５）また、上記（１）〜（４）のいずれ
か１項において、マスタ計算機はスレーブ計算機からの
応答信号が無かった場合、所定時間経過後、その計算機
に対して１対１で全装置状態を再送信して再確認するよ
うにした方法である。(5) Further, in any one of the above items (1) to (4), when there is no response signal from the slave computer, the master computer has a one-to-one correspondence with the computer after a lapse of a predetermined time. This is a method for retransmitting and reconfirming all device states.

【００１９】（６）また、上記（１）〜（５）のいずれ
か１項において、スレーブ計算機はヘルスチェックによ
る全装置状態を一定時間受信できなかった場合、予め決
められた順番によりスレーブ計算機の一つをマスタ計算
機として動作させるようにした方法である。(6) Further, in any one of the above items (1) to (5), when the slave computer cannot receive all the device states due to the health check for a certain period of time, the slave computer of the slave computer is in a predetermined order. This is a method in which one operates as a master computer.

【００２０】（７）また、上記（１）〜（６）のいずれ
か１項において、計算機は、ヘルスチェック以外の一般
のデータを送信する際、自計算機が保持する全装置状態
を参照し、故障・停止等の状態の計算機に対して、上記
送信データを送信しないようにした方法である。(7) In any one of the above items (1) to (6), the computer refers to all device states held by the computer when transmitting general data other than the health check, This is a method in which the transmission data is not transmitted to a computer in a state of failure / stop.

【００２１】（８）また、上記（１）〜（７）のいずれ
か１項の装置状態管理方法を用いたデータ通信システム
としたものである。(8) Further, the data communication system uses the apparatus state management method according to any one of the above (1) to (7).

【００２２】[0022]

BEST MODE FOR CARRYING OUT THE INVENTION

実施の形態１．以下、この発明の実施の形態１を図に基
づいて説明する。図１において、計算機Ａ〜ＤはＬＡＮ
に接続され、お互いの計算機が通信できるシステム構成
で、１は全装置状態１０をブロードキャスト送信し、そ
の応答として各計算機から個別の装置状態１１〜１４を
受信するヘルスチェック送信機構である。Embodiment 1 FIG. Hereinafter, a first embodiment of the present invention will be described with reference to the drawings. In FIG. 1, computers A to D are LANs.
1 is a health check transmission mechanism that broadcasts the entire device state 10 and receives individual device states 11 to 14 from each computer as a response in a system configuration in which the computers can communicate with each other.

【００２３】２は全装置状態１０を受信した時、自計算
機の装置状態を返信するヘルスチェック受信機構、３は
自計算機の装置状態をブロードキャスト送信する装置状
態送信機構である。１０は各計算機の装置状態１１〜１
４をまとめて保持する全装置状態で、メモリに保存され
ている。１１〜１４は計算機Ａ〜Ｄに対応する動作／待
機／停止／故障等の装置状態である。計算機Ａ〜Ｄは全
て同じ構成となっている。Reference numeral 2 is a health check receiving mechanism for returning the device state of the computer itself when receiving all device states 10. Reference numeral 3 is a device state transmitting mechanism for broadcasting the device state of the computer itself. 10 is the device state 11 to 1 of each computer
All device states that collectively hold 4 are stored in the memory. 11 to 14 are device states such as operation / standby / stop / failure corresponding to the computers A to D. All the computers A to D have the same configuration.

【００２４】次に動作について図２のフローチャートと
共に説明する。（１）あらかじめヘルスチェック送信機構１を作動させ
るように定められた計算機Ａ（マスタ計算機）は、自発
的に一定周期または状態変化時に全装置状態１０をブロ
ードキャスト送信する。（Ｓ１）（２）各計算機Ｂ〜Ｄはヘルスチェック受信機構２でブ
ロードキャスト信号を受信すれば応答する。この応答は
自計算機の装置状態を応答信号とするものである。Next, the operation will be described with reference to the flowchart of FIG. (1) The computer A (master computer), which is predetermined to operate the health check transmission mechanism 1, voluntarily broadcasts the entire device state 10 at a constant cycle or when the state changes. (S1) (2) Each of the computers B to D responds if the health check receiving mechanism 2 receives a broadcast signal. This response uses the device status of its own computer as a response signal.

【００２５】（３）計算機Ａは、ヘルスチェック送信機
構１で各計算機Ｂ〜Ｄの応答状態を受信し、各計算機Ｂ
〜Ｄからの応答があるか否かを判断する。（Ｓ３）（４）計算機Ａは各計算機Ｂ〜Ｄからの応答状態（装置
状態１１〜１４）を受信した結果、計算機Ｃ，Ｄから応
答があり、計算機Ｂから応答が無い場合、計算機Ｂの故
障をヘルスチェック送信機構１で検出し、計算機Ｂを故
障状態と設定し、全装置状態１０を更新する。（Ｓ４）(3) The computer A receives the response status of each of the computers B to D by the health check transmission mechanism 1, and each computer B receives the response state.
It is determined whether or not there is a response from ~ D. (S3) (4) As a result of receiving the response states (device states 11 to 14) from the computers B to D, the computer A responds from the computers C and D, and when there is no response from the computer B, the computer B The failure is detected by the health check transmission mechanism 1, the computer B is set to the failure state, and the all-device state 10 is updated. (S4)

【００２６】（５）これにより得られた全装置状態１０
を次回の送信データとして一定時間後に再びブロードキ
ャスト送信する。（Ｓ５）（６）一方、計算機Ｃ〜Ｄは全装置状態１０を受信する
と、ヘルスチェック受信機構２にて、自計算機以外の計
算機の装置状態を内部の全装置状態１０へ設定する。
（Ｓ６）(5) All device states 10 thus obtained
Is transmitted again as a next transmission data after a fixed time. (S5) (6) On the other hand, when the computers C to D receive the all-device state 10, the health check receiving mechanism 2 sets the device states of the computers other than the own computer to the internal all-device state 10.
(S6)

【００２７】（７）また、同時に計算機Ｃ〜Ｄは自計算
機の装置状態を送信元（計算機Ａ）へ返信する。（Ｓ
７）（８）計算機Ａは計算機Ｃ，Ｄの装置状態を受信し、そ
の装置状態を設定し、全装置状態１０を更新する。（Ｓ
８）(7) At the same time, the computers C to D return the device statuses of their own computers to the sender (computer A). (S
7) (8) Computer A receives the device states of computers C and D, sets the device states, and updates all device states 10. (S
8)

【００２８】この結果、計算機Ａと計算機Ｃ〜Ｄは同じ
全装置状態を持つことができ、計算機Ｂの故障を知るこ
とができる。As a result, the computer A and the computers C to D can have the same all device states, and the failure of the computer B can be known.

【００２９】次に計算機Ｂが故障状態から動作状態とな
った場合の動作を図３のフローチャートと共に説明す
る。（１）自計算機（計算機Ｂ）の故障が回復すると、（Ｔ
１）（２）自計算機（計算機Ｂ）の装置状態を装置状態送信
機構３にて、全装置へブロードキャスト送信する。（Ｔ
２）（３）他計算機Ａ，Ｃ，Ｄはヘルスチェック受信機構２
で受信し、計算機Ｂの「故障回復」を全装置状態に設定
する。（Ｔ３）Next, the operation when the computer B is changed from the failure state to the operation state will be described with reference to the flowchart of FIG. (1) When the failure of the computer (computer B) is recovered, (T
1) (2) The device status transmission mechanism 3 broadcasts the device status of its own computer (computer B) to all devices. (T
2) (3) Other computers A, C, D are health check receiving mechanism 2
Then, the “failure recovery” of the computer B is set to the state of all devices. (T3)

【００３０】これにより、計算機Ｂの装置状態がヘルス
チェックの周期とは関係なく状態変化時に全装置で同時
に把握される。As a result, the device state of the computer B can be grasped by all the devices at the same time when the state changes, regardless of the health check cycle.

【００３１】次に計算機Ａが故障した場合の処置を図４
のフローチャートと共に説明する。（１）計算機Ｂ〜Ｄはヘルスチェック受信機構２にて一
定時間内に計算機Ａから全装置状態１０が受信したか否
かを検出する。（Ｕ１）（２）計算機Ｂ〜Ｄは計算機Ａに故障が発生したか、ま
たは、ＬＡＮに故障が発生したとみなす。（３）これにより装置全体での通信不良をいち早く検出
でき、故障時の対応処置を実施することができる。例え
ば、外部警報や内部のソフトウエアへ通知しても良い。Next, FIG. 4 shows the measures to be taken when the computer A fails.
This will be described with reference to a flowchart of (1) The computers B to D detect whether the health check receiving mechanism 2 has received all the device states 10 from the computer A within a fixed time. (U1) (2) Computers B to D consider that a fault has occurred in computer A or a fault has occurred in LAN. (3) As a result, a communication failure in the entire device can be quickly detected, and a countermeasure for a failure can be taken. For example, an external alarm or internal software may be notified.

【００３２】このようにすることで、すべての計算機
が、時差なく最短時間で全計算機の装置状態を管理でき
る。By doing so, all the computers can manage the device states of all the computers in the shortest time without a time difference.

【００３３】以上のように実施の形態１では、ヘルスチ
ェック送信機構１が、すべての装置の装置状態をヘルス
チェックのデータとして送信するため、すべての装置が
時差なく同時に全装置状態を管理することができる。ま
た、ヘルスチェック受信機構２と装置状態送信機構３
が、ヘルスチェックの応答や自計算機の状態変化時に、
自装置状態を送信するので、すべての計算機が、１つの
計算機の状態変化を最短時間で得ることができる。As described above, in the first embodiment, the health check transmission mechanism 1 transmits the device states of all the devices as the data of the health check, so that all the devices manage the state of all the devices at the same time without any time difference. You can Also, the health check receiving mechanism 2 and the device status transmitting mechanism 3
However, at the time of a health check response or a change in the status of the computer,
Since the self-device status is transmitted, all computers can obtain the status change of one computer in the shortest time.

【００３４】実施の形態２．上記実施の形態１では、ヘ
ルスチェック送信機構１にて応答を返さない計算機を故
障状態とみなしていたが、ルータ装置を介したＬＡＮ構
成の場合には、ブロードキャスト通信ができない計算機
や、通信回線速度が遅くデータ送受信に時間がかかる計
算機などが存在する場合がある。Embodiment 2 In the first embodiment, a computer that does not return a response in the health check transmission mechanism 1 is regarded as a failure state. However, in the case of a LAN configuration via a router device, a computer that cannot perform broadcast communication or a communication line speed There may be a computer that is slow and takes time to send and receive data.

【００３５】この実施の形態は、このような場合でも装
置状態を把握することができるようにするものである。
図５はこの実施の形態の要部の構成図で、再認識機構４
を設けたものであり、図５のように、ルータを介してブ
ロードキャストが通信できない場合や、通信速度が遅い
低速回線のある場合に適用する。図６は動作のフローチ
ャートである。In this embodiment, the apparatus state can be grasped even in such a case.
FIG. 5 is a configuration diagram of a main part of this embodiment, in which the rerecognition mechanism 4
This is applied when broadcast cannot be communicated via a router or when there is a low-speed line with a slow communication speed, as shown in FIG. FIG. 6 is a flowchart of the operation.

【００３６】この場合の動作を説明する。（１）計算機Ａのヘルスチェック送信機構１からブロー
ドキャスト送信し、（Ｖ１）（２）各計算機Ｂ〜Ｄは、ヘルスチェック受信機構２で
ブロードキャスト信号を受信すれば応答する。（Ｖ２）The operation in this case will be described. (1) Broadcast transmission from the health check transmission mechanism 1 of the computer A, (V1) (2) Each of the computers B to D responds if the health check reception mechanism 2 receives the broadcast signal. (V2)

【００３７】（３）計算機Ａは、ヘルスチェック送信機
構１で各計算機Ｂ〜Ｄからの応答状態を受信し、各計算
機Ｂ〜Ｄからの応答が所定時間内にあったか否かを判断
する。（Ｖ３）（４）計算機Ａは、全装置状態１０のブロードキャスト
送信に対する応答が無かったり、応答が返るのが遅れた
りした計算機の装置状態を、すぐに故障状態とはせず、
もう一度当該計算機に対し、全装置状態１０を１対１で
送信し、（Ｖ４）(3) The computer A receives the response status from each of the computers B to D by the health check transmission mechanism 1 and judges whether or not the response from each of the computers B to D is within a predetermined time. (V3) (4) The computer A does not immediately set the device state of the computer, which has received no response to the broadcast transmission of the entire device state 10 or has a delayed response, to a failure state,
Send all device statuses 10 to the computer once more, and send (V4)

【００３８】（５）再確認機構４は、所定時間内に応答
があったか否かを判断する。（Ｖ５）この再確認機構４
は、応答のタイムアウト値を多めにし、一定時間間隔で
ＬＡＮの通信負荷を上げない程度で再ヘルスチェックす
る。(5) The reconfirmation mechanism 4 judges whether or not there is a response within a predetermined time. (V5) This reconfirmation mechanism 4
Performs a re-health check by increasing the response timeout value and not increasing the LAN communication load at regular time intervals.

【００３９】（６）計算機Ａのヘルスチェック送信機構
１で各計算機Ｂ〜Ｄの応答状態を受信した結果、計算機
Ｂは応答あり、計算機Ｃは１対１の送信後応答あり、計
算機Ｂは１対１で所定時間経っても応答が無い場合、計
算機Ｂの故障を検出し、計算機Ｂを故障状態と設定し、
全装置状態１０を更新する。（Ｖ６）（７）ステップＶ７〜Ｖ１０は実施の形態１の図２のス
テップＳ５〜Ｓ８と同様であるので説明を省略する。(6) As a result of the health check transmission mechanism 1 of the computer A receiving the response states of the computers B to D, the computer B has a response, the computer C has a one-to-one post-transmission response, and the computer B has 1 If there is no response after a lapse of a predetermined time in the pair 1, the failure of the computer B is detected, the computer B is set to the failure state,
Update all device state 10. (V6) (7) Steps V7 to V10 are the same as steps S5 to S8 of the first embodiment shown in FIG.

【００４０】上記ステップＶ３〜Ｖ６で説明したよう
に、本当に故障した計算機は応答を返さないので故障状
態とみなす。また、遅れて応答を返した計算機は、その
装置状態を有効とし、故障状態とは見なさない。ブロー
ドキャスト通信ができない計算機は、この再確認機構に
よって１対１の通信は可能なので応答を返すため、その
装置情報は有効となる。As described in steps V3 to V6 above, the computer that has really failed does not return a response, so it is considered to be in a failed state. A computer that returns a response with a delay validates the device state and does not regard it as a failure state. A computer that cannot perform broadcast communication returns a response because one-to-one communication is possible by this reconfirmation mechanism, so that the device information is valid.

【００４１】以上のように、再確認機構４がヘルスチェ
ックに応答しない計算機に対して、通常の１対１通信に
てヘルスチェックを行うので、ルータを介した計算機な
どでブロードキャスト通信ができないものや、回線速度
が遅く応答がタイムアウトしてしまうものに対してもヘ
ルスチェックが可能となり、ＬＡＮの構成に依存せず、
任意の計算機構成に対応した装置状態管理ができる。As described above, since the reconfirmation mechanism 4 performs the health check on the computer which does not respond to the health check by the normal one-to-one communication, the computer which is through the router cannot perform the broadcast communication. , Health check is possible even if the line speed is slow and the response times out, regardless of the LAN configuration,
It is possible to manage the device status corresponding to any computer configuration.

【００４２】実施の形態３．上記実施の形態１では、装
置状態送信機構３において、例えば計算機Ａの自装置状
態が変化した時に、ブロードキャスト送信して状態変化
を通知していた。しかし、系切替（動作状態の計算機と
待機状態の計算機との切替え）等による状態変化の場合
は、当該計算機は新しい装置状態で即時に動作しなけれ
ばならないため、ブロードキャスト送信の応答を待たず
に動作することになる。そのため本当に全計算機へ装置
状態の変化が通知されたか不明となる。この実施の形態
はこのような事態を解決するものである。Embodiment 3. In the first embodiment, the device status transmission mechanism 3 broadcasts and notifies the status change when, for example, the status of the computer A itself changes. However, in the case of a state change due to system switching (switching between a computer in the operating state and a computer in the standby state), the computer must immediately operate in the new device state, so do not wait for the broadcast transmission response. It will work. Therefore, it is unknown whether or not all computers have been notified of the change in device status. This embodiment solves such a situation.

【００４３】図７はこの実施の形態の要部を示す構成図
で、３１の装置状態送信機構を設けたものである。図８
は動作のフローチャートである。FIG. 7 is a block diagram showing the main part of this embodiment, in which 31 device status transmission mechanisms are provided. FIG.
Is a flowchart of the operation.

【００４４】次に動作を説明する。（１）マスタ計算機Ａは、系切替等の状態変化を検知す
れば、（Ｗ１）（２）計算機Ａは、自装置状態をヘルスチェック送信機
構１からブロードキャスト送信する。（Ｗ２）Next, the operation will be described. (1) When the master computer A detects a state change such as system switching, (W1) (2) The computer A broadcasts its own state from the health check transmitting mechanism 1. (W2)

【００４５】（３）ブロードキャスト送信後、計算機Ａ
は、装置状態送信機構３１から１対１で計算機Ｂ〜Ｄへ
自装置状態を送信する。（Ｗ３）（４）計算機Ａは装置状態送信機構３１で、各計算機Ｂ
〜Ｄからの応答を個別に受信して、応答があったか否か
を判断する。（Ｗ４）(3) Computer A after broadcast transmission
Sends the device status from the device status transmission mechanism 31 to the computers B to D on a one-to-one basis. (W3) (4) Computer A is the device status transmission mechanism 31, and each computer B
Each of the responses from D to D is individually received and it is determined whether or not there is a response. (W4)

【００４６】（５）計算機Ａのヘルスチェック送信機構
１で各計算機Ｂ〜Ｄの応答状態を受信した結果、計算機
Ｃ，Ｄから応答があり、計算機Ｂから応答がない場合、
計算機Ｂの故障を検出し、計算機Ｂを故障状態と設定
し、全装置状態１０を更新する。（Ｗ５）（６）計算機Ａは、全計算機の装置状態を、次回のブロ
ードキャスト送信で通知する。（Ｗ６）(5) When the health check transmission mechanism 1 of the computer A receives the response states of the computers B to D as a result, there is a response from the computers C and D, but no response from the computer B,
The failure of the computer B is detected, the computer B is set to the failure state, and the all-device state 10 is updated. (W5) (6) The computer A notifies the device states of all the computers by the next broadcast transmission. (W6)

【００４７】このように偶発的な障害によるブロードキ
ャスト送信の不到達が発生し、他計算機へ通知されない
場合でも、後から一定時間間隔で１対１で全装置へ再送
信するようにしたので、計算機Ａの装置状態が各計算機
で不一致になることを回避し、ＬＡＮの通信負荷をあげ
ることなく、系切替等の即時性が必要で重要な装置状態
変化を確実に通知して、信頼性の高い装置状態管理が可
能となる。Even when the non-arrival of the broadcast transmission occurs due to the accidental failure and the other computers are not notified in this way, the data is retransmitted to all the devices at a fixed time interval one-to-one after that. Avoiding inconsistencies in the device status of A on each computer, reliable notification of important device status changes that require immediacy such as system switching without increasing the communication load on the LAN, and are highly reliable. The device status can be managed.

【００４８】実施の形態４．この実施の形態は、ヘルス
チェックを行うマスタ計算機が故障しても、他の計算機
をマスタ計算機としてヘルスチェック送信を引き継ぐよ
うにしたものである。Embodiment 4 In this embodiment, even if the master computer that performs the health check fails, another computer is used as the master computer to take over the health check transmission.

【００４９】図９はこの実施の形態の要部の構成図であ
り、ヘルスチェック送信引継機構５を設けたものであ
る。図１０はマスタ計算機が故障した場合の動作のフロ
ーチャートである。次に動作を説明する。（１）計算機Ｂ〜Ｄ（スレーブ計算機）は計算機Ａ（マ
スタ計算機）から全装置状態を指定時間内に受信しない
場合、（Ｘ１）（２）予め定めた次引継計算機Ｂは、ヘ
ルスチェック送信引継機構５により全装置状態をブロー
ドキャスト送信する。FIG. 9 is a block diagram of the essential parts of this embodiment, in which a health check transmission takeover mechanism 5 is provided. FIG. 10 is a flowchart of the operation when the master computer fails. Next, the operation will be described. (1) When computers B to D (slave computers) do not receive all the device states from computer A (master computer) within the specified time, (X1) (2) The predetermined next takeover computer B takes over the health check transmission. The mechanism 5 broadcasts all device states.

【００５０】このようにして、計算機Ａが故障しても全
装置状態の管理を継続して実施することができる。な
お、ヘルスチェック送信引継機構５では、計算機Ａが故
障した場合に引継を行う計算機の順番を、予め決めてお
き、ヘルスチェック受信機構２でのヘルスチェック未受
信時間を優先順位の高い計算機ほど短くしておく。ヘル
スチェック未受信時間が計算機ごとに少しずつ違うこと
で、複数計算機が同時にヘルスチェック送信を引き継ぐ
ことを避け、順番通りにヘルスチェックを引き継ぐこと
ができる。In this way, even if the computer A fails, it is possible to continuously manage the state of all devices. In the health check transmission and takeover mechanism 5, the order of the computers to take over when the computer A fails is determined in advance, and the health check non-reception time in the health check receiving mechanism 2 is shortened in the higher priority computer. I'll do it. Since the time when the health check is not received differs slightly for each computer, it is possible to avoid taking over the health check transmission from multiple computers at the same time and take over the health check in order.

【００５１】このヘルスチェック引継機構５により、ど
の計算機もマスタ計算機になることができ、マスタ計算
機が不要となる。また、システム全体の装置状態管理が
滞ることなく継続でき、システム全体の信頼性を向上さ
せることができる。With this health check transfer mechanism 5, any computer can be a master computer, and the master computer is not required. Further, the device state management of the entire system can be continued without delay, and the reliability of the entire system can be improved.

【００５２】実施の形態５．この実施の形態は、故障等
のデータ送信の不要な計算機に対してデータ送信を行わ
ないようにし、通信負荷を軽減し、データ送信の効率の
向上を図るものである。Embodiment 5 FIG. In this embodiment, data transmission is not performed to a computer that does not need data transmission due to a failure or the like, the communication load is reduced, and the efficiency of data transmission is improved.

【００５３】図１１はこの実施の形態の要部の構成図で
あり、データ送信制御機構６を設けたものである。図１
２は動作のフローチャートである。FIG. 11 is a block diagram of the essential parts of this embodiment, in which a data transmission control mechanism 6 is provided. FIG.
2 is a flowchart of the operation.

【００５４】このデータ送信制御機構６につて説明する
と、実施の形態１では、各計算機の動作／待機／停止／
故障状態を全装置が管理しているが、図１１に示す通
り、データ送信制御機構６は、計算機からの一般のデー
タを送信する場合、送信先計算機の装置状態（動作・故
障・停止など）を参照して、故障状態や停止状態であっ
た場合には、データ送信せずに即時にエラーを返却する
などの、理論的な切り離し制御を行う。The data transmission control mechanism 6 will be described. In the first embodiment, the operation / standby / stop / stop of each computer is performed.
Although the failure status is managed by all the devices, as shown in FIG. 11, when sending general data from the computer, the data transmission control mechanism 6 is in the device status of the destination computer (operation, failure, stop, etc.). In the case of a failure state or a stop state, theoretical disconnection control is performed, such as immediately returning an error without transmitting data.

【００５５】図１２のフローチャートで説明すると、（１）計算機Ａから一般のデータ送信を実施する際、計
算機Ａのデータ送信制御機構６は全装置状態１０を参照
する。（Ｚ１）（２）参照の結果、故障計算機または停止計算機がある
か否かを判断し、（Ｚ２）（３）故障計算機または停止計算機に対してはデータ送
信せず、（Ｚ３）（４）動作計算機に対しては、通常のデータ送信をす
る。（Ｚ４）The explanation will be given with reference to the flow chart of FIG. 12. (1) When general data transmission is performed from the computer A, the data transmission control mechanism 6 of the computer A refers to the all-device state 10. (Z1) As a result of the reference (2), it is determined whether or not there is a failure computer or a shutdown computer. (Z2) (3) Data is not transmitted to the failure computer or the shutdown computer, and (Z3) (4) Normal data transmission is performed to the motion computer. (Z4)

【００５６】このように計算機Ａから故障中の計算機Ｃ
への一般のデータ送信時に、無駄な応答タイムアウト処
理を無くし、通信処理の負荷を減少させることができ
る。また、例えば、計算機Ａから計算機Ｄへの送信で、
通信はできるが計算機Ｄが停止状態ならば、あえて送信
せずにエラーとすることで、計算機Ｄを理論的に切り離
し、特定の計算機Ｄの試験作業や保守作業を、システム
に影響を与えることなく円滑に行えるようにすることが
できる。In this way, computer A is in failure and computer C is in failure.
It is possible to eliminate unnecessary response time-out processing and reduce the load of communication processing when general data is transmitted to. Also, for example, in the transmission from computer A to computer D,
If communication is possible, but the computer D is in a stopped state, the computer D is theoretically separated by not sending the data and making an error, and the test work and maintenance work of a specific computer D can be performed without affecting the system. It can be done smoothly.

【００５７】以上のように、通常送信時に送信先装置の
装置状態を判断し、論理的に切り離して、故障ならばデ
ータ送信せずに即エラーとするようにしたので、無駄な
応答タイムアウト処理が無くなり、また、計算機の試験
や保守作業をする場合、システムに影響を与えずに行う
ことが可能となる。As described above, the device state of the destination device is judged during normal transmission and logically separated, and if there is a failure, an error is immediately sent without data transmission. Moreover, it becomes possible to perform computer testing and maintenance work without affecting the system.

[Brief description of the drawings]

【図１】この発明の実施の形態１による装置状態管理
システムの構成図である。FIG. 1 is a configuration diagram of a device state management system according to a first embodiment of the present invention.

【図２】この発明の実施の形態１による動作のフロー
チャートである。FIG. 2 is a flowchart of the operation according to the first embodiment of the present invention.

【図３】この発明の実施の形態１による故障復帰時の
フローチャートである。FIG. 3 is a flowchart at the time of failure recovery according to the first embodiment of the present invention.

【図４】この発明の実施の形態１によるホスト計算機
が故障したときのフローチャートである。FIG. 4 is a flowchart when the host computer according to the first embodiment of the present invention fails.

【図５】この発明の実施の形態２による装置状態管理
システムの構成図である。FIG. 5 is a configuration diagram of a device state management system according to a second embodiment of the present invention.

【図６】この発明の実施の形態２による再認識機構の
フローチャートである。FIG. 6 is a flowchart of a re-recognition mechanism according to the second embodiment of the present invention.

【図７】この発明の実施の形態３による装置状態管理
システムの構成図である。FIG. 7 is a configuration diagram of an apparatus state management system according to Embodiment 3 of the present invention.

【図８】この発明の実施の形態３によるホスト計算機
の状態変化時のフローチャートである。FIG. 8 is a flowchart when the state of the host computer changes according to the third embodiment of the present invention.

【図９】この発明の実施の形態４による装置状態管理
システムの構成図である。FIG. 9 is a configuration diagram of an apparatus state management system according to Embodiment 4 of the present invention.

【図１０】この発明の実施の形態４によるマスタ計算
機が故障したときのフローチャートである。FIG. 10 is a flowchart when the master computer according to Embodiment 4 of the present invention fails.

【図１１】この発明の実施の形態５による装置状態管
理システムの構成図である。FIG. 11 is a configuration diagram of a device state management system according to a fifth embodiment of the present invention.

【図１２】この発明の実施の形態５による故障・停止
計算機への送信中止のフローチャートである。FIG. 12 is a flowchart for stopping transmission to a failure / stop computer according to the fifth embodiment of the present invention.

【図１３】従来の装置状態管理システムの構成図であ
る。FIG. 13 is a configuration diagram of a conventional device state management system.

[Explanation of symbols]

１ヘルスチェック送信機構、２ヘルスチェック受信
機構、３、３１装置状態送信機構、４再確認機構、
５ヘルスチェック送信引継機構、６データ送信制御機
構１０全装置状態、１１計算機Ａの装置状態、１２
計算機Ｂの装置状態、１３計算機Ｃの装置状態、１４
計算機Ｄの装置状態。1 health check transmission mechanism, 2 health check reception mechanism, 3, 31 device status transmission mechanism, 4 reconfirmation mechanism,
5 Health Check Transmission Handover Mechanism, 6 Data Transmission Control Mechanism 10 All Device State, 11 Computer A Device State, 12
Device status of computer B, 13 Device status of computer C, 14
Device status of computer D.

Claims

[Claims]

1. In a system in which a plurality of computers are connected to a communication line such as a LAN, one computer serves as a master, and this master computer periodically uses the device states of all the computers as health check transmission data. Or when the status changes, broadcast transmission to the slave computer, the slave computer, by the received all device status,
All the device statuses other than the own computer held by the own computer are updated, the device status of the own computer is returned as a response signal, and the master computer receives the above response signal and considers the computer that does not return a response as a failure state. A device state management method in which the device states of all computers are updated and the updated all device states are used as the next transmission data.

2. The apparatus state management method according to claim 1, wherein when the master computer periodically broadcasts health check transmission data, the slave computer is abnormal if the transmission data is not received for a predetermined time. A device status management method in which it is regarded as an occurrence and a prescribed countermeasure is taken.

3. The apparatus state management method according to claim 1, wherein when the slave computer has a state change in its own computer, the slave computer broadcasts the state change.

4. The apparatus state management method according to claim 1, wherein when the slave computer has a state change in its own computer, the slave computer broadcasts the state change, and after a predetermined time from this transmission, a pair of A device status management method in which the status change is retransmitted to another computer in 1 and the device status of the other computer is confirmed by the presence or absence of a response signal.

5. The apparatus state management method according to claim 1, wherein the master computer, when there is no response signal from the slave computer, waits for a predetermined period of time and then sends a pair to the computer. A device status management method in which all device statuses are retransmitted and reconfirmed in step 1.

6. The device state management method according to claim 1, wherein when the slave computer cannot receive all device states by health check for a certain period of time, the slave computer is in a predetermined order. A device status management method that operates one of the above as a master computer.

7. The apparatus state management method according to claim 1, wherein the computer refers to all apparatus states held by the computer when transmitting general data other than a health check. A device state management method in which the transmission data is not transmitted to a computer in a state such as a failure or a stop.

8. A data communication system using the device status management method according to claim 1.