JPS633361A - Inter-computer monitoring method - Google Patents

Inter-computer monitoring method

Info

Publication number
JPS633361A
JPS633361A JP61144880A JP14488086A JPS633361A JP S633361 A JPS633361 A JP S633361A JP 61144880 A JP61144880 A JP 61144880A JP 14488086 A JP14488086 A JP 14488086A JP S633361 A JPS633361 A JP S633361A
Authority
JP
Japan
Prior art keywords
computer
computers
abnormality
message
receiving
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP61144880A
Other languages
Japanese (ja)
Inventor
Toshiharu Unotsu
宇之津 俊治
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hitachi Engineering Co Ltd
Hitachi Ltd
Original Assignee
Hitachi Engineering Co Ltd
Hitachi Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hitachi Engineering Co Ltd, Hitachi Ltd filed Critical Hitachi Engineering Co Ltd
Priority to JP61144880A priority Critical patent/JPS633361A/en
Publication of JPS633361A publication Critical patent/JPS633361A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To prevent the influence of abnormality of a certain computer from effecting to other ones by separating immediately the abnormal computer after the detection of the abnormality. CONSTITUTION:The timer device 17 of a computer 10 starts a monitor transmission device 11 in a fixed cycle. Thus the device 11 transmits a state monitor message to a computer 20 via a communication controller 16. The computer 10 cuts off the communication channel between both computers 10 and 20 when no answer message is received from the computer 20 even when a fixed time is lapsed. Furthermore a fact that the computer 20 has the abnormality is informed to a computer 30 that transfers a message to the computer 20. At the time of receiving said information on the abnormality of the computer 20, the computer 30 immediately interrupts the communication channel set between both computers 20 and 30. Thus it is possible to prevent the influence of abnormality of the computer 20 from effecting to the computer 30 in advance.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 本発明は、複数の計算機で構成される計算機システムに
係り、特に相互にメツセージ交換を行う網システムに好
適な計算機間の監視方法に関する。
DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a computer system composed of a plurality of computers, and particularly to a monitoring method between computers suitable for a network system that exchanges messages with each other.

〔従来の技術〕[Conventional technology]

従来の計算機関監視方法は、特開昭58−45053号
公報に記載のように、前記応答メツセージが許容時間以
内に返ってこなかった時に、該相手計算機に停止メツセ
ージを送信するとともに自計算機内に該相手計算機異常
を記憶する方法が知られている。
As described in Japanese Unexamined Patent Publication No. 58-45053, the conventional computer monitoring method, when the response message is not returned within a permissible time, sends a stop message to the other computer and also sends a message to the computer itself. A method of storing the abnormality of the other party's computer is known.

〔発明が解決しようとする問題点〕[Problem that the invention seeks to solve]

上記従来技術は、相手計算機の他計算機への異常の波及
を、自計算機から相手計算機に対し停止メツセージを送
信し、該相手計算機を停止させることで、防止しようと
するものである。
The above-mentioned conventional technology attempts to prevent an abnormality from spreading from the other computer to other computers by sending a stop message from the own computer to the other computer and stopping the other computer.

しかし、前記異常状態は、概ね、相手計算機内のソフト
ウェア故障が原因と考えられ、停止メツセージ自体も相
手計算機で受は付けられない場合が多く、又、受は付け
られたとしても他計算機は、相手計算機が停止したこと
を認識していない。
However, the above-mentioned abnormal state is generally considered to be caused by a software failure in the other computer, and the stop message itself is often not accepted by the other computer, and even if it is accepted, the other computer The other computer does not recognize that it has stopped.

本発明の目的は、上記従来技術の問題点を解消し、効果
的な計算機のメツセージ交換を提供することにある。
SUMMARY OF THE INVENTION An object of the present invention is to solve the problems of the prior art described above and to provide an effective message exchange between computers.

〔問題点を解決するための手段〕[Means for solving problems]

上記目的は、自計算機が相手計算機の異常を検知したな
らば、直ちに自計算機と相手計算機との、通信路をしゃ
断するとともに、あらかじめ定義され起他計算機に対し
、相手計算機の異常を連絡し、更に他計算機は該相手計
算機への異常連絡を受信後、直ちに相手計算機との通信
路をしゃ断することにより、計算機網上、相手計算機を
隔離し異常の波及を防止することにより達成される。
The above purpose is that when the own computer detects an abnormality in the other computer, it immediately cuts off the communication path between the own computer and the other computer, and also notifies the other computer defined in advance of the abnormality in the other computer. Furthermore, after receiving the abnormality notification to the other computer, the other computer immediately cuts off the communication path with the other computer, thereby isolating the other computer on the computer network and preventing the spread of the abnormality.

〔作用〕[Effect]

本発明は、次のように動作する。即ち、自計算機はあら
かじめ定義した周期で、あらかじめ定義した監視光相手
計算機に対し、状態監視メツセージを送信し、その応答
メツセージを受信し、該計算機関通信路が正常であるこ
とを確認する。又、あらかじめ定義した状態監視メツセ
ージ送信から応答メツセージ受信までの許容時間内に応
答メツセージが返ってこなければ、相手計算機異常を認
識し、該通信路をしゃ断すると共に、あらかじめ定義し
た関係する他計算機に、該相手計算機異常を連絡する。
The invention operates as follows. That is, the own computer transmits a status monitoring message to a predefined monitoring optical destination computer at a predefined period, receives a response message, and confirms that the computer communication channel is normal. Also, if a response message is not returned within the permissible time from sending the predefined status monitoring message to receiving the response message, it will recognize an abnormality in the other computer, cut off the communication path, and communicate with other related computers defined in advance. , informs the other party of the computer error.

連絡を受けた他計算機はそれぞれ、該相手計算機との通
信路をしゃ断する。
Each of the other computers that received the contact cuts off the communication path with the other computer.

それによって、計算機網システムは、自計算機切り離さ
れたとの同一認識の元に動作を開始することができる。
Thereby, the computer network system can start operating based on the same recognition that its own computer has been disconnected.

〔実施例〕〔Example〕

以下、本発明の一実施例を説明する。 An embodiment of the present invention will be described below.

第1図は、本発明にかかる計算種間監視方法を3台の計
算機について示したものである。3台の計算機A、B、
Cの構造は、同じものとする。
FIG. 1 shows the calculation type monitoring method according to the present invention for three computers. Three computers A, B,
The structure of C is assumed to be the same.

計算機AIOのタイマー装置17は、−定周期で監視用
送信装置11を起動する。監視用送信装置は、起動され
ると計算機B20状態監視用メツセージを、通信制御装
置16を介して計算機B20側に送信し、−定時間カウ
ントし、応答メツセージの受信を待つ、−方、計算機B
20は、状態監視用メツセージを受信すると直ちに応答
メツセージを返す。計算機A10は、応答メツセージを
受信すると、時間カウントをリセットし終了する。しか
し、−定時間カウントするも、計算機B20から応答メ
ツセージが返らなければ、タイムアウトとなり、計算機
AIO〜計算機820間の通信路をしゃ断し、更に計算
機B20とメツセ−ジ交換をしている計算機C30に対
し、計n、機B20が異常であることを連絡する。−方
、計算機C30は、計算機B20異常の連絡を受信する
と直ちに計算機C30〜計算機820間の通信路をしゃ
断する。
The timer device 17 of the computer AIO starts the monitoring transmitting device 11 at regular intervals. When the monitoring transmitting device is activated, it transmits a message for monitoring the status of computer B20 to the computer B20 side via the communication control device 16, - counts a fixed period of time, and waits for receiving a response message.
20 immediately returns a response message upon receiving the status monitoring message. When the computer A10 receives the response message, it resets the time count and ends the process. However, if no response message is returned from computer B20 even after counting for a fixed period of time, a timeout occurs and the communication path between computer AIO and computer 820 is cut off. In response, a total of n calls were made to inform that machine B20 was abnormal. - On the other hand, upon receiving the notification that computer B20 is abnormal, computer C30 immediately cuts off the communication path between computer C30 and computer 820.

第2図に概略動作フロー、第3図、第4図にそれぞれ正
常な場合、異常の場合のタイムチャートを示す。
FIG. 2 shows a schematic operation flow, and FIGS. 3 and 4 show time charts for normal and abnormal cases, respectively.

本実施例によれば、計算機システム上、異常な計算機B
20を切り離すことにより、該異常計算機B20の他計
算機C30に対する異常波及を未然に防止できるという
効果がある。
According to this embodiment, on the computer system, an abnormal computer B
20 has the effect of preventing the abnormality from spreading to other computers C30 from the abnormal computer B20.

〔発明の効果〕〔Effect of the invention〕

本発明によれば、異常検知後、直ちに異常計算機を切り
尊すことができるので、フェイルセーフな計算機システ
ムとすることができるという効果がある。
According to the present invention, since an abnormal computer can be immediately taken care of after an abnormality is detected, it is possible to provide a fail-safe computer system.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は本発明の一実施例による計算種間監視方法を計
算機が3台の場合について具体的に示し゛視力法の概念
図、第3図は本発明に係る計算種間監視方法において、
相手計算機が正常な場合のタイムチャート、第4図は相
手計算機が異常な場合のタイムチャートである。 10・・・計算機X、2o・・・計算機Y、30・・計
算機2.11・・・監視用送信装置、12・・・監視用
受信装置、13・・・業務用送信装置、14・・・業務
用受信装置、15・・・業務プログラム、16・・・通
信制御装置、1〜3・・・通信路、51,61.71・
・・送信線、52・・・タイマー、53,63.73・
・・受信線、第 1 図 L Z  口
FIG. 1 is a conceptual diagram of a visual acuity method that specifically shows a calculation type monitoring method according to an embodiment of the present invention in the case of three computers.
FIG. 4 is a time chart when the other party's computer is normal, and FIG. 4 is a time chart when the other party's computer is abnormal. 10... Computer - Commercial reception device, 15... Business program, 16... Communication control device, 1 to 3... Communication path, 51, 61.71.
...Transmission line, 52...Timer, 53,63.73.
...Reception line, Figure 1 LZ port

Claims (1)

【特許請求の範囲】[Claims] 1、複数の計算機と該計算機間を接続する通信制御装置
とを備え、該複数計算機間で相互にメッセージを交換し
あう計算機システムにおいて、各計算機が一定周期で、
あらかじめ定義された相手計算機に対し、状態監視メッ
セージを送信し該状態監視メッセージを受信して返され
る応答メッセージを受信する手段と、他計算機から送信
された自計算機状態監視メッセージを受信して応答メッ
セージを送信する手段と、状態監視メッセージ送信から
応答メッセージ受信までの時間を計測し、あらかじめ定
義された許容時間になつても該相手計算機からの応答メ
ッセージが受信されない場合は、該相手計算機に対する
通信路をしや断すると同時に該相手計算機と通信路をも
つ他計算機に対し、該相手計算機異常を連絡する手段と
、他計算機から異常計算機の連絡を受信し、該異常計算
機との通信路をしや断する手段を有することを特徴とす
る機算機間監視方法。
1. In a computer system that includes a plurality of computers and a communication control device that connects the computers, and in which messages are exchanged between the plurality of computers, each computer periodically
A means for transmitting a status monitoring message to a predefined partner computer, receiving the status monitoring message, and receiving a response message returned, and a means for receiving a self-computer status monitoring message sent from another computer and receiving a response message. and a means for transmitting the status monitoring message and measuring the time from sending the status monitoring message to receiving the response message, and if a response message is not received from the target computer even after a predefined allowable time, a communication path to the target computer is established. At the same time, a means for notifying other computers that have a communication path with the other computer of the abnormality of the other computer, and a means for receiving notification of the abnormal computer from other computers and closing the communication path with the abnormal computer. 1. A method for monitoring between computers, comprising means for disconnecting the computer.
JP61144880A 1986-06-23 1986-06-23 Inter-computer monitoring method Pending JPS633361A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP61144880A JPS633361A (en) 1986-06-23 1986-06-23 Inter-computer monitoring method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP61144880A JPS633361A (en) 1986-06-23 1986-06-23 Inter-computer monitoring method

Publications (1)

Publication Number Publication Date
JPS633361A true JPS633361A (en) 1988-01-08

Family

ID=15372523

Family Applications (1)

Application Number Title Priority Date Filing Date
JP61144880A Pending JPS633361A (en) 1986-06-23 1986-06-23 Inter-computer monitoring method

Country Status (1)

Country Link
JP (1) JPS633361A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7243257B2 (en) 2002-05-14 2007-07-10 Nec Corporation Computer system for preventing inter-node fault propagation

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7243257B2 (en) 2002-05-14 2007-07-10 Nec Corporation Computer system for preventing inter-node fault propagation

Similar Documents

Publication Publication Date Title
CN101772007B (en) Improved total network signalling tracing system and method
JPS633361A (en) Inter-computer monitoring method
JPH04184656A (en) Automatic switching system
JPS6129966A (en) Monitoring method in exchange of message between computers
JP3341712B2 (en) Switching unit failure handling method
JP3638337B2 (en) Frame relay network and information transmission method in network
JPH10171769A (en) Composite computer system
JPH01276858A (en) Communication adaptor
JPH02105648A (en) Fault detecting line switching system
JPH03117242A (en) Retrial method for data transmission
JP2966579B2 (en) One-way call detection in packet-switched networks
JP2619401B2 (en) Transmission system configuration method
JPH0897937A (en) Method and device for processing accidental state occurrence notice
JPS63206051A (en) Method for processing interruption fault for communication line
JPH0344134A (en) Terminal controller
JPH01144744A (en) System for informing trouble in communication network block
JPS63100842A (en) No communication control system in data communication
JP2953183B2 (en) Maintenance operation module monitoring method
JPS62226742A (en) Polling transmission control system
JPH05206894A (en) Two-route transmission control system
JPH03233731A (en) Temperature abnormality processing system
JPH0457263B2 (en)
KR20000044303A (en) Method for managing link between network management center and mediation device
JPS63148352A (en) Monitor control system
JPH07200105A (en) Power-off system of decentralized processing system