JPH08305600A

JPH08305600A - Supervisory and diagnostic device for fault state of computer system

Info

Publication number: JPH08305600A
Application number: JP7132848A
Authority: JP
Inventors: Hiromichi Hori; 裕通堀
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1995-05-02
Filing date: 1995-05-02
Publication date: 1996-11-22
Anticipated expiration: 2018-11-10
Also published as: JP3464846B2

Abstract

PURPOSE: To provide the supervisory and diagnostic device for a fault state of a computer system in which a fault occurrence location is presented visually by a module configuration diagram of a computer system, the processing with respect to the fault is aided and control data of the fault state are recorded. CONSTITUTION: The device is provided with a fault location detection database D2 cross-referencing a fault signal and a module of a computer system, a fault state storage table T2 storing a fault state for each module, fault location detection processing means 2a, 2b detecting a module at a fault location from the fault signal and the fault location detection database, and fault output processing sections 3a, 3b displaying the result of detection as a module configuration diagram on a display device.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、計算機システムの障害
発生時における障害状況を診断すると共に、障害影響状
況を提示する計算機システムの障害状態の監視診断装置
に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a monitoring and diagnosing device for a fault condition of a computer system, which diagnoses a fault condition when a fault occurs in the computer system and presents a fault influence condition.

【０００２】[0002]

【従来の技術】従来の計算機システムを図３４のブロッ
ク図を基に説明する。なおここでは、１系、２系を備え
た２重化計算機システムを仮定する。2. Description of the Related Art A conventional computer system will be described with reference to the block diagram of FIG. It is assumed here that a dual computer system having one system and two systems.

【０００３】従来の計算機システムＡには、２重化され
た１系計算機演算処理装置ａ及び２系計算機演算処理装
置ｂと、ＣＲＴ表示装置２１と、プリンタ装置２２と、
状態監視装置２３とを有する。１系及び２系計算機演算
処理装置ａ，ｂは、自計算機演算処理装置内で障害が発
生すると障害信号を発生させ、その障害信号を自計算機
演算処理装置内の１系障害信号処理部１ａ及び２系障害
信号処理部１ｂに出力する。各障害信号処理部１ａ，１
ｂはその障害信号を１系障害出力処理部３ａ及び２系障
害出力処理部３ｂに出力する。各障害出力処理部３ａ，
３ｂは、障害信号に応じた障害発生状況のメッセージを
ＣＲＴ表示装置２１、プリンタ装置２２、及び状態監視
装置２３にそれぞれ出力する。In the conventional computer system A, a duplicated system 1 computer arithmetic processing device a and system 2 computer arithmetic processing device b, a CRT display device 21, a printer device 22,
And a state monitoring device 23. The 1-system and 2-system computer arithmetic processing devices a and b generate a fault signal when a fault occurs in the self-computer arithmetic processing device, and the fault signal is transmitted to the 1-system fault signal processing unit 1a in the self-computer arithmetic processing device. The signal is output to the system 2 fault signal processing unit 1b. Each fault signal processing unit 1a, 1
b outputs the fault signal to the 1-system fault output processing unit 3a and the 2-system fault output processing unit 3b. Each failure output processing unit 3a,
3b outputs a message of a failure occurrence status corresponding to the failure signal to the CRT display device 21, the printer device 22, and the status monitoring device 23, respectively.

【０００４】計算機システムの運転監視員は、これらの
各装置、特に状態監視装置２３を監視することにより、
計算機システムに発生した障害を監視する。The operation supervisor of the computer system monitors each of these devices, especially the status monitor 23,
Monitor the failures that have occurred in the computer system.

【０００５】[0005]

【発明が解決しようとする課題】しかし、従来の運転監
視員に表示される障害発生表示内容は、計算機システム
の障害箇所のみであったため、計算機システムの運転監
視員は、計算機システムの障害発生箇所が計算機システ
ムのハードウェア上のどのモジュール部分であるのかを
視覚的に、すなわちモジュール構成図上で知ることがで
きず、またその障害が計算機システムの機能上特にどの
範囲で影響を与えるものなのかも知ることができなかっ
た。そのため、その障害への対応処理の判断を的確に実
施出来なかった。However, since the fault occurrence display content displayed to the conventional operation supervisor is only the fault location of the computer system, the operation supervisor of the computer system determines the fault occurrence location of the computer system. It is not possible to visually know which module part on the hardware of the computer system, that is, on the module configuration diagram, and whether the failure affects the function of the computer system, especially in what range. I couldn't know. Therefore, it was not possible to accurately carry out the judgment of the response processing for the failure.

【０００６】そこで本発明の目的は、計算機システムに
障害が発生時に、その障害の発生箇所を計算機システム
のモジュール構成図によって視覚的に提示すると共に、
その障害に対する処置を支援し、その障害状況の管理デ
ータを記録しうる計算機システムの障害状態の監視診断
装置を提供することを目的とする。Therefore, an object of the present invention is to visually present, when a failure occurs in a computer system, the location of the failure by means of a module configuration diagram of the computer system.
It is an object of the present invention to provide a monitoring and diagnosing device for a failure state of a computer system, which can support the processing for the failure and record management data of the failure state.

【０００７】[0007]

【課題を解決するための手段】本発明の請求項１に係る
計算機システムの障害状態の監視診断装置は、計算機シ
ステムの障害発生時に、その障害信号を入力して、計算
機システムの障害発生状況を周辺装置に出力する計算機
システムの障害状態の監視診断装置であって、障害信号
と計算機システムを構成するモジュールとを対応づける
障害箇所検出データベースと、各モジュール毎に障害状
態を保存しておく障害状態保存テーブルと、障害信号を
障害信号処理手段を介して入力し、障害箇所検出データ
ベースから障害モジュールを検出し、その検出結果を障
害状態保存テーブルに保存する障害箇所検出処理手段
と、検出結果が保存された障害状態保存テーブルをＣＲ
Ｔ表示装置上にモジュール構成図としてグラフィックに
表示すると共に、検出された障害モジュールをＣＲＴ表
示装置上に強調して表示する障害出力処理手段とを備え
ている。According to a first aspect of the present invention, there is provided a computer system fault condition monitoring and diagnosing apparatus which, when a computer system fault occurs, inputs a fault signal to determine a fault occurrence state of the computer system. A fault diagnosis device for monitoring a fault condition of a computer system that outputs to a peripheral device, a fault location detection database that associates a fault signal with a module that constitutes the computer system, and a fault condition that stores the fault condition for each module The storage table and the failure signal are input via the failure signal processing means, the failure module is detected from the failure location detection database, and the detection result is stored in the failure state storage table. CR of the saved failure status storage table
A failure output processing means for graphically displaying a module configuration diagram on the T display device and for displaying the detected failure module in an emphasized manner on the CRT display device is provided.

【０００８】本発明の請求項２に係る計算機システムの
障害状態の監視診断装置は、障害信号とその障害信号に
対応するモジュールの障害により影響を受けるモジュー
ルとを対応付ける障害影響箇所検出データベースと、各
モジュール毎に他のモジュールの障害により自モジュー
ルが影響を受けるか否かを保存する障害状態保存テーブ
ルと、障害箇所検出処理手段により検出された障害モジ
ュールを入力し、障害影響箇所検出データベースから障
害による影響を受けるモジュールを検出し、そのモジュ
ールの状態を障害影響状態保存テーブルに保存する障害
影響箇所検出処理手段とを備え、請求項１に記載の障害
出力処理手段が、前記検出された障害モジュールを前記
モジュール構成図上に強調して出力するものであること
を特徴とする。According to a second aspect of the present invention, there is provided a fault diagnosis and diagnosis apparatus for a fault condition of a computer system, and a fault influence point detection database for associating a fault signal with a module affected by a fault of a module corresponding to the fault signal, For each module, enter the failure state storage table that stores whether or not the own module is affected by the failure of another module, and the failure module detected by the failure location detection processing means, and enter the failure from the failure impact location detection database. The fault output processing unit according to claim 1, further comprising: a fault-affected portion detection processing unit that detects an affected module and stores the state of the module in a fault-affected state storage table. It is characterized in that the output is emphasized on the module configuration diagram.

【０００９】本発明の請求項３に係る計算機システムの
障害状態の監視診断装置は、障害モジュール又は他のモ
ジュールで発生した障害の影響を受けるモジュールによ
り完全に使用不可能となる部分を計算機システムの機能
毎に記述している完全機能使用不可検出データベース
と、障害モジュール又は他のモジュールで発生した障害
の影響を受けるモジュールにより部分的に使用不可能と
なる部分を計算機システムの機能毎に記述している部分
機能使用不可検出データベースと、計算機システムの機
能毎にその機能の状態を保存している完全使用不可状態
保存テーブル及び部分使用不可状態保存テーブルと、障
害箇所検出処理手段により検出された障害箇所及び障害
影響箇所検出処理手段により検出された障害影響箇所を
入力し、完全機能使用不可検出データベース及び部分機
能使用不可検出データベースを基に、完全に使用不可能
となる機能及び部分的に使用不可能となる機能を検出
し、完全使用不可状態保存テーブル及び部分使用不可状
態保存テーブルにそれぞれ記録する障害機能検出処理手
段とを備え、請求項２に記載の障害出力処理手段が、前
記完全使用不可状態保存テーブル及び部分使用不可不可
状態保存テーブルの内容を前記ＣＲＴ表示装置に表示す
るものであることを特徴とする。According to a third aspect of the present invention, there is provided a computer system fault condition monitoring and diagnosing apparatus, wherein a portion of the computer system that is completely unusable by a fault module or a module affected by a fault occurring in another module is used. The complete function unusability detection database that describes each function and the part that is partially unusable due to the module affected by the failure module or the failure that occurred in another module are described for each function of the computer system. Partial function unusability detection database, complete unusable status storage table and partial unusable status storage table that stores the status of each function of the computer system, and failure location detected by failure location detection processing means And the failure-affected area detected by the failure-affected area detection processing means is input to use the full function. Based on the non-detection database and partial function non-usability detection database, it detects functions that are completely unusable and partially unusable, and creates a complete unusable state saving table and a partial unusable state saving table. 3. Fault function detection processing means for respectively recording, wherein the fault output processing means according to claim 2 displays the contents of the complete unusable state storage table and the partial unusable state storage table on the CRT display device. Is characterized in that.

【００１０】本発明の請求項４に係る計算機システムの
障害状態の監視診断装置は、障害箇所、障害影響箇所、
及び障害箇所の障害により影響を受ける機能から障害要
因を調査するために必要とされる調査項目及び障害から
復帰するための一次的な対応手順を記述した一次対応手
順データベースと、障害発生箇所、障害影響箇所、及び
障害箇所の障害により影響を受ける機能を入力し、一次
対応手順データベースを基に、調査項目及び一次的な対
応手順を検出する一次対応手順検出処理手段とを備え、
請求項３に記載の障害出力処理手段が、前記一次対応操
作検出手段により検出された調査項目及び一次的対応手
順をＣＲＴ表示装置又はプリンタ装置に表示するもので
あることを特徴とする。According to a fourth aspect of the present invention, there is provided a monitoring and diagnosing apparatus for a fault condition of a computer system,
And the primary response procedure database that describes the investigation items required to investigate the cause of the failure from the function affected by the failure at the failure location and the primary response procedure for recovering from the failure, and the location where the failure occurred and the failure The affected part and the function affected by the failure at the failure part are input, and the primary response procedure detection processing means for detecting the survey item and the primary response procedure based on the primary response procedure database is provided.
According to a third aspect of the present invention, the failure output processing means displays the investigation item and the primary response procedure detected by the primary response operation detection means on a CRT display device or a printer device.

【００１１】本発明の請求項５に係る計算機システムの
障害状態の監視診断装置は、障害箇所、障害影響箇所、
及び障害箇所の障害により影響を受ける機能を障害履歴
データ記録テーブルに記録すると共に、調査項目を調査
した結果判明した障害原因及び一次的な対応手順を実行
した処置結果を障害履歴データ記録テーブルに記録する
障害履歴管理処理手段と、帳票表示要求を受付け、障害
履歴データ記録テーブルに記録されているデータを、障
害履歴管理用ＣＲＴ装置に帳票表示するとともに、障害
履歴管理用プリンタ装置に帳票出力を行う障害履歴管理
入出力処理手段とを備えたことを特徴とする。According to a fifth aspect of the present invention, there is provided a monitoring and diagnosing apparatus for a fault condition of a computer system,
And the function affected by the failure at the failure location is recorded in the failure history data recording table, and the cause of failure found as a result of investigating the investigation items and the action result of executing the primary countermeasure procedure are recorded in the failure history data recording table. The fault history management processing means for receiving the form display request, the data recorded in the fault history data recording table is displayed on the fault history management CRT device, and the form is output to the fault history management printer device. And a fault history management input / output processing means.

【００１２】[0012]

【作用】本発明の請求項１の係る計算機システムの障害
状態の監視診断装置は、計算機システムの障害信号を障
害信号処理手段を介して入力し障害箇所検出処理手段が
障害箇所検出データベースを基に障害箇所のモジュール
を検出し、その検出結果を障害状態保存テーブルに保存
し、障害出力処理手段がその障害状態保存テーブルを用
いて、ＣＲＴ表示装置上に障害箇所をグラフィック表示
する。According to a first aspect of the present invention, there is provided a fault diagnosis and diagnosis apparatus for a computer system, wherein a fault signal of the computer system is inputted through a fault signal processing means, and the fault point detection processing means is based on a fault point detection database. The module at the fault location is detected, the detection result is stored in the fault state storage table, and the fault output processing means uses the fault state storage table to graphically display the fault location on the CRT display device.

【００１３】このように、計算機システムの障害発生時
に、障害箇所をＣＲＴ表示装置によりグラフィック表示
した計算機システムのハードウェア構成図上に表示する
ことが可能になり障害発生箇所を視覚的に提示し、運転
員及び保守員が障害発生箇所を正確に認識することが可
能になる。As described above, when a failure occurs in the computer system, the failure point can be displayed on the hardware configuration diagram of the computer system in which the CRT display device graphically displays, and the failure point can be visually presented. It enables the operator and maintenance personnel to accurately recognize the location of the failure.

【００１４】本発明の請求項２の係る計算機システムの
障害状態の監視診断装置は、障害影響箇所検出処理手段
が、障害影響箇所検出データベースにより障害による影
響箇所を検出し、その検出結果を障害影響状態保存テー
ブルに保存し、障害出力処理手段が障害影響状態保存テ
ーブルを用いて、ＣＲＴ表示装置上で障害影響箇所をグ
ラフィック表示する。According to a second aspect of the present invention, in the fault diagnosis and diagnosis apparatus for a computer system, the fault-affected-portion detection processing means detects a fault-affected spot by the fault-affected-portion detection database, and uses the detection result as the fault influence. The failure output processing means uses the failure saving status saving table to graphically display the failure affected area on the CRT display device.

【００１５】これによって、計算機システムの障害発生
時の影響箇所をＣＲＴ表示装置によりグラフィック表示
した計算機ハードウェア構成図上に提示することで障害
影響箇所を視覚的に提示し、運転員及び保守員に障害影
響箇所を正確に認識させること。Thus, the affected part when the failure occurs in the computer system is presented on the computer hardware configuration diagram graphically displayed by the CRT display device so that the affected area can be visually presented to the operator and the maintenance staff. Accurately identify the affected area.

【００１６】本発明の請求項３の係る計算機システムの
障害状態の監視診断装置は、障害の発生による影響を計
算機システムの機能単位で検出し、表示するものであ
る。According to a third aspect of the present invention, a fault diagnosis and diagnosis device for a computer system detects and displays the influence of the occurrence of a fault for each functional unit of the computer system.

【００１７】これによって、運転員及び保守員に計算機
システム機能障害状態を正確に認識させ、障害状況の影
響度合い検出を支援する。In this way, the operator and maintenance personnel are made to accurately recognize the computer system functional failure state, and the detection of the degree of influence of the failure state is assisted.

【００１８】本発明の請求項４の係る計算機システムの
障害状態の監視診断装置は、一次対応手順検出処理手段
において一次対応手順データベースにより検出した一次
対応手順を用い、障害出力処理手段がＣＲＴ表示装置で
一次対応手順のメッセージをＣＲＴ表示装置に及びプリ
ンタ装置に出力処理を行う。According to another aspect of the present invention, there is provided a computer system fault condition monitoring and diagnosing apparatus which uses the primary handling procedure detected by the primary handling procedure database in the primary handling procedure detecting processing means, and the fault output processing means uses the CRT display device. Then, the message of the primary handling procedure is output to the CRT display device and the printer device.

【００１９】これによって、計算機システムの障害発生
時に計算機システムの障害発生手段位の障害発生要因調
査のための調査事項及び計算機システム機能障害に対し
ての対応事項等一次対応手順を運転員及び保守員に提示
し、速やかな障害復旧対応を支援することが可能にな
る。Thus, when a failure occurs in the computer system, the check items for investigating the cause of failure at the failure occurrence means of the computer system and the first-order handling procedures such as the countermeasures against the functional failure of the computer system are provided to the operator and the maintenance staff. It will be possible to support prompt recovery from disasters.

【００２０】本発明の請求項５の係る計算機システムの
障害状態の監視診断装置は、障害履歴管理処理手段が、
障害箇所検出処理手段、機能障害検出処理手段、及び一
次対応手順検出処理手段それぞれで検出処理した結果、
及び障害発生日時を障害履歴データ記録テーブルに記録
する。また、障害履歴管理入出力処理手段を介し障害履
歴管理入出力処理手段から障害履歴データの入力処理を
行い、障害履歴管理用プリンタ装置により障害履歴管理
データの帳票の出力を行う。According to a fifth aspect of the present invention, in the fault diagnosis and diagnosis device for a computer system, the fault history management processing means includes:
As a result of detection processing by the fault location detection processing means, the functional fault detection processing means, and the primary response procedure detection processing means,
And the date and time of failure occurrence are recorded in the failure history data recording table. Further, the failure history management input / output processing means performs failure history data input processing from the failure history management input / output processing means, and the failure history management printer device outputs a form of the failure history management data.

【００２１】これによって、計算機システムの障害発生
時に計算機システムの障害発生日時、障害発生部分、障
害発生時の計算機システムのデータ、障害原因、対策処
置等を記録した計算機システムの障害の履歴管理データ
により、計算機システム運転員及び保守員に障害発生時
の障害要因の推定、対応操作の技術的な支援データを蓄
積・提供することが可能になる。With this, when a failure occurs in the computer system, the failure date and time of the computer system, the failure occurrence portion, the data of the computer system at the time of the failure, the cause of the failure, the countermeasure history, etc. , It becomes possible to estimate the cause of failure when a failure occurs and accumulate and provide technical support data for the corresponding operation to the computer system operator and maintenance personnel.

【００２２】[0022]

【実施例】以下、本発明の請求項１に係わる計算機シス
テムの障害状態の監視診断システムの第１実施例を図１
のブロック図を参照して説明する。なお、従来の技術で
述べたように、ここでの計算機システムは１系、２系を
備えた２重化された計算機演算機装置ａ，ｂを仮定す
る。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A first embodiment of a fault diagnosis and diagnosis system for a computer system according to claim 1 of the present invention will be described below with reference to FIG.
Will be described with reference to the block diagram of FIG. As described in the related art, the computer system here is assumed to be the duplicated computer arithmetic units a and b having the first system and the second system.

【００２３】計算機システムＡで発生した障害信号は計
算機演算処理装置ａ，ｂの障害信号処理部１ａ，１ｂを
介して、障害箇所検出処理部２ａ，２ｂに入力される。
障害箇所検出処理部２ａ，２ｂは、入力した障害信号か
ら、その障害信号とその障害箇所のモジュールとを対応
づける障害箇所検出データベースＤ２を基に障害が発生
したモジュールを検出し、検出結果を障害状態保存テー
ブルＴ２に保存する。The fault signal generated in the computer system A is input to the fault location detection processing units 2a and 2b via the fault signal processing units 1a and 1b of the computer arithmetic processing units a and b.
The fault location detection processing units 2a and 2b detect a faulty module from the input fault signal based on a fault location detection database D2 that associates the fault signal with the module at the fault location, and faults the detection result. The state is stored in the state storage table T2.

【００２４】その検出結果は障害出力処理部３ａ，３ｂ
に入力され、障害出力処理部３ａ，３ｂは、計算機シス
テムＡの構成図グラフィックデータベースＧＤからシス
テム構成図を生成し、ＣＲＴ表示装置２１にグラフィッ
ク表示すると共に、その検出結果を出力する。The detection result is the fault output processing units 3a and 3b.
The fault output processing units 3a and 3b generate a system configuration diagram from the configuration diagram graphic database GD of the computer system A, display it graphically on the CRT display device 21, and output the detection result.

【００２５】障害箇所のモジュールは、ＣＲＴ表示装置
２１上でグラフィックシンボルで色替え及び点滅処理
等、強調して表示される。The module at the fault location is highlighted on the CRT display device 21 by a graphic symbol such as color changing and blinking.

【００２６】ここでＣＲＴ表示装置２１上のグラフィッ
ク画面での障害箇所の表示の１例を図２に示す。FIG. 2 shows an example of the display of the fault location on the graphic screen on the CRT display device 21.

【００２７】図２において、１系計算機演算処理装置ａ
内のＣＰＵ２０ａ、二重化されているモジュール１−１
ａ〜１−５ａ、装置からなるモジュールｓ４，ｓ６、そ
れらに対応する２系計算機演算処理装置ｂ内のＣＰＵ２
０ｂ、２重化されているモジュール１−１ｂ〜１−５
ｂ、装置モジュールｓ５，ｓ６、及び２重化されていな
い１系計算機演算処理装置ａと２系計算機演算処理装置
ｂに共通な共有化装置モジュールｓ１〜ｓ３の接続関係
が示されており、ここではモジュール１−１ａとモジュ
ール１−５ｂとに障害が発生していることを示してい
る。In FIG. 2, the system 1 arithmetic processing unit a
CPU 20a in the module, duplicated module 1-1
a to 1-5a, modules s4 and s6 composed of devices, and a CPU 2 in the 2-system computer arithmetic processing device b corresponding to them
0b and duplicated modules 1-1b to 1-5
b, the device modules s5 and s6, and the connection relation of the shared device modules s1 to s3 common to the 1-system computer arithmetic processing device a and the 2-system computer arithmetic processing device b which are not duplicated, are shown. Indicates that a failure has occurred in the module 1-1a and the module 1-5b.

【００２８】ここで障害箇所検出処理部２ａ，２ｂによ
る検出処理の動作を図３、図４、及び図５を基に説明す
る。Here, the operation of the detection processing by the fault location detection processing units 2a and 2b will be described with reference to FIGS. 3, 4 and 5.

【００２９】図３は障害箇所検出処理部２ａ，２ｂによ
る検出処理の動作を示すフローチャートであり、図４は
障害箇所検出データベースＤ２を示す図表であり、図５
は障害状態保存テーブルＴ２を示す図表である。FIG. 3 is a flow chart showing the operation of the detection processing by the fault location detection processing units 2a and 2b, FIG. 4 is a diagram showing the fault location detection database D2, and FIG.
Is a chart showing a failure state storage table T2.

【００３０】図４において、計算機システムを構成する
各モジュールには予め一意のモジュール番号（ｊ＝１〜
ｎ）が割りふられており、また各障害信号にも一意の番
号（１〜ｍ）が割り当てられている。障害箇所検出デー
タベースＤ２には、障害信号種別と各モジュールとの関
係が記述されており、表の中で１が立っているモジュー
ルは障害状態であることを示し、０が立っているモジュ
ールは正常状態であることを示している。In FIG. 4, each module constituting the computer system has a unique module number (j = 1 to 1) in advance.
n) is assigned, and a unique number (1 to m) is also assigned to each fault signal. In the fault location detection database D2, the relation between the fault signal type and each module is described. In the table, the module with 1 stands for a fault state, and the module with 0 stands for normal. It shows that it is in a state.

【００３１】図５において、障害状態保存テーブルＴ２
は現在の各モジュール毎の障害状態を示しており、もし
障害状態ならば１が、もし障害から復帰した状態ならば
０が立てられている。In FIG. 5, a failure state storage table T2
Indicates the current failure status of each module. If the failure status is 1, 1 is set, and if the status is restored from the failure, 0 is set.

【００３２】図３において、まず障害箇所検出処理部２
ａ，２ｂは、障害信号処理部１ａ，１ｂを介して障害信
号を入力し、障害信号番号変数ｉに代入する。（ステッ
プ３１，３２）。次にモジュール番号変数ｊを初期設定
し（ステップ３３）、図４に示す障害箇所検出データベ
ースＤ２の検索を開始する（ステップ３４）。まず表の
中で、最初のモジュール番号とその障害信号番号に対応
する値が１が立っているか否か判定する（ステップ３
５）。もし、１が立っていればそのモジュールは障害し
ていると判定し、図５に示す障害状態保存テーブルＴ２
のそのモジュール欄に障害状態であることを示す１を代
入する（ステップ３６）。もし、０が立っていれば，そ
のモジュールは障害から復帰していると判定し、図５に
示す障害状態保存テーブルＴ２のそのモジュール欄に復
帰状態であることを示す０を代入する（ステップ３７，
３８）。以後これらの処理を全てのモジュール分だけ繰
り返す（ステップ３９，３４）。In FIG. 3, first, a fault location detection processing unit 2
The fault signals a and 2b receive the fault signals via the fault signal processing units 1a and 1b and substitute them into the fault signal number variable i. (Steps 31, 32). Next, the module number variable j is initialized (step 33), and the search of the fault location detection database D2 shown in FIG. 4 is started (step 34). First, in the table, it is determined whether or not the value corresponding to the first module number and its fault signal number is 1 (step 3).
5). If 1 is set, it is determined that the module has a failure, and the failure state storage table T2 shown in FIG.
Substituting 1 into the module column of 1 to indicate a failure state (step 36). If 0 is set, it is determined that the module has recovered from the failure, and 0 indicating the recovery status is substituted into the module column of the failure status storage table T2 shown in FIG. 5 (step 37). ，
38). After that, these processes are repeated for all modules (steps 39 and 34).

【００３３】この障害状態保存テーブルＴ２の設定状態
を、障害出力処理部３ａ，３ｂが各モジュール毎に検出
処理して、図２に示すような各モジュール単位のグラフ
ィカルな構成図上での障害モジュールのシンボルの色替
え及び点滅処理を行う。The fault output processing units 3a and 3b detect the setting state of the fault state storage table T2 for each module, and the fault module on the graphical configuration diagram of each module as shown in FIG. The color change and blinking of the symbol are performed.

【００３４】これによって、計算機システムの障害発生
時に、障害箇所をＣＲＴ表示装置によりグラフィック表
示した計算機システムのハードウェア構成図上に表示す
ることが可能になり障害発生箇所を視覚的に提示し、運
転員及び保守員が障害発生箇所を正確に認識することが
可能になる。Thus, when a failure occurs in the computer system, the failure point can be displayed on the hardware configuration diagram of the computer system in which the CRT display device graphically displays, and the failure point can be visually presented and operated. It becomes possible for the maintenance staff and the maintenance staff to accurately recognize the location of the failure.

【００３５】本発明の請求項２に係わる計算機システム
の障害状態の監視診断システムの第２実施例を図６のブ
ロック図を参照して説明する。A second embodiment of the fault diagnosis and diagnosis system of the computer system according to the second aspect of the present invention will be described with reference to the block diagram of FIG.

【００３６】第２実施例は、障害したモジュールを検出
・表示するだけでなく、例えばあるモジュールの障害に
より他のモジュールが読みとり動作ができなくなる等、
あるモジュールの障害により影響を受ける他のモジュー
ルを検出・表示するものである。The second embodiment not only detects and displays the faulty module, but, for example, the faulty operation of one module renders another module unable to read.
It detects and displays other modules that are affected by the failure of one module.

【００３７】図示するように、障害箇所検出処理部２
ａ，２ｂにより検出された障害モジュールを入力し、そ
の障害モジュールをキーにして、その障害モジュールと
その障害の影響を受けるモジュールとの関係を記述して
いる障害影響箇所検出データベースＤ４を基に影響を受
けるモジュールを検出する障害影響箇所検出処理部４
ａ，４ｂと、各モジュールが影響を受けるか否かを保存
している障害影響状態保存テーブルＴ４とが新たに備え
られている。As shown in the figure, the fault location detection processing unit 2
Input the faulty module detected by a and 2b, and use the faulty module as a key to influence the faulty affected part detection database D4 that describes the relationship between the faulty module and the module affected by the fault. Failure influence point detection processing unit 4 for detecting a module to be received
a and 4b and a failure influence state storage table T4 that stores whether or not each module is affected are newly provided.

【００３８】ＣＲＴ表示装置２１上でのグラフィッフィ
カルな表示画面の例を図７に示す。An example of a graphic display screen on the CRT display device 21 is shown in FIG.

【００３９】図７において、モジュール１−１ａ、１−
７ａは障害箇所を示し、装置ｓ７はモジュール１−７ａ
の障害により影響を受けていることを示している。図中
では、各モジュール間を接続するラインの影響範囲も示
しており、ラインＬ２０，Ｌ２５，Ｌ２６がモジュール
１−７ｂの障害による影響を受けることを示している。In FIG. 7, modules 1-1a, 1-
7a indicates a failure point, and the device s7 is a module 1-7a.
Indicates that they are being affected by the disability. In the figure, the influence range of the line connecting between the modules is also shown, and it is shown that the lines L20, L25, L26 are affected by the failure of the module 1-7b.

【００４０】以下、障害影響箇所検出処理部４ａ，４ｂ
の動作を図８〜１１を基に説明する。Hereinafter, the fault-affected-portion detection processing units 4a and 4b will be described.
The operation will be described with reference to FIGS.

【００４１】図８は障害影響箇所検出処理部４ａ，４ｂ
の動作を示すフローチャートであり、図９は障害影響箇
所検出データベースＤ４を示す図表であり、図１０は障
害影響状態保存テーブルＴ４を示す図表であり、図１１
は図８で説明する共有化モジュールへの影響を検出する
処理を示す図表である。FIG. 8 shows the failure-affected-portion detection processing units 4a and 4b.
11 is a flowchart showing the operation of FIG. 9, FIG. 9 is a chart showing the fault-affected portion detection database D4, FIG. 10 is a chart showing the fault-affected state storage table T4, and FIG.
9 is a chart showing a process of detecting an influence on the sharing module described in FIG. 8.

【００４２】図９において、各モジュールには予め一意
の障害モジュール番号（ｉ＝１〜ｎ）がおり、また各影
響モジュールにも一意の番号（１〜ｎ）が割り当てられ
ている。障害影響箇所検出データベースＤ４には、障害
モジュール番号とそれによって影響を受ける影響モジュ
ールとの関係が記述されており、表の中で１が立ってい
るモジュールはその障害モジュールにより影響を受ける
ものであることを示しており、０が立っているモジュー
ルはその障害モジュールにより影響を受けないものであ
ることを示している。In FIG. 9, each module has a unique failure module number (i = 1 to n) in advance, and each affected module is also assigned a unique number (1 to n). The relationship between the failure module number and the affected module affected by the failure module number is described in the failure influence point detection database D4, and the module for which 1 is set in the table is affected by the failure module. It means that a module set to 0 is not affected by the faulty module.

【００４３】図１０において、障害影響状態保存テーブ
ルＴ４は、各モジュールがある障害により現在の各モジ
ュールが影響を受けるものなのか否かを、もし影響を受
けてるならば１を立てることにより、もしその影響から
復帰した状態ならば０を立てることにより示している。In FIG. 10, the failure influence state storage table T4 indicates whether or not each module is affected by a certain failure, and if it is affected, the value 1 is set. If the state is restored from the influence, it is indicated by setting 0.

【００４４】図８において、まず障害影響箇所検出処理
部４ａ，４ｂは、障害信号処理部２ａ，２ｂを介し障害
モジュール番号を入力し、障害モジュールの番号を示す
障害モジュール番号変数ｉを初期設定する（ステップ８
１）。以降、順番に障害モジュール番号変数ｉを増分し
ていく（ステップ８２）。また、障害の影響を受ける影
響モジュール変数ｊを初期設定する（ステップ８３）。
以降、順番に影響モジュール番号変数ｊを増分していく
（ステップ８４）。次に、障害箇所検出処理部２ａ，２
ｂが図５に示す障害状態保存テーブルＴ２から判定処理
した各モジュールの障害／復帰状態を検出し（ステップ
８５）、障害影響箇所検出データベースＤ４から影響を
受けるモジュールを選び出し、障害の影響が有ると判断
した時には、共有化モジュール判定処理に移る（ステッ
プ８８）。In FIG. 8, the failure-affected-portion detection processing units 4a and 4b first input the failure module number via the failure signal processing units 2a and 2b, and initialize the failure module number variable i indicating the failure module number. (Step 8
1). Thereafter, the failure module number variable i is sequentially incremented (step 82). Further, the influence module variable j affected by the failure is initialized (step 83).
Thereafter, the affected module number variable j is incremented in order (step 84). Next, the failure location detection processing units 2a, 2
b detects the failure / recovery status of each module that has undergone the determination processing from the failure status storage table T2 shown in FIG. 5 (step 85), selects the affected module from the failure affected portion detection database D4, and determines that there is an effect of the failure. When the determination is made, the sharing module determination processing is started (step 88).

【００４５】ここで、共有化モジュール判定処理の動作
を図１１の図表を基に説明する。Here, the operation of the shared module determination process will be described with reference to the chart of FIG.

【００４６】この図表は、２重化されている共有化モジ
ュール毎に１系、２系の各モジュールの障害の状態が記
述されており、両系共に障害である時にのみ、そのモジ
ュール対が障害の影響が有ると判断する。もし影響を受
けるモジュールが２重化されているものならば、対にな
るモジュールもその障害の影響を受けるのかを判定す
る。モジュール対に障害の影響が有ると判断した時に
は、図１０に示す障害影響状態保存テーブルＴ４の該当
モジュールに１を設定することによって、影響有りとす
る（ステップ８９）。This chart describes the failure status of each module of the 1-system and 2-system for each shared module that is duplicated. Only when both systems have a failure, the module pair fails. It is judged that there is an influence of. If the affected module is duplicated, it is determined whether the paired module is also affected by the failure. When it is determined that the module pair is affected by the failure, the module is determined to be affected by setting 1 to the corresponding module in the failure effect state storage table T4 shown in FIG. 10 (step 89).

【００４７】他方、もし対になる相手側のモジュールが
影響を受けないのであれば、障害影響状態保存テーブル
Ｔ４の該当モジュールに０を設定することによって、影
響無しとする（ステップ９０）。On the other hand, if the counterpart module to be paired with is not affected, the affected module is set to 0 in the failure influence state storage table T4 to eliminate the influence (step 90).

【００４８】また、ステップ８６の動作の結果、影響を
受けないと判断した時も障害影響状態保存テーブルＴ４
の該当モジュールに０を設定することによって、影響無
しとする（ステップ８７，９０）。Further, as a result of the operation of step 86, the failure influence state storage table T4 is also determined when it is determined that the influence is not exerted.
By setting 0 in the corresponding module of No., there is no influence (steps 87, 90).

【００４９】以上の動作を障害モジュール及び影響モジ
ュール分だけ繰り返す（ステップ９１，９２）。The above operation is repeated for the faulty module and the influencing module (steps 91 and 92).

【００５０】障害出力処理部３ａ，３ｂは、第１実施例
で説明したように障害検出箇所をグラフィカルに表示す
ると共に、影響を受けるモジュールを障害影響状態保存
テーブルＴ４を基に図７に示すようにグラフィカルに表
示する。The fault output processing units 3a and 3b graphically display the fault detection points as described in the first embodiment, and the affected modules are shown in FIG. 7 based on the fault influence state storage table T4. Graphically.

【００５１】更に、障害影響箇所検出処理部４ａ，４ｂ
は、図１２の図表に示すように障害モジュール及びその
障害モジュールにより影響を受けるモジュールと、それ
らの影響をうけるラインとの関係を示す影響ライン検出
データベースＤＬ４により障害の影響を受けるラインを
選び出す。Further, the fault-affected-portion detection processing units 4a, 4b.
Selects the line affected by the failure from the affected line detection database DL4 indicating the relationship between the faulty module, the module affected by the faulty module, and the line affected by the faulty module, as shown in the chart of FIG.

【００５２】図１２に示すように影響ライン検出データ
ベースＤＬ４は、障害モジュール及びその障害モジュー
ルにより影響を受けるモジュール番号と各ラインの関係
を示している。例えば、モジュール番号（２）であるモ
ジュール１−１ａが障害中であるか又は他モジュールの
故障の影響をうければ、１が立っている。（この場合、
モジュール１−１ａの障害によりラインＬ１及びライン
Ｌ２が影響を受ける）。As shown in FIG. 12, the affected line detection database DL4 shows the relationship between the faulty module, the module number affected by the faulty module, and each line. For example, 1 is set if the module 1-1a having the module number (2) is in failure or is affected by the failure of another module. (in this case,
The failure of the module 1-1a affects the lines L1 and L2).

【００５３】ここで、障害影響箇所検出処理部４ａ，４
ｂが、影響を受けるラインを検出し、それをＣＲＴ表示
装置２１に表示する動作を図１３のフローチャートを基
に説明する。Here, the failure-affected-portion detection processing units 4a, 4
An operation in which the line b detects the affected line and displays it on the CRT display device 21 will be described with reference to the flowchart of FIG.

【００５４】まず障害モジュール及びその障害モジュー
ルにより影響を受けるモジュール番号を示す変数ｉを初
期設定し（ステップ１２１）、以後変数ｉを増分して以
下の判定を行う（ステップ１２２）。他方、各ラインを
示すライン番号変数ｊを初期設定（ステップ１２３）、
ライン番号変数ｊを増分しながら以下の処理を行う（ス
テップ１２４）。次に影響ライン検出データベースＤＬ
４から影響を受けるラインを検出し（ステップ１２
５）、もし影響を受けるラインであれば、その変数ｉを
キーにして前述した障害状態保存テーブルＴ２をひき
（ステップ１２６）、もしその変数ｉのモジュールが障
害中であるならば、図１４に示す影響ライン検出状態保
存テーブルＴＬ４のそのラインに対応する欄に障害によ
り影響を受けるラインあることを示す１を代入する（ス
テップ１２７）。他方、障害状態保存テーブルＴ２をひ
き（ステップ１２６）、もしその変数ｉのモジュールが
障害中でないならば、変数ｉをキーにして前述した障害
影響状態保存テーブルＴ４をひき（ステップ１２８）、
もしその変数ｉのモジュールが影響をうけるモジュール
を示すものであるならば、図１４に示す影響ライン検出
状態保存テーブルＴＬ４のそのラインに対応する欄に障
害により影響を受けるラインあることを示す１を代入す
る（ステップ１２９）。そのライン変数ｉに対応するモ
ジュールが障害もなく、かつ他のモジュールの障害の影
響をうけない場合は、影響ライン検出状態保存テーブル
ＴＬ４のそのラインに対応する欄に障害により影響を受
けるラインでないことを示す０を代入する（ステップ１
２６，１２７，１２８）。First, a variable i indicating a faulty module and a module number affected by the faulty module is initialized (step 121), and then the variable i is incremented to make the following determination (step 122). On the other hand, the line number variable j indicating each line is initialized (step 123),
The following processing is performed while incrementing the line number variable j (step 124). Next, influence line detection database DL
4 to detect the affected line (step 12
5) If it is an affected line, the variable i is used as a key to open the failure state storage table T2 described above (step 126). If the module of the variable i is in failure, then FIG. Substitute 1 indicating that there is a line affected by the failure in the column corresponding to that line in the affected line detection state storage table TL4 shown (step 127). On the other hand, the failure state storage table T2 is pulled (step 126), and if the module of the variable i is not in failure, the variable i is used as a key to pull the failure influence state storage table T4 (step 128),
If the module of the variable i indicates the affected module, the column 1 corresponding to that line in the affected line detection state storage table TL4 shown in FIG. Substitute (step 129). If the module corresponding to the line variable i has no failure and is not affected by the failure of another module, the column corresponding to that line in the affected line detection state storage table TL4 is not a line affected by the failure. Is assigned to 0 (step 1
26, 127, 128).

【００５５】以上全てのモジュール及びラインについ
て、処理を繰り返す（ステップ１３１，１３２）。The above processing is repeated for all the modules and lines (steps 131 and 132).

【００５６】障害出力処理部３ａ，３ｂは、この影響ラ
イン検出状態保存テーブルＴＬ４に基き、障害の影響を
受けるラインを色を替えて，図７に示すようにグラフィ
ックに表示する。The fault output processing units 3a and 3b change the color of the line affected by the fault based on the affected line detection state storage table TL4 and display it graphically as shown in FIG.

【００５７】なお、参考のため図１５に各ライン番号を
表示した画面を、図１６に各モジュール番号を表示した
画面をそれぞれを示す。For reference, FIG. 15 shows a screen displaying each line number, and FIG. 16 shows a screen displaying each module number.

【００５８】これによって、障害発生時の影響箇所をＣ
ＲＴ表示装置によりグラフィックに障害影響箇所を、運
転員及び保守員に障害影響箇所を正確に認識させること
が可能になる。As a result, the point of influence at the time of failure occurrence is C
The RT display device makes it possible to make the operator and maintenance personnel recognize the failure-affected portion in a graphic form accurately.

【００５９】本発明の請求項３に係わる計算機システム
の障害状態の監視診断システムの第３実施例を図１７の
ブロック図を参照して説明する。A third embodiment of the fault diagnosis and diagnosis system for a computer system according to claim 3 of the present invention will be described with reference to the block diagram of FIG.

【００６０】第３実施例は障害の発生による影響を計算
機システムの機能単位で検出し、表示するものである。The third embodiment detects and displays the influence of the occurrence of a failure for each functional unit of the computer system.

【００６１】第３実施例は、第２実施例に加えて、障害
モジュール又は他のモジュールで発生した障害の影響を
受けるモジュールにより完全に使用不可能となる計算機
システムの機能を記述している完全機能使用不可検出デ
ータベースＤ５と、障害モジュール又は他のモジュール
で発生した障害の影響を受けるモジュールにより部分的
に使用不可能となる計算機システムの機能を記述してい
る部分機能使用不可検出データベースＤ６と、障害箇所
検出処理部２ａ，２ｂによる検出箇所、障害影響箇所検
出処理部４ａ，４ｂによる障害影響箇所を基に、完全機
能使用不可検出データベースＤ５及び部分機能使用不可
検出データベースＤ６から、障害により完全に使用不可
能となる計算機システムの機能及び障害により部分的に
使用不可能となる計算機システムの機能を検出し、その
結果を完全使用不可状態保存テーブルＴ５及び部分使用
不可状態保存テーブルＴ６にそれぞれ保存する障害機能
検出処理部５ａ，５ｂが新たに設けられている。The third embodiment, in addition to the second embodiment, describes a function of the computer system which is completely unusable by a failure module or a module affected by a failure occurred in another module. A function unavailability detection database D5 and a partial function unavailability detection database D6 that describes the functions of the computer system that are partially unusable due to the failure module or a module affected by the failure that occurred in another module, Based on the detection points by the failure point detection processing units 2a and 2b and the failure influence points by the failure affected point detection processing sections 4a and 4b, the complete function unusable detection database D5 and the partial function unusable detection database D6 are used to completely detect the failure. It becomes partially unusable due to the function and failure of the computer system. Detecting the capabilities of calculation system, failure function detection processing unit 5a which stores each of which results in complete disabled state storage table T5 and partial disabled state storage table T6, 5b are newly provided.

【００６２】ここで、障害機能検出処理部５ａ，５ｂが
障害により完全に使用不可能となる計算機システムの機
能を検出処理する動作を図１８のフローチャートを基に
説明する。Here, the operation of the faulty function detection processing units 5a and 5b for detecting the function of the computer system which is completely unusable due to a fault will be described with reference to the flowchart of FIG.

【００６３】まず、障害モジュール番号及び影響モジュ
ール番号を示す変数ｉを初期設定し（ステップ２１
１）、変数ｉを増分しながら以下の処理を繰り返す（ス
テップ２１２）。次に障害モジュール及び影響モジュー
ルにより完全に使用不可能となる程の影響を受ける計算
機システムの機能を示す変数ｊを初期設定し（ステップ
２１３）、変数ｊを増分しながら以下の処理を繰り返す
（ステップ２１４）。次に、各変数ｉ，ｊを基に図１９
に示す完全機能使用不可検出データベースＤ５をひき、
変数ｊに対応する機能が変数ｉに対応するモジュールの
障害の影響を受けるか否かを調べる。もし、影響を受け
る場合であるならば、障害状態保存テーブルＴ２で変数
ｉに対応するモジュールが現在障害中かを判断し（ステ
ップ２１６）、影響を受けない場合であるならば、障害
影響状態保存テーブルＴ４によって変数ｉに対応するモ
ジュールが他のモジュールの障害によって影響をうける
か否かを判断し（ステップ２１７）、変数ｉに対応する
モジュールが現在障害中か又は他のモジュールの障害に
よって影響をうける場合には、その機能がその機能を実
現しているモジュールの多重化によってカバーされるか
否かの判断を行う（機能多重化影響判定処理、ステップ
２１８）。機能多重化影響判定処理は、図２０の図表に
示すように各機能毎にその機能を実現しているモジュー
ルに状態が記述されており、その機能を実現している全
てのモジュールが影響有の場合に、その機能が完全に使
用ができないと判断する。もしその機能が完全に使用が
できないと判断されたならば、図２１に示す完全使用不
可状態保存テーブルＴ５のその機能に対応する欄に完全
に使用が不可能になっていることを示す１を立てる（ス
テップ２１９）。First, a variable i indicating a failure module number and an affected module number is initialized (step 21
1) The following processing is repeated while incrementing the variable i (step 212). Next, a variable j indicating the function of the computer system which is so affected as to be completely unusable by the failure module and the influence module is initialized (step 213), and the following processing is repeated while incrementing the variable j (step 213). 214). Next, based on the variables i and j, FIG.
Draw the full-function unusability detection database D5 shown in
It is checked whether the function corresponding to the variable j is affected by the failure of the module corresponding to the variable i. If it is affected, it is judged whether or not the module corresponding to the variable i in the failure state storage table T2 is currently in failure (step 216). If it is not affected, the failure affected state is saved. It is determined whether or not the module corresponding to the variable i is affected by the failure of another module according to the table T4 (step 217), and the module corresponding to the variable i is currently in the failure state or affected by the failure of another module. In the case of receiving the function, it is judged whether or not the function is covered by the multiplexing of the modules realizing the function (function multiplexing influence determining process, step 218). In the function multiplexing influence determination processing, the state is described in the module that realizes each function as shown in the chart of FIG. 20, and all the modules that realize the function have an influence. In that case, it is determined that the function cannot be completely used. If it is determined that the function cannot be completely used, the column 1 corresponding to the function in the completely unusable state storage table T5 shown in FIG. Set up (step 219).

【００６４】他方、ステップ２１７で影響無しとされた
場合又はステップ２１８でモジュールの多重化により完
全にその機能が影響を受けないと判断された場合には、
完全使用不可状態保存テーブルＴ５のその機能に対応す
る欄に障害から復帰していることを示す０を立てる（ス
テップ２２０）。On the other hand, when it is determined that there is no effect in step 217 or when it is determined in step 218 that the function is not completely affected by the module multiplexing,
In the column corresponding to the function of the completely unusable state storage table T5, 0 indicating that the failure is recovered is set (step 220).

【００６５】以降、各モジュール及び各機能毎に以上の
処理を繰り返す（ステップ２２１，２２２）。Thereafter, the above processing is repeated for each module and each function (steps 221 and 222).

【００６６】次に障害機能検出処理部５ａ，５ｂが障害
により部分的に使用不可能となる計算機システムの機能
を検出処理する動作を図２２のフローチャートを基に説
明する。Next, the operation of the faulty function detection processing units 5a and 5b for detecting the function of the computer system which becomes partially unusable due to a fault will be described with reference to the flowchart of FIG.

【００６７】まず、障害モジュール番号及び影響モジュ
ール番号を示す変数ｉを初期設定し（ステップ２５
１）、変数ｉを増分しながら以下の処理を繰り返す（ス
テップ２５２）。次に障害モジュール及び影響モジュー
ルにより部分的に使用不可能となる程の影響を受ける計
算機システムの機能を示す変数ｊを初期設定し（ステッ
プ２５３）、変数ｊを増分しながら以下の処理を繰り返
す（ステップ２５４）。次に、各変数ｉ，ｊを基に図２
３に示す部分機能使用不可検出データベースＤ６をひ
き、変数ｊに対応する機能が変数ｉに対応するモジュー
ルの障害の影響を受けるか否かを調べる。もし、影響を
受ける場合であるならば、障害状態保存テーブルＴ２で
変数ｉに対応するモジュールが現在障害中かを判断し
（ステップ２５６）、影響を受けない場合であるなら
ば、障害影響状態保存テーブルＴ４によって変数ｉに対
応するモジュールが他のモジュールの障害によって影響
をうけるか否かを判断し（ステップ２５７）、変数ｉに
対応するモジュールが現在障害中か又は他のモジュール
の障害によって影響をうける場合には、その機能がその
機能を実現しているモジュールの多重化によってカバー
されるか否かの判断を行う（機能多重化影響判定処理、
ステップ２５８）。機能多重化影響判定処理は、図２０
の図表に示すように各機能毎にその機能を実現している
モジュールに状態が記述されており、全てのその機能を
実現しているモジュールの一部が影響有の場合に、その
機能が部分的に使用できないと判断する。もしその機能
が部分的に使用できないと判断されたならば、図２４に
示す部分使用不可状態保存テーブルＴ６のその機能に対
応する欄に部分的に使用不可能になっていることを示す
１を立てる（ステップ２５９）。First, a variable i indicating a failure module number and an affected module number is initialized (step 25).
1) The following processing is repeated while incrementing the variable i (step 252). Next, the variable j indicating the function of the computer system that is affected by the failure module and the influence module to the extent that it cannot be used is initialized (step 253), and the following processing is repeated while incrementing the variable j ( Step 254). Next, based on the variables i and j, FIG.
The partial function unavailability detection database D6 shown in FIG. 3 is drawn to check whether the function corresponding to the variable j is affected by the failure of the module corresponding to the variable i. If it is affected, it is judged whether or not the module corresponding to the variable i in the failure state saving table T2 is currently in failure (step 256). If it is not affected, the failure affected state is saved. It is determined whether or not the module corresponding to the variable i is affected by the failure of the other module according to the table T4 (step 257), and the module corresponding to the variable i is currently in the failure state or affected by the failure of another module. In the case of receiving, it is judged whether or not the function is covered by the multiplexing of the modules realizing the function (function multiplexing influence determination processing,
Step 258). The function multiplexing influence determination process is shown in FIG.
As shown in the figure, the status is described for each function in the module that realizes that function, and if some of all the modules that realize that function are affected, that function is partially Determine that it cannot be used. If it is determined that the function cannot be partially used, the column 1 corresponding to the function of the partially unusable state storage table T6 shown in FIG. Set up (step 259).

【００６８】他方、ステップ２５７で影響無しとされた
場合又はステップ２５８でモジュールの多重化により部
分的にしかその機能が影響を受けないと判断された場合
には、完全使用不可状態保存テーブルＴ５のその機能に
対応する欄に障害から復帰していることを示す０を立て
る（ステップ２６０）。On the other hand, if it is determined that there is no effect in step 257 or if it is determined in step 258 that the function is only partially affected by the module multiplexing, the complete unusable state storage table T5 is stored. In the column corresponding to the function, 0 indicating that the failure has been recovered is set (step 260).

【００６９】以降、各モジュール及び各機能毎に以上の
処理を繰り返す（ステップ２６１，２６２）。Thereafter, the above processing is repeated for each module and each function (steps 261 and 262).

【００７０】障害出力処理部３ａ，３ｂは、完全使用不
可状態保存テーブルＴ５及び部分使用不可状態保存テー
ブルＴ６を基に、計算機システムのＣＲＴ表示装置２１
上に図２５に示すようなグラフィカルな表示や図２６に
示すような表示を行い、更にプリンタ装置２２に図２７
に示すような印刷を行う。The failure output processing units 3a and 3b use the CRT display device 21 of the computer system based on the completely unusable state storage table T5 and the partially unusable state storage table T6.
A graphical display as shown in FIG. 25 and a display as shown in FIG.
Print as shown in.

【００７１】これによって、障害発生時の計算機システ
ムの機能障害状態を提示することで、運転員及び保守員
に計算機システム機能障害状態を正確に認識させ、障害
状況の影響度合い検出を支援することが可能になる。In this way, by presenting the functional failure status of the computer system at the time of failure occurrence, it is possible for the operator and the maintenance staff to accurately recognize the functional failure status of the computer system and to assist the detection of the influence degree of the failure status. It will be possible.

【００７２】本発明の請求項４に係わる計算機システム
の障害状態の監視診断システムの第４実施例を図２８の
ブロック図を参照して説明する。A fourth embodiment of the fault diagnosis and diagnosis system of the computer system according to claim 4 of the present invention will be described with reference to the block diagram of FIG.

【００７３】図示するように、第４実施例は第３実施例
に加え、障害箇所検出処理部２ａ，２ｂで検出された障
害箇所、障害機能検出処理部５ａ，５ｂにより検出され
た機能、及びそれに対処するための一次手順を記述した
一次対応手順データベースＤ７と、障害箇所検出処理部
２ａ，２ｂで検出された障害箇所及び障害機能検出処理
部５ａ，５ｂで検出された機能を入力し、一次対応手順
データベースＤ７から一次対応手順の検出を行う一次対
応手順検出処理部６ａ，６ｂとが新たに備えらている。As shown in the figure, in addition to the third embodiment, the fourth embodiment has a fault location detected by the fault location detection processing units 2a and 2b, a function detected by the fault function detection processing units 5a and 5b, and The primary correspondence procedure database D7 describing the primary procedure for coping with it, the failure location detected by the failure location detection processing units 2a and 2b, and the function detected by the failure function detection processing units 5a and 5b are input, and the primary Primary correspondence procedure detection processing units 6a and 6b for detecting the primary correspondence procedure from the correspondence procedure database D7 are newly provided.

【００７４】ここで一次対応手順データベースＤ７の持
つ処理項目を下記に示す。（１）計算機障害箇所障害箇所モジュールに併せ、対象モジュールと収納筐体
番号を組合せたデータベースにより収納筐体番号を提示
処理する。（２）一次診断提示処理障害箇所モジュールに併せ、対象モジュールと確認内容
メッセージを組合せたデータベースにより障害箇所モジ
ュールの確認内容メッセージを提示処理する。（３）連絡先提示処理障害箇所モジュール、機能障害状況により連絡先の提示
処理する。The processing items of the primary handling procedure database D7 are shown below. (1) Computer failure location In addition to the failure location module, the storage enclosure number is presented and processed by the database that combines the target module and the storage enclosure number. (2) Primary diagnosis presentation processing In addition to the failure location module, the confirmation content message of the failure location module is presented and processed by the database that combines the target module and the confirmation content message. (3) Contact presenting process The contact presenting process is performed according to the fault location module and the functional fault condition.

【００７５】一次対応手順検出処理部６ａ，６ｂは、図
２９のフローチャート示すように障害箇所検出処理部２
ａ，２ｂで検出された障害箇所及び機能障害検出処理部
５ａ，５ｂで検出された機能を入力し（ステップ３１
１）、一次対応手順データベースＤ７を上述した処理項
目単位に検索する。検索した結果、対応する各出力処理
項目毎に、すなわち計算機障害箇所出力処理（ステップ
３１２）、一次診断提示処理（ステップ３１３）、及び
連絡先提示処理（ステップ３１４）を行う。それぞれの
処理結果は、障害出力処理部３ａ，３ｂを介し、例えば
図３０に示すようにＣＲＴ表示装置２１上に、そして図
３１に示すプリンタ装置２２上に表示される。As shown in the flow chart of FIG. 29, the primary handling procedure detection processing units 6a and 6b are provided with the fault location detection processing unit 2.
The failure location detected in a and 2b and the function detected in the functional failure detection processing units 5a and 5b are input (step 31
1), the primary handling procedure database D7 is searched in units of the above-mentioned processing items. As a result of the search, for each corresponding output process item, that is, the computer fault location output process (step 312), the primary diagnosis presentation process (step 313), and the contact address presentation process (step 314) are performed. The respective processing results are displayed on the CRT display device 21 as shown in FIG. 30 and on the printer device 22 shown in FIG. 31 via the fault output processing units 3a and 3b.

【００７６】これによって、障害発生時に計算機システ
ムの障害発生部位の障害発生要因調査のための調査事項
及び計算機システム機能障害に対しての対応事項等一次
対応手順を運転員及び保守員に提示し、速やかな障害復
旧対応を支援することが可能になる。In this way, when a failure occurs, the operator and the maintenance staff are presented with the first-order measures such as the investigation items for investigating the cause of failure of the failure occurrence part of the computer system and the measures to deal with the malfunction of the computer system. It will be possible to support prompt disaster recovery support.

【００７７】本発明の請求項５に係わる計算機システム
の障害状態の監視診断システムの第５実施例を図３２の
ブロック図を参照して説明する。A fifth embodiment of the fault diagnosis and diagnosis system of the computer system according to the fifth aspect of the present invention will be described with reference to the block diagram of FIG.

【００７８】図示するように、第５実施例は、第４実施
例に加え障害箇所検出処理部２ａ，２ｂで検出された障
害箇所及び障害機能検出処理部５ａ，５ｂで検出された
障害機能の履歴を記録する障害履歴データ記録テーブル
Ｔ８ａ，Ｔ８ｂと、障害履歴データ記録テーブルＴ８
ａ，Ｔ８ｂに記録されたデータの履歴管理を行う障害履
歴管理処理部７ａ，７ｂと、障害履歴データ記録テーブ
ルＴ８ａ，Ｔ８ｂと障害履歴管理用ＣＲＴ装置２４や障
害履歴管理用プリンタ装置２５との間のデータの入出力
を処理する障害履歴管理入出力処理部９ａ，９ｂとが新
たに備えられている。As shown in the figure, the fifth embodiment is different from the fourth embodiment in that the fault location detected by the fault location detection processing units 2a and 2b and the fault function detected by the fault function detection processing units 5a and 5b. Fault history data recording tables T8a and T8b for recording history, and fault history data recording table T8
Between the fault history management processing units 7a and 7b that perform history management of the data recorded in a and T8b, the fault history data recording tables T8a and T8b, the fault history management CRT device 24, and the fault history management printer device 25. Fault log management input / output processing units 9a and 9b for processing input / output of the data are newly provided.

【００７９】障害履歴管理処理部７ａ，７ｂは、障害履
歴データ記録テーブルＴ８ａ，Ｔ８ｂ上で。以下に示す
管理記録項目を管理する。（１）障害機器区分；障害信号により、ＣＰＵ、周辺機
器、入出力装置、その他に区分した障害機器区分を記録
する。（例えば重障害、軽障害、機器障害、検出器障害
等）。（２）障害発生日時；計算機システムの障害信号を入力
処理した時点での日時の記録を行う。（３）警報状態；計算機システムからの障害信号により
出力処理した、計算機障害発生時の検出処理されたデー
タを記録する。（４）障害発生／復帰時のメッセージ出力結果；計算機
システムからの障害信号により出力処理した、障害発生
時のメッセージ出力の記録を行う。（５）機能障害発生／復帰時のメッセージ出力結果；計
算機システムの障害影響箇所モジュール信号により検出
処理した、機能障害メッセージの記録を行う。（６）一次対応手順メッセージ出力結果；一次対応手順
検出処理部により検出処理した、一次対応手順メッセー
ジの記録を行う。The fault history management processing units 7a and 7b are on the fault history data recording tables T8a and T8b. The following management record items are managed. (1) Faulty device classification: The faulty device classification classified into the CPU, the peripheral device, the input / output device, and others is recorded according to the fault signal. (For example, serious obstacle, light obstacle, equipment obstacle, detector obstacle, etc.). (2) Date and time of fault occurrence: The date and time at the time when the fault signal of the computer system is input and processed are recorded. (3) Alarm status: The data processed for detection by the fault signal from the computer system and recorded when the computer fault occurs are recorded. (4) Message output result at the time of failure occurrence / recovery: The message output at the time of failure occurrence, which has been output by the failure signal from the computer system, is recorded. (5) Message output result at the time of occurrence / recovery of a functional failure: The functional failure message detected and processed by the failure-affected point module signal of the computer system is recorded. (6) Output result of primary handling procedure message: The primary handling procedure message detected by the primary handling procedure detection processing unit is recorded.

【００８０】以上にあげた各障害履歴データは、ＣＲＴ
表示装置２１、プリンタ装置２２、及び状態表示装置２
３への出力データであり、障害履歴管理処理部７ａ，７
ｂが入力データとして入力する。障害履歴管理処理部７
ａ，７ｂでは、入力したデータを障害履歴データ記録テ
ーブルＴ８ａ，Ｔ８ｂ用に変換処理を行い、障害履歴デ
ータ記録テーブルＴ８ａ，Ｔ８ｂに記録する。Each fault history data set forth above is recorded on the CRT.
Display device 21, printer device 22, and status display device 2
3 is output data to the fault history management processing units 7a, 7
b is input as input data. Fault history management processing unit 7
In a and 7b, the input data is converted into the fault history data recording tables T8a and T8b and recorded in the fault history data recording tables T8a and T8b.

【００８１】また、障害履歴管理処理部７ａ，７ｂは、
例えばキーボードである障害履歴管理用入出力装置２４
からデータを入力し、障害履歴管理入出力処理部９ａ，
９ｂを介して障害履歴データ記録テーブルＴ８ａ，Ｔ８
ｂに書込む。。The fault history management processing units 7a and 7b are
Fault history management input / output device 24 such as a keyboard
From the fault history management input / output processing unit 9a,
Fault history data recording tables T8a, T8 via 9b
Write to b. .

【００８２】障害履歴管理入出力処理部９ａ，９ｂは、
障害履歴データ記録テーブルＴ８ａ，Ｔ８ｂに記録され
ている各障害履歴管理データを、障害履歴管理用入出力
装置２４からの帳票表示要求により、障害履歴管理用Ｃ
ＲＴ装置２４に帳票表示するとともに、障害履歴管理用
プリンタ装置２５に帳票出力を行う。The fault history management input / output processing units 9a and 9b are
The failure history management data recorded in the failure history data recording tables T8a and T8b are transferred to the failure history management C by a form display request from the failure history management input / output device 24.
The form is displayed on the RT device 24 and the form is output to the fault history management printer device 25.

【００８３】以下に、障害履歴管理入出力処理部９ａ，
９ｂが受け付ける障害履歴管理機能を示す。（１）データ表示機能（２）データ帳票出力機能（３）データ入力機能（４）データ保存機能（５）データ一覧表示・出力機能ここで、図３３に障害履歴データの出力例を示す。Below, the fault history management input / output processing unit 9a,
9b shows a fault history management function accepted by 9b. (1) Data display function (2) Data form output function (3) Data input function (4) Data storage function (5) Data list display / output function Here, FIG. 33 shows an output example of failure history data.

【００８４】これによって、障害発生時に計算機システ
ムの障害発生日時、障害発生部位、障害発生時の計算機
システムのデータ、障害原因、対策処置等を記録した計
算機システムの障害の履歴管理データにより、計算機シ
ステム運転員及び保守員に障害発生時の障害要因の推
定、対応操作の技術的な支援データを蓄積・提供するこ
とが可能になる。Accordingly, when a failure occurs, the failure date and time of the computer system, the failure location, the data of the computer system at the time of the failure, the cause of the failure, the countermeasures and the like are recorded in the failure history management data of the computer system. It becomes possible to estimate the cause of failure when a failure occurs and to store and provide technical support data for the corresponding operation to the operator and maintenance staff.

【００８５】[0085]

【発明の効果】請求項１の発明によれば、計算機システ
ムに障害が発生した場合、計算機システムからの障害信
号を用いて障害発生箇所が計算機システム構成上どの箇
所であるかを視覚的にかつ正確に運転員及び計算機保守
員に情報提供する事が可能となる。According to the first aspect of the present invention, when a failure occurs in the computer system, a failure signal from the computer system is used to visually identify the failure occurrence location in the computer system configuration. It is possible to accurately provide information to operators and computer maintenance personnel.

【００８６】請求項２の発明によれば、計算機のハード
ウェア構成上で障害発生箇所が及ぼす障害の影響箇所を
視覚的にかつ正確に運転員及び計算機保守員に情報提供
することが可能となる。According to the second aspect of the present invention, it is possible to visually and accurately provide the operator and the computer maintenance staff with information on the affected part of the failure caused by the failed part on the hardware configuration of the computer. .

【００８７】請求項３の発明によれば、計算機のハード
ウェア構成上の障害箇所、影響箇所を情報提供すること
に加えて、計算機システムの機能障害状態を正確にかつ
速やかに運転員及び計算機保守員に情報提供することが
可能となる。According to the third aspect of the present invention, in addition to providing information on a fault location and an affected location on the hardware configuration of the computer, the functional fault condition of the computer system can be accurately and promptly maintained by the operator and the computer. It becomes possible to provide information to the staff.

【００８８】請求項４の発明によれば、計算機システム
の障害発生時に障害発生箇所の障害発生要因調査のため
の調査事項及び計算機システムの機能障害状態に対する
対応事項等、一次対応手順を運転員及び計算機保守員に
提示し、速やかな障害復旧対応を支援することが可能と
なる。According to the fourth aspect of the present invention, when a fault occurs in the computer system, a primary response procedure such as investigation items for investigating the cause of the fault at the location where the fault has occurred and items to be dealt with the functional fault state of the computer system It is possible to present it to the computer maintenance staff and support prompt failure recovery.

【００８９】請求項５の発明によれば、計算機システム
の障害発生時に障害発生日時、障害発生箇所、障害発生
時の計算機システムのデータ、障害原因、対策処置等の
記録が可能となる。また、計算機システムの障害の履歴
管理データの蓄積により、運転員及び計算機保守員に計
算機システムの障害発生時の障害要因の推定、対応操作
の技術的な支援データの提供も可能となる。According to the fifth aspect of the present invention, when a failure occurs in the computer system, the failure date and time, the location of the failure, the data of the computer system at the time of the failure, the cause of the failure, the countermeasure, etc. can be recorded. Further, by accumulating the history management data of the failure of the computer system, it becomes possible to estimate the failure factor at the time of the failure of the computer system and provide the technical support data for the corresponding operation to the operator and the computer maintenance staff.

[Brief description of drawings]

【図１】本発明の請求項１に係わる計算機システムの障
害状態の監視診断システムによる第１実施例を示すブロ
ック図。FIG. 1 is a block diagram showing a first embodiment of a fault diagnosis and diagnosis system of a computer system according to claim 1 of the present invention.

【図２】ＣＲＴ表示装置上でグラフィックに表示された
障害箇所を示す図。FIG. 2 is a diagram showing a fault location graphically displayed on a CRT display device.

【図３】障害箇所検出処理部の動作を示すフローチャー
ト。FIG. 3 is a flowchart showing the operation of a fault location detection processing unit.

【図４】障害箇所検出データベースを示す図表。FIG. 4 is a diagram showing a fault location detection database.

【図５】障害状態保存テーブルを示す図表。FIG. 5 is a diagram showing a failure state storage table.

【図６】第２実施例を示すブロック図。FIG. 6 is a block diagram showing a second embodiment.

【図７】ＣＲＴ表示装置上でグラフィックに表示された
他のモジュールの障害により影響を受ける箇所を示す
図。FIG. 7 is a diagram showing a portion affected by a failure of another module which is graphically displayed on the CRT display device.

【図８】障害影響箇所検出処理部の動作を示すフローチ
ャート。FIG. 8 is a flowchart showing the operation of a failure influence point detection processing unit.

【図９】障害影響箇所検出データベースを示す図表。FIG. 9 is a diagram showing a failure influence point detection database.

【図１０】障害影響状態保存テーブルを示す図表。FIG. 10 is a diagram showing a failure influence state storage table.

【図１１】共有化モジュールへの影響を検出する動作を
示す図表。FIG. 11 is a chart showing an operation of detecting an influence on a sharing module.

【図１２】障害の発生により影響を受ける各モジュール
を接続するラインを検出する動作を示すフローチャー
ト。FIG. 12 is a flowchart showing an operation of detecting a line connecting each module affected by the occurrence of a failure.

【図１３】影響ライン検出データベースを示す図表。FIG. 13 is a chart showing an influence line detection database.

【図１４】影響ライン検出状態保存テーブルを示す図
表。FIG. 14 is a diagram showing an affected line detection state storage table.

【図１５】ＣＲＴ表示装置上でグラフィックに表示され
たライン番号を示す図。FIG. 15 is a diagram showing line numbers graphically displayed on a CRT display device.

【図１６】ＣＲＴ表示装置上でグラフィックに表示され
たモジュール番号を示す図。FIG. 16 is a diagram showing module numbers graphically displayed on a CRT display device.

【図１７】第３実施例を示すブロック図。FIG. 17 is a block diagram showing a third embodiment.

【図１８】障害機能検出処理部の動作を示すフローチャ
ート。FIG. 18 is a flowchart showing the operation of the fault function detection processing unit.

【図１９】完全機能使用不可検出データベースを示す図
表。FIG. 19 is a chart showing a full-function unavailability detection database.

【図２０】機能多重化影響判定処理を説明するための図
表。FIG. 20 is a diagram for explaining a function multiplexing influence determination process.

【図２１】完全使用不可状態保存テーブルを示す図表。FIG. 21 is a diagram showing a complete unusable state storage table.

【図２２】障害機能検出処理部の動作を示すフローチャ
ート。FIG. 22 is a flowchart showing the operation of the fault function detection processing unit.

【図２３】部分使用不可検出データベースを示す図表。FIG. 23 is a diagram showing a partially unusable detection database.

【図２４】部分使用不可状態保存テーブルを示す図表。FIG. 24 is a diagram showing a partial unusable state storage table.

【図２５】ＣＲＴ表示装置上にグラフィカルに表示され
た障害の影響を受けた計算機システムの機能を示す図。FIG. 25 is a diagram showing the function of the computer system affected by the failure, which is graphically displayed on the CRT display device.

【図２６】ＣＲＴ表示装置上に表示された障害の影響を
受けた計算機システムの機能を示す図。FIG. 26 is a diagram showing functions of a computer system affected by a failure displayed on a CRT display device.

【図２７】プリンタ装置上に表示された障害の影響を受
けた計算機システムの機能を示す図。FIG. 27 is a diagram showing the functions of a computer system affected by a fault displayed on the printer device.

【図２８】第４実施例を示すブロック図。FIG. 28 is a block diagram showing a fourth embodiment.

【図２９】一次対応手順検出処理部の動作を示すフロー
チャート。FIG. 29 is a flowchart showing the operation of the primary handling procedure detection processing unit.

【図３０】ＣＲＴ表示装置上に表示された一次対応手順
のメッセージを示す図。FIG. 30 is a diagram showing a message of the primary handling procedure displayed on the CRT display device.

【図３１】プリンタ装置上に表示された一次対応手順の
メッセージを示す図。FIG. 31 is a diagram showing a message of a primary handling procedure displayed on the printer device.

【図３２】第５実施例を示すブロック図。FIG. 32 is a block diagram showing a fifth embodiment.

【図３３】障害履歴データの出力表示を示す図。。FIG. 33 is a view showing an output display of fault history data. .

【図３４】従来の技術による計算機システムの障害状態
の監視診断システムを示すブロック図。FIG. 34 is a block diagram showing a monitoring / diagnosing system for a fault condition of a computer system according to a conventional technique.

[Explanation of symbols]

１ａ，１ｂ障害信号処理部２ａ，２ｂ障害箇所検出処理部３ａ，３ｂ障害出力処理部４ａ，４ｂ障害影響箇所検出処理部５ａ，５ｂ障害機能検出処理部６ａ，６ｂ一次対応手順検出処理部７ａ，７ｂ障害履歴管理処理部９ａ，９ｂ障害履歴管理入出力処理部Ｄ２障害箇所検出データベースＤ４障害影響箇所検出データベースＤ５完全機能使用不可検出データベースＤ６部分機能使用不可検出データベースＴ２障害状態保存テーブルＴ４障害影響状態保存テーブルＴ５完全使用不可状態保存テーブルＴ６部分使用不可状態保存テーブルＴ８ａ，Ｔ８ｂ障害履歴データ記録テーブル 1a, 1b Fault signal processing section 2a, 2b Fault location detection processing section 3a, 3b Fault output processing section 4a, 4b Fault affected location detection processing section 5a, 5b Fault function detection processing section 6a, 6b Primary response procedure detection processing section 7a, 7b Fault history management processing unit 9a, 9b Fault history management input / output processing unit D2 Fault location detection database D4 Fault affected location detection database D5 Full function disabled detection database D6 Partial function disabled detection database T2 Fault status storage table T4 Fault affected status Storage table T5 Completely unusable state storage table T6 Partially unusable state storage table T8a, T8b Fault history data recording table

Claims

[Claims]

1. A fault diagnosis and diagnosis system for a fault condition of a computer system, which receives a fault signal when a fault occurs in the computer system and outputs the fault occurrence state of the computer system to a peripheral device. Fault location detection database for associating the modules constituting the, fault state storage table for storing the fault state for each module, and inputting the fault signal through the fault signal processing means to detect the fault location. A fault location detection processing unit that detects a fault module from a database and stores the detection result in the fault state storage table, and a fault state storage table in which the detection result is stored are graphically displayed as a module configuration diagram on a CRT display device. While displaying
A failure diagnosis and monitoring apparatus for a failure state of a computer system, comprising: a failure output processing means for highlighting and displaying the detected failure module on the CRT display device.

2. A fault influence point detection database for associating the fault signal with a module affected by the fault of the module corresponding to the fault signal, and whether or not each module is affected by the fault of another module. A failure state storage table for storing whether or not, and a failure module detected by the failure location detection processing unit are input, a module affected by the failure is detected from the failure impact location detection database, and the status of the module is displayed. And a failure-affected-portion detection processing unit that saves the failure-affected state storage table, wherein the failure output processing unit emphasizes and outputs the detected failure module on the module configuration diagram. The monitoring and diagnosing device for the fault condition of the computer system according to claim 1.

3. A complete function unavailability detection database that describes, for each function of a computer system, a part that is completely unusable by a module affected by a failure that has occurred in the failure module or another module, and For each function of the computer system, a partial function unavailability detection database that describes, for each function of the computer system, a part that is partially unusable due to a failure module or a module affected by a failure that occurs in another module. A complete unusable state storage table and a partial unusable state storage table that store the state of the function, and a failure affected point detected by the failure point detection processing means and a failure detected by the failure affected point detection processing means. Enter the affected part, and detect the database that the full function cannot be used and the partial function that cannot be used. Failure function detection processing means for detecting a completely unusable function and a partially unusable function on the basis of a database and recording them in the completely unusable state saving table and the partially unusable state saving table, respectively. 3. The fault output processing means displays the contents of the completely unusable state storage table and the partially unusable state storage table on the CRT display device. Computer system fault condition monitoring and diagnostic device.

4. An investigation item required for investigating a failure factor from the failure point, the failure affected point, and the function affected by the failure of the failure point, and a primary item for recovering from the failure Enter the primary response procedure database describing the response procedure, the failure occurrence point, the failure affected point, and the function affected by the failure of the failure point, based on the primary response procedure database,
Primary failure handling procedure detection processing means for detecting the investigation item and the primary troubleshooting procedure, wherein the failure output processing means uses the CRT to report the investigation item and the primary troubleshooting procedure detected by the primary handling operation detecting means. The monitoring and diagnosing device for a fault condition of a computer system according to claim 3, which is displayed on a display device or a printer device.

5. The failure location, the failure affected location, and the function affected by the failure of the failure location are recorded in a failure history data recording table, and the cause of failure and the cause found as a result of investigating the investigation item and the A failure history management processing unit that records the result of a procedure in which a primary response procedure is executed in the failure history data recording table, and a form display request are received, and the data recorded in the failure history data recording table is managed as failure history management. 5. A fault diagnosis and diagnosis of a computer system according to claim 4, further comprising: fault history management input / output processing means for displaying a form on a CRT device for use and outputting a form to a fault history management printer device. apparatus.