JPH1074108A

JPH1074108A - Fault detection system

Info

Publication number: JPH1074108A
Application number: JP22990196A
Authority: JP
Inventors: Isamu Yamane; 勇山根
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1996-08-30
Filing date: 1996-08-30
Publication date: 1998-03-17

Abstract

PROBLEM TO BE SOLVED: To enable all monitored systems to detect the faults of the temporary faulty parts based on the common proper threshold by comparing the temporary failure occurrence frequency of every part of the monitored systems with the common threshold and deciding the occurrence of faults via a monitoring system. SOLUTION: The monitored systems 18A, 18B and 18C send the temporary failure information when the temporary failures which can be automatically recovered occur in these systems. Then a means is added to receive and store the temporary failure information and counts the temporary failure occurrence frequency to compare it with the prescribed threshold. When the temporary failure occurrence frequency is equal to or larger than the threshold, a central control part 12 decides a fault of the relevant parts to register it in a failure history data base 15 and transfers the information to a fault display control part 16 to show it on a display device 17. Then the part 12 notifies the system 18A of the fault of the relevant parts.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は障害検出方式に係
り、特にシステムの一時故障発生時の障害検出方式に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a failure detection method, and more particularly to a failure detection method when a temporary failure occurs in a system.

【０００２】[0002]

【従来の技術】システムの故障が発生しても、その故障
が再試行などにより自動的に復旧する一時的な故障であ
る場合は、その故障は表面上はなかったものとして正常
に処理されるため、不良部品が実装されていたり、調整
不十分なまま出荷されていたり、経年変化や運用環境変
化などにより品質の劣化が生じていても、劣化の進み具
合が分からず、障害の発生を事前に回避することができ
ず、システムの性能と信頼性が低下してしまう。2. Description of the Related Art Even if a system failure occurs, if the failure is a temporary failure that is automatically recovered by retrying or the like, the failure is normally treated as if it were not on the surface. Therefore, even if defective parts are mounted, shipped with insufficient adjustment, or if the quality has deteriorated due to aging or changes in the operating environment, the progress of the deterioration is not known, and the occurrence of a failure is required in advance. And the performance and reliability of the system decrease.

【０００３】そこで、上記の一時故障が発生した時は再
試行回数あるいは故障発生回数を記憶しておき、その回
数がしきい値を越えたときに劣化状態を通知するように
した障害検出方式が従来より知られている（例えば、特
開平４−１６５５３８号公報：発明の名称「劣化状態検
出機能付き周辺記憶装置」）。[0003] Therefore, there is a failure detection method in which the number of retries or the number of failure occurrences is stored when the above-mentioned temporary failure occurs and the deterioration state is notified when the number of times exceeds a threshold value. It is conventionally known (for example, Japanese Patent Application Laid-Open No. 4-165538: title of the invention "Peripheral storage device with deterioration state detection function").

【０００４】図３は上記と同様の障害検出をする従来の
障害検出方式の一例の構成図を示す。同図において、監
視システム１は中央制御部２、通信制御部３、障害表示
制御部４及び表示装置５から構成されており、被監視シ
ステム６Ａ、６Ｂ及び６Ｃの障害を検出する。被監視シ
ステム６Ａ、６Ｂ及び６Ｃはそれぞれ同様の構成で、中
央制御部７、状態情報記憶部８及び通信制御部９から構
成されている。FIG. 3 is a block diagram showing an example of a conventional fault detection system for detecting a fault similar to the above. In FIG. 1, a monitoring system 1 includes a central control unit 2, a communication control unit 3, a fault display control unit 4, and a display device 5, and detects a fault in the monitored systems 6A, 6B, and 6C. Each of the monitored systems 6A, 6B, and 6C has the same configuration, and includes a central control unit 7, a state information storage unit 8, and a communication control unit 9.

【０００５】この従来方式では、監視システム１は被監
視システム６Ａ、６Ｂ及び６Ｃからの一時故障情報を通
信制御部３を介して受信する。いま、被監視システム６
Ａで故障検出後に自動的に復旧する一時的な故障が発生
したとすると、その故障情報は中央制御部７に転送され
る。中央制御部７は、入力された故障情報により故障部
位を認識し、状態情報記憶部８に記憶されている該当部
品のエラーカウンタ値を＋１加算更新する。In this conventional system, the monitoring system 1 receives the temporary failure information from the monitored systems 6A, 6B and 6C via the communication control unit 3. Now, the monitored system 6
If a temporary failure that automatically recovers after the failure is detected in A occurs, the failure information is transferred to the central control unit 7. The central control unit 7 recognizes a failed part based on the inputted failure information, and updates the error counter value of the corresponding component stored in the state information storage unit 8 by +1.

【０００６】そして、中央制御部７は、状態情報記憶部
８から加算後の該当部品のエラーカウンタ値を読み取
り、予め被監視システム６Ａで設定されているしきい値
と比較する。比較結果がしきい値以下であれば処理は終
了するが、しきい値と同じあるいは越えている場合は一
時故障通知のあった部品を障害とみなし、その部品をシ
ステムから切り離す。更に、中央制御部７は、通信制御
部９を起動して監視システム１に対し障害の通知を行
う。Then, the central control unit 7 reads the error counter value of the added component from the state information storage unit 8 and compares it with a threshold value set in the monitored system 6A in advance. If the comparison result is equal to or less than the threshold value, the process is terminated. If the comparison result is equal to or greater than the threshold value, the part for which the temporary failure has been notified is regarded as a failure, and the part is separated from the system. Further, the central control unit 7 activates the communication control unit 9 and notifies the monitoring system 1 of a failure.

【０００７】監視システム１は、被監視システム６Ａ〜
６Ｃからの障害情報を通信制御部３で受信すると、中央
制御部２に転送する。中央制御部２は転送されてきた障
害情報から障害を発生した被監視システム６Ａとその障
害内容を認識し、障害表示制御部４に表示情報を転送
し、表示装置５でその障害内容を表示させる。The monitoring system 1 includes monitored systems 6A to 6A.
When the communication control unit 3 receives the failure information from 6C, it transfers it to the central control unit 2. The central control unit 2 recognizes the monitored system 6A in which the fault has occurred and the content of the fault from the transferred fault information, transfers the display information to the fault display control unit 4, and causes the display device 5 to display the fault content. .

【０００８】[0008]

【発明が解決しようとする課題】しかるに、上記の従来
の障害検出方式では、被監視システム６Ａ〜６Ｃ個々に
しきい値が設定され、そのしきい値に対して一時故障発
生回数を比較するものであるため、被監視システム６Ａ
〜６Ｃの管理者がしきい値の設定を誤った場合、あるい
は故意または過失により不適正な値に設定した場合、故
障部品が潜在化することになり、システムの運用、維持
に支障を来すという問題がある。However, in the above-described conventional fault detection system, a threshold value is set for each of the monitored systems 6A to 6C, and the number of temporary fault occurrences is compared with the threshold value. There is a monitored system 6A
If the administrator of ~ 6C incorrectly sets the threshold value, or sets it incorrectly or incorrectly to an inappropriate value, a faulty component becomes latent, which hinders the operation and maintenance of the system. There is a problem.

【０００９】また、一旦、障害検出された部品を修理
後、他の被監視システムに転用して潜在化している不具
合により故障が再発しても、またそのシステムで設定さ
れているしきい値に達するまでは一時故障を繰り返さな
ければ、障害検出されないという問題がある。[0009] Further, even after a component once detected as a failure is repaired, the failure reappears due to a potential problem by being diverted to another monitored system, and the threshold set in the system is not changed. There is a problem that a failure is not detected unless a temporary failure is repeated until the failure is reached.

【００１０】本発明は以上の点に鑑みなされたもので、
全被監視システムが共通の適正なしきい値により一時故
障部品の障害検出ができる障害検出方式を提供すること
を目的とする。[0010] The present invention has been made in view of the above points,
It is an object of the present invention to provide a failure detection method in which all monitored systems can detect a failure of a temporarily failed component using a common appropriate threshold.

【００１１】また、本発明の他の目的は、潜在化してい
る不具合に対して迅速に対応できる障害検出方式を提供
することにある。Another object of the present invention is to provide a failure detection method capable of promptly responding to a latent defect.

【００１２】[0012]

【課題を解決するための手段】本発明は上記の目的を達
成するため、部品に自動復旧可能な一時的故障が発生し
た時にその部品の一時的故障発生を示す一時故障情報を
送信する機能と、外部からの障害検出通知を受信したと
きは障害検出通知された部品をシステムから切り離す機
能を少なくとも有する一又は二以上の被監視システム
と、一時故障情報を受信し、障害検出通知を送信する通
信制御部と、通信制御部により受信された一時故障情報
に基づき一時故障を発生した部品の履歴を蓄積する履歴
蓄積手段と、履歴蓄積手段から得られた一時故障を発生
した部品の故障発生回数が予め定めたしきい値以上かど
うか判定し、しきい値以上であるときは部品の障害と判
断して障害検出通知を行う検出手段とを備えた監視シス
テムとを有することを特徴とする。In order to achieve the above object, the present invention has a function of transmitting temporary failure information indicating the occurrence of a temporary failure of a part when the temporary failure that can be automatically recovered has occurred in the part. A communication with one or more monitored systems having at least a function of disconnecting the component notified of the failure detection from the system when receiving a failure detection notification from the outside, receiving the temporary failure information, and transmitting the failure detection notification A control unit, a history accumulation unit for accumulating a history of the component having the temporary failure based on the temporary failure information received by the communication control unit, and a failure occurrence count of the component having the temporary failure obtained from the history accumulation unit. A monitoring system having a detecting means for determining whether or not the value is equal to or more than a predetermined threshold value, and when the value is equal to or more than the threshold value, determining that the component is faulty and notifying fault detection And it features.

【００１３】この発明では、被監視システムからの一時
故障情報を監視システムで分析して、部品単位で一時故
障発生回数を記憶し、その一時故障発生回数がしきい値
以上となったときに障害発生と検出するようにしている
ため、すべての被監視システムの各部品の一時故障発生
回数は共通のしきい値と比較されて障害発生かどうか監
視システムで判断される。According to the present invention, the temporary failure information from the monitored system is analyzed by the monitoring system, and the number of temporary failure occurrences is stored for each component. Since the occurrence is detected, the number of temporary failure occurrences of each component of all the monitored systems is compared with a common threshold value, and the monitoring system determines whether a failure has occurred.

【００１４】また、本発明における監視システムの履歴
蓄積手段は、一時故障を発生した部品が障害発生部品と
して検出手段により判定され、被監視システムで正常な
部品に交換された後もそのまま履歴を保存し、また検出
手段はその後に障害発生部品を修理した修理部品の一時
故障発生を示す一時故障情報を通信制御部を介して受信
したときは、履歴蓄積手段から一時故障を発生した修理
部品の修理前の累積故障発生回数を検索し、修理部品の
障害と判断して障害検出通知を行うようにしたことを特
徴とする。The history storing means of the monitoring system according to the present invention stores the history as it is by the detecting means when a component having a temporary failure is determined as a failed component by the detecting means and is replaced by a normal component in the monitored system. When the detecting means subsequently receives, via the communication control section, temporary failure information indicating the occurrence of a temporary failure of the repaired part in which the failed part has been repaired, the detecting means repairs the repaired part in which the temporary failure occurred. It is characterized in that the previous number of occurrences of failures is retrieved, a failure is detected in the repaired part, and a failure detection notification is issued.

【００１５】この発明では、一旦障害発生と検出された
部品が修理された後に一時故障を発生した場合は、修理
前の一時故障発生回数に更に＋１加算された値が一時故
障発生回数となるので、しきい値よりも大であり、直ち
に障害発生と検出されることとなる。According to the present invention, if a temporary failure occurs after a component once detected as having a failure is repaired, a value obtained by adding +1 to the temporary failure occurrence number before the repair is the temporary failure occurrence number. , Which is larger than the threshold value, and the occurrence of a failure is immediately detected.

【００１６】[0016]

【発明の実施の形態】次に、本発明の実施の形態につい
て図面と共に説明する。図１は本発明になる障害検出方
式の一実施の形態の構成図を示す。この実施の形態は、
内部で自動的に復旧可能な一時的故障発生時にその一時
故障情報を送信する機能を備えた複数の被監視システム
１８Ａ、１８Ｂ及び１８Ｃと、上記一時故障情報を受信
して蓄積し、蓄積された一時故障回数を計測し予め設定
されているしきい値と比較する手段を備えた監視システ
ム１１とから構成されている。Next, embodiments of the present invention will be described with reference to the drawings. FIG. 1 shows a configuration diagram of an embodiment of a failure detection system according to the present invention. In this embodiment,
A plurality of monitored systems 18A, 18B, and 18C each having a function of transmitting the temporary failure information when a temporary failure that can be automatically recovered internally is received, and the temporary failure information is received, accumulated, and stored. The monitoring system 11 includes means for measuring the number of temporary failures and comparing it with a preset threshold value.

【００１７】監視システム１１は、システムの各部を統
括制御すると共に、監視対象となるすべての被監視シス
テム１８Ａ〜１８Ｃの一時故障情報を分析する中央制御
部１２と、被監視システム１８Ａ〜１８Ｃからの一時故
障情報を受信する通信制御部１３と、中央制御部１２か
らの分析結果を故障履歴データベース１５に蓄積制御す
る故障情報蓄積部１４と、中央制御部１２からの表示情
報が入力される障害表示制御部１６と、障害表示制御部
１６により表示情報を表示する表示装置１７とから構成
されている。The monitoring system 11 controls the components of the system, and analyzes the temporary fault information of all the monitored systems 18A to 18C to be monitored. A communication control unit 13 for receiving temporary failure information; a failure information storage unit 14 for storing and controlling an analysis result from the central control unit 12 in a failure history database 15; and a failure display to which display information from the central control unit 12 is input. It comprises a control unit 16 and a display device 17 for displaying display information by the fault display control unit 16.

【００１８】次に、この実施の形態の動作について、図
２の動作説明用フローチャートを併せ参照して説明す
る。被監視システム１８Ａにおいて一時故障が発生した
場合を例にとって図１の監視システム１１の障害検出動
作について説明するに、被監視システム１８Ａは一時故
障情報を監視システム１１に送信する（図２のステップ
２１）。監視システム１１はその通信制御部１３で上記
の一時故障情報を受信して中央制御部１２へ転送する
（図２のステップ２２）。Next, the operation of this embodiment will be described with reference to the flowchart of FIG. The failure detection operation of the monitoring system 11 of FIG. 1 will be described with reference to a case where a temporary failure has occurred in the monitored system 18A. The monitored system 18A transmits temporary failure information to the monitoring system 11 (step 21 in FIG. 2). ). The monitoring system 11 receives the temporary failure information in the communication control unit 13 and transfers it to the central control unit 12 (step 22 in FIG. 2).

【００１９】中央制御部１３は受信された一時故障情報
を分析し、一時故障が発生した故障部品のシリアル番号
を認識する。その結果、中央制御部１２は故障情報蓄積
部１４を介して全被監視システムの故障履歴データベー
ス１５に対し、一時故障を発生した故障部品のシリアル
番号の一時故障発生回数を更新登録（蓄積）する（図２
のステップ２２）。The central control unit 13 analyzes the received temporary failure information and recognizes the serial number of the failed component in which the temporary failure has occurred. As a result, the central control unit 12 updates (accumulates) the number of temporary failure occurrences of the serial number of the failed component in which the temporary failure has occurred in the failure history database 15 of all monitored systems via the failure information storage unit 14. (Figure 2
Step 22).

【００２０】その後、中央制御部１２は故障情報蓄積部
１４を介して故障履歴データベース１５からシリアル番
号に基づいて該当部品の履歴を検索する（図２のステッ
プ２３）。中央制御部１２はシリアル番号に基づいて検
索した該当部品の履歴から、その部品の一時故障発生回
数と監視システム１１で予め設定しているしきい値とを
大小比較し（図２のステップ２４）、一時故障発生回数
がしきい値未満であるときには処理を終了する。Thereafter, the central control unit 12 searches the history of the corresponding part from the failure history database 15 via the failure information storage unit 14 based on the serial number (step 23 in FIG. 2). The central control unit 12 compares the number of temporary failure occurrences of the part with a threshold set in advance by the monitoring system 11 from the history of the part searched based on the serial number (step 24 in FIG. 2). If the number of times of temporary failure occurrence is less than the threshold, the process is terminated.

【００２１】一方、一時故障発生回数がしきい値と同じ
かそれより大であるときには、中央制御部１２は当該部
品を障害とみなし、故障履歴データベース１５にその旨
を登録すると共に、障害表示に必要な情報を障害表示制
御部１６に転送し、表示装置１７に障害発生を表示する
（図２のステップ２５）。中央制御部１２は更に、通
信制御部１３を介して一時故障情報を送信した当該被監
視システム１８Ａに対して部品の障害通知を行う（図２
のステップ２６）。On the other hand, when the number of temporary failure occurrences is equal to or greater than the threshold value, the central control unit 12 regards the part as a failure, registers the failure in the failure history database 15, and displays the failure in the failure display. The necessary information is transferred to the failure display control unit 16, and the occurrence of the failure is displayed on the display device 17 (step 25 in FIG. 2). The central control unit 12 further notifies the monitored system 18A that has transmitted the temporary failure information via the communication control unit 13 of a component failure (FIG. 2).
Step 26).

【００２２】当該被監視システム１８Ａは上記の障害通
知を受信すると、当該部品をシステムから切り離す（図
２のステップ２７）。そして、被監視システム１８Ａに
おいて障害通知を受けた部品が正常な部品に交換されて
システムに組み込まれると、被監視システム１８Ａは監
視システム１１に障害の復旧を通知する（図２のステッ
プ２８）。When the monitored system 18A receives the failure notification, it disconnects the component from the system (step 27 in FIG. 2). Then, when the component notified of the failure in the monitored system 18A is replaced with a normal component and incorporated into the system, the monitored system 18A notifies the monitoring system 11 of the restoration of the failure (Step 28 in FIG. 2).

【００２３】監視システム１１ではその通信制御部１３
で上記の障害の復旧通知を受信して中央制御部１２へ転
送し、中央制御部１３が障害復旧通知入力に基づいて障
害表示制御部１６を制御して表示装置１７における障害
発生表示を解除すると共に、故障情報蓄積部１４を制御
して故障履歴データベース１５に交換後の部品のシリア
ル番号を登録する（図２のステップ２９）。ただし、中
央制御部１２は前記障害部品については履歴を故障履歴
データベース１５にそのまま保存する。In the monitoring system 11, the communication control unit 13
Then, the failure recovery notification is received and transferred to the central control unit 12, and the central control unit 13 controls the failure display control unit 16 based on the failure recovery notification input to cancel the failure occurrence display on the display device 17. At the same time, the failure information storage unit 14 is controlled to register the serial number of the replaced part in the failure history database 15 (step 29 in FIG. 2). However, the central control unit 12 stores the history of the failed component in the failure history database 15 as it is.

【００２４】その後、被監視システム１８Ａで障害とさ
れて正常部品と交換された前記障害部品が修理された
後、例えば被監視システム１８Ｂに転用された場合、そ
の部品が再び被監視システム１８Ｂにおいて一時故障を
発生したものとすると、被監視システム１８Ｂから一時
故障情報が監視システム１１に送信される（図２のステ
ップ２１）。Thereafter, after the failed component replaced with a normal component by the monitored system 18A is repaired and, for example, diverted to the monitored system 18B, the component is temporarily restored in the monitored system 18B. Assuming that a failure has occurred, the monitored system 18B transmits temporary failure information to the monitoring system 11 (step 21 in FIG. 2).

【００２５】監視システム１１内の中央制御部１３は受
信された一時故障情報を分析し、一時故障が発生した故
障部品のシリアル番号を認識し、故障履歴データベース
１５に対し、一時故障を発生した故障部品のシリアル番
号の一時故障発生回数を更新登録（蓄積）する（図２の
ステップ２２）。その後、中央制御部１２は故障情報
蓄積部１４を介して故障履歴データベース１５からシリ
アル番号に基づいて該当部品の履歴を検索する（図２の
ステップ２３）。The central control unit 13 in the monitoring system 11 analyzes the received temporary fault information, recognizes the serial number of the faulty component in which the temporary fault has occurred, and checks the fault history database 15 for the fault in which the temporary fault occurred. The number of times of temporary failure occurrence of the serial number of the part is updated and registered (accumulated) (step 22 in FIG. 2). Thereafter, the central control unit 12 searches the history of the corresponding part from the failure history database 15 via the failure information storage unit 14 based on the serial number (Step 23 in FIG. 2).

【００２６】中央制御部１２は検索した該当部品のシリ
アル番号から、その部品が以前被監視システム１８Ａに
おいてしきい値を越えて障害検出されたものであると認
識し、その一時故障発生回数は既に障害検出時にしきい
値を越えているから、被監視システム１８Ｂにおいて当
該修理部品にとって初めての一時故障発生であっても、
中央制御部１２は上記の一時故障回数としきい値との比
較の結果障害発生と判断し、故障履歴をもとに図２のス
テップ２５以降の障害検出処理を行う。The central control unit 12 recognizes from the retrieved serial number that the component has previously detected a failure exceeding the threshold value in the monitored system 18A, and the number of temporary failure occurrences has already been determined. Since the threshold value is exceeded when the failure is detected, even if the first temporary failure occurs for the repaired part in the monitored system 18B,
The central control unit 12 determines that a failure has occurred as a result of the comparison between the number of temporary failures and the threshold value, and performs failure detection processing starting from step 25 in FIG. 2 based on the failure history.

【００２７】[0027]

【発明の効果】以上説明したように、本発明によれば、
すべての被監視システムの各部品の一時故障発生回数は
共通のしきい値と比較されて障害発生かどうか監視シス
テムで判断されるため、全被監視システムが共通の適正
なしきい値により一時故障部品の障害検出ができる。As described above, according to the present invention,
The number of temporary failure occurrences of each component of all monitored systems is compared with a common threshold value, and the monitoring system determines whether a failure has occurred. Can be detected.

【００２８】また、本発明によれば、一旦障害発生と検
出された部品が修理された後に一時故障を発生した場合
は、直ちに障害発生と検出されるため、潜在している不
具合に対して迅速に対応することができる。Further, according to the present invention, if a temporary failure occurs after a component once detected as having a failure has been repaired, the failure is immediately detected as a failure, so that a potential failure can be quickly resolved. Can be handled.

[Brief description of the drawings]

【図１】本発明の一実施の形態の構成図である。FIG. 1 is a configuration diagram of an embodiment of the present invention.

【図２】図１の動作説明用フローチャートである。FIG. 2 is a flowchart for explaining the operation of FIG. 1;

【図３】従来の一例の構成図である。FIG. 3 is a configuration diagram of a conventional example.

[Explanation of symbols]

１１監視システム１２中央制御部１３通信制御部１４故障情報蓄積部１５故障履歴データベース１６障害表示制御部１７表示装置１８Ａ、１８Ｂ、１８Ｃ被監視システム Reference Signs List 11 monitoring system 12 central control unit 13 communication control unit 14 failure information storage unit 15 failure history database 16 failure display control unit 17 display device 18A, 18B, 18C monitored system

Claims

[Claims]

1. A function of transmitting temporary failure information indicating the occurrence of a temporary failure of a component when a temporary failure that can be automatically recovered from the component occurs, and detecting the failure when an external failure detection notification is received. One or more monitored systems having at least a function of separating the notified component from the system, a communication control unit that receives the temporary failure information, and transmits the failure detection notification, and a communication control unit that receives the failure detection notification. A history accumulating means for accumulating the history of the component having the temporary failure based on the temporary failure information, and determining whether the number of failure occurrences of the component having the temporary failure obtained from the history accumulating means is equal to or greater than a predetermined threshold. A monitoring system comprising a detecting unit for determining whether the component is faulty and determining that the component is faulty when the threshold value or more is detected. Out method.

2. The history accumulating means of the monitoring system, wherein the component in which the temporary failure has occurred is determined by the detecting means as a faulty component, and the history is stored as it is even after being replaced by a normal component in the monitored system. When the detection unit receives the temporary failure information indicating the temporary failure occurrence of the repaired component that has repaired the failed component via the communication control unit, the detection unit generates the temporary failure from the history accumulation unit. 2. The failure detection method according to claim 1, wherein the number of times of occurrence of the failure before the repair of the repair part is searched, and the failure detection notification is performed when it is determined that the repair part has a failure.

3. The monitoring system, comprising: a communication control unit that receives the temporary failure information and transmits the failure detection notification; a history accumulation unit that accumulates a history of a component in which a temporary failure has occurred; Analyze the temporary failure information received by, after updating and registering the number of times of failure of the component that has caused the temporary failure in the history storage means, compare the updated number of times of temporary failure and the threshold, It is determined whether the number of times of temporary failure occurrence is equal to or greater than the threshold value. If the number is equal to or greater than the threshold value, the communication control unit transmits the failure detection notification to the monitored system that has transmitted the temporary failure information. A fault detection method comprising: a control unit; a fault display control unit to which display information from the central control unit is input; and a display device that displays display information by the fault display control unit.