JP6095140B2

JP6095140B2 - Remote monitoring system, remote monitoring method, and program

Info

Publication number: JP6095140B2
Application number: JP2015536594A
Authority: JP
Inventors: 晋二落合
Original assignee: NEC Fielding Ltd
Current assignee: NEC Fielding Ltd
Priority date: 2013-09-11
Filing date: 2014-09-10
Publication date: 2017-03-15
Anticipated expiration: 2034-09-10
Also published as: WO2015037603A1; CN105531680B; JPWO2015037603A1; CN105531680A

Description

［関連出願についての記載］
本発明は、日本国特許出願：特願２０１３−１８８３２４号（２０１３年９月１１日出願）に基づくものであり、同出願の全記載内容は引用をもって本書に組み込み記載されているものとする。
本発明は、遠隔監視システム、遠隔監視方法、及びプログラムに関する。特に、本発明は、監視対象装置における障害をネットワークを介して遠隔検出する技術に関する。[Description of related applications]
The present invention is based on a Japanese patent application: Japanese Patent Application No. 2013-188324 (filed on September 11, 2013), and the entire contents of this application are incorporated and described in this document by reference.
The present invention relates to a remote monitoring system, a remote monitoring method, and a program. In particular, the present invention relates to a technique for remotely detecting a failure in a monitoring target device via a network.

従来、顧客先に設置した情報処理装置等の障害を検出して保守管理するシステムにおいて、以下の２通りの方法が広く用いられている。第１の方法は、顧客の情報処理装置に監視モジュールを導入し、該監視モジュールが障害を検出し通知する方法である。第２の方法は、顧客ネットワーク内に障害監視装置を設置し、該障害監視装置が情報処理装置の障害を検出する方法である。 2. Description of the Related Art Conventionally, the following two methods are widely used in a system for detecting and maintaining a failure of an information processing apparatus installed at a customer site. The first method is a method in which a monitoring module is introduced into an information processing apparatus of a customer, and the monitoring module detects and notifies a failure. The second method is a method in which a failure monitoring device is installed in the customer network, and the failure monitoring device detects a failure of the information processing device.

しかしながら、第１の方法では、情報処理装置の外部から障害を検出することができないため、情報処理装置の異常停止等の障害を検出することができないという問題がある。また、第２の方法では、顧客ネットワーク内に障害監視装置を設置しなければならず、障害監視装置の導入費用が顧客の負担になるという問題がある。 However, the first method has a problem that a failure such as an abnormal stop of the information processing device cannot be detected because a failure cannot be detected from outside the information processing device. Further, in the second method, there is a problem that a trouble monitoring apparatus must be installed in the customer network, and the introduction cost of the trouble monitoring apparatus becomes a burden on the customer.

上記の問題を解消するため、例えば、特許文献１には、複数の顧客サイトに設置したコンピュータシステムを、ネットワークを介して保守管理センタの障害監視サーバにより障害を監視する保守管理システムが開示されている。該保守管理システムによれば、複数の顧客サイトから集められた障害情報を点数化し、点数の高低により障害の重要性や緊急度を定量評価して、保守部品の発送や保守員の手配に関する優先順位付けを一括して管理することが可能である。 In order to solve the above problem, for example, Patent Document 1 discloses a maintenance management system that monitors a failure of a computer system installed at a plurality of customer sites by a failure monitoring server of a maintenance management center via a network. Yes. According to the maintenance management system, the failure information collected from a plurality of customer sites is scored, the importance and urgency of the failure are quantitatively evaluated by the level of the score, and priority is given to dispatch of maintenance parts and arrangement of maintenance personnel. Rankings can be managed collectively.

また、特許文献２には、複数の顧客システムとインターネットを介して接続された保守装置が開示されている。該保守装置は、各顧客システム内の装置の構成情報（型名、製造番号、版数等）の登録手段を有すると共に、その構成情報に基づいて遠隔診断を実行させることができる。また、該保守装置は診断結果に異常を検出した場合に、保守員を自動的に呼び出すこともできる。 Patent Document 2 discloses a maintenance device connected to a plurality of customer systems via the Internet. The maintenance device has a means for registering the configuration information (model name, serial number, version number, etc.) of the device in each customer system, and can execute remote diagnosis based on the configuration information. In addition, the maintenance device can automatically call maintenance personnel when an abnormality is detected in the diagnosis result.

特開２００３−２４１９９９号公報JP 2003-241999 A 特開２００３−３４５６２２号公報JP 2003-345622 A

上記の特許文献の全開示内容は、本書に引用をもって繰り込み記載されているものとする。以下の分析は、本発明の観点から与えられる。 The entire disclosure of the above patent document is incorporated herein by reference. The following analysis is given from the perspective of the present invention.

特許文献１、２で開示された保守管理システムにおいて、特許文献２の図４のように複数の顧客システムに対する診断実行を互いに時間間隔を空けて実施する場合には、それぞれの障害検出、障害解析、及び障害対策を所望の処理能力で実行することが可能である。しかしながら、例えば地震等の緊急時には、複数の顧客システムに対して、障害検出、障害解析、及び障害対策を一斉に実行することが要求される。そのような場合に、特許文献１、２で開示された保守管理システムでは、処理能力が不足する虞がある。 In the maintenance management systems disclosed in Patent Documents 1 and 2, when the diagnosis execution for a plurality of customer systems is performed at intervals of time as shown in FIG. In addition, it is possible to execute a countermeasure against a failure with a desired processing capacity. However, in an emergency such as an earthquake, for example, it is required to simultaneously perform failure detection, failure analysis, and failure countermeasures for a plurality of customer systems. In such a case, the maintenance management systems disclosed in Patent Documents 1 and 2 may have insufficient processing capability.

そこで、本発明は、複数の監視対象装置に対する障害検出、障害解析、及び障害対策の処理能力の向上に貢献しうる遠隔監視システムを提供することを目的とする。 Therefore, an object of the present invention is to provide a remote monitoring system that can contribute to the improvement of processing capability of failure detection, failure analysis, and failure countermeasures for a plurality of monitoring target devices.

第１の視点による遠隔監視システムは、以下の障害監視装置、障害解析装置、及び障害管理装置を含む。該障害監視装置は、複数の監視対象装置にネットワークを介して障害検出コマンドを送信し、各々の前記監視対象装置で実行された前記障害検出コマンドの実行結果を受信して各々の前記監視対象装置の障害の有無を判定し、障害があると判定された前記監視対象装置に関して前記監視対象装置の情報及び前記障害検出コマンドの実行結果を含む障害検出情報を出力する。また、該障害解析装置は、前記障害検出情報を受信し、前記受信された障害検出情報に含まれる障害検出コマンドの実行結果を解析して障害を復旧する障害対処方法を決定し、前記監視対象装置の情報及び前記障害対処方法を含む障害解析情報を出力する。また、該障害管理装置は、前記障害解析情報を受信し、前記受信された障害解析情報に含まれる障害対処方法の実行を管理する。ここで、前記障害監視装置、前記障害解析装置、及び前記障害管理装置の２以上が、互いに異なる前記監視対象装置に対する処理を並行して実行可能に構成される。また、前記障害監視装置は、正系の障害監視装置及び副系の障害監視装置を含む２以上の障害監視装置で構成され、前記正系の障害監視装置及び前記副系の障害監視装置は、夫々前記障害検出情報を出力する。さらに、前記障害解析装置は、前記正系の障害監視装置の異常を検出した場合に、前記正系の障害監視装置と前記副系の障害監視装置との間で、前記正系／副系の設定を切り替える。さらに、前記障害解析装置は、前記正系の障害監視装置の異常を検出しない場合には、前記正系の障害監視装置から送信された前記障害検出情報を使用して障害対処方法を決定し、前記副系の障害監視装置から送信された前記障害検出情報を使用しない。 The remote monitoring system according to the first aspect includes the following failure monitoring device, failure analysis device, and failure management device. The failure monitoring device transmits a failure detection command to a plurality of monitoring target devices via a network, receives an execution result of the failure detection command executed by each of the monitoring target devices, and receives each of the monitoring target devices. The fault detection information including the information on the monitoring target apparatus and the execution result of the fault detection command is output for the monitoring target apparatus determined to have a fault. The failure analysis apparatus receives the failure detection information, analyzes a failure detection command execution result included in the received failure detection information, determines a failure handling method for recovering the failure, and determines the monitoring target Failure analysis information including device information and the failure handling method is output. Further, the failure management apparatus receives the failure analysis information and manages execution of a failure handling method included in the received failure analysis information. Here, two or more of the failure monitoring device, the failure analysis device, and the failure management device are configured to be able to execute processes for different monitoring target devices in parallel. The failure monitoring device is composed of two or more failure monitoring devices including a primary failure monitoring device and a secondary failure monitoring device, and the primary failure monitoring device and the secondary failure monitoring device include: The failure detection information is output respectively . Furthermore, when the failure analysis device detects an abnormality in the primary failure monitoring device, the failure analysis device may be configured to switch between the primary / secondary failure monitoring device between the primary failure monitoring device and the secondary failure monitoring device. Switch settings. Further, when the failure analysis device does not detect an abnormality of the primary failure monitoring device, the failure analysis method is determined using the failure detection information transmitted from the primary failure monitoring device, The failure detection information transmitted from the secondary failure monitoring device is not used.

第２の視点による遠隔監視方法は、以下のステップを含む。即ち、該遠隔監視方法は、複数の監視対象装置にネットワークを介して障害検出コマンドを送信するステップを含む。また、該遠隔監視方法は、各々の前記監視対象装置で実行された前記障害検出コマンドの実行結果を受信して各々の前記監視対象装置の障害の有無を判定し、障害があると判定された前記監視対象装置に関して前記監視対象装置の情報及び前記障害検出コマンドの実行結果を含む障害検出情報を出力する判定ステップを含む。また、該遠隔監視方法は、前記障害検出情報を受信し、前記受信された障害検出情報に含まれる障害検出コマンドの実行結果を解析して障害を復旧する障害対処方法を決定し、前記監視対象装置の情報及び前記障害対処方法を含む障害解析情報を出力する解析ステップを含む。さらに、該遠隔監視方法は、前記障害解析情報を受信し、前記受信された障害解析情報に含まれる障害対処方法の実行を管理する管理ステップを含む。さらに、正系の装置及び副系の装置を含む２以上の装置が前記判定ステップを実行する。さらに、該遠隔監視方法は、前記正系の装置の異常が検出された場合に、前記正系の装置と前記副系の装置との間で、前記正系／副系の設定を切り替えるステップをさらに含む。さらに、該遠隔監視方法は、前記正系の装置の異常が検出されない場合には、前記正系の装置から送信された前記障害検出情報を使用して障害対処方法を決定し、前記副系の装置から送信された前記障害検出情報を使用しないステップを含む。ここで、前記判定ステップ、前記解析ステップ、及び前記管理ステップの２以上が、並行して実行可能である。
本方法は、障害を監視する障害監視装置、障害を解析する障害解析装置、及び障害を管理する障害管理装置という特定の機械に結び付けられている。 The remote monitoring method according to the second aspect includes the following steps. That is, the remote monitoring method includes a step of transmitting a failure detection command to a plurality of monitoring target devices via a network. Further, the remote monitoring method receives the execution result of the failure detection command executed in each of the monitoring target devices, determines whether there is a failure in each of the monitoring target devices, and determines that there is a failure And a determination step of outputting failure detection information including information on the monitoring target device and an execution result of the failure detection command with respect to the monitoring target device. Further, the remote monitoring method receives the failure detection information, analyzes a result of executing a failure detection command included in the received failure detection information, determines a failure handling method for recovering the failure, and determines the monitoring target An analysis step of outputting failure analysis information including device information and the failure handling method. Further, the remote monitoring method includes a management step of receiving the failure analysis information and managing execution of the failure handling method included in the received failure analysis information. Further, two or more devices including a primary device and a secondary device execute the determination step. Further, the remote monitoring method includes a step of switching the setting of the primary / secondary system between the primary system device and the secondary system device when an abnormality of the primary system device is detected. In addition. Furthermore, the remote monitoring method determines a failure handling method using the failure detection information transmitted from the primary device when an abnormality of the primary device is not detected, and A step of not using the failure detection information transmitted from the device. Here, two or more of the determination step, the analysis step, and the management step can be executed in parallel.
The present method is linked to specific machines such as a failure monitoring device that monitors failures, a failure analysis device that analyzes failures, and a failure management device that manages failures.

第３の視点によるプログラムは、複数の監視対象装置の障害を検出するプログラムであって、以下の処理をコンピュータに実行させる。即ち、該プログラムは、複数の監視対象装置にネットワークを介して障害検出コマンドを送信し、各々の前記監視対象装置で実行された前記障害検出コマンドの実行結果を受信して各々の前記監視対象装置の障害の有無を判定し、障害があると判定された前記監視対象装置に関して前記監視対象装置の情報及び前記障害検出コマンドの実行結果を含む障害検出情報を、前記障害検出情報を用いて障害を復旧する障害対処方法を決定する装置に出力する処理、をコンピュータに実行させる。
また、他の視点によるプログラムは、監視対象装置を監視する障害監視装置とネットワークを介して接続する、障害解析装置に実行させるプログラムであって、以下の処理を前記障害解析装置に実行させる。即ち、該プログラムは、障害があると判定された前記監視対象装置に関して前記監視対象装置の情報及び障害検出コマンドの実行結果を含む障害検出情報を受信し、前記受信された障害検出情報に含まれる障害検出コマンドの実行結果を解析して障害を復旧する障害対処方法を決定し、前記監視対象装置の情報及び前記障害対処方法を含む障害解析情報を出力する解析処理、を前記障害解析装置に実行させる。さらに、正系の前記障害監視装置及び副系の前記障害監視装置を含む２以上の前記障害監視装置が前記障害検出情報を出力する。さらに、該プログラムは、前記正系の前記障害監視装置の異常が検出された場合に、前記正系の前記障害監視装置と前記副系の前記障害監視装置との間で、前記正系／副系の設定を切り替える処理を、前記障害解析装置に実行させる。さらに、該プログラムは、前記正系の障害監視装置の異常が検出されない場合には、前記正系の障害監視装置から送信された前記障害検出情報を使用して障害対処方法を決定し、前記副系の障害監視装置から送信された前記障害検出情報を使用しない処理を、前記障害解析装置に実行させる。
なお、これらのプログラムは、コンピュータが読み取り可能な記憶媒体に記録することができる。記憶媒体は、半導体メモリ、ハードディスク、磁気記録媒体、光記録媒体等の非トランジェント（non-transient）なものとすることができる。これらの発明は、コンピュータプログラム製品として具現することも可能である。 The program according to the third aspect is a program for detecting a failure of a plurality of monitoring target devices, and causes a computer to execute the following processing. That is, the program transmits a failure detection command to a plurality of monitoring target devices via a network, receives an execution result of the failure detection command executed by each of the monitoring target devices, and receives each of the monitoring target devices. The failure detection information including the monitoring target device information and the execution result of the failure detection command for the monitoring target device determined to have a failure is determined using the failure detection information. Processing to output to a device that determines a failure handling method to be recovered is executed by a computer.
Further, the program according to another viewpoint, connected through a fault monitoring device and the network to monitor the monitoring target device, a program executed by a failure analysis equipment, to execute the following processing on the fault analysis device. That is, the program receives failure detection information including information on the monitored device and an execution result of the failure detection command regarding the monitored device determined to have a failure, and is included in the received failure detection information. Analyzing the execution result of the failure detection command to determine a failure handling method for recovering the failure, and executing the analysis processing for outputting the failure analysis information including the monitored device information and the failure handling method to the failure analysis device Let Further, two or more fault monitoring apparatuses including the primary fault monitoring apparatus and the secondary fault monitoring apparatus output the fault detection information. Further, when an abnormality of the primary fault monitoring device is detected, the program executes the primary / secondary communication between the primary fault monitoring device and the secondary fault monitoring device. The fault analysis apparatus is caused to execute processing for switching the system setting. Further, when no abnormality is detected in the primary fault monitoring device, the program determines a fault handling method using the fault detection information transmitted from the primary fault monitoring device, and The failure analysis device is caused to execute processing that does not use the failure detection information transmitted from the system failure monitoring device.
Note that these programs can be recorded on a computer-readable storage medium. The storage medium may be non-transient such as a semiconductor memory, a hard disk, a magnetic recording medium, an optical recording medium, or the like. These inventions can also be embodied as computer program products.

遠隔監視システムによれば、複数の監視対象装置に対する障害検出、障害解析、及び障害対策の処理能力の向上に貢献しうる遠隔監視システムを提供することができる。 According to the remote monitoring system, it is possible to provide a remote monitoring system that can contribute to the improvement of processing capability of failure detection, failure analysis, and failure countermeasure for a plurality of monitoring target devices.

一実施形態に係る遠隔監視システムを適用した場合の全体構成を示すブロック図である。It is a block diagram which shows the whole structure at the time of applying the remote monitoring system which concerns on one Embodiment. 一実施形態に係る遠隔監視システムの動作を示すシーケンスチャートである。It is a sequence chart which shows operation | movement of the remote monitoring system which concerns on one Embodiment. 第１の実施形態に係る遠隔監視システムを適用した場合の全体構成を示すブロック図である。It is a block diagram which shows the whole structure at the time of applying the remote monitoring system which concerns on 1st Embodiment. 第１の実施形態に係る遠隔監視システムにおける障害監視装置の機能構成を示す図である。It is a figure which shows the function structure of the failure monitoring apparatus in the remote monitoring system which concerns on 1st Embodiment. 第１の実施形態に係る遠隔監視システムにおける障害解析装置の機能構成を示す図である。It is a figure which shows the function structure of the failure analysis apparatus in the remote monitoring system which concerns on 1st Embodiment. 第１の実施形態に係る遠隔監視システムにおける障害管理装置の機能構成を示す図である。It is a figure which shows the function structure of the failure management apparatus in the remote monitoring system which concerns on 1st Embodiment. 第１の実施形態に係る遠隔監視システムにおける障害監視装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the failure monitoring apparatus in the remote monitoring system which concerns on 1st Embodiment. 第１の実施形態に係る遠隔監視システムにおける障害解析装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the failure analysis apparatus in the remote monitoring system which concerns on 1st Embodiment. 第１の実施形態に係る遠隔監視システムにおける障害管理装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the failure management apparatus in the remote monitoring system which concerns on 1st Embodiment. 図９に続くフローチャートである。It is a flowchart following FIG. 第１の実施形態において、障害監視装置の系切り替えが発生する場合の障害監視装置のフローチャートである。6 is a flowchart of a failure monitoring apparatus when system switching of the failure monitoring apparatus occurs in the first embodiment. 第１の実施形態において、障害監視装置の系切り替えが発生する場合の障害解析装置のフローチャートである。5 is a flowchart of a failure analysis device when a system switching of the failure monitoring device occurs in the first embodiment. 第１の実施形態に係る遠隔監視システムにおける障害監視装置の障害検出対象装置データベースの一例である。It is an example of the failure detection object apparatus database of the failure monitoring apparatus in the remote monitoring system which concerns on 1st Embodiment. 第１の実施形態に係る遠隔監視システムにおける障害監視装置の障害検出項目データベースの一例である。It is an example of the failure detection item database of the failure monitoring apparatus in the remote monitoring system which concerns on 1st Embodiment. 第１の実施形態に係る遠隔監視システムにおける障害監視装置のデータ転送先データベースの一例である。It is an example of the data transfer destination database of the failure monitoring apparatus in the remote monitoring system which concerns on 1st Embodiment. 第１の実施形態に係る遠隔監視システムにおける障害解析装置の監視対象装置情報データベースの一例である。It is an example of the monitoring object apparatus information database of the failure analysis apparatus in the remote monitoring system which concerns on 1st Embodiment. 第１の実施形態に係る遠隔監視システムにおける障害解析装置の障害情報データベースの一例である。It is an example of the failure information database of the failure analysis device in the remote monitoring system according to the first embodiment. 第１の実施形態に係る遠隔監視システムにおける障害解析装置の障害監視装置冗長構成情報データベースの一例である。It is an example of the fault monitoring apparatus redundant structure information database of the fault analysis apparatus in the remote monitoring system which concerns on 1st Embodiment. 第１の実施形態に係る遠隔監視システムにおける障害監視装置の障害管理データベースの一例である。It is an example of the failure management database of the failure monitoring apparatus in the remote monitoring system which concerns on 1st Embodiment. 第１の実施形態に係る遠隔監視システムにおける障害監視装置の顧客情報データベースの一例である。It is an example of the customer information database of the failure monitoring apparatus in the remote monitoring system which concerns on 1st Embodiment. 第１の実施形態に係る遠隔監視システムにおける障害監視装置の社員情報データベースの一例である。It is an example of the employee information database of the failure monitoring apparatus in the remote monitoring system which concerns on 1st Embodiment. 第１の実施形態に係る遠隔監視システムにおける障害監視装置の作業予定データベースの一例である。It is an example of the work schedule database of the failure monitoring apparatus in the remote monitoring system which concerns on 1st Embodiment. 第２の実施形態に係る遠隔監視システムを適用した場合の全体構成を示すブロック図である。It is a block diagram which shows the whole structure at the time of applying the remote monitoring system which concerns on 2nd Embodiment. 第２の実施形態に係る遠隔監視システムの動作を示すシーケンスチャートである。It is a sequence chart which shows operation | movement of the remote monitoring system which concerns on 2nd Embodiment. 比較例１に係る遠隔監視システムを適用した場合の全体構成を示すブロック図である。It is a block diagram which shows the whole structure at the time of applying the remote monitoring system which concerns on the comparative example 1. FIG. 比較例１に係る遠隔監視システムの動作を示すシーケンスチャートである。10 is a sequence chart showing the operation of the remote monitoring system according to Comparative Example 1.

まず、一実施形態の概要について説明する。なお、実施形態の概要の説明において付記した図面参照符号は専ら理解を助けるための例示であり、図示の態様に限定することを意図するものではない。 First, an outline of one embodiment will be described. Note that the reference numerals of the drawings added in the description of the outline of the embodiment are merely examples for helping understanding, and are not intended to be limited to the illustrated modes.

一実施形態における遠隔監視システム１は、図１に示すように、以下の障害監視装置１００、障害解析装置２００、及び障害管理装置３００を含む。該障害監視装置１００は、複数の監視対象装置１０ａ〜ｂにネットワーク２を介して障害検出コマンド３ａ〜ｂを送信し、各々の監視対象装置で実行された障害検出コマンドの実行結果４ａ〜ｂを受信して各々の監視対象装置１０ａ〜ｂの障害の有無を判定し、障害があると判定された監視対象装置に関して監視対象装置の情報（１０ａに対応するＡ０００１；図１（Ｂ）参照）及び障害検出コマンドの実行結果４ａ〜ｂを含む障害検出情報５ａ〜ｂを出力する。また、該障害解析装置２００は、障害検出情報５ａ〜ｂを受信し、受信された障害検出情報５ａ〜ｂに含まれる障害検出コマンドの実行結果４ａ〜ｂを解析して障害を復旧する障害対処方法を決定し、監視対象装置の情報及び障害対処方法（６ａに含まれる７ａ；図１（Ｃ）参照）を含む障害解析情報６ａ〜ｂを出力する。また、該障害管理装置３００は、障害解析情報６ａ〜ｂを受信し、受信された障害解析情報６ａ〜ｂに含まれる障害対処方法（６ａに含まれる７ａ；図１（Ｃ）参照）の実行を管理する。ここで、図２に示すように、障害監視装置１００、障害解析装置２００、及び障害管理装置３００の２以上が、互いに異なる監視対象装置１０ａ〜ｂに対する処理を並行して実行可能に構成される（図２に示すように、期間ＴＡにおいて障害監視装置１００と障害解析装置２００、期間ＴＢにおいて障害解析装置２００と障害管理装置３００、が夫々並行して実行されている）。 As shown in FIG. 1, the remote monitoring system 1 according to an embodiment includes the following failure monitoring device 100, failure analysis device 200, and failure management device 300. The failure monitoring device 100 transmits failure detection commands 3a to 3b to the plurality of monitoring target devices 10a to 10b via the network 2, and the execution results 4a to b of the failure detection commands executed by each of the monitoring target devices. Each of the monitoring target devices 10a and 10b is received to determine whether or not there is a failure. Information on the monitoring target device (A0001 corresponding to 10a; see FIG. 1B) regarding the monitoring target device determined to have a failure and Fault detection information 5a-b including fault detection command execution results 4a-b is output. The failure analysis apparatus 200 receives the failure detection information 5a-b, analyzes the failure detection command execution results 4a-b included in the received failure detection information 5a-b, and recovers the failure. The method is determined, and the failure analysis information 6a-b including information on the monitoring target device and the failure handling method (7a included in 6a; see FIG. 1C) is output. Further, the failure management apparatus 300 receives the failure analysis information 6a-b and executes the failure handling method (7a included in 6a; see FIG. 1C) included in the received failure analysis information 6a-b. Manage. Here, as illustrated in FIG. 2, two or more of the failure monitoring device 100, the failure analysis device 200, and the failure management device 300 are configured to be able to execute processes for different monitoring target devices 10 a and 10 b in parallel. (As shown in FIG. 2, the failure monitoring apparatus 100 and the failure analysis apparatus 200 are executed in parallel during the period TA, and the failure analysis apparatus 200 and the failure management apparatus 300 are executed in parallel during the period TB).

上記の構成により、複数の監視対象装置１０ａ〜ｂに対して遠隔で障害検出、障害解析、及び障害対策を行う遠隔監視システム１において、遠隔監視システム１を構成する障害監視装置１００、障害解析装置２００、及び障害管理装置３００の２以上が処理を並行して実行可能に構成される。従って、全体として処理能力が向上した遠隔監視システムを提供することが可能になる。 With the above configuration, in the remote monitoring system 1 that remotely performs failure detection, failure analysis, and failure countermeasures on the plurality of monitoring target devices 10a and 10b, the failure monitoring device 100 and the failure analysis device that configure the remote monitoring system 1 Two or more of 200 and the failure management apparatus 300 are configured to be able to execute processes in parallel. Therefore, it is possible to provide a remote monitoring system with improved processing capability as a whole.

上記の遠隔監視システムにおいて、障害解析装置２００は、複数の障害検出コマンドの実行結果に対応した障害対処方法を予め定めた障害情報データベース（図５の２０５）を備えることが好ましい。そして、障害解析装置２００は、障害監視装置１００から受信した障害検出情報５ａ〜ｂに含まれる障害検出コマンドの実行結果４ａ〜ｂを用いて障害情報データベース（図５の２０５）を検索して障害を復旧する障害対処方法（図１（Ｃ）の７ａ等）を決定するようにしてもよい。 In the above remote monitoring system, the failure analysis apparatus 200 preferably includes a failure information database (205 in FIG. 5) in which a failure handling method corresponding to the execution results of a plurality of failure detection commands is determined. Then, the failure analysis device 200 searches the failure information database (205 in FIG. 5) using the failure detection command execution results 4a and b included in the failure detection information 5a and b received from the failure monitoring device 100, and detects the failure. A failure handling method (such as 7a in FIG. 1C) may be determined.

上記の遠隔監視システムにおいて、障害情報データベース（図５、図１７の２０５）には各々の障害対処方法に対応した被疑部品情報９がさらに定められていることが好ましい。そして、障害解析装置２００は障害検出情報５ａ〜ｂに含まれる障害検出コマンドの実行結果４ａ〜ｂを用いて障害情報データベース（図５の２０５）を検索して被疑部品情報９ａ〜ｂを取得し、障害解析情報６ａ〜ｂに被疑部品情報９ａ〜ｂを付加し、障害管理装置３００は受信された障害解析情報６ａ〜ｂに含まれる被疑部品情報９ａ〜ｂに基づいて被疑部品の手配の指示を送出するようにしてもよい。 In the remote monitoring system described above, it is preferable that suspected part information 9 corresponding to each failure handling method is further defined in the failure information database (205 in FIGS. 5 and 17). Then, the failure analysis apparatus 200 searches the failure information database (205 in FIG. 5) using the failure detection command execution results 4a and b included in the failure detection information 5a and b to obtain the suspected component information 9a and b. Then, the suspicious component information 9a-b is added to the failure analysis information 6a-b, and the failure management apparatus 300 instructs to arrange the suspicious component based on the suspected component information 9a-b included in the received failure analysis information 6a-b. May be sent out.

上記の遠隔監視システムにおいて、障害情報データベース（図５、図１７の２０５）には各々の障害対処方法に対応した保守員派遣要請情報１９がさらに定められていることが好ましい。そして、障害解析装置２００は障害検出情報５ａ〜ｂに含まれる障害検出コマンドの実行結果４ａ〜ｂを用いて障害情報データベース（図５の２０５）を検索して保守員派遣要請情報１９ａ〜ｂを取得し、障害解析情報６ａ〜ｂに保守員派遣要請情報１９ａ〜ｂを付加し、障害管理装置３００は受信された障害解析情報６ａ〜ｂに含まれる保守員派遣要請情報１９ａ〜ｂに基づいて保守員の派遣を指示する通知を保守員端末２２ａに送信するようにしてもよい。 In the above remote monitoring system, it is preferable that maintenance information dispatch request information 19 corresponding to each failure handling method is further defined in the failure information database (205 in FIGS. 5 and 17). Then, the failure analysis apparatus 200 searches the failure information database (205 in FIG. 5) using the failure detection command execution results 4a-b included in the failure detection information 5a-b, and obtains maintenance staff dispatch request information 19a-b. The maintenance manager dispatch request information 19a-b is added to the failure analysis information 6a-b, and the failure management apparatus 300 is based on the maintenance staff dispatch request information 19a-b included in the received failure analysis information 6a-b. You may make it transmit the notification which instruct | indicates dispatch of a maintenance worker to the maintenance worker terminal 22a.

上記の遠隔監視システムにおいて、障害管理装置３００は、複数の監視対象装置１０ａ〜ｂの各々に対応した顧客端末２０ａ〜ｂのアドレスを記憶した顧客情報データベース（図６、図２０の２０８）を備えることが好ましい。そして、障害管理装置３００は、受信された障害解析情報（例えば、６ａ）に含まれる監視対象装置の情報（図１（Ｂ）のＡ０００１）を用いて顧客情報データベース（図６、図２０の２０８）を検索して監視対象装置１０ａに対応した顧客端末２０ａのアドレスを取得し、顧客端末２０ａのアドレスに監視対象装置１０ａの障害を通知するようにしてもよい。 In the remote monitoring system described above, the failure management apparatus 300 includes a customer information database (208 in FIGS. 6 and 20) that stores addresses of customer terminals 20a and 20b corresponding to each of the plurality of monitoring target apparatuses 10a and 10b. It is preferable. Then, the failure management apparatus 300 uses the information on the monitoring target device (A0001 in FIG. 1B) included in the received failure analysis information (for example, 6a), and the customer information database (208 in FIGS. 6 and 20). ) To obtain the address of the customer terminal 20a corresponding to the monitoring target device 10a, and notify the address of the customer terminal 20a of the failure of the monitoring target device 10a.

上記の遠隔監視システムにおいて、図３に示すように、障害監視装置は、正系の障害監視装置１００ｍ及び副系の障害監視装置１００ｓを含む２以上の障害監視装置で構成されることが好ましい。ここで、障害解析装置２００は、正系の障害監視装置１００ｍの異常を検出した場合に、正系の障害監視装置１００ｍと副系の障害監視装置１００ｓとの間で、正系／副系の設定を切り替え、障害解析装置２００は、正系の障害監視装置から送信された障害検出情報を使用して障害対処方法を決定し、副系の障害監視装置から送信された障害検出情報を使用しないことが好ましい。 In the above remote monitoring system, as shown in FIG. 3, the failure monitoring device is preferably composed of two or more failure monitoring devices including a primary failure monitoring device 100m and a secondary failure monitoring device 100s. Here, when the failure analysis apparatus 200 detects an abnormality in the primary fault monitoring apparatus 100m, the fault analysis apparatus 200 establishes a primary / subordinate fault monitoring apparatus 100m between the primary fault monitoring apparatus 100m and the secondary fault monitoring apparatus 100s. The settings are switched, and the failure analysis device 200 determines a failure handling method using the failure detection information transmitted from the primary failure monitoring device, and does not use the failure detection information transmitted from the secondary failure monitoring device. It is preferable.

上記の遠隔監視システムにおいて、上記の複数の障害監視装置（１００ｍ、１００ｓ等）は、夫々定期的に稼動通知情報８ａ〜ｂを障害解析装置２００に送信するように設定され、障害解析装置２００は、２以上の障害監視装置（１００ｍ、１００ｓ等）の各々から稼動通知情報８ａ〜ｂを受信できるか否かを判定し、稼動通知情報を受信できない障害監視装置が異常であると検出するようにしてもよい。 In the above remote monitoring system, the plurality of failure monitoring devices (100m, 100s, etc.) are set to periodically transmit the operation notification information 8a to 8b to the failure analysis device 200. It is determined whether or not the operation notification information 8a to 8b can be received from each of two or more failure monitoring devices (100m, 100s, etc.), and it is detected that the failure monitoring device that cannot receive the operation notification information is abnormal. May be.

上記の遠隔監視システムにおいて、図３に示すように、障害監視装置（１００ｍ、１００ｓ）は閉域ＩＰ（ＩｎｔｅｒｎｅｔＰｒｏｔｏｃｏｌ）網５０を介して複数の監視対象装置（１０ａ〜ｂ）からの障害検出コマンド実行結果４ａ〜ｂを受信することが好ましい。 In the above remote monitoring system, as shown in FIG. 3, the failure monitoring devices (100m, 100s) execute failure detection commands from a plurality of monitoring target devices (10a-b) via a closed IP (Internet Protocol) network 50. It is preferable to receive results 4a-b.

一実施形態における遠隔監視方法は、図１、図２のいずれかに示すように、以下のステップを含む。即ち、該遠隔監視方法は、複数の監視対象装置１０ａ〜ｂにネットワーク２を介して障害検出コマンド３ａ〜ｂを送信するステップを含む。また、該遠隔監視方法は、各々の監視対象装置１０ａ〜ｂで実行された障害検出コマンドの実行結果４ａ〜ｂを受信して各々の監視対象装置１０ａ〜ｂの障害の有無を判定し、障害があると判定された監視対象装置に関して監視対象装置の情報及び障害検出コマンドの実行結果４ａ〜ｂを含む障害検出情報５ａ〜ｂを出力する判定ステップ（Ｓ１００ａ、Ｓ１００ｂ）を含む。また、該遠隔監視方法は、障害検出情報５ａ〜ｂを受信し、受信された障害検出情報５ａ〜ｂに含まれる障害検出コマンドの実行結果４ａ〜ｂを解析して障害を復旧する障害対処方法を決定し、監視対象装置の情報及び障害対処方法を含む障害解析情報６ａ〜ｂを出力する解析ステップ（Ｓ１０１ａ、Ｓ１０１ｂ）を含む。さらに、該遠隔監視方法は、障害解析情報６ａ〜ｂを受信し、受信された障害解析情報６ａ〜ｂに含まれる障害対処方法の実行を管理する管理ステップ（Ｓ１０２ａ、Ｓ１０２ｂ）を含む。ここで、判定ステップ、解析ステップ、及び管理ステップの２以上が、並行して実行可能である（図２では、判定ステップＳ１００ｂと解析ステップＳ１０１ａ、解析ステップＳ１０１ｂと管理ステップＳ１０２ａが夫々並行して実行されている）。 The remote monitoring method in one embodiment includes the following steps as shown in either FIG. 1 or FIG. That is, the remote monitoring method includes a step of transmitting failure detection commands 3a to 3b to the plurality of monitoring target devices 10a to 10b via the network 2. In addition, the remote monitoring method receives the execution result 4a-b of the failure detection command executed in each of the monitoring target devices 10a-b, determines whether there is a failure in each of the monitoring target devices 10a-b, and And a determination step (S100a, S100b) for outputting failure detection information 5a-b including information on the monitoring target device and execution results 4a-b of the failure detection command for the monitoring target device determined to be present. Further, the remote monitoring method receives failure detection information 5a-b, analyzes the failure detection command execution results 4a-b included in the received failure detection information 5a-b, and recovers the failure. And an analysis step (S101a, S101b) for outputting failure analysis information 6a-b including information on the monitoring target device and a failure handling method. Further, the remote monitoring method includes management steps (S102a, S102b) for receiving the failure analysis information 6a-b and managing the execution of the failure handling method included in the received failure analysis information 6a-b. Here, two or more of the determination step, the analysis step, and the management step can be executed in parallel (in FIG. 2, the determination step S100b and the analysis step S101a, and the analysis step S101b and the management step S102a are executed in parallel, respectively. Have been).

以下、各実施形態について、図面を参照して詳しく説明する。 Hereinafter, each embodiment will be described in detail with reference to the drawings.

［第１の実施形態］
（第１の実施形態の構成）
第１の実施形態の構成について、図３〜図６を参照しながら詳細に説明する。図３は、第１の実施形態に係る遠隔監視システム１１の全体構成を示すブロック図である。図３に示すように、遠隔監視システム１１は、複数の顧客サイト（顧客サイトＡ、顧客サイトＢ等）に夫々設置された監視対象装置１０ａ〜ｂを遠隔監視システム１１が設置された監視センタから遠隔で監視する装置である。ここで、監視対象装置１０ａ〜ｂは、例えば、サーバ、ネットワーク機器、ＰＣ（Personal Computer）等の情報処理装置である。[First Embodiment]
(Configuration of the first embodiment)
The configuration of the first embodiment will be described in detail with reference to FIGS. FIG. 3 is a block diagram showing the overall configuration of the remote monitoring system 11 according to the first embodiment. As shown in FIG. 3, the remote monitoring system 11 includes monitoring target devices 10a and 10b installed at a plurality of customer sites (customer site A, customer site B, etc.) from the monitoring center where the remote monitoring system 11 is installed. This is a remote monitoring device. Here, the monitoring target devices 10a and 10b are information processing devices such as servers, network devices, and PCs (Personal Computers), for example.

ここで、各顧客サイトの顧客は、自社ネットワーク内に監視用設備の設置を望まずに、遠隔監視システム１１を提供する業者に対して自社内の情報処理装置の保守管理を依頼している。監視センタの遠隔監視システム１１は、図３に示すように、２つの障害監視装置１００ｍ、１００ｓ、障害解析装置２００、及び障害管理装置３００により構成される。上記４つの装置は、いずれも中央処理装置ＣＰＵ（Central Processing Unit）が搭載されたサーバ機器である。以降、障害監視装置は、正系／副系の区別しない場合は単に「障害監視装置」という。 Here, the customer of each customer site does not want to install the monitoring facility in the company network, but requests the supplier who provides the remote monitoring system 11 to maintain and manage the information processing apparatus in the company. As shown in FIG. 3, the remote monitoring system 11 of the monitoring center includes two failure monitoring devices 100m and 100s, a failure analysis device 200, and a failure management device 300. Each of the four devices is a server device on which a central processing unit CPU (Central Processing Unit) is mounted. Hereinafter, the failure monitoring device is simply referred to as a “failure monitoring device” when there is no distinction between the primary system and the secondary system.

障害監視装置（１００ｍ、１００ｓ）は、閉域ＩＰ網５０を介して監視対象装置１０ａ〜ｂの障害検出を行い、監視対象装置１０ａ〜ｂから異常を検出した場合、障害検出情報５ａ〜ｂを障害解析装置２００に通知する機能を有している。閉域ＩＰ網５０としては、例えば、ＶＰＮ（Virtual Private Network）等が適用可能である。尚、障害監視装置は、正系の障害監視装置１００ｍ及び副系の障害監視装置１００ｓによる冗長構成になっている。 The fault monitoring devices (100m, 100s) detect faults of the monitoring target devices 10a and 10b via the closed IP network 50, and when faults are detected from the monitoring target devices 10a and 10b, the fault detection information 5a and b are faulty. It has a function of notifying the analysis apparatus 200. As the closed IP network 50, for example, a VPN (Virtual Private Network) or the like is applicable. The fault monitoring apparatus has a redundant configuration including a primary fault monitoring apparatus 100m and a secondary fault monitoring apparatus 100s.

障害解析装置２００は、障害監視装置（１００ｍ、１００ｓ）から通知された障害検出情報５ａ〜ｂについて解析を行い、解析結果を障害解析情報６ａ〜ｂとして障害管理装置３００に通知する機能を有している。ここで、障害検出情報５ａ及び障害解析情報６ａは、監視対象装置１０ａに対応し、障害検出情報５ｂ及び障害解析情報６ｂは、監視対象装置１０ｂに対応している。 The failure analysis device 200 has a function of analyzing the failure detection information 5a-b notified from the failure monitoring device (100m, 100s) and notifying the failure management device 300 of the analysis result as failure analysis information 6a-b. ing. Here, the failure detection information 5a and the failure analysis information 6a correspond to the monitoring target device 10a, and the failure detection information 5b and the failure analysis information 6b correspond to the monitoring target device 10b.

障害監視装置（１００ｍ、１００ｓ）は、一方が正系、他方が副系の障害監視装置として機能する。図３では、障害監視装置１００ｍ、障害監視装置１００ｓが夫々正系の障害監視装置、副系の障害監視装置として設定されている。障害解析装置２００は、正系の障害監視装置１００ｍから転送された障害検出情報５ａ〜ｂのみを解析の対象とする。但し、障害解析装置２００が正系の障害監視装置１００ｍの異常を検出した場合は、正系／副系を切り替える。即ち、副系の障害監視装置であった１００ｓが正系に切り替わり、正系の障害監視装置であった１００ｍが副系に切り替わる。 The fault monitoring device (100m, 100s) functions as a fault monitoring device in which one is a primary system and the other is a secondary system. In FIG. 3, the failure monitoring device 100m and the failure monitoring device 100s are set as a primary failure monitoring device and a secondary failure monitoring device, respectively. The failure analysis device 200 uses only the failure detection information 5a-b transferred from the primary failure monitoring device 100m as an analysis target. However, when the failure analysis device 200 detects an abnormality in the primary failure monitoring device 100m, the primary / sub system is switched. That is, 100s that was the fault monitoring device of the secondary system is switched to the primary system, and 100m that was the fault monitoring device of the primary system is switched to the secondary system.

障害管理装置３００は、障害解析装置２００から転送された障害解析情報６ａ〜ｂを基に、顧客端末２０ａ〜ｂへ障害内容の通知、保守員への出動指示の通知、及び保守部品（被疑部品）の選定を行う機能を有している。 Based on the failure analysis information 6a-b transferred from the failure analysis device 200, the failure management device 300 notifies the customer terminals 20a-b of the failure content, the notification of the dispatch instruction to the maintenance staff, and the maintenance component (suspected component). ) Selection function.

（障害監視装置の構成）
次に、図４を参照しながら、障害監視装置（１００ｍ、１００ｓ）の構成について詳細に説明する。障害監視装置（１００ｍ、１００ｓ）は、システム障害検出部１０１、障害検出情報作成部１０２、稼動通知情報定期作成部１０３、及びデータ転送部１０４を備えている。上記の各部は、障害監視装置が備えたＣＰＵにより実行されるソフトウェアモジュールである。また、障害監視装置（１００ｍ、１００ｓ）は、データベースとして障害検出対象装置ＤＢ２０１、障害検出項目ＤＢ２０２、及びデータ転送先ＤＢ２０３を備えている（ここで、ＤＢはデータベースの略。以下、同じ）。上記のデータベースは、障害監視装置の記憶部（不図示）に格納される。(Configuration of fault monitoring device)
Next, the configuration of the failure monitoring apparatus (100m, 100s) will be described in detail with reference to FIG. The failure monitoring apparatus (100m, 100s) includes a system failure detection unit 101, a failure detection information creation unit 102, an operation notification information regular creation unit 103, and a data transfer unit 104. Each of the above-described units is a software module that is executed by a CPU provided in the failure monitoring apparatus. Further, the failure monitoring device (100m, 100s) includes a failure detection target device DB 201, a failure detection item DB 202, and a data transfer destination DB 203 as databases (here, DB is an abbreviation of a database; the same applies hereinafter). The above database is stored in a storage unit (not shown) of the failure monitoring apparatus.

システム障害検出部１０１は、障害検出対象装置ＤＢ（図１３の２０１）を参照し、障害検出コマンド３ａ〜ｂを実行する監視対象装置１０ａ〜ｂのＩＰアドレス、及び監視対象装置コード（監視対象装置を特定するための一意コード）を取得する。 The system failure detection unit 101 refers to the failure detection target device DB (201 in FIG. 13), and the IP addresses of the monitoring target devices 10a to 10b that execute the failure detection commands 3a to 3b and the monitoring target device code (monitoring target device) To get a unique code).

そして、システム障害検出部１０１は、障害検出対象装置ＤＢ２０１から取得した監視対象装置のＩＰアドレスに対して、閉域ＩＰ網５０を介し、障害検出項目ＤＢ（図１４の２０２）から取得した障害検出用コマンド３ａ〜ｂを送信する。そして、各監視対象装置１０ａ〜ｂで実行された障害検出用コマンドの実行結果４ａ〜ｂを受信する。受信された障害検出用コマンドの実行結果４ａ〜ｂは、障害検出項目ＤＢ２０２の障害判定閾値と比較すること等により、正常か異常かの判定を行う。異常と判定した場合は、障害検出項目ＤＢ２０２の障害コードを取得する。 Then, the system failure detection unit 101 detects the failure detected from the failure detection item DB (202 in FIG. 14) via the closed IP network 50 with respect to the IP address of the monitoring target device acquired from the failure detection target device DB 201. Send commands 3a-b. And the execution result 4a-b of the command for fault detection performed by each monitoring object apparatus 10a-b is received. The received failure detection command execution results 4a and 4b are determined to be normal or abnormal by comparing them with the failure determination threshold value in the failure detection item DB 202, for example. If it is determined as abnormal, the failure code in the failure detection item DB 202 is acquired.

次に、取得済みの各種情報（障害検出用コマンド実行日時、監視対象装置のＩＰアドレス、監視対象装置コード、障害検出用コマンドの実行結果４ａ〜ｂ、障害コード）を引数として、障害検出情報作成部１０２を起動する。障害検出情報作成部１０２は、所定のディレクトリに障害検出情報５ａ〜ｂのファイルを作成する。 Next, failure detection information creation is performed using various types of acquired information (failure detection command execution date and time, monitored device IP address, monitored device code, failure detection command execution results 4a and 4b, failure code) as arguments. The unit 102 is activated. The failure detection information creation unit 102 creates files of failure detection information 5a-b in a predetermined directory.

稼動通知情報定期作成部１０３については、図１１〜図１２のフローチャートを説明するときに後述することとする。 The operation notification information regular creation unit 103 will be described later when the flowcharts of FIGS. 11 to 12 are described.

データ転送部１０４は、障害解析装置２００に障害検出情報５ａ〜ｂを転送する機能を有している。データ転送部１０４は、所定のディレクトリに障害検出情報作成部１０２で作成された障害検出情報５ａ〜ｂのファイルが存在するか否かを判定し、存在する場合、データ転送先ＤＢ（図１５の２０３）を参照し、転送先となる障害解析装置２００のＩＰアドレスを取得する。そして、データ転送部１０４は、障害解析装置２００のＩＰアドレスに対し、障害検出情報５ａ〜ｂの転送を実行する。転送された障害検出情報５ａ〜ｂは、障害解析装置２００の所定のディレクトリに保存される。 The data transfer unit 104 has a function of transferring the failure detection information 5a to 5b to the failure analysis apparatus 200. The data transfer unit 104 determines whether or not the file of the failure detection information 5a-b created by the failure detection information creation unit 102 exists in a predetermined directory. If there is, the data transfer destination DB (FIG. 15). 203), the IP address of the failure analysis apparatus 200 as the transfer destination is acquired. Then, the data transfer unit 104 transfers the failure detection information 5a to 5b to the IP address of the failure analysis device 200. The transferred failure detection information 5a-b is stored in a predetermined directory of the failure analysis apparatus 200.

障害検出対象装置ＤＢ２０１は、図１３に示すように、監視対象装置を管理するためのデータベースである。障害検出対象装置ＤＢ２０１は、監視対象装置（１０ａ〜ｂ等）のＩＰアドレス、監視対象装置コードを有している。障害検出対象装置ＤＢ２０１は、システム障害検出部１０１が閉域ＩＰ網５０を介し、障害検出コマンド３ａ〜ｂを実行する装置を特定するために使用される。 The failure detection target device DB 201 is a database for managing monitoring target devices as shown in FIG. The failure detection target device DB 201 has the IP address of the monitoring target device (10a-b, etc.) and the monitoring target device code. The failure detection target device DB 201 is used by the system failure detection unit 101 to identify a device that executes the failure detection commands 3a and 3b via the closed IP network 50.

障害検出項目ＤＢ２０２は、図１４に示すように、障害検出コマンド３ａ〜ｂ及び障害検出コマンドの実行結果４ａ〜ｂの判定に使用する情報を管理するためのデータベースである。障害検出項目ＤＢ２０２は、監視対象装置コード、障害検出コマンド、障害判定閾値、障害コードを有している。障害検出項目ＤＢ２０２は、閉域ＩＰ網５０を介して送信する障害検出コマンド３ａ〜ｂの取得、障害判定閾値の取得、及び障害コードを取得するために使用される。 As shown in FIG. 14, the failure detection item DB 202 is a database for managing information used for determining the failure detection commands 3a and 3b and the execution results 4a and 4b of the failure detection command. The failure detection item DB 202 includes a monitoring target device code, a failure detection command, a failure determination threshold, and a failure code. The failure detection item DB 202 is used for acquiring the failure detection commands 3a and 3b transmitted via the closed IP network 50, acquiring the failure determination threshold, and acquiring the failure code.

データ転送先ＤＢ２０３は、図１５に示すように、作成された障害検出情報５ａ〜ｂの転送先を管理する。データ転送先ＤＢ２０３は、データ転送部１０４が障害検出情報５ａ〜ｂを障害解析装置２００に転送する際に使用される。尚、データ転送先ＤＢ２０３は、稼動通知情報８ａ〜ｂを転送する際にも同様に使用される（詳細は後述する）。 As shown in FIG. 15, the data transfer destination DB 203 manages the transfer destinations of the created failure detection information 5a-b. The data transfer destination DB 203 is used when the data transfer unit 104 transfers the failure detection information 5a to 5b to the failure analysis apparatus 200. The data transfer destination DB 203 is similarly used when transferring the operation notification information 8a-b (details will be described later).

（障害解析装置の構成）
次に、図５を参照しながら、障害解析装置２００の構成について詳細に説明する。障害解析装置２００は、データ解析部１０５、稼動通知情報監視部１０６、障害監視装置系切り替え部１０７、及びデータ転送部１０８を備えている。上記の各部は、障害解析装置２００が備えたＣＰＵにより実行されるソフトウェアモジュールである。また、障害解析装置２００は、データベースとして、監視対象装置情報ＤＢ２０４、障害情報ＤＢ２０５、及び障害監視装置冗長構成情報ＤＢ２０６を備えている。(Configuration of failure analysis device)
Next, the configuration of the failure analysis apparatus 200 will be described in detail with reference to FIG. The failure analysis device 200 includes a data analysis unit 105, an operation notification information monitoring unit 106, a failure monitoring device system switching unit 107, and a data transfer unit 108. Each of the above units is a software module that is executed by a CPU provided in the failure analysis apparatus 200. The failure analysis device 200 includes a monitoring target device information DB 204, a failure information DB 205, and a failure monitoring device redundant configuration information DB 206 as databases.

データ解析部１０５は、障害監視装置（１００ｍ等）から転送された障害検出情報５ａ〜ｂを解析し、障害が発生した装置情報の特定、障害対処方法の決定、現地作業の要否の判定、及び被疑部品の特定を行う機能を有している。データ解析部１０５は、所定のディレクトリに障害監視装置（１００ｍ等）から転送された障害検出情報５ａ〜ｂのファイルが存在するか否かを判定する。障害検出情報５ａ〜ｂのファイルが存在しない場合は終了し、存在する場合は障害検出情報５ａ〜ｂを参照し、障害検出情報５ａ〜ｂに記載された各種情報（障害検出用コマンド実行日時、監視対象装置のＩＰアドレス、監視対象装置コード、障害検出用コマンドの実行結果４ａ〜ｂ、障害コード）を取得する。また、データ解析部１０５は障害検出情報５ａ〜ｂが正系の障害監視装置から送信されたものか、副系の障害監視装置から送信されたものかの判定を行う（詳細は後述）。 The data analysis unit 105 analyzes the failure detection information 5a to 5b transferred from the failure monitoring device (100m, etc.), identifies the device information in which the failure has occurred, determines the failure handling method, determines the necessity of field work, And a function of identifying the suspected part. The data analysis unit 105 determines whether or not the files of the failure detection information 5a to 5b transferred from the failure monitoring device (100m or the like) exist in a predetermined directory. If the failure detection information 5a-b file does not exist, the process ends. If the failure detection information 5a-b file exists, the failure detection information 5a-b is referred to, and various information described in the failure detection information 5a-b (failure detection command execution date and time, The monitoring target device IP address, the monitoring target device code, the failure detection command execution results 4a to 4b, and the failure code) are acquired. Further, the data analysis unit 105 determines whether the failure detection information 5a-b is transmitted from the primary failure monitoring device or the secondary failure monitoring device (details will be described later).

データ解析部１０５は、障害検出情報５ａ〜ｂが正系から送信されたものであると判定した場合には、障害検出情報５ａ〜ｂから取得した監視対象装置コードをキーとして監視対象装置情報ＤＢ（図１６の２０４）を検索し、監視対象装置の情報（装置型番、製品名）を取得する。 If the data analysis unit 105 determines that the failure detection information 5a to b is transmitted from the main system, the monitoring target device information DB is used with the monitoring target device code acquired from the failure detection information 5a to b as a key. (204 in FIG. 16) is searched, and information (device model number, product name) of the monitoring target device is acquired.

続いて、データ解析部１０５は、所得した監視対象装置コードと、障害コードをキーとして障害情報ＤＢ（図１７の２０５）を検索し、障害対処方法、現地作業の要否、及び被疑部品コード（被疑部品を特定するための一意のコード）を取得する。続いて、データ解析部１０５は、上記の取得した情報（障害検出用コマンド実行日時、監視対象装置のＩＰアドレス、監視対象装置コード、障害検出用コマンドの実行結果４ａ〜ｂ、障害コード、監視対象装置の情報（装置型番、製品名）、障害対処方法、現地作業の要否、被疑部品コード）を基に、所定のディレクトリに障害管理装置３００に転送する障害解析情報６ａ〜ｂのファイルを作成する。 Subsequently, the data analysis unit 105 searches the failure information DB (205 in FIG. 17) using the monitored device code that has been acquired and the failure code as a key, the failure handling method, the necessity of field work, and the suspicious part code ( A unique code for identifying the suspicious part is acquired. Subsequently, the data analysis unit 105 performs the above-described acquired information (failure detection command execution date and time, monitoring target device IP address, monitoring target device code, failure detection command execution results 4a to b, failure code, monitoring target. Create failure analysis information 6a-b files to be transferred to the failure management device 300 in a predetermined directory based on device information (device model number, product name), failure handling method, necessity of field work, suspected part code) To do.

稼動通知情報監視部１０６及び障害監視装置系切り替え部１０７についての詳細は、図１１〜図１２のフローチャートを説明するときに後述する。 Details of the operation notification information monitoring unit 106 and the failure monitoring apparatus system switching unit 107 will be described later when the flowcharts of FIGS. 11 to 12 are described.

次に、データ転送部１０８はデータ解析部１０５が作成した障害解析情報６ａ〜ｂのファイルを障害管理装置３００に転送する機能を有している。データ転送部１０８は、所定のディレクトリ内に障害解析情報６ａ〜ｂのファイルの有無を判定する。障害解析情報６ａ〜ｂのファイルが存在しない場合は終了する。障害解析情報６ａ〜ｂのファイルが存在する場合、障害解析情報６ａ〜ｂのファイルを障害管理装置３００に転送する。 Next, the data transfer unit 108 has a function of transferring the files of the failure analysis information 6 a to 6 b created by the data analysis unit 105 to the failure management apparatus 300. The data transfer unit 108 determines whether there is a file of the failure analysis information 6a-b in a predetermined directory. If the failure analysis information 6a-b file does not exist, the process ends. If the failure analysis information 6a-b file exists, the failure analysis information 6a-b file is transferred to the failure management apparatus 300.

障害対象装置情報ＤＢ２０４は、図１６に示すように、監視対象装置を特定するために使用するデータベースであり、監視対象装置コード、監視対象装置の型番及び製品名を管理する。障害対象装置情報ＤＢ２０４は、データ解析部１０５が障害検出情報５ａ〜ｂのファイルから取得した監視対象装置コードをキーとして監視対象装置に関する情報を取得する際に使用される。 As shown in FIG. 16, the failure target device information DB 204 is a database used for specifying a monitoring target device, and manages the monitoring target device code, the model number of the monitoring target device, and the product name. The failure target device information DB 204 is used when the data analysis unit 105 acquires information about the monitoring target device using the monitoring target device code acquired from the failure detection information 5a-b file as a key.

障害情報ＤＢ２０５は、図１７に示すように、発生した障害の情報を管理するためのデータベースであり、監視対象装置コード、障害コード、障害対処方法、被疑部品コード（被疑部品情報）、及び現地作業の要否（保守員派遣要請情報）を管理する。障害情報ＤＢ２０５は、データ解析部１０５が障害検出情報５ａ〜ｂから取得した監視対象装置コード、障害コードをキーとして、障害対処方法、現地作業の要否、被疑部品コードを取得するために使用される。尚、障害監視装置冗長構成情報ＤＢ（図１８の２０６）については、後述する。 As shown in FIG. 17, the failure information DB 205 is a database for managing information on a failure that has occurred, and includes a monitoring target device code, a failure code, a failure handling method, a suspicious component code (suspected component information), and field work Management (maintenance staff dispatch request information) is managed. The failure information DB 205 is used to acquire a failure handling method, necessity of field work, and suspected component code using the monitoring target device code and failure code acquired from the failure detection information 5a-b by the data analysis unit 105 as keys. The The failure monitoring apparatus redundant configuration information DB (206 in FIG. 18) will be described later.

（障害管理装置の構成）
次に、図６を参照しながら、障害管理装置３００の構成について詳細に説明する。障害管理装置３００は、メール送信部１０９、保守員出動指示部１１０、保守部品手配部１１１を備えている。上記の各部は、障害管理装置３００が備えたＣＰＵにより実行されるソフトウェアモジュールである。また、障害管理装置３００は、データベースとして、障害管理ＤＢ２０７、顧客情報ＤＢ２０８、社員情報ＤＢ２０９、及び作業予定ＤＢ２１０を備えている。(Configuration of fault management device)
Next, the configuration of the failure management apparatus 300 will be described in detail with reference to FIG. The failure management apparatus 300 includes a mail transmission unit 109, a maintenance staff dispatch instruction unit 110, and a maintenance parts arrangement unit 111. Each of the above units is a software module executed by the CPU provided in the failure management apparatus 300. Further, the failure management apparatus 300 includes a failure management DB 207, a customer information DB 208, an employee information DB 209, and a work schedule DB 210 as databases.

メール送信部１０９は、遠隔監視システム１１が検出した障害に関する情報を、インターネット６０を介し、顧客端末２０ａ〜ｂに送信する機能を有している。メール送信部１０９は定期的に障害管理ＤＢ（図１９の２０７）を参照し、メール送信済みフラグ情報が未実施である障害情報の有無を判定する。そして、メール送信が未実施である障害情報が存在する場合は、障害管理ＤＢ２０７からメール送信が未実施である障害情報についての各種情報（障害検出用コマンド実行日時、監視対象装置のＩＰアドレス、監視対象装置コード、障害検出用コマンドの実行結果４ａ〜ｂ、障害コード、監視対象装置の情報（装置型番、製品名）、障害対処方法、現地作業の要否、被疑部品コード）を取得し、取得した該情報に基づいて障害情報を通知するメールを作成する。 The mail transmission unit 109 has a function of transmitting information regarding a failure detected by the remote monitoring system 11 to the customer terminals 20a and 20b via the Internet 60. The mail transmission unit 109 periodically refers to the failure management DB (207 in FIG. 19), and determines whether there is failure information for which the email transmission completed flag information is not implemented. If there is failure information for which mail transmission has not been performed, various information about the failure information for which mail transmission has not been performed from the failure management DB 207 (failure detection command execution date and time, IP address of the monitoring target device, monitoring information) Acquire and acquire the target device code, failure detection command execution results 4a-b, failure code, monitored device information (device model number, product name), failure handling method, necessity of local work, suspected component code) A mail for notifying failure information is created based on the information.

続いて、メール送信部１０９は、監視対象装置コードをキーとして顧客情報ＤＢ（図２０の２０８）を検索してメールの送信先アドレスを取得し、メール送信を行う。メール送信部１０９は、メール送信が終了した際に、障害管理ＤＢ（図１９の２０７）内の当該障害情報に対応したメール送信済みフラグを「１」に設定する。 Subsequently, the mail transmission unit 109 searches the customer information DB (208 in FIG. 20) using the monitoring target device code as a key, acquires the mail transmission destination address, and performs mail transmission. When mail transmission ends, the mail transmission unit 109 sets a mail transmission completed flag corresponding to the failure information in the failure management DB (207 in FIG. 19) to “1”.

保守員出動指示部１１０は、障害の対処として現地作業が必要となる場合、現地（顧客サイト）に派遣する保守員を自動的にアサインする機能である。保守員出動指示部１１０は、定期的に障害管理ＤＢ２０７を参照し、現地作業の要否が要であり（即ち、データが「１」の場合）、且つ、保守員要請済みフラグが未実施（即ち、データが「０」）である障害情報の有無を判定する。上記の条件に該当する障害情報の登録が存在しない場合は終了する。該当する障害情報が存在する場合、監視対象装置コードをキーとして顧客情報ＤＢ（図２０の２０８）を検索して対応する顧客の情報（保守拠点コード、顧客名、顧客住所、顧客電話番号）を取得する。続いて、取得した保守拠点コードをキーとして社員情報ＤＢ（図２１の２０９）を検索し、担当拠点に所属する社員の情報（社員コード、社員名、保守員メールアドレス）を取得する。 The maintenance staff dispatch instruction section 110 is a function that automatically assigns maintenance staff dispatched to the site (customer site) when field work is required to deal with a failure. The maintenance staff dispatch instruction unit 110 periodically refers to the failure management DB 207 to determine whether or not the field work is necessary (that is, when the data is “1”), and the maintenance staff requested flag is not implemented ( That is, the presence / absence of failure information whose data is “0”) is determined. If no failure information corresponding to the above condition exists, the process ends. If the corresponding failure information exists, the customer information DB (208 in FIG. 20) is searched using the monitored device code as a key, and the corresponding customer information (maintenance base code, customer name, customer address, customer telephone number) is obtained. get. Subsequently, the employee information DB (209 in FIG. 21) is searched using the acquired maintenance base code as a key, and information (employee code, employee name, maintenance staff mail address) of the employee belonging to the base in charge is acquired.

続いて、保守員出動指示部１１０は、作業予定ＤＢ（図２２の２１０）を参照し、作業予定が登録されていない社員に作業予定情報（監視対象装置コード、社員コード、社員名、作業開始日時）を登録し、作業予定を登録した社員のメールアドレスに上記作業予定情報をメールにて通知する。続いて、保守員出動指示部１１０は、障害管理ＤＢ（図１９の２０７）内の該当する障害情報に保守員要請済みフラグを設定する（データを「１」にする）。 Subsequently, the maintenance staff dispatch instruction unit 110 refers to the work schedule DB (210 in FIG. 22), and schedules work schedule information (monitored device code, employee code, employee name, work start) to an employee whose work schedule is not registered. Date and time) is registered, and the work schedule information is notified by email to the email address of the employee who registered the work schedule. Subsequently, the maintenance staff dispatch instruction unit 110 sets a maintenance engineer requested flag (data is set to “1”) in the corresponding fault information in the fault management DB (207 in FIG. 19).

次に、障害管理ＤＢ２０７において該当する障害情報に被疑部品コードの登録が存在する場合は、監視対象装置コード、装置型番、被疑部品コードを引数として保守部品手配部１１１を起動する。保守部品手配部１１１は、障害復旧のために必要となる被疑部品コードに対応した保守部品を顧客サイトに送付するための機能を有している。保守部品手配部１１１は、監視対象装置コードをキーとして顧客情報ＤＢ（図２０の２０８）を検索し、部品送付先の顧客情報（顧客名、顧客住所、顧客電話番号）を取得する。そして、保守部品手配部１１１は、保守部品倉庫端末２１に上記顧客住所に被疑部品コードに対応する保守部品を配送する指示を通知するメールを送信する。 Next, when there is a suspicious component code registered in the failure information in the failure management DB 207, the maintenance component arrangement unit 111 is activated with the monitored device code, device model number, and suspicious component code as arguments. The maintenance part arrangement unit 111 has a function for sending a maintenance part corresponding to a suspected part code required for failure recovery to a customer site. The maintenance parts arrangement unit 111 searches the customer information DB (208 in FIG. 20) using the monitoring target device code as a key, and acquires the customer information (customer name, customer address, customer telephone number) of the parts delivery destination. Then, the maintenance part arranging unit 111 transmits an email notifying the maintenance parts warehouse terminal 21 of an instruction to deliver the maintenance part corresponding to the suspected part code to the customer address.

障害管理ＤＢ２０７は、図１９に示すように、障害情報を管理するためのデータベースであり、監視対象装置コード、障害検出コマンド実行日時、装置型番、製品名、監視対象装置ＩＰアドレス、障害検出コマンドの実行結果４ａ〜ｂ、障害監視システムコード、障害コード、障害対処方法、現地作業要否、被疑部品コード、メール送信済みフラグ、保守員要請済みフラグを管理する。障害管理ＤＢ２０７は、メール送信部１０９が顧客端末２０ａ〜ｂに対して送信するメールの作成時、保守員出動指示部１１０が保守員の予定登録を実施する時、及び保守部品手配部１１１が保守部品の登録を行う時に参照される。 The failure management DB 207 is a database for managing failure information, as shown in FIG. 19, and includes a monitoring target device code, a failure detection command execution date, a device model number, a product name, a monitoring target device IP address, and a failure detection command. Execution results 4a-b, fault monitoring system code, fault code, fault handling method, necessity of field work, suspected part code, mail transmission completed flag, maintenance staff requested flag are managed. The failure management DB 207 is created when the mail transmission unit 109 creates a mail to be transmitted to the customer terminals 20a and 20b, when the maintenance staff dispatch instruction section 110 performs the scheduled registration of the maintenance staff, and when the maintenance parts arrangement section 111 performs the maintenance. Referenced when registering parts.

顧客情報ＤＢ２０８は、図２０に示すように、顧客情報を管理するためのデータベースであり、監視対象装置コード、保守拠点コード、顧客名、顧客住所、顧客電話番号、顧客メールアドレスを管理する。顧客情報ＤＢ２０８は、保守員出動指示部１１０が保守員の出動先情報を取得する際、及び保守部品手配部１１１が保守部品送信先情報を取得する際に使用される。 As shown in FIG. 20, the customer information DB 208 is a database for managing customer information, and manages a monitoring target device code, a maintenance base code, a customer name, a customer address, a customer telephone number, and a customer mail address. The customer information DB 208 is used when the maintenance staff dispatch instruction section 110 acquires maintenance staff dispatch destination information and when the maintenance parts arrangement section 111 acquires maintenance parts transmission destination information.

社員情報ＤＢ２０９は、図２１に示すように、社員情報（保守員）を管理するためのデータベースであり、保守拠点コード、社員コード、社員名、保守員メールアドレスを管理する。社員情報ＤＢ２０９は、保守員出動指示部１１０が障害発生装置の保守拠点に属する保守員情報を取得する際に使用される。 As shown in FIG. 21, the employee information DB 209 is a database for managing employee information (maintenance personnel), and manages maintenance base codes, employee codes, employee names, and maintenance personnel email addresses. The employee information DB 209 is used when the maintenance staff dispatch instruction unit 110 acquires maintenance staff information belonging to the maintenance base of the failure occurrence device.

作業予定ＤＢ２１０は、図２２に示すように、作業予定を管理するためのデータベースであり、監視対象装置コード、社員コード、社員名、作業開始日時を管理する。作業予定ＤＢ２１０は、保守員出動指示部１１０が現地作業指示を行う際に使用される。 As shown in FIG. 22, the work schedule DB 210 is a database for managing work schedules, and manages monitoring target device codes, employee codes, employee names, and work start dates and times. The work schedule DB 210 is used when the maintenance staff dispatch instruction unit 110 issues a field work instruction.

（第１の実施形態の動作）
次に、第１の実施形態の動作について、図２を参照しながら説明する。図２は、図３に示した第１の実施形態の動作を示すシーケンスチャートである（即ち、図２は、図１、図３の構成に対する共通のシーケンスチャートである）。図２において、遠隔監視システム１１は、２つの顧客サイトに設置した監視対象装置１０ａ〜ｂの障害検出を一斉に実施する場合を想定している。尚、一斉に障害検出を開始する監視対象装置の数は、２に限定されず、任意数が適用可能である。また、正系の障害監視装置として、１００ｍが設定されているものとする。(Operation of the first embodiment)
Next, the operation of the first embodiment will be described with reference to FIG. FIG. 2 is a sequence chart showing the operation of the first embodiment shown in FIG. 3 (that is, FIG. 2 is a common sequence chart for the configurations of FIGS. 1 and 3). In FIG. 2, the remote monitoring system 11 assumes the case where the failure detection of the monitoring target devices 10a and 10b installed at two customer sites is simultaneously performed. Note that the number of monitoring target devices that start failure detection all at once is not limited to two, and an arbitrary number is applicable. Further, it is assumed that 100 m is set as the primary failure monitoring apparatus.

図２において、障害監視装置１００ｍから２つの障害対象装置１０ａ〜ｂに対して、夫々障害検出コマンド３ａ〜ｂが連続して送信される。障害対象装置１０ａ〜ｂは、夫々障害検出コマンド３ａ〜ｂを受けて、障害検出コマンド３ａ〜ｂに対応する診断処理を実行する。障害対象装置１０ａは、診断処理が終了すると障害検出コマンドの実行結果４ａを障害監視装置１００ｍに送信する。障害検出コマンドの実行結果４ａは、診断処理におけるログ情報である。ここで、ログ情報は監視対象装置１０ａで収集した情報を全て含んでいてもよいし、障害解析に有効なものだけを抽出した情報であってもよい。或いは、障害検出コマンドの実行結果４ａは、単に障害検出コマンド３ａ〜ｂの実行結果の正常／異常を表すものであってもよい。同様に、監視対象装置１０ｂは診断処理が終了すると障害検出コマンドの実行結果４ｂを障害監視装置１００ｍに送信する。 In FIG. 2, failure detection commands 3a and 3b are successively transmitted from the failure monitoring device 100m to the two failure target devices 10a and 10b, respectively. The failure target devices 10a and 10b receive the failure detection commands 3a and 3b, respectively, and execute diagnostic processing corresponding to the failure detection commands 3a and 3b. The failure target device 10a transmits the execution result 4a of the failure detection command to the failure monitoring device 100m when the diagnosis process is completed. The failure detection command execution result 4a is log information in the diagnosis process. Here, the log information may include all information collected by the monitoring target device 10a, or may be information obtained by extracting only information effective for failure analysis. Alternatively, the failure detection command execution result 4a may simply indicate the normality / abnormality of the failure detection command 3a-b execution results. Similarly, the monitoring target device 10b transmits the execution result 4b of the failure detection command to the failure monitoring device 100m when the diagnosis process is completed.

障害監視装置１００ｍは、障害検出コマンドの実行結果４ａを受信すると、それに対応して障害検出コマンドの実行結果４ａの判定（Ｓ１００ａ）、及び障害検出情報５ａの送信を行う。次に、障害解析装置２００は、障害検出情報５ａを受信し、障害検出情報５ａの解析（Ｓ１０１ａ）、及び障害解析情報６ａの送信を行う。次に、障害管理装置３００は、障害解析情報６ａを受信し、障害解析情報６ａによる管理（Ｓ１０２ａ）を行う。同様に、障害検出コマンドの実行結果４ｂに対応して、障害監視装置１００ｍ、障害解析装置２００、及び障害管理装置３００は夫々、判定（Ｓ１００ｂ）、解析（Ｓ１０１ｂ）、及び管理（Ｓ１０２ｂ）の処理を行う。ここで、図２に示すように、遠隔監視システム１１は、障害監視装置１００ｍ、障害解析装置２００、及び障害管理装置３００の３つの装置の夫々に判定、解析、管理の３つの処理を分担して実行させるように構成している。これにより、図２の期間ＴＡでは、障害監視装置１００ｍによる判定（Ｓ１００ｂ）と障害解析装置２００による解析（Ｓ１０１ａ）とを並行して実行することが可能である。また、期間ＴＢでは、障害解析装置２００による解析（Ｓ１０１ｂ）と障害管理装置３００による管理（Ｓ１０２ａ）とを並行して実行することが可能である。このように、遠隔監視システム１１内では、２以上の装置を並行動作させることにより全体の処理能力の向上を図っている。 Upon receiving the failure detection command execution result 4a, the failure monitoring apparatus 100m determines the failure detection command execution result 4a (S100a) and transmits the failure detection information 5a accordingly. Next, the failure analysis device 200 receives the failure detection information 5a, analyzes the failure detection information 5a (S101a), and transmits the failure analysis information 6a. Next, the failure management apparatus 300 receives the failure analysis information 6a and performs management (S102a) using the failure analysis information 6a. Similarly, in response to the execution result 4b of the failure detection command, the failure monitoring device 100m, the failure analysis device 200, and the failure management device 300 perform determination (S100b), analysis (S101b), and management (S102b) processing, respectively. I do. Here, as shown in FIG. 2, the remote monitoring system 11 shares the three processes of determination, analysis, and management with each of the three devices, the failure monitoring device 100m, the failure analysis device 200, and the failure management device 300. Are configured to be executed. Thereby, in the period TA in FIG. 2, the determination by the failure monitoring device 100m (S100b) and the analysis by the failure analysis device 200 (S101a) can be executed in parallel. In the period TB, the analysis by the failure analysis device 200 (S101b) and the management by the failure management device 300 (S102a) can be executed in parallel. Thus, in the remote monitoring system 11, the overall processing capability is improved by operating two or more devices in parallel.

次に、図７を参照しながら、図２で示した障害監視装置１００ｍの動作についてより詳細に説明する。図７は、第１の実施形態に係る遠隔監視システム１１における障害監視装置１００ｍの動作を示すフローチャートであり、監視対象装置１０ａのみに対する障害監視装置１００ｍの動作を示している。尚、以下の動作説明において、前述した障害監視装置の構成で説明した内容と重複する説明を省略することがある。 Next, the operation of the failure monitoring apparatus 100m shown in FIG. 2 will be described in more detail with reference to FIG. FIG. 7 is a flowchart showing the operation of the failure monitoring device 100m in the remote monitoring system 11 according to the first embodiment, and shows the operation of the failure monitoring device 100m for only the monitoring target device 10a. In the following description of the operation, a description that overlaps with the content described in the configuration of the failure monitoring apparatus described above may be omitted.

図７において、ステップＡ１〜Ａ４は、システム障害検出部１０１を起動すると共に、監視対象装置１０ａに対して障害検出コマンド３ａを送信している部分である。具体的には、障害検出対象装置ＤＢ（図１３の２０１）を参照して、監視対象装置コードＡ０００１、及びそのＩＰアドレス１０．０．０．１を取得する。次に、監視対象装置コードＡ０００１をキーとして障害検出項目ＤＢ（図１４の２０２）を検索する。図１４において監視対象装置コードＡ０００１に対応する３つのコマンドＴｅｓｔＣｏｍｍａｎｄ００１＿１、ＴｅｓｔＣｏｍｍａｎｄ００１＿２、ＴｅｓｔＣｏｍｍａｎｄ００１＿３を取得し、これらを障害検出コマンド３ａとして監視対象装置１０ａに送信している。近年、情報処理装置が多機能化しテスト内容が多岐に亘るため、障害検出コマンド３ａが複数のコマンドを含む傾向にある。 In FIG. 7, steps A1 to A4 are parts for starting the system failure detection unit 101 and transmitting a failure detection command 3a to the monitoring target device 10a. Specifically, referring to the failure detection target device DB (201 in FIG. 13), the monitoring target device code A0001 and its IP address 10.0.0.1 are acquired. Next, the failure detection item DB (202 in FIG. 14) is searched using the monitoring target device code A0001 as a key. In FIG. 14, three commands TestCommand001_1, TestCommand001_2, and TestCommand001_3 corresponding to the monitoring target apparatus code A0001 are acquired and transmitted to the monitoring target apparatus 10a as a failure detection command 3a. In recent years, since information processing apparatuses have become multifunctional and test contents are diverse, the failure detection command 3a tends to include a plurality of commands.

次に、図７のステップＡ５〜Ａ１２は、図２の判定（Ｓ１００ａ）に相当する部分である。ステップＡ５において、障害検出コマンドの実行結果４ａを受信する。障害検出コマンドの実行結果４ａは、例えば、上記した３つのコマンドＴｅｓｔＣｏｍｍａｎｄ００１＿１、ＴｅｓｔＣｏｍｍａｎｄ００１＿２、ＴｅｓｔＣｏｍｍａｎｄ００１＿３の各々に対するログ情報である。ステップＡ６は、各ログ情報から障害検出コマンドの実行結果４ａが正常であると判定した場合には処理を終了する。一方、障害検出コマンドの実行結果４ａが異常であると判定した場合には、以下のステップＡ７〜Ａ１２を実行する。 Next, steps A5 to A12 in FIG. 7 are portions corresponding to the determination (S100a) in FIG. In step A5, the failure detection command execution result 4a is received. The failure detection command execution result 4a is, for example, log information for each of the three commands TestCommand001_1, TestCommand001_2, and TestCommand001_3. Step A6 ends the process when it is determined from the log information that the failure detection command execution result 4a is normal. On the other hand, when it is determined that the failure detection command execution result 4a is abnormal, the following steps A7 to A12 are executed.

ここで、ステップＡ６において、上記した３つのコマンドの実行結果に対し、図１４に示すように障害判定閾値ｖａｌ００１＿１、ｖａｌ００１＿２、ｖａｌ００１＿３が定められている場合には、各コマンドの実行結果と対応する障害判定閾値が比較され、各コマンドの実行結果の正常／異常が判定される。或いは、障害判定閾値に代わって各コマンドの実行結果の期待値が定められ、該期待値との一致／不一致によって各コマンドの実行結果の正常／異常が判定される場合もある。 Here, in step A6, when the failure determination thresholds val001_1, val001_2, and val001_3 are defined for the execution results of the above three commands as shown in FIG. 14, the failure corresponding to the execution result of each command is determined. The determination threshold values are compared to determine whether the execution result of each command is normal or abnormal. Alternatively, an expected value of the execution result of each command may be determined instead of the failure determination threshold value, and normal / abnormal of the execution result of each command may be determined based on a match / mismatch with the expected value.

ステップＡ７〜Ａ８によって生成される障害検出情報５ａは、前述したように、障害検出用コマンド実行日時、監視対象装置のＩＰアドレス、監視対象装置コード、障害検出用コマンドの実行結果４ａ、障害コードを含んでいる。ここで、障害検出用コマンドの実行結果４ａが大容量のログ情報を含む場合には、障害検出情報５ａに書き込む障害検出用コマンドの実行結果４ａは、ログ情報のファイル名のみを登録するようにし、該ログ情報のファイルを、別途、障害解析装置２００に転送するようにしてもよい。また、障害検出情報５ａの障害コードは、図１４に示すように各コマンドに対応して定義される。例えば、ＴｅｓｔＣｏｍｍａｎｄ００１＿１、ＴｅｓｔＣｏｍｍａｎｄ００１＿２、ＴｅｓｔＣｏｍｍａｎｄ００１＿３の実行結果が異常の場合、夫々の障害コードとして、ＳＧ００１＿１、ＳＧ００１＿２、ＳＧ００１＿３が定義される。 As described above, the failure detection information 5a generated in steps A7 to A8 includes the failure detection command execution date and time, the monitored device IP address, the monitored device code, the failure detection command execution result 4a, and the failure code. Contains. Here, when the failure detection command execution result 4a includes a large amount of log information, the failure detection command execution result 4a written to the failure detection information 5a registers only the file name of the log information. The log information file may be separately transferred to the failure analysis apparatus 200. Further, the failure code of the failure detection information 5a is defined corresponding to each command as shown in FIG. For example, when the execution results of TestCommand001_1, TestCommand001_2, and TestCommand001_1 are abnormal, SG001_1, SG001_2, and SG001_3 are defined as respective failure codes.

ステップＡ９〜Ａ１２は、生成された障害検出情報５ａを障害解析装置２００に転送する処理を行っている部分である。ここで、ステップＡ１１では、データ転送先ＤＢ（図１５の２０３）を参照して、障害解析装置２００のＩＰアドレス１９２．１６８．０．１を取得している。 Steps A <b> 9 to A <b> 12 are parts in which processing for transferring the generated failure detection information 5 a to the failure analysis apparatus 200 is performed. Here, in step A11, the IP address 192.168.0.1 of the failure analysis apparatus 200 is acquired with reference to the data transfer destination DB (203 in FIG. 15).

次に、図８を参照しながら、図２で示した障害解析装置２００の動作についてより詳細に説明する。図８は、第１の実施形態に係る遠隔監視システム１１における障害解析装置２００の動作を示すフローチャートであり、監視対象装置１０ａのみに対する障害監視装置２００の動作を示しており、図２の解析（Ｓ１０１ａ）の処理に対応している。尚、以下の動作説明において、前述した障害解析装置の構成で説明した内容と重複する説明を省略することがある。 Next, the operation of the failure analysis apparatus 200 shown in FIG. 2 will be described in more detail with reference to FIG. FIG. 8 is a flowchart showing the operation of the failure analysis device 200 in the remote monitoring system 11 according to the first embodiment, showing the operation of the failure monitoring device 200 only for the monitoring target device 10a. This corresponds to the processing of S101a). In the following description of the operation, a description that overlaps the content described in the configuration of the failure analysis apparatus described above may be omitted.

図８において、ステップＡ１３でデータ解析部１０５を起動した後、ステップＡ１４〜Ａ１５で障害監視装置１００ｍから転送された障害検出情報５ａから情報取得を行っている。次に、ステップＡ１６では、障害監視装置冗長構成情報ＤＢ（図１８の２０６）を参照し、障害監視装置１００ｍ（障害監視装置コードは１００）は正系であると判定する。よって、ステップＡ１７では、障害検出情報５ａが正系の障害監視装置からの情報であると判定する。次に、ステップＡ１８では、監視対象装置１０ａの監視対象装置コードＡ０００１をキーとして監視対象装置情報ＤＢ（図１６の２０４）を検索することにより、製品名Ｅｘｐｒｅｓｓ５８００ＸＸＸＸＸＸＸＸ、及び装置型番Ｎ１２３４５を取得している。 In FIG. 8, after starting the data analysis unit 105 in step A13, information is acquired from the failure detection information 5a transferred from the failure monitoring apparatus 100m in steps A14 to A15. Next, in step A16, the failure monitoring device redundant configuration information DB (206 in FIG. 18) is referred to, and it is determined that the failure monitoring device 100m (failure monitoring device code is 100) is the main system. Therefore, in step A17, it is determined that the failure detection information 5a is information from the primary failure monitoring device. Next, in step A18, the product name Express5800XXXXXX and the device model number N12345 are acquired by searching the monitoring target device information DB (204 in FIG. 16) using the monitoring target device code A0001 of the monitoring target device 10a as a key. Yes.

次に、ステップＡ１９では、監視対象装置１０ａの監視対象装置コードＡ０００１及び障害検出情報５ａの障害コードＳＧ００１＿１、ＳＧ００１＿２、ＳＧ００１＿３をキーとして障害情報ＤＢ（図１７の２０５）を検索する。その結果、上記のキーに対して障害情報ＤＢ２０５がヒットし、障害対処方法７ａとして「ＸＸＸの交換が必要」、被疑部品コード（被疑部品情報）９ａとしてＡ００１、現地作業の要否（保守員派遣要請情報）１９ａとして１（現地作業が要）の情報が取得される。尚、監視対象装置コード及び障害コードをキーとして障害情報ＤＢ２０５を検索した際にヒットしない場合には、例えば、障害検索コマンドの実行結果４ａのログ情報を直接解析することにより、障害対処方法等を決定するようにしてもよい。 Next, in step A19, the failure information DB (205 in FIG. 17) is searched using the monitoring target device code A0001 of the monitoring target device 10a and the failure codes SG001_1, SG001_2, SG001_3 of the failure detection information 5a as keys. As a result, the failure information DB 205 hits the above key, the failure handling method 7a is “requires replacement of XXX”, the suspected component code (suspected component information) 9a is A001, the necessity of local work (dispatching maintenance personnel) Request information) 19a is acquired as 1 (requires local work). If there is no hit when the failure information DB 205 is searched using the monitoring target device code and the failure code as keys, for example, by directly analyzing the log information of the execution result 4a of the failure search command, a failure handling method or the like can be obtained. It may be determined.

次に、ステップＡ２０〜Ａ２３は、障害解析結果に基づいて、障害解析情報６ａを生成し、障害管理装置３００へ送信する部分である。 Next, steps A20 to A23 are parts for generating the failure analysis information 6a based on the failure analysis result and transmitting it to the failure management apparatus 300.

次に、図９、図１０を参照しながら、図２で示した障害管理装置３００の動作についてより詳細に説明する。図９、図１０は、第１の実施形態に係る遠隔監視システム１１における障害管理装置３００の動作を示すフローチャートであり、監視対象装置１０ａのみに対する障害管理装置３００の動作を示しており、図２の管理（Ｓ１０２ａ）の処理に対応している。尚、以下の動作説明において、前述した障害管理装置の構成で説明した内容と重複する説明を省略することがある。 Next, the operation of the failure management apparatus 300 shown in FIG. 2 will be described in more detail with reference to FIGS. 9 and 10 are flowcharts showing the operation of the failure management apparatus 300 in the remote monitoring system 11 according to the first embodiment, showing the operation of the failure management apparatus 300 for only the monitoring target device 10a. This corresponds to the management (S102a) process. In the following description of the operation, a description that overlaps with the content described in the configuration of the failure management apparatus described above may be omitted.

図９において、ステップＡ２４は、障害解析装置２００から障害解析情報６ａを受信して、障害管理ＤＢ（図１９の２０７）に出力している。図１９を参照すると、障害管理ＤＢ２０７の１行目に障害解析情報６ａに対応した情報が書き込まれていることが分かる。尚、メール送信済みフラグ及び保守員要請済みフラグは、未実施であることを示す「０」に初期設定される。また、コマンド実行結果として、ログファイル名が書き込まれている。 In FIG. 9, step A24 receives the failure analysis information 6a from the failure analysis apparatus 200 and outputs it to the failure management DB (207 in FIG. 19). Referring to FIG. 19, it can be seen that information corresponding to the failure analysis information 6a is written in the first row of the failure management DB 207. Note that the mail transmission completed flag and the maintenance personnel requested flag are initially set to “0” indicating that the flag has not been implemented. The log file name is written as the command execution result.

次に、ステップＡ２５〜Ａ３０は、顧客端末２０ａに障害連絡メールを送信する部分である。具体的には、ステップＡ２６〜Ａ２７で障害管理ＤＢ（図１９の２０７）の１行目を参照してメール送信済みフラグが０（即ち、メール送信が未実施）であると判定し、ステップＡ２８で障害管理ＤＢ２０７の１行目のデータを情報取得する。続いて、ステップＡ２９において、ステップＡ２８で取得した監視対象装置コードＡ０００１をキーとして顧客情報ＤＢ２０８を検索し、顧客メールアドレス「Ａｓｙａ＠ＸＸＸ．ｃｏｍ」を取得する。そして、ステップＡ３０で障害連絡メールを顧客端末２０ａのメールアドレスＡｓｙａ＠ＸＸＸ．ｃｏｍに送信する。 Next, steps A25 to A30 are parts for transmitting a trouble contact mail to the customer terminal 20a. Specifically, in steps A26 to A27, the first line of the failure management DB (207 in FIG. 19) is referred to, and it is determined that the mail transmission flag is 0 (that is, mail transmission has not been performed). As a result, information of the data in the first row of the failure management DB 207 is acquired. Subsequently, in step A29, the customer information DB 208 is searched using the monitoring target device code A0001 acquired in step A28 as a key, and the customer mail address “Asia@XXX.com” is acquired. In step A30, the trouble contact mail is sent to the mail address Asya @ XXX. com.

次に、ステップＡ３１〜Ａ３７は、保守員端末２２ａに出動要請を行う部分である。具体的には、ステップＡ３１で保守員出動指示部１１０を起動する。そして、ステップＡ３２において、前述したステップＡ２８と同様に、障害管理ＤＢ（図１９の２０７）の１行目の情報を取得する。そして、ステップＡ３３において、上記取得した情報において現地作業要否が「１」であることから、保守員出動が必要であると判定する。 Next, steps A31 to A37 are parts for making a request for dispatch to the maintenance staff terminal 22a. Specifically, the maintenance staff dispatch instruction unit 110 is activated in step A31. In step A32, the information on the first line of the failure management DB (207 in FIG. 19) is acquired as in step A28 described above. In step A33, since the necessity of field work is “1” in the acquired information, it is determined that the maintenance staff needs to be dispatched.

そして、ステップＡ３４において、前述したステップＡ２９と同様に、監視対象装置コードＡ０００１をキーとして顧客情報ＤＢ（図２０の２０８）を検索して顧客の保守拠点コード「ＨＫ００１」を取得する。次に、ステップＡ３５において、保守拠点コード「ＨＫ００１」をキーとして社員情報ＤＢ（図２１の２０９）を検索し、保守拠点コード「ＨＫ００１」に対応した拠点の保守員の社員コード「１１１１１１１」、保守員メールアドレス「ｓｈａｉｎＡ＠ｔｅｓｔ．ｃｏｍ」を取得する。次に、ステップＡ３６において、社員コード「１１１１１１１」の作業予定ＤＢ（図２２の２１０）の行に、作業予定を登録する（図２２に示すように、作業開始時刻２０１３年１月１５日９時が書き込まれる）。続いて、ステップＡ３７において、保守員端末２２ａのメールアドレス「ｓｈａｉｎＡ＠ｔｅｓｔ．ｃｏｍ」に、作業予定メールを送信する。該作業予定メールにより、出動先の顧客住所、障害情報、作業開始時刻などの情報が通知される。 In step A34, as in step A29 described above, the customer information DB (208 in FIG. 20) is searched using the monitoring target apparatus code A0001 as a key to obtain the customer maintenance base code “HK001”. Next, in step A35, the employee information DB (209 in FIG. 21) is searched using the maintenance base code “HK001” as a key, and the employee code “1111111” of the maintenance staff at the base corresponding to the maintenance base code “HK001” is stored. Member email address “chainA@test.com” is acquired. Next, in step A36, the work schedule is registered in the row of the work schedule DB (210 in FIG. 22) of the employee code “1111111” (work start time January 15, 2013, 9:00 as shown in FIG. 22). Is written). Subsequently, in step A37, the work schedule mail is transmitted to the mail address “chainA@test.com” of the maintenance staff terminal 22a. Information such as the customer's address of the destination, the failure information, and the work start time is notified by the work schedule mail.

次に、図１０のステップＡ３８〜Ａ４１は、保守部品倉庫端末２１に保守部品の手配を行う部分である。具体的には、ステップＡ３８において、取得済みの障害管理ＤＢ２０７の１行目の情報のうち、被疑部品コードが登録されているか否かを判定する。図１９の障害管理ＤＢ２０７の１行目の情報では、被疑部品コード「Ａ００１」が登録されているので、ステップＡ３８では保守部品が必要と判定される。次に、ステップＡ３９で保守部品手配部１１１を起動する。次に、ステップＡ４０で、監視対象装置コードＡ０００１をキーとして顧客情報ＤＢ２０８を検索し顧客名「Ａ社」、顧客住所「東京都ＸＸＸ」を取得する。そして、ステップＡ４１において、保守部品倉庫端末２１に部品配送指示メールを送信する。該部品配送指示メールは、顧客住所「東京都ＸＸＸ」に、被疑部品コード「Ａ００１」に対応する保守部品を配送する指示を行うものである。 Next, steps A38 to A41 in FIG. 10 are parts for arranging maintenance parts at the maintenance parts warehouse terminal 21. Specifically, in step A38, it is determined whether or not the suspected part code is registered in the information on the first line of the acquired failure management DB 207. In the information on the first line of the failure management DB 207 in FIG. 19, the suspected part code “A001” is registered, so it is determined in step A38 that a maintenance part is necessary. Next, the maintenance parts arrangement unit 111 is activated at step A39. Next, in step A40, the customer information DB 208 is searched by using the monitoring target apparatus code A0001 as a key to acquire the customer name “Company A” and the customer address “Tokyo XXX”. In step A41, a parts delivery instruction mail is transmitted to the maintenance parts warehouse terminal 21. The parts delivery instruction mail is an instruction to deliver a maintenance part corresponding to the suspected part code “A001” to the customer address “Tokyo XXX”.

次に、図１１、図１２を参照しながら、障害監視装置の正系と副系の切り替え動作について説明する。図３に示すように、遠隔監視システム１１の障害監視システムは正系と副系を有する冗長構成になっており、図３では、１００ｍ、１００ｓが夫々正系の障害監視装置、副系の障害監視装置に設定されている状態になっている。 Next, the switching operation between the primary system and the secondary system of the failure monitoring apparatus will be described with reference to FIGS. As shown in FIG. 3, the fault monitoring system of the remote monitoring system 11 has a redundant configuration having a primary system and a secondary system. In FIG. 3, 100m and 100s are the primary system fault monitoring device and the secondary system fault, respectively. The monitoring device is set.

図１１は障害監視装置の系切り替えが発生する場合の障害監視装置のフローチャートである。遠隔監視システム１１が備える２つの障害監視装置１００ｍ、１００ｓのうち、図１１は、障害監視装置１００ｍの動作を示している（１００ｓの動作も同様である）。図１１において、ステップＢ１で稼動通知情報定期作成部１０３が定期的に起動する。定期的に起動する間隔は、予め所定の値に設定されている。ステップＢ２で稼動通知情報定期作成部１０３により稼動通知情報８ａが生成される。稼動通知情報８ａには、稼動通知情報定期作成部１０３の起動日時、及び障害監視装置コードの情報が書き込まれる。 FIG. 11 is a flowchart of the failure monitoring apparatus when system switching of the failure monitoring apparatus occurs. Of the two failure monitoring devices 100m and 100s provided in the remote monitoring system 11, FIG. 11 shows the operation of the failure monitoring device 100m (the operation of 100s is the same). In FIG. 11, the operation notification information regular creation unit 103 is periodically activated in step B1. The interval for periodically starting is set to a predetermined value in advance. In step B2, the operation notification information periodic creation unit 103 generates operation notification information 8a. In the operation notification information 8a, information on the activation date and time of the operation notification information periodic creation unit 103 and the fault monitoring device code are written.

次に、ステップＢ３〜Ｂ６は、生成した稼動通知情報８ａを障害解析装置２００に転送する部分である。具体的には、ステップＢ３においてデータ転送部１０４を定期起動し、ステップＢ４で稼動通知情報８ａのファイルの有無を判定する。ステップＢ４で稼動通知情報８ａのファイルが有ると判定された場合は、ステップＢ５でデータ転送先ＤＢ２０３から障害解析装置２００のＩＰアドレス「１９２．１６８．０．１」を取得する。そして、ステップＢ６で稼動通知情報８ａを障害解析装置２００のＩＰアドレスに送信する。 Next, steps B <b> 3 to B <b> 6 are parts for transferring the generated operation notification information 8 a to the failure analysis apparatus 200. Specifically, the data transfer unit 104 is periodically started in step B3, and the presence / absence of a file of the operation notification information 8a is determined in step B4. If it is determined in step B4 that there is a file of the operation notification information 8a, the IP address “192.168.0.1” of the failure analysis apparatus 200 is acquired from the data transfer destination DB 203 in step B5. In step B6, the operation notification information 8a is transmitted to the IP address of the failure analysis apparatus 200.

次に、図１２は障害監視装置の系切り替えが発生する場合の障害解析装置２００のフローチャートである。障害解析装置２００は、障害監視装置（１００ｍ、１００ｓ）の各々が生成した稼動通知情報８ａ、８ｂの受信を試みて、受信できた場合には送信側の障害監視装置が正常に稼動していると判断する。ここで、障害監視装置１００ｍ、１００ｓの障害監視装置コードは、夫々１００、２００である。 Next, FIG. 12 is a flowchart of the failure analysis apparatus 200 when system switching of the failure monitoring apparatus occurs. The failure analysis device 200 attempts to receive the operation notification information 8a and 8b generated by each of the failure monitoring devices (100m, 100s). If the failure notification device 200 receives the notification, the failure monitoring device on the transmission side is operating normally. Judge. Here, the fault monitoring device codes of the fault monitoring devices 100m and 100s are 100 and 200, respectively.

図１２のステップＢ７〜Ｂ１０は、上記した各障害監視装置の稼動の判断を行う部分である。具体的には、ステップＢ７において稼動通知情報監視部１０６を定期起動する。続いて、障害監視装置冗長構成情報ＤＢ２０６を参照し、障害監視装置コード１００、２００を検索して、障害監視装置コード１００（即ち、障害監視装置１００ｍ）が、正系の障害監視装置であり、障害監視装置コード２００（即ち、障害監視装置１００ｓ）が、副系の障害監視装置であるという情報を取得する。 Steps B7 to B10 in FIG. 12 are parts for determining the operation of each failure monitoring apparatus. Specifically, the operation notification information monitoring unit 106 is periodically activated in step B7. Subsequently, the failure monitoring device redundancy configuration information DB 206 is referred to, the failure monitoring device codes 100 and 200 are searched, and the failure monitoring device code 100 (that is, the failure monitoring device 100m) is the primary failure monitoring device. Information indicating that the failure monitoring device code 200 (that is, the failure monitoring device 100s) is a secondary failure monitoring device is acquired.

次に、ステップＢ９で稼動通知情報８ａ〜ｂの受信を試みる。ステップＢ１０において、稼動通知情報８ａ〜ｂの未着がない場合（即ち、稼動通知情報８ａ〜ｂが共に受信できた場合）は、終了する。一方、正系の障害監視装置からの稼動通知情報８ａが未着の場合（即ち、受信できない場合）、ステップＢ１１において障害監視装置系切り替え部１０７を起動し、障害監視装置の正系と副系の切り替えを行う。そして、ステップＢ１２において、障害監視装置冗長構成情報ＤＢ２０６の１行目において、正系の障害監視装置コードを２００に、副系の障害監視装置コードを１００に更新する。 Next, reception of operation notification information 8a-b is tried at step B9. In step B10, when there is no arrival of the operation notification information 8a-b (that is, when the operation notification information 8a-b can be received together), the process ends. On the other hand, when the operation notification information 8a from the primary failure monitoring device has not arrived (that is, when it cannot be received), the failure monitoring device system switching unit 107 is activated in step B11, and the primary and secondary systems of the failure monitoring device are activated. Switch. In step B12, the primary failure monitoring device code is updated to 200 and the secondary failure monitoring device code is updated to 100 in the first row of the failure monitoring device redundant configuration information DB 206.

（比較例１）
次に、図２５、図２６を参照しながら、比較例１について説明する。図２５は、比較例１に係る遠隔監視システムの全体構成を示すブロック図である。図２５を図３と比較すると分かるように、図２５は図３の遠隔監視システム１１を遠隔監視装置１３に置き換えた構成になっている。比較例１では、第１の実施形態の障害監視装置１００ｍ、障害解析装置２００、及び障害管理装置３００の各機能を、１台の遠隔監視装置１３で実行するように構成している。また、比較例１では、遠隔監視装置１３と監視対象装置１０ａ〜ｂとは、閉域ＩＰ網５０ではなくインターネット６０を介して接続される。(Comparative Example 1)
Next, Comparative Example 1 will be described with reference to FIGS. FIG. 25 is a block diagram showing the overall configuration of the remote monitoring system according to Comparative Example 1. As can be seen by comparing FIG. 25 with FIG. 3, FIG. 25 has a configuration in which the remote monitoring system 11 of FIG. In Comparative Example 1, each function of the failure monitoring device 100m, the failure analysis device 200, and the failure management device 300 of the first embodiment is configured to be executed by one remote monitoring device 13. In the first comparative example, the remote monitoring device 13 and the monitoring target devices 10a and 10b are connected not via the closed IP network 50 but via the Internet 60.

図２６は、比較例１の動作を示すシーケンスチャートである。図２６において、遠隔監視装置１３による判定（Ｓ３００ａ）、解析（Ｓ３０１ａ）、管理（Ｓ３０２ａ）、判定（Ｓ３００ｂ）、解析（Ｓ３０１ｂ）、及び管理（Ｓ３０２ｂ）が順次処理される。ここで、各々の処理内容は、図２の判定（Ｓ１００ａ）、解析（Ｓ１０１ａ）、管理（Ｓ１０２ａ）、判定（Ｓ１００ｂ）、解析（Ｓ１０１ｂ）、及び管理（Ｓ１０２ｂ）と同様である。 FIG. 26 is a sequence chart showing the operation of the first comparative example. In FIG. 26, determination (S300a), analysis (S301a), management (S302a), determination (S300b), analysis (S301b), and management (S302b) by the remote monitoring device 13 are sequentially processed. Here, each processing content is the same as the determination (S100a), analysis (S101a), management (S102a), determination (S100b), analysis (S101b), and management (S102b) in FIG.

（第１の実施形態と比較例１の比較）
次に、第１の実施形態と比較例１を対比することにより、第１の実施形態で得られる効果について説明する。図２（第１の実施形態）と図２６（比較例１）を比較すると、図２の場合のほうが、期間ＴＡ、ＴＢにおいて処理を並列に実行させることにより、全体の処理能力が向上し、全体の処理時間を短縮する効果があることが分かる。従って、緊急時に複数の監視対象装置に対して一斉に障害検出をさせる場合には、並列処理が可能な第１の実施形態のほうが、顧客に対する障害連絡メールを早く送ることができ、保守員に対して早く出動指示を出すことができ、保守部品の手配を早く行うことができる。尚、第１の実施形態では、監視対象装置が２つの場合について例示したが、監視対象装置の数が増えた場合、並列動作をさせる効果は顕著になる。(Comparison between the first embodiment and Comparative Example 1)
Next, the effects obtained in the first embodiment will be described by comparing the first embodiment with the comparative example 1. Comparing FIG. 2 (first embodiment) and FIG. 26 (Comparative Example 1), in the case of FIG. 2, the overall processing capability is improved by executing the processes in parallel during the periods TA and TB. It turns out that there exists an effect which shortens the whole processing time. Therefore, in the case where a plurality of devices to be monitored are simultaneously detected in an emergency, the first embodiment capable of parallel processing can send a failure notification mail to the customer earlier, and can be used for maintenance personnel. On the other hand, a dispatch instruction can be issued early, and maintenance parts can be arranged quickly. In the first embodiment, the case where there are two monitoring target devices is illustrated. However, when the number of monitoring target devices increases, the effect of performing the parallel operation becomes significant.

また、比較例１の場合、遠隔監視装置１３の不具合が発生すると遠隔監視ができない状態になってしまう虞がある。一方、第１の実施形態の場合には、障害監視装置を正系と副系の冗長構成とし、その切り替えを障害解析装置２００で制御するようにしている。このように構成することで、正系の障害監視装置に何らかのトラブルが発生し動作不能の状態になったとしても、副系の障害監視装置が代わって障害監視を行うようにすることが可能になるという効果が得られる。 Moreover, in the case of the comparative example 1, when the malfunction of the remote monitoring apparatus 13 generate | occur | produces, there exists a possibility that it may be in the state which cannot perform remote monitoring. On the other hand, in the case of the first embodiment, the failure monitoring apparatus has a redundant configuration of the primary system and the secondary system, and the switching is controlled by the failure analysis apparatus 200. By configuring in this way, even if some trouble occurs in the primary fault monitoring device and it becomes inoperable, the secondary fault monitoring device can perform fault monitoring instead. The effect of becoming is obtained.

また、比較例１では、遠隔監視装置１３と複数の監視対象装置１０ａ〜ｂとがインターネット６０を介して障害検出コマンドの送受信及び障害検出コマンドの実行結果の送受信を行っているため、監視対象装置において診断処理と別の秘匿性の高い処理とが並行して実行されているような場合、障害検出コマンドの実行結果（例えば、ログ情報）に秘匿性の高いデータが含まれることが懸念される。一方、第１の実施形態では、遠隔監視システム１１と複数の監視対象装置１０ａ〜ｂとがＶＰＮ等の閉域ＩＰ網５０を介して障害検出コマンドの送受信及び障害検出コマンドの実行結果の送受信を行っているため、障害検出コマンドの実行結果（例えば、ログ情報）に秘匿性の高いデータが含まれたとしても、外部からのアクセスされないようにセキュリティを確保することが可能であるという効果が得られる。 In the first comparative example, the remote monitoring device 13 and the plurality of monitoring target devices 10a to 10b transmit / receive the failure detection command and the transmission / reception of the execution result of the failure detection command via the Internet 60. When a diagnostic process and another highly confidential process are executed in parallel, the execution result (for example, log information) of the failure detection command may contain highly confidential data. . On the other hand, in the first embodiment, the remote monitoring system 11 and the plurality of monitoring target devices 10a to 10b perform transmission / reception of a failure detection command and transmission / reception of the execution result of the failure detection command via the closed IP network 50 such as VPN. Therefore, even if highly confidential data is included in the execution result (for example, log information) of the failure detection command, it is possible to ensure the security so that it is not accessed from the outside. .

［第２の実施形態］
次に、第２の実施形態について、図２３、図２４を参照しながら説明する。図２３は第２の実施形態に係る遠隔監視システム１２を適用した場合の全体構成を示すブロック図である。第２の実施形態では、第１の実施形態の障害解析装置２００及び障害管理装置３００の機能を、障害解析管理装置４００の１つの装置で動作させるように構成している。[Second Embodiment]
Next, a second embodiment will be described with reference to FIGS. FIG. 23 is a block diagram showing an overall configuration when the remote monitoring system 12 according to the second embodiment is applied. In the second embodiment, the functions of the failure analysis device 200 and the failure management device 300 of the first embodiment are configured to operate with one device of the failure analysis management device 400.

図２４は、第２の実施形態の動作を示すシーケンスチャートである。図２４において、判定（Ｓ２００ａ）、解析（Ｓ２０１ａ）、管理（Ｓ２０２ａ）、判定（Ｓ２００ｂ）、解析（Ｓ２０１ｂ）、及び管理（Ｓ２０２ｂ）の各々の処理内容は、図２の判定（Ｓ１００ａ）、解析（Ｓ１０１ａ）、管理（Ｓ１０２ａ）、判定（Ｓ１００ｂ）、解析（Ｓ１０１ｂ）、及び管理（Ｓ１０２ｂ）と同様である。図２４では、期間ＴＡにおいて、障害監視装置１００による判定（Ｓ２００ｂ）と障害解析管理装置４００による解析（Ｓ２０１ａ）とを並列動作させることができる。但し、図２（第１の実施形態）の期間ＴＢの並列動作は得られない。 FIG. 24 is a sequence chart showing the operation of the second embodiment. In FIG. 24, the processing contents of determination (S200a), analysis (S201a), management (S202a), determination (S200b), analysis (S201b), and management (S202b) are the same as the determination (S100a) and analysis of FIG. This is the same as (S101a), management (S102a), determination (S100b), analysis (S101b), and management (S102b). In FIG. 24, the determination by the failure monitoring device 100 (S200b) and the analysis by the failure analysis management device 400 (S201a) can be performed in parallel during the period TA. However, the parallel operation in the period TB of FIG. 2 (first embodiment) cannot be obtained.

以上のように、第２の実施形態によれば、第１の実施形態に比べて並列化できる動作が半減するものの、比較例１に比べると処理能力を向上させることができる。 As described above, according to the second embodiment, although the operation that can be parallelized is halved compared to the first embodiment, the processing capability can be improved as compared with the first comparative example.

また、第２の実施形態の遠隔監視システム１２は、少ない装置数で構成することができるので（第１の実施形態の４つに対し、第２の実施形態の遠隔監視システム１２は３つ）、低コストの遠隔監視システムとすることができるという効果が得られる。 Further, the remote monitoring system 12 of the second embodiment can be configured with a small number of devices (three remote monitoring systems 12 of the second embodiment compared to four of the first embodiment). Thus, it is possible to obtain a low-cost remote monitoring system.

尚、第１の実施形態において、障害監視装置（１００ｍ、１００ｓ）による処理（図７、図１１の各ステップ）は、障害監視装置の記憶部（不図示）にプログラムとして格納され、障害監視装置（１００ｍ、１００ｓ）が備えたＣＰＵにより呼び出されて実行される。また、障害解析装置２００による処理（図８、図１２の各ステップ）は、障害解析装置の記憶部（不図示）にプログラムとして格納され、障害解析装置２００が備えたＣＰＵにより呼び出されて実行される。また、障害管理装置３００による処理（図９、図１０の各ステップ）は、障害管理装置の記憶部（不図示）にプログラムとして格納され、障害管理装置３００が備えたＣＰＵにより呼び出されて実行される。また、第２の実施形態の障害解析管理装置４００による処理も、障害解析管理装置の記憶部（不図示）にプログラムとして格納され、障害解析管理装置４００が備えたＣＰＵにより呼び出されて実行される。また、上記した各プログラムは、ネットワークを介してダウンロードするか、或いは、プログラムを記憶した記憶媒体を用いて、更新することができる。 In the first embodiment, processing (steps in FIGS. 7 and 11) by the failure monitoring device (100m, 100s) is stored as a program in a storage unit (not shown) of the failure monitoring device. It is called and executed by the CPU provided in (100m, 100s). Further, the processing (each step in FIGS. 8 and 12) by the failure analysis device 200 is stored as a program in a storage unit (not shown) of the failure analysis device, and is called and executed by a CPU provided in the failure analysis device 200. The Further, the processing (steps in FIGS. 9 and 10) by the failure management apparatus 300 is stored as a program in a storage unit (not shown) of the failure management apparatus, and is called and executed by the CPU provided in the failure management apparatus 300. The Further, the processing by the failure analysis management device 400 of the second embodiment is also stored as a program in a storage unit (not shown) of the failure analysis management device, and is called and executed by the CPU provided in the failure analysis management device 400. . Each program described above can be downloaded via a network or updated using a storage medium storing the program.

上記の実施形態の一部又は全部は、以下の形態のようにも記載されうるが、以下には限られない。 Part or all of the above-described embodiments can be described as in the following forms, but is not limited to the following.

（形態１）複数の監視対象装置にネットワークを介して障害検出コマンドを送信し、各々の前記監視対象装置で実行された前記障害検出コマンドの実行結果を受信して各々の前記監視対象装置の障害の有無を判定し、障害があると判定された前記監視対象装置に関して前記監視対象装置の情報及び前記障害検出コマンドの実行結果を含む障害検出情報を出力する障害監視装置と、
前記障害検出情報を受信し、前記受信された障害検出情報に含まれる障害検出コマンドの実行結果を解析して障害を復旧する障害対処方法を決定し、前記監視対象装置の情報及び前記障害対処方法を含む障害解析情報を出力する障害解析装置と、
前記障害解析情報を受信し、前記受信された障害解析情報に含まれる障害対処方法の実行を管理する障害管理装置と、
を備え、
前記障害監視装置、前記障害解析装置、及び前記障害管理装置の２以上が、互いに異なる前記監視対象装置に対する処理を並行して実行可能に構成された、遠隔監視システム。(Mode 1) A failure detection command is transmitted to a plurality of monitoring target devices via a network, and an execution result of the failure detection command executed by each of the monitoring target devices is received to detect a failure of each of the monitoring target devices. A fault monitoring device that outputs fault detection information including information on the monitoring target device and an execution result of the fault detection command regarding the monitoring target device determined to have a fault,
The failure detection information is received, the execution result of the failure detection command included in the received failure detection information is analyzed, a failure handling method for recovering the failure is determined, and the information on the monitored device and the failure handling method are determined. A failure analysis device that outputs failure analysis information including
A failure management device that receives the failure analysis information and manages execution of a failure handling method included in the received failure analysis information;
With
A remote monitoring system in which two or more of the failure monitoring device, the failure analysis device, and the failure management device are configured to be able to execute processes for different monitoring target devices in parallel.

（形態２）前記障害解析装置は、
複数の障害検出コマンドの実行結果に対応した障害対処方法を予め定めた障害情報データベースを備え、
前記障害監視装置から受信した障害検出情報に含まれる障害検出コマンドの実行結果を用いて前記障害情報データベースを検索して障害を復旧する障害対処方法を決定する、形態１に記載の遠隔監視システム。(Mode 2) The failure analysis apparatus comprises:
Provided with a failure information database in which failure handling methods corresponding to the execution results of a plurality of failure detection commands are predetermined,
The remote monitoring system according to mode 1, wherein a fault handling method for searching for the fault information database and determining a fault is determined by using an execution result of a fault detection command included in the fault detection information received from the fault monitoring device.

（形態３）前記障害情報データベースには各々の前記障害対処方法に対応した被疑部品情報がさらに定められており、
前記障害解析装置は前記障害検出情報に含まれる障害検出コマンドの実行結果を用いて前記障害情報データベースを検索して被疑部品情報を取得し、前記障害解析情報に前記被疑部品情報を付加し、
前記障害管理装置は前記受信された障害解析情報に含まれる被疑部品情報に基づいて被疑部品の手配の指示を送出する、形態２に記載の遠隔監視システム。(Mode 3) The failure information database further includes suspected part information corresponding to each failure handling method,
The failure analysis device searches the failure information database using the execution result of the failure detection command included in the failure detection information to obtain the suspected component information, adds the suspected component information to the failure analysis information,
The remote monitoring system according to mode 2, wherein the failure management apparatus sends an instruction to arrange a suspected component based on suspected component information included in the received failure analysis information.

（形態４）前記障害情報データベースには各々の前記障害対処方法に対応した保守員派遣要請情報がさらに定められており、
前記障害解析装置は前記障害検出情報に含まれる障害検出コマンドの実行結果を用いて前記障害情報データベースを検索して保守員派遣要請情報を取得し、前記障害解析情報に前記保守員派遣要請情報を付加し、
前記障害管理装置は前記受信された障害解析情報に含まれる保守員派遣要請情報に基づいて保守員の派遣を指示する通知を保守員端末に送信する、形態２または３に記載の遠隔監視システム。(Mode 4) The failure information database further defines maintenance staff dispatch request information corresponding to each failure handling method,
The failure analysis device searches the failure information database using the execution result of the failure detection command included in the failure detection information to obtain maintenance staff dispatch request information, and adds the maintenance staff dispatch request information to the failure analysis information. Add
The remote monitoring system according to claim 2 or 3, wherein the failure management device transmits a notification instructing dispatch of a maintenance staff to a maintenance staff terminal based on maintenance staff dispatch request information included in the received failure analysis information.

（形態５）前記障害管理装置は、
前記複数の監視対象装置の各々に対応した顧客端末のアドレスを記憶した顧客情報データベースを備え、
前記受信された障害解析情報に含まれる監視対象装置の情報を用いて前記顧客情報データベースを検索して前記監視対象装置に対応した顧客端末のアドレスを取得し、前記顧客端末のアドレスに前記監視対象装置の障害を通知する、形態１乃至４のいずれか一に記載の遠隔監視システム。(Mode 5) The failure management apparatus is:
A customer information database storing addresses of customer terminals corresponding to the plurality of devices to be monitored;
The customer information database is searched by using the information on the monitoring target device included in the received failure analysis information to obtain the address of the customer terminal corresponding to the monitoring target device, and the monitoring target is added to the address of the customer terminal. The remote monitoring system according to any one of Embodiments 1 to 4, which notifies a device failure.

（形態６）前記障害監視装置は、正系の障害監視装置及び副系の障害監視装置を含む２以上の障害監視装置で構成され、
前記障害解析装置は、前記正系の障害監視装置の異常を検出した場合に、前記正系の障害監視装置と前記副系の障害監視装置との間で、前記正系／副系の設定を切り替え、
前記障害解析装置は、前記正系の障害監視装置から送信された前記障害検出情報を使用して障害対処方法を決定し、前記副系の障害監視装置から送信された前記障害検出情報を使用しない、形態１乃至５のいずれか一に記載の遠隔監視システム。(Mode 6) The failure monitoring device is composed of two or more failure monitoring devices including a primary failure monitoring device and a secondary failure monitoring device,
When the failure analysis device detects an abnormality in the primary failure monitoring device, the failure analysis device sets the primary / secondary system between the primary failure monitoring device and the secondary failure monitoring device. switching,
The failure analysis device determines a failure handling method using the failure detection information transmitted from the primary failure monitoring device, and does not use the failure detection information transmitted from the secondary failure monitoring device. The remote monitoring system according to any one of Forms 1 to 5.

（形態７）前記複数の障害監視装置は、夫々定期的に稼動通知情報を前記障害解析装置に送信するように設定され、
前記障害解析装置は、前記２以上の障害監視装置の各々から稼動通知情報を受信できるか否かを判定し、前記稼動通知情報を受信できない障害監視装置が異常であると検出する、形態６に記載の遠隔監視システム。(Mode 7) Each of the plurality of failure monitoring devices is set to periodically transmit operation notification information to the failure analysis device,
In mode 6, the failure analysis device determines whether or not operation notification information can be received from each of the two or more failure monitoring devices, and detects that the failure monitoring device that cannot receive the operation notification information is abnormal. The remote monitoring system described.

（形態８）複数の監視対象装置にネットワークを介して障害検出コマンドを送信し、各々の前記監視対象装置で実行された前記障害検出コマンドの実行結果を受信して各々の前記監視対象装置の障害の有無を判定し、障害があると判定された前記監視対象装置に関して前記監視対象装置の情報及び前記障害検出コマンドの実行結果を含む障害検出情報を出力する障害監視装置と、
前記障害検出情報を受信し、前記受信された障害検出情報に含まれる障害検出コマンドの実行結果を解析して障害を復旧する障害対処方法を決定すると共に、前記障害対処方法の実行を管理する障害解析管理装置と、
を備え、
前記障害監視装置、及び前記障害解析管理装置が、異なる前記監視対象装置に対する処理を並行して実行可能に構成された、遠隔監視システム。(Mode 8) A failure detection command is transmitted to a plurality of monitoring target devices via a network, and an execution result of the failure detection command executed on each of the monitoring target devices is received to detect a failure of each of the monitoring target devices. A fault monitoring device that outputs fault detection information including information on the monitoring target device and an execution result of the fault detection command regarding the monitoring target device determined to have a fault,
A fault that receives the fault detection information, analyzes a result of executing a fault detection command included in the received fault detection information, determines a fault handling method for recovering the fault, and manages execution of the fault handling method; An analysis management device;
With
A remote monitoring system in which the failure monitoring device and the failure analysis management device are configured to be able to execute processes for different monitoring target devices in parallel.

（形態９）前記障害監視装置は閉域ＩＰ（ＩｎｔｅｒｎｅｔＰｒｏｔｏｃｏｌ）網を介して前記複数の監視対象装置からの障害検出コマンド実行結果を受信する、形態１乃至８のいずれか一に記載の遠隔監視システム。 (Mode 9) The remote monitoring system according to any one of modes 1 to 8, wherein the fault monitoring apparatus receives a fault detection command execution result from the plurality of monitoring target apparatuses via a closed IP (Internet Protocol) network. .

（形態１０）複数の監視対象装置にネットワークを介して障害検出コマンドを送信するステップと、
各々の前記監視対象装置で実行された前記障害検出コマンドの実行結果を受信して各々の前記監視対象装置の障害の有無を判定し、障害があると判定された前記監視対象装置に関して前記監視対象装置の情報及び前記障害検出コマンドの実行結果を含む障害検出情報を出力する判定ステップと、
前記障害検出情報を受信し、前記受信された障害検出情報に含まれる障害検出コマンドの実行結果を解析して障害を復旧する障害対処方法を決定し、前記監視対象装置の情報及び前記障害対処方法を含む障害解析情報を出力する解析ステップと、
前記障害解析情報を受信し、前記受信された障害解析情報に含まれる障害対処方法の実行を管理する管理ステップと、
を含み、
前記判定ステップ、前記解析ステップ、及び前記管理ステップの２以上が、並行して実行可能である遠隔監視方法。(Mode 10) Sending a failure detection command to a plurality of monitoring target devices via a network;
Receiving an execution result of the failure detection command executed in each of the monitoring target devices, determining whether or not each of the monitoring target devices has a failure, and regarding the monitoring target device determined to have a failure, the monitoring target A determination step of outputting failure detection information including device information and an execution result of the failure detection command;
The failure detection information is received, the execution result of the failure detection command included in the received failure detection information is analyzed, a failure handling method for recovering the failure is determined, and the information on the monitored device and the failure handling method are determined. An analysis step for outputting failure analysis information including
A management step for receiving the failure analysis information and managing execution of a failure handling method included in the received failure analysis information;
Including
A remote monitoring method in which two or more of the determination step, the analysis step, and the management step can be performed in parallel.

（形態１１）複数の監視対象装置の障害を検出するプログラムであって、
複数の監視対象装置にネットワークを介して障害検出コマンドを送信し、各々の前記監視対象装置で実行された前記障害検出コマンドの実行結果を受信して各々の前記監視対象装置の障害の有無を判定し、障害があると判定された前記監視対象装置に関して前記監視対象装置の情報及び前記障害検出コマンドの実行結果を含む障害検出情報を、前記障害検出情報を用いて障害を復旧する障害対処方法を決定する装置に出力する処理、をコンピュータに実行させるプログラム。(Mode 11) A program for detecting a failure of a plurality of monitoring target devices,
A failure detection command is transmitted to a plurality of monitoring target devices via a network, and an execution result of the failure detection command executed on each of the monitoring target devices is received to determine whether each of the monitoring target devices has a failure. And a failure handling method for recovering the failure using the failure detection information, the failure detection information including the monitoring target device information and the execution result of the failure detection command regarding the monitoring target device determined to have a failure. A program that causes a computer to execute processing to be output to a determining device.

本発明は、複数の監視対象装置（サーバ装置、ネットワーク機器、コンピュータ等の情報処理装置；空調システム等）の障害をネットワークを介して遠隔検出し、障害情報を解析して、被疑部品の配送や保守員の派遣などの対策を自動で行う遠隔監視システムに適用することができる。 The present invention remotely detects a failure of a plurality of monitoring target devices (information processing devices such as server devices, network devices, and computers; air conditioning systems, etc.) via a network, analyzes the failure information, It can be applied to a remote monitoring system that automatically performs measures such as dispatch of maintenance personnel.

なお、本発明の全開示（請求の範囲及び図面を含む）の枠内において、さらにその基本的技術思想に基づいて、実施形態の変更・調整が可能である。また、本発明の請求の範囲の枠内において種々の開示要素（各請求項の各要素、各実施形態の各要素、各図面の各要素等を含む）の多様な組み合わせないし選択が可能である。すなわち、本発明は、請求の範囲及び図面を含む全開示、技術的思想にしたがって当業者であればなし得るであろう各種変形、修正を含むことは勿論である。特に、本書に記載した数値範囲については、当該範囲内に含まれる任意の数値ないし小範囲が、別段の記載のない場合でも具体的に記載されているものと解釈されるべきである。 Note that, within the scope of the entire disclosure (including claims and drawings) of the present invention, the embodiments can be changed and adjusted based on the basic technical concept. Various combinations or selections of various disclosed elements (including each element of each claim, each element of each embodiment, each element of each drawing, etc.) are possible within the scope of the claims of the present invention. . That is, the present invention naturally includes various variations and modifications that could be made by those skilled in the art according to the entire disclosure including the claims and the drawings, and the technical idea. In particular, with respect to the numerical ranges described in this document, any numerical value or small range included in the range should be construed as being specifically described even if there is no specific description.

１、１１：遠隔監視システム
２：ネットワーク
３ａ〜ｂ：障害検出コマンド
４ａ〜ｂ：障害検出コマンドの実行結果
５ａ〜ｂ：障害検出情報
６ａ〜ｂ：障害解析情報
７ａ：障害対処方法
８ａ〜ｂ：稼動通知情報
９、９ａ〜ｂ：被疑部品情報
１０ａ〜ｂ：監視対象装置
１２：遠隔監視システム
１３：遠隔監視装置
１９、１９ａ〜ｂ：保守員派遣要請情報
２０ａ〜ｂ：顧客端末
２１：保守部品倉庫端末
２２ａ：保守員端末
５０：閉域ＩＰ網
６０：インターネット
１００：障害監視装置
１００ｍ：（正系の）障害監視装置
１００ｓ：（副系の）障害監視装置
１０１：システム障害検出部
１０２：障害検出情報作成部
１０３：稼動通知情報定期作成部
１０４：データ転送部
１０５：データ解析部
１０６：稼動通知情報監視部
１０７：障害監視装置系切り替え部
１０８：データ転送部
１０９：メール送信部
１１０：保守員出動指示部
１１１：保守部品手配部
２００：障害解析装置
２０１：障害検出対象装置データベース
２０２：障害検出項目データベース
２０３：データ転送先データベース
２０４：監視対象装置情報データベース
２０５：障害情報データベース
２０６：障害監視装置冗長構成情報データベース
２０７：障害管理データベース
２０８：顧客情報データベース
２０９：社員情報データベース
２１０：作業予定データベース
３００：障害管理装置
４００：障害解析管理装置1, 11: Remote monitoring system 2: Network 3a-b: Failure detection command 4a-b: Failure detection command execution result 5a-b: Failure detection information 6a-b: Failure analysis information 7a: Failure handling method 8a-b: Operation notification information 9, 9a-b: Suspected component information 10a-b: Monitoring target device 12: Remote monitoring system 13: Remote monitoring device 19, 19a-b: Maintenance staff dispatch request information 20a-b: Customer terminal 21: Maintenance component Warehouse terminal 22a: maintenance worker terminal 50: closed IP network 60: Internet 100: failure monitoring device 100m: (primary) failure monitoring device 100s: (secondary) failure monitoring device 101: system failure detection unit 102: failure detection Information creation unit 103: Operation notification information periodic creation unit 104: Data transfer unit 105: Data analysis unit 106: Operation notification information monitoring unit 107: Fault monitoring device system disconnection Replacement unit 108: Data transfer unit 109: Mail transmission unit 110: Maintenance staff dispatch instruction unit 111: Maintenance part arrangement unit 200: Failure analysis device 201: Failure detection target device database 202: Failure detection item database 203: Data transfer destination database 204 : Monitoring target device information database 205: Failure information database 206: Failure monitoring device redundant configuration information database 207: Failure management database 208: Customer information database 209: Employee information database 210: Work schedule database 300: Failure management device 400: Failure analysis management apparatus

Claims

A failure detection command is transmitted to a plurality of monitoring target devices via a network, and an execution result of the failure detection command executed on each of the monitoring target devices is received to determine whether each of the monitoring target devices has a failure. A fault monitoring device that outputs fault detection information including information on the monitored device and an execution result of the fault detection command regarding the monitored device determined to have a fault;
The failure detection information is received, the execution result of the failure detection command included in the received failure detection information is analyzed, a failure handling method for recovering the failure is determined, and the information on the monitored device and the failure handling method are determined. A failure analysis device that outputs failure analysis information including
A failure management device that receives the failure analysis information and manages execution of a failure handling method included in the received failure analysis information;
With
Two or more of the failure monitoring device, the failure analysis device, and the failure management device are configured to be able to execute processes for the different monitoring target devices in parallel,
The fault monitoring apparatus is composed of two or more fault monitoring apparatuses including a primary fault monitoring apparatus and a secondary fault monitoring apparatus, and the primary fault monitoring apparatus and the secondary fault monitoring apparatus are respectively described above. Output fault detection information,
When the failure analysis device detects an abnormality in the primary failure monitoring device, the failure analysis device sets the primary / secondary system between the primary failure monitoring device and the secondary failure monitoring device. and switching shall,
If the failure analysis device does not detect an abnormality of the primary failure monitoring device, the failure analysis device determines a failure handling method using the failure detection information transmitted from the primary failure monitoring device, and A remote monitoring system that does not use the failure detection information transmitted from a system failure monitoring device .

The failure analysis device
Provided with a failure information database in which failure handling methods corresponding to the execution results of a plurality of failure detection commands are predetermined,
2. The remote monitoring system according to claim 1, wherein a failure handling method for searching the failure information database to determine a failure is determined by using an execution result of a failure detection command included in the failure detection information received from the failure monitoring apparatus. .

The failure information database further defines suspected part information corresponding to each of the failure handling methods,
The failure analysis device searches the failure information database using the execution result of the failure detection command included in the failure detection information to obtain the suspected component information, adds the suspected component information to the failure analysis information,
The remote monitoring system according to claim 2, wherein the failure management apparatus sends an instruction to arrange a suspected component based on suspected component information included in the received failure analysis information.

The failure information database further defines maintenance staff dispatch request information corresponding to each of the failure handling methods,
The failure analysis device searches the failure information database using the execution result of the failure detection command included in the failure detection information to obtain maintenance staff dispatch request information, and adds the maintenance staff dispatch request information to the failure analysis information. Add
The remote monitoring system according to claim 2 or 3, wherein the failure management apparatus transmits a notification instructing dispatch of a maintenance staff to a maintenance staff terminal based on maintenance staff dispatch request information included in the received failure analysis information. .

The failure management device is:
A customer information database storing addresses of customer terminals corresponding to the plurality of devices to be monitored;
The customer information database is searched by using the information on the monitoring target device included in the received failure analysis information to obtain the address of the customer terminal corresponding to the monitoring target device, and the monitoring target is added to the address of the customer terminal. The remote monitoring system according to any one of claims 1 to 4, which notifies a device failure.

The plurality of failure monitoring devices are each set to periodically transmit operation notification information to the failure analysis device,
The failure analysis device determines whether or not operation notification information can be received from each of the two or more failure monitoring devices, and detects that the failure monitoring device that cannot receive the operation notification information is abnormal. The remote monitoring system according to any one of 1 to 5 .

The remote monitoring system according to any one of claims 1 to 6 , wherein the failure monitoring device receives a failure detection command execution result from the plurality of monitoring target devices via a closed IP (Internet Protocol) network.

Transmitting a failure detection command to a plurality of monitored devices via a network;
Receiving an execution result of the failure detection command executed in each of the monitoring target devices, determining whether or not each of the monitoring target devices has a failure, and regarding the monitoring target device determined to have a failure, the monitoring target A determination step of outputting failure detection information including device information and an execution result of the failure detection command;
The failure detection information is received, the execution result of the failure detection command included in the received failure detection information is analyzed, a failure handling method for recovering the failure is determined, and the information on the monitored device and the failure handling method are determined. An analysis step for outputting failure analysis information including
A management step for receiving the failure analysis information and managing execution of a failure handling method included in the received failure analysis information;
Including
2 or more devices executing the determining step comprising the apparatus and the sub system of the apparatus of the main system,
When an abnormality of the primary device is detected, switching between the primary / sub system settings between the primary device and the secondary device ;
If an abnormality of the primary device is not detected, a failure handling method is determined using the failure detection information transmitted from the primary device, and the failure detection transmitted from the secondary device is detected. Steps without information,
Further including
A remote monitoring method in which two or more of the determination step, the analysis step, and the management step can be performed in parallel.

Connected through a fault monitoring device and the network to monitor the monitoring target device, a program executed by a failure analysis equipment,
The failure detection information including the monitoring target device information and the failure detection command execution result for the monitoring target device determined to have a failure is received, and the failure detection command execution result included in the received failure detection information Analyzing the error and determining a failure handling method for recovering the failure, causing the failure analysis device to execute analysis processing for outputting the failure analysis information including the monitoring target device information and the failure handling method,
2 or more of the fault monitoring device including the fault monitoring device of the fault monitoring device and a sub system of the main system outputs the fault detection information,
Processing for switching the setting of the primary system / sub system between the primary system fault monitoring apparatus and the secondary system fault monitoring apparatus when an abnormality of the primary system fault monitoring apparatus is detected When,
If no abnormality is detected in the primary failure monitoring device, a failure handling method is determined using the failure detection information transmitted from the primary failure monitoring device, and the secondary failure monitoring device A program that causes the failure analysis apparatus to execute processing that does not use the transmitted failure detection information .