JP2014178797A

JP2014178797A - Monitoring control device and method, built-in control device, and computer program

Info

Publication number: JP2014178797A
Application number: JP2013051243A
Authority: JP
Inventors: Shuichi Terauchi; 秀一寺内
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2013-03-14
Filing date: 2013-03-14
Publication date: 2014-09-25
Anticipated expiration: 2033-03-14
Also published as: JP6123375B2

Abstract

PROBLEM TO BE SOLVED: To provide a monitoring control device and others which can correctly and quickly detect a failure of a device even if a failure occurs in a monitoring source device that monitors a failure of a monitoring target devices.SOLUTION: A monitoring control device 1 receives first determination information 101 via a communication unit 3, and stores the received first determination information 101 in a storage unit 4, then when it is verified that the storage unit 4 stores also second determination information, determines that a failure has occurred in a resource of a monitoring target. When it is verified that the storage unit 4 stores neither the first determination information 101 nor the second determination information, the monitoring control device 1 determines that a failure has occurred in a resource of a monitoring source according to a value of a counter.

Description

本発明は、電子機器等に搭載された各種ハードウェアデバイスを監視する技術分野に関する。 The present invention relates to a technical field of monitoring various hardware devices mounted on an electronic device or the like.

組み込み制御装置やサーバ装置、パーソナルコンピュータ等の電子機器は、常に安定して稼動することが求められている。特に組み込み制御装置やサーバ装置においては、２４時間、３６５日安定して稼動することを要求される。 Electronic devices such as embedded control devices, server devices, and personal computers are required to always operate stably. In particular, embedded control devices and server devices are required to operate stably for 24 hours and 365 days.

また、組み込み制御装置やサーバ装置は、安定して稼動することだけでなく、障害が発生した際に、障害から迅速に復旧することを要求される。その理由は、障害が発生した際に、例えば、メーカは、サーバ装置の障害発生から復旧までに要する時間が短ければ短いほど、データ等の損失を最小限に留めることができるからである。 Moreover, the embedded control device and the server device are required not only to operate stably but also to quickly recover from the failure when a failure occurs. The reason is that, when a failure occurs, for example, the manufacturer can minimize the loss of data or the like as the time required from the failure occurrence to the recovery of the server device is shorter.

このような背景から、メーカは、常に電子機器を安定して稼動すると共に、万一、電子機器に発生した障害に対する被害を極小化したいという要求がある。このため、電子機器の安定稼動と効率的なシステム運用するための技術は、多岐にわたり存在する。 Against this background, manufacturers are constantly required to operate electronic devices stably and to minimize damage to failures that occur in electronic devices. For this reason, there are various technologies for stable operation of electronic devices and efficient system operation.

例えば、特許文献１は、オペレーティングシステム（Ｏｐｅｒａｔｉｎｇ＿Ｓｙｓｔｅｍ：以降、「ＯＳ」と称する）動作状態確認システム、確認対象装置、ＯＳ動作状態確認装置、ＯＳ動作状態確認方法およびプログラムに関する技術を開示する。 For example, Patent Document 1 discloses a technique related to an operating system (Operating_System: hereinafter referred to as “OS”) operation state confirmation system, a confirmation target device, an OS operation state confirmation device, an OS operation state confirmation method, and a program.

特許文献１に開示されたＯＳ動作状態確認システムは、監視の対象である装置（以降、「監視対象装置」と称する）と、監視対象装置を監視するＯＳ動作状態確認装置とが通信ネットワークを介して相互に通信可能に接続する。 In the OS operation state confirmation system disclosed in Patent Document 1, a device to be monitored (hereinafter referred to as “monitoring target device”) and an OS operation state confirmation device that monitors the monitoring target device are connected via a communication network. To communicate with each other.

ＯＳ動作状態確認装置は、監視対象装置と定期的に通信すると共に、その通信が途切れた際に、監視対象装置に不具合が生じたと判別する。 The OS operation state confirmation device periodically communicates with the monitoring target device and determines that a failure has occurred in the monitoring target device when the communication is interrupted.

監視対象装置に不具合が生じたと判別した場合に、ＯＳ動作状態確認装置は、監視対象装置にＩＰ（Ｉｎｔｅｒｎｅｔ＿Ｐｒｏｔｏｃｏｌ）パケットを発行するピング（Ｐｉｎｇ）コマンドを実行する。 When it is determined that a failure has occurred in the monitoring target device, the OS operation state confirmation device executes a ping command that issues an IP (Internet_Protocol) packet to the monitoring target device.

さらに、ＯＳ動作状態確認装置は、監視対象装置からピングに対する返信を受け付けた場合に、監視対象装置のＯＳが正常でありながら過負荷等の何らかの状態にあると判別する。一方で、ＯＳ動作状態確認装置は、監視対象装置からピングに対する返信を受け付けていない場合に、監視対象装置が故障したと判別する。 Further, when the OS operation status confirmation device receives a reply to the ping from the monitoring target device, it determines that the OS of the monitoring target device is in a certain state such as an overload while being normal. On the other hand, the OS operation state confirmation device determines that the monitoring target device has failed when a reply to the ping is not received from the monitoring target device.

特開２０１２−０３８２５７号公報JP 2012-038257 A

ところで、一般的に知られた、例えば、組み込み制御装置の故障を検出する手法としては、係る制御装置が正常に動作していることを示すハートビート情報が更新されたか否かを判別することによって、その制御装置の故障を検出する方法が存在する。 By the way, generally known, for example, as a method for detecting a failure of an embedded control device, by determining whether or not heartbeat information indicating that the control device is operating normally has been updated. There is a method for detecting a failure of the control device.

より具体的に、係る故障を検出する手法は、相互に通信可能に接続された２つの制御装置間においてハートビート情報を送受信すると共に、そのハートビート情報が更新されたか否かを判別する。 More specifically, in the method of detecting such a failure, heartbeat information is transmitted and received between two control devices that are communicably connected to each other, and whether or not the heartbeat information has been updated is determined.

さらに、係る故障を検出する手法は、ハートビート情報が更新されていないと判別した場合に、組み込み制御装置が故障したと検出する。 Furthermore, the method for detecting such a failure detects that the embedded control device has failed when it is determined that the heartbeat information has not been updated.

しかしながら、例えば、監視元となる（つまり、監視する側の）組み込み制御装置の通信部が故障している場合に、監視元の組み込み制御装置は、ハートビート情報を送受信することができない。そのため、監視元の組み込み制御装置は、監視対象である（つまり、監視される側の）組み込み制御装置に障害が発生したと判別する。 However, for example, when the communication unit of the embedded control device serving as the monitoring source (that is, the monitoring side) has failed, the monitoring source embedded control device cannot transmit or receive heartbeat information. Therefore, the monitoring source embedded control apparatus determines that a failure has occurred in the embedded control apparatus that is the monitoring target (that is, the monitored side).

即ち、係る手法では、監視対象である組み込み制御装置が故障したと誤検出する場合がある。そのため、正常に動作する監視対象の組み込み制御装置は、システムから切り離され、その一方で、故障した監視元の組み込み制御装置は、自装置が組み込まれているシステムにおいて継続して動作することとなる。 That is, according to such a method, there is a case in which it is erroneously detected that the embedded control device that is the monitoring target has failed. Therefore, the monitoring target embedded control device that operates normally is disconnected from the system, while the failed monitoring source embedded control device continues to operate in the system in which the device itself is embedded. .

また、上述した一般的に知られた係る手法と同様に、特許文献１に開示されたＯＳ動作状態確認装置は、監視元であるＯＳ動作状態確認装置の受信部が故障している場合に、監視対象装置からピングに対する返信を受け付けることができない。そのため、ＯＳ動作状態確認装置は、監視対象装置が故障したと判別する。 Similarly to the generally known method described above, the OS operation state confirmation device disclosed in Patent Document 1 is used when the reception unit of the OS operation state confirmation device that is the monitoring source is out of order. A reply to the ping cannot be received from the monitoring target device. For this reason, the OS operation state confirmation device determines that the monitoring target device has failed.

即ち、特許文献１では、監視元のＯＳ動作状態確認装置が故障しているか否かを判別することについて考慮されておらず、監視対象装置が故障していない場合であっても、その監視対象装置が故障していると検出する（つまり、誤検出する）場合がある。 That is, in Patent Document 1, it is not considered to determine whether or not the OS operating state confirmation device as the monitoring source is out of order, and even if the monitoring target device is not out of order, the monitoring target There is a case where it is detected that the device is broken (that is, erroneously detected).

本発明の主たる目的は、装置間において装置の故障を監視する際に、監視元の装置が故障した場合であっても、正確且つ速やかに装置の故障を検出することが可能な監視制御装置等を提供することにある。 The main object of the present invention is to monitor a control device that can detect a device failure accurately and promptly even when the device that is the monitoring source fails when monitoring device failures between devices. Is to provide.

上記の課題を達成すべく、本発明に係る監視制御装置は、以下の構成を備えることを特徴とする。 In order to achieve the above object, a monitoring control apparatus according to the present invention has the following configuration.

即ち、本発明に係る監視制御装置は、
通信部を介して監視対象のリソースが正常に動作していることを示す第１の判別情報を受信すると共に、該第１の判別情報を記憶部に格納し、その記憶部に前記第１の判別情報、または該第１の判別情報と異なる第２の判別情報の有無を判別し、
前記第２の判別情報があると判別した場合に、前記監視対象のリソースに障害が発生したと判別し、
前記第１及び前記第２の判別情報が無い場合に、カウンタの値に応じて、監視元のリソースに障害が発生したと判別することを特徴とする。 That is, the monitoring control device according to the present invention is
The first determination information indicating that the monitored resource is operating normally is received via the communication unit, and the first determination information is stored in the storage unit, and the first determination information is stored in the storage unit. Determining whether there is discrimination information or second discrimination information different from the first discrimination information;
When it is determined that there is the second determination information, it is determined that a failure has occurred in the monitored resource,
When there is no first and second determination information, it is determined that a failure has occurred in a monitoring source resource according to a counter value.

或いは、上記に示す監視制御装置を含む組み込み制御装置によっても、同目的を達成される。 Alternatively, the same object can be achieved by an embedded control device including the monitoring control device described above.

また、同目的を達成すべく、本発明に係る監視制御方法は、以下の構成を備えることを特徴とする。 In order to achieve the object, a monitoring control method according to the present invention is characterized by having the following configuration.

即ち、本発明に係る監視制御方法は、
通信部を介して監視対象のリソースが正常に動作していることを示す第１の判別情報を受信すると共に、該第１の判別情報を記憶部に格納し、その記憶部に前記第１の判別情報、または該第１の判別情報と異なる第２の判別情報の有無を判別し、
前記第２の判別情報があると判別した場合に、前記監視対象のリソースに障害が発生したと判別し、
前記第１及び前記第２の判別情報が無い場合に、カウンタの値に応じて、監視元のリソースに障害が発生したと判別することを特徴とする。 That is, the supervisory control method according to the present invention is:
The first determination information indicating that the monitored resource is operating normally is received via the communication unit, and the first determination information is stored in the storage unit, and the first determination information is stored in the storage unit. Determining whether there is discrimination information or second discrimination information different from the first discrimination information;
When it is determined that there is the second determination information, it is determined that a failure has occurred in the monitored resource,
When there is no first and second determination information, it is determined that a failure has occurred in a monitoring source resource according to a counter value.

尚、同目的は、上記の各構成を有する監視制御装置及び方法を、コンピュータによって実現するコンピュータ・プログラム、及びそのコンピュータ・プログラムが格納されている、コンピュータ読み取り可能な記憶媒体によっても達成される。 This object is also achieved by a computer program that implements the monitoring and control apparatus and method having the above-described configurations by a computer, and a computer-readable storage medium that stores the computer program.

本発明によれば、装置間において装置の故障を監視する際に、監視元の装置が故障した場合であっても、正確且つ速やかに装置の故障を検出することが可能な監視制御装等を提供することができる。 According to the present invention, when monitoring a device failure between devices, even if the monitoring source device fails, a monitoring control device or the like that can detect the device failure accurately and promptly is provided. Can be provided.

本発明の第１の実施形態における監視制御装置の構成を示すブロック図である。It is a block diagram which shows the structure of the monitoring control apparatus in the 1st Embodiment of this invention. 本発明の第２の実施形態における監視制御装置を含む組み込み制御装置の構成を示すブロック図である。It is a block diagram which shows the structure of the embedded control apparatus containing the monitoring control apparatus in the 2nd Embodiment of this invention. 本発明の第２の実施形態における監視制御装置が行う「第１の時間周期」毎に第１の判別情報を送信する動作を示すフローチャートである。It is a flowchart which shows the operation | movement which transmits 1st discrimination | determination information for every "1st time period" which the monitoring control apparatus in the 2nd Embodiment of this invention performs. 本発明の第２の実施形態における監視制御装置が行う第１の判別情報を受信する動作を示すフローチャートである。It is a flowchart which shows the operation | movement which receives the 1st discrimination | determination information which the monitoring control apparatus in the 2nd Embodiment of this invention performs. 本発明の第２の実施形態における監視制御装置が行う「第２の時間周期」毎に受信した第１の判別情報、または第２の判別情報の有無を判別する動作を示すフローチャートである。It is a flowchart which shows the operation | movement which discriminate | determines the presence or absence of the 1st discrimination | determination information received for every "2nd time period" performed by the monitoring control apparatus in the 2nd Embodiment of this invention, or the 2nd discrimination | determination information. 本発明の第２の実施形態における監視制御装置が行う「第２の時間周期」毎に受信した第１の判別情報、または第２の判別情報の有無を判別する動作を示すフローチャートである。It is a flowchart which shows the operation | movement which discriminate | determines the presence or absence of the 1st discrimination | determination information received for every "2nd time period" performed by the monitoring control apparatus in the 2nd Embodiment of this invention, or the 2nd discrimination | determination information. 本発明の第３の実施形態における監視制御装置の構成を示すブロック図である。It is a block diagram which shows the structure of the monitoring control apparatus in the 3rd Embodiment of this invention. 本発明に係る各実施形態を実現可能な情報処理装置のハードウェア構成を例示的に説明するブロック図である。It is a block diagram explaining illustratively the hardware constitutions of the information processor which can realize each embodiment concerning the present invention.

以下、本発明の実施形態について図面を参照して詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

＜第１の実施形態＞
図１は、本発明の第１の実施形態における監視制御装置１の構成を示すブロック図である。図１において、監視制御装置１は、監視部２を有する。 <First Embodiment>
FIG. 1 is a block diagram showing the configuration of the monitoring control device 1 according to the first embodiment of the present invention. In FIG. 1, the monitoring control device 1 has a monitoring unit 2.

より具体的に、監視部２は、通信部３を介して第１の判別情報１０１を受信する。さらに、監視部２は、第１の判別情報１０１を受信するのに応じて、受信した第１の判別情報１０１を記憶部４に格納する。 More specifically, the monitoring unit 2 receives the first determination information 101 via the communication unit 3. Furthermore, the monitoring unit 2 stores the received first determination information 101 in the storage unit 4 in response to receiving the first determination information 101.

ここで、第１の判別情報１０１とは、監視制御装置１によって監視されるリソースが正常に稼動（動作）していることを示す情報を含む。また、第１の判別情報１０１は、所定の時間周期（以下、説明の便宜上、「第１の時間周期」と称する）毎に送信されることとする。 Here, the first determination information 101 includes information indicating that the resource monitored by the monitoring control device 1 is operating (operating) normally. The first discrimination information 101 is transmitted every predetermined time period (hereinafter referred to as “first time period” for convenience of explanation).

また、リソースとは、例えば、組み込み制御装置、サーバ装置及びパーソナルコンピュータ等の電子機器や電子機器に搭載されたハードディスクドライブ（Ｈａｒｄ＿ｄｉｓｋ＿ｄｒｉｖｅ：以降、「ＨＤＤ」と称する）、ネットワークカード（Ｎｅｔｗｏｒｋ＿Ｉｎｔｅｒｆａｃｅ＿Ｃａｒｄ：以降、「ＮＩＣ」と称する）、アプリケーション等である。 The resource includes, for example, an electronic device such as an embedded control device, a server device, and a personal computer, a hard disk drive (Hard_disk_drive: hereinafter referred to as “HDD”), a network card (Network_Interface_Card: hereinafter, “ NIC ”), applications, and the like.

次に、監視部２は、送信された第１の判別情報１０１を受信した時、記憶部４に第１の判別情報１０１を格納する。さらに、監視部２は、記憶部４に第１の判別情報１０１、または後述する第２の判別情報の有無を判別（判断）する。即ち、監視部２は、所定の時間周期（以下、説明の便宜上、「第２の時間周期」と称する）毎に第１の判別情報１０１の有無を判別する、または第２の判別情報の有無を判別する。 Next, when the monitoring unit 2 receives the transmitted first determination information 101, the monitoring unit 2 stores the first determination information 101 in the storage unit 4. Furthermore, the monitoring unit 2 determines (determines) the presence or absence of the first determination information 101 or second determination information described later in the storage unit 4. That is, the monitoring unit 2 determines the presence / absence of the first determination information 101 or the presence / absence of the second determination information every predetermined time period (hereinafter, referred to as “second time period” for convenience of explanation). Is determined.

ここで、第２の時間周期とは、第１の時間周期よりも長い時間周期である。より具体的に、第２の時間周期は、第１の時間周期毎に格納される第１の判別情報１０１の有無を判別可能な時間周期である。 Here, the second time period is a time period longer than the first time period. More specifically, the second time period is a time period in which the presence or absence of the first determination information 101 stored for each first time period can be determined.

監視部２は、第１の判別情報１０１、または第２の判別情報の有無を判別した結果、記憶部４に第１の判別情報１０１があると判別した場合に、カウンタの値をクリア（リセット、つまり、値を「０」にセット）すると共に、記憶部４に格納された第１の判別情報１０１を消去する。 When the monitoring unit 2 determines that the first determination information 101 or the second determination information is present, and determines that the first determination information 101 exists in the storage unit 4, the monitoring unit 2 clears (resets) the counter value. That is, the value is set to “0”) and the first discrimination information 101 stored in the storage unit 4 is erased.

ここで、カウンタは、例えば、監視部２が含む構成を採用してもよいし、記憶部４が含む構成を採用してもよい。また、監視部２がカウンタの値をクリア、インクリメント及びデクリメントする技術自体は、現在では一般的な技術を採用することができるので、本実施形態における詳細な説明は省略する（以下、各実施形態においても同様）。 Here, for example, the configuration included in the monitoring unit 2 or the configuration included in the storage unit 4 may be employed as the counter. Further, as the technique itself for the monitoring unit 2 to clear, increment, and decrement the counter value, a general technique can be adopted at present, and a detailed description in this embodiment will be omitted (hereinafter, each embodiment). The same applies to the above).

監視部２は、第１の判別情報１０１、または第２の判別情報の有無を判別した結果、記憶部４に第２の判別情報があると判別した場合に、監視制御装置１が監視するリソースに障害（故障）が発生したと判別する。即ち、監視部２は、監視制御装置１が監視するリソースの障害（故障）を検出する。 When the monitoring unit 2 determines that there is the second determination information in the storage unit 4 as a result of determining the presence or absence of the first determination information 101 or the second determination information, the resource monitored by the monitoring control device 1 It is determined that a failure has occurred. That is, the monitoring unit 2 detects a failure (failure) of the resource monitored by the monitoring control device 1.

一方で、監視部２は、第１の判別情報１０１、または第２の判別情報の有無を判別する。監視部２は、記憶部４に第１の判別情報１０１及び第２の判別情報が無いと判別した場合に、カウンタの値を参照すると共に、その値を「１」インクリメント（増加）する。 On the other hand, the monitoring unit 2 determines the presence / absence of the first determination information 101 or the second determination information. When the monitoring unit 2 determines that the first determination information 101 and the second determination information are not present in the storage unit 4, the monitoring unit 2 refers to the value of the counter and increments (increases) the value by “1”.

監視部２は、インクリメントしたカウンタの値が「１」（第１の値）の場合に、第２の判別情報を記憶部４に格納する。 The monitoring unit 2 stores the second determination information in the storage unit 4 when the incremented counter value is “1” (first value).

より具体的に、カウンタの値が第１の値の場合に、監視部２は、例えば、監視制御装置１を含む組み込み制御装置（自装置、つまり、リソース）が正常に動作しているか否かを判別する。 More specifically, when the counter value is the first value, the monitoring unit 2 determines whether, for example, the embedded control device (the own device, that is, the resource) including the monitoring control device 1 is operating normally. Is determined.

監視部２は、判別した結果、係る自装置が正常に動作している場合に、第２の判別情報を生成すると共に、生成した第２の判別情報を記憶部４に格納する。 As a result of the determination, the monitoring unit 2 generates the second determination information and stores the generated second determination information in the storage unit 4 when the device itself operates normally.

尚、以下の説明において、説明の便宜上、監視対象の装置を監視する側を「監視元」と称する（以下、各実施形態においても同様）。 In the following description, for convenience of explanation, a side that monitors a device to be monitored is referred to as a “monitoring source” (hereinafter, the same applies to each embodiment).

監視部２は、インクリメントしたカウンタの値が「２」（第２の値）の場合に、例えば、監視制御装置１を含む組み込み制御装置（自装置）に障害（故障）が発生したと判別する。即ち、監視部２は、自装置（つまり、監視元である装置）の障害を検出する。 When the incremented counter value is “2” (second value), for example, the monitoring unit 2 determines that a failure (failure) has occurred in the embedded control device (self device) including the monitoring control device 1. . That is, the monitoring unit 2 detects a failure of the own device (that is, a device that is a monitoring source).

尚、上述した本実施形態では、説明の便宜上、一例として、監視部２は、カウンタの値を「１」インクリメントする構成を例に説明した。しかしながら本発明に係る実施形態は、係る構成に限定されない。 In the present embodiment described above, for the sake of convenience of explanation, as an example, the monitoring unit 2 has been described with a configuration in which the counter value is incremented by “1”. However, the embodiment according to the present invention is not limited to such a configuration.

監視部２は、例えば、カウンタの値を「２」インクリメントしてもよいし、「３」インクリメントしてもよい。即ち、第２の値と第１の値とは、異なる値を示す構成を採用することができる。例えば、第２の値は、第１の値より大きい値を示す構成を採用してもよい。 For example, the monitoring unit 2 may increment the counter value by “2” or “3”. That is, a configuration in which the second value and the first value are different from each other can be employed. For example, a configuration in which the second value is larger than the first value may be adopted.

監視部２は、監視対象の装置、または監視元の装置の障害を検出した際に、その障害を通知する。 When the monitoring unit 2 detects a failure of a monitoring target device or a monitoring source device, the monitoring unit 2 notifies the failure.

例えば、組み込み制御装置が備える不図示のディスプレイ等のユーザインタフェースに障害が発生したことを表示する構成を採用してもよいし、複数の電子機器によって冗長化している場合に、正常に動作する電子機器に切り替える構成を採用してもよい。但し、本実施形態を例に説明する本発明は、これらの構成に限定されない（以下、各実施形態においても同様）。 For example, a configuration that displays that a failure has occurred in a user interface such as a display (not shown) provided in the embedded control device may be adopted, or an electronic device that operates normally when redundant is provided by a plurality of electronic devices. You may employ | adopt the structure switched to an apparatus. However, the present invention described using this embodiment as an example is not limited to these configurations (hereinafter, the same applies to each embodiment).

通信部３は、伝送媒体を介して電子機器と通信可能に接続する。また、通信部３は、各種情報を送受信する。即ち、通信部３は、伝送媒体を介して第１の判別情報１０１を送受信する。 The communication unit 3 is communicably connected to an electronic device via a transmission medium. Moreover, the communication part 3 transmits / receives various information. That is, the communication unit 3 transmits / receives the first discrimination information 101 via the transmission medium.

より具体的に、一例として、通信部３は、一般的に知られたＲＳ−２３２Ｃ等のデータバスやＬＡＮ（Ｌｏｃａｌ＿Ａｒｅａ＿Ｎｅｔｗｏｒｋ：以降、「ＬＡＮ」と称する）等を採用することができる。但し、本実施形態を例に説明する本発明は、これらの構成に限定されない（以下の実施形態においても同様）。 More specifically, as an example, the communication unit 3 may employ a generally known data bus such as RS-232C, LAN (Local_Area_Network: hereinafter referred to as “LAN”), or the like. However, the present invention described using this embodiment as an example is not limited to these configurations (the same applies to the following embodiments).

記憶部４は、読み書き可能な一時記憶媒体である。即ち、記憶部４は、揮発性の記憶デバイスである。 The storage unit 4 is a readable / writable temporary storage medium. That is, the storage unit 4 is a volatile storage device.

より具体的に、一例として、記憶部４は、読み込み制御装置やサーバ装置等の電子機器に搭載された主記憶装置等の揮発性記憶素子を採用することができる。但し、本発明は、これらに限定されない（以下の実施形態においても同様）。 More specifically, as an example, the storage unit 4 may employ a volatile storage element such as a main storage device mounted on an electronic device such as a read control device or a server device. However, the present invention is not limited to these (the same applies to the following embodiments).

尚、上述した本実施形態では、説明の便宜上、一例として、記憶部４は、揮発性の記憶デバイスを採用する構成を例に説明した。しかしながら本発明に係る実施形態は、係る構成に限定されず、記憶部４は、読み書き可能な不揮発性記憶デバイスを採用してもよい。 In the above-described embodiment, for the sake of convenience of explanation, the storage unit 4 has been described as an example of a configuration that employs a volatile storage device. However, the embodiment according to the present invention is not limited to such a configuration, and the storage unit 4 may adopt a readable / writable nonvolatile storage device.

その場合に、より具体的に、一例として、記憶部４は、電子機器に搭載されたＨＤＤ等の不揮発性記憶素子を採用することができる。 In that case, more specifically, as an example, the storage unit 4 may employ a nonvolatile storage element such as an HDD mounted on an electronic device.

このように本実施の形態に係る監視制御装置１によれば、装置間において装置の故障を監視する際に、監視元の装置が故障した場合であっても、正確且つ速やかに装置の故障を検出することができる。その理由は、以下に述べる通りである。 As described above, according to the monitoring control device 1 according to the present embodiment, when monitoring a device failure between devices, even if the monitoring source device fails, the device failure is accurately and promptly detected. Can be detected. The reason is as described below.

即ち、監視制御装置１の監視部２は、監視対象の装置が故障したと判別した場合に、監視元のリソース（自装置）が正常に動作しているか否かを判別する。監視部２は、自装置が正常に動作していると判別した場合に、監視対象のリソースが故障したと判別する。一方で、監視部２は、自装置が正常に動作していないと判別した場合に、監視対象のリソースが故障したのではなく自装置が故障したと判別するからである。即ち、監視部２は、自装置が故障したことによって、監視対象のリソースが故障したと判別すること（つまり、誤検出）を阻止することができる。よって、例えば、監視部２は、故障して自装置が誤った判断によって、正常に動作している監視対象のリソースを停止、あるいはリセットするよう誤った指示を行うことによる二次的な不具合をも阻止することができる。 That is, when the monitoring unit 2 of the monitoring control device 1 determines that the monitoring target device has failed, the monitoring unit 2 determines whether or not the monitoring source resource (own device) is operating normally. If the monitoring unit 2 determines that the device itself is operating normally, the monitoring unit 2 determines that the resource to be monitored has failed. On the other hand, when the monitoring unit 2 determines that the own device is not operating normally, it determines that the own device has failed, not the resource to be monitored has failed. That is, the monitoring unit 2 can prevent the monitoring target resource from being determined to have failed due to the failure of the own device (that is, erroneous detection). Thus, for example, the monitoring unit 2 may detect a secondary problem caused by giving an erroneous instruction to stop or reset a monitoring target resource that is operating normally, based on an erroneous determination that the device itself has failed. Can also be prevented.

＜第２の実施形態＞
次に、上述した本発明の第１の実施形態に係る監視制御装置１を基本とする第２の実施形態について説明する。以下の説明においては、本実施形態に係る特徴的な部分を中心に説明する。その際、上述した各実施形態と同様な構成については、同一の参照番号を付すことにより、重複する説明は省略する。 <Second Embodiment>
Next, a second embodiment based on the monitoring control device 1 according to the first embodiment of the present invention described above will be described. In the following description, the characteristic part according to the present embodiment will be mainly described. At this time, the same reference numerals are assigned to the same configurations as those in the above-described embodiments, and duplicate descriptions are omitted.

図２は、本発明の第２の実施形態における監視制御装置１１（第１の監視制御装置１１ａ、第２の監視制御装置１１ｂ）を含む組み込み制御装置１０（第１の組み込み制御装置１０ａ、第２の組み込み制御装置１０ｂ）の構成を示すブロック図である。 FIG. 2 shows an embedded controller 10 (first embedded controller 10a, first monitor controller 11a, second monitor controller 11b) including the monitor controller 11 (first monitor controller 11a, second monitor controller 11b) according to the second embodiment of the present invention. It is a block diagram which shows the structure of 2 built-in control apparatuses 10b).

図２において、第１の組み込み制御装置１０ａと第２の組み込み制御装置１０ｂとは、第１の通信部３ａと第２の通信部３ｂとを介して相互に通信可能に接続する。 In FIG. 2, a first embedded control device 10a and a second embedded control device 10b are connected to each other via a first communication unit 3a and a second communication unit 3b so that they can communicate with each other.

本実施形態では、第１の実施形態において説明した監視制御装置１に、さらに、生成部１２（第１の生成部１２ａ、第２の生成部１２ｂ）を適用した場合について説明する。 In the present embodiment, a case where the generation unit 12 (the first generation unit 12a and the second generation unit 12b) is further applied to the monitoring control apparatus 1 described in the first embodiment will be described.

尚、以下の説明において、説明の便宜上、第１の組み込み制御装置１０ａと第２の組み込み制御装置１０ｂとを総称して組み込み制御装置１０と称する。第１の監視制御装置１１ａと第２の監視制御装置１１ｂとを総称して監視制御装置１１と称する。 In the following description, for convenience of explanation, the first embedded control device 10a and the second embedded control device 10b are collectively referred to as the embedded control device 10. The first monitoring control device 11a and the second monitoring control device 11b are collectively referred to as the monitoring control device 11.

さらに、第１の監視部２ａと第２の監視部２ｂとを総称して監視部２と称する。第１の通信部３ａと第２の通信部３ｂとを総称して通信部３と称する。第１の記憶部４ａと第２の記憶部４ｂとを総称して記憶部４と称する。第１の生成部１２ａと第２の生成部１２ｂとを総称して生成部１２と称する。 Further, the first monitoring unit 2a and the second monitoring unit 2b are collectively referred to as the monitoring unit 2. The first communication unit 3a and the second communication unit 3b are collectively referred to as the communication unit 3. The first storage unit 4a and the second storage unit 4b are collectively referred to as a storage unit 4. The first generation unit 12a and the second generation unit 12b are collectively referred to as a generation unit 12.

生成部１２は、監視対象の（つまり、監視される）リソースが正常に動作しているか否かを判別すると共に、判別した結果、係るリソースが正常に動作していると判別した場合に、第１の判別情報１０１を生成する。 The generation unit 12 determines whether or not the resource to be monitored (that is, monitored) is operating normally, and when the determination unit determines that the resource is operating normally as a result of the determination, 1 discrimination information 101 is generated.

尚、生成部１２が監視対象のリソースが正常に動作しているか否かを判別する技術自体は、現在では一般的な技術を採用することができるので、本実施形態における詳細な説明は省略する（以下、各実施形態においても同様）。 Note that the technology itself by which the generation unit 12 determines whether or not the resource to be monitored is operating normally can employ a general technology, and thus a detailed description in this embodiment is omitted. (Hereinafter, the same applies to each embodiment).

さらに、生成部１２は、通信部３を介して第１の判別情報１０１を第１の時間周期毎に送信する。 Furthermore, the generation unit 12 transmits the first determination information 101 via the communication unit 3 for each first time period.

より具体的に、例えば、第１の監視制御装置１１ａから第２の監視制御装置１１ｂに第１の判別情報１０１を送信する場合に、第１の生成部１２ａは、第１の通信部３ａを介して第２の監視制御装置１１ｂに送信する。さらに、第２の監視制御装置１１ｂは、第２の通信部３ｂを介して第１の判別情報１０１を受信する。 More specifically, for example, when the first determination information 101 is transmitted from the first monitoring control device 11a to the second monitoring control device 11b, the first generation unit 12a sets the first communication unit 3a to To the second monitoring control device 11b. Furthermore, the second monitoring control device 11b receives the first determination information 101 via the second communication unit 3b.

次に、本発明の第２の実施形態に係るより具体的な監視制御装置１１の動作について説明する。 Next, a more specific operation of the monitoring control device 11 according to the second embodiment of the present invention will be described.

図３は、本発明の第２の実施形態における監視制御装置１１が行う動作を示すフローチャートである。図３に示す「第１の時間周期」毎に第１の判別情報１０１を送信するフローチャートに沿って監視制御装置１１の動作手順を説明する。 FIG. 3 is a flowchart showing an operation performed by the monitoring control device 11 according to the second embodiment of the present invention. The operation procedure of the monitoring control apparatus 11 will be described with reference to a flowchart for transmitting the first discrimination information 101 for each “first time period” shown in FIG.

ここでは、以下の説明において、説明の便宜上、一例として、第２の監視制御装置１１ｂを含む第２の組み込み制御装置１０ｂが第１の組み込み制御装置１０ａを監視する際の動作を説明する。 Here, in the following description, for convenience of explanation, an operation when the second embedded control device 10b including the second monitoring control device 11b monitors the first embedded control device 10a will be described as an example.

また、第１の監視制御装置１１ａと第２の監視制御装置１１ｂとは、動作に差異が無いこととする。 Further, it is assumed that there is no difference in operation between the first monitoring control device 11a and the second monitoring control device 11b.

さらに、第１の判別情報１０１と第２の判別情報とは、一般的に知られたハートビート情報である。 Further, the first discrimination information 101 and the second discrimination information are generally known heartbeat information.

説明の便宜上、上述した構成を例に説明するが、これに限定されない（以下の実施形態においても同様）。 For convenience of explanation, the configuration described above will be described as an example, but the present invention is not limited to this (the same applies to the following embodiments).

ステップＳ１：
第１の生成部１２ａは、監視対象のリソースが正常に動作しているか否かを判別する。 Step S1:
The first generation unit 12a determines whether or not the monitoring target resource is operating normally.

ステップＳ２において「ＹＥＳ」：
第１の生成部１２ａは、監視対象のリソースが正常に動作しているか否かを判別した結果、監視対象のリソースが正常に動作していると判別した場合に、処理をステップＳ３に進める。 “YES” in step S2:
As a result of determining whether the monitoring target resource is operating normally as a result of determining whether the monitoring target resource is operating normally, the first generation unit 12a advances the processing to step S3.

ステップＳ２において「ＮＯ」：
第１の生成部１２ａは、監視対象のリソースが正常に動作しているか否かを判別した結果、監視対象のリソースが正常に動作していないと判別した場合に、処理を終了する。 “NO” in step S2:
As a result of determining whether or not the monitoring target resource is operating normally, the first generation unit 12a ends the process when determining that the monitoring target resource is not operating normally.

ステップＳ３：
第１の生成部１２ａは、第１の判別情報１０１を生成すると共に、第１の通信部３ａを介して第２の組み込み制御装置１０ｂに生成した第１の判別情報１０１を送信する。 Step S3:
The first generation unit 12a generates the first determination information 101 and transmits the generated first determination information 101 to the second embedded control device 10b via the first communication unit 3a.

ステップＳ４：
第１の生成部１２ａは、第１の時間周期の時間が経過するまで待つ（つまり、Ｗａｉｔする）。即ち、第１の生成部１２ａは、一定時間置きに第１の判別情報１０１を第２の監視制御装置１１ｂに送れるように待ち合わせる。 Step S4:
The first generation unit 12a waits (that is, waits) until the time of the first time period elapses. That is, the first generation unit 12a waits so that the first determination information 101 can be sent to the second monitoring control device 11b at regular intervals.

ステップＳ５：
第１の生成部１２ａは、第１の時間周期の時間が正しく待ち合わせられているか否かを判別する。 Step S5:
The first generation unit 12a determines whether or not the time of the first time period is correctly queued.

ステップＳ６において「ＮＯ」：
第１の生成部１２ａは、第１の時間周期の時間が経過していないと判別した場合に、処理をステップＳ５に戻す。 “NO” in step S6:
If the first generation unit 12a determines that the first time period has not elapsed, the process returns to step S5.

即ち、第１の生成部１２ａは、もう一度、第１の時間周期の時間が正しく待ち合わせられているか否かを判別する。 In other words, the first generation unit 12a once again determines whether or not the time of the first time period is correctly queued.

ステップＳ６において「ＹＥＳ」：
第１の生成部１２ａは、第１の時間周期の時間が経過したと判別した場合に、処理をステップＳ１に戻す。 "YES" in step S6:
If the first generation unit 12a determines that the time of the first time period has elapsed, the process returns to step S1.

即ち、第１の生成部１２ａは、もう一度、監視対象のリソースが正常に動作しているか否かを判別する処理ステップＳ１に分岐する。 That is, the first generation unit 12a once again branches to the processing step S1 for determining whether or not the resource to be monitored is operating normally.

次に、図４は、本発明の第２の実施形態における監視制御装置１１が行う動作を示すフローチャートである。図４に示す第１の判別情報１０１を受信するフローチャートに沿って監視制御装置１１の動作手順を説明する。 Next, FIG. 4 is a flowchart showing an operation performed by the monitoring control device 11 in the second embodiment of the present invention. The operation procedure of the monitoring control apparatus 11 will be described along the flowchart for receiving the first discrimination information 101 shown in FIG.

ステップＳ１１：
第２の監視制御装置１１ｂの第２の監視部２ｂは、第２の通信部３ｂを介して第１の判別情報１０１を受信したか否かを判別する。 Step S11:
The second monitoring unit 2b of the second monitoring control device 11b determines whether or not the first determination information 101 has been received via the second communication unit 3b.

ステップＳ１２において「ＮＯ」：
第２の監視部２ｂは、第２の通信部３ｂを介して第１の判別情報１０１を受信したか否かを判別した結果、第１の判別情報１０１を受信していないと判別した場合に、処理をステップＳ１１に戻す。 “NO” in step S12:
When the second monitoring unit 2b determines whether or not the first determination information 101 has been received as a result of determining whether or not the first determination information 101 has been received via the second communication unit 3b. The process returns to step S11.

ステップＳ１２において「ＹＥＳ」：
第２の監視部２ｂは、第２の通信部３ｂを介して第１の判別情報１０１を受信したか否かを判別した結果、第１の判別情報１０１を受信していると判別した場合に、処理をステップＳ１３に進める。 “YES” in step S12:
When the second monitoring unit 2b determines whether the first determination information 101 has been received as a result of determining whether the first determination information 101 has been received via the second communication unit 3b. Then, the process proceeds to step S13.

ステップＳ１３：
第２の監視部２ｂは、受信した第１の判別情報１０１を第２の記憶部４ｂに格納する。 Step S13:
The second monitoring unit 2b stores the received first discrimination information 101 in the second storage unit 4b.

次に、図５Ａ及び図５Ｂは、本発明の第２の実施形態における監視制御装置１１が行う動作を示すフローチャートである。図５Ａ及び図５Ｂに示す「第２の時間周期」毎に受信した第１の判別情報１０１、または第２の判別情報の有無を判別するフローチャートに沿って監視制御装置１１の動作手順を説明する。 Next, FIG. 5A and FIG. 5B are flowcharts showing operations performed by the monitoring control device 11 according to the second embodiment of the present invention. The operation procedure of the monitoring control apparatus 11 will be described with reference to the flowchart for determining the presence / absence of the first determination information 101 or the second determination information received every “second time period” shown in FIGS. 5A and 5B. .

ステップＳ２１：
第２の監視部２ｂは、第２の時間周期の時間が経過するまで待つ（つまり、Ｗａｉｔする）。即ち、第２の監視部２ｂは、一定時間置きに第１の判別情報１０１が第２の記憶部４ｂにあるか否かを判別できるように待ち合わせる。 Step S21:
The second monitoring unit 2b waits (that is, waits) until the time of the second time period elapses. That is, the second monitoring unit 2b waits so that it can be determined whether or not the first determination information 101 exists in the second storage unit 4b at regular intervals.

ステップＳ２２：
第２の監視部２ｂは、第２の時間周期の時間が正しく待ち合わせられているか否かを判別する。 Step S22:
The second monitoring unit 2b determines whether or not the time of the second time period is correctly queued.

ステップＳ２３において「ＮＯ」：
第２の監視部２ｂは、第２の時間周期の時間が経過していないと判別した場合に、処理をステップＳ２２に戻す。 “NO” in step S23:
If the second monitoring unit 2b determines that the second time period has not elapsed, the process returns to step S22.

即ち、第２の監視部２ｂは、もう一度、第２の時間周期の時間が正しく待ち合わせられているか否かを判別する。 In other words, the second monitoring unit 2b determines once again whether or not the time of the second time period is properly queued.

ステップＳ２３において「ＹＥＳ」：
第２の監視部２ｂは、第２の時間周期の時間が経過したと判別した場合に、処理をステップＳ２４に進める。 “YES” in step S23:
If the second monitoring unit 2b determines that the time of the second time period has elapsed, the process proceeds to step S24.

ステップＳ２４：
第２の監視部２ｂは、第２の記憶部４ｂに第１の判別情報１０１があるか否かを判別する。 Step S24:
The second monitoring unit 2b determines whether or not the first determination information 101 exists in the second storage unit 4b.

ステップＳ２５において「ＹＥＳ」：
第２の監視部２ｂは、第２の記憶部４ｂに第１の判別情報１０１があるか否かを判別した結果、第１の判別情報１０１があると判別した場合に、処理をステップＳ２６に進める。 “YES” in step S25:
The second monitoring unit 2b determines whether there is the first determination information 101 in the second storage unit 4b. As a result, if the second monitoring unit 2b determines that the first determination information 101 exists, the process proceeds to step S26. Proceed.

ステップＳ２５において「ＮＯ」：
第２の監視部２ｂは、第２の記憶部４ｂに第１の判別情報１０１があるか否かを判別した結果、第１の判別情報１０１が無いと判別した場合に、処理をステップＳ２７に進める。 “NO” in step S25:
The second monitoring unit 2b determines whether there is the first determination information 101 in the second storage unit 4b. As a result, if the second monitoring unit 2b determines that there is no first determination information 101, the process proceeds to step S27. Proceed.

ステップＳ２６：
第２の監視部２ｂは、カウンタの値をクリア（リセット）すると共に、記憶部４に格納された第１の判別情報１０１を消去する。 Step S26:
The second monitoring unit 2b clears (resets) the value of the counter and erases the first determination information 101 stored in the storage unit 4.

ステップＳ２７：
第２の監視部２ｂは、第２の記憶部４ｂに第２の判別情報があるか否かを判別する。 Step S27:
The second monitoring unit 2b determines whether there is second determination information in the second storage unit 4b.

ステップＳ２８において「ＹＥＳ」：
第２の監視部２ｂは、第２の記憶部４ｂに第２の判別情報があるか否かを判別した結果、第２の判別情報があると判別した場合に、処理をステップＳ２９に進める。 "YES" in step S28:
If the second monitoring unit 2b determines whether there is the second determination information in the second storage unit 4b and determines that there is the second determination information, the second monitoring unit 2b advances the process to step S29.

ステップＳ２８において「ＮＯ」：
第２の監視部２ｂは、第２の記憶部４ｂに第２の判別情報があるか否かを判別した結果、第２の判別情報が無いと判別した場合に、処理をステップＳ３０に進める。 “NO” in step S28:
If the second monitoring unit 2b determines whether there is the second determination information in the second storage unit 4b and determines that there is no second determination information, the process proceeds to step S30.

ステップＳ２９：
第２の監視部２ｂは、監視対象の第１の組み込み制御装置１０ａに障害（故障）が発生したと判別する。即ち、第２の監視部２ｂは、第１の組み込み制御装置１０ａの障害（故障）を検出する。 Step S29:
The second monitoring unit 2b determines that a failure (failure) has occurred in the first embedded control device 10a to be monitored. That is, the second monitoring unit 2b detects a failure (failure) in the first embedded control device 10a.

つまり、監視元である第２の組み込み制御装置１０ｂ（自装置）は、正常に動作していることを意味する。一方で、監視対象である第１の組み込み制御装置１０ａは、障害が発生したことを意味する。 That is, it means that the second embedded control device 10b (own device) that is the monitoring source is operating normally. On the other hand, the first embedded control apparatus 10a that is the monitoring target means that a failure has occurred.

ステップＳ３０：
第２の監視部２ｂは、カウンタの値を参照すると共に、その値を「１」インクリメント（増加）する。 Step S30:
The second monitoring unit 2b refers to the value of the counter and increments (increases) the value by “1”.

ステップＳ３１：
第２の監視部２ｂは、インクリメントしたカウンタの値を判別する。即ち、第２の監視部２ｂは、インクリメントしたカウンタの値が「１」であるか否かを判別する。 Step S31:
The second monitoring unit 2b determines the incremented counter value. That is, the second monitoring unit 2b determines whether or not the incremented counter value is “1”.

ステップＳ３２において「ＹＥＳ」：
第２の監視部２ｂは、インクリメントしたカウンタの値が「１」であるか否かを判別した結果、カウンタの値が「１」であると判別した場合に、処理をステップＳ３３に進める。 “YES” in step S32:
As a result of determining whether or not the incremented counter value is “1”, the second monitoring unit 2 b advances the process to step S <b> 33 when determining that the counter value is “1”.

ステップＳ３２において「ＮＯ」：
第２の監視部２ｂは、インクリメントしたカウンタの値が「１」であるか否かを判別した結果、カウンタの値が「１」でないと判別した場合に、処理をステップＳ３６に進める。 "NO" in step S32:
As a result of determining whether or not the incremented counter value is “1”, the second monitoring unit 2b advances the process to step S36 when it is determined that the counter value is not “1”.

ステップＳ３３：
第２の監視部２ｂは、監視元である第２の組み込み制御装置１０ｂ（自装置）が正常に動作しているか否かを判別する。 Step S33:
The second monitoring unit 2b determines whether or not the second embedded control device 10b (own device) that is the monitoring source is operating normally.

ステップＳ３４において「ＹＥＳ」：
第２の監視部２ｂは、第２の組み込み制御装置１０ｂが正常に動作しているか否かを判別した結果、第２の組み込み制御装置１０ｂが正常に動作していると判別した場合に、処理をステップＳ３５に進める。 "YES" in step S34:
The second monitoring unit 2b determines whether the second embedded control device 10b is operating normally as a result of determining whether the second embedded control device 10b is operating normally. Advances to step S35.

ステップＳ３４において「ＮＯ」：
第２の監視部２ｂは、第２の組み込み制御装置１０ｂが正常に動作しているか否かを判別した結果、第２の組み込み制御装置１０ｂが正常に動作していないと判別した場合に、処理を終了する。 "NO" in step S34:
The second monitoring unit 2b determines whether or not the second embedded control device 10b is operating normally as a result of determining whether or not the second embedded control device 10b is operating normally. Exit.

ステップＳ３５：
第２の監視部２ｂは、第２の判別情報を生成すると共に、生成した第２の判別情報を第２の記憶部４ｂに格納する。 Step S35:
The second monitoring unit 2b generates second determination information and stores the generated second determination information in the second storage unit 4b.

ステップＳ３６：
第２の監視部２ｂは、インクリメントしたカウンタの値が「２」であるか否かを判別する。 Step S36:
The second monitoring unit 2b determines whether or not the incremented counter value is “2”.

ステップＳ３７において「ＹＥＳ」：
第２の監視部２ｂは、インクリメントしたカウンタの値が「２」であるか否かを判別した結果、カウンタの値が「２」であると判別した場合に、処理をステップＳ３８に進める。 “YES” in step S37:
As a result of determining whether or not the incremented counter value is “2”, the second monitoring unit 2b advances the process to step S38 when it is determined that the counter value is “2”.

ステップＳ３７において「ＮＯ」：
第２の監視部２ｂは、インクリメントしたカウンタの値が「２」であるか否かを判別した結果、カウンタの値が「２」でないと判別した場合に、処理を終了する。 "NO" in step S37:
As a result of determining whether or not the incremented counter value is “2”, the second monitoring unit 2b ends the process when it is determined that the counter value is not “2”.

ステップＳ３８：
第２の監視部２ｂは、監視元である第２の組み込み制御装置１０ｂ（自装置）に障害（故障）が発生したと判別する。即ち、第２の監視部２ｂは、第２の組み込み制御装置１０ｂの障害（故障）を検出する。 Step S38:
The second monitoring unit 2b determines that a failure (failure) has occurred in the second embedded control device 10b (own device) that is the monitoring source. That is, the second monitoring unit 2b detects a failure (failure) in the second embedded control device 10b.

ステップＳ３９：
第２の監視部２ｂは、監視元、または監視対象に発生した障害を検出したか否かを判別する。 Step S39:
The second monitoring unit 2b determines whether or not a failure that has occurred in the monitoring source or the monitoring target has been detected.

即ち、第２の監視部２ｂは、監視元である第２の組み込み制御装置１０ｂ、または監視対象の第１の組み込み制御装置１０ａに発生した障害を検出したか否かを判別する。 That is, the second monitoring unit 2b determines whether or not a failure has occurred in the second embedded control device 10b that is the monitoring source or the first embedded control device 10a that is the monitoring target.

ステップＳ４０において「ＹＥＳ」：
第２の監視部２ｂは、監視元、または監視対象に発生した障害を検出したか否かを判別した結果、その障害を検出した場合に、処理を終了する。 "YES" in step S40:
The second monitoring unit 2b determines whether or not a failure that has occurred in the monitoring source or the monitoring target has been detected, and ends the process when the failure is detected.

ステップＳ４０において「ＮＯ」：
第２の監視部２ｂは、監視元、または監視対象に発生した障害を検出したか否かを判別した結果、その障害を検出していない場合に、処理をステップＳ２１に戻す。 “NO” in step S40:
The second monitoring unit 2b returns the process to step S21 if the failure is not detected as a result of determining whether or not the failure that has occurred in the monitoring source or the monitoring target has been detected.

尚、上述した本実施形態では、説明の便宜上、一例として、監視制御装置１１は、相互に通信可能に接続された組み込み制御装置を監視する構成を例に説明した。しかしながら本発明に係る実施形態は、係る構成に限定されず、監視制御装置１１は、１つ以上の組み込み制御装置を監視する構成を採用してもよい。 In the embodiment described above, for the sake of convenience of explanation, as an example, the monitoring control apparatus 11 has been described by taking an example of a configuration that monitors embedded control apparatuses that are communicably connected to each other. However, the embodiment according to the present invention is not limited to such a configuration, and the monitoring control device 11 may adopt a configuration for monitoring one or more embedded control devices.

その場合に、監視部２は、１つ以上の監視制御装置１１から受信した第１の判別情報１０１を識別可能なように記憶部４に格納する構成を採用してもよい。 In this case, the monitoring unit 2 may adopt a configuration in which the first determination information 101 received from one or more monitoring control devices 11 is stored in the storage unit 4 so that it can be identified.

このように本実施の形態に係る監視制御装置１１によれば、第１の実施形態において説明した効果を享受できると共に、さらに、速やかに装置の故障を検出することができる。その理由は、監視制御装置１１は、さらに、生成部１２を有するからである。生成部１２は、監視部２とは別に自装置が正常に動作するか否かを判別すると共に、自装置が正常に動作する場合には、第１の判別情報１０１を監視元の装置に送信することができるからである。そのため、装置間で相互に障害を監視する場合に、監視制御装置１１は、監視部２と生成部１２とのそれぞれが所定の動作をすることによって、より速やかに障害を検出することができる。 As described above, according to the monitoring control device 11 according to the present embodiment, the effects described in the first embodiment can be enjoyed, and further, the failure of the device can be detected promptly. The reason is that the monitoring control device 11 further includes a generation unit 12. The generation unit 12 determines whether or not the own device operates normally separately from the monitoring unit 2, and transmits the first determination information 101 to the monitoring source device when the own device operates normally. Because it can be done. Therefore, when monitoring a failure between devices, the monitoring control device 11 can detect the failure more quickly by the monitoring unit 2 and the generation unit 12 performing predetermined operations.

＜第３の実施形態＞
次に、上述した本発明の第２の実施形態に係る監視制御装置１１を基本とする第３の実施形態について説明する。以下の説明においては、本実施形態に係る特徴的な部分を中心に説明する。その際、上述した各実施形態と同様な構成については、同一の参照番号を付すことにより、重複する説明は省略する。 <Third Embodiment>
Next, a third embodiment based on the monitoring control device 11 according to the second embodiment of the present invention described above will be described. In the following description, the characteristic part according to the present embodiment will be mainly described. At this time, the same reference numerals are assigned to the same configurations as those in the above-described embodiments, and duplicate descriptions are omitted.

図６は、本発明の第３の実施形態における監視制御装置２１の構成を示すブロック図である。 FIG. 6 is a block diagram showing the configuration of the monitoring control device 21 in the third embodiment of the present invention.

図６において、監視制御装置２１は、生成部１２と、更新部２２と、監視部２３と、検出部２４とを有する。 In FIG. 6, the monitoring control device 21 includes a generation unit 12, an update unit 22, a monitoring unit 23, and a detection unit 24.

尚、更新部２２、監視部２３及び検出部２４は、上述した第２の実施形態における監視部２に相当する。 Note that the update unit 22, the monitoring unit 23, and the detection unit 24 correspond to the monitoring unit 2 in the second embodiment described above.

即ち、更新部２２、監視部２３及び検出部２４は、上述した第２の実施形態における監視部２の処理を実行する。 That is, the update unit 22, the monitoring unit 23, and the detection unit 24 execute the process of the monitoring unit 2 in the second embodiment described above.

より具体的に、更新部２２は、図４においてステップＳ１１乃至ステップＳ１３の処理を実行する。さらに、監視部２３は、図５Ａ及び図５ＢにおいてステップＳ２１乃至ステップＳ２９の処理を実行する。また、検出部２４は、図５Ａ及び図５ＢにおいてステップＳ３０乃至ステップＳ４０の処理を実行する。 More specifically, the update unit 22 executes the processes of steps S11 to S13 in FIG. Furthermore, the monitoring unit 23 executes the processes of steps S21 to S29 in FIGS. 5A and 5B. Moreover, the detection part 24 performs the process of step S30 thru | or step S40 in FIG. 5A and 5B.

このように本実施の形態に係る監視制御装置２１によれば、上述した各実施形態において説明した効果を享受できると共に、さらに、より速やかに装置の故障を検出することができる。その理由は、監視制御装置２１は、第２の実施形態において説明した監視部２が行う処理を分散すると共に、分散した処理を更新部２２、監視部２３及び検出部２４が実行するからである。 As described above, according to the monitoring control device 21 according to the present embodiment, the effects described in the above-described embodiments can be enjoyed, and further, the failure of the device can be detected more promptly. The reason is that the monitoring control device 21 distributes the processing performed by the monitoring unit 2 described in the second embodiment, and the updating unit 22, the monitoring unit 23, and the detection unit 24 execute the distributed processing. .

（ハードウェア構成例）
上述した実施形態において図１、図２、図６に示した各部は、ソフトウェアプログラムの機能（処理）単位（ソフトウェアモジュール）と捉えることができる。但し、これらの図面に示した各部の区分けは、説明の便宜上の構成であり、実装に際しては、様々な構成が想定され得る。この場合のハードウェア環境の一例を、図７を参照して説明する。 (Hardware configuration example)
In the embodiment described above, each unit shown in FIGS. 1, 2, and 6 can be regarded as a function (processing) unit (software module) of a software program. However, the division of each part shown in these drawings is a configuration for convenience of explanation, and various configurations can be assumed for mounting. An example of the hardware environment in this case will be described with reference to FIG.

図７は、本発明の模範的な実施形態に係る監視制御装置を実行可能な情報処理装置３００（コンピュータ）の構成を例示的に説明する図である。即ち、図７は、図１に示した監視制御装置１、或いは、図２に示した監視制御装置１１、図６に示した監視制御装置２１の全体または一部の監視制御装置を実現可能なサーバやホストコンピュータ等のコンピュータ（情報処理装置）の構成であって、上述した実施形態における各機能を実現可能なハードウェア環境を表す。 FIG. 7 is a diagram illustrating an exemplary configuration of an information processing device 300 (computer) that can execute the monitoring control device according to the exemplary embodiment of the present invention. That is, FIG. 7 can implement the monitoring control apparatus 1 shown in FIG. 1, the monitoring control apparatus 11 shown in FIG. 2, or the whole or a part of the monitoring control apparatus 21 shown in FIG. 6. This is a configuration of a computer (information processing apparatus) such as a server or a host computer, and represents a hardware environment capable of realizing each function in the above-described embodiment.

図７に示した情報処理装置３００は、ＣＰＵ（Ｃｅｎｔｒａｌ＿Ｐｒｏｃｅｓｓｉｎｇ＿Ｕｎｉｔ）３０１、ＲＯＭ（Ｒｅａｄ＿Ｏｎｌｙ＿Ｍｅｍｏｒｙ）３０２、ＲＡＭ（Ｒａｎｄｏｍ＿Ａｃｃｅｓｓ＿Ｍｅｍｏｒｙ）３０３、ハードディスク３０４（記憶装置）、並びに外部装置との通信インタフェース（Ｉｎｔｅｒｆａｃｅ：以降、「Ｉ／Ｆ」と称する）３０５、ＣＤ−ＲＯＭ（Ｃｏｍｐａｃｔ＿Ｄｉｓｃ＿Ｒｅａｄ＿Ｏｎｌｙ＿Ｍｅｍｏｒｙ）等の記憶媒体３０７に格納されたデータを読み書き可能なリーダライタ３０８を備え、これらの構成がバス３０６（通信線）を介して接続された一般的なコンピュータである。 The information processing apparatus 300 illustrated in FIG. 7 includes a CPU (Central_Processing_Unit) 301, a ROM (Read_Only_Memory) 302, a RAM (Random_Access_Memory) 303, a hard disk 304 (storage device), and a communication interface (Interface: hereinafter “I”). / F ”) 305, and a reader / writer 308 capable of reading and writing data stored in a storage medium 307 such as a CD-ROM (Compact_Disc_Read_Only_Memory), and these components are connected via a bus 306 (communication line). It is a general computer.

そして、上述した各実施形態を例に説明した本発明は、図７に示した情報処理装置３００に対して、その実施形態の説明において参照したブロック構成図（図１、図２、図６）或いはフローチャート（図３、図４、図５Ａ、図５Ｂ）の機能を実現可能なコンピュータ・プログラムを供給した後、そのコンピュータ・プログラムを、当該ハードウェアのＣＰＵ３０１に読み出して解釈し実行することによって達成される。また、当該装置内に供給されたコンピュータ・プログラムは、読み書き可能な揮発性の記憶メモリ（ＲＡＭ３０３）またはハードディスク３０４等の不揮発性の記憶デバイスに格納すれば良い。 The present invention described by taking each of the embodiments described above as an example is a block configuration diagram (FIGS. 1, 2, and 6) referred to in the description of the information processing apparatus 300 illustrated in FIG. Alternatively, it is achieved by supplying a computer program capable of realizing the functions of the flowcharts (FIGS. 3, 4, 5A, and 5B), reading the computer program to the CPU 301 of the hardware, and interpreting and executing the computer program. Is done. The computer program supplied to the apparatus may be stored in a readable / writable volatile storage memory (RAM 303) or a nonvolatile storage device such as the hard disk 304.

また、前記の場合において、当該ハードウェア内へのコンピュータ・プログラムの供給方法は、ＣＤ−ＲＯＭ等の各種記憶媒体３０７を介して当該装置内にインストールする方法や、インターネット等の通信回線を介して外部よりダウンロードする方法等のように、現在では一般的な手順を採用することができる。そして、このような場合において、本発明は、係るコンピュータ・プログラムを構成するコード或いは、そのコードが格納された記憶媒体３０７によって構成されると捉えることができる。 In the above case, the computer program can be supplied to the hardware by a method of installing the computer program via various storage media 307 such as a CD-ROM or a communication line such as the Internet. Currently, a general procedure can be adopted, such as a method of downloading from the outside. In such a case, the present invention can be understood to be configured by a code constituting the computer program or a storage medium 307 in which the code is stored.

本発明は、上述した各実施形態には限定されない。本発明は、パーソナルコンピュータ、ＰＯＳ（Ｐｏｉｎｔ＿Ｏｆ＿Ｓａｌｅ）システム、サーバ、複写機、複合機等のリソースの障害を監視する機能を搭載する各種の電子機器に適用可能である。 The present invention is not limited to the embodiments described above. The present invention can be applied to various electronic devices equipped with a function for monitoring a failure of a resource such as a personal computer, a POS (Point_Of_Sale) system, a server, a copier, and a multifunction peripheral.

１監視制御装置
２監視部
２ａ第１の監視部
２ｂ第２の監視部
３通信部
３ａ第１の通信部
３ｂ第２の通信部
４記憶部
４ａ第１の記憶部
４ｂ第２の記憶部
１０組み込み制御装置
１０ａ第１の組み込み制御装置
１０ｂ第２の組み込み制御装置
１１監視制御装置
１１ａ第１の監視制御装置
１１ｂ第２の監視制御装置
１２生成部
１２ａ第１の生成部
１２ｂ第２の生成部
２１監視制御装置
２２更新部
２３監視部
２４検出部
１０１第１の判別情報
３００情報処理装置
３０１ＣＰＵ
３０２ＲＯＭ
３０３ＲＡＭ
３０４ハードディスク
３０５通信インタフェース
３０６バス
３０７記憶媒体
３０８リーダライタ DESCRIPTION OF SYMBOLS 1 Monitoring control apparatus 2 Monitoring part 2a 1st monitoring part 2b 2nd monitoring part 3 Communication part 3a 1st communication part 3b 2nd communication part 4 Memory | storage part 4a 1st memory | storage part 4b 2nd memory | storage part 10 Embedded control device 10a First embedded control device 10b Second embedded control device 11 Monitoring control device 11a First monitoring control device 11b Second monitoring control device 12 Generation unit 12a First generation unit 12b Second generation unit DESCRIPTION OF SYMBOLS 21 Monitoring control apparatus 22 Update part 23 Monitoring part 24 Detection part 101 1st discrimination | determination information 300 Information processing apparatus 301 CPU
302 ROM
303 RAM
304 Hard Disk 305 Communication Interface 306 Bus 307 Storage Medium 308 Reader / Writer

Claims

The first determination information indicating that the monitored resource is operating normally is received via the communication unit, and the first determination information is stored in the storage unit, and the first determination information is stored in the storage unit. Determining whether there is discrimination information or second discrimination information different from the first discrimination information;
When it is determined that there is the second determination information, it is determined that a failure has occurred in the monitored resource,
A monitoring control apparatus, wherein when there is no first and second determination information, it is determined that a failure has occurred in a monitoring source resource according to a counter value.

When there is no first and second determination information, the value of the counter is referred to and the value is increased.
When the increased value is the first value, it is determined whether or not the monitoring source resource is operating normally, and when it is determined that the resource is operating normally, the second determination information is And generating the generated second discrimination information in the storage unit,
The monitoring control apparatus according to claim 1, wherein when the increased value is a second value different from the first value, it is determined that a failure has occurred in the monitoring source resource.

A generation unit that determines whether or not the resource to be monitored is operating normally and generates the first determination information for each first time period when the resource is operating normally, The monitoring control device according to claim 1, further comprising: a monitoring control device according to claim 1.

The monitoring and control device includes:
The presence / absence of the first determination information or the second determination information in the storage unit is determined every second time period different from the first time period. 4. The monitoring control device according to any one of 3.

The first discrimination information and the second discrimination information are:
5. The monitoring control apparatus according to claim 1, wherein the monitoring control apparatus is heartbeat information.

An embedded control device comprising the monitoring control device according to any one of claims 1 to 5.

The first determination information indicating that the monitored resource is operating normally is received via the communication unit, and the first determination information is stored in the storage unit, and the first determination information is stored in the storage unit. Determining whether there is discrimination information or second discrimination information different from the first discrimination information;
When it is determined that there is the second determination information, it is determined that a failure has occurred in the monitored resource,
A monitoring control method comprising: determining that a failure has occurred in a monitoring source resource in accordance with a counter value when there is no first and second determination information.

When there is no first and second determination information, the value of the counter is referred to and the value is increased.
When the increased value is the first value, it is determined whether or not the monitoring source resource is operating normally, and when it is determined that the resource is operating normally, the second determination information is And generating the generated second discrimination information in the storage unit,
The monitoring control method according to claim 7, wherein when the increased value is a second value different from the first value, it is determined that a failure has occurred in the monitoring source resource.

A computer program for controlling the operation of the monitoring control device, the computer program
The first determination information indicating that the monitored resource is operating normally is received via the communication unit, and the first determination information is stored in the storage unit, and the first determination information is stored in the storage unit. Determining whether there is discrimination information or second discrimination information different from the first discrimination information;
If it is determined that there is the second determination information, it is determined that a failure has occurred in the monitored resource; and
A computer program for causing a computer to realize a function of determining that a failure has occurred in a monitoring source resource in accordance with a counter value when there is no first and second determination information.

When there is no first and second determination information, the value of the counter is referred to and the value is increased.
When the increased value is the first value, it is determined whether or not the monitoring source resource is operating normally, and when it is determined that the resource is operating normally, the second determination information is And generating the generated second discrimination information in the storage unit,
10. The computer program product according to claim 9, wherein when the increased value is a second value different from the first value, it is determined that a failure has occurred in the monitoring source resource.