JP2008259047A

JP2008259047A - Network monitoring system, and centralized monitoring apparatus

Info

Publication number: JP2008259047A
Application number: JP2007100840A
Authority: JP
Inventors: Kimihiko Ariga; 公彦有賀
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2007-04-06
Filing date: 2007-04-06
Publication date: 2008-10-23

Abstract

<P>PROBLEM TO BE SOLVED: To easily construct a distributed monitoring environment when integrally monitoring a large-scaled network. <P>SOLUTION: When monitoring a large-scaled network, it is difficult for one monitoring apparatus to perform monitoring because of a load imposed on the monitoring apparatus. Then, a plurality of distributed processing apparatuses 300 are installed, each of which implements only status confirmation processing for network equipment 10 to be monitored, and a status change in the network equipment 10 is monitored, and monitoring of the network with one monitoring apparatus 100B and the plurality of distributed processing apparatuses 300 is made possible. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、ネットワーク監視システムおよび集中監視装置に係り、特に信頼性に優れたネットワーク監視システムおよび集中監視装置に関する。 The present invention relates to a network monitoring system and a centralized monitoring apparatus, and more particularly to a network monitoring system and a centralized monitoring apparatus excellent in reliability.

従来、大規模なネットワークにおいて監視を行う場合には、図１のような構成をとることが一般的である。ここで、図１は分散監視システムのブロック図である。図１において、監視すべきネットワーク機器（監視対象機器）１０−１〜１０−１０に対して、集中監視装置１００Ａと監視対象機器の一部分の監視を担当する分散監視装置２００−１、２００−２とによってネットワーク全体を監視する形態である。図１において、分散監視装置２００−１は、ネットワーク機器１０−１〜１０−５を監視し、分散監視装置２００−２は、ネットワーク機器１０−６〜１０−１０を監視する。 Conventionally, when monitoring is performed in a large-scale network, a configuration as shown in FIG. 1 is generally used. Here, FIG. 1 is a block diagram of the distributed monitoring system. In FIG. 1, the centralized monitoring device 100A and distributed monitoring devices 200-1 and 200-2 that are responsible for monitoring a part of the monitoring target device with respect to the network devices (monitoring target devices) 10-1 to 10-10 to be monitored. And monitoring the entire network. In FIG. 1, a distributed monitoring device 200-1 monitors network devices 10-1 to 10-5, and a distributed monitoring device 200-2 monitors network devices 10-6 to 10-10.

このような監視の形態をとるとき、集中監視装置１００Ａと分散監視装置２００は、ともに図２に示すネットワーク全体のノードリストを保持する。ここで、図２はノードリストを説明する図である。図２において、分散監視装置２００−１は自己の監視対象外のネットワーク機器１０−６〜１０−１０のリスト２０を保有する。同様に、分散監視装置２００−２は自己の監視対象外のネットワーク機器１０−１〜１０−５のリストを保有する。 When taking such a monitoring form, the centralized monitoring device 100A and the distributed monitoring device 200 both hold the node list of the entire network shown in FIG. Here, FIG. 2 is a diagram for explaining the node list. In FIG. 2, the distributed monitoring apparatus 200-1 has a list 20 of network devices 10-6 to 10-10 that are not monitored. Similarly, the distributed monitoring apparatus 200-2 holds a list of network devices 10-1 to 10-5 that are not monitored by itself.

図３を参照して、このシステムの動作を説明する。ここで、図３は分散監視システムの機能ブロックと動作を説明する図である。なお、図３では図１に示した監視対象機器のうち、１台のネットワーク機器（すなわち１０−１）に関する処理のみを記載している。 The operation of this system will be described with reference to FIG. Here, FIG. 3 is a diagram for explaining functional blocks and operations of the distributed monitoring system. In FIG. 3, only the processing related to one network device (that is, 10-1) among the monitoring target devices illustrated in FIG. 1 is illustrated.

本来ネットワーク監視は、統合的に行われるべきであり、その為には１台の監視装置が全てのネットワーク機器（１０−１〜１０−１０）を監視するのが望ましい。しかし、大規模なネットワークとなり、監視対象機器が多くなってきた場合には、監視パケットの送受信による負荷が高くなり、１台の監視装置で全ての機器を監視する事は非常に困難である。例えば１０００台の機器を監視し障害検知までの時間を５分以内とした場合には、毎秒約３台の機器に対して状態確認パケットを送受信する必要があり、障害発生などによって多くの機器が応答できない状況が発生した場合には、パケットの処理と障害の検知のために多くのリソースを必要とするため高負荷状態となり、正常にネットワークを監視することができなくなる可能性がある。そのため、複数の監視装置をネットワーク内に設置し、分散監視システムを構築する事が必要となる。 Originally, network monitoring should be performed in an integrated manner. For this purpose, it is desirable that a single monitoring device monitors all network devices (10-1 to 10-10). However, when the network becomes large and the number of devices to be monitored increases, the load due to transmission and reception of monitoring packets increases, and it is very difficult to monitor all the devices with one monitoring device. For example, when 1000 devices are monitored and the time until failure detection is within 5 minutes, it is necessary to send and receive status confirmation packets to approximately 3 devices per second. When a situation in which a response cannot be made occurs, many resources are required for packet processing and failure detection, so there is a possibility that the network becomes unable to be monitored normally because of a high load state. Therefore, it is necessary to install a plurality of monitoring devices in the network and construct a distributed monitoring system.

図３において、集中監視装置１００Ａは、監視機能部１１０Ａと通信部１２０Ａとから構成される。また、監視機能部１１０Ａは対象ノードリスト１１１Ａと障害判定条件テーブル１１２Ａとを含む。分散監視装置２００−１は、監視機能部２１０と通信部２２０とから構成される。監視機能部２１０は監視機能部１１０Ａと同様に、対象ノードリスト２１１と障害判定条件テーブル２１２とを含む。さらに、ネットワーク機器１０−１は通信部１２と制御部１５とから構成される。 In FIG. 3, the centralized monitoring apparatus 100A includes a monitoring function unit 110A and a communication unit 120A. The monitoring function unit 110A includes a target node list 111A and a failure determination condition table 112A. The distributed monitoring apparatus 200-1 includes a monitoring function unit 210 and a communication unit 220. Similar to the monitoring function unit 110A, the monitoring function unit 210 includes a target node list 211 and a failure determination condition table 212. Further, the network device 10-1 includes a communication unit 12 and a control unit 15.

図３に示す分散監視システムにおいて、ネットワーク機器１０−１を監視するにあたって集中監視装置１００Ａと分散監視装置２００−１の双方に、監視対象となるネットワーク機器１０−１の情報を登録する必要がある。また、ネットワークの構成および障害判定条件についても同様に双方に登録する必要がある。集中監視装置１００Ａおよび分散監視装置２００−１において、オペレータは登録された対象ノードの情報を基に、監視対象ノードリスト１１１Ａおよび２１１に監視対象ノード情報を書き込む。オペレータは、また、登録された障害判定条件を基に障害状態判定条件テーブル１１２Ａおよび２１２に障害状態判定条件を書き込む。その後、集中監視装置１００Ａの監視機能部１１０Ａは、対象ノードリスト１１１Ａと障害判定条件テーブル１１２Ａを参照して、対象ノード判定処理を実施する（Ｓ１０１）。また、分散監視装置２００−１の監視機能部２１０は、対象ノードリスト２１１と障害判定条件テーブル２１２を参照して、対象ノード判定処理を実施する（Ｓ２０１）。これら処理により、集中監視装置１００Ａと分散監視装置２００−１は、ネットワークの基本構成を理解し、発生する障害に応じて自立的に障害検知機能や機器監視機能を制御する。また、集中監視装置１００Ａと分散監視装置２００−１は、ネットワーク監視者が障害箇所の特定および復旧を行うにあたって有益な情報を提供する。 In the distributed monitoring system shown in FIG. 3, when monitoring the network device 10-1, it is necessary to register information on the network device 10-1 to be monitored in both the centralized monitoring device 100A and the distributed monitoring device 200-1. . Similarly, it is necessary to register both the network configuration and the failure determination condition. In the centralized monitoring apparatus 100A and the distributed monitoring apparatus 200-1, the operator writes the monitoring target node information in the monitoring target node lists 111A and 211 based on the registered target node information. The operator also writes the failure state determination condition in the failure state determination condition tables 112A and 212 based on the registered failure determination condition. Thereafter, the monitoring function unit 110A of the centralized monitoring apparatus 100A performs target node determination processing with reference to the target node list 111A and the failure determination condition table 112A (S101). Also, the monitoring function unit 210 of the distributed monitoring apparatus 200-1 refers to the target node list 211 and the failure determination condition table 212, and performs target node determination processing (S201). Through these processes, the centralized monitoring device 100A and the distributed monitoring device 200-1 understand the basic configuration of the network, and independently control the failure detection function and the device monitoring function according to the failure that occurs. In addition, the centralized monitoring device 100A and the distributed monitoring device 200-1 provide useful information when the network monitor identifies and recovers from a failure location.

通常時、実際の監視は分散監視装置２００−１が実施する。分散監視装置２００−１は、ステップ２０１で状態確認を実施すると判定されたネットワーク機器１０−１に対して対象ノードの状態確認処理を実施する（Ｓ２０２）。この状態確認処理は、主にＰｉｎｇ（Packet Internet Groper）によって行われる。ネットワーク機器１０−１は、状態確認データを受信し（Ｓ２０３）、制御部１５０において確認データに対する応答として現在の自分の機器状態を示すデータを生成し（Ｓ２０４）、その状態データを送信する（Ｓ２０６）。実際に送信されるデータに含まれる内容は、きわめて限定的であり、対象装置が稼働しているか否かを判断する程度の情報のみが含まれている。分散監視装置２００−１は対象ノードからの状態データを受信し（Ｓ２０７）、受信したデータを基に障害が発生しているのか否かを判定する（Ｓ２０８）。ステップ２０８の障害判定処理において、ネットワークに障害が発生していると判断できた場合には、設定に応じて画面表示の変更・アラームの発報（Ｓ２０９）などを実施する。 In the normal state, the actual monitoring is performed by the distributed monitoring apparatus 200-1. The distributed monitoring apparatus 200-1 performs the status check process of the target node on the network device 10-1 that is determined to check the status in Step 201 (S202). This state confirmation process is mainly performed by Ping (Packet Internet Groper). The network device 10-1 receives the status confirmation data (S203), the control unit 150 generates data indicating the current device status as a response to the confirmation data (S204), and transmits the status data (S206). ). The content included in the data that is actually transmitted is extremely limited, and includes only information that can be used to determine whether the target device is operating. The distributed monitoring apparatus 200-1 receives the status data from the target node (S207), and determines whether a failure has occurred based on the received data (S208). If it is determined in the failure determination process in step 208 that a failure has occurred in the network, a screen display change / alarm notification (S209) is performed according to the settings.

さらに、分散監視装置２００−１は、集中監視装置１００Ａに対して、ネットワーク内で障害が発生した旨を通知する（Ｓ２１１）。集中監視装置１００Ａは、障害通知を受信し（Ｓ２１２）、受信した分散監視装置２００−１からのデータを確認し（Ｓ２１３）、障害状態の判定を実施した（Ｓ２１４）上で、分散監視装置２００−１が実施したのと同様に画面表示の変更・アラームの発報などを行う（Ｓ２１６）。また、他の分散監視装置２００−２に対しても障害が発生した旨を通知し（Ｓ２１７）、それぞれの監視装置間で情報の統一を行う。 Furthermore, the distributed monitoring apparatus 200-1 notifies the centralized monitoring apparatus 100A that a failure has occurred in the network (S211). The centralized monitoring device 100A receives the failure notification (S212), confirms the received data from the distributed monitoring device 200-1 (S213), determines the failure state (S214), and then the distributed monitoring device 200. The screen display is changed and an alarm is issued in the same manner as in step -1 (S216). The other distributed monitoring apparatus 200-2 is also notified that a failure has occurred (S217), and information is unified among the respective monitoring apparatuses.

すなわち従来の監視システムにおいては、分散監視装置２００−１から集中監視装置１００Ａに対して送信される障害発生通知は、ネットワークの基本構成や障害と判定するための条件を基にして生成されたデータである。そのため、障害発生データを生成するために集中監視装置１００Ａと分散監視装置２００−１にてネットワークの基本構成などの情報を保持する必要がある。また、このデータは複数存在する監視装置間において同一のデータが保持されている必要がある。万一、集中監視装置１００Ａと分散監視装置２００の間でデータの不一致が発生した場合には、分散監視装置２００が実際の障害を検知できない場合や、集中監視装置１００Ａが通知された情報を解析した結果、障害と判定されずにアラームの発報が行われないなどの状況に陥る可能性がある。分散監視装置２００は、集中監視装置１００Ａとほぼ同様の機能を持ち、ネットワークの基本構造や障害状態の判定処理などを実施することができる。しかし、そのために機器に要求されるスペックは高くなり、また、設定情報の整合性について細心の注意を払う必要がある。 That is, in the conventional monitoring system, the failure occurrence notification transmitted from the distributed monitoring device 200-1 to the centralized monitoring device 100A is data generated based on the basic configuration of the network and the conditions for determining a failure. It is. Therefore, in order to generate failure occurrence data, the centralized monitoring device 100A and the distributed monitoring device 200-1 need to hold information such as the basic configuration of the network. Further, the same data needs to be held between a plurality of monitoring devices. In the unlikely event that a data mismatch occurs between the centralized monitoring device 100A and the distributed monitoring device 200, the distributed monitoring device 200 cannot detect an actual failure or analyzes the information notified by the centralized monitoring device 100A. As a result, there is a possibility that an alarm is not issued without being judged as a failure. The distributed monitoring apparatus 200 has substantially the same function as the centralized monitoring apparatus 100A, and can execute a basic structure of the network, a failure state determination process, and the like. However, the specifications required for the equipment are increased for that purpose, and it is necessary to pay close attention to the consistency of the setting information.

大規模ネットワークを監視するためには、複数の監視装置を設置し、それぞれネットワークの一部分を監視した上で相互に連係動作させる必要がある。 In order to monitor a large-scale network, it is necessary to install a plurality of monitoring devices, monitor each part of the network, and operate them in cooperation with each other.

その際に、集中監視装置〜分散監視装置間で相互に連係動作をさせるために、集中・分散監視装置に対して重複するような機能を実装する必要がある。ここで言うところの重複する機能とは、ネットワーク機器情報、構成情報、障害判定条件を基にして、ネットワークの構成を把握しての動作を行う機能である。 At that time, it is necessary to implement an overlapping function with respect to the centralized / distributed monitoring apparatus in order to cause the centralized monitoring apparatus to the distributed monitoring apparatus to cooperate with each other. The overlapping functions mentioned here are functions that perform operations by grasping the network configuration based on network device information, configuration information, and failure determination conditions.

そのため、集中・分散の両監視装置に対して、規模の大きなソフトウェアを導入する必要がある。要求されるマシンスペックも高くなってしまう。また、そのソフトウェアを導入するために必要な設計・構築作業に大きなコストが必要となる。 Therefore, it is necessary to install large-scale software for both centralized and distributed monitoring devices. The required machine specs are also high. In addition, a large cost is required for the design and construction work necessary for introducing the software.

また、複数の分散監視装置および集中監視装置が連動して動作しているため、監視システムとして正常かつ円滑な動作を行わせるためには、全ての監視装置で統一された設定を行う必要があり、ネットワーク構築後の構成変更などによる、監視装置への修正作業は繁雑を極める。 In addition, since multiple distributed monitoring devices and centralized monitoring devices operate in tandem, it is necessary to perform unified settings for all monitoring devices in order to perform normal and smooth operation as a monitoring system. And the modification work to the monitoring device due to the configuration change after the network construction is extremely complicated.

万一、整合性が失われた場合には、複数の監視装置において検出する障害事象に差異が発生する可能性があり、ネットワーク管理者が実施する障害特定プロセスに多大な影響を与えると同時に、復旧までに要する時間の増大を招く。これは、障害箇所の特定および復旧時間の短縮を目的として導入される監視システムとしては、致命的な問題点である。 In the unlikely event that consistency is lost, there may be differences in the failure events detected by multiple monitoring devices, which will greatly affect the failure identification process performed by the network administrator. This will increase the time required for recovery. This is a fatal problem for a monitoring system introduced for the purpose of identifying the fault location and shortening the recovery time.

本発明では、分散監視装置の機能を低減した分散処理装置を導入することによって前述の課題を解決する。 In the present invention, the above-described problem is solved by introducing a distributed processing device in which the functions of the distributed monitoring device are reduced.

分散監視装置で必要とされる機能を絞り込み、分散監視装置を小規模なものにすることによって、装置自体に対する要求スペックを低減すると同時に導入に必要とされるコストの低減を実現する。また、分散監視装置は極めて従属性の高いものとし、集中監視装置の指示により動作するエージェントとしての動作のみを担い、個別の監視装置に対する設定などは実施しない。そのため、不整合は発生せず、問題点が解決される。 By narrowing down the functions required for the distributed monitoring apparatus and making the distributed monitoring apparatus small-scale, the required specifications for the apparatus itself are reduced, and at the same time, the cost required for introduction is reduced. Further, the distributed monitoring device is assumed to have extremely high dependency, and only operates as an agent that operates in accordance with an instruction from the centralized monitoring device, and does not perform setting for individual monitoring devices. Therefore, inconsistency does not occur and the problem is solved.

具体的には、従来の分散監視装置が必要としていたネットワーク機器情報、構成情報、障害判定条件を、分散処理装置に登録することなく、ネットワーク監視を行うことができる。 Specifically, network monitoring can be performed without registering network device information, configuration information, and failure determination conditions required by a conventional distributed monitoring apparatus in the distributed processing apparatus.

上述した課題は、集中監視装置と、複数の分散処理装置とから構成され、集中監視装置は、ネットワークを構成するネットワーク機器と複数の分散処理装置との距離から、ネットワーク機器と通信する第１の分散処理装置を選択し、第１の分散処理装置にネットワーク機器のアドレスを送信し、第１の分散処理装置は、ネットワーク機器の状態を定期的に第１のパケットを送信し、ネットワーク機器の状態に変化があったとき、前期集中監視装置に第２のパケットを送信し、集中監視装置は、第２のパケットを受信したとき、アラームを発行するネットワーク監視システムにより、達成できる。 The above-described problem is configured by a centralized monitoring device and a plurality of distributed processing devices. The centralized monitoring device communicates with the network device from the distance between the network device constituting the network and the plurality of distributed processing devices. The distributed processing device is selected, the address of the network device is transmitted to the first distributed processing device, and the first distributed processing device periodically transmits the first packet as the status of the network device, and the status of the network device. When there is a change, the second packet is transmitted to the centralized monitoring apparatus in the previous period, and the centralized monitoring apparatus can be achieved by a network monitoring system that issues an alarm when receiving the second packet.

また、複数の分散処理装置と接続され、ネットワークを構成するネットワーク機器と複数の分散処理装置との距離から、ネットワーク機器と通信する第１の分散処理装置を選択し、第１の分散処理装置にネットワーク機器のアドレスを送信し、第１の分散処理装置からパケットを受信したとき、アラームを発行する集中監視装置により、達成できる。 In addition, a first distributed processing device that communicates with a network device is selected from the distances between the network devices that are connected to the plurality of distributed processing devices and configure the network, and the plurality of distributed processing devices. This can be achieved by a centralized monitoring device that transmits an address of a network device and issues an alarm when a packet is received from the first distributed processing device.

本発明によって、大規模ネットワークを監視する場合に必要となるコストが大幅に低減される。ここでいうコストとは、分散監視装置の機能を削減し、分散処理装置とすることで低減可能なハードウェアコストも含まれるが、それ以上に監視装置の運用、維持のためのコスト（すなわち人的コスト）も含む。人的コストの低減によって、大規模ネットワークに対する監視を容易に実現することができ、規模の大きなネットワークにおいても末端に至るまで監視を行うことが可能なため、ネットワークの信頼性の向上する。 The present invention significantly reduces the cost required when monitoring a large network. The cost mentioned here includes the hardware cost that can be reduced by reducing the functions of the distributed monitoring device and making it a distributed processing device. Cost). By reducing human costs, monitoring of a large-scale network can be easily realized, and even a large-scale network can be monitored to the end, so that the reliability of the network is improved.

以下本発明の実施の形態について、実施例を用い図４ないし図１８を参照しながら、詳細に説明する。なお、以下の実施例において、同一部位には同じ参照番号を振り、説明は繰り返さない。ここで、図４は分散監視システムのブロック図である。図５はノードリストを説明する図である。図６は分散監視システムの機能ブロック図と動作を説明する図である。図７は分散監視ネットワークの構成を説明するブロック図である。図８はネットワーク機器情報のテーブルを説明する図である。図９はネットワーク構成情報のテーブルを説明する図である。図１０は障害判定条件テーブルを説明する図である。図１１はネットワークトポロジを説明する図である。図１２は集中監視装置の対象ノードリスト作成処理を説明するフローチャートである。図１３はノードリストの生成過程を説明する図である。図１４は状態保持テーブルの生成を説明する図である。図１５は状態保持テーブルを説明する図である。図１６は複数の分散監視システムのネットワーク上の配置を説明する図である。図１７および図１８は同一ネットワーク上での分散処理装置の配置を説明する図である。 Hereinafter, embodiments of the present invention will be described in detail with reference to FIGS. 4 to 18 using examples. In the following examples, the same parts are denoted by the same reference numerals, and description thereof will not be repeated. Here, FIG. 4 is a block diagram of the distributed monitoring system. FIG. 5 is a diagram for explaining the node list. FIG. 6 is a functional block diagram of the distributed monitoring system and a diagram for explaining the operation. FIG. 7 is a block diagram illustrating the configuration of the distributed monitoring network. FIG. 8 is a diagram for explaining a table of network device information. FIG. 9 is a diagram illustrating a network configuration information table. FIG. 10 is a diagram for explaining the failure determination condition table. FIG. 11 is a diagram for explaining a network topology. FIG. 12 is a flowchart for explaining target node list creation processing of the centralized monitoring apparatus. FIG. 13 is a diagram for explaining a node list generation process. FIG. 14 is a diagram for explaining the generation of the state holding table. FIG. 15 is a diagram for explaining the state holding table. FIG. 16 is a diagram for explaining the arrangement of a plurality of distributed monitoring systems on a network. FIGS. 17 and 18 are diagrams for explaining the arrangement of distributed processing apparatuses on the same network.

図４において、分散監視システム５００は、集中監視装置１００Ｂと、２台の分散処理装置３００と、監視対象である１０台のネットワーク機器１０とから構成されている。分散処理装置３００−１は、ネットワーク機器１０−１〜１０−５と、通信処理を実施し、分散処理装置３００−２は、ネットワーク機器１０−６〜１０−１０と、通信処理を実施する。図４の分散監視システム５００は、図１の分散監視システムと構成が類似している。しかし、集中監視装置１００Ｂ、分散処理装置３００が保有するノードリストは、図５に示すように異なる。 In FIG. 4, the distributed monitoring system 500 includes a centralized monitoring device 100B, two distributed processing devices 300, and ten network devices 10 to be monitored. The distributed processing device 300-1 performs communication processing with the network devices 10-1 to 10-5, and the distributed processing device 300-2 performs communication processing with the network devices 10-6 to 10-10. The distributed monitoring system 500 in FIG. 4 is similar in configuration to the distributed monitoring system in FIG. However, the node lists held by the centralized monitoring device 100B and the distributed processing device 300 are different as shown in FIG.

図５において、図５（ａ）は集中監視装置１００Ｂが保有するノードリスト２０、図５（ｂ）は分散処理装置３００−１が保有するノードリスト２０Ａ−１、図５（ｃ）は分散処理装置３００−２が保有するノードリスト２０Ａ−２である。集中監視装置１００Ｂが保有するノードリスト２０は、図２のノードリスト２０と同一である。一方、分散処理装置３００が保有するノードリスト２０Ａは、自身が通信するネットワーク機器１０のみのリストとなっている。すなわち、分散処理装置３００は、ネットワークの全体構成を知らない状態にある。 5, FIG. 5A shows the node list 20 held by the centralized monitoring apparatus 100B, FIG. 5B shows the node list 20A-1 held by the distributed processing apparatus 300-1, and FIG. 5C shows the distributed processing. This is a node list 20A-2 held by the device 300-2. The node list 20 held by the centralized monitoring device 100B is the same as the node list 20 in FIG. On the other hand, the node list 20A possessed by the distributed processing apparatus 300 is a list of only the network devices 10 with which it communicates. That is, the distributed processing apparatus 300 is in a state where it does not know the overall configuration of the network.

図４に戻って、分散処理装置３００−１、３００−２は、従来構成において分散監視装置２００−１、２００−２が実施していた機能の一部を実装したものであり、集中監視装置のエージェントとして処理の一部の担うものである。 Returning to FIG. 4, the distributed processing devices 300-1 and 300-2 are implemented with some of the functions performed by the distributed monitoring devices 200-1 and 200-2 in the conventional configuration. As part of the process.

そもそも大規模ネットワークを１台の監視装置のみで監視することが、現実できでないとされる大きな要因は、ネットワーク機器１０に対する状態確認パケットの処理が、監視装置にとって負荷の高いためである。本実施例の構成では、ネットワーク機器１０−１〜１０−１０への状態確認パケットの処理を、分散処理装置３００−１、３００−２で分散実施させることで、この問題を解決している。 In the first place, a major factor that makes it impossible to monitor a large-scale network with only one monitoring device is that the processing of the status confirmation packet for the network device 10 has a high load on the monitoring device. In the configuration of the present embodiment, this problem is solved by distributing the processing of the status confirmation packet to the network devices 10-1 to 10-10 by the distributed processing devices 300-1 and 300-2.

図６において、集中監視装置１００Ｂは、対象ノード設定部１３０、監視機能部１１０Ｂ、記憶部１４０、通信部１２０Ｂとから構成される。分散処理装置３００は、対象ノード登録部３１０、通信部３３０、記憶部３２０、監視機能部３４０とから構成される。なお、ネットワーク機器１０は前述の通りの構成である。また、集中監視装置１００Ｂの対象ノード設定部１３０は、対象ノードリスト１１１Ｂとノードリスト転送条件テーブル１３２を保持する。集中監視装置１００Ｂの記憶部１４０は、対象ノードリスト１４１を保持する。集中監視装置１００Ｂの監視機能部１１０Ｂは、障害状態判定条件テーブル１１２Ｂを保持する。一方、分散処理装置３００の記憶部３２０は、対象ノードリスト３２１と、対象ノード状態保持テーブル３２２を保持する。 In FIG. 6, the centralized monitoring apparatus 100B includes a target node setting unit 130, a monitoring function unit 110B, a storage unit 140, and a communication unit 120B. The distributed processing device 300 includes a target node registration unit 310, a communication unit 330, a storage unit 320, and a monitoring function unit 340. The network device 10 is configured as described above. Further, the target node setting unit 130 of the centralized monitoring apparatus 100B holds a target node list 111B and a node list transfer condition table 132. The storage unit 140 of the centralized monitoring device 100B holds a target node list 141. The monitoring function unit 110B of the centralized monitoring device 100B holds a failure state determination condition table 112B. On the other hand, the storage unit 320 of the distributed processing apparatus 300 holds a target node list 321 and a target node state holding table 322.

集中監視装置１００Ｂは、集中監視装置１００Ａと比較して、対象ノード設定部１３０と記憶部１４０が追加されている。この対象ノード設定部１３０は、集中監視装置１００Ｂに登録された全監視対象ノードを、分散処理装置３００−１、３００−２に分担させる処理を実施する部分であり、そのノードを記憶しておくための領域が記憶部１４０である。集中監視装置１００Ｂのオペレータは、対象ノードリスト１１１Ｂに、集中監視装置１００Ｂに監視させる全ノードを、登録する。オペレータは、またノードリスト転送条件を、ノードリスト転送条件テーブル１３２に登録する。この登録は、集中監視装置１００Ｂで生成される対象ノードリストを、どのように分散処理装置３００に配分するかを決定するための条件である。オペレータは、さらに障害判定条件テーブル１１２Ｂに障害状態判定条件を登録する。 Compared with the centralized monitoring apparatus 100A, the centralized monitoring apparatus 100B has a target node setting unit 130 and a storage unit 140 added thereto. The target node setting unit 130 is a part that performs processing for sharing all the monitoring target nodes registered in the centralized monitoring device 100B with the distributed processing devices 300-1 and 300-2, and stores the nodes. An area for this is the storage unit 140. The operator of the centralized monitoring apparatus 100B registers all the nodes to be monitored by the centralized monitoring apparatus 100B in the target node list 111B. The operator also registers the node list transfer condition in the node list transfer condition table 132. This registration is a condition for determining how to distribute the target node list generated by the centralized monitoring device 100B to the distributed processing device 300. The operator further registers a failure state determination condition in the failure determination condition table 112B.

ノードリスト転送条件テーブル１３２は、ノードリストを転送する分散処理装置３００−１、３００−２とその分散処理装置３００−１、３００−２が処理可能なノード数の上限値の組み合わせで構成される。これは集中監視装置１００Ｂに登録された対象ノードリスト２０１をネットワーク内の分散処理装置３００に割り振る際にその能力差に応じて対象ノードを配分するためのものである。 The node list transfer condition table 132 includes a combination of the distributed processing devices 300-1 and 300-2 that transfer the node list and the upper limit value of the number of nodes that can be processed by the distributed processing devices 300-1 and 300-2. . This is for allocating target nodes according to the difference in capability when the target node list 201 registered in the centralized monitoring device 100B is allocated to the distributed processing device 300 in the network.

集中監視装置１００Ｂは、対象ノードリスト１１１Ｂとノード転送条件テーブル１３２に基づいて、対象ノードリストを生成し（Ｓ１１１）、対象ノードリスト１４１に記録する。集中監視装置１００Ｂは、ネットワーク内のセグメント情報を元に対象となる分散処理装置３００−１、３００−２へノードリストを配分する。配分にあたっては、集中監視装置１００Ａが持つネットワークトポロジ情報を基に効果的な配分を行う。 The centralized monitoring device 100B generates a target node list based on the target node list 111B and the node transfer condition table 132 (S111), and records it in the target node list 141. The centralized monitoring device 100B distributes the node list to the target distributed processing devices 300-1 and 300-2 based on the segment information in the network. In the distribution, effective distribution is performed based on the network topology information of the centralized monitoring apparatus 100A.

配分が完了すると、通信部部１２０Ｂは、対象ノードリストを該当する分散処理装置３００に転送する（Ｓ１１２）。 When the distribution is completed, the communication unit 120B transfers the target node list to the corresponding distributed processing device 300 (S112).

分散処理装置３００は、通信部３００にて対象ノードリストを受信すると（Ｓ１１３）、対象ノードリストを、対象ノード登録部３１０に転送する。対象ノード登録部３１０は、受信した対象ノードリストに基づいて記憶部３２０の対象ノードリスト３２１を更新する（Ｓ１１４）。分散処理装置３００の監視機能ｂ３４０は、対象ノードリスト３２１に登録された対象ノード１０に対して、状態確認を実施する（Ｓ１１６）。監視機能部３４０は、対象ノード１０からの、状態データを受信する（Ｓ１１７）。なお、ここで状態データの受信とは、予め定められた時間内に応答を受信しなかったことによる、タイムアウトを含む。 When the communication processing unit 300 receives the target node list (S113), the distributed processing device 300 transfers the target node list to the target node registration unit 310. The target node registration unit 310 updates the target node list 321 of the storage unit 320 based on the received target node list (S114). The monitoring function b340 of the distributed processing device 300 performs state confirmation on the target node 10 registered in the target node list 321 (S116). The monitoring function unit 340 receives state data from the target node 10 (S117). Here, the reception of the state data includes a timeout due to not receiving a response within a predetermined time.

監視機能部３４０は、対象ノード１０からの状態データを受信した後、対象ノード１０の状態判定を実施する（Ｓ１１８）。具体的には、ノードからの状態データが前回同様の機器から送信された状態データと比較して変化があったか否かを判定する。基本的にＰｉｎｇによって行われる状態確認の場合には、応答があったか否かで機器の正常性を確認しているため、ステップ１１８では前回の状態確認時にその対象機器からの応答があったか否かが判定基準となる。 After receiving the status data from the target node 10, the monitoring function unit 340 performs the status determination of the target node 10 (S118). Specifically, it is determined whether or not the state data from the node has changed compared to the state data transmitted from the same device as the previous time. In the case of status confirmation basically performed by Ping, the normality of the device is confirmed based on whether or not there is a response. Therefore, in step 118, whether or not there is a response from the target device at the previous status confirmation. It becomes a criterion.

状態判定において、状態が変化した（前回は応答ありだが今回は応答がない場合または前回は応答なしだが今回は応答があった場合）と判定された場合には、分散処理装置３００は、集中監視装置１００Ｂに対して、状態が変化したノードを通知する（Ｓ１１９）。通知される情報は、状態が変化したノードを集中監視装置１００Ｂが特定するために必要な最低限の情報（状態変化が発生したノードが一意に特定可能な情報であり、具体的にはノードのＩＰアドレスである）であって、現在のノードの状態などは転送しない。なお、ステップ１１８では、監視機能部３４０は、記憶部３２０に記録された対象ノード状態保持テーブル３２２を参照して判定し、対象ノード状態保持テーブル３２２を更新する。ここで、対象ノード状態保持テーブル３２２のデフォルト値は、いずれの対象ノードも正常を意味する「応答有り」である。 In the state determination, if it is determined that the state has changed (when there was a response at the previous time but no response at this time or when there was no response at the previous time but there was a response this time), the distributed processing device 300 performs centralized monitoring. The node whose state has changed is notified to the device 100B (S119). The information to be notified is the minimum information necessary for the centralized monitoring device 100B to identify the node whose state has changed (information that can uniquely identify the node in which the state change has occurred. IP address) and the current node state is not transferred. In step 118, the monitoring function unit 340 makes a determination with reference to the target node state holding table 322 recorded in the storage unit 320, and updates the target node state holding table 322. Here, the default value of the target node state holding table 322 is “with response” meaning that all target nodes are normal.

集中監視装置１００Ｂは、分散処理装置３００からの状態変化ノードのＩＰアドレスを受信する（Ｓ１２１）と、分散処理装置３００−１が通知してきたノードが自分の管理下にあるかどうか（すなわち対象ノードリスト１１１Ｂに含まれているかどうか）を確認する（Ｓ１２３）。自分の管理下にあるとき、集中監視装置１００Ｂは、通知されたノードの状態がどのように変化したのかを再確認する（Ｓ１２２）。集中監視装置１００Ｂは、ノードの状態を受信したとき（Ｓ１２４）、障害状態の判定を行い（Ｓ１２５）、アラーム発報（Ｓ１２６）などの処理を行う。 When the centralized monitoring device 100B receives the IP address of the state change node from the distributed processing device 300 (S121), whether the node notified by the distributed processing device 300-1 is under its own management (that is, the target node) It is confirmed whether it is included in the list 111B (S123). When it is under its own management, the centralized monitoring device 100B reconfirms how the status of the notified node has changed (S122). When the centralized monitoring device 100B receives the node status (S124), the centralized monitoring device 100B determines the failure status (S125), and performs processing such as alarm notification (S126).

ステップ１２４の障害状態の判定処理は、集中監視装置１００Ｂから対象ノード１０の状態を取得し、ステップ１２２のノード状態再確認がＰｉｎｇであればその応答の有無、ＳＮＭＰ（Simple Network Management Protocol）による状態収集の場合にはＩ／Ｆの状態などから障害状態であるか否かを判定する処理である。 The failure state determination process in step 124 acquires the state of the target node 10 from the centralized monitoring apparatus 100B, and if the node state reconfirmation in step 122 is Ping, whether there is a response, the state by SNMP (Simple Network Management Protocol) In the case of collection, it is a process for determining whether or not there is a failure state from the state of the I / F.

集中監視装置１００Ｂのステップ１２２の処理は、１つのネットワーク内に複数の集中監視装置１００Ｂを設置する必要がある場合、複数の分散処理装置１００Ｂで誤った設定がなされるなどして自装置が対象としていない機器の情報が転送されてきた場合、に破棄するための処理である。 The process of step 122 of the centralized monitoring device 100B is targeted for the own device when a plurality of centralized monitoring devices 100B need to be installed in a single network, such as incorrect settings are made in the plurality of distributed processing devices 100B. This is a process for discarding when the information of the device not to be transferred is transferred.

図７を参照して、分散監視システム５００のネットワーク構成の詳細を説明する。図７において、分散監視システム５００は、センタ（セグメント１）に配置された集中監視装置１００Ｂと分散処理装置３００−１とネットワーク機器１０−１と、拠点Ａ（セグメント２）に配置されたネットワーク機器１０−２と、拠点Ｂに配置された分散処理装置３００−２とネットワーク機器１０−３と、拠点Ｃ（セグメント４）に配置されたネットワーク機器１０−４と、ネットワーク機器１０間を接続するセグメント５〜７とから構成されていることとする。 Details of the network configuration of the distributed monitoring system 500 will be described with reference to FIG. In FIG. 7, a distributed monitoring system 500 includes a centralized monitoring device 100B, a distributed processing device 300-1, a network device 10-1, and a network device disposed at a base A (segment 2). 10-2, a distributed processing device 300-2 and a network device 10-3 arranged at the site B, a network device 10-4 arranged at the site C (segment 4), and a segment connecting the network devices 10 It shall be comprised from 5-7.

図８を参照して、集中監視装置１００Ｂおよび分散処理装置３００が保持するネットワーク機器情報テーブルを説明する。なお、集中監視装置１００Ｂが保持するネットワーク機器情報テーブルは、配下の全てのネットワーク機器１０について、記載されている。一方、分散処理装置３００が保持するネットワーク機器情報テーブルは、分散処理装置３００の配下のネットワーク機器１０についてのみ、記載されている。図８において、ネットワーク機器情報テーブル８０は、種別８１と名称８２とＩ/Ｆ番号８３とＩ/Ｆ種別８４とＩＰアドレス８５とから構成される。なお、ネットワーク機器情報テーブル８０は、ノードリスト２０と同じであっても良い。 With reference to FIG. 8, a network device information table held by the centralized monitoring device 100B and the distributed processing device 300 will be described. Note that the network device information table held by the centralized monitoring device 100B is described for all the network devices 10 under its control. On the other hand, the network device information table held by the distributed processing device 300 is described only for the network devices 10 subordinate to the distributed processing device 300. In FIG. 8, the network device information table 80 includes a type 81, a name 82, an I / F number 83, an I / F type 84, and an IP address 85. The network device information table 80 may be the same as the node list 20.

図９を参照して、集中監視装置１００Ｂが保持するネットワーク構成テーブル９０を説明する。ネットワーク構成テーブル９０は、通信機器９１とＩ/Ｆ番号９２と接続先９３とから構成される。通信機器９１には、集中監視装置１００Ｂ、分散処理装置３００、ネットワーク機器１０とを含んで、ネットワーク構成テーブル９０が構成されている。 With reference to FIG. 9, the network configuration table 90 held by the centralized monitoring device 100B will be described. The network configuration table 90 includes a communication device 91, an I / F number 92, and a connection destination 93. The communication device 91 includes a centralized monitoring device 100B, a distributed processing device 300, and a network device 10, and a network configuration table 90 is configured.

図１０を参照して、集中監視装置１００Ｂが保持する障害判定条件テーブル１１２Ｂを説明する。障害判定条件テーブル１１２Ｂは、名称１１２１とＩ/Ｆ番号１１２２と監視対象１１２３とから構成される。 With reference to FIG. 10, the failure determination condition table 112B held by the centralized monitoring apparatus 100B will be described. The failure determination condition table 112B includes a name 1121, an I / F number 1122, and a monitoring target 1123.

図１１を参照して、分散処理装置から見たネットワークを説明する。図１１において、ネットワークの構成は図７と同一である。しかし、図１１（ａ）は、分散処理３００−１から参照したネットワーク機器１０の構成であり、図１１（ｂ）は、分散処理３００−２から参照したネットワーク機器１０の構成である。図１１において、第１階層には、分散処理装置３００−１、３００−２がある。分散処理装置３００−１の第２階層には、ネットワーク機器１０−１があり、第３階層にはネットワーク機器１０−２と１０−３とがあり、第４階層にはネットワーク機器１０−４がある。一方、分散処理装置３００−２の第２階層には、ネットワーク機器１０−３があり、第３階層にはネットワーク機器１０−４と１０−１とがあり、第４階層にはネットワーク機器１０−１がある。 With reference to FIG. 11, a network viewed from the distributed processing apparatus will be described. In FIG. 11, the network configuration is the same as in FIG. However, FIG. 11A shows the configuration of the network device 10 referenced from the distributed processing 300-1, and FIG. 11B shows the configuration of the network device 10 referenced from the distributed processing 300-2. In FIG. 11, there are distributed processing apparatuses 300-1 and 300-2 in the first hierarchy. The distributed processing apparatus 300-1 has a network device 10-1 in the second layer, network devices 10-2 and 10-3 in the third layer, and a network device 10-4 in the fourth layer. is there. On the other hand, the network device 10-3 is in the second layer of the distributed processing apparatus 300-2, the network devices 10-4 and 10-1 are in the third layer, and the network device 10- is in the fourth layer. There is one.

図１２を参照して、図６のステップ１１１を更に詳細に説明する。対象ノード設定部１３０は、対象ノードリスト１１１Ｂから対象ノードを読み込む（Ｓ５０１）。対象ノード設定部１３０は、ノード転送条件テーブル１３２を参照して、条件が一致しているか判定する（Ｓ５０２）。一致しているとき、対象ノード設定部１３０は、次ノードの処理処理に遷移する（Ｓ５０４）。一方、ステップ５０２で一致していないとき、対象ノード設定部１３０は、ネットワーク距離計算を実施する（Ｓ５０６）。対象ノード設定部１３０は、演算した結果に等距離があるかどうか判定する（Ｓ５０７）。等距離でないとき、近い分散処理装置の対象ノードリストに追加し（Ｓ５０８）、ステップ５０４に遷移する。一方、ステップ５０７で等距離の場合、ランダムな分散処理装置の対象ノードリストへ追加し（Ｓ５０９）、ステップ５０４に遷移する。 With reference to FIG. 12, step 111 of FIG. 6 will be described in more detail. The target node setting unit 130 reads the target node from the target node list 111B (S501). The target node setting unit 130 refers to the node transfer condition table 132 and determines whether the conditions match (S502). If they match, the target node setting unit 130 transitions to processing of the next node (S504). On the other hand, when they do not match in step 502, the target node setting unit 130 performs network distance calculation (S506). The target node setting unit 130 determines whether there is an equal distance in the calculated result (S507). When the distance is not equal, it is added to the target node list of the nearby distributed processing device (S508), and the process proceeds to step 504. On the other hand, in the case of equidistant in step 507, it is added to the target node list of the random distributed processing device (S509), and the process proceeds to step 504.

図１３において、図１３（ａ）は集中監視装置１００Ｂが保持するノードリストである。図１３（ｂ）は集中管理装置１００Ｂに入力される転送条件である。図１３（ｃ）は転送条件から決定される転送ノードリストのうち分散処理装置３００−１が保持する転送ノードリストである。図１３（ｄ）は転送条件から決定される転送ノードリストのうち分散処理装置３００−２が保持する転送ノードリストである。図１３（ｅ）は転送条件から未決の転送ノードリストである。図１３（ｆ）は未決のネットワーク装置１０−４に対する分散処理装置３００の距離である。図１３（ｇ）は分散処理装置３００−１宛てに最終的に転送されるノードリストである。図１３（ｈ）は分散処理装置３００−２宛てに最終的に転送されるノードリストである。 In FIG. 13, FIG. 13A shows a node list held by the centralized monitoring apparatus 100B. FIG. 13B shows transfer conditions input to the central management apparatus 100B. FIG. 13C is a transfer node list held by the distributed processing device 300-1 among the transfer node list determined from the transfer conditions. FIG. 13D shows a transfer node list held by the distributed processing device 300-2 in the transfer node list determined from the transfer conditions. FIG. 13E shows a transfer node list that has not been determined from the transfer conditions. FIG. 13F shows the distance of the distributed processing device 300 with respect to the pending network device 10-4. FIG. 13G shows a node list that is finally transferred to the distributed processing apparatus 300-1. FIG. 13H is a node list that is finally transferred to the distributed processing apparatus 300-2.

図１３（ｅ）に示すネットワーク装置１０−４は、オペレータからの入力では分散処理装置が未決な装置であり、分散処理装置３００からネットーク装置１０−４までの距離を集中監視装置１００Ｂが演算し、処理すべき分散処理装置３００は、図１３（ｆ）において分散処理装置３００−２であると判断している。 The network device 10-4 shown in FIG. 13E is a device whose distributed processing device has not yet been determined by input from the operator, and the centralized monitoring device 100B calculates the distance from the distributed processing device 300 to the network device 10-4. The distributed processing device 300 to be processed is determined to be the distributed processing device 300-2 in FIG.

図１４は、図６のステップ１１６ないしステップ１１８の処理を詳細に説明するものである。分散処理装置３００の監視機能部３４０はＰｉｎｇを発行する（Ｓ６０１）と、Ｐｉｎｇ応答受信を待ち（Ｓ６０２）、状態保持テーブル３２２を参照して状態変化があったか判断する（Ｓ６０３）。ステップ６０３で状態変化していないと判断したときは次ノードに遷移し（Ｓ６０４）、ステップ６０３で状態変化したときは状態変化通知を集中監視装置に送信し（Ｓ６０５）、状態保持テーブル３２２を更新して（Ｓ６０６）、次ノードに遷移する（Ｓ６０４）。 FIG. 14 explains the processing in steps 116 to 118 in FIG. 6 in detail. When the monitoring function unit 340 of the distributed processing apparatus 300 issues a Ping (S601), the monitoring function unit 340 waits for a Ping response reception (S602), and refers to the state holding table 322 to determine whether or not a state change has occurred (S603). When it is determined in step 603 that the state has not changed, the process transits to the next node (S604). When the state has changed in step 603, a state change notification is transmitted to the centralized monitoring apparatus (S605), and the state holding table 322 is updated. Then (S606), it transits to the next node (S604).

図１５において、状態保持テーブル３２２は、ネットワーク機器名称と応答有無とから構成される。監視機能部３４０は、この状態保持テーブル３２２とＰｉｎｇ応答有無とから、状態に変化があったとき状態変化通知を送信し、状態保持テーブル３２２を更新する。 In FIG. 15, the state holding table 322 is composed of network device names and presence / absence of responses. The monitoring function unit 340 transmits a state change notification when there is a change from the state holding table 322 and the presence / absence of a Ping response, and updates the state holding table 322.

図１６において、センタには２台の集中監視装置１００Ｂ−１、１００Ｂ−２と、３台の分散処理装置３００−１〜３００−３と、２台のネットワーク機器１０−１、１０−４を配置し、拠点Ａにはネットワーク機器１０−２、拠点Ｂにはネットワーク機器１０−３、拠点Ｘにはネットワーク機器１０−５を配置している。集中監視装置１００Ｂ−１は、分散処理装置３００−１、３００−２と第１の監視システムを構成し、ネットワーク機器１０−１、１０−２、１０−３を監視している。また、集中監視装置１００Ｂ−２は、分散処理装置３００−３と第２の監視システムを構成し、ネットワーク機器１０−４、１０−５を監視している。 In FIG. 16, two centralized monitoring devices 100B-1 and 100B-2, three distributed processing devices 300-1 to 300-3, and two network devices 10-1 and 10-4 are included in the center. The network device 10-2 is arranged at the site A, the network device 10-3 at the site B, and the network device 10-5 at the site X. The centralized monitoring device 100B-1 constitutes a first monitoring system with the distributed processing devices 300-1 and 300-2, and monitors the network devices 10-1, 10-2, and 10-3. The centralized monitoring device 100B-2 configures a second monitoring system with the distributed processing device 300-3, and monitors the network devices 10-4 and 10-5.

図１７において、センタには集中監視装置１００Ｂと、２台の分散処理装置３００−１、３００−２と、２台のネットワーク機器１０−１、１０−５を配置し、拠点Ａにはネットワーク機器１０−２、拠点Ｂにはネットワーク機器１０−３、拠点Ｃにはネットワーク機器１０−４、拠点Ｘにはネットワーク機器１０−６を配置している。図１７の監視システムの形態は、複数台の分散処理装置と集中監視装置を１拠点に設置し大規模なネットワークを集中的に監視する形態である。この形態において、分散処理装置は純粋に集中監視装置の負荷低減のための処理装置として扱われる。 In FIG. 17, a centralized monitoring device 100B, two distributed processing devices 300-1 and 300-2, and two network devices 10-1 and 10-5 are arranged at the center, and a network device at the base A. 10-2, the network device 10-3 at the site B, the network device 10-4 at the site C, and the network device 10-6 at the site X. The form of the monitoring system in FIG. 17 is a form in which a plurality of distributed processing devices and a centralized monitoring device are installed at one site to centrally monitor a large-scale network. In this embodiment, the distributed processing apparatus is treated as a processing apparatus for reducing the load of the centralized monitoring apparatus.

図１８において、センタには集中監視装置１００Ｂと、分散処理装置３００−１と、２台のネットワーク機器１０−１、１０−５を配置し、拠点Ａにはネットワーク機器１０−２、拠点Ｂには分散処理装置３００−２とネットワーク機器１０−３、拠点Ｃにはネットワーク機器１０−４、拠点Ｘにはネットワーク機器１０−６を配置している。図１８の監視システムの形態は、複数台の分散処理装置を監視対象ネットワーク内に分散して設置し、集中監視装置で統括的な管理を行う形態である。この形態においては、分散処理装置自体をネットワーク内に分散配置することにより、監視に利用されるトラフィック量の低減も期待できる。 In FIG. 18, a centralized monitoring device 100B, a distributed processing device 300-1, and two network devices 10-1 and 10-5 are arranged at the center, the network device 10-2 at the site A, and the site B. The distributed processing apparatus 300-2 and the network device 10-3, the network device 10-4 at the site C, and the network device 10-6 at the site X are arranged. The form of the monitoring system in FIG. 18 is a form in which a plurality of distributed processing apparatuses are distributed and installed in the monitoring target network, and the centralized monitoring apparatus performs overall management. In this embodiment, a reduction in the amount of traffic used for monitoring can be expected by distributing the distributed processing devices themselves in the network.

分散監視システムのブロック図である。It is a block diagram of a distributed monitoring system. ノードリストを説明する図である。It is a figure explaining a node list. 分散監視システムの機能ブロックと動作を説明する図である。It is a figure explaining the functional block and operation | movement of a distributed monitoring system. 分散監視システムのブロック図である。It is a block diagram of a distributed monitoring system. ノードリストを説明する図である。It is a figure explaining a node list. 分散監視システムの機能ブロック図と動作を説明する図である。It is a figure explaining the functional block diagram and operation | movement of a distributed monitoring system. 分散監視ネットワークの構成を説明するブロック図である。It is a block diagram explaining the structure of a distributed monitoring network. ネットワーク機器情報のテーブルを説明する図である。It is a figure explaining the table of network device information. ネットワーク構成情報のテーブルを説明する図である。It is a figure explaining the table of network configuration information. 障害判定条件テーブルを説明する図である。It is a figure explaining a failure judgment condition table. ネットワークトポロジを説明する図である。It is a figure explaining a network topology. 集中監視装置の対象ノードリスト作成処理を説明するフローチャートである。It is a flowchart explaining the object node list creation process of a centralized monitoring apparatus. ノードリストの生成過程を説明する図である。It is a figure explaining the production | generation process of a node list. 状態保持テーブルの生成を説明する図である。It is a figure explaining the production | generation of a state holding table. 状態保持テーブルを説明する図である。It is a figure explaining a state holding table. 複数の分散監視システムのネットワーク上の配置を説明する図である。It is a figure explaining arrangement | positioning on the network of a some distributed monitoring system. 同一ネットワーク上での分散処理装置の配置（その１）を説明する図である。It is a figure explaining arrangement | positioning (the 1) of the distributed processing apparatus on the same network. 同一ネットワーク上での分散処理装置の配置（その２）を説明する図である。It is a figure explaining arrangement | positioning (the 2) of the distributed processing apparatus on the same network.

Explanation of symbols

１０…ネットワーク機器、１２…通信部、１５…制御部、２０…ノードリスト、１００…集中監視装置、１１０…監視機能部、１２０…通信部、１３０…対象ノード設定部、１４０…記憶部、２００…分散監視装置、２１０…監視機能部、２２０…通信部、３００…分散処理装置、３１０…対象ノード登録部、３２０…記憶部、３３０…通信部、３４０…監視制御部、５００…分散監視システム。 DESCRIPTION OF SYMBOLS 10 ... Network equipment, 12 ... Communication part, 15 ... Control part, 20 ... Node list, 100 ... Centralized monitoring apparatus, 110 ... Monitoring function part, 120 ... Communication part, 130 ... Target node setting part, 140 ... Memory | storage part, 200 DESCRIPTION OF SYMBOLS ... Distributed monitoring apparatus 210 ... Monitoring function part 220 ... Communication part 300 ... Distributed processing apparatus 310 ... Target node registration part 320 ... Storage part 330 ... Communication part 340 ... Monitoring control part 500 ... Distributed monitoring system .

Claims

In a network monitoring system composed of a centralized monitoring device and a plurality of distributed processing devices,
The centralized monitoring device selects a first distributed processing device that communicates with the network device based on the distance between the network device constituting the network and the plurality of distributed processing devices, and the network is selected as the first distributed processing device. Send the device address,
The first distributed processing device periodically transmits a first packet to the state of the network device, and transmits a second packet to the centralized monitoring device in the previous period when the state of the network device has changed. ,
The network monitoring system, wherein the centralized monitoring device issues an alarm when receiving the second packet.

The network monitoring system according to claim 1,
The centralized monitoring device, when receiving the second packet, determines whether or not the network device targeted by the second packet is under management.

The network monitoring system according to claim 1 or 2,
The central monitoring apparatus, when receiving the second packet, checks a state of a target network device of the second packet.

In a centralized monitoring device connected to a plurality of distributed processing devices,
A first distributed processing device that communicates with the network device is selected from the distances between the network devices constituting the network and the plurality of distributed processing devices, and the address of the network device is transmitted to the first distributed processing device. ,
A centralized monitoring device that issues an alarm when a packet is received from the first distributed processing device.

The centralized monitoring device according to claim 4,
The centralized monitoring device, when receiving the second packet, determines whether or not a network device targeted for the packet is under management.

The centralized monitoring device according to claim 4 or 5,
The centralized monitoring device, when receiving the second packet, confirms a state of a network device targeted by the packet.