JP2014036310A

JP2014036310A - Apparatus and method for evaluating effect

Info

Publication number: JP2014036310A
Application number: JP2012175992A
Authority: JP
Inventors: Keiji Nomura; 啓仁野村; Takehiro Kawada; 丈浩川田; Hiroshi Yamada; 博司山田
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2012-08-08
Filing date: 2012-08-08
Publication date: 2014-02-24

Abstract

PROBLEM TO BE SOLVED: To easily evaluate effects on a network or services by events.SOLUTION: An acquisition unit acquires log information of network equipment and configuration information of a network included in the network. An estimation unit estimates a range and a service affected by the event which has occurred in the network on the basis of the log information and the configuration information of the network acquired by the acquisition unit. An inspection unit inspects a content of the effect, by the event occurred in the network, on the service in the range on the network estimated by the estimation unit. A notification unit notifies a network administrator of an inspection result from the inspection unit.

Description

この発明は、影響評価装置及び影響評価方法に関する。 The present invention relates to an influence evaluation apparatus and an influence evaluation method.

従来、ネットワーク（ＮＷ：Network）のシステム管理者が障害などのイベントや、イベントの予兆を検出して、その原因を推定するための情報として、ＮＷ機器の管理情報が用いられている。例えば、システム管理者は、ＲＦＣ１１５６、ＲＦＣ１２１３で標準化されているＭＩＢ（Management Information Base：管理情報ベース）情報をＳＮＭＰ（Simple Network Management Protocol）によりＮＷ経由で取得して管理することで、イベントや、イベントの予兆を検出して、原因を推定する。ここで、ＭＩＢ情報を取得して管理するためのツールは数多く知られており、例えば、ＴＷＳＮＭＰＭａｎａｇｅｒなどが知られている。 2. Description of the Related Art Conventionally, NW device management information is used as information for a network (NW) system administrator to detect an event such as a failure or a sign of an event and estimate the cause of the event. For example, a system administrator acquires and manages MIB (Management Information Base) information standardized by RFC1156 and RFC1213 via NW by SNMP (Simple Network Management Protocol), thereby managing events and events. Detect the signs of, and estimate the cause. Here, many tools for acquiring and managing MIB information are known, for example, TWSNMP Manager is known.

また、例えば、システム管理者は、各ＮＷ機器のログを収集して管理するＳｙｓｌｏｇコレクタを用いて、イベントや、イベントの予兆を検出して、原因を推定する。ここで、Ｓｙｓｌｏｇコレクタは、システムの動作状況やメッセージなどのログを記録するプログラムとして一般的に利用されているＳｙｓｌｏｇが備えるログの送受信機能が利用されており、例えば、ＫｉｗｉＳｙｓｌｏｇＳｅｒｖｅｒなどが知られている。なお、ログの送受信機能のプロトコルは、ＲＦＣ５４２４によって標準化されている。 In addition, for example, the system administrator uses an Syslog collector that collects and manages logs of each NW device to detect an event or an event precursor and estimate the cause. Here, the Syslog collector uses a log transmission / reception function provided in Syslog, which is generally used as a program for recording logs such as system operation statuses and messages. For example, Kiwi Syslog Server is known. ing. Note that the protocol of the log transmission / reception function is standardized by RFC5424.

さらに、上述したＳＮＭＰマネージャやＳｙｓｌｏｇコレクタによって取得された管理情報をトリガとして、予め設定されたアクションを自動的に実行する技術も実用化されている。例えば、あるＳｙｓｌｏｇメッセージが観測された場合や、監視しているＭＩＢオブジェクトの値が閾値を超えた場合のアクションとして、Ｓｈｏｗコマンドを送信して追加情報を取得するように定義しておく。これにより、トリガが発生すると自動的に対象機器の状況をさらに詳細に知るための補足情報が取得される。このような技術としては、例えば、ＣｉｓｃｏＩＯＳ（登録商標）に搭載されているＥＥＭ（Embedded Event Manager）や、ＪＵＮＯＳに搭載されているＪＵＮＯＳｃｒｉｐｔなどが知られている。 Furthermore, a technique for automatically executing a preset action using the management information acquired by the SNMP manager or the Syslog collector as a trigger has been put into practical use. For example, when a certain Syslog message is observed or when the value of the monitored MIB object exceeds a threshold value, a Show command is transmitted and additional information is acquired. Thereby, when a trigger occurs, supplementary information for automatically knowing the status of the target device in more detail is acquired. As such a technique, for example, EEM (Embedded Event Manager) installed in Cisco IOS (registered trademark), JUNOScript installed in JUNOS, and the like are known.

J.Case et al, "A Simple Network Management Protocol(SNMP)", Network Working Group, RFC1157, May 1990, ［online］,［平成２４年６月２６日検索］、インターネット＜http://tools.ietf.org/html/rfc1157＞J. Case et al, "A Simple Network Management Protocol (SNMP)", Network Working Group, RFC1157, May 1990, [online], [searched June 26, 2012], Internet <http: //tools.ietf .org / html / rfc1157> twise labo., "TWSNMP Manager", [online]、[平成２４年６月２６日検索]インターネット＜http://www.twise.co.jp/twsnmp.html＞twise labo., "TWSNMP Manager", [online], [Search June 26, 2012] Internet <http://www.twise.co.jp/twsnmp.html> R.Gerhards, "The Syslog Protocol", Network Working Group, RFC5424, March 2009, ［online］,［平成２４年６月２６日検索］、インターネット＜http://tools.ietf.org/html/rfc5424＞R. Gerhards, "The Syslog Protocol", Network Working Group, RFC5424, March 2009, [online], [searched June 26, 2012], Internet <http://tools.ietf.org/html/rfc5424> ＣＩＳＣＯ, "EEM（Embedded Event Manager）機能", [online]、[平成２４年６月２６日検索]インターネット＜http://www.cisco.com/cisco/web/support/JP/102/1020/1020031_eem.html＞CISCO, "EEM (Embedded Event Manager) Function", [online], [Search June 26, 2012] Internet <http://www.cisco.com/cisco/web/support/JP/102/1020/ 1020031_eem.html> JUNIPER NETWORKS, "Junos XML Management Protocol（Junoscript）Download Software", [online]、[平成２４年６月２６日検索]インターネット＜http://www.juniper.net/support/xml/junoscript/index.html＞JUNIPER NETWORKS, "Junos XML Management Protocol (Junoscript) Download Software", [online], [Search June 26, 2012] Internet <http://www.juniper.net/support/xml/junoscript/index.html >

しかしながら、上述した従来技術では、イベントがネットワークやサービスに与える影響を評価することが困難であった。例えば、上述した従来技術では、イベントの影響を評価するためには、ＮＷ管理者が、イベントの詳細情報に基づいてＮＷ上で影響が及ぶ範囲を推定し、適切な評価手段を選択して評価を行わなければならず、イベントがネットワークやサービスに与える影響を評価することが困難であった。また、例えば、上述した従来技術では、イベントに関する情報のみでなく、管理者がＮＷの物理構成及び論理構成の情報を把握しておく必要があり、ＮＷ管理者にとっては高度な知識が求められる上、多くの稼動が必要となり、イベントがネットワークやサービスに与える影響を評価することが困難であった。 However, with the above-described conventional technology, it has been difficult to evaluate the influence of an event on a network or a service. For example, in the above-described conventional technology, in order to evaluate the influence of an event, the NW administrator estimates the range of influence on the NW based on the detailed information of the event, and selects and evaluates an appropriate evaluation means. It was difficult to assess the impact of events on networks and services. In addition, for example, in the above-described conventional technology, it is necessary for an administrator to grasp not only information related to an event but also information on the physical configuration and logical configuration of the NW, which requires advanced knowledge for the NW administrator. Many operations were required, and it was difficult to evaluate the impact of events on networks and services.

そこで、本願に係る技術は、上述した従来技術の問題に鑑みてなされたものであって、イベントがネットワークやサービスに与える影響を容易に評価することを可能にする影響評価装置及び影響評価方法を提供することを目的とする。 Therefore, the technology according to the present application has been made in view of the above-described problems of the prior art, and an impact evaluation device and an impact evaluation method that enable an event to easily evaluate the impact on a network or service. The purpose is to provide.

上述した課題を解決し、目的を達成するため、本願に係る影響評価装置は、ネットワークに含まれる複数のネットワーク機器に関するログ情報及び前記ネットワークにおける前記複数のネットワーク機器の構成を示す構成情報を取得する取得部と、前記取得部によって取得された前記ログ情報及び前記ネットワークの構成情報に基づいて、前記ネットワークに発生した障害によって影響を受ける当該ネットワーク上の範囲及びサービスを推定する推定部と、前記推定部によって推定された前記ネットワーク上の範囲における前記サービスに対して、前記ネットワークに発生した障害が与える影響の内容を検査する検査部と、前記検査部による検査結果を前記ネットワークの管理者に対して通知する通知部とを備えることを特徴とする。 In order to solve the above-described problems and achieve the object, the impact evaluation apparatus according to the present application acquires log information regarding a plurality of network devices included in a network and configuration information indicating configurations of the plurality of network devices in the network. An estimation unit that estimates a range and a service on the network affected by a failure that has occurred in the network based on the log information and the configuration information of the network acquired by the acquisition unit, and the estimation An inspection unit that inspects the content of the influence of the failure that has occurred on the network for the service in the range on the network estimated by the unit, and the inspection result by the inspection unit to the administrator of the network And a notification unit for notifying.

本願に係る装置は、イベントがネットワークやサービスに与える影響を容易に評価することを可能にする。 The apparatus according to the present application makes it possible to easily evaluate the influence of an event on a network or a service.

図１は、第１の実施形態に係るネットワークの構成の一例を示す図である。FIG. 1 is a diagram illustrating an example of a network configuration according to the first embodiment. 図２は、第１の実施形態に係る影響評価装置の構成の一例を示す図である。FIG. 2 is a diagram illustrating an example of the configuration of the impact evaluation apparatus according to the first embodiment. 図３は、第１の実施形態に係るログ情報記憶部によって記憶される情報の一例を示す図である。FIG. 3 is a diagram illustrating an example of information stored by the log information storage unit according to the first embodiment. 図４は、第１の実施形態に係る構成情報記憶部によって記憶される情報の一例を示す図である。FIG. 4 is a diagram illustrating an example of information stored by the configuration information storage unit according to the first embodiment. 図５は、第１の実施形態に係るルール情報記憶部によって記憶される情報の一例を示す図である。FIG. 5 is a diagram illustrating an example of information stored by the rule information storage unit according to the first embodiment. 図６は、第１の実施形態に係るテスト情報記憶部によって記憶される情報の一例を示す図である。FIG. 6 is a diagram illustrating an example of information stored by the test information storage unit according to the first embodiment. 図７は、第１の実施形態に係る影響評価装置を含むネットワーク内における処理の流れを説明するための模式図である。FIG. 7 is a schematic diagram for explaining the flow of processing in the network including the impact evaluation apparatus according to the first embodiment. 図８は、第１の実施形態に係る通知部によって通知されるイベントレポートの一例を示す図である。FIG. 8 is a diagram illustrating an example of an event report notified by the notification unit according to the first embodiment. 図９は、第１の実施形態に係る影響評価装置による処理の手順を示すフローチャートである。FIG. 9 is a flowchart illustrating a procedure of processing performed by the impact evaluation apparatus according to the first embodiment.

以下に添付図面を参照して、本願の開示する影響評価装置及び影響評価方法の実施形態を詳細に説明する。なお、本願の開示する影響評価装置及び影響評価方法は、以下の実施形態により限定されるものではない。 Exemplary embodiments of an impact evaluation apparatus and an impact evaluation method disclosed in the present application will be described below in detail with reference to the accompanying drawings. In addition, the impact evaluation apparatus and the impact evaluation method which this application discloses are not limited by the following embodiment.

［第１の実施形態］
［ネットワークの構成］
まず、第１の実施形態に係る影響評価装置が適用されるネットワークの構成の一例を説明する。図１は、第１の実施形態に係るネットワーク１０の構成の一例を示す図である。図１に示すように、ネットワーク１０は、中継網１１と、ユーザセグメント１２ａと、ユーザセグメント１２ｂと、サーバセグメント１３ａと、サーバセグメント１３ｂとを有する。そして、ネットワーク１０は、ＣＭＤＢ（Configuration Management DataBase）２０、ＥＭ（Event Manager）３０及び影響評価装置１００が接続される。なお、ネットワーク１０の構成は、図１に示す例に限定されるものではない。例えば、ユーザセグメント及びサーバセグメントは図示された数に限られず、多数のユーザセグメント及びサーバセグメントが、中継網１１に接続されている場合であってもよい。 [First Embodiment]
[Network configuration]
First, an example of a network configuration to which the impact evaluation apparatus according to the first embodiment is applied will be described. FIG. 1 is a diagram illustrating an example of a configuration of a network 10 according to the first embodiment. As shown in FIG. 1, the network 10 includes a relay network 11, a user segment 12a, a user segment 12b, a server segment 13a, and a server segment 13b. The network 10 is connected to a CMDB (Configuration Management DataBase) 20, an EM (Event Manager) 30, and an impact assessment apparatus 100. The configuration of the network 10 is not limited to the example shown in FIG. For example, the number of user segments and server segments is not limited to the illustrated number, and a large number of user segments and server segments may be connected to the relay network 11.

中継網１１は、例えば、図１に示すように、ルータＲ１〜ルータＲ４を有し、ユーザセグメント１２ａと、ユーザセグメント１２ｂと、サーバセグメント１３ａと、サーバセグメント１３ｂとの間で実行される通信を中継する。なお、図１に示す中継網１１はあくまでも一例であり、図１に示す例に限定されるものではない。 For example, as shown in FIG. 1, the relay network 11 includes routers R1 to R4, and performs communication performed between the user segment 12a, the user segment 12b, the server segment 13a, and the server segment 13b. Relay. The relay network 11 shown in FIG. 1 is merely an example, and is not limited to the example shown in FIG.

ユーザセグメント１２ａ及びユーザセグメント１２ｂは、中継網１１を介してサーバセグメント１３ａ及びサーバセグメント１３ｂに含まれるサーバと通信を実行するユーザ端末をそれぞれ有する。また、ユーザセグメント１２ａ及びユーザセグメント１２ｂは、影響評価装置１００によって利用されるリファレンスサイトＲＳ１及びリファレンスサイトＲＳ２をそれぞれ有する。 The user segment 12a and the user segment 12b have user terminals that communicate with the servers included in the server segment 13a and the server segment 13b via the relay network 11, respectively. Moreover, the user segment 12a and the user segment 12b respectively have a reference site RS1 and a reference site RS2 that are used by the impact evaluation apparatus 100.

リファレンスサイトＲＳ１及びリファレンスサイトＲＳ２は、影響評価装置１００によって送信されたテストコマンドを受信して、受信したテストコマンドに対応するプログラムを実行する。具体的には、リファレンスサイトＲＳ１及びリファレンスサイトＲＳ２は、影響評価装置１００によって送信されたテストコマンドに基づいて、障害などのイベントによって影響を受けるサービスの品質をテストする。例えば、リファレンスサイトＲＳ１及びリファレンスサイトＲＳ２は、インターネット制御メッセージプロトコル（ＩＣＭＰ：Internet Control Message Protocol）を用いたＰｉｎｇやＴｒａｃｅＲｏｕｔｅによる計測や、ＨＴＴＰ（HyperText Transfer Protocol）などを実行することにより、遅延や疎通性などをテストする。リファレンスサイトＲＳ１及びリファレンスサイトＲＳ２は、例えば、セグメントやサーバなどである。 The reference site RS1 and the reference site RS2 receive the test command transmitted by the impact evaluation apparatus 100 and execute a program corresponding to the received test command. Specifically, the reference site RS1 and the reference site RS2 test the quality of services affected by an event such as a failure based on the test command transmitted by the impact evaluation apparatus 100. For example, the reference site RS1 and the reference site RS2 perform delay or communication by executing measurement using Ping or Trace Route using the Internet Control Message Protocol (ICMP), HTTP (HyperText Transfer Protocol), or the like. Test sex and more. Reference site RS1 and reference site RS2 are a segment, a server, etc., for example.

サーバセグメント１３ａ及びサーバセグメント１３ｂは、各種サービスを提供するサービス提供サーバＳ１及びサービス提供サーバＳ２をそれぞれ有する。サービス提供サーバＳ１及びサービス提供サーバＳ２は、ユーザセグメント１２ａ及びユーザセグメント１２ｂに含まれるユーザ端末に対して各種サービスを提供する。 The server segment 13a and the server segment 13b respectively have a service providing server S1 and a service providing server S2 that provide various services. The service providing server S1 and the service providing server S2 provide various services to the user terminals included in the user segment 12a and the user segment 12b.

ここで、ネットワーク１０に含まれる中継網１１及び各セグメント（ユーザセグメント１２ａ、１２ｂ及びサーバセグメント１３ａ、１３ｂ）は、それぞれ共通の管理下において運用されるネットワークの集合「ＡＳ（Autonomous System：自律システム）」である。 Here, the relay network 11 and each segment (user segments 12a and 12b and server segments 13a and 13b) included in the network 10 are a set of networks “AS (Autonomous System) that are operated under common management. Is.

また、ネットワーク１０に含まれる各ネットワーク機器（例えば、中継網１１に含まれるルータＲ１〜Ｒ４や、各セグメントと中継網１１を接続するルータなど：以下、ＮＷ機器と記す場合がある）は、それぞれＳｙｓｌｏｇプロトコルが適用され、所定のイベントが発生した場合に、Ｓｙｓｌｏｇメッセージを出力し、出力したＳｙｓｌｏｇメッセージをＥＭ３０に送信する。例えば、各ネットワーク機器は、データの中継処理を行った場合や、ポート間のリンクがダウンする事イベントであるリンクダウンが発生した場合や、ポート間のリンクが形成されるイベントであるリンクアップが発生した場合等に、Ｓｙｓｌｏｇメッセージを出力する。 In addition, each network device included in the network 10 (for example, routers R1 to R4 included in the relay network 11, routers connecting the segments to the relay network 11, and the like; hereinafter may be referred to as NW devices), respectively, When the Syslog protocol is applied and a predetermined event occurs, a Syslog message is output, and the output Syslog message is transmitted to the EM 30. For example, each network device has a data relay process, a link down that is an event that a link between ports goes down, or a link up that is an event that forms a link between ports. If it occurs, a Syslog message is output.

このとき、各ＮＷ機器は、「タイムスタンプ」と、「イベントが発生した箇所」と、「発生したイベントの内容」とを含むＳｙｓｌｏｇメッセージを出力する。ここでいう「タイムスタンプ」とは、例えば、Ｓｙｓｌｏｇメッセージを出力した時刻、又は、イベントが発生した時刻等を示す。また、「イベントが発生した箇所」とは、例えば、ルータや、ルータを構成する構成部品等を示す。ルータの構成部品の例としては、ポートや、複数のポートを有するラインカード等が挙げられる。 At this time, each NW device outputs a Syslog message including “time stamp”, “location where the event has occurred”, and “content of the event that has occurred”. Here, the “time stamp” indicates, for example, the time when the Syslog message is output or the time when the event occurs. Further, “location where an event has occurred” indicates, for example, a router, a component constituting the router, or the like. Examples of router components include ports, line cards having a plurality of ports, and the like.

例えば、ルータＲ１がポート１ａを有し、ルータＲ２がポート２ａを有し、ポート１ａとポート２ａとが接続されていたが、ポート１ａとポート２ａとの間においてリンクダウンが発生したものとする。かかる場合には、ルータＲ１は、例えば、「２０１１年７月１５日１０時３０分１０秒ルータＲ１ポート１ａリンクダウン」といったＳｙｓｌｏｇメッセージを出力する。また、ルータＲ２も同様のＳｙｓｌｏｇメッセージを出力する。 For example, it is assumed that the router R1 has the port 1a, the router R2 has the port 2a, and the port 1a and the port 2a are connected, but a link down occurs between the port 1a and the port 2a. . In such a case, the router R1 outputs a Syslog message such as “July 15, 2011 10:30:10 router R1 port 1a link down”. Router R2 also outputs a similar Syslog message.

また、各ＮＷ機器は、それぞれＭＩＢ情報（Management Information Base）を有する。例えば、各ＮＷ機器は、ＲＦＣ１２１３によって定義された自装置のリソース状況やポートごとの管理情報を有する。一例を挙げると、各ＮＷ機器は、システムやインターフェースなどの監視対象グループごとに複数のオブジェクトを有するＭＩＢ情報を管理する。ここで、ＭＩＢ情報は、適宜更新される。 Each NW device has MIB information (Management Information Base). For example, each NW device has the resource status of its own device defined by RFC 1213 and management information for each port. For example, each NW device manages MIB information having a plurality of objects for each monitoring target group such as a system or an interface. Here, the MIB information is updated as appropriate.

さらに、各ＮＷ機器は、ＳｙｓｌｏｇメッセージやＭＩＢ情報をトリガとして自装置の状態を詳細に調べるためのコマンドを実行するスクリプト（Event Trigger Script：以下、ＥＴＳと記す場合がある）がそれぞれ実装されている。例えば、各ＮＷ機器は、ＣＩＳＣＯＥＥＭや、Ｊｕｎｉｐｅｒｓｃｒｉｐｔなどが実装される。一例を挙げると、各ＮＷ機器は、所定のＳｙｓｌｏｇメッセージ（例えば、リンクダウンなど）やＭＩＢ情報（例えば、エラー以外の理由で破棄された受信パケット数が所定の閾値を超えたなど）をトリガとして、Ｓｈｏｗコマンドを実行して自装置の状態の詳細情報を収集する。 Furthermore, each NW device has a script (Event Trigger Script: hereinafter sometimes referred to as ETS) for executing a command for examining the state of the own device in detail using a Syslog message or MIB information as a trigger. . For example, CISCO EEM, Juniper script, etc. are mounted on each NW device. For example, each NW device uses a predetermined Syslog message (for example, link down) or MIB information (for example, the number of received packets discarded for a reason other than an error exceeds a predetermined threshold) as a trigger. , The Show command is executed to collect detailed information on the status of the device itself.

ＣＭＤＢ２０は、各ＮＷ機器に設定されている設定情報を各ＮＷ機器から取得して、管理する。これにより、ＣＭＤＢ２０は、各ＮＷ機器が有するポート間の接続関係に関する情報をネットワーク１０の構成情報として取得することができる。例えば、ＣＭＤＢ２０は、ネットワーク１０の構成情報として、ルータＲ１のポート１ａとルータＲ２のポート２ａとが接続されていることを示す情報等を取得することができる。 The CMDB 20 acquires and manages setting information set in each NW device from each NW device. As a result, the CMDB 20 can acquire information regarding the connection relationship between the ports of each NW device as the configuration information of the network 10. For example, the CMDB 20 can acquire information indicating that the port 1a of the router R1 and the port 2a of the router R2 are connected as the configuration information of the network 10.

ＥＭ３０は、Ｓｙｓｌｏｇメッセージ収集機能と、Ｓｙｓｌｏｇメッセージ以外のシステムログ収集機能と、イベント管理機能とを有する。具体的には、ＥＭ３０は、Ｓｙｓｌｏｇコレクタを有し、各ＮＷ機器からＳｙｓｌｏｇメッセージを収集する。また、ＥＭ３０は、Ｓｙｓｌｏｇメッセージ以外のログ情報であるシステムログ情報として、ＭＩＢ情報や、ＥＴＳの結果（Ｓｈｏｗコマンドなどによって収集された詳細情報）などを収集する。例えば、ＥＭ３０がＳＮＭＰマネージャを実装し、各ＮＷ機器に実装されたＳＮＭＰエージェントに対してリクエストコマンドを発行することにより、ＭＩＢ情報を収集する。また、例えば、ＥＭ３０は、ＴＦＴＰ（Trivial File Transfer Protocol）が実装され、ＴＦＴＰリクエストメッセージを各ＮＷ装置に対して送信することにより、ＥＴＳの結果などを収集する。 The EM 30 has a Syslog message collection function, a system log collection function other than the Syslog message, and an event management function. Specifically, the EM 30 has a Syslog collector and collects Syslog messages from each NW device. Further, the EM 30 collects MIB information, ETS results (detailed information collected by the Show command), and the like as system log information that is log information other than the Syslog message. For example, the EM 30 implements an SNMP manager, and collects MIB information by issuing a request command to an SNMP agent installed in each NW device. Further, for example, the EM 30 is mounted with a TFTP (Trivial File Transfer Protocol), and collects ETS results and the like by transmitting a TFTP request message to each NW device.

そして、ＥＭ３０は、収集したＳｙｓｌｏｇメッセージやシステムログ情報を統合して、イベント詳細情報として整理して、保持する。例えば、ＥＭ３０は、「いつ」、「どのＮＷ機器」が、「どのような状態か」を示すイベント詳細情報を保持する。 Then, the EM 30 integrates the collected Syslog messages and system log information, and organizes and holds them as event detailed information. For example, the EM 30 holds detailed event information indicating “when” and “which NW device” is “what state”.

影響評価装置１００は、ＣＭＤＢ２０によって収集されたネットワーク１０の構成情報と、ＥＭ３０によって収集されたログ情報とを用いて、イベントによって影響を受けるネットワーク上の範囲とサービスとを解析する。 The impact evaluation apparatus 100 analyzes the range and service on the network affected by the event using the configuration information of the network 10 collected by the CMDB 20 and the log information collected by the EM 30.

［影響評価装置の構成］
以下、図２を用いて、第１の実施形態に係る影響評価装置１００の構成について説明する。図２は、第１の実施形態に係る影響評価装置１００の構成の一例を示す図である。図２に示すように、影響評価装置１００は、Ｉ／Ｆ（Interface）部１１０と、入力部１２０と、表示部１３０と、通信部１４０と、記憶部１５０と、制御部１６０とを有する。Ｉ／Ｆ部１１０は、入力部１２０、表示部１３０及び通信部１４０と、制御部１６０との各種情報のやり取りを制御する。入力部１２０は、影響評価装置１００の管理者から各種情報の入力操作を受付ける。例えば、入力部１２０は、後述する制御部１６０によるリファレンスサイトの実行プログラムの選択に用いられるルール情報の入力操作を受付ける。なお、ルール情報については後に詳述する。表示部１３０は、各種情報を表示する表示デバイスであり、例えば液晶ディスプレイ等である。 [Configuration of impact assessment equipment]
Hereinafter, the configuration of the impact evaluation apparatus 100 according to the first embodiment will be described with reference to FIG. FIG. 2 is a diagram illustrating an example of the configuration of the impact evaluation apparatus 100 according to the first embodiment. As illustrated in FIG. 2, the impact evaluation apparatus 100 includes an I / F (Interface) unit 110, an input unit 120, a display unit 130, a communication unit 140, a storage unit 150, and a control unit 160. The I / F unit 110 controls the exchange of various types of information between the input unit 120, the display unit 130, the communication unit 140, and the control unit 160. The input unit 120 receives various information input operations from the administrator of the impact assessment apparatus 100. For example, the input unit 120 accepts an input operation of rule information used for selecting a reference site execution program by the control unit 160 described later. The rule information will be described later in detail. The display unit 130 is a display device that displays various types of information, such as a liquid crystal display.

記憶部１５０は、図２に示すように、ログ情報記憶部１５１と、構成情報記憶部１５２と、ルール情報記憶部１５３と、テスト情報１５４とを有する。記憶部１５０は、例えば、ＲＡＭ（Random Access Memory）、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、又は、ハードディスク、光ディスク等の記憶装置などである。 As illustrated in FIG. 2, the storage unit 150 includes a log information storage unit 151, a configuration information storage unit 152, a rule information storage unit 153, and test information 154. The storage unit 150 is, for example, a semiconductor memory device such as a RAM (Random Access Memory) or a flash memory, or a storage device such as a hard disk or an optical disk.

ログ情報記憶部１５１は、ネットワーク１０に含まれるＮＷ機器のログ情報を記憶する。具体的には、ログ情報記憶部１５１は、後述する制御部１６０によってＥＭ３０から取得されたイベント詳細情報を記憶する。図３は、第１の実施形態に係るログ情報記憶部１５１によって記憶される情報の一例を示す図である。例えば、ログ情報記憶部１５１は、図３に示すように、「日付」、「時刻」、「ＮＷ機器」及び「状態」を対応付けたイベント詳細情報を記憶する。 The log information storage unit 151 stores log information of NW devices included in the network 10. Specifically, the log information storage unit 151 stores detailed event information acquired from the EM 30 by the control unit 160 described later. FIG. 3 is a diagram illustrating an example of information stored by the log information storage unit 151 according to the first embodiment. For example, as illustrated in FIG. 3, the log information storage unit 151 stores detailed event information in which “date”, “time”, “NW device”, and “state” are associated with each other.

ここで、「日付」とは、ネットワーク１０においてイベントが発生した日付を示す。また、「時刻」とは、ネットワーク１０においてイベントが発生した時刻を示す。また、「ＮＷ機器」とは、イベントが発生したＮＷ機器を示す。また、「状態」とは、イベントの内容を示す。すなわち、ログ情報記憶部１５１によって記憶されるログ情報は、「いつ」、「どの機器」で「どのようなイベント」が発生したかを意味する情報である。 Here, “date” indicates the date on which an event occurred in the network 10. “Time” indicates the time at which an event occurred in the network 10. The “NW device” indicates the NW device in which an event has occurred. The “state” indicates the content of the event. That is, the log information stored by the log information storage unit 151 is information indicating “what” and “what event” has occurred in “which device”.

図２に戻って、構成情報記憶部１５２は、ネットワーク１０の構成を示す構成情報を記憶する。具体的には、構成情報記憶部１５２は、後述する制御部１６０によってＣＭＤＢ２０から取得された構成情報を記憶する。図４は、第１の実施形態に係る構成情報記憶部１５２によって記憶される情報の一例を示す図である。例えば、構成情報記憶部１５２は、図４に示すように、「ＮＷ機器」、「構成部品」、「接続先ＮＷ機器」及び「構成部品」を対応付けた構成情報を記憶する。 Returning to FIG. 2, the configuration information storage unit 152 stores configuration information indicating the configuration of the network 10. Specifically, the configuration information storage unit 152 stores configuration information acquired from the CMDB 20 by the control unit 160 described later. FIG. 4 is a diagram illustrating an example of information stored in the configuration information storage unit 152 according to the first embodiment. For example, as illustrated in FIG. 4, the configuration information storage unit 152 stores configuration information in which “NW device”, “component”, “connection destination NW device”, and “component” are associated with each other.

一例を挙げると、構成情報記憶部１５２は、図４に示すように、「ＮＷ機器：Ｒ１、構成部品：ポート１ａ」に「接続先ＮＷ機器：Ｒ２、構成部品：ポート２ａ」が対応付けられた構成情報を記憶する。すなわち、上述した構成情報は、ネットワーク１０において、「ルータＲ１」の「ポート１ａ」に「ルータＲ２」の「ポート２ａ」が接続されていることを意味する。 For example, in the configuration information storage unit 152, as shown in FIG. 4, “NW device: R1, component: port 1a” is associated with “connection destination NW device: R2, component: port 2a”. Stored configuration information. That is, the above-described configuration information means that “port 2a” of “router R2” is connected to “port 1a” of “router R1” in network 10.

図２に戻って、ルール情報記憶部１５３は、後述する制御部１６０によって実行されるリファレンスサイトの実行プログラムの選択に用いられるルール情報を記憶する。具体的には、ルール情報記憶部１５３は、リファレンスサイトによって実行されるテスト内容を決定するために予め設定されるルール情報を記憶する。図５は、第１の実施形態に係るルール情報記憶部１５３によって記憶される情報の一例を示す図である。例えば、ルール情報記憶部１５３は、図５に示すように、「イベント」、「範囲」、「サービス」及び「テスト内容」を対応付けたルール情報を記憶する。 Returning to FIG. 2, the rule information storage unit 153 stores rule information used for selecting an execution program for a reference site executed by the control unit 160 described later. Specifically, the rule information storage unit 153 stores rule information set in advance in order to determine the test content executed by the reference site. FIG. 5 is a diagram illustrating an example of information stored by the rule information storage unit 153 according to the first embodiment. For example, as illustrated in FIG. 5, the rule information storage unit 153 stores rule information in which “event”, “range”, “service”, and “test content” are associated with each other.

ここで、「イベント」とは、ネットワーク１０において発生しうるイベントの内容を示す。また、「範囲」とは、ネットワーク１０において発生したイベントが影響する範囲を示す。また、「サービス」とは、ネットワーク１０において発生したイベントが影響するサービスを示す。また、「テスト内容」とは、リファレンスサイトに実行させるテストの内容を示す。すなわち、ルール情報記憶部１５１によって記憶されるルール情報は、イベント内容と、当該イベントによって影響をうける範囲及びサービス内容との組み合わせごとに、リファレンスサイトにどのようなテストを実行させるかを示す情報である。テスト内容としては、例えば、遅延テストや、疎通性テストなどが挙げられる。 Here, the “event” indicates the content of an event that can occur in the network 10. The “range” indicates a range in which an event occurring in the network 10 affects. The “service” indicates a service affected by an event occurring in the network 10. “Test content” indicates the content of a test to be executed by the reference site. That is, the rule information stored by the rule information storage unit 151 is information indicating what tests are to be executed by the reference site for each combination of the event content, the range affected by the event, and the service content. is there. Examples of the test contents include a delay test and a communication test.

図２に戻って、テスト情報記憶部１５４は、後述する制御部１６０によってリファレンスサイトに対して送信されるテスト実行コマンドに指定されるテストプログラムの情報を記憶する。図６は、第１の実施形態に係るルール情報記憶部１５４によって記憶される情報の一例を示す図である。例えば、テスト情報記憶部１５４は、図６に示すように、「テスト内容」に「プログラム」を対応付けたテスト情報を記憶する。ここで、「テスト内容」とは、リファレンスサイトに実行させるテストの内容を示す。また、「プログラム」とは、テスト内容に対応するプログラムの情報を示す。 Returning to FIG. 2, the test information storage unit 154 stores information on the test program specified in the test execution command transmitted to the reference site by the control unit 160 described later. FIG. 6 is a diagram illustrating an example of information stored by the rule information storage unit 154 according to the first embodiment. For example, as illustrated in FIG. 6, the test information storage unit 154 stores test information in which “program” is associated with “test content”. Here, “test content” indicates the content of a test to be executed by the reference site. The “program” indicates program information corresponding to the test content.

図２に戻って、制御部１６０は、取得部１６１と、推定部１６２と、検査部１６３と、解析部１６４と、通知部１６５とを有する。ここで、制御部１６０は、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）などの電子回路やＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）などの集積回路である。 Returning to FIG. 2, the control unit 160 includes an acquisition unit 161, an estimation unit 162, an inspection unit 163, an analysis unit 164, and a notification unit 165. Here, the control unit 160 is an electronic circuit such as a CPU (Central Processing Unit) or an MPU (Micro Processing Unit), or an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array).

取得部１６１は、ネットワーク１０に含まれる複数のネットワーク機器に関するログ情報及びネットワーク１０における複数のネットワーク機器の構成を示す構成情報を取得する。具体的には、取得部１６１は、ログ情報として、Ｓｙｓｌｏｇの情報、ＭＩＢ情報及びＳｈｏｗコマンドの出力情報を含む情報を取得する。例えば、取得部１６１は、ＣＭＤＢ２０から構成情報を取得する。また、取得部１６１は、ＥＭ３０からイベント詳細情報を取得する。ここで、取得部１６１は、例えば、ＴＦＴＰリクエストを用いて構成情報やイベント詳細情報を取得する。そして、取得部１６１は、取得したイベント詳細情報をログ情報記憶部１５１に格納し、構成情報を構成情報記憶部１５２に格納する。 The acquisition unit 161 acquires log information regarding a plurality of network devices included in the network 10 and configuration information indicating configurations of the plurality of network devices in the network 10. Specifically, the acquisition unit 161 acquires information including Syslog information, MIB information, and output information of a Show command as log information. For example, the acquisition unit 161 acquires configuration information from the CMDB 20. The acquisition unit 161 acquires event detailed information from the EM 30. Here, the acquisition unit 161 acquires configuration information and event detailed information using, for example, a TFTP request. Then, the acquisition unit 161 stores the acquired event detailed information in the log information storage unit 151 and stores the configuration information in the configuration information storage unit 152.

推定部１６２は、取得部１６１によって取得されたログ情報及びネットワーク１０の構成情報に基づいて、ネットワーク１０に発生した障害によって影響を受ける当該ネットワーク１０上の範囲及びサービスを推定する。具体的には、推定部１６２は、ログ情報及び前記ネットワークの構成情報に基づいて、当該ネットワーク上でサービスを提供する複数のサービス提供サーバ及び前記複数のサービス提供サーバからサービス提供を受ける複数のユーザ端末のうち、ネットワーク１０に発生した障害によって影響を受けるサービス提供サーバ及びユーザ端末を推定する。 Based on the log information acquired by the acquisition unit 161 and the configuration information of the network 10, the estimation unit 162 estimates the range and service on the network 10 that are affected by the failure that has occurred in the network 10. Specifically, the estimation unit 162 includes a plurality of service providing servers that provide services on the network and a plurality of users that receive services from the plurality of service providing servers, based on log information and network configuration information. Among the terminals, a service providing server and a user terminal that are affected by a failure occurring in the network 10 are estimated.

例えば、推定部１６２は、ログ情報記憶部１５１によって記憶されたイベント詳細情報を参照して、ネットワーク１０に発生したイベントを特定する。そして、推定部１６２は、構成情報記憶部１５２によって記憶されたネットワークの構成情報を参照して、特定したイベントごとに、イベントによって影響を受ける範囲と、サービス提供サーバによって提供されるサービスのうちイベントによって影響を受けるサービスとを推定する。 For example, the estimation unit 162 identifies an event that has occurred in the network 10 with reference to the event detailed information stored by the log information storage unit 151. Then, the estimation unit 162 refers to the network configuration information stored in the configuration information storage unit 152, and for each identified event, the range that is affected by the event and the event among the services provided by the service providing server Estimate the services affected by

検査部１６３は、推定部１６２によって推定されたネットワーク１０上の範囲におけるサービスに対して、ネットワーク１０に発生した障害が与える影響の内容を検査する。具体的には、検査部１６３は、ネットワーク１０に発生した障害と、推定部１６２によって推定されたネットワーク１０上の範囲との組み合わせに基づいて、リファレンスサイトに実行させる検査内容を決定し、決定した検査内容の実行コマンドをリファレンスサイトに実行させることで、サービスに対して障害が与える影響の内容を検査する。 The inspecting unit 163 inspects the content of the influence of the failure that has occurred in the network 10 on the service in the range on the network 10 estimated by the estimating unit 162. Specifically, the inspection unit 163 determines and determines the inspection contents to be executed by the reference site based on the combination of the failure that has occurred in the network 10 and the range on the network 10 estimated by the estimation unit 162. By causing the reference site to execute the inspection content execution command, the content of the effect of the failure on the service is inspected.

例えば、検査部１６３は、ルール情報記憶部１５３によって記憶されたルール情報と、テスト情報記憶部１５４によって記憶されたテスト情報とを参照して、推定部１６２によって影響範囲とサービスが推定されたイベントごとに、テスト内容及びプログラムを選択する。そして、検査部１６３は、選択したプログラムを実行させるテスト実行コマンドをリファレンスサイトＲＳ１及び／又はリファレンスサイトＲＳ２に送信してテストを実行させる。 For example, the inspection unit 163 refers to the rule information stored in the rule information storage unit 153 and the test information stored in the test information storage unit 154, and the event whose influence range and service are estimated by the estimation unit 162 Each time, the test content and program are selected. Then, the inspection unit 163 transmits a test execution command for executing the selected program to the reference site RS1 and / or the reference site RS2 to execute the test.

解析部１６４は、検査部１６３による検査結果に基づいて、推定部１６２によって推定されたネットワーク上の範囲におけるサービスに対して、ネットワークに発生した障害が与える影響の内容を解析する。具体的には、解析部１６４は、リファレンスサイトＲＳ１及び／又はリファレンスサイトＲＳ２のテスト結果を取得して、取得した結果を解析することで、イベントがサービスに対して与える影響を評価する。 Based on the inspection result by the inspection unit 163, the analysis unit 164 analyzes the content of the influence of the failure occurring in the network on the service in the network range estimated by the estimation unit 162. Specifically, the analysis unit 164 acquires the test result of the reference site RS1 and / or the reference site RS2, and analyzes the acquired result to evaluate the influence of the event on the service.

通知部１６５は、検査部１６３による検査結果をネットワークの管理者に対して通知する。具体的には、通知部１６５は、解析部１６４による解析結果とイベントとを組み合わせたイベントレポートを生成し、生成したイベントレポートをネットワーク１０の管理者に対して通知する。例えば、通知部１６５は、イベントごとに生成したイベントレポートを表示部１３０にて表示させることで、ネットワーク１０の管理者に検査結果を通知する。 The notification unit 165 notifies the network administrator of the inspection result by the inspection unit 163. Specifically, the notification unit 165 generates an event report that combines the analysis result of the analysis unit 164 and the event, and notifies the administrator of the network 10 of the generated event report. For example, the notification unit 165 notifies the administrator of the network 10 of the inspection result by causing the display unit 130 to display an event report generated for each event.

以上、第１の実施形態に係る影響評価装置１００の構成について説明した。以下、上述した影響評価装置１００を含むネットワーク内における処理の流れについて、図７を用いて説明する。図７は、第１の実施形態に係る影響評価装置１００を含むネットワーク１０内における処理の流れを説明するための模式図である。 The configuration of the impact evaluation apparatus 100 according to the first embodiment has been described above. Hereinafter, the flow of processing in the network including the above-described impact evaluation apparatus 100 will be described with reference to FIG. FIG. 7 is a schematic diagram for explaining the flow of processing in the network 10 including the impact evaluation apparatus 100 according to the first embodiment.

例えば、図７に示すように、ステップ１において、ネットワーク（ＮＷ）にイベントが発生すると、ＥＭ３０は、システムログ収集機能によってＭＩＢ情報やＳｈｏｗコマンド出力の情報などを収集する。また、同様に、ＥＭ３０は、Ｓｙｓｌｏｇ収集機能によってＳｙｓｌｏｇメッセージを収集する。そして、ステップ２において、ＥＭ３０は、イベント管理機能によって、収集したシステムログ情報及びＳｙｓｌｏｇメッセージの情報とを統合して、「いつ」、「どの機器」が「どのような状態か」を示すイベント詳細情報を生成して保持する。 For example, as shown in FIG. 7, when an event occurs in the network (NW) in step 1, the EM 30 collects MIB information, Show command output information, and the like by the system log collection function. Similarly, the EM 30 collects Syslog messages using the Syslog collection function. Then, in step 2, the EM 30 integrates the collected system log information and the Syslog message information by the event management function, and the event details indicating “when” and “which device” is “what state”. Generate and maintain information.

その後、影響評価装置１００の取得部１６１が、ＣＭＤＢ２０からネットワークの構成情報を取得するとともに、ＥＭ３０からイベント詳細情報を取得する。そして、ステップ３において、影響評価装置１００の推定部１６２が、取得部１６１によって取得されたネットワークの構成情報と、イベント詳細情報とからどの機器・どのサービスがどのように影響を受けるかを推定する。 Thereafter, the acquisition unit 161 of the impact evaluation apparatus 100 acquires network configuration information from the CMDB 20 and event detailed information from the EM 30. In step 3, the estimation unit 162 of the impact evaluation apparatus 100 estimates which device and which service are affected from the network configuration information acquired by the acquisition unit 161 and the event detailed information. .

そして、ステップ４において、影響評価装置１００の検査部１６３が、推定部１６２の推定結果と、ルール情報とを用いて、「どのリファレンスサイトが」、「ＮＷのどの箇所へ」、「どのテストを実施するか」をそれぞれ決定する。そして、検査部１６３は、リファレンスサイトに対してテスト実行コマンドを送信する。 In step 4, the inspection unit 163 of the impact evaluation apparatus 100 uses the estimation result of the estimation unit 162 and the rule information to determine “which reference site”, “to which part of the NW”, “which test Decide whether or not to implement. Then, the inspection unit 163 transmits a test execution command to the reference site.

テスト実行コマンドを受信したリファレンスサイトは、ステップ５において、テスト実行コマンドで指示された範囲に対して、Ｐｉｎｇ、ＴｒａｃｅＲｏｕｔｅ、Ｃａｐｔｕｒｅなどのテストを実行する。そして、リファレンスサイトは、テスト結果を影響評価装置１００に対して送信する。 In step 5, the reference site that has received the test execution command executes tests such as Ping, Trace Route, and Capture for the range designated by the test execution command. Then, the reference site transmits the test result to the impact evaluation apparatus 100.

そして、ステップ６において、影響評価装置１００がテスト結果を受信すると、解析部１６４は、テスト結果に基づいて、イベントがサービスに与える影響を解析する。例えば、解析部１６４は、「遅延」や、「疎通性」などを解析する。 In step 6, when the impact evaluation apparatus 100 receives the test result, the analysis unit 164 analyzes the influence of the event on the service based on the test result. For example, the analysis unit 164 analyzes “delay”, “communication”, and the like.

そして、ステップ７において、通知部１６５は、解析部１６４の解析結果に基づいて、「イベント名」、「イベント詳細」、「影響評価」などの情報を含むイベントレポートを作成して、作成したイベントレポートをＮＷの管理者に対して通知する。 In step 7, the notification unit 165 creates an event report including information such as “event name”, “event details”, and “impact assessment” based on the analysis result of the analysis unit 164, and creates the event The report is notified to the NW administrator.

以上、影響評価装置１００を含むネットワーク内の処理の流れについて説明した。以下、図１に示すネットワーク１０のルータＲ３とルータＲ４との間にリンク故障が発生した場合の一例について説明する。 The flow of processing in the network including the impact evaluation apparatus 100 has been described above. Hereinafter, an example when a link failure occurs between the router R3 and the router R4 of the network 10 illustrated in FIG. 1 will be described.

上記した場合、推定部１６２は、取得部１６１によって取得されたイベント詳細情報からリンク故障の詳細な情報を抽出する。例えば、推定部１６２は、イベント詳細情報に含まれる「中継網１１のルータ４のポートがダウン」とする情報を抽出する。そして、推定部１６２は、ネットワーク１０の構成情報を参照して、中継網１１の各ＮＷ機器が同一のＡＳに所属していることから、「中継網１１の各ＮＷ機器においてルートが再計算される可能性がある」と判定する。 In the case described above, the estimation unit 162 extracts detailed information on the link failure from the event detailed information acquired by the acquisition unit 161. For example, the estimation unit 162 extracts information indicating that “the port of the router 4 of the relay network 11 is down” included in the event detailed information. Then, the estimation unit 162 refers to the configuration information of the network 10 and since each NW device of the relay network 11 belongs to the same AS, “the route is recalculated in each NW device of the relay network 11”. It is possible to determine.

さらに、推定部１６２は、構成情報を参照して、ユーザセグメント１２ａ、１２ｂ及びサーバセグメント１３ａ、１３ｂの通信が中継網１１を介していることから、「ユーザセグメント１２ａに対する及びユーザセグメント１２ｂに対するサービスサーバＳ１のサービスが影響を受ける」と推定する。同様に、推定部１６２は、「ユーザセグメント１２ａに対する及びユーザセグメント１２ｂに対するサービスサーバＳ２のサービスが影響を受ける」と推定する。 Further, the estimation unit 162 refers to the configuration information, and since the communication of the user segments 12a and 12b and the server segments 13a and 13b is via the relay network 11, the “service server for the user segment 12a and for the user segment 12b” It is estimated that the service of S1 is affected. Similarly, the estimation unit 162 estimates that “the service of the service server S2 for the user segment 12a and the user segment 12b is affected”.

そして、検査部１６３は、「ルータＲ４のポートがダウンして、ルータＲ３との間のリンクが故障している」ことと、「ユーザセグメント１２ａに対する及びユーザセグメント１２ｂに対するサービスサーバＳ１のサービスが影響を受ける」及び「ユーザセグメント１２ａに対する及びユーザセグメント１２ｂに対するサービスサーバＳ２のサービスが影響を受ける」こととから、ルール情報を用いてテスト内容を決定する。 Then, the inspection unit 163 determines that “the port of the router R4 is down and the link between the router R3 is broken” and “the service of the service server S1 for the user segment 12a and the user segment 12b is affected. The contents of the test are determined using the rule information from “Receiving” and “Service server S2 service to user segment 12a and user segment 12b is affected”.

例えば、検査部１６３は、「サービスサーバＳ１からユーザセグメント１２ａ及びユーザセグメント１２ｂへの疎通性」を検証するテストと、「サービスサーバＳ２からユーザセグメント１２ａ及びユーザセグメント１２ｂへの疎通性」を検証するテストと、「サービスサーバＳ２からユーザセグメント１２ｂへのサービス応答時間」を検証するテストとを実行することを決定する。 For example, the inspection unit 163 verifies the “communication from the service server S1 to the user segment 12a and the user segment 12b” and the “communication from the service server S2 to the user segment 12a and the user segment 12b”. It is decided to execute the test and the test for verifying the “service response time from the service server S2 to the user segment 12b”.

そして、検証部１６３は、テスト情報を参照して各テスト内容に対応するプログラムを決定し、該当するリファレンスサイトへ決定したテスト実行コマンドを送信する。例えば、検査部１６３は、「サービスサーバＳ１へＰｉｎｇして、応答の有無を記録させるテスト」のテスト実行コマンドをリファレンスサイトＲＳ１に対して送信する。また、検査部１６３は、「サービスサーバＳ１へＰｉｎｇして、応答の有無を記録させるテスト」のテスト実行コマンドをリファレンスサイトＲＳ２に対して送信する。また、検査部１６３は、「サービスサーバＳ２へＰｉｎｇして、応答の有無と、応答時間とを記録させるテスト」のテスト実行コマンドをリファレンスサイトＲＳ２に対して送信する。 Then, the verification unit 163 determines a program corresponding to each test content with reference to the test information, and transmits the determined test execution command to the corresponding reference site. For example, the inspection unit 163 transmits a test execution command “Ping to the service server S1 to record the presence / absence of a response” to the reference site RS1. In addition, the inspection unit 163 transmits a test execution command “Ping to the service server S1 to record presence / absence of a response” to the reference site RS2. In addition, the inspection unit 163 transmits a test execution command of “Ping to the service server S2 to record presence / absence of response and response time” to the reference site RS2.

そして、解析部１６３は、各リファレンスサイトから受信したテスト結果を解析して、各サービスに対するイベントの影響を評価する。例えば、テスト結果が、「サービスサーバＳ１からリファレンスサイトＲＳ１に対する応答が有り」、「サービスサーバＳ１からリファレンスサイトＲＳ２に対する応答が有り」、「サービスサーバＳ２からリファレンスサイトＲＳ２に対する応答が有り」、「サービスサーバＳ２からリファレンスサイトＲＳ２に対する応答時間が所定の時間より長い」であった場合、解析部１６３は、「ユーザセグメント１２ｂに含まれるユーザ端末に対してサーバＳ２から提供されるサービスの通信に遅延が発生する」と解析する。 And the analysis part 163 analyzes the test result received from each reference site, and evaluates the influence of the event with respect to each service. For example, the test results are “a response from the service server S1 to the reference site RS1”, “a response from the service server S1 to the reference site RS2”, “a response from the service server S2 to the reference site RS2”, “service When the response time from the server S2 to the reference site RS2 is longer than a predetermined time, the analyzing unit 163 indicates that “the communication of the service provided from the server S2 to the user terminal included in the user segment 12b is delayed. Analyzes.

そして、通知部１６５は、解析部１６４による解析結果と、イベント詳細情報とからイベントレポートを生成して、ネットワーク１０の管理者に対して通知する。図８は、第１の実施形態に係る通知部１６５によって通知されるイベントレポートの一例を示す図である。例えば、通知部１６５は、図８に示すように、「イベント名：リンク故障」と、「イベント詳細：中継網１１のルータＲ４のポートがダウンし、ルータＲ３とルータＲ４との間のリンク故障」と、「影響：ユーザセグメント１２ｂに含まれるユーザ端末に対してサーバＳ２によって提供されるサービスの通信に遅延発生」とが記載されたイベントレポートを生成する。そして、通知部１６５は、表示部１３０に生成したイベントレポートを表示させる。 Then, the notification unit 165 generates an event report from the analysis result by the analysis unit 164 and the event detailed information, and notifies the administrator of the network 10. FIG. 8 is a diagram illustrating an example of an event report notified by the notification unit 165 according to the first embodiment. For example, as shown in FIG. 8, the notification unit 165 displays “event name: link failure” and “event details: the port of the router R4 of the relay network 11 goes down, and the link failure between the router R3 and the router R4. And “Influence: Delay in communication of service provided by server S2 to user terminals included in user segment 12b” is generated. Then, the notification unit 165 causes the display unit 130 to display the generated event report.

［影響評価装置による処理の手順］
次に、第１の実施形態に係る影響評価装置１００による処理の手順について、図９を用いて説明する。図９は、第１の実施形態に係る影響評価装置１００による処理の手順を示すフローチャートである。なお、図９においては、ネットワーク１０にイベントが発生して、ＥＭ３０がイベント詳細情報を生成した後の処理について示す。 [Procedure for processing by impact assessment device]
Next, a processing procedure performed by the impact evaluation apparatus 100 according to the first embodiment will be described with reference to FIG. FIG. 9 is a flowchart illustrating a procedure of processing performed by the impact evaluation apparatus 100 according to the first embodiment. FIG. 9 shows processing after an event occurs in the network 10 and the EM 30 generates event detailed information.

図９に示すように、第１の実施形態に係る影響評価装置１００においては、ネットワーク１０にイベントが発生すると、取得部１６１が、イベント詳細情報及びネットワーク１０の構成情報を取得する（ステップＳ１０１）。そして、推定部１６２が、取得部１６１によって取得されたイベント詳細情報及びネットワークの構成情報を用いて、イベントによって影響を受けるＮＷ機器及びサービスを推定する（ステップＳ１０２）。 As shown in FIG. 9, in the impact evaluation apparatus 100 according to the first embodiment, when an event occurs in the network 10, the acquisition unit 161 acquires event detailed information and configuration information of the network 10 (step S101). . Then, the estimation unit 162 estimates the NW device and service affected by the event using the event detailed information and the network configuration information acquired by the acquisition unit 161 (step S102).

そして、検査部１６３が、推定部１６２によって推定されたＮＷ機器及びサービスと、イベントの内容とから、ルール情報に基づいて、テスト実行するリファレンスサイト及びテストプログラムを選択する（ステップＳ１０３）。そして、検査部１６３は、選択したリファレンスサイトに対してテスト実行コマンドを送信する（ステップＳ１０４）。 Then, the inspection unit 163 selects a reference site and a test program to be tested based on the rule information from the NW device and service estimated by the estimation unit 162 and the content of the event (step S103). Then, the inspection unit 163 transmits a test execution command to the selected reference site (step S104).

そして、リファレンスサイトからテスト結果を受信すると（ステップＳ１０５肯定）、解析部１６４は、サービスに対するイベントの影響を解析する（ステップＳ１０６）。そして、通知部１６５は、解析部１６４による解析結果からイベントレポートを生成して、ＮＷ管理者に通知して（ステップＳ１０７）、処理を終了する。なお、ステップＳ１０５において、リファレンスサイトからのテスト結果を受信するまで、影響評価装置１００は待機状態となる（ステップＳ１０５否定）。 When the test result is received from the reference site (Yes at Step S105), the analysis unit 164 analyzes the influence of the event on the service (Step S106). And the notification part 165 produces | generates an event report from the analysis result by the analysis part 164, notifies an NW administrator (step S107), and complete | finishes a process. In step S105, the impact evaluation apparatus 100 is in a standby state until a test result from the reference site is received (No in step S105).

［第１の実施形態の効果］
上述したように、第１の実施形態によれば、取得部１６１は、ネットワーク１０に含まれるネットワーク機器のログ情報及びネットワーク１０の構成情報を取得する。そして、推定部１６２は、取得部１６１によって取得されたログ情報及びネットワーク１０の構成情報に基づいて、ネットワーク１０に発生したイベントによって影響を受ける当該ネットワーク１０上の範囲及びサービスを推定する。そして、検査部１６３は、推定部１６２によって推定されたネットワーク１０上の範囲におけるサービスに対して、ネットワーク１０に発生したイベントが与える影響の内容を検査する。そして。通知部１６５は、検査部１６３による検査結果をネットワーク１０の管理者に対して通知する。従って、第１の実施形態に係る影響評価装置１００は、イベントがネットワークやサービスに与える影響を容易に評価することを可能にする。 [Effect of the first embodiment]
As described above, according to the first embodiment, the acquisition unit 161 acquires log information of network devices included in the network 10 and configuration information of the network 10. Based on the log information acquired by the acquisition unit 161 and the configuration information of the network 10, the estimation unit 162 estimates the range and service on the network 10 that are affected by the event that has occurred in the network 10. Then, the inspection unit 163 inspects the content of the influence of the event occurring in the network 10 on the service in the range on the network 10 estimated by the estimation unit 162. And then. The notification unit 165 notifies the administrator of the network 10 of the inspection result by the inspection unit 163. Therefore, the influence evaluation apparatus 100 according to the first embodiment makes it possible to easily evaluate the influence of an event on a network or a service.

例えば、従来技術では、イベントの影響を評価するためには、ＮＷ管理者が、イベントの詳細情報に基づいてＮＷ上で影響が及ぶ範囲を推定し、適切な評価手段を選択して評価を行なっていたが、第１の実施形態に係る影響評価装置１００は、イベントに合わせて適切な評価を行うことができ、イベントがネットワークやサービスに与える影響を容易に評価することを可能にする。 For example, in the prior art, in order to evaluate the influence of an event, the NW administrator estimates the range of influence on the NW based on the detailed information of the event, and selects and evaluates an appropriate evaluation means. However, the influence evaluation apparatus 100 according to the first embodiment can perform an appropriate evaluation according to the event, and can easily evaluate the influence of the event on the network and the service.

また、例えば、従来技術では、イベントに関する情報のみでなく、管理者がＮＷの物理構成及び論理構成の情報を把握しておく必要があり、ＮＷ管理者にとっては高度な知識が求められる上、多くの稼動が必要であったが、第１の実施形態に係る影響評価装置１００は、イベント検知からその影響評価までを自動的に実施することができ、イベントがネットワークやサービスに与える影響を容易に評価することを可能にする。 In addition, for example, in the prior art, it is necessary for an administrator to grasp not only information related to an event but also information on the physical configuration and logical configuration of the NW, which requires a high level of knowledge for the NW administrator. However, the impact evaluation apparatus 100 according to the first embodiment can automatically perform the event detection to the impact assessment, and easily influence the event on the network and the service. Make it possible to evaluate.

また、第１の実施形態によれば、推定部１６２は、ログ情報及びネットワーク１０の構成情報に基づいて、当該ネットワーク１０上でサービスを提供する複数のサービス提供サーバ及び複数のサービス提供サーバからサービス提供を受ける複数のユーザ端末のうち、前記ネットワークに発生した障害によって影響を受けるサービス提供サーバ及びユーザ端末を推定する。従って、第１の実施形態に係る影響評価装置１００は、イベントによる影響の範囲を詳細に推定することを可能にする。 Further, according to the first embodiment, the estimation unit 162 is configured to provide services from a plurality of service providing servers and a plurality of service providing servers that provide services on the network 10 based on the log information and the configuration information of the network 10. A service providing server and user terminals that are affected by a failure occurring in the network are estimated from among a plurality of user terminals to be provided. Therefore, the influence evaluation apparatus 100 according to the first embodiment makes it possible to estimate in detail the range of influence caused by an event.

また、第１の実施形態によれば、検査部１６３は、ネットワーク１０に発生した障害と、推定部１６２によって推定されたネットワーク上１０の範囲との組み合わせに基づいて、リファレンスサイトに実行させる検査内容を決定し、決定した検査内容の実行コマンドをリファレンスサイトに実行させることで、サービスに対して障害が与える影響の内容を検査する。従って、第１の実施形態に係る影響評価装置１００は、種々のイベントに対応した適切な検査を実行することを可能にする。 In addition, according to the first embodiment, the inspection unit 163 causes the reference site to perform inspection contents based on the combination of the failure that has occurred in the network 10 and the range on the network 10 estimated by the estimation unit 162. And the execution of the determined inspection contents is executed by the reference site, thereby inspecting the contents of the influence of the failure on the service. Therefore, the influence evaluation apparatus 100 according to the first embodiment makes it possible to execute appropriate inspections corresponding to various events.

また、第１の実施形態によれば、解析部１６４は、検査部１６３による検査結果に基づいて、推定部１６２によって推定されたネットワーク１０上の範囲におけるサービスに対して、ネットワーク１０に発生した障害が与える影響の内容を解析する。そして、通知部１６５は、解析部１６４による解析結果と障害とを組み合わせたレポートを生成し、生成したレポートをネットワーク１０の管理者に対して通知する。従って、第１の実施形態に係る影響評価装置１００は、イベントがサービスに与える影響の詳細をネットワーク管理者に提供することができ、ネットワーク管理者が適切かつ迅速にアクションをとることを可能にする。 Further, according to the first embodiment, the analysis unit 164 causes a failure that has occurred in the network 10 to a service in the range on the network 10 estimated by the estimation unit 162 based on the inspection result by the inspection unit 163. Analyze the content of the impact of. Then, the notification unit 165 generates a report that combines the analysis result by the analysis unit 164 and the failure, and notifies the generated report to the administrator of the network 10. Therefore, the impact evaluation apparatus 100 according to the first embodiment can provide details of the impact of the event on the service to the network administrator, and enables the network administrator to take appropriate and prompt actions. .

また、第１の実施形態によれば、取得部１６１は、ログ情報として、Ｓｙｓｌｏｇの情報、ＭＩＢ情報及びＳｈｏｗコマンドの出力情報を含む情報を取得する。従って、第１の実施形態に係る影響評価装置１００は、ネットワーク機器の状態に関して種々の情報を取得することができ、様々なイベントを検出することを可能にする。 Further, according to the first embodiment, the acquisition unit 161 acquires information including Syslog information, MIB information, and output information of the Show command as log information. Therefore, the impact evaluation apparatus 100 according to the first embodiment can acquire various information regarding the state of the network device, and can detect various events.

［第２の実施形態］
これまで第１の実施形態について説明したが、本願に係る技術は第１の実施形態に限定されるものではない。すなわち、第１の実施形態は、その他の様々な形態で実施されることが可能であり、種々の省略、置き換え、変更を行うことができる。 [Second Embodiment]
Although the first embodiment has been described so far, the technology according to the present application is not limited to the first embodiment. That is, the first embodiment can be implemented in various other forms, and various omissions, replacements, and changes can be made.

上述した第１の実施形態では、イベントレポートをネットワーク１０の管理者に通知する場合について説明した。しかしながら、実施形態はこれに限定されるものではなく、例えば、複数のネットワーク管理者それぞれにイベントレポートを通知する場合であってもよい。例えば、図１に示すユーザセグメント１２ａ及びユーザセグメント１２ｂそれぞれの管理者に対してイベントレポートを通知する場合であってもよい。これにより、例えば、第２の実施形態に係る影響評価装置１００は、イベントによって影響を受けるユーザに対して迅速に通知を行うことができる。また、例えば、第２の実施形態に係る影響評価装置１００は、各ネットワーク管理者によるネットワークの設計やサーバの構成などをより適切に行わせることを可能にする。 In the first embodiment described above, the case where the event report is notified to the administrator of the network 10 has been described. However, the embodiment is not limited to this. For example, the event report may be notified to each of a plurality of network administrators. For example, the event report may be notified to the managers of the user segment 12a and the user segment 12b shown in FIG. Thereby, for example, the impact evaluation apparatus 100 according to the second embodiment can quickly notify the user who is affected by the event. Further, for example, the impact evaluation apparatus 100 according to the second embodiment enables each network administrator to more appropriately perform network design, server configuration, and the like.

[システム構成等]
例えば、各装置の分散・統合の具体的形態（例えば、図２の形態）は図示のものに限られず、その全部又は一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的又は物理的に分散・統合することができる。一例を挙げると、取得部１６１と推定部１６２とを一つの処理部として統合してもよく、一方、検査部１６３を、リファレンスサイト及び検査内容を決定する決定部と、決定された検査内容を決定されたリファレンスサイトに実行させる実行制御部とに分散してもよい。 [System configuration, etc.]
For example, the specific form of distribution / integration of each device (for example, the form shown in FIG. 2) is not limited to the one shown in the figure, and all or a part thereof can be changed in arbitrary units according to various loads and usage conditions. Functionally or physically distributed and integrated. For example, the acquisition unit 161 and the estimation unit 162 may be integrated as one processing unit, while the inspection unit 163 includes a reference site and a determination unit that determines the inspection content, and the determined inspection content. You may distribute to the execution control part performed by the determined reference site.

また、ＥＭ３０に備えられたＳｙｓｌｏｇ収集機能、Ｓｙｓｌｏｇ以外のシステムログ収集機能、イベント詳細情報の管理機能は、同一の装置によって実行される場合であってもよく、或いは、それぞれ異なる装置で実行される場合であってもよい。 Further, the Syslog collection function, the system log collection function other than Syslog, and the event detailed information management function provided in the EM 30 may be executed by the same device, or may be executed by different devices. It may be the case.

これらの実施例やその変形は、本願に係る技術に含まれると同様に、特許請求の範囲に記載された発明とその均等の範囲に含まれるものである。 These embodiments and modifications thereof are also included in the invention according to the claims and equivalents thereof, if included in the technology according to the present application.

１０ネットワーク
１１中継網
１２ａ、１２ｂユーザセグメント
１３ａ、１３ｂサーバセグメント
２０ＣＭＤＢ
３０ＥＭ
１００影響評価装置
１６１取得部
１６２推定部
１６３検査部
１６４解析部
１６５通知部 10 network 11 relay network 12a, 12b user segment 13a, 13b server segment 20 CMDB
30 EM
DESCRIPTION OF SYMBOLS 100 Impact evaluation apparatus 161 Acquisition part 162 Estimation part 163 Inspection part 164 Analysis part 165 Notification part

Claims

An acquisition unit that acquires log information about a plurality of network devices included in a network and configuration information indicating a configuration of the plurality of network devices in the network;
Based on the log information acquired by the acquisition unit and the configuration information of the network, an estimation unit that estimates a range and service on the network affected by a failure that has occurred in the network;
An inspection unit that inspects the content of the influence of a failure that has occurred in the network on the service in the range on the network estimated by the estimation unit;
A notification unit for notifying the network administrator of the inspection result by the inspection unit;
An impact evaluation apparatus comprising:

The estimation unit is based on the log information and the configuration information of the network, among a plurality of service providing servers that provide services on the network and a plurality of user terminals that receive services from the plurality of service providing servers, The impact evaluation apparatus according to claim 1, wherein a service providing server and a user terminal that are affected by a failure occurring in the network are estimated.

The inspection unit determines an inspection content to be executed by a reference site based on a combination of a failure occurring in the network and a range on the network estimated by the estimation unit, and executes an execution command of the determined inspection content. The influence evaluation apparatus according to claim 1, wherein the influence evaluation apparatus inspects the content of the influence of the failure on the service by being executed by the reference site.

Based on the inspection result by the inspection unit, the service unit in the range estimated on the network by the estimation unit, further comprising an analysis unit for analyzing the content of the effect of the failure that has occurred in the network,
The said notification part produces | generates the report which combined the analysis result by the said analysis part, and the said failure, and notifies the produced | generated report with respect to the administrator of the said network. The impact evaluation apparatus according to one.

The impact evaluation apparatus according to claim 1, wherein the acquisition unit acquires information including Syslog information, MIB information, and output information of a Show command as the log information.

An impact assessment method executed by an impact assessment device that assesses the impact of a failure occurring in a network,
An acquisition step of acquiring log information regarding a plurality of network devices included in the network and configuration information indicating a configuration of the plurality of network devices in the network;
Based on the log information acquired by the acquisition step and the configuration information of the network, an estimation step of estimating a range and a service on the network affected by a failure that has occurred in the network;
An inspection step of inspecting the contents of the influence of a failure occurring in the network with respect to the service in the range on the network estimated by the estimation step;
A notification step of notifying an inspection result of the inspection step to an administrator of the network;
An impact assessment method characterized by including