JP2020005138A

JP2020005138A - Inference device of failure influence range, inference device of failure cause, inference method and program of failure cause

Info

Publication number: JP2020005138A
Application number: JP2018123350A
Authority: JP
Inventors: 修鎌谷; Osamu Kamatani; 修明石; Osamu Akashi; 山口　高弘; Takahiro Yamaguchi; 高弘山口; 寺岡　文男; Fumio Teraoka; 文男寺岡; 三上　啓; Hiroshi Mikami; 啓三上
Original assignee: Nippon Telegraph and Telephone Corp; Keio University
Current assignee: Nippon Telegraph and Telephone Corp; Keio University
Priority date: 2018-06-28
Filing date: 2018-06-28
Publication date: 2020-01-09
Anticipated expiration: 2038-06-28
Also published as: JP7032251B2

Abstract

To provide an inference method of failure cause not requiring to calculate all failure propagation paths previously, and capable of automatic refining of the failure cause.SOLUTION: When failure information about an element constituting a network is inputted, an inference method selects an element having impact on the state from a preserved dependence rule, detects state propagation rule on the basis of the selected element, estimates the range of the elements receiving impact of the failure information, and when failure state about the network is inputted, selects one or more elements relevant to the failure state from the dependence rule, retrieves one or more states which the selected element can take, detects the state propagation rule on the basis of the selected element and the retrieved state, and performs cause estimation for estimating a list of the elements becoming the cause of fault state and the state of the elements.SELECTED DRAWING: Figure 1

Description

本開示は、ネットワークオントロジを利用した障害影響範囲と障害原因の推論手法に関する。 The present disclosure relates to a fault influence range and a fault cause inference method using a network ontology.

サーバやネットワークの仮想化技術によりネットワーク構成が複雑化している。大規模で動的なシステムではネットワーク管理者がシステム全体を正確に把握することが難しく、障害の原因や影響範囲を突き止めるコストが増大している。その結果、障害原因推論の自動化への需要が高まっている。 The network configuration has become complicated due to server and network virtualization technologies. In a large-scale dynamic system, it is difficult for a network administrator to accurately grasp the entire system, and the cost of locating the cause and the scope of the failure is increasing. As a result, there is a growing demand for automation of fault cause inference.

非特許文献１は、オントロジによってモデル化したＩＴシステム構成に対して汎用解析ルールを適用し、障害の根本原因を推論する。１つの汎用解析ルールは条件部と結論部からなる。条件部で指定する条件には、ＩＴ装置とその装置を構成するリソースの種別、そのリソースから生成されるイベント種別を指定する。結論部には、解析の結果として装置種別とそのリソースの種別、そのリソースから生成されるイベントの種別を記述する。このように定義することにより、汎用解析ルールはトポロジに対して条件を記述でき、また障害イベントの共起条件を記述できる。 Non-Patent Document 1 applies a general-purpose analysis rule to an IT system configuration modeled by ontology and infers a root cause of a failure. One general-purpose analysis rule includes a condition part and a conclusion part. The condition specified by the condition part specifies the type of the IT device and the resource that configures the device, and the type of event generated from the resource. In the conclusion part, the type of the device, the type of the resource, and the type of the event generated from the resource are described as a result of the analysis. By defining in this way, the general-purpose analysis rule can describe a condition for the topology, and can describe a co-occurrence condition of a failure event.

非特許文献２は、システム構成要素間の異常情報の伝搬の仕方をオントロジで表現している。この手法では依存関係を導出するルールと状態伝搬に関するルールという２種類のルールを使用する。影響範囲推論の場合、原因となる構成要素から順にルールを展開し、状態を伝搬させていくことで影響を受ける構成要素を見つけることができる。障害原因推論の場合、すべての依存関係や状態伝搬ルールを展開し、システム内を異常状態が伝播するすべての経路を計算する。その上で現在発生している障害状態を含む経路のみを抽出することで、原因候補の構成要素を列挙することができる。 Non-Patent Document 2 expresses a method of transmitting abnormal information between system components in an ontology. This method uses two types of rules: a rule for deriving a dependency and a rule for state propagation. In the case of the influence range inference, it is possible to find a component affected by expanding a rule in order from a component causing a cause and propagating a state. In the case of fault cause inference, all dependencies and state propagation rules are expanded, and all paths through which abnormal conditions propagate in the system are calculated. Then, by extracting only the path including the currently occurring fault state, the constituent elements of the cause candidates can be listed.

特許５８４５１３３号Patent No. 5845133

工藤裕，森村知弘，菅内公徳，増石哲也，薦田憲久．障害原因解析のためのルール構築方法と解析実行方式．電気学会論文誌Ｃ，Ｖｏｌ．１３２，Ｎｏ．１０，ｐｐ．１６８９−１６９７，２０１２．Hiroshi Kudo, Tomohiro Morimura, Kiminori Sugauchi, Tetsuya Masuishi, Norihisa Shinoda. Rule construction method and analysis execution method for failure cause analysis. IEICE Transactions C, Vol. 132, no. 10, pp. 1689-1697, 2012. Ｈ．Ｄｉｈｏｗｓｋｉ，Ｏ．Ｈｏｌｕｂ，ａｎｄＪ．Ｒｏｊｃｅｋ．Ｋｎｏｗｌｅｄｇｅ−ＢａｓｅｄＦａｕｌｔＰｒｏｐａｇａｔｉｏｎｉｎＢｕｉｌｄｉｎｇＡｕｔｏｍａｔｉｏｎＳｙｓｔｅｍ．ＩｎＰｒｏｃｅｅｄｉｎｇｓｏｆ２０１６ＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｎｅｃｅｏｎＳｙｓｔｅｍｓＩｎｆｏｒｍａｔｉｃｓ，ＭｏｄｅｌｌｉｎｇａｎｄＳｉｍｕｌａｔｉｏｎ（ＳＩＭＳ），ｐｐ．１２４−１３２，Ｊｕｎｅ２０１６．H. Dihowski, O .; Holub, and J.M. Rojcek. Knowledge-Based Fault Propagation in Building Automation System. In Proceedings of 2016 International Conference on Systems Information, Modeling and Simulation (SIMS), pp. 147-64. 124-132, June 2016. 三上啓，川口慎司，大島涼太，島松健太，近藤賢郎，鎌谷修，明石修，金子晋丈，寺岡文男．ネットワークオントロジＢｏｎｓａｉを利用したネットワーク管理手法に関する一検討．信学技報ＩＮ２０１６−８６），Ｊａｎｕａｒｙ２０１７．Kei Mikami, Shinji Kawaguchi, Ryota Oshima, Kenta Shimamatsu, Kenro Kondo, Osamu Kamagai, Osamu Akashi, Shincho Kaneko, Fumio Teraoka. A study on network management method using network ontology Bonsai. IEICE Technical Report, IN2006-86), January 2017. 川口慎司，「ＫＡＮＶＡＳ：ネットワーク知識の活用に向けたオントロジを利用したオープンな情報共有基盤」，慶應義塾大学大学院理工学研究科開放環境科学専攻修士論文，２０１６年３月．Shinji Kawaguchi, "KANVAS: An Open Information Sharing Platform Using Ontologies for Utilizing Network Knowledge," Master's Thesis, Graduate School of Science and Engineering, Keio University, March 2016.

非特許文献１の方式は、障害原因推論のみを対象としており、障害影響範囲推論を考慮していない。非特許文献２の方式は、障害原因推論と障害影響範囲推論が実現されているが、そのためにはすべてのルールをシステム構成にあわせてあらかじめ展開しておく必要がある。このため、非特許文献２の方式には、解析対象のシステムが大規模化及び複雑化すると、構成要素数や推論ルール数が増加するため、あらかじめ障害伝搬経路を計算しておくことが困難という第１の課題があった。 The method of Non-Patent Document 1 targets only failure cause inference and does not consider failure influence range inference. In the method of Non-Patent Document 2, fault cause inference and fault influence range inference are realized, but for that purpose, all rules need to be developed in advance according to the system configuration. For this reason, according to the method of Non-Patent Document 2, when the system to be analyzed becomes large-scale and complicated, the number of components and the number of inference rules increase, so that it is difficult to calculate a fault propagation path in advance. There was a first problem.

また、いずれの文献の方式も障害原因推論が可能であるが、示された原因候補を人手によって検証する必要があり、障害原因の絞り込みが困難という第２の課題があった。 In addition, although any of the methods described in the literatures can infer the cause of a failure, it is necessary to manually verify the indicated cause candidate, and there is a second problem that it is difficult to narrow down the cause of the failure.

そこで、本発明は、上記第１の課題を解決するために、解析対象のシステムが大規模化及び複雑化しても全ての障害伝搬経路を予め計算することを不要とできる障害影響範囲の推論装置、障害原因の推論装置、障害影響範囲の推論方法、障害原因の推論方法、及びプログラムを提供することを目的とする。
また、本発明は、上記第２の課題を解決するために、障害原因の自動的な絞り込みが可能である障害原因の推論装置、障害原因の推論方法、及びプログラムを提供することも目的とする。 In order to solve the first problem, the present invention provides a fault influence range inference apparatus that does not need to calculate all fault propagation paths in advance even if the system to be analyzed becomes large-scale and complicated. It is an object to provide a failure cause inference device, a failure influence range inference method, a failure cause inference method, and a program.
Another object of the present invention is to provide a failure cause inference apparatus, a failure cause inference method, and a program capable of automatically narrowing down a failure cause in order to solve the second problem. .

上記目的を達成するために、本発明に係る障害影響範囲の推論装置及びその方法は、ネットワーク構成情報に応じた依存関係ルールと状態伝搬ルールを展開して保持しておくこととした。 In order to achieve the above object, the apparatus and method for estimating the scope of a failure according to the present invention develops and holds dependency rules and state propagation rules according to network configuration information.

具体的には、本発明に係る障害影響範囲の推論装置は、
ネットワークを構成する複数の要素のうち、互いの状態が依存し合う関係にある２つの要素の依存関係を記載した依存関係ルール、及び前記２つの要素の間で伝搬する状態の内容を記載した状態伝搬ルールが設定される設定手段と、
前記ネットワークの構成についての情報を収集し、前記ネットワークの構成に応じた前記依存関係ルールを抽出して保管する保管手段と、
前記要素についての故障情報が入力されたときに、保管されている前記依存関係ルールから状態に影響がある前記要素を選び出し、選び出された前記要素に基づいて前記状態伝搬ルールを検出し、前記故障情報で影響を受ける前記要素の範囲を推定する範囲推定手段と、
を備える。 Specifically, the fault inference range inference device according to the present invention includes:
A dependency rule describing a dependency relationship between two elements of which a state is dependent on each other among a plurality of elements constituting a network, and a state describing contents of a state propagated between the two elements. Setting means for setting a propagation rule;
Storage means for collecting information about the configuration of the network, extracting and storing the dependency rules according to the configuration of the network,
When failure information on the element is input, select the element that has an effect on the state from the stored dependency rules, detect the state propagation rule based on the selected element, Range estimating means for estimating the range of the element affected by the failure information;
Is provided.

また、本発明に係る障害影響範囲の推論方法は、
ネットワークを構成する複数の要素のうち、互いの状態が依存し合う関係にある２つの要素の依存関係を記載した依存関係ルール、及び前記２つの要素の間で伝搬する状態の内容を記載した状態伝搬ルールを設定する設定手順と、
前記ネットワークの構成についての情報を収集し、前記ネットワークの構成に応じた前記依存関係ルールを抽出して保管する保管手順と、
前記要素についての故障情報が入力されたときに、保管されている前記依存関係ルールから状態に影響がある前記要素を選び出し、選び出された前記要素に基づいて前記状態伝搬ルールを検出し、前記故障情報で影響を受ける前記要素の範囲を推定する範囲推定手順と、
を行う。 In addition, the method of inferring the range of failure influence according to the present invention is as follows.
A dependency rule describing a dependency relationship between two elements of which a state is dependent on each other among a plurality of elements constituting a network, and a state describing contents of a state propagated between the two elements. A setting procedure for setting a propagation rule,
A storage procedure for collecting information about the configuration of the network, extracting and storing the dependency rules according to the configuration of the network,
When failure information on the element is input, select the element that has an effect on the state from the stored dependency rules, detect the state propagation rule based on the selected element, A range estimation procedure for estimating the range of the element affected by the failure information,
I do.

本発明に係る障害影響範囲の推論装置は、推定した前記要素の範囲のうち、ネットワークのサービスに関する影響を入力された前記故障情報の応答とする故障情報応答手段をさらに備えてもよい。 The fault influence range inference device according to the present invention may further include a failure information response unit that makes a response to the input failure information an influence on a network service in the estimated range of the element.

一方、本発明に係る障害原因の推論装置は、
ネットワークを構成する複数の要素のうち、互いの状態が依存し合う関係にある２つの要素の依存関係を記載した依存関係ルール、及び前記２つの要素の間で伝搬する状態の内容を記載した状態伝搬ルールが設定される設定手段と、
前記ネットワークの構成についての情報を収集し、前記ネットワークの構成に応じた前記依存関係ルールを抽出して保管する保管手段と、
前記ネットワークについての障害状況が入力されたときに、保管されている前記依存関係ルールから前記障害状況に関連する１又は複数の前記要素を選び出すとともに、選び出された前記要素が取り得る１又は複数の状態を検索し、選び出された前記要素と検索した前記状態に基づいて前記状態伝搬ルールを検出し、前記障害状況の原因となる前記要素と該要素の状態のリストを推定する原因推定を行う原因推定手段と、
を備える。 On the other hand, the fault cause inference device according to the present invention includes:
A dependency rule describing a dependency relationship between two elements of which a state is dependent on each other among a plurality of elements constituting a network, and a state describing contents of a state propagated between the two elements. Setting means for setting a propagation rule;
Storage means for collecting information about the configuration of the network, extracting and storing the dependency rules according to the configuration of the network,
When a fault condition for the network is input, one or more of the elements related to the fault condition are selected from the stored dependency rules, and one or more of the selected elements can be taken. The state estimation rule is searched based on the selected element and the searched state to detect the state propagation rule, and the cause estimation for estimating the list of the element causing the failure state and the state of the element is performed. Means for estimating the cause
Is provided.

また、本発明に係る障害原因の推論方法は、
ネットワークを構成する複数の要素のうち、互いの状態が依存し合う関係にある２つの要素の依存関係を記載した依存関係ルール、及び前記２つの要素の間で伝搬する状態の内容を記載した状態伝搬ルールを設定する設定手順と、
前記ネットワークの構成についての情報を収集し、前記ネットワークの構成に応じた前記依存関係ルールを抽出して保管する保管手順と、
前記ネットワークについての障害状況が入力されたときに、保管されている前記依存関係ルールから前記障害状況に関連する１又は複数の前記要素を選び出すとともに、選び出された前記要素が取り得る１又は複数の状態を検索し、選び出された前記要素と検索した前記状態に基づいて前記状態伝搬ルールを検出し、前記障害状況の原因となる前記要素と該要素の状態のリストを推定する原因推定手順と、
を行う。 In addition, the method of inferring the cause of the fault according to the present invention includes:
A dependency rule describing a dependency relationship between two elements of which a state is dependent on each other among a plurality of elements constituting a network, and a state describing contents of a state propagated between the two elements. A setting procedure for setting a propagation rule,
A storage procedure for collecting information about the configuration of the network, extracting and storing the dependency rules according to the configuration of the network,
When a fault condition for the network is input, one or more of the elements related to the fault condition are selected from the stored dependency rules, and one or more of the selected elements can be taken. A state estimating step of detecting the state propagation rule based on the selected element and the searched state, and estimating a list of the element causing the failure state and the state of the element When,
I do.

本障害影響範囲の推論手法及び本障害原因の推論手法は、対象のネットワークの構造を定期的に収集し、その構造の変化に応じた依存関係ルールと状態伝搬ルールのみを展開して保持している。このため、全ての障害伝搬経路を予め計算することが不要である。従って、本発明は、解析対象のシステムが大規模化及び複雑化しても全ての障害伝搬経路を予め計算することを不要とできる障害影響範囲の推論装置、障害原因の推論装置、障害影響範囲の推論方法、及び障害原因の推論方法を提供することができる。 The method for inferring the scope of the failure and the method for inferring the cause of the failure periodically collect the structure of the target network and deploy and hold only the dependency rules and state propagation rules according to the change in the structure. I have. For this reason, it is not necessary to calculate all the fault propagation paths in advance. Therefore, the present invention provides a failure influence range inference device, a failure cause inference device, and a failure influence range that can eliminate the need to calculate all failure propagation paths in advance even if the system to be analyzed becomes large-scale and complicated. An inference method and a failure cause inference method can be provided.

本発明に係る障害原因の推論装置は、前記原因推定手段に前記障害状況に類似する他の障害状況を仮定して前記原因推定を行わせ、前記他の障害状況の原因となる前記要素と該要素の状態の他のリストを推定し、前記リストと前記他のリストに共通する項目を前記リストから除外した結果を、入力された前記障害状況の応答とする障害状況応答手段をさらに備えてもよい。 The fault cause inference apparatus according to the present invention causes the cause estimating means to perform the cause estimation by assuming another fault situation similar to the fault situation, and the element causing the other fault situation and the Failure state response means for estimating another list of element states and removing a result common to the list and the items common to the other list from the list as a response to the input failure state may be further provided. Good.

本障害原因の推論装置は、仮定の障害状況に基づいて原因推定を行い、現状の原因推定と比較を行うことで障害原因を自動的に絞り込むこととした。従って、本発明は、障害原因の自動的な絞り込みが可能である障害原因の推論装置、及び障害原因の推論方法を提供することができる。 The fault cause inference apparatus estimates the cause based on the assumed failure state, and automatically narrows down the cause of the failure by comparing with the current cause estimation. Therefore, the present invention can provide a fault cause inference apparatus and a fault cause inference method capable of automatically narrowing down a fault cause.

本発明に係るプログラムは、前記障害影響範囲の推論装置、あるいは前記障害原因の推論装置としてコンピュータを機能させるためのプログラムである。本発明に係る障害影響範囲の推論装置及び障害原因の推論装置はコンピュータとプログラムによっても実現でき、プログラムを記録媒体に記録することも、ネットワークを通して提供することも可能である。 A program according to the present invention is a program for causing a computer to function as the failure influence range inference device or the failure cause inference device. The fault influence range inference device and the fault cause inference device according to the present invention can also be realized by a computer and a program, and the program can be recorded on a recording medium or provided through a network.

本発明は、解析対象のシステムが大規模化及び複雑化しても全ての障害伝搬経路を予め計算することを不要とできる障害影響範囲の推論装置、障害原因の推論装置、障害影響範囲の推論方法、障害原因の推論方法、及びプログラムを提供することができる。
また、本発明は、障害原因の自動的な絞り込みが可能である障害原因の推論装置、障害原因の推論方法、及びプログラムを提供することもできる。 The present invention provides a failure influence range inference device, a failure cause inference device, and a failure influence range inference method that do not need to calculate all the failure propagation paths in advance even if the system to be analyzed becomes large-scale and complicated. , A failure cause inference method, and a program can be provided.
Further, the present invention can also provide a failure cause inference device, a failure cause inference method, and a program capable of automatically narrowing down a failure cause.

本発明に係る障害影響範囲の推論装置及び障害原因の推論装置を説明するブロック図である。FIG. 2 is a block diagram illustrating a failure influence range inference device and a failure cause inference device according to the present invention. ネットワークオントロジＢｏｎｓａｉの全体構造を説明する構成図である。FIG. 1 is a configuration diagram illustrating an overall structure of a network ontology Bonsai. ネットワークの例をＩＰレベルで説明する図である。FIG. 2 is a diagram illustrating an example of a network at an IP level. ＩＰネットワーク構成を説明するインスタンス図である。FIG. 2 is an instance diagram illustrating an IP network configuration. ネットワークの例を物理レベルで説明する図である。FIG. 2 is a diagram illustrating an example of a network at a physical level. 物理ネットワーク構成を説明するインスタンス図である。FIG. 2 is an instance diagram illustrating a physical network configuration. ネットワークサービス構成を説明するインスタンス図である。FIG. 3 is an instance diagram illustrating a network service configuration. スイッチ３の物理ネットワーク構成におけるインスタンス表現を説明する図である。FIG. 3 is a diagram illustrating an instance expression in a physical network configuration of a switch 3; スイッチ３の上流インタフェースの物理ネットワーク構成におけるインスタンス表現を説明する図である。FIG. 4 is a diagram illustrating an instance expression in a physical network configuration of an upstream interface of a switch. ルータ３とスイッチ３間のリンクの物理ネットワーク構成におけるインスタンス表現を説明する図である。FIG. 3 is a diagram illustrating an instance expression in a physical network configuration of a link between a router 3 and a switch 3. スイッチ３とホスト２間のリンクの物理ネットワーク構成におけるインスタンス表現を説明する図である。FIG. 3 is a diagram illustrating an instance expression in a physical network configuration of a link between a switch 3 and a host 2; ルータ３とスイッチ３間のリンクの論理ネットワーク構成におけるインスタンス表現を説明する図である。FIG. 4 is a diagram illustrating an instance expression in a logical network configuration of a link between a router 3 and a switch 3. ＩＰネットワーク構成におけるサブネット３のインスタンス表現を説明する図である。FIG. 3 is a diagram illustrating an instance representation of subnet 3 in an IP network configuration. ホスト２からＷｅｂサーバへの通信経路のインスタンス表現を説明する図である。FIG. 3 is a diagram illustrating an instance expression of a communication path from a host 2 to a Web server. ホスト３のＩＰネットワーク構成におけるインスタンス表現を説明する図である。FIG. 3 is a diagram illustrating an instance expression in a host 3 IP network configuration. ホスト３のＩＰネットワーク構成におけるインスタンス表現を説明する図である。FIG. 3 is a diagram illustrating an instance expression in a host 3 IP network configuration. ネットワークサービス構成におけるＷＥＢサービスのインスタンス表現を説明する図である。FIG. 3 is a diagram illustrating an instance expression of a web service in a network service configuration. 依存関係ルール１（ＰｈｙｓｉｃａｌＮｏｄｅ −＞ＰｈｙｓｉｃａｌＩｎｔｅｒｆａｃｅ）を説明する図である。It is a figure explaining the dependency relationship rule 1 (PhysicalNode-> PhysicalInterface). 依存関係ルール２（ＰｈｙｓｉｃａｌＩｎｔｅｒｆａｃｅ −＞ＰｈｙｓｉｃａｌＬｉｎｋ）を説明する図である。It is a figure explaining the dependency relationship rule 2 (PhysicalInterface-> PhysicalLink). 依存関係ルール３（ＰｈｙｓｉｃａｌＬｉｎｋ −＞ＬｏｇｉｃａｌＬｉｎｋ）を説明する図である。It is a figure explaining the dependency relationship rule 3 (PhysicalLink-> LogicalLink). 依存関係ルール４（ＬｏｇｉｃａｌＬｉｎｋ −＞ＩＰＳｕｂｎｅｔ）を説明する図である。It is a figure explaining the dependency relationship rule 4 (LogicalLink-> IPSubnet). 依存関係ルール５（ＩＰＳｕｂｎｅｔ −＞ＩＰＰａｔｈ）を説明する図である。FIG. 9 is a diagram for explaining dependency rule 5 (IPSubnet-> IPPath). 依存関係ルール６（ＩＰＰａｔｈ −＞ＮｅｔｗｏｒｋＳｅｒｖｉｃｅ）を説明する図である。FIG. 9 is a diagram for describing dependency rule 6 (IPPath-> NetworkService). 状態伝搬ルール１（ＰｈｙｓｉｃａｌＮｏｄｅ −＞ＰｈｙｓｉｃａｌＩｎｔｅｒｆａｃｅ）を説明する図である。It is a figure explaining the state propagation rule 1 (PhysicalNode-> PhysicalInterface). 状態伝搬ルール２（ＰｈｙｓｉｃａｌＩｎｔｅｒｆａｃｅ −＞ＰｈｙｓｉｃａｌＬｉｎｋ）を説明する図である。It is a figure explaining the state propagation rule 2 (PhysicalInterface-> PhysicalLink). 状態伝搬ルール３（ＰｈｙｓｉｃａｌＬｉｎｋ −＞ＬｏｇｉｃａｌＬｉｎｋ）を説明する図である。It is a figure explaining the state propagation rule 3 (PhysicalLink-> LogicalLink). 状態伝搬ルール４（ＬｏｇｉｃａｌＬｉｎｋ −＞ＩＰＳｕｂｎｅｔ）を説明する図である。It is a figure explaining the state propagation rule 4 (LogicalLink-> IPSubnet). 状態伝搬ルール５（ＩＰＳｕｂｎｅｔ −＞ＩＰＰａｔｈ）を説明する図である。It is a figure explaining the state propagation rule 5 (IPSubnet-> IPPath). 状態伝搬ルール６（ＩＰＰａｔｈ −＞ＮｅｔｗｏｒｋＳｅｒｖｉｃｅ）を説明する図である。It is a figure explaining the state propagation rule 6 (IPPath-> NetworkService). 依存関係ルール適用後のスイッチ３の物理ネットワーク構成におけるインスタンス表現を説明する図である。FIG. 9 is a diagram illustrating an instance expression in a physical network configuration of a switch 3 after application of a dependency relationship rule. 依存関係ルール適用後のスイッチ３の上流インタフェースの物理ネットワーク構成におけるインスタンス表現を説明する図である。FIG. 14 is a diagram illustrating an instance expression in a physical network configuration of an upstream interface of a switch after application of a dependency relationship rule. 依存関係ルール適用後のルータ３とスイッチ３間のリンクの物理ネットワーク構成におけるインスタンス表現を説明する図である。FIG. 9 is a diagram illustrating an instance expression in a physical network configuration of a link between a router 3 and a switch 3 after applying a dependency relationship rule. 依存関係ルール適用後のルータ３とスイッチ３間のリンクの論理ネットワーク構成におけるインスタンス表現を説明する図である。FIG. 9 is a diagram illustrating an instance expression in a logical network configuration of a link between a router 3 and a switch 3 after application of a dependency relationship rule. 依存関係ルール適用後のＩＰネットワーク構成におけるサブネット３のインスタンス表現を説明する図である。FIG. 11 is a diagram illustrating an instance expression of subnet 3 in an IP network configuration after applying a dependency relationship rule. 依存関係ルール適用後のホスト２からＷｅｂサーバへの通信経路のインスタンス表現を説明する図である。FIG. 9 is a diagram illustrating an instance expression of a communication path from the host 2 to the Web server after applying the dependency relationship rule. 本発明に係る障害影響範囲の推論方法を説明する図である。It is a figure explaining the inference method of the fault influence range concerning the present invention. 影響範囲推論ルールを説明する図である。It is a figure explaining an influence range inference rule. 本発明に係る障害原因の推論方法を説明する図である。FIG. 6 is a diagram illustrating a method of inferring a failure cause according to the present invention. 原因推論ルール（１）を説明する図である。It is a figure explaining a cause reasoning rule (1). 原因推論ルール（２−１）を説明する図である。It is a figure explaining a cause reasoning rule (2-1). 原因推論ルール（２−２）を説明する図である。It is a figure explaining a cause reasoning rule (2-2). 原因推論ルール（３）を説明する図である。It is a figure explaining a cause reasoning rule (3). 本発明に係る障害原因の推論方法で使用するＦａｕｌｔＮｏｄｅクラスの例を説明する図である。FIG. 5 is a diagram illustrating an example of a FaultNode class used in the method of inferring a cause of a failure according to the present invention. 本発明に係る障害原因の推論方法における原因絞り込みの例を説明する図である。FIG. 7 is a diagram illustrating an example of narrowing down a cause in the method of inferring a cause of a failure according to the present invention. 本発明に係る障害原因の推論方法における原因絞り込み手順を説明する図である。FIG. 7 is a diagram illustrating a cause narrowing down procedure in the failure cause inference method according to the present invention. フォルトツリーの絞込ルールを説明する図である。It is a figure explaining a narrowing rule of a fault tree. ＫＡＮＶＡＳアーキテクチャの全体像の一例を示す。1 shows an example of an overview of a KANVAS architecture. ＫＩＧの内部構造の一例を示す。1 shows an example of the internal structure of a KIG. ＩＰネットワーク構成の一例を示す。1 shows an example of an IP network configuration. 物理ネットワーク構成の一例を示す。1 shows an example of a physical network configuration. ネットワーク構成検出方法のフローチャートの一例を示す。1 shows an example of a flowchart of a network configuration detection method. ステップＳ１０２終了後に得られたネットワーク構成の一例を示す。An example of the network configuration obtained after step S102 is shown. ステップＳ１０３終了後に得られたネットワーク構成の一例を示す。An example of the network configuration obtained after the end of step S103 is shown. ステップＳ１０４終了後に得られたネットワーク構成の一例を示す。An example of the network configuration obtained after the end of step S104 is shown. ステップＳ１０５終了後に得られたネットワーク構成の一例を示す。An example of the network configuration obtained after the end of step S105 is shown. ステップＳ１０６終了後に得られたネットワーク構成の一例を示す。An example of a network configuration obtained after the end of step S106 is shown.

添付の図面を参照して本発明の実施形態を説明する。以下に説明する実施形態は本発明の実施例であり、本発明は、以下の実施形態に制限されるものではない。なお、本明細書及び図面において符号が同じ構成要素は、相互に同一のものを示すものとする。 Embodiments of the present invention will be described with reference to the accompanying drawings. The embodiment described below is an example of the present invention, and the present invention is not limited to the following embodiment. In the specification and the drawings, components having the same reference numerals indicate the same components.

本実施形態の推論装置は、
ネットワークを構成する複数の要素のうち、互いの状態が依存し合う関係にある２つの要素の依存関係を記載した依存関係ルール、及び前記２つの要素の間で伝搬する状態の内容を記載した状態伝搬ルールが設定される設定手段と、
前記ネットワークの構成についての情報を収集し、前記ネットワークの構成に応じた前記依存関係ルールを抽出して保管する保管手段と、
前記要素についての故障情報が入力されたときに、保管されている前記依存関係ルールから状態に影響がある前記要素を選び出し、選び出された前記要素に基づいて前記状態伝搬ルールを検出し、前記故障情報で影響を受ける前記要素の範囲を推定する範囲推定手段、又は／及び
前記ネットワークについての障害状況が入力されたときに、保管されている前記依存関係ルールから前記障害状況に関連する１又は複数の前記要素を選び出すとともに、選び出された前記要素が取り得る１又は複数の状態を検索し、選び出された前記要素と検索した前記状態に基づいて前記状態伝搬ルールを検出し、前記障害状況の原因となる前記要素と該要素の状態のリストを推定する原因推定を行う原因推定手段と、
を備えることを特徴とする。
なお、上記の「要素」とは、ネットワークを構成するノード、インターフェース、リンク、経路、サブネット、サービス等を意味する。 The inference device of the present embodiment is:
A dependency rule describing a dependency relationship between two elements of which a state is dependent on each other among a plurality of elements constituting a network, and a state describing contents of a state propagated between the two elements. Setting means for setting a propagation rule;
Storage means for collecting information about the configuration of the network, extracting and storing the dependency rules according to the configuration of the network,
When failure information on the element is input, select the element that has an effect on the state from the stored dependency rules, detect the state propagation rule based on the selected element, Range estimating means for estimating the range of the element affected by the failure information; and / or when a failure situation for the network is input, one or more related to the failure situation from the stored dependency rules. Selecting a plurality of the elements, searching for one or more possible states of the selected elements, detecting the state propagation rule based on the selected elements and the searched states, Cause estimating means for estimating the cause of the situation and estimating a list of the state of the element and the state of the element;
It is characterized by having.
Note that the above “elements” refer to nodes, interfaces, links, routes, subnets, services, and the like that constitute a network.

１．モジュール構成
図１に本推論装置のブロック図を示す。ネットワーク構成情報収集モジュール１１はネットワークの構成情報を自動的に収集し、これをネットワークオントロジＢｏｎｓａｉに基づくインスタンス表現に変換してネットワーク構成情報ＲＤＦ（ＲｅｓｏｕｒｃｅＤｅｓｃｒｉｐｔｉｏｎＦｒａｍｅｗｏｒｋ）ストレージに格納する。なお、ネットワーク構成情報収集モジュール１１の具体例は後述する。 1. Module Configuration FIG. 1 shows a block diagram of the inference apparatus. The network configuration information collection module 11 automatically collects network configuration information, converts it into an instance representation based on the network ontology Bonsai, and stores it in a network configuration information RDF (Resource Description Framework) storage. A specific example of the network configuration information collection module 11 will be described later.

ＢｏｎｓａｉはＲＤＦとＯＷＬ（ＷｅｂＯｎｔｏｌｏｇｙＬａｎｇｕａｇｅ）に基づくドメインオントロジである（非特許文献３）。Ｂｏｎｓａｉによるネットワーク構成オントロジの全体像を図２に示す。Ｂｏｎｓａｉは５階層、６カテゴリから構成される。第１階層は物理ネットワーク構成、第２階層は論理ネットワーク構成、第３階層はＩＰネットワーク構成、第４層はオーバレイネットワーク構成、第５階層にはネットワークサービス構成と運用ネットワーク構成を定義する。ネットワーク構成情報収集モジュール１１は、ネットワーク構成の変化を検出するたびにネットワーク構成情報をネットワーク構成情報ＲＤＦストレージ２１に格納する。 Bonsai is a domain ontology based on RDF and OWL (Web Ontology Language) (Non-Patent Document 3). FIG. 2 shows the overall image of the network configuration ontology by Bonsai. Bonsai is composed of five layers and six categories. The first layer defines a physical network configuration, the second layer defines a logical network configuration, the third layer defines an IP network configuration, the fourth layer defines an overlay network configuration, and the fifth layer defines a network service configuration and an operation network configuration. The network configuration information collection module 11 stores the network configuration information in the network configuration information RDF storage 21 each time a change in the network configuration is detected.

ルール入力モジュール２５は、システム管理者が定義した依存関係ルールと状態伝搬ルールをＳＰＩＮ（ＳＰＡＲＱＬＩｎｆｅｒｅｎｃｉｎｇＮｏｔａｔｉｏｎ）形式に変換し、ＳＰＩＮ依存関係ルールストレージ２２とＳＰＩＮ状態伝搬ルールストレージ２４に格納する。 The rule input module 25 converts the dependency relation rule and the state propagation rule defined by the system administrator into the SPIN (SPARQL Inferring Notation) format, and stores them in the SPIN dependency relation rule storage 22 and the SPIN state propagation rule storage 24.

ＳＰＩＮ推論エンジン１２はネットワーク構成情報ＲＤＦストレージ２１に格納された情報の変更を検出すると、ネットワーク構成情報にＳＰＩＮ依存関係ルールを適用し、その結果を依存関係ルール展開ストレージ２３に格納する。 When detecting a change in the information stored in the network configuration information RDF storage 21, the SPIN inference engine 12 applies the SPIN dependency relationship rule to the network configuration information, and stores the result in the dependency rule expansion storage 23.

入出力モジュール２６はシステム利用者からの障害情報を受け取ったり、推論結果をユーザに返すためのインタフェースとなるモジュールである。障害情報には以下の２種類がある。１つはネットワークにおける障害箇所を指定した情報である（障害指定情報）。たとえば「ネットワークスイッチ１が故障した」などの情報である。もう１つは障害状況を記述した情報である（障害状況情報）。たとえば「ホスト１からｗｅｂサーバが閲覧不能」などの情報である。 The input / output module 26 is a module serving as an interface for receiving failure information from the system user and returning an inference result to the user. There are the following two types of fault information. One is information that specifies a fault location in the network (fault specifying information). For example, information such as "the network switch 1 has failed". The other is information describing a failure status (failure status information). For example, it is information such as “the web server cannot be browsed from the host 1”.

影響範囲推論モジュール１３は、入出力モジュール２６から障害指定情報を受け取ると、障害指定情報をＢｏｎｓａｉに基づく表現形式に変換して依存関係ルール展開ストレージ２３に問い合わせることで、障害により影響を受ける範囲を推論する。その際、依存関係ルールを一時的に適用した結果を一時的依存関係ルール展開ストレージ２７に蓄える。 Upon receiving the failure designation information from the input / output module 26, the influence range inference module 13 converts the failure designation information into a representation format based on Bonsai and inquires the dependency relationship rule storage 23 to determine the range affected by the failure. Infer. At this time, the result of temporarily applying the dependency relationship rule is stored in the temporary dependency rule expansion storage 27.

障害原因推論モジュール１４は、入出力モジュール２６から障害状況情報を受け取ると、依存関係ルール展開ストレージ２３に問合せて障害状況からの依存関係を逆にたどることで障害の原因を推論し、木構造による障害原因候補を得る。その際、状態伝搬ルールを一時的に適用した結果を一時的状態伝搬ストレージ２８に蓄える。 Upon receiving the failure status information from the input / output module 26, the failure cause inference module 14 inquires the dependency relationship expansion storage 23 and infers the cause of the failure by reversing the dependency from the failure status, thereby inferring the tree structure. Obtain the cause of failure. At this time, the result of temporarily applying the state propagation rule is stored in the temporary state propagation storage 28.

前記設定手段は、ＳＰＩＮ推論エンジン１２、ルール入力モジュール２５、ＳＰＩＮ依存関係ルールストレージ２２、及びＳＰＩＮ状態伝搬ルールストレージ２４に相当する。前記保管手段は、ネットワーク構成情報収集モジュール１１、ＳＰＩＮ推論エンジン１２、ネットワーク構成情報ＲＤＦストレージ２１、及び依存関係ルール展開ストレージ２３に相当する。前記範囲推定手段は、ＳＰＩＮ推論エンジン１２、影響範囲推論モジュール１３、入出力モジュール２６、依存関係ルール展開ストレージ２３、及び一時的依存関係ルール展開ストレージ２７に相当する。前記原因推定手段は、ＳＰＩＮ推論エンジン１２、障害原因推論モジュール１４、依存関係ルール展開ストレージ２３、及び一時的状態伝搬ストレージ２８に相当する。 The setting means corresponds to the SPIN inference engine 12, the rule input module 25, the SPIN dependency relation rule storage 22, and the SPIN state propagation rule storage 24. The storage means corresponds to the network configuration information collection module 11, the SPIN inference engine 12, the network configuration information RDF storage 21, and the dependency relationship expansion storage 23. The range estimating means corresponds to the SPIN inference engine 12, the influence range inference module 13, the input / output module 26, the dependency rule expansion storage 23, and the temporary dependency rule expansion storage 27. The cause estimating means corresponds to the SPIN inference engine 12, the failure cause inference module 14, the dependency rule expansion storage 23, and the temporary state propagation storage 28.

本実施形態の推論装置は、前記原因推定手段に前記障害状況に類似する他の障害状況を仮定して前記原因推定を行わせ、前記他の障害状況の原因となる前記要素と該要素の状態の他のリストを推定し、前記リストと前記他のリストに共通する項目を前記リストから除外した結果を、入力された前記障害状況の応答とする障害状況応答手段をさらに備える。
前記障害状況応答手段は、障害原因絞込モジュール１５であり、複数の障害状況情報から推論した複数の障害原因候補から障害原因を絞り込む。 The inference apparatus of the present embodiment causes the cause estimating means to perform the cause estimation by assuming another failure situation similar to the failure situation, and the element causing the other failure situation and the state of the element The apparatus further includes a failure status response unit that estimates another list, and removes an item common to the list and the other list from the list as a response to the input failure status.
The failure situation response means is a failure cause narrowing down module 15, which narrows down the failure cause from a plurality of failure cause candidates inferred from a plurality of failure situation information.

２．ネットワーク構成例
図３に本明細書で例として用いるネットワークのＩＰレベルの構成を示す。図４にこのネットワークのＩＰネットワーク構成をＢｏｎｓａｉに基づくインスタンスとして表した図を示す。図５にこのネットワークの物理レベルの構成を示す。図６にこのネットワークの物理ネットワーク構成をＢｏｎｓａｉに基づくインスタンスとして表した図を示す。図７にホスト３上で動作するｗｅｂサーバ１のネットワークサービス構成をＢｏｎｓａｉに基づくインスタンスとして表した図を示す。論理ネットワーク構成、オーバレイネットワーク構成および運用ネットワーク構成に関するインスタンス図は省略する。 2. Network Configuration Example FIG. 3 shows an IP-level configuration of a network used as an example in this specification. FIG. 4 is a diagram showing an IP network configuration of this network as an instance based on Bonsai. FIG. 5 shows the configuration of the physical level of this network. FIG. 6 is a diagram showing the physical network configuration of this network as an instance based on Bonsai. FIG. 7 is a diagram showing a network service configuration of the web server 1 operating on the host 3 as an instance based on Bonsai. Instance diagrams relating to the logical network configuration, the overlay network configuration, and the operation network configuration are omitted.

図８にスイッチ３の物理ネットワーク構成におけるインスタンス表現を示す。このインスタンスの名称はｓｗｉｔｃｈ３＿ｐである。図９にスイッチ３の上流インタフェースの物理ネットワーク構成におけるインスタンス表現を示す。このインスタンスの名称はｉｆ＿ｓ３＿ｕ＿ｐである。図１０にルータ３とスイッチ３間のリンクの物理ネットワークにおけるインスタンス表現を示す。このインスタンスの名称はｌｉｎｋ＿ｒ３ｓ３＿ｐである。図１１にスイッチ３とホスト２間のリンクの物理ネットワークにおけるインスタンス表現を示す。このインスタンスの名称はｌｉｎｋ＿ｓ３ｈ２＿ｐである。図１２にルータ３とスイッチ３間のリンクの論理ネットワークにおけるインスタンス表現を示す。このインスタンスの名称はｌｉｎｋ＿ｒ３ｓ３＿ｌである。図１３にＩＰサブネット３のＩＰネットワーク構成におけるインスタンス表現を示す。このインスタンスの名称はｓｕｂｎｅｔ３＿ｉである。図１４にホスト２とｗｅｂサーバ１間のＩＰ経路のＩＰネットワーク構成におけるインスタンス表現を示す。このインスタンスの名称はｐａｔｈ＿ｈ２ｗ１＿ｉである。図１５にホスト３のＩＰネットワーク構成におけるインスタンス表現を示す。このインスタンスの名称はｈｏｓｔ３＿ｉである。図１６にホスト３のサービスネットワーク構成におけるインスタンス表現を示す。このインスタンスの名称はｈｏｓｔ３＿ｓである。図１７にｗｅｂサーバ１のサービスネットワーク構成におけるインスタンス表現を示す。このインスタンスの名称はｗｅｂ１＿ｓである。 FIG. 8 shows an instance expression in the physical network configuration of the switch 3. The name of this instance is switch3_p. FIG. 9 shows an instance expression in the physical network configuration of the upstream interface of the switch 3. The name of this instance is if_s3_u_p. FIG. 10 shows an instance representation of a link between the router 3 and the switch 3 in the physical network. The name of this instance is link_r3s3_p. FIG. 11 shows an instance representation of a link between the switch 3 and the host 2 in the physical network. The name of this instance is link_s3h2_p. FIG. 12 shows an instance representation of a link between the router 3 and the switch 3 in a logical network. The name of this instance is link_r3s3_1. FIG. 13 shows an instance expression in the IP network configuration of the IP subnet 3. The name of this instance is subnet3_i. FIG. 14 shows an instance expression of an IP route between the host 2 and the web server 1 in the IP network configuration. The name of this instance is path_h2w1_i. FIG. 15 shows an instance expression of the host 3 in the IP network configuration. The name of this instance is host3_i. FIG. 16 shows an instance expression in the service network configuration of the host 3. The name of this instance is host3_s. FIG. 17 shows an instance expression in the service network configuration of the web server 1. The name of this instance is web1_s.

３．依存関係ルールと状態伝搬ルール
依存関係ルールは、互いの状態に影響を及ぼす（依存関係にある）２つの要素を記載したものである。例えば、図１８に示すＳＰＩＮ依存関係ルールは、物理インタフェースが物理ノードに接続している場合、物理ノードの状態が物理インタフェースの状態に影響を与えることを示している。図１９に示すＳＰＩＮ依存関係ルールは、物理リンクが物理インタフェースに接続している場合、物理インタフェースの状態が物理リンクの状態に影響を与えることを示している。図２０に示すＳＰＩＮ依存関係ルールは、論理リンクが物理リンク上で動作している場合、物理リンクの状態が論理リンクの状態に影響を与えることを示している。図２１に示すＳＰＩＮ依存関係ルールは、ＩＰサブネットが論理リンク上で動作している場合、論理リンクの状態がＩＰサブネットの状態に影響を与えることを示している。図２２に示すＳＰＩＮ依存関係ルールは、ＩＰ経路がＩＰサブネットにより構成されている場合、ＩＰサブネットの状態がＩＰ経路の状態に影響を与えることを示している。図２３に示すＳＰＩＮ依存関係ルールは、サービスがＩＰ経路の端点となる論理ノード上で動作している場合、ＩＰ経路の状態がサービスの状態に影響を与えることを示している。 3. Dependency Rules and State Propagation Rules A dependency rule describes two elements that are influencing each other (in a dependent relationship). For example, the SPIN dependency relationship rule shown in FIG. 18 indicates that when a physical interface is connected to a physical node, the state of the physical node affects the state of the physical interface. The SPIN dependency relation rule shown in FIG. 19 indicates that when a physical link is connected to a physical interface, the state of the physical interface affects the state of the physical link. The SPIN dependency relation rule shown in FIG. 20 indicates that the state of the physical link affects the state of the logical link when the logical link operates on the physical link. The SPIN dependency relation rule shown in FIG. 21 indicates that when the IP subnet operates on the logical link, the state of the logical link affects the state of the IP subnet. The SPIN dependency relation rule shown in FIG. 22 indicates that when an IP route is configured by an IP subnet, the status of the IP subnet affects the status of the IP route. The SPIN dependency relationship rule shown in FIG. 23 indicates that when a service is operating on a logical node that is an end point of an IP route, the status of the IP route affects the status of the service.

状態伝搬ルールは、依存関係ルールに記載された２つの要素間で影響する具体的な状態内容を記載したものである。例えば、図２４に示すＳＰＩＮ状態伝搬ルールは、物理ノードが“ＮｏｄｅＤｏｗｎ”という状態を持ち、この物理ノードと物理インタフェース間に依存関係がある場合、物理インタフェースには“ＩｆＤｏｗｎ”という状態が伝搬することを示している。図２５に示すＳＰＩＮ状態伝搬ルールは、物理インタフェースが“ＩｆＤｏｗｎ”という状態を持ち、この物理インタフェースと物理リンク間に依存関係がある場合、物理リンクには“ＬｉｎｋＤｏｗｎ”という状態が伝搬することを示している。図２６に示すＳＰＩＮ状態伝搬ルールは、物理リンクが“ＬｉｎｋＤｏｗｎ”という状態を持ち、この物理リンクと論理リンク間に依存関係がある場合、論理リンクには“ＬｉｎｋＤｏｗｎ”という状態が伝搬することを示している。図２７に示すＳＰＩＮ状態伝搬ルールは、論理リンクが“ＬｉｋｎＤｏｗｎ”という状態を持ち、この論理リンクとＩＰサブネット間に依存関係がある場合、ＩＰサブネットには“ＮｅｔＤｏｗｎ”という状態が伝搬することを示している。図２８に示すＳＰＩＮ状態伝搬ルールは、ＩＰサブネットが“ＮｅｔＤｏｗｎ”という状態を持ち、このＩＰサブネットとＩＰ経路間に依存関係がある場合、ＩＰ経路には“Ｕｎｒｅａｃｈａｂｌｅ”という状態が伝搬することを示している。図２９に示すＳＰＩＮ状態伝搬ルールは、ＩＰ経路が“Ｕｎｒｅａｃｈａｂｌｅ”という状態を持ち、このＩＰ経路とネットワークサービス間に依存関係がある場合、ネットワークサービスには“Ｕｎａｖａｉｌａｂｌｅ”という状態が伝搬することを示している。 The state propagation rule describes a specific state content that affects between two elements described in the dependency relationship rule. For example, according to the SPIN state propagation rule shown in FIG. 24, when a physical node has a state of “NodeDown” and there is a dependency between the physical node and the physical interface, the state of “IfDown” propagates to the physical interface. Is shown. The SPIN state propagation rule shown in FIG. 25 indicates that when a physical interface has a state of “IfDown” and there is a dependency between the physical interface and the physical link, a state of “LinkDown” propagates to the physical link. ing. The SPIN state propagation rule shown in FIG. 26 indicates that when a physical link has a state of “Link Down” and there is a dependency between the physical link and the logical link, a state of “Link Down” propagates to the logical link. ing. The SPIN state propagation rule shown in FIG. 27 indicates that the state “NetDown” propagates to the IP subnet when the logical link has the state “LikeDown” and there is a dependency between the logical link and the IP subnet. ing. The SPIN state propagation rule shown in FIG. 28 indicates that when an IP subnet has a state of “NetDown” and there is a dependency between this IP subnet and an IP path, a state of “Unreachable” propagates to the IP path. ing. The SPIN state propagation rule shown in FIG. 29 indicates that when the IP route has a state of “Unreachable” and there is a dependency between the IP route and the network service, the state of “Unavailable” propagates to the network service. ing.

４．依存関係のルール展開
図１に示す本推論装置において、ＳＰＩＮ依存関係ストレージにはすでに図１８から図２３に示したＳＰＩＮ依存関係ルールが格納されているとする。このとき、図８から図１７に示したネットワーク構成情報がネットワーク構成情報ＲＤＦストレージ２１に入力されたとする。するとＳＰＩＮ推論エンジン１２は入力されたネットワーク構成情報にＳＰＩＮ依存関係ルールを適用し、その結果を依存関係ルール展開ストレージ２３に格納する。図８に示すインスタンス表現は、図１８に示すＳＰＩＮ依存関係ルールと図９に示すインスタンス表現により、図３０のように書き換えられる。この例では７行目と８行目が元のインスタンス表現に追加されている。図９に示すインスタンス表現は、図１９に示すＳＰＩＮ依存関係ルールおよび図１０と図１１に示すインスタンス表現により、図３１のように書き換えられる。この例では７行目が元のインスタンス表現に追加されている。図１０に示すインスタンス表現は、図２０に示すＳＰＩＮ依存関係ルールと図１２に示すインスタンスにより、図３２のように書き換えられる。この例では７行目が元のインスタンス表現に追加されている。図１２に示すインスタンス表現は、図２１に示すＳＰＩＮ依存関係ルールと図１３に示すインスタンス表現により、図３３のように書き換えられる。この例では８行目が元のインスタンス表現に追加されている。図１３に示すインスタンス表現は、図２２に示すＳＰＩＮ依存関係ルールと図１４に示すインスタンス表現により、図３４のように書き換えられる。この例では９行目が元のインスタンス表現に追加されている。図１４に示すインスタンス表現は、図２３に示すＳＰＩＮ依存関係ルールと図１７に示すインスタンス表現により、図３５のように書き換えられる。この例では７行目が元のインスタンス表現に追加されている。 4. Rule Expansion of Dependency In the present inference apparatus shown in FIG. 1, it is assumed that the SPIN dependency storage shown in FIGS. 18 to 23 has already been stored in the SPIN dependency storage. At this time, it is assumed that the network configuration information shown in FIGS. 8 to 17 has been input to the network configuration information RDF storage 21. Then, the SPIN inference engine 12 applies the SPIN dependency relation rule to the input network configuration information, and stores the result in the dependency rule development storage 23. The instance expression shown in FIG. 8 is rewritten as shown in FIG. 30 by the SPIN dependency relationship rule shown in FIG. 18 and the instance expression shown in FIG. In this example, the seventh and eighth lines are added to the original instance expression. The instance expression shown in FIG. 9 is rewritten as shown in FIG. 31 by the SPIN dependency relationship rule shown in FIG. 19 and the instance expression shown in FIG. 10 and FIG. In this example, the seventh line is added to the original instance expression. The instance expression shown in FIG. 10 is rewritten as shown in FIG. 32 by the SPIN dependency relationship rule shown in FIG. 20 and the instance shown in FIG. In this example, the seventh line is added to the original instance expression. The instance expression shown in FIG. 12 is rewritten as shown in FIG. 33 by the SPIN dependency relationship rule shown in FIG. 21 and the instance expression shown in FIG. In this example, the eighth line is added to the original instance expression. The instance expression shown in FIG. 13 is rewritten as shown in FIG. 34 by the SPIN dependency relationship rule shown in FIG. 22 and the instance expression shown in FIG. In this example, the ninth line is added to the original instance expression. The instance expression shown in FIG. 14 is rewritten as shown in FIG. 35 by the SPIN dependency relationship rule shown in FIG. 23 and the instance expression shown in FIG. In this example, the seventh line is added to the original instance expression.

ネットワーク構成情報に変化がない限り、本推論装置はここまでの処理を完了した段階でシステム利用者からの入力待ちとなる。 As long as there is no change in the network configuration information, the inference apparatus waits for an input from the system user at the stage where the processing up to this point is completed.

５．影響範囲推論
本推論装置の範囲推定手段が行う動作について説明する。スイッチ３が故障した場合のネットワークサービスレベルでの影響範囲推論の手順を図３６に示す。その際に使用する影響範囲推論ルールを図３７に示す。 5. Influence range inference The operation performed by the range estimation means of the inference device will be described. FIG. 36 shows a procedure for inferring the range of influence at the network service level when the switch 3 fails. FIG. 37 shows the influence range inference rules used at that time.

（ステップ１）入出力モジュール２５は「スイッチ３が故障した場合のネットワークサービスレベルの障害範囲」という要求を影響範囲推論モジュール１３に送信する。
（ステップ２）影響範囲推論モジュール１３はこの情報を「ｓｗｉｔｃｈ３＿ｐ − ｈａｓＳｔａｔｅ −“ＮｏｄｅＤｏｗｎ”」という障害情報トリプルに変換し、依存関係ルール展開ストレージ２３に送信する。
（ステップ３）依存関係ルール展開ストレージ２３は一時的依存関係ルール展開ストレージ２７にオントロジ更新を送信する。
（ステップ４）一時的依存関係ルール展開ストレージ２７はオントロジ更新通知をＳＰＩＮ推論エンジン１２に送信する。
（ステップ５）ＳＰＩＮ推論エンジン１２は追加された障害情報トリプルに基づきＳＰＩＮ状態伝搬ルールを展開し、その結果として得られたトリプルを一時的依存関係ルール展開ストレージ２７に追加する。ＳＰＩＮ推論エンジン１２が追加するトリプルは以下のとおりである。
・ｉｆ＿ｓ３＿ｕ＿ｐ − ｈａｓＳｔａｔｅ −“ＩｆＤｏｗｎ”
・ｌｉｎｋ＿ｒ３ｓ２＿ｐ − ｈａｓＳｔａｔｅ −“ＬｉｎｋＤｏｗｎ”
・ｌｉｎｋ＿ｒ３ｓ２＿ｌ − ｈａｓＳｔａｔｅ −“ＬｉｎｋＤｏｗｎ”
・ｓｕｂｎｅｔ３＿ｉ − ｈａｓＳｔａｔｅ −“ＮｅｔＤｏｗｎ”
・ｐａｔｈ＿ｈ２ｈ３ − ｈａｓＳｔａｔｅ −“Ｕｎｒｅａｃｈａｂｌｅ”
・ｗｅｂ１＿ｓ − ｈａｓＳｔａｔｅ −“Ｕｎａｖａｉｌａｂｌｅ”
（ステップ６）一時的依存関係ルール展開ストレージ２７はトリプル追加通知を影響範囲推論モジュール１３に送信する。
（ステップ７）影響範囲推論モジュール１３は、ＳＰＩＮ推論エンジン１２によるトリプル追加によって状態に変更があったインスタンスを得るため、状態変更インスタンス要求を一時的依存関係ルール展開ストレージ２７に送信する。
（ステップ８）一時的依存関係ルール展開ストレージ２７は状態に変更があったインスタンスを影響範囲推論モジュール１３に送信する。状態に変更があったインスタンスは以下のとおりである。
・ｓｗｉｔｃｈ３＿ｐ − ｈａｓＳｔａｔｅ −“ＮｏｄｅＤｏｗｎ”
・ｉｆ＿ｓ３＿ｕ＿ｐ − ｈａｓＳｔａｔｅ −“ＩｆＤｏｗｎ”
・ｌｉｎｋ＿ｒ３ｓ２＿ｐ − ｈａｓＳｔａｔｅ −“ＬｉｎｋＤｏｗｎ”
・ｌｉｎｋ＿ｒ３ｓ２＿ｌ − ｈａｓＳｔａｔｅ −“ＬｉｎｋＤｏｗｎ”
・ｓｕｂｎｅｔ３＿ｉ − ｈａｓＳｔａｔｅ −“ＮｅｔＤｏｗｎ”
・ｐａｔｈ＿ｈ３ｈ２ − ｈａｓＳｔａｔｅ −“Ｕｎｒｅａｃｈａｂｌｅ”
・ｗｅｂ１＿ｓ − ｈａｓＳｔａｔｅ −“Ｕｎａｖａｉｌａｂｌｅ”
（ステップ９）影響範囲推論モジュール１３はステップ２で追加されたトリプルを消去するため、リセット要求を一時的依存関係ルール展開ストレージ２７に送信する。
（ステップ１０）一時的依存関係ルール展開ストレージ２７はステップ２で追加されたトリプルを消去し、リセット応答を影響範囲推論モジュール１３に送信する。
（ステップ１１）この結果、ネットワークサービスレベルでの影響は「ｗｅｂ１＿ｓ − ｈａｓＳｔａｔｅ −“Ｕｎａｖａｉｌａｂｌｅ”」であるので、影響範囲推論モジュール１３は入出力モジュール２６に「ｗｅｂ１閲覧不能」という応答を返す。 (Step 1) The input / output module 25 transmits a request for “a failure range of the network service level when the switch 3 has failed” to the influence range inference module 13.
(Step 2) The influence range inference module 13 converts this information into the failure information triple “switch3_p−hasState−“ NodeDown ”” and transmits it to the dependency relationship rule storage 23.
(Step 3) The dependency relationship expansion storage 23 sends the ontology update to the temporary dependency relationship expansion storage 27.
(Step 4) The temporary dependency rule expansion storage 27 sends an ontology update notification to the SPIN inference engine 12.
(Step 5) The SPIN inference engine 12 expands the SPIN state propagation rule based on the added failure information triple, and adds the resulting triple to the temporary dependency rule expansion storage 27. The triples added by the SPIN inference engine 12 are as follows.
-If_s3_u_p-hasState-"IfDown"
-Link_r3s2_p-hasState-"LinkDown"
-Link_r3s2_1-hasState-"LinkDown"
-Subnet3_i-hasState-"NetDown"
-Path_h2h3-hasState-"Unreachable"
-Web1_s-hasState-"Unavailable"
(Step 6) The temporary dependency rule expansion storage 27 sends a triple addition notification to the influence range inference module 13.
(Step 7) The influence range inference module 13 transmits a state change instance request to the temporary dependency rule expansion storage 27 in order to obtain an instance whose state has been changed by triple addition by the SPIN inference engine 12.
(Step 8) The temporary dependency rule expansion storage 27 sends the instance whose state has changed to the influence range inference module 13. The instances whose status has changed are as follows.
-Switch3_p-hasState-"NodeDown"
-If_s3_u_p-hasState-"IfDown"
-Link_r3s2_p-hasState-"LinkDown"
-Link_r3s2_1-hasState-"LinkDown"
-Subnet3_i-hasState-"NetDown"
-Path_h3h2-hasState-"Unreachable"
-Web1_s-hasState-"Unavailable"
(Step 9) The influence range inference module 13 sends a reset request to the temporary dependency rule expansion storage 27 to delete the triple added in Step 2.
(Step 10) The temporary dependency rule expansion storage 27 deletes the triple added in Step 2, and sends a reset response to the influence range inference module 13.
(Step 11) As a result, the influence at the network service level is “web1_s−hasState−“ Unavailable ””, so that the influence range inference module 13 returns a response that “web1 cannot be browsed” to the input / output module 26.

６．障害原因推論
本推論装置の原因推定手段が行う動作について説明する。ホスト２からｗｅｂ１が閲覧不能である場合の障害原因推論の手順を図３８に示す。このとき使用する障害推論ルールを図３９から図４２に示す。またこのルールで使用するＦａｕｌｔＮｏｄｅクラスを図４３に示す。 6. Failure cause inference The operation performed by the cause estimation means of the inference apparatus will be described. FIG. 38 shows a procedure for inferring a failure cause when web1 cannot be browsed from the host 2. The fault inference rules used at this time are shown in FIGS. FIG. 43 shows a FaultNode class used in this rule.

（ステップ１）入出力モジュール２５は「ホスト２からｗｅｂ１が閲覧不能」という情報を障害原因推論モジュール１４に送信する。
（ステップ２）障害原因推論モジュール１４は、ｗｅｂ１＿ｓの状態に依存関係をもつインスタンスを検索するため、「？ − ｈａｓＣａｕｓａｌＲｅｌａｔｉｏｎｓｈｉｐ − ｗｅｂ１＿ｓ」という依存インスタンス要求を依存関係ルール展開ストレージ２３に送信する。
（ステップ３）依存関係ルール展開ストレージ２３は依存インスタンス応答として「ｈｏｓｔ３＿ｉ − ｈａｓＣａｕｓａｌＲｅｌａｔｉｏｎｓｈｉｐ − ｗｅｂ１＿ｓ」を障害原因推論モジュール１４に送信する。
（ステップ４）障害原因推論モジュール１４は、ｈｏｓｔ３＿ｉが取り得る状態を得るため「ｈｏｓｔ３＿ｉ − ｐｏｓｓｉｂｌｅＳｔａｔｅｓ − ？」という状態候補要求を依存関係ルール展開ストレージ２３に送信する。
（ステップ５）依存関係ルール展開ストレージ２３は、ネットワーク構成情報ＲＤＦストレージ２１が保管する、図１５に示すｈｏｓｔ３＿ｉのインスタンス表現に基づき、ｈｏｓｔ３＿ｉが取り得る状態として“ＮｏｄｅＤｏｗｎ”を状態候補応答として障害原因推論モジュール１４に送信する。
（ステップ６）障害原因推論モジュール１４は、ｈｏｓｔ３＿ｉが“ＮｏｄｅＤｏｗｎ”という状態になったとき、ｗｅｂ１＿ｓが“Ｕｎａｖａｉｌａｂｌｅ”という状態になるかを確認するため、「ｈｏｓｔ３＿ｉ − ｈａｓＳｔａｔｅ −“ＮｏｄｅＤｏｗｎ”」というトリプルを仮定トリプルとして依存関係ルール展開ストレージ２３に送信する。
（ステップ７）依存関係ルール展開ストレージ２３は仮定トリプルを追加し、ＳＰＩＮ推論エンジン１２にオントロジ更新通知を送信する。
（ステップ８）ＳＰＩＮ推論エンジン１２は追加されたトリプルに対してＳＰＩＮ状態伝搬ルールを適用し、「ｗｅｂ１＿ｓ − ｈａｓＳｔａｔｅ −“Ｕｎａｖａｉｌａｂｌｅ”というトリプルを得るので、これを一時的状態伝搬ストレージに追加する。
（ステップ９）一時的状態伝搬ストレージは障害原因推論モジュール１４にトリプル追加通知を送信する。
（ステップ１０）障害原因推論モジュール１４は状態更新があったインスタンスを得るため、「？ − ｈａｓＳｔａｔｅ − ？」を状態更新インスタンス要求として一時的状態伝搬ストレージに送信する。
（ステップ１１）一時的状態伝搬ストレージは、状態変更があったインスタンスとして以下を障害原因推論モジュール１４に送信する。
・ｈｏｓｔ３＿ｉ − ｈａｓＳｔａｔｅ −“ＮｏｄｅＤｏｗｎ”
・ｗｅｂ１＿ｓ − ｈａｓＳｔａｔｅ −“Ｕｎａｖａｉｌａｂｌｅ”
（ステップ１２）この結果、「ｈｏｓｔ３＿ｉ − ｈａｓＳｔａｔｅ −“ＮｏｄｅＤｏｗｎ”」から「ｗｅｂ１＿ｓ − ｈａｓＳｔａｔｅ −“Ｕｎａｖａｉｌａｂｌｅ”」に遷移可能であることが分かるため、障害原因推論モジュール１４は「ｈｏｓｔ３＿ｉ − ｈａｓＳｔａｔｅ −“ＮｏｄｅＤｏｗｎ”」が「ｗｅｂ１＿ｓ − ｈａｓＳｔａｔｅ −“Ｕｎａｖａｉｌａｂｌｅ”」の原因候補であることを知る。次に障害原因推論モジュール１４は、ステップ６で追加した仮定トリプルを取り消すため、リセット要求を一時的状態伝搬ストレージに送信する。
（ステップ１３）一時的状態伝搬ストレージ２８はステップ６からステップ８で更新した内容を消去し、リセット応答を障害原因推論モジュール１４に送信する。ステップ５で複数の状態候補が返ってきた場合、それぞれの状態についてステップ６からステップ１３を繰り返す。また、ステップ３で複数の依存インスタンスが返ってきた場合、それぞれのインスタンスについてステップ４からステップ１３を繰り返す。以上の処理の結果、原因候補のインスタンスが確定する。この例では「ｈｏｓｔ３＿ｉ − ｈａｓＳｔａｔｅ −“ＮｏｄｅＤｏｗｎ”」である。次に障害原因推論モジュール１４は確定した原因候補インスタンス「ｈｏｓｔ３＿ｉ − ｈａｓＳｔａｔｅ −“ＮｏｄｅＤｏｗｎ”」の原因を得るため、ステップ２に戻り、「？ − ｈａｓＣａｕｓａｌＲｅｌａｔｉｏｎｓｈｉｐ − ｈｏｓｔ３＿ｉ」を依存インスタンス要求として依存関係ルール展開ストレージ２３に送信する。以降、原因候補インスタンスが得られなくなるまで上記の手順を再帰的に繰り返す。
（ステップ１４）以上の結果、影響範囲推論モジュール１３は図４４−（ｂ）に示すフォルトツリーを得るので、これを入出力モジュール２６に送信する。 (Step 1) The input / output module 25 transmits to the failure cause inference module 14 the information that “web1 cannot be browsed from the host 2”.
(Step 2) The failure cause inference module 14 transmits a dependent instance request “? −hasCausalRelationship−web1_s” to the dependency rule expansion storage 23 to search for an instance having a dependency on the state of web1_s.
(Step 3) The dependency relationship rule expansion storage 23 transmits “host3_i-hasCausalRelationship-web1_s” to the failure cause inference module 14 as a dependency instance response.
(Step 4) The failure cause inference module 14 sends a state candidate request “host3_i-possibleStates-?” To the dependency-rule expansion storage 23 to obtain a state that the host3_i can take.
(Step 5) The dependency rule expansion storage 23, based on the instance expression of the host3_i shown in FIG. 15 and stored in the network configuration information RDF storage 21, uses “NodeDown” as a possible state of the host3_i and infers a failure cause as a state candidate response. Send to module 14.
(Step 6) When the host3_i becomes “NodeDown”, the failure cause inference module 14 checks the triple “host3_i−hasState−“ NodeDown ”” to check whether web1_s is in a state of “Unavailable”. It is transmitted to the dependency relationship expansion storage 23 as an assumed triple.
(Step 7) The dependency rule expansion storage 23 adds the assumed triple and sends an ontology update notification to the SPIN inference engine 12.
(Step 8) The SPIN inference engine 12 applies the SPIN state propagation rule to the added triple and obtains the triple "web1_s-hasState-" Unavailable ", and adds this to the temporary state propagation storage.
(Step 9) The temporary state propagation storage sends a triple addition notification to the failure cause inference module 14.
(Step 10) The fault cause inference module 14 transmits “? −hasState−?” To the temporary state propagation storage as a state update instance request in order to obtain the instance whose state has been updated.
(Step 11) The temporary state propagation storage transmits the following to the failure cause inference module 14 as the instance whose state has changed.
-Host3_i-hasState-"NodeDown"
-Web1_s-hasState-"Unavailable"
(Step 12) As a result, it can be seen that it is possible to make a transition from “host3_i-hasState-“ NodeDown ”” to “web1_s-hasState-“ Unavailable ””, so the failure cause inference module 14 makes the “host3_i-hasState-“ NodeDow ”. "Is a candidate for the cause of" web1_s-hasState- "Unavailable"". Next, the failure cause inference module 14 sends a reset request to the temporary state propagation storage to cancel the hypothetical triple added in step 6.
(Step 13) The temporary state propagation storage 28 erases the contents updated in steps 6 to 8, and sends a reset response to the failure cause inference module 14. If a plurality of state candidates are returned in step 5, steps 6 to 13 are repeated for each state. When a plurality of dependent instances are returned in step 3, steps 4 to 13 are repeated for each instance. As a result of the above processing, the cause candidate instance is determined. In this example, "host3_i-hasState-" NodeDown "". Next, the fault cause inference module 14 returns to step 2 to obtain the cause of the determined cause candidate instance “host3_i−hasState−“ NodeDown ””, and uses “? −hasCausalRelationship−host3_i” as a dependent instance request and stores the dependency rule expansion storage. 23. Thereafter, the above procedure is recursively repeated until no cause candidate instance can be obtained.
(Step 14) As a result, the influence range inference module 13 obtains the fault tree shown in FIG. 44- (b), and transmits this to the input / output module 26.

上記の手順では、まずステップ２〜３にて入力された障害状況と直接依存関係のあるインスタンスを得る。次に、ステップ４〜１３において上記で得られた各インスタンスから依存関係のあるインスタンスを次々とたどっていく。ステップ４〜１３の内部では、まずステップ４〜５においてインスタンスが取り得る状態を得る。次に、ステップ６〜１１において上記で得られた各状態から入力された障害状況へ遷移できるかを調べる。この過程において、ステップ８において状態伝搬ルールが展開され、ステップ１２〜１３において展開結果を削除する。以上のように、入力された障害情報と依存関係のあるインスタンスとその状態についてのみ状態伝搬ルールを展開し、その結果を調べた後に直ちに展開結果を削除する。これにより実行時のメモリ消費量を抑えている。 In the above procedure, first, an instance having a direct dependency with the failure status input in steps 2 and 3 is obtained. Next, in steps 4 to 13, the instances having a dependency relationship are sequentially traced from the respective instances obtained above. Inside steps 4 to 13, first, in steps 4 and 5, a state that the instance can take is obtained. Next, in Steps 6 to 11, it is checked whether or not each of the states obtained as described above can transition to the input failure state. In this process, the state propagation rule is developed in step 8, and the developed result is deleted in steps 12 and 13. As described above, the state propagation rules are developed only for the instances and their states that have a dependency relationship with the input fault information, and the development results are deleted immediately after checking the results. This reduces the memory consumption during execution.

７．原因の絞込
本推論装置の障害状況応答手段が行う動作について説明する。ホスト２からｗｅｂ１が閲覧不能であることが分かったと同時に、ホスト１からはｗｅｂ１の閲覧が可能であることが分かったとする。このような情報を利用した障害原因絞り込みの手順を図４５に示す。その際に使用する絞り込みルールを図４６に示す。 7. The operation performed by the fault situation response means of the inference apparatus will be described. It is assumed that the host 2 has determined that the web 1 cannot be browsed and that the host 1 has determined that the web 1 can be browsed. FIG. 45 shows a procedure for narrowing down the cause of a failure using such information. FIG. 46 shows the narrowing rules used at that time.

（ステップ１）入出力モジュール２６は「ｈｏｓｔ２からｗｅｂ１が閲覧不能」という情報を原因推論モジュールに送信する。
（ステップ２）障害原因推論モジュール１４は図３８に示した手順でｈｏｓｔ２からｗｅｂ１が閲覧不能である原因候補を得る。
（ステップ３）結果として障害原因推論モジュール１４は図４４−（ｂ）のフォルトツリーを得る。
（ステップ４）障害原因推論モジュール１４はｈｏｓｔ２のフォルトツリーを入出力モジュール２６に送信する。
（ステップ５）次に入出力モジュール２６は「ｈｏｓｔ１からｗｅｂ１が閲覧不能」という情報を原因推論モジュールに送信する。
（ステップ６）障害原因推論モジュール１４は図３８に示した手順でｈｏｓｔ１からｗｅｂ１が閲覧不能である原因候補を得る。
（ステップ７）結果として障害原因推論モジュール１４は図４４−（ａ）のフォルトツリーを得る。
（ステップ８）障害原因推論モジュール１４はｈｏｓｔ１のフォルトツリーを入出力モジュール２６に送信する。
（ステップ９）入出力モジュール２６はｈｏｓｔ２のフォルトツリー（障害状態）とｈｏｓｔ１のフォルトツリー（正常状態）とともに原因絞込要求を原因絞込モジュールに送信する。
（ステップ１０）原因推論モジュールは障害状態のフォルトツリーと正常状態のフォルトツリーにより、以下のようにして原因を絞り込む。図４４−（ａ）と（ｂ）を比較すると、下線を付した行が両方に現れている。実際にはホスト１からｗｅｂ１は閲覧可能であるので、ホスト１のフォルトツリーに現れているインスタンスは正常に動作している。したがって、下線を付した行はホスト２のフォルトツリーから削除することができる。その結果、図４４−（ｃ）の結果を得ることができる。太字で示した行は物理レベルでの障害原因候補を示す。原因絞込モジュールは結果を障害原因推論モジュール１４に送信する。
（ステップ１１）障害原因推論モジュール１４は絞込後のｈｏｓｔ２のフォルトツリーを入出力モジュール２６に送信する。 (Step 1) The input / output module 26 transmits information that “web1 cannot be browsed from host2” to the cause inference module.
(Step 2) The failure cause inference module 14 obtains from the host 2 a cause candidate that the web1 cannot be browsed by the procedure shown in FIG.
(Step 3) As a result, the fault cause inference module 14 obtains the fault tree of FIG. 44- (b).
(Step 4) The failure cause inference module 14 sends the host2 fault tree to the input / output module 26.
(Step 5) Next, the input / output module 26 transmits information that "web1 cannot be browsed from host1" to the cause inference module.
(Step 6) The failure cause inference module 14 obtains a cause candidate from which the web1 cannot be browsed from the host1 by the procedure shown in FIG.
(Step 7) As a result, the fault cause inference module 14 obtains the fault tree of FIG.
(Step 8) The fault cause inference module 14 sends the host1 fault tree to the input / output module 26.
(Step 9) The input / output module 26 sends a cause narrowing request to the cause narrowing module together with the host 2 fault tree (failure state) and the host 1 fault tree (normal state).
(Step 10) The cause inference module narrows down the cause by the fault tree in the fault state and the fault tree in the normal state as follows. Comparing FIG. 44- (a) and (b), underlined rows appear in both. Actually, since web1 can be browsed from host 1, the instance appearing in the fault tree of host 1 is operating normally. Therefore, the underlined line can be deleted from the host 2 fault tree. As a result, the result of FIG. 44- (c) can be obtained. The lines shown in bold type indicate failure cause candidates at the physical level. The cause narrowing down module sends the result to the failure cause inference module 14.
(Step 11) The fault cause inference module 14 transmits the narrowed-down host2 fault tree to the input / output module 26.

［ネットワーク構成情報収集モジュールの動作例］
ネットワーク構成情報収集モジュール１１としては、例えば非特許文献４等に記載される、ＫＡＮＶＡＳアーキテクチャを備えるＫＡＮＶＡＳシステムが例示できる。図４７に、ＫＡＮＶＡＳシステムの構成例を示す。ＫＡＮＶＡＳシステムは、情報収集装置として機能するＫＡＮＶＡＳＩｎｆｏｒｍａｔｉｏｎＣｏｌｌｅｃｔｏｒ（ＫＩＣ）３０、ストレージサーバ装置として機能するＫＡＮＶＡＳＳｔｏｒａｇｅＳｅｒｖｅｒ（ＫＳＳ）２０、アクセスサーバ装置として機能するＫＡＮＶＡＳＡｃｃｅｓｓＳｅｒｖｅｒ（ＫＡＳ）１０、及びＫＡＮＶＡＳＩｎｓｔａｎｃｅＧｅｎｅｒａｔｏｒ（ＫＩＧ）５０という４つの主要なモジュールを備える。 [Operation example of network configuration information collection module]
Examples of the network configuration information collection module 11 include a KANVAS system having a KANVAS architecture described in Non-Patent Document 4, for example. FIG. 47 shows a configuration example of the KANVAS system. The KANVAS system includes a KANVAS Information Collector (KIC) 30 functioning as an information collecting device, a KANVAS Storage Server (KSS) 20 functioning as a storage server device, a KANVAS Access Server (KAS) 10 and an Invasor KANSAS serving as an access server device. It has four main modules, (KIG) 50.

ＫＡＳ１０は、管理者４２及びユーザ４３といったエンドノードの使用可能なアプリケーションである。
ＫＳＳ２０は論理的にはＡＳ（ＡｕｔｏｎｏｍｏｕｓＳｙｓｔｅｍ：統一された管理ポリシによって運用されているネットワークの範囲）ネットワーク４４に１つ存在する。負荷分散のため物理的には複数のノードに存在してもよいが、論理的には１つであるとする。ＫＡＳ１０とＫＩＣ３０はＡＳネットワーク４４の規模により、ＡＳネットワーク４４内に１つ設置される場合もあれば、負荷分散のため複数設置される場合もある。 The KAS 10 is an application that can be used by end nodes such as the administrator 42 and the user 43.
One KSS 20 logically exists in an AS (Autonomous System: network range operated by a unified management policy) network 44. Although it may physically exist in a plurality of nodes for load distribution, it is assumed that there is only one logically. Depending on the size of the AS network 44, the KAS 10 and the KIC 30 may be provided one in the AS network 44, or may be provided in plurals for load distribution.

ＫＩＣ３０は、ＡＳネットワーク４４から経路情報や機器情報などのさまざまなネットワーク情報を集める。例えば、経路情報はＲＦＣ２３２８で規定されるＯＳＰＦ（ＯｐｅｎＳｈｏｒｔｅｓｔＰａｔｈＦｉｒｓｔ）やＲＦＣ４２７１で規定されるＢＧＰ−４（ＢｏｒｄｅｒＧａｔｅｗａｙＰｒｏｔｏｃｏｌ４）などのプロトコルに参加して収集する。また、機器情報や統計情報は、ＲＦＣ３４１６で規定されるＳＮＭＰ（ＳｉｍｐｌｅＮｅｔｗｏｒｋＭａｎａｇｅｍｅｎｔＰｒｏｔｏｃｏｌ）やＲＦＣ６２４１で規定されるＮＥＴＣＯＮＦによって各機器にアクセスして収集する。ネットワークのフロー情報は、ＲＦＣ３１７６で規定されるｓＦｌｏｗやＲＦＣ３９５４で規定されるＮｅｔＦｌｏｗを利用して収集する。障害情報は、ＣＬＩＮＥＸ（特許文献１）などを利用して収集する。そして得られた情報をネットワークオントロジＢｏｎｓａｉのインスタンスとして表現し、知識ベース（ネットワーク構成情報ＲＤＦストレージ２１）に格納する。 The KIC 30 collects various network information such as route information and device information from the AS network 44. For example, the route information is collected by participating in a protocol such as OSPF (Open Shortest Path First) defined by RFC2328 and BGP-4 (Border Gateway Protocol 4) defined by RFC4271. In addition, device information and statistical information are collected by accessing each device using SNMP (Simple Network Management Protocol) defined by RFC3416 or NETCONF defined by RFC6241. Network flow information is collected using sFlow defined by RFC3176 and NetFlow defined by RFC3954. The failure information is collected using CLINEX (Patent Document 1) or the like. The obtained information is expressed as an instance of the network ontology Bonsai and stored in the knowledge base (network configuration information RDF storage 21).

ＫＩＧ５０は、ＡＳネットワーク４４に接続した機器で動作し、指定された対象範囲（たとえば企業内ネットワーク）のネットワーク構成を自動的に収集してネットワーク構成を検出し、ＫＳＳ２０のデータベース（ネットワーク構成情報ＲＤＦストレージ２１）に格納する。 The KIG 50 operates on a device connected to the AS network 44, automatically collects a network configuration in a specified target range (for example, a corporate network), detects the network configuration, and stores a database (network configuration information RDF storage) of the KSS 20. 21).

図４８にＫＩＧの内部構造を示す。ＫＩＧ５０は、ＡＰＩモジュール５１と、記憶情報取得部として機能するＫＳＳインタフェースモジュール５２と、コントローラモジュール５３と、追加情報取得部として機能するＭＩＢ（Ｍａｎａｇｅｍｅｎｔｉｎｆｏｒｍａｔｉｏｎｂａｓｅ）取得モジュール５４及びサービス判定モジュール５５と、ネットワーク構成検出部として機能するインスタンス生成モジュール５６と、を備える。ＫＩＧ５０は、コンピュータにプログラムを実行させることで、ＫＩＧ５０に備わる各機能部を実現させたものであってもよい。 FIG. 48 shows the internal structure of the KIG. The KIG 50 includes an API module 51, a KSS interface module 52 functioning as a storage information obtaining unit, a controller module 53, a MIB (Management information base) obtaining module 54 and a service determining module 55 functioning as an additional information obtaining unit, An instance generation module 56 that functions as a configuration detection unit. The KIG 50 may realize each functional unit provided in the KIG 50 by causing a computer to execute a program.

ＡＰＩモジュール５１はユーザ４３やアプリケーション４１がＫＩＧ５０にアクセスするためのインタフェースを提供する。ＫＳＳインタフェースモジュール５２は、ＫＩＧ５０がＫＳＳ２０のデータベースに蓄えられている情報を得たり、ＫＩＧ５０が検出したネットワーク構成の情報をＫＳＳ２０のデータベースに蓄える処理を行う。これにより、ＫＳＳインタフェースモジュール５２はＡＳＡ３のネットワーク情報が格納されたデータベースから取得することができる。 The API module 51 provides an interface for the user 43 and the application 41 to access the KIG 50. The KSS interface module 52 performs a process in which the KIG 50 obtains information stored in the database of the KSS 20 and stores information of the network configuration detected by the KIG 50 in the database of the KSS 20. As a result, the KSS interface module 52 can acquire from the database in which the ASA3 network information is stored.

ＫＳＳインタフェースモジュール５２は、ＫＳＳ２０の任意のデータベースから情報を取得する。例えば、図４７では、ＫＳＳ２０のデータベースの一例として、ＲＤＦ（ＲｅｓｏｕｒｃｅＤｅｓｃｒｉｐｔｉｏｎＦｒａｍｅｗｏｒｋ）で記述されたデータを保存するＲＤＦデータベース（ネットワーク構成情報ＲＤＦストレージ２１）と時系列データを保存する時系列データベースの２種類のデータベースが備わる例を示す。この場合、ＫＳＳインタフェースモジュール５２は、ＲＤＦデータベース及び時系列データベースのうちのいずれのデータベースから情報を取得してもよい。 The KSS interface module 52 acquires information from an arbitrary database of the KSS 20. For example, in FIG. 47, two examples of the KSS 20 database are an RDF database (network configuration information RDF storage 21) for storing data described in RDF (Resource Description Framework) and a time-series database for storing time-series data. The following shows an example in which a database is provided. In this case, the KSS interface module 52 may acquire information from any one of the RDF database and the time-series database.

コントローラモジュール５３はＫＩＧ５０の動作を制御する。ＭＩＢ（Ｍａｎａｇｅｍｅｎｔｉｎｆｏｒｍａｔｉｏｎｂａｓｅ）取得モジュール５４はネットワーク機器にＳＮＭＰでアクセスし、ＲＦＣ１２１３およびＲＦＣ３４１８で規定される各種のＭＩＢ（ＭａｎａｇｅｍｅｎｔＩｎｆｏｒｍａｔｉｏｎＢａｓｅ）を取得する。サービス判定モジュール５５はネットワーク機器にアクセスし、サービスが動作しているかを判定する。インスタンス生成モジュール５６は、ＭＩＢ取得モジュール５４やサービス判定モジュール５５が得た情報を基に、対象範囲のネットワーク構成をＢｏｎｓａｉのインスタンスとして表現し、ＫＳＳ２０に格納する。 The controller module 53 controls the operation of the KIG 50. The MIB (Management Information Base) acquisition module 54 accesses the network device by SNMP, and acquires various MIBs (Management Information Bases) defined by RFC1213 and RFC3418. The service determination module 55 accesses the network device and determines whether the service is operating. The instance generation module 56 expresses the network configuration of the target range as an instance of Bonsai based on the information obtained by the MIB acquisition module 54 and the service determination module 55 and stores it in the KSS 20.

以下に、例を用いてＫＩＧの動作を示す。図４９にＩＰサブネットの観点でみたＡＳネットワークの例を示す。このＡＳネットワークは５台のルータ（ルータ＃１からルータ＃５）と６つのＩＰサブネット（サブネット＃１からサブネット＃６）から構成されている。 The operation of the KIG will be described below using an example. FIG. 49 shows an example of an AS network from the viewpoint of an IP subnet. This AS network is composed of five routers (router # 1 to router # 5) and six IP subnets (subnet # 1 to subnet # 6).

サブネット＃１にはサーバマシン、ＨＴＴＰサーバ（Ｗｅｂサーバ）、ＳＭＴＰサーバ（メールサーバ）、ＤＮＳサーバ、ＫＳＳ２０、ＫＩＧ５０が接続している。また、ルータ＃５ではＤＨＣＰサーバが動作している。各ルータ間では経路制御プロトコルとしてＯＳＰＦが動作しているものとする。ＫＩＣ３０はＯＳＰＦのＬＳＤＢ（ＬｉｎｋＳｔａｔｅＤａｔａｂａｓｅ）を収集し、ＫＳＳ２０に格納しているものとする。 A server machine, an HTTP server (Web server), an SMTP server (mail server), a DNS server, a KSS 20, and a KIG 50 are connected to the subnet # 1. In addition, a DHCP server is operating in the router # 5. It is assumed that OSPF operates as a routing protocol between the routers. It is assumed that the KIC 30 collects the LSDB (Link State Database) of the OSPF and stores it in the KSS 20.

図５０に上記ＡＳネットワークの物理構成を示す。すべてのサブネットはスイッチ（スイッチＳＷ＃１からスイッチＳＷ＃５）を介した構成となっている。このうち、サブネット＃４と＃５はスイッチＳＷ＃４を共有したＶＬＡＮ（ＶｉｒｔｕａｌＬＡＮ）で構成されている。 FIG. 50 shows the physical configuration of the AS network. All subnets are configured via switches (switches SW # 1 to SW # 5). Among them, the subnets # 4 and # 5 are configured by VLANs (Virtual LAN) sharing the switch SW # 4.

サブネット＃１にはサーバマシン、ＨＴＴＰサーバ、ＳＭＴＰサーバ、ＤＮＳサーバ、ＫＳＳ２０、ＫＩＧ５０が接続するが、ＨＴＴＰサーバとＳＭＴＰサーバは仮想マシンであり、サーバマシン上で動作している。サブネット＃６にはＷｉ−Ｆｉのアクセスポイント＃１が設置され、ホスト＃７がＷｉ−Ｆｉで接続している。各スイッチやアクセスポイント＃１ではＩＥＥＥ８０２．１ＡＢで規定されるＬＬＤＰ（ＬｉｎｋＬａｙｅｒＤｉｓｃｏｖｅｒｙＰｒｏｔｏｃｏｌ）が動作しているとする。 A server machine, an HTTP server, an SMTP server, a DNS server, a KSS 20 and a KIG 50 are connected to the subnet # 1, and the HTTP server and the SMTP server are virtual machines and operate on the server machine. A Wi-Fi access point # 1 is installed on the subnet # 6, and a host # 7 is connected via Wi-Fi. It is assumed that an LLDP (Link Layer Discovery Protocol) specified by IEEE 802.1AB is operating in each switch or access point # 1.

ユーザまたはアプリケーションが、ＫＩＧ５０のＡＰＩモジュール５１を介してネットワーク構成のインスタンス生成モジュール５６を起動したとする。ネットワーク構成を検出する対象範囲は、ネットワークプリフィクス等で指定されるものとする。この例では、サブネット＃１から＃６を含むネットワークプリフィクスが指定されたものとする。 It is assumed that the user or the application has activated the instance generation module 56 of the network configuration via the API module 51 of the KIG 50. The target range for detecting the network configuration is specified by a network prefix or the like. In this example, it is assumed that a network prefix including subnets # 1 to # 6 has been designated.

図５１に、本実施形態に係るネットワーク構成検出方法のフローチャートを示す。本実施形態に係るネットワーク構成検出方法は、ＫＩＧ５０が、記憶情報取得手順（Ｓ１０１）と、追加情報取得手順（Ｓ１０２〜Ｓ１０７）と、ネットワーク構成検出手順（Ｓ１０８〜Ｓ１０９）を順に実行する。 FIG. 51 shows a flowchart of the network configuration detection method according to the present embodiment. In the network configuration detection method according to the present embodiment, the KIG 50 sequentially executes a stored information acquisition procedure (S101), an additional information acquisition procedure (S102 to S107), and a network configuration detection procedure (S108 to S109).

ステップＳ１０１．ＩＰネットワーク構成情報の取得（１）：対象範囲に存在するルータによるＩＰネットワーク構成情報の取得。
図５０の例では、ＫＩＧ５０のコントローラモジュール５３は、ＫＳＳインタフェースモジュール５２を介してＫＳＳ２０にアクセスし、対象範囲のネットワーク構成に関する情報をデータベースから取得する。ＫＳＳ２０のデータベースに対象範囲のＬＳＤＢが格納されている場合、コントローラモジュール５３は、ルータ＃１から＃５のＩＰアドレスが記載されたＬＳＤＢを得る。これにより、ＫＩＧ５０は、ルータ＃１から＃５のＩＰノード（図２に示す符号Ｃ３）を知ることができる。 Step S101. Acquisition of IP network configuration information (1): Acquisition of IP network configuration information by a router existing in the target range.
In the example of FIG. 50, the controller module 53 of the KIG 50 accesses the KSS 20 via the KSS interface module 52, and acquires information on the network configuration of the target range from the database. When the LSDB of the target range is stored in the database of the KSS 20, the controller module 53 obtains the LSDB in which the IP addresses of the routers # 1 to # 5 are described. Thereby, the KIG 50 can know the IP nodes (reference numeral C3 shown in FIG. 2) of the routers # 1 to # 5.

ステップＳ１０２．ＩＰネットワーク構成情報の取得（２）：対象範囲に存在するルータのインタフェース情報およびルータ間関係情報の取得。
図５０の例では、次にＫＩＧ５０のコントローラモジュール５３はＭＩＢ取得モジュール５４を介してルータ＃１から＃５にＳＮＭＰで問合せ、各ルータが持つインタフェースの情報（インタフェースの種類、ＩＰアドレス、ネットマスクなど）を得る。たとえば、ＭＩＢで定義されているＩｆＴｙｐｅ，ｉｐＡｄＥｎｔＡｄｄｒ，ｉｐＡｄＥｎｔＮｅｔＭａｓｋなどのオブジェクトを参照する。これにより、ＫＩＧ５０は、ルータ＃１から＃５のＩＰサブネットＣ１、ＩＰインタフェースＣ２及びＩＰネットワーク構成Ｃ４を得ることができる。 Step S102. Acquisition of IP network configuration information (2): Acquisition of interface information and inter-router relation information of routers existing in the target range.
In the example of FIG. 50, the controller module 53 of the KIG 50 inquires the routers # 1 to # 5 by SNMP via the MIB acquisition module 54, and obtains interface information (interface type, IP address, netmask, etc.) of each router. Get) For example, objects such as IfType, ipAdEntAddr, and ipAdEntNetMask defined in the MIB are referred to. As a result, the KIG 50 can obtain the IP subnet C1, the IP interface C2, and the IP network configuration C4 of the routers # 1 to # 5.

すなわち、図５２に示すような、以下の情報を得る。
・サブネット＃１には、ルータ＃１，ルータ＃２，ルータ＃３が接続している。
・サブネット＃２には、ルータ＃２，ルータ＃４，ルータ＃５が接続している。
・サブネット＃３には、ルータ＃３が接続している。
・サブネット＃４には、ルータ＃４が接続している。
・サブネット＃５には、ルータ＃４が接続している。
・サブネット＃６には、ルータ＃５が接続している。 That is, the following information as shown in FIG. 52 is obtained.
Router # 1, Router # 2, and Router # 3 are connected to subnet # 1.
The router # 2, the router # 4, and the router # 5 are connected to the subnet # 2.
-Router # 3 is connected to subnet # 3.
-Router # 4 is connected to subnet # 4.
-Router # 4 is connected to subnet # 5.
-Router # 5 is connected to subnet # 6.

ステップＳ１０３．ＩＰネットワーク構成情報の取得（３）：対象範囲に存在するルータ以外のＬａｙｅｒ−３機器情報の取得。
図５０の例では、ＫＩＧ５０のコントローラモジュール５３はＭＩＢ取得モジュール５４を介してルータ＃１から＃５にＳＮＭＰで問合せ、各サブネットに接続する機器のＩＰアドレスとＭＡＣアドレスの対応表を得る。たとえば、ＭＩＢで定義されているｉｐＮｅｔＴｏＭｅｄｉａＴａｂｌｅなどのオブジェクトを参照する。これにより、ＫＩＧ５０は、各装置のＩＰノードＣ３、論理ノードＤ３、物理ノードＥ３を得ることができる。 Step S103. Acquisition of IP network configuration information (3): Acquisition of Layer-3 device information other than routers existing in the target range.
In the example of FIG. 50, the controller module 53 of the KIG 50 inquires the routers # 1 to # 5 by SNMP via the MIB acquisition module 54 to obtain a correspondence table between the IP addresses and the MAC addresses of the devices connected to each subnet. For example, an object such as ipNetToMediaTable defined in MIB is referred to. Thereby, the KIG 50 can obtain the IP node C3, the logical node D3, and the physical node E3 of each device.

すなわち、図５３に示すような、以下の情報を得る。
・サブネット＃１には、ルータ＃１、＃２、＃３、ＫＩＧ５０、ＫＳＳ２０以外に５台の機器が接続している。
・サブネット＃２には、ルータ＃２、＃４、＃５以外に２台の機器が接続している．
・サブネット＃３には、ルータ＃３以外に２台の機器が接続している。
・サブネット＃４には、ルータ＃４以外に２台の機器が接続している。
・サブネット＃５には、ルータ＃４以外に２台の機器が接続している。
・サブネット＃６には、ルータ＃５以外に４台の機器が接続している。 That is, the following information as shown in FIG. 53 is obtained.
-Five devices other than the routers # 1, # 2, # 3, KIG50 and KSS20 are connected to the subnet # 1.
-Two devices other than the routers # 2, # 4, and # 5 are connected to the subnet # 2.
-Two devices other than the router # 3 are connected to the subnet # 3.
-Two devices other than the router # 4 are connected to the subnet # 4.
-Two devices other than the router # 4 are connected to the subnet # 5.
-Four devices other than the router # 5 are connected to the subnet # 6.

ステップＳ１０４．物理／論理ネットワーク構成情報の取得（１）：対象範囲に存在するＬａｙｅｒ−２機器情報の取得。
図５０の例では、ＫＩＧ５０のコントローラモジュール５３は、ＭＩＢ取得モジュール５４を介してＳＮＭＰで各ルータに問合せ、ＬＬＤＰの情報を得る。たとえばＬＬＤＰ−ＭＩＢで定義されているｌｌｄｐＲｅｍＴａｂｌｅなどのオブジェクトを参照する。また、スイッチのようにＬａｙｅｒ−２機器でもＩＰアドレスを持つものは、その値も得る。これにより、ＫＩＧ５０は、接続機器として機能する物理ノードＥ３を得ることができる。 Step S104. Acquisition of physical / logical network configuration information (1): Acquisition of information on Layer-2 device existing in the target range.
In the example of FIG. 50, the controller module 53 of the KIG 50 inquires each router by SNMP via the MIB acquisition module 54 to obtain LLDP information. For example, an object such as lldpRemTable defined in LLDP-MIB is referred to. Also, for a Layer-2 device having an IP address, such as a switch, the value is also obtained. Thereby, the KIG 50 can obtain the physical node E3 functioning as a connection device.

すなわち、図５４に示すような、以下の情報を得る。
・サブネット＃１には、スイッチＳＷ＃１が接続している。
・サブネット＃２には、スイッチＳＷ＃２が接続している。
・サブネット＃３には、スイッチＳＷ＃３が接続している。
・サブネット＃４には、スイッチＳＷ＃４−１が接続している。
・サブネット＃５には、スイッチＳＷ＃４−２が接続している。
・サブネット＃６には、スイッチＳＷ＃５とアクセスポイント＃１が接続している。 That is, the following information as shown in FIG. 54 is obtained.
Switch SW # 1 is connected to subnet # 1.
Switch SW # 2 is connected to subnet # 2.
Switch SW # 3 is connected to subnet # 3.
Switch SW # 4-1 is connected to subnet # 4.
Switch SW # 4-2 is connected to subnet # 5.
Switch SW # 5 and access point # 1 are connected to subnet # 6.

ステップＳ１０５．物理／論理ネットワーク構成情報の取得（２）：対象範囲に存在するＬａｙｅｒ−３機器とＬａｙｅｒ−２機器間の関係情報の取得。
コントローラモジュール５３は、対象範囲における経路を制御するネットワーク機器から、各ネットワーク機器が制御する下流ネットワークの情報を取得し、これを用いて前記対象範囲の論理ネットワーク構成を特定する。図５０の例では、ＫＩＧ５０のコントローラモジュール５３は、ＭＩＢ取得モジュール５４を介してスイッチＳＷ＃１からＳＷ＃５とアクセスポイント＃１にＳＮＭＰで問合せ、各スイッチやアクセスポイント＃１に接続する機器のＭＡＣアドレスを得る。たとえば、ＭＩＢで定義されているｄｏｔ１ｄＴｐＰｏｒｔＴａｂｌｅなどのオブジェクトを参照する。この結果とステップＳ１０３で得たＭＡＣアドレスを突き合わせることで、ＫＩＧ５０は、論理ノードＤ３、論理インタフェースＤ２、論理リンクＤ１、論理ネットワーク構成Ｄ４、物理インタフェースＥ２、物理リンクＥ１、物理ネットワーク構成Ｅ４を得ることができる。 Step S105. Acquisition of physical / logical network configuration information (2): Acquisition of relationship information between Layer-3 devices and Layer-2 devices existing in the target range.
The controller module 53 obtains information on the downstream network controlled by each network device from the network devices that control the routes in the target range, and specifies the logical network configuration in the target range using the information. In the example of FIG. 50, the controller module 53 of the KIG 50 inquires the switches SW # 1 to SW # 5 and the access point # 1 by SNMP via the MIB acquisition module 54 by SNMP, and checks the devices connected to each switch and the access point # 1. Get the MAC address. For example, an object such as dot1dTpPortTable defined in MIB is referred to. By matching the result with the MAC address obtained in step S103, the KIG 50 obtains the logical node D3, the logical interface D2, the logical link D1, the logical network configuration D4, the physical interface E2, the physical link E1, and the physical network configuration E4. be able to.

すなわち、図５５に示すような、以下の情報を得る。
・スイッチＳＷ＃１にはルータ＃１、ルータ＃２、ルータ＃３、ＫＳＳ２０、ＫＩＧ５０、ホスト＃１−１、ホスト＃１−２、ホスト＃１−３、ホスト＃１−４が接続している。
・スイッチＳＷ＃２にはルータ＃２、ルータ＃４、ルータ＃５、ホスト＃２が接続している。
・スイッチＳＷ＃４−１とスイッチＳＷ＃４−２は、物理的には１台のスイッチ（スイッチＳＷ＃４）である。
・サブネット＃４とサブネット＃５はＶＬＡＮであり、スイッチＳＷ＃４を共有している。
・スイッチＳＷ＃４にはルータ＃４、ホスト＃４、ホスト＃５が接続している。
・スイッチＳＷ＃５にはホスト＃６とアクセスポイント＃１が接続している。
・アクセスポイント＃１にはホスト＃７が接続している。 That is, the following information as shown in FIG. 55 is obtained.
Router # 1, Router # 2, Router # 3, KSS20, KIG50, Host # 1-1, Host # 1-2, Host # 1-3, Host # 1-4 are connected to switch SW # 1. I have.
Router # 2, Router # 4, Router # 5, and Host # 2 are connected to switch SW # 2.
Switch SW # 4-1 and switch SW # 4-2 are physically one switch (switch SW # 4).
The subnet # 4 and the subnet # 5 are VLANs, and share the switch SW # 4.
Router # 4, host # 4, and host # 5 are connected to switch SW # 4.
The host # 6 and the access point # 1 are connected to the switch SW # 5.
-Host # 7 is connected to access point # 1.

ステップＳ１０６．論理ネットワーク構成情報と物理ネットワーク構成情報の識別。
図５０の例では、ＫＩＧ５０のコントローラモジュール５３はＭＩＢ取得モジュール５４を介して各ホストにＳＮＭＰで問合せ、仮想マシン環境を実現するハイパバイザが動作しているか、また動作している場合、どのような仮想マシンが動作しているかを得る。たとえば、ＲＦＣ７６６６で規定されるＭＩＢで定義されているｖｍＭＩＢなどのオブジェクトを利用する。この結果、ＫＩＧ５０は、ホスト＃１−１上でホスト＃１−２とホスト＃１−３が仮想マシン（ＶＭ）として動作していることを知る。すなわち、図５６に示すような情報を得る。 Step S106. Identification of logical network configuration information and physical network configuration information.
In the example of FIG. 50, the controller module 53 of the KIG 50 inquires each host via the MIB acquisition module 54 by SNMP, and determines whether a hypervisor for realizing a virtual machine environment is operating and, if so, what virtual Get if the machine is running. For example, an object such as vmMIB defined in MIB defined in RFC7666 is used. As a result, the KIG 50 knows that the host # 1-2 and the host # 1-3 are operating as virtual machines (VM) on the host # 1-1. That is, information as shown in FIG. 56 is obtained.

ステップＳ１０７．ネットワークサービス構成情報の取得。
図５０の例では、ＫＩＧ５０のコントローラモジュール５３は、サービス判定モジュール５５を介して、コントローラモジュール５３は、対象範囲に含まれる各端末すなわち各ホストにアクセスし、対象範囲に含まれるネットワークサービス構成Ｂ３を特定する。アクセスは、例えば、ルータ＃１から＃５、ホスト＃１−１から＃７における、ネットワークサーバに対応するポートへのアクセスである。アクセスするポートはサービスに応じたポートであり、たとえば、ＨＴＴＰサーバは８０番ポート、ＳＭＴＰサーバは２５番ポート、ＤＮＳサーバは５３番ポート、ＤＨＣＰサーバは６７番ポートとなる。具体的には、ＨＴＴＰサーバやＳＭＴＰサーバのようにＴＣＰ（ＴｒａｎｓｍｉｓｓｉｏｎＣｏｎｔｒｏｌＰｒｏｔｏｃｏｌ）を使用する場合は、対応するポート番号を指定してＴＣＰコネクションの確立を試みる。 Step S107. Get network service configuration information.
In the example of FIG. 50, the controller module 53 of the KIG 50 accesses each terminal, that is, each host included in the target range via the service determination module 55, and configures the network service configuration B3 included in the target range. Identify. The access is, for example, an access to a port corresponding to the network server in the routers # 1 to # 5 and the hosts # 1-1 to # 7. The access port is a port corresponding to the service. For example, the HTTP server is port 80, the SMTP server is port 25, the DNS server is port 53, and the DHCP server is port 67. Specifically, when TCP (Transmission Control Protocol) is used like an HTTP server or an SMTP server, an attempt is made to establish a TCP connection by specifying a corresponding port number.

ＴＣＰコネクションが確立された場合、対応するサーバが動作していると判断する。ＩＣＭＰ（ＩｎｔｅｒｎｅｔＣｏｎｔｒｏｌＭｅｓｓａｇｅＰｒｏｔｏｃｏｌ）ｐｏｒｔｕｎｒｅａｃｈａｂｌｅが返されたり、ＴＣＰコネクション確立が失敗したりする場合は、対応するサーバが動作していないと判断する。 When the TCP connection is established, it is determined that the corresponding server is operating. If an ICMP (Internet Control Message Protocol) port unreachable is returned or the TCP connection establishment fails, it is determined that the corresponding server is not operating.

ＤＮＳサーバやＤＨＣＰサーバのようにＵＤＰ（ＵｓｅｒＤａｔａｇｒａｍＰｒｏｔｏｃｏｌ）を使用する場合は、対応するポート番号にＵＤＰセグメントを送信する。ＩＣＭＰｐｏｒｔｕｎｒｅａｃｈａｂｌｅやエラーを示すセグメントが返された場合は、対応するサーバは動作していないと判断する。それ以外の場合は，対応するサーバが動作していると判断する。 When using UDP (User Datagram Protocol) like a DNS server or a DHCP server, a UDP segment is transmitted to a corresponding port number. If a segment indicating an ICMP port unreachable or an error is returned, it is determined that the corresponding server is not operating. In other cases, it is determined that the corresponding server is operating.

この結果、ホスト＃１−２でＨＴＴＰサーバが動作し、ホスト＃１−３でＳＭＴＰサーバが動作し、ホスト＃１−４でＤＮＳサーバが動作し、ルータ＃５でＤＨＣＰサーバが動作することを知る。これにより、ＫＩＧ５０はネットワークサービスＢにおけるサービスエンティティＢ２、ネットワークサービスＢ１及びネットワークサービス構成Ｂ３を得ることができる。すなわち、図５６に示すような物理構成の情報を得る。 As a result, it is assumed that the HTTP server operates on the host # 1-2, the SMTP server operates on the host # 1-3, the DNS server operates on the host # 1-4, and the DHCP server operates on the router # 5. know. Thereby, the KIG 50 can obtain the service entity B2, the network service B1, and the network service configuration B3 in the network service B. That is, physical configuration information as shown in FIG. 56 is obtained.

ステップＳ１０２〜Ｓ１０７により、ＫＳＳインタフェースモジュール５２の取得した情報を用いて対象範囲に含まれる機器を特定し、特定した機器から、対象範囲のネットワーク構成のうちのＫＳＳインタフェースモジュール５２の取得できなかった情報を取得することができる。 In steps S102 to S107, the devices included in the target range are specified using the information obtained by the KSS interface module 52, and the information that the KSS interface module 52 of the network configuration of the target range could not be obtained from the specified device. Can be obtained.

ステップＳ１０８．
ＫＩＧ５０のコントローラモジュール５３は、得られたネットワーク構成の情報をインスタンス生成モジュール５６に渡す。インスタンス生成モジュール５６は、対象範囲のネットワーク構成を予め定められた形式で表しインスタンスを生成する。 Step S108.
The controller module 53 of the KIG 50 passes the obtained network configuration information to the instance generation module 56. The instance generation module 56 expresses the network configuration of the target range in a predetermined format and generates an instance.

ステップＳ１０９．
ＫＩＧ５０のコントローラモジュール５３は、ＫＳＳインタフェースモジュール５２を介して得られたインスタンス表現をＫＳＳ２０に送信する。 Step S109.
The controller module 53 of the KIG 50 transmits the instance representation obtained via the KSS interface module 52 to the KSS 20.

［発明によって生じる効果］
・データモデルとしてネットワークオントロジＢｏｎｓａｉを用いることで障害影響範囲推論と障害原因推論の両方を可能とした。
・障害影響範囲推論においては、ネットワーク構成が決定した時点で依存関係ルールを展開するため、実行時に依存関係ルールを展開する必要がない。そのため、実行時のメモリ消費量が抑えられ、高速に実行できる。
・障害原因推論においては、依存関係にないインスタンスや症状に関係しない状態伝搬に関しては解析ルールを展開しないことで全探索を避け、大規模なシステムにも適用可能とした。
・障害原因推論において、複数の観測情報を用いた原因の絞り込みが可能である。 [Effects of the Invention]
-By using the network ontology Bonsai as a data model, both fault influence range inference and fault cause inference can be performed.
In fault inference range inference, since dependency rules are developed when the network configuration is determined, there is no need to deploy the dependency rules at runtime. Therefore, the memory consumption at the time of execution is suppressed, and the execution can be performed at high speed.
-In fault cause inference, all search is avoided by not deploying analysis rules for instances that are not dependent and state propagation that is not related to symptoms, and can be applied to large-scale systems.
・ It is possible to narrow down the cause using multiple observation information in fault cause inference.

［発明趣旨］
・障害の影響範囲推論において、ネットワーク構成情報にあらかじめ依存関係ルールを展開して保持しておくこと。
・障害原因推論において、依存関係にないインスタンスや症状に関係しない状態伝搬には解析ルールを展開しないこと。
・障害原因推論において、複数の観測情報を用いた原因の絞り込みが可能であること。 [Purpose of Invention]
-In inference of the range of influence of a failure, a dependency rule must be developed and stored in advance in the network configuration information.
-In fault cause inference, do not deploy analysis rules for instances that are not dependent or state propagation that is not related to symptoms.
-In fault inference, it is possible to narrow down the cause using multiple pieces of observation information.

１０：ＫＡＮＶＡＳＡｃｃｅｓｓＳｅｒｖｅｒ（ＫＡＳ）
１１：ネットワーク構成情報収集モジュール
１２：ＳＰＩＮ推論エンジン
１３：影響範囲推論モジュール
１４：障害原因推論モジュール
１５：障害原因絞込モジュール
２０：ＫＡＮＶＡＳＳｔｏｒａｇｅＳｅｒｖｅｒ（ＫＳＳ）
２１：ネットワーク構成情報ＲＤＦストレージ
２２：ＳＰＩＮ依存関係ルールストレージ
２３：依存関係ルール展開ストレージ
２４：ＳＰＩＮ状態伝搬ルールストレージ
２５：ルール入力モジュール
２６：入出力モジュール
２７：一時的依存関係ルール展開ストレージ
３０：ＫＡＮＶＡＳＩｎｆｏｒｍａｔｉｏｎＣｏｌｌｅｃｔｏｒ（ＫＩＣ）
４４：ネットワーク
５０：ＫＡＮＶＡＳＩｎｓｔａｎｃｅＧｅｎｅｒａｔｏｒ（ＫＩＧ）
５１：ＡＰＩモジュール
５２：ＫＳＳインタフェースモジュール
５３：コントローラモジュール
５４：ＭＩＢ取得モジュール
５５：サービス判定モジュール
５６：インスタンス生成モジュール 10: KANVAS Access Server (KAS)
11: Network configuration information collection module 12: SPIN inference engine 13: Influence range inference module 14: Failure cause inference module 15: Failure cause narrowing down module 20: KANVAS Storage Server (KSS)
21: Network Configuration Information RDF Storage 22: SPIN Dependency Rule Storage 23: Dependency Rule Expansion Storage 24: SPIN State Propagation Rule Storage 25: Rule Input Module 26: Input / Output Module 27: Temporary Dependency Rule Expansion Storage 30: KANVAS Information Collector (KIC)
44: Network 50: KANVAS Instance Generator (KIG)
51: API module 52: KSS interface module 53: Controller module 54: MIB acquisition module 55: Service determination module 56: Instance generation module

Claims

A dependency rule describing a dependency relationship between two elements of which a state is dependent on each other among a plurality of elements constituting a network, and a state describing contents of a state propagated between the two elements. Setting means for setting a propagation rule;
Storage means for collecting information about the configuration of the network, extracting and storing the dependency rules according to the configuration of the network,
When failure information on the element is input, select the element that has an effect on the state from the stored dependency rules, detect the state propagation rule based on the selected element, Range estimating means for estimating the range of the element affected by the failure information;
A fault influence range inference device comprising:

2. The fault influence range inference apparatus according to claim 1, further comprising: a fault information response unit configured to make a response to the input fault information an influence on a service of a network from the estimated range of the element.

A dependency rule describing a dependency relationship between two elements of which a state is dependent on each other among a plurality of elements constituting a network, and a state describing contents of a state propagated between the two elements. Setting means for setting a propagation rule;
Storage means for collecting information about the configuration of the network, extracting and storing the dependency rules according to the configuration of the network,
When a fault condition for the network is input, one or more of the elements related to the fault condition are selected from the stored dependency rules, and one or more of the selected elements can be taken. The state estimation rule is searched based on the selected element and the searched state to detect the state propagation rule, and the cause estimation for estimating the list of the element causing the failure state and the state of the element is performed. Means for estimating the cause
An inference device for causing a failure cause, comprising:

Causing the cause estimating means to perform the cause estimation assuming another failure situation similar to the failure situation, and estimating another list of the element and the state of the element that cause the other failure situation; 4. The fault cause according to claim 3, further comprising: a fault status response unit that sets a result of excluding items common to the list and the other list from the list as a response to the input fault status. Inference device.

A dependency rule describing a dependency relationship between two elements of which a state is dependent on each other among a plurality of elements constituting a network, and a state describing contents of a state propagated between the two elements. A setting procedure for setting a propagation rule,
A storage procedure for collecting information about the configuration of the network, extracting and storing the dependency rules according to the configuration of the network,
When failure information on the element is input, select the element that has an effect on the state from the stored dependency rules, detect the state propagation rule based on the selected element, A range estimation procedure for estimating the range of the element affected by the failure information,
The method for inferring the range of the influence of a failure characterized by performing the following.

A dependency rule describing a dependency relationship between two elements of which a state is dependent on each other among a plurality of elements constituting a network, and a state describing contents of a state propagated between the two elements. A setting procedure for setting a propagation rule,
A storage procedure for collecting information about the configuration of the network, extracting and storing the dependency rules according to the configuration of the network,
When a fault condition for the network is input, one or more of the elements related to the fault condition are selected from the stored dependency rules, and one or more of the selected elements can be taken. A state estimating step of detecting the state propagation rule based on the selected element and the searched state, and estimating a list of the element causing the failure state and the state of the element When,
Inference method of the cause of failure characterized by performing.

A program for causing a computer to function as the fault influence range inference device according to claim 1.

A program for causing a computer to function as the failure cause inference device according to claim 3.