JP4572211B2

JP4572211B2 - Network system, network relay device

Info

Publication number: JP4572211B2
Application number: JP2007090431A
Authority: JP
Inventors: 亘史高村
Original assignee: NEC Computertechno Ltd
Current assignee: NEC Computertechno Ltd
Priority date: 2007-03-30
Filing date: 2007-03-30
Publication date: 2010-11-04
Anticipated expiration: 2027-03-30
Also published as: JP2008252458A

Description

本発明は、ネットワーク・システム、ネットワーク中継装置に関する。 The present invention relates to a network system and a network relay device.

ネットワークを監視するシステムとして、ＳＮＭＰ（Simple Network Management Protocol）やＲＭＯＮ（Remote Network Monitoring）という機能を有するものが知られるが、これらのシステムでは、ネットワーク関連情報を収集して、ネットワークを監視する。ＳＮＭＰでは、ＭＩＢ（Management Information Base）と呼ばれる情報ＩＤを定義し、この定義に従って自装置の情報を格納すると共に、他装置からのリクエストに応じてその格納値を返す。ＲＭＯＮでは自動的に特定の情報を取得して、装置に蓄積する仕組みを有し、かつ、リクエストに応じなくともデータを配信する仕組みを備える。また、両技術ともトラップという機能を備え、ネットワーク中継装置ではない専用のマネージャ装置がそれを受け取り、その情報を利用するのみである。 As systems for monitoring a network, those having a function called Simple Network Management Protocol (SNMP) or Remote Network Monitoring (RMON) are known. In these systems, network-related information is collected to monitor the network. In SNMP, an information ID called MIB (Management Information Base) is defined, information on the own device is stored according to this definition, and a stored value is returned in response to a request from another device. RMON has a mechanism for automatically acquiring specific information and storing it in the apparatus, and also has a mechanism for distributing data without responding to a request. In addition, both technologies have a function called a trap, and a dedicated manager device that is not a network relay device receives the information and only uses the information.

ネットワーク関連の障害を解析する場合、ネットワーク機器や端末装置に備えられたログ採取機能により採取されたログ情報を元に、発生原因ならびに原因装置を推測することが主である。しかし、ネットワークでの障害については接続相手の有する情報も考慮する必要があり、障害発生装置のログ情報だけでは解析においては不十分であることがほとんどである。また、機器の有する情報収集機能は常に所望のデータがとられているわけでもなく、管理者が所望のデータ取得のコマンドを入力することで収集することがほとんどである。そのため、ネットワーク内の機器が同調して障害発生の前後における時間帯の情報を記録することは困難であった。 When analyzing a network-related failure, it is mainly to infer the cause and the cause device based on the log information collected by the log collection function provided in the network device or terminal device. However, regarding the failure in the network, it is necessary to consider the information held by the connection partner, and the log information of the failure occurrence device alone is often insufficient for analysis. In addition, the information collection function of the device does not always take the desired data, but it is almost always collected by the administrator inputting a desired data acquisition command. For this reason, it is difficult to record time zone information before and after the occurrence of a failure in synchronization with devices in the network.

特開２００２−２５２６１４号公報（特許文献１参照）には、「記録媒体」の発明が記載されている。この発明は、ネットワークの状態を監視する処理をコンピュータに機能させるプログラムを記録したコンピュータ読み取り可能な記録媒体である。コンピュータを、以下の手段として機能させるプログラムを記録する。監視内容設定手段は、監視内容を設定する。監視手段は、前記監視内容設定手段によって設定された監視内容にてネットワークを監視する。監視内容変更手段は、前記監視手段の監視結果に応じて、前記監視内容設定手段によって設定された監視内容を変更する。 Japanese Patent Laid-Open No. 2002-252614 (see Patent Document 1) describes the invention of “recording medium”. The present invention is a computer-readable recording medium that records a program that causes a computer to function to monitor a network state. A program for causing the computer to function as the following means is recorded. The monitoring content setting means sets the monitoring content. The monitoring means monitors the network with the monitoring content set by the monitoring content setting means. The monitoring content changing unit changes the monitoring content set by the monitoring content setting unit according to the monitoring result of the monitoring unit.

特開２００５−１６７３４７号公報（特許文献２参照）には、「ネットワーク監視プログラム」の発明が記載されている。この発明は、ネットワーク上の障害発生箇所を検出するためのネットワーク監視プログラムである。コンピュータを、以下の手段として機能させる。記憶手段は、前記ネットワーク上で障害の発生原因となり得る要素が予め分類され、分類された要素に対して、前記ネットワークを介した通信の異常を示す事象が対応付けられた障害箇所判定テーブルを記憶する。通信状況監視手段は、前記ネットワーク上の他の機器との間の通信状況を監視する。異常検出手段は、前記通信状況監視手段で検出された通信内容から異常を示す事象を検出する。障害箇所判定手段は、前記障害箇所判定テーブルを参照し、前記異常検出手段で検出された事象の発生原因となる要素を判定する。障害情報出力手段は、前記障害箇所判定手段での判定結果を示す障害情報を出力する。 Japanese Patent Laying-Open No. 2005-167347 (see Patent Document 2) describes the invention of a “network monitoring program”. The present invention is a network monitoring program for detecting a fault occurrence location on a network. The computer is caused to function as the following means. The storage means stores in advance a failure location determination table in which elements that may cause failures on the network are classified in advance, and events indicating an abnormality in communication via the network are associated with the classified elements. To do. The communication status monitoring means monitors the communication status with other devices on the network. The abnormality detection unit detects an event indicating an abnormality from the communication content detected by the communication status monitoring unit. The failure location determination unit refers to the failure location determination table and determines an element that causes the occurrence of the event detected by the abnormality detection unit. The failure information output means outputs failure information indicating the determination result by the failure location determination means.

特開２００５−２８６６８４号公報（特許文献３参照）には、「計測システム」の発明が記載されている。計測システムは、複数のネットワーク装置からなるネットワーク内のトラフィック情報を把握するためにネットワークを流れるトラフィック情報を出力するネットワーク装置と任意のネットワーク装置からトラフィック情報を収集し、ネットワークの性能を把握する計測サーバとで構成される。計測サーバにおいては、算出する手段は、ネットワーク装置から収集されたトラフィック情報をもとに任意のネットワーク装置から出力されるパケットの集合体であるフローの統計情報を算出する。同定する手段は、フローの統計情報が更なる監視を要求する条件を満たしたときに、そのフローを構成するパケットが通過する別なネットワーク装置を同定する。要求する手段は、前記同定されたネットワーク装置から、前記更なる監視を要求するフロー情報を監視するためにトラフィック監視を要求する。 Japanese Patent Laying-Open No. 2005-286684 (see Patent Document 3) describes an invention of a “measurement system”. The measurement system is a measurement server that collects traffic information from a network device that outputs traffic information that flows through the network and an arbitrary network device to grasp traffic information in the network that consists of multiple network devices, and grasps the network performance It consists of. In the measurement server, the calculating means calculates statistical information of a flow that is an aggregate of packets output from an arbitrary network device based on traffic information collected from the network device. The identifying means identifies another network device through which a packet constituting the flow passes when the statistical information of the flow satisfies a condition that requires further monitoring. Requesting means requests traffic monitoring from the identified network device to monitor flow information requesting further monitoring.

特開２００２−２５２６１４号公報JP 2002-252614 A 特開２００５−１６７３４７号公報JP 2005-167347 A 特開２００５−２８６６８４号公報JP 2005-286684 A

本発明の課題は、一のネットワーク中継装置が障害兆候を検出したときに、他のネットワーク中継装置へ、監視強化を要請することである。本発明の他の課題は、一のネットワーク中継装置が、他のネットワーク中継装置から、監視強化を要請されたときに、当該一のネットワーク中継装置において、監視強化を実施することである。 An object of the present invention is to request monitoring enhancement from another network relay device when one network relay device detects a failure sign. Another object of the present invention is to perform monitoring strengthening in one network relay device when the one network relay device is requested to strengthen monitoring from another network relay device.

以下に、［発明を実施するための最良の形態］で使用される番号・符号を用いて、［課題を解決するための手段］を説明する。これらの番号・符号は、［特許請求の範囲］の記載と［発明を実施するための最良の形態］との対応関係を明らかにするために括弧付きで付加されたものである。ただし、それらの番号・符号を、［特許請求の範囲］に記載されている発明の技術的範囲の解釈に用いてはならない。 [Means for Solving the Problems] will be described below using the numbers and symbols used in [Best Mode for Carrying Out the Invention]. These numbers and symbols are added in parentheses in order to clarify the correspondence between the description of [Claims] and [Best Mode for Carrying Out the Invention]. However, these numbers and symbols should not be used for the interpretation of the technical scope of the invention described in [Claims].

本発明によるネットワーク・システムは、ネットワーク・データを中継するｎ個のネットワーク中継装置（１０１〜１０４）を有する。一のネットワーク中継装置（１０１）は、自装置内で監視情報を収集する装置内情報収集部（２０４）と、前記監視情報を蓄積する情報記録部（２０５）と、前記監視情報が、障害兆候を示したときに、他の［ｎ−１］個のネットワーク中継装置（１０２〜１０４）へ、当該障害兆候が検出された旨を通知する兆候通知処理部（２０３）とを具備する。前記他の［ｎ−１］個のネットワーク中継装置（１０２〜１０４）は、それぞれ、自装置内で監視情報を収集すると共に、前記一のネットワーク中継装置（１０１）から、障害兆候が検出された旨を通知されたときに、追加の監視情報を収集する装置内情報収集部と、前記監視情報及び前記追加の監視情報を蓄積する情報記録部とを具備する。 The network system according to the present invention includes n network relay apparatuses (101 to 104) that relay network data. One network relay device (101) includes an in-device information collecting unit (204) that collects monitoring information in the own device, an information recording unit (205) that accumulates the monitoring information, and the monitoring information includes a failure indication. , A sign notification processing unit (203) for notifying the other [n−1] network relay apparatuses (102 to 104) that the trouble sign has been detected. Each of the other [n−1] network relay devices (102 to 104) collects monitoring information within the device itself, and a failure sign is detected from the one network relay device (101). An in-device information collection unit that collects additional monitoring information when notified to the effect, and an information recording unit that accumulates the monitoring information and the additional monitoring information.

本発明によるネットワーク中継装置（１０１）においては、装置内情報収集部（２０４）は、自装置内で監視情報を収集する。情報記録部（２０５）は、前記監視情報を蓄積する。兆候通知処理部（２０３）は、前記監視情報が、障害兆候を示したときに、他のネットワーク中継装置（１０２〜１０４）へ、当該障害兆候が検出された旨を通知する。 In the network relay device (101) according to the present invention, the in-device information collection unit (204) collects monitoring information within the own device. The information recording unit (205) stores the monitoring information. When the monitoring information indicates a failure sign, the sign notification processing unit (203) notifies the other network relay devices (102 to 104) that the failure sign has been detected.

本発明によるネットワーク中継装置（１０１）においては、リスト（１０５）は、他のネットワーク中継装置（１０２〜１０４）と共有され、障害兆候の種別が定義される。兆候検出条件テーブル（２０６）は、前記種別ごとに障害兆候の検出条件を定義する。前記兆候通知処理部（２０３）は、前記兆候検出条件テーブル（２０６）に基づいて、いずれかの種別の障害兆候が検出されたときに、他のネットワーク中継装置（１０２〜１０４）へ、当該種別の障害兆候が検出された旨を通知する。 In the network relay device (101) according to the present invention, the list (105) is shared with other network relay devices (102 to 104), and the type of failure sign is defined. The sign detection condition table (206) defines fault sign detection conditions for each type. Based on the sign detection condition table (206), the sign notification processing unit (203) sends the type to the other network relay devices (102 to 104) when any kind of fault sign is detected. Notify that a failure sign has been detected.

本発明によるネットワーク中継装置（１０１）においては、前記兆候検出条件テーブル（２０６）は、前記種別ごとに障害兆候の収束条件を定義する。前記兆候通知処理部（２０３）は、前記検出された旨を通知した後に、前記兆候検出条件テーブル（２０６）に基づいて、当該種別の障害兆候が収束したときには、他のネットワーク中継装置（１０２〜１０４）へ、当該種別の障害兆候が収束した旨を通知する。 In the network relay device (101) according to the present invention, the sign detection condition table (206) defines a failure sign convergence condition for each type. After the notification of the detection, the sign notification processing unit (203), based on the sign detection condition table (206), when the failure sign of the type has converged, other network relay devices (102 to 102). 104) that the failure indication of the type has converged.

本発明によるネットワーク中継装置（１０１）においては、監視強化情報テーブル（２０７）は、前記兆候検出条件テーブル（２０６）に基づいて、いずれかの種別の障害兆候が検出されたときに、その種別の障害兆候に関連した追加の監視情報を索引するために参照される。前記装置内情報収集部（２０４）は、いずれかの種別の障害兆候が検出されたときには、当該監視強化情報テーブル（２０７）に基づいて、追加の監視情報を収集する。 In the network relay device (101) according to the present invention, the monitoring strengthening information table (207), when any type of failure sign is detected based on the sign detection condition table (206), Referenced to index additional monitoring information related to failure signs. The in-device information collection unit (204) collects additional monitoring information based on the enhanced monitoring information table (207) when any type of failure sign is detected.

本発明によるネットワーク中継装置（１０１）においては、前記兆候検出条件テーブル（２０６）は、前記種別ごとに障害兆候の収束条件を定義する。前記装置内情報収集部（２０４）は、いずれかの種別の障害兆候が検出された後に、前記兆候検出条件テーブル（２０６）に基づいて、当該種別の障害兆候が収束したときには、前記監視強化情報テーブル（２０７）に基づいて、追加の監視情報を収集することを停止する。 In the network relay device (101) according to the present invention, the sign detection condition table (206) defines a failure sign convergence condition for each type. The in-device information collection unit (204), when any type of failure sign has been detected and when the type of failure sign has converged based on the indication detection condition table (206), Collecting additional monitoring information is stopped based on the table (207).

本発明によるネットワーク中継装置（１０１）においては、装置内情報収集部（２０４）は、自装置内で監視情報を収集する。情報記録部（２０５）は、前記監視情報を蓄積する。前記装置内情報収集部（２０４）は、他のネットワーク中継装置（１０２〜１０４）から、障害兆候が検出された旨を通知されたときに、追加の監視情報を収集する。 In the network relay device (101) according to the present invention, the in-device information collection unit (204) collects monitoring information within the own device. The information recording unit (205) stores the monitoring information. The in-device information collection unit (204) collects additional monitoring information when notified from other network relay devices (102 to 104) that a failure sign has been detected.

本発明によるネットワーク中継装置（１０１）においては、リスト（１０５）は、他のネットワーク中継装置（１０２〜１０４）と共有され、障害兆候の種別が定義される。監視強化情報テーブル（２０７）は、他のネットワーク中継装置（１０２〜１０４）から、いずれかの種別の障害兆候が検出された旨を通知されたときに、その種別の障害兆候に関連した追加の監視情報を索引するために参照される。前記装置内情報収集部（２０４）は、他のネットワーク中継装置（１０２〜１０４）から、いずれかの種別の障害兆候が検出された旨を通知されたときには、前記監視強化情報テーブル（２０７）に基づいて、その種別の障害兆候に関連した追加の監視情報を収集する。 In the network relay device (101) according to the present invention, the list (105) is shared with other network relay devices (102 to 104), and the type of failure sign is defined. When the enhanced network information table (207) is notified from the other network relay devices (102 to 104) that a failure sign of any type has been detected, an additional information related to the failure sign of that type is added. Referenced to index monitoring information. When the in-device information collecting unit (204) is notified from the other network relay devices (102 to 104) that any type of failure sign has been detected, the in-device information collecting unit (204) stores the in-monitoring information table (207). Based on that, collect additional monitoring information related to that type of failure indication.

本発明によるネットワーク中継装置（１０１）においては、前記装置内情報収集部（２０４）は、他のネットワーク中継装置（１０２〜１０４）から、いずれかの種別の障害兆候が検出された旨を通知された場合において、その後、当該他のネットワーク中継装置から、前記種別の障害兆候が収束した旨を通知されたときには、前記監視強化情報テーブル（２０７）に基づいて、追加の監視情報を収集することを停止する。 In the network relay device (101) according to the present invention, the in-device information collection unit (204) is notified from the other network relay devices (102 to 104) that any type of failure sign has been detected. In this case, after that, when the other network relay device is notified that the failure sign of the type has converged, the additional monitoring information is collected based on the enhanced monitoring information table (207). Stop.

本発明によれば、一のネットワーク中継装置が障害兆候を検出したときに、他のネットワーク中継装置へ、監視強化を要請することができる。また、一のネットワーク中継装置が、他のネットワーク中継装置から、監視強化を要請されたときに、当該一のネットワーク中継装置において、監視強化を実施することができる。 According to the present invention, when one network relay device detects a failure sign, it is possible to request another network relay device to enhance monitoring. Further, when one network relay device is requested to enhance monitoring from another network relay device, the one network relay device can perform the monitoring enhancement.

本実施の形態では、ネットワークにおける障害発生の兆候を、ネットワークに繋がる複数の中継装置で検出し、その検出を契機に中継装置内部の情報収集範囲を拡張するための仕組みを提供する。第一に、ネットワークに繋がる複数の中継装置では、各中継装置において、自中継装置の監視情報を収集する機能を備える。第二に、各中継装置は、それぞれが障害兆候を検出する機能を有する。それぞれの中継装置が、任意の条件で障害兆候を検出できるようにする。第三に、障害兆候を検出した中継装置は、その障害兆候を他の中継装置へ通知する機能を具備する。このとき、ネットワークでは、複数の中継装置で同調して情報収集範囲を拡張することにより、障害発生前後における詳細な監視情報を収集する。 In the present embodiment, a mechanism for detecting a failure occurrence sign in a network with a plurality of relay apparatuses connected to the network and expanding the information collection range inside the relay apparatus in response to the detection is provided. First, a plurality of relay devices connected to the network have a function of collecting monitoring information of the own relay device in each relay device. Secondly, each relay device has a function of detecting a failure sign. Each relay apparatus can detect a failure sign under an arbitrary condition. Third, the relay device that has detected the failure sign has a function of notifying the other relay device of the failure sign. At this time, in the network, detailed monitoring information before and after the occurrence of the failure is collected by extending the information collection range in synchronization with a plurality of relay devices.

本実施の形態では、ネットワークにおける障害発生の兆候パターンを、複数の中継装置で共通に定義しておく。また、ネットワーク内共通の障害兆候パターンに対応させて、各中継装置では、監視情報とその閾値とをマッピングした条件（アサート条件と呼ぶ）を定義する。そして、アサート条件に当てはまる場合に、障害兆候の発生を検出したと判断する。一方、障害兆候が収束したと判断する閾値も定義し、同様にデアサート条件も定義する。このデアサート条件に当てはまる場合には、障害兆候が収束したと判断する。後述するように、障害兆候パターンと監視情報とのマッピングは１対１である必要はなく、一つの障害兆候につき、複数の監視情報をマッピングしても良い。この場合、複数条件を複合的に判断するために、個別の条件判定式を演算し、その演算結果を兆候検出・収束の判断に用いる。 In this embodiment, a failure occurrence pattern in a network is defined in common by a plurality of relay apparatuses. Further, in association with a common failure symptom pattern in the network, each relay apparatus defines a condition (referred to as an assert condition) in which the monitoring information and its threshold value are mapped. When the assert condition is satisfied, it is determined that the occurrence of the failure sign is detected. On the other hand, a threshold value for determining that the failure sign has converged is also defined, and similarly, a deassertion condition is defined. If this deassertion condition is met, it is determined that the failure sign has converged. As will be described later, the mapping between the failure sign pattern and the monitoring information does not have to be one-to-one, and a plurality of pieces of monitoring information may be mapped for one failure sign. In this case, in order to judge a plurality of conditions in a composite manner, individual condition judgment formulas are computed, and the computation results are used for sign detection / convergence judgment.

ある中継装置にて監視情報がアサート条件に当てはまった場合、その中継装置にて障害兆候検出を意味する「兆候通知パケット」を生成してネットワークへ向けて配信する。このとき、「兆候通知パケット」を受信する他の中継装置では、監視強化モードに移行して、監視情報の収集範囲を拡大する。その後、当該中継装置にて、監視情報がデアサート条件に当てはまった場合には、障害の兆候が収束したとして、障害兆候収束を意味する「兆候収束パケット」を生成して、ネットワークへ向けて配信する。このとき、「兆候収束パケット」を受信する他の中継装置では、通常監視モードに戻り、監視情報の収集範囲の拡大を解除する。 When the monitoring information satisfies the assert condition in a certain relay device, the relay device generates a “sign indication packet” that means failure sign detection and distributes it to the network. At this time, other relay apparatuses that receive the “indication notification packet” shift to the monitoring enhancement mode and expand the collection range of the monitoring information. After that, when the monitoring information matches the deassertion condition, the relay device generates a “signature convergence packet” that means convergence of the failure sign and distributes it to the network, assuming that the sign of the failure has converged. . At this time, other relay apparatuses that receive the “sign indication convergence packet” return to the normal monitoring mode and cancel the expansion of the monitoring information collection range.

自中継装置でアサート条件に当てはまった時、または他中継装置から「兆候通知パケット」を受信した時には、その障害兆候種別に応じて自中継装置の情報収集対象を追加し、監視を強化する。情報収集対象は、全てのネットワーク中継装置で共通にすることもできるが、個別に定義可能である。この情報収集を強化する動作は、自中継装置でアサート条件に当てはまった場合には、その後に自中継装置において、デアサート条件に当てはまるまで続くものとする。また、他中継装置から「兆候通知パケット」を受信した場合には、当該他中継装置から「兆候収束パケット」を受信するまで続けられるものとする。障害兆候の発生から収束まで、取得強化された監視情報は、複数の中継装置において、それぞれ個別に蓄積される。蓄積された監視情報は、障害の原因究明調査において利用することができる。 When the own relay device satisfies the assert condition or receives a “sign indication packet” from another relay device, the information collection target of the own relay device is added according to the failure symptom type to enhance monitoring. The information collection target can be common to all network relay apparatuses, but can be defined individually. If the assertion condition is met at the own relay apparatus, the operation for enhancing the information collection is continued until the deassertion condition is met at the own relay apparatus. In addition, when a “sign indication packet” is received from another relay device, it is continued until a “sign indication packet” is received from the other relay device. From the occurrence of a failure sign to convergence, the acquired and enhanced monitoring information is individually stored in each of the plurality of relay apparatuses. The accumulated monitoring information can be used in the investigation of the cause of the failure.

次に、図面を参照して、本発明の実施の形態について詳細に説明する。図１を参照すると、本発明の実施の形態は、一つのネットワーク中継装置１０１に、他のネットワーク中継装置１０２〜１０４が接続されたネットワーク構成になっている。図示するように、ネットワーク中継装置１０１は、他のネットワーク中継装置１０２〜１０４と共に、障害兆候共通定義リスト１０５を共有する。ネットワーク中継装置１０１〜１０４は、ネットワークデータの中継を行う装置で、受信データに記録された送信先／送信元を識別する情報を元に、ネットワークデータを中継する。障害兆候定義リスト１０５は、ネットワーク中継装置１０１〜１０４で発生し得る障害事象をリストにしたものである。 Next, embodiments of the present invention will be described in detail with reference to the drawings. Referring to FIG. 1, the embodiment of the present invention has a network configuration in which one network relay device 101 is connected to other network relay devices 102 to 104. As illustrated, the network relay device 101 shares the failure symptom common definition list 105 with the other network relay devices 102 to 104. The network relay devices 101 to 104 are devices that relay network data, and relay network data based on information for identifying a transmission destination / transmission source recorded in received data. The failure sign definition list 105 is a list of failure events that can occur in the network relay devices 101 to 104.

図２に、障害兆候共通定義リスト１０５の説明図を示す。図示する障害兆候共通定義リスト１０５は、障害事象に一意の識別子を付与したリストであり、ネットワーク中継装置１０１〜１０４で共有される。図２において、［兆候１］に対しては障害兆候識別子”１”が付与されており、［兆候２］に対しては障害兆候識別子”２”が、［兆候３］に対しては障害兆候識別子”３”が、［兆候４］に対しては障害兆候識別子”４”が、それぞれ付与されている。 FIG. 2 is an explanatory diagram of the failure sign common definition list 105. The failure sign common definition list 105 shown is a list in which a unique identifier is assigned to a failure event, and is shared by the network relay devices 101 to 104. In FIG. 2, a failure sign identifier “1” is assigned to [indication 1], a failure sign identifier “2” is assigned to [sign 2], and a failure sign is assigned to [sign 3]. The identifier “3” is assigned with the failure sign identifier “4” for [sign 4].

図３に、ネットワーク中継装置１０１のブロック説明図を示す。ここでは、図１に示した四つのネットワーク中継装置１０１〜１０４のうち、ネットワーク中継装置１０１のみを図示するが、他の三つのネットワーク中継装置１０２〜１０４も、ネットワーク中継装置１０１と同様のブロック構成を有している。ただし、パケットスイッチ処理部の性能や、ポートの数などは、ネットワーク中継装置１０１〜１０４間で全く同一では無い。図３に示すように、ネットワーク中継装置１０１は、パケットスイッチ処理部２０２と、兆候通知検出／発行処理部２０３と、装置内情報収集部２０４と、情報記録部２０５とを有している。 FIG. 3 shows a block explanatory diagram of the network relay device 101. Here, only the network relay device 101 is illustrated among the four network relay devices 101 to 104 illustrated in FIG. 1, but the other three network relay devices 102 to 104 are configured in the same block configuration as the network relay device 101. have. However, the performance of the packet switch processing unit, the number of ports, and the like are not exactly the same between the network relay apparatuses 101 to 104. As illustrated in FIG. 3, the network relay device 101 includes a packet switch processing unit 202, a sign notification detection / issue processing unit 203, an in-device information collection unit 204, and an information recording unit 205.

装置内情報収集部２０４は、パケットスイッチ処理部２０２の情報レジスタ２０９に格納された統計情報や装置情報を収集する。そして、これらの監視情報を、情報記録部２０５に記録する。また、これらの監視情報を、情報記録部２０５から読み出すことができる。収集可能な監視情報は監視情報定義テーブル２０８に定義され、定常的に監視をする監視情報の識別番号も同テーブル２０８に定義される。また、装置内情報収集部２０４は、兆候検出条件テーブル２０６に定義されるアサート条件に一致した場合、兆候通知検出／発行処理部２０３に対して、障害兆候を検出したとして「兆候検出通知」を発行する。その後、兆候検出条件テーブル２０６に定義されるデアサート条件に一致した場合には、装置内情報収集部２０４は、兆候通知検出／発行処理部２０３に対して、兆候が収束したとして「兆候収束通知」を発行する。 The in-device information collection unit 204 collects statistical information and device information stored in the information register 209 of the packet switch processing unit 202. Then, the monitoring information is recorded in the information recording unit 205. Further, the monitoring information can be read from the information recording unit 205. The monitoring information that can be collected is defined in the monitoring information definition table 208, and the identification number of the monitoring information that is constantly monitored is also defined in the table 208. Further, if the in-device information collection unit 204 matches the assert condition defined in the sign detection condition table 206, the in-device information collection unit 204 notifies the sign notification detection / issuance processing unit 203 that a sign of failure has been detected. Issue. Thereafter, when the deassertion condition defined in the sign detection condition table 206 is met, the in-device information collection unit 204 determines that the sign has converged to the sign notification detection / issue processing unit 203 as “sign sign convergence notification”. Is issued.

兆候通知検出／発行処理部２０３は、装置内情報収集部２０４から「兆候検出通知」を受信した場合には、「兆候通知パケット」を生成し、この「兆候通知パケット」をパケットスイッチ処理部２０２へ送出する。このとき、パケットスイッチ処理部２０２は、他の中継装置１０２〜１０４へ「兆候通知パケット」を配信する。一方、兆候通知検出／発行処理部２０３は、装置内情報収集部２０４から「兆候収束通知」を受信した場合には、「兆候収束パケット」を生成し、この「兆候収束パケット」をパケットスイッチ処理部２０２へ送出する。このとき、パケットスイッチ処理部２０２は、他の中継装置１０２〜１０４へ「兆候収束パケット」を配信する。 When receiving the “sign indication detection notification” from the in-device information collection unit 204, the indication notification detection / issue processing unit 203 generates an “indication notification packet” and transmits the “indication notification packet” to the packet switch processing unit 202. To send. At this time, the packet switch processing unit 202 distributes the “sign indication packet” to the other relay apparatuses 102 to 104. On the other hand, when receiving the “signature convergence notification” from the in-device information collection unit 204, the sign notification detection / issuance processing unit 203 generates a “signature convergence packet” and performs packet switch processing on the “signature convergence packet”. To the unit 202. At this time, the packet switch processing unit 202 distributes the “signature convergence packet” to the other relay apparatuses 102 to 104.

パケットスイッチ処理部２０２は、ネットワーク中継装置１０１のポート２１０から入力されるネットワークデータを解析し、送信先のポート２１０を特定し、そのポート２１０へネットワークデータを出力する。また、パケットスイッチ処理部２０２は、ポート２１０から入力されるネットワークデータが、「兆候通知パケット」であるか否かも解析する。パケットスイッチ処理部２０２は、「兆候通知パケット」に該当するネットワークデータを発見すると、この「兆候通知パケット」を、兆候通知検出／発行処理部２０３へ送出する。兆候通知検出／発行処理部２０３は、「兆候通知パケット」に記述された兆候ＩＤ及び送信元を示す識別情報を、装置内情報収集部２０４に渡す。装置内情報収集部２０４は、監視強化情報テーブル２０７及び監視情報定義テーブル２０８を参照して、監視情報の収集範囲を拡大する。 The packet switch processing unit 202 analyzes the network data input from the port 210 of the network relay apparatus 101, specifies the destination port 210, and outputs the network data to the port 210. The packet switch processing unit 202 also analyzes whether the network data input from the port 210 is a “sign indication packet”. When the packet switch processing unit 202 finds network data corresponding to the “sign indication packet”, the packet switch processing unit 202 sends the “sign indication packet” to the indication notification detection / issue processing unit 203. The sign notification detection / issuance processing unit 203 passes the sign ID described in the “sign sign notification packet” and the identification information indicating the transmission source to the in-device information collection unit 204. The in-device information collection unit 204 refers to the monitoring enhancement information table 207 and the monitoring information definition table 208 to expand the monitoring information collection range.

その後、「兆候通知パケット」を配信したネットワーク中継装置から「兆候収束パケット」が配信された場合には、パケットスイッチ処理部２０２にて、「兆候収束パケット」が発見される。この「兆候収束パケット」は、兆候通知検出／発行処理部２０３へ送出される。兆候通知検出／発行処理部２０３は、「兆候収束パケット」に記述された兆候ＩＤ及び送信元を示す識別情報を、装置内情報収集部２０４に渡す。装置内情報収集部２０４は、監視強化情報テーブル２０７を参照して、該当する兆候識別子に対応する監視強化情報の取得処理を停止する。 Thereafter, when the “signature convergence packet” is delivered from the network relay device that delivered the “signature notification packet”, the “signature convergence packet” is found by the packet switch processing unit 202. This “signature convergence packet” is sent to the sign notification detection / issue processor 203. The sign notification detection / issue processing unit 203 passes the sign ID described in the “sign sign convergence packet” and the identification information indicating the transmission source to the in-device information collection unit 204. The in-device information collection unit 204 refers to the monitoring enhancement information table 207 and stops the acquisition processing of the monitoring enhancement information corresponding to the corresponding sign identifier.

図４を参照すると、ネットワーク中継装置１０１が有する兆候検出条件テーブル２０６の詳細説明図が示されている。図４において、兆候検出条件テーブル２０６は、ネットワーク中継装置１０１による障害兆候を検出する条件（アサート条件）および障害の兆候が収束したものと判断する条件（デアサート条件）を定義する情報を保持している。図中、兆候ＩＤの欄は、障害兆候共通定義リスト１０５で定義された障害兆候の識別子を示している。監視情報ＩＤの欄は、その障害兆候に関連する監視情報の識別子を示している。判定閾値の欄は、その監視情報がアサート条件に当てはまる閾値とデアサート条件に当てはまる閾値とを示している。検出判定演算式の欄は、複数種類の監視情報を用いて、アサート条件とデアサート条件とを判定する場合の論理式を示している。 Referring to FIG. 4, a detailed explanatory diagram of the sign detection condition table 206 included in the network relay device 101 is shown. In FIG. 4, the sign detection condition table 206 holds information defining conditions for detecting a failure sign by the network relay device 101 (assertion condition) and conditions for determining that the fault sign has converged (deassertion condition). Yes. In the figure, the sign ID column indicates the fault sign identifier defined in the common fault sign definition list 105. The column of the monitoring information ID indicates an identifier of monitoring information related to the failure sign. The determination threshold value column indicates a threshold value that the monitoring information applies to the assertion condition and a threshold value that applies to the deassertion condition. The column of the detection determination arithmetic expression indicates a logical expression in the case of determining the assert condition and the deassert condition using a plurality of types of monitoring information.

図５を参照すると、ネットワーク中継装置１０１が有する監視強化情報テーブル２０７の詳細説明図が示されている。図５において、監視強化情報テーブル２０７は、アサート条件が成立したときに追加取得する監視情報に関する情報を保持している。図中、兆候ＩＤの欄は、障害兆候共通定義リスト１０５で定義された障害兆候の識別子を示している。また、監視情報ＩＤの欄は、ネットワーク中継装置１０１にて障害兆候を検出したとき、又は、他のネットワーク中継装置１０２〜１０４から「兆候通知パケット」を受信したときに、追加取得する監視情報の識別子を示している。 Referring to FIG. 5, a detailed explanatory diagram of the monitoring enhancement information table 207 included in the network relay apparatus 101 is shown. In FIG. 5, the monitoring enhancement information table 207 holds information related to monitoring information that is additionally acquired when the assert condition is satisfied. In the figure, the sign ID column indicates the fault sign identifier defined in the common fault sign definition list 105. The column of the monitoring information ID is a field for monitoring information to be additionally acquired when a failure symptom is detected by the network relay device 101 or when a “sign indication packet” is received from another network relay device 102 to 104. Indicates an identifier.

図６を参照すると、ネットワーク中継装置１０１が有する監視情報定義テーブル２０８の詳細説明図が示されている。図６において、監視情報定義テーブル２０８は、ネットワーク中継装置１０１の装置内情報収集部２０４で収集可能な監視情報群を定義している。図中、監視情報ＩＤの欄は、収集可能な監視情報の識別子を示している。監視情報の欄は、収集可能な監視情報の内容を示している。定常監視フラグの欄は、その監視情報が、定常運用時に収集するものか、監視強化時に収集するものかを示している。図６の監視情報定義テーブル２０８を参照すると、監視情報ＩＤ＝”１”〜”５”で示される［監視情報１］〜［監視情報５］では、定常監視フラグが、”Ｖ（Ｖａｌｉｄ）”と表示されている。従って、これらの監視情報は、常に収集される。一方、監視情報ＩＤ＝”６”〜”９”で示される［監視情報６］〜［監視情報９］では、定常監視フラグが、”ＯＦＦ”と表示されている。従って、これらの監視情報は、監視強化モード時にのみ収集され得る。 Referring to FIG. 6, a detailed explanatory diagram of the monitoring information definition table 208 included in the network relay device 101 is shown. In FIG. 6, a monitoring information definition table 208 defines monitoring information groups that can be collected by the in-device information collection unit 204 of the network relay device 101. In the figure, the monitoring information ID column indicates identifiers of monitoring information that can be collected. The monitoring information column shows the contents of monitoring information that can be collected. The column of the steady monitoring flag indicates whether the monitoring information is collected during steady operation or collected during monitoring enhancement. Referring to the monitoring information definition table 208 in FIG. 6, in [monitoring information 1] to [monitoring information 5] indicated by the monitoring information ID = “1” to “5”, the steady monitoring flag is “V (Valid)”. Is displayed. Therefore, such monitoring information is always collected. On the other hand, in [monitoring information 6] to [monitoring information 9] indicated by the monitoring information ID = “6” to “9”, the steady monitoring flag is displayed as “OFF”. Therefore, these pieces of monitoring information can be collected only in the monitoring enhancement mode.

次に、図７〜１１のフローチャートを参照して、本実施の形態におけるネットワーク中継装置１０１の動作について詳細に説明する。まず、図７の初期設定処理について説明する。図７における初期設定では、まず、監視情報定義テーブル２０８を設定する（ステップＡ１）。ここでは、ネットワーク中継装置１０１で収集可能な監視情報のリストを生成し、各監視情報に対して一意の識別子を付与する。また、監視情報の中で、常時監視の対象とするものに対しては、定常監視フラグを立てる。定常監視フラグが立った監視情報は、障害兆候共通定義リスト１０５で定義される兆候を検出するために利用することができる。 Next, the operation of the network relay device 101 in the present embodiment will be described in detail with reference to the flowcharts of FIGS. First, the initial setting process of FIG. 7 will be described. In the initial setting in FIG. 7, first, the monitoring information definition table 208 is set (step A1). Here, a list of monitoring information that can be collected by the network relay apparatus 101 is generated, and a unique identifier is assigned to each monitoring information. In addition, a steady monitoring flag is set for monitoring information that is to be constantly monitored. The monitoring information in which the steady monitoring flag is set can be used to detect the signs defined in the failure sign common definition list 105.

次に、兆候検出条件テーブル２０６を設定する（ステップＡ２）。障害兆候共通定義リスト１０５で定義された兆候に対応させて、判定閾値及び演算式を作成する。判定閾値では、監視情報定義テーブル２０８で、定常監視フラグが立っている監視情報の中から当該障害事象に関連する情報を結びつけ、さらにその監視情報のアサート条件／デアサート条件の閾値を設定する。図４の兆候検出条件テーブル２０６では、兆候ＩＤ”１”の障害兆候検出に、［監視情報１］と［監視情報３］とが複合的に用いられている。［監視情報１］については、［監視情報１］が２０回／秒以上になるとアサート条件が成立し、［監視情報１］が１０回／秒以下になるとデアサート条件が成立する。また、［監視情報３］については、［監視情報３］が４０個以上になるとアサート条件が成立し、［監視情報３］が２５個以下になるとデアサート条件が成立する。兆候ＩＤ”２”の障害兆候検出には、［監視情報２］と［監視情報３］とが複合的に用いられている。［監視情報２］については、［監視情報２］が８Ｋパケット以上になるとアサート条件が成立し、［監視情報２］が６Ｋパケット以下になるとデアサート条件が成立する。また、［監視情報３］については、［監視情報３］が１０００個以上になるとアサート条件が成立し、［監視情報３］が４００個以下になるとデアサート条件が成立する。兆候ＩＤ”３”の障害兆候検出には、［監視情報１］と［監視情報４］とが複合的に用いられている。［監視情報１］については、［監視情報１］が５０回／秒以上になるとアサート条件が成立し、［監視情報１］が２０回／秒以下になるとデアサート条件が成立する。また、［監視情報４］については、［監視情報４］が”ｔｒｕｅ”になるとアサート条件が成立し、［監視情報４］が”ｆａｌｓｅ”になるとデアサート条件が成立する。兆候ＩＤ”４”の障害兆候検出には、［監視情報５］のみが用いられている。［監視情報５］が１０００フレーム／秒以上になるとアサート条件が成立し、［監視情報５］が５００フレーム／秒以下になるとデアサート条件が成立する。 Next, the sign detection condition table 206 is set (step A2). A determination threshold value and an arithmetic expression are created in association with the signs defined in the failure sign common definition list 105. In the determination threshold value, information related to the failure event is linked from the monitoring information in which the steady monitoring flag is set in the monitoring information definition table 208, and the threshold value of the assertion condition / deassertion condition of the monitoring information is set. In the sign detection condition table 206 of FIG. 4, [monitoring information 1] and [monitoring information 3] are used in combination for detecting the fault sign of the sign ID “1”. As for [monitoring information 1], the assertion condition is satisfied when [monitoring information 1] is 20 times / second or more, and the deassertion condition is satisfied when [monitoring information 1] is 10 times / second or less. As for [monitoring information 3], the assert condition is satisfied when [monitor information 3] is 40 or more, and the deassert condition is satisfied when [monitor information 3] is 25 or less. [Monitoring information 2] and [Monitoring information 3] are used in combination for detection of a fault sign of the sign ID “2”. As for [monitoring information 2], the assertion condition is satisfied when [monitoring information 2] is 8K packets or more, and the deassertion condition is satisfied when [monitoring information 2] is 6K packets or less. As for [monitoring information 3], the assert condition is satisfied when [monitor information 3] is 1000 or more, and the deassert condition is satisfied when [monitor information 3] is 400 or less. [Monitoring information 1] and [Monitoring information 4] are used in combination for the detection of the fault sign of the sign ID “3”. As for [monitoring information 1], the assertion condition is satisfied when [monitoring information 1] is 50 times / second or more, and the deassertion condition is satisfied when [monitoring information 1] is 20 times / second or less. As for [monitoring information 4], when [monitoring information 4] becomes “true”, the assertion condition is satisfied, and when [monitoring information 4] becomes “false”, the deassertion condition is satisfied. Only the [monitoring information 5] is used for detecting the failure sign of the sign ID “4”. The assert condition is satisfied when [monitor information 5] is 1000 frames / second or more, and the deassert condition is satisfied when [monitor information 5] is 500 frames / second or less.

兆候検出条件テーブル２０６の設定において、一つの障害事象に対して、複数の監視情報を結びつける場合には、複合的な判断を行うために、判定演算式を定義する。図４の兆候検出条件テーブル２０６では、兆候ＩＤ”１”のアサート条件を、［監視情報１］のアサート条件が成立し、かつ、［監視情報３］のアサート条件が成立したときとしている。また、兆候ＩＤ”２”のアサート条件は、［監視情報２］のアサート条件、または、［監視情報３］のアサート条件のいずれかが成立したときとしている。また、兆候ＩＤ”３”のアサート条件は、［監視情報１］のアサート条件が成立し、かつ、［監視情報４］のアサート条件が成立したときとしている。兆候ＩＤ”４”では論理式が規定されていない。これは、障害兆候パターンと監視情報とのマッピングが１対１であるためである。 In the setting of the sign detection condition table 206, when a plurality of pieces of monitoring information are linked to one failure event, a determination calculation formula is defined in order to make a composite determination. In the sign detection condition table 206 of FIG. 4, the assertion condition of the sign ID “1” is set when the assertion condition of [monitoring information 1] is satisfied and the assertion condition of [monitoring information 3] is satisfied. The assertion condition of the sign ID “2” is set when either the assertion condition of [monitoring information 2] or the assertion condition of [monitoring information 3] is satisfied. The assertion condition of the sign ID “3” is set when the assertion condition of [monitoring information 1] is satisfied and the assertion condition of [monitoring information 4] is satisfied. No logical expression is defined for the sign ID “4”. This is because there is a one-to-one mapping between the failure sign pattern and the monitoring information.

最後に、監視情報強化テーブル２０７を設定する（ステップＡ３）。ここでは、障害兆候を検出した際に追加収集する監視情報を設定する。監視情報定義テーブル２０８の情報群の中から、定常監視フラグが立っている識別子を有する監視情報以外の監視情報の中から、追加収集する監視情報を選択する。選択した監視情報を、それぞれの兆候事象に対して設定する。図５の監視情報強化テーブル２０７では、兆候ＩＤ”１”に対して、［監視情報６］が設定されている。また、兆候ＩＤ”２”に対しては、［監視情報７］が、兆候ＩＤ”３”に対しては、［監視情報８］と［監視情報９］とが、兆候ＩＤ”４”に対しては、［監視情報６］と［監視情報９］とが設定されている。 Finally, the monitoring information strengthening table 207 is set (step A3). Here, monitoring information to be additionally collected when a failure sign is detected is set. From the information group of the monitoring information definition table 208, monitoring information to be additionally collected is selected from the monitoring information other than the monitoring information having the identifier for which the steady monitoring flag is set. Set the selected monitoring information for each symptom event. In the monitoring information strengthening table 207 of FIG. 5, [monitoring information 6] is set for the sign ID “1”. For the sign ID “2”, the “monitoring information 7” is for the sign ID “3”, and for the sign ID “3”, the “monitoring information 8” and the “monitoring information 9” are for the sign ID “4”. In this case, [monitoring information 6] and [monitoring information 9] are set.

ネットワーク中継装置１０１は、他のネットワーク中継装置１０２〜１０４と、兆候ＩＤを用いて障害兆候の発生と収束とを連絡し合う仕組みになっている。従って、ネットワーク中継装置１０１が有するテーブル２０６〜２０８の内容と、他のネットワーク中継装置１０２〜１０４が有する兆候検出条件テーブル、監視強化情報テーブル、及び、監視情報定義テーブルの内容とは、同一であるとは限らない。例えば、ネットワーク中継装置１０１が有する兆候検出条件テーブル２０６の内容と、他のネットワーク中継装置１０２〜１０４が有する兆候検出条件テーブルの内容とを比較すると、兆候ＩＤの欄は一致するが、他の監視情報ＩＤの欄、判定閾値の欄、及び、検出判定演算式の欄は必ずしも一致しない。同様に、ネットワーク中継装置１０１が有する監視強化情報テーブル２０７の内容と、他のネットワーク中継装置１０２〜１０４が有する監視強化情報テーブルの内容とを比較すると、兆候ＩＤの欄は一致するが、監視情報ＩＤの欄は必ずしも一致しない。さらに、監視情報定義テーブルの内容は、ネットワーク中継装置の機種・性能等に依存するので、ネットワーク中継装置間で大きく異なる場合もあり得る。 The network relay device 101 is configured to communicate with the other network relay devices 102 to 104 about the occurrence and convergence of the failure sign using the sign ID. Therefore, the contents of the tables 206 to 208 included in the network relay apparatus 101 are the same as the contents of the sign detection condition table, the monitoring enhancement information table, and the monitoring information definition table included in the other network relay apparatuses 102 to 104. Not necessarily. For example, when the contents of the sign detection condition table 206 included in the network relay apparatus 101 and the contents of the sign detection condition table included in the other network relay apparatuses 102 to 104 are compared, the signs ID column matches, but other monitoring The information ID column, the determination threshold value column, and the detection determination calculation formula column do not necessarily match. Similarly, when the contents of the enhanced monitoring information table 207 of the network relay apparatus 101 and the contents of the enhanced monitoring information tables of the other network relay apparatuses 102 to 104 are compared, the signs ID column matches, but the monitoring information The ID column does not necessarily match. Furthermore, since the contents of the monitoring information definition table depend on the model, performance, etc. of the network relay device, there may be a great difference between the network relay devices.

つぎに、図８の定常監視処理について説明する。まず、監視情報定義テーブル２０８での定常監視フラグが立っている種別の監視情報を取得する（ステップＢ１）。そして、この監視情報が兆候検出条件テーブル２０６のアサート条件に一致するかを演算する（ステップＢ２）。ここでアサート条件に一致しない場合はステップＢ１の処理に戻り、引き続き定常監視情報を取得する。アサート条件に一致した場合は、自中継装置にて障害の兆候が発生したことを意味する通知をネットワーク全体に配信するために通知処理を実施する（ステップＢ３）。そして、通知処理フローに処理が渡る。 Next, the steady monitoring process of FIG. 8 will be described. First, the type of monitoring information for which the regular monitoring flag is set in the monitoring information definition table 208 is acquired (step B1). Then, it is calculated whether this monitoring information matches the assertion condition of the sign detection condition table 206 (step B2). If the assert condition is not met, the process returns to step B1, and the steady monitoring information is continuously acquired. When the assert condition is met, notification processing is performed in order to deliver a notification indicating that a failure sign has occurred in the own relay apparatus to the entire network (step B3). Then, the process passes to the notification process flow.

図９に、通知処理のフローチャートを示す。通知処理では、まずパケットデータを生成する（ステップＣ１）。パケットデータには、自中継装置の識別情報、および障害兆候共通定義リスト１０５に定義された障害兆候の識別子が含まれる。そして、生成されたパケットデータをネットワークに向けて配信する（ステップＣ２）。発信先は同報でも個別通報でもよく、個別通報の場合は送信先リストを別途生成しておき、このリストに基づいて送信することでもよい。通知処理を完了した後、若しくは、通知処理と平行して、ネットワーク中継装置１０１にて、監視強化処理を開始する（ステップＢ５）。 FIG. 9 shows a flowchart of the notification process. In the notification process, packet data is first generated (step C1). The packet data includes the identification information of the own relay device and the failure symptom identifier defined in the failure symptom common definition list 105. Then, the generated packet data is distributed toward the network (step C2). The transmission destination may be broadcast or individual notification. In the case of individual notification, a transmission destination list may be separately generated and transmitted based on this list. After completing the notification process or in parallel with the notification process, the network relay apparatus 101 starts the monitoring enhancement process (step B5).

図１０に、監視強化処理のフローチャートを示す。監視強化処理では、監視情報定義テーブル２０８に定義された定常監視情報に加えて、監視強化情報テーブル２０７に定義された追加取得監視情報を取得し、情報記録部２０５に蓄積をする（ステップＤ１）。そして、兆候検出条件テーブル２０６に定義されているデアサート条件に一致するか否かを判断する（ステップＤ２）。一致しない場合、障害兆候が収束していないものとして、引き続き監視強化状態におかれ、ステップＤ１の処理に戻される。デアサート条件に一致した場合には、自中継装置での障害の兆候が収束したとして、通知処理を実施する（ステップＤ３）。この通知処理は、図９に示したステップＣ１〜Ｃ２の処理と同様で、兆候収束の情報を埋め込み配信する。通知処理を実施すると同時に、ネットワーク中継装置１０１は、追加取得監視情報の取得を停止する。 FIG. 10 shows a flowchart of the monitoring enhancement process. In the monitoring enhancement process, in addition to the steady monitoring information defined in the monitoring information definition table 208, additional acquisition monitoring information defined in the monitoring enhancement information table 207 is obtained and stored in the information recording unit 205 (step D1). . Then, it is determined whether or not the deassertion condition defined in the sign detection condition table 206 is met (step D2). If they do not coincide with each other, it is assumed that the failure sign has not converged, and the monitoring is continued to be enhanced, and the process returns to step D1. If the deasserting condition is met, the notification process is performed assuming that the failure sign in the own relay apparatus has converged (step D3). This notification process is the same as the process of steps C1 and C2 shown in FIG. 9, and signs convergence information is embedded and distributed. At the same time as performing the notification process, the network relay apparatus 101 stops acquiring the additional acquisition monitoring information.

図８において、監視強化処理（ステップＢ４）を抜けると、定常監視を終了させるか否かを受け付ける（ステップＢ５）。継続して監視するのであればステップＢ１に戻り定常監視を続行する。定常監視処理を終了するのであれば、定常監視処理を停止する。 In FIG. 8, after exiting the monitoring enhancement process (step B4), it is accepted whether or not the regular monitoring is terminated (step B5). If the monitoring is to be continued, the process returns to step B1 to continue the steady monitoring. If the steady monitoring process is to be terminated, the steady monitoring process is stopped.

なお、他中継装置１０２〜１０４で障害兆候の発生／収束を検出した場合には、他中継装置１０２〜１０４から兆候通知及び収束通知が発行される。このとき、ネットワーク中継装置１０１では、兆候通知受信処理を実行する。図１１に、兆候通知受信処理のフローチャートを示す。図１１のフローチャートは、兆候通知の受信によって開始される（ステップＥ１）。続いて、受信した兆候通知を解析する。監視強化情報テーブル２０７を参照して、兆候通知に含まれる兆候の”ＩＤ”から、追加取得する監視情報の”ＩＤ”を検索する。監視情報の”ＩＤ”を検索したら、監視情報定義テーブル２０８を参照して、当該監視情報の”ＩＤ”から、追加取得する監視情報の内容を検索する。追加取得する監視情報の内容が判明したら、ネットワーク中継装置１０１は、当該監視情報を追加取得し、蓄積する（ステップＥ２）。その後、兆候通知を発した中継装置から収束通知が配信されたかを調べる（ステップＥ３）。収束通知を受信していなければ、ステップＥ２に戻り、監視情報の追加取得を継続する。収束通知を受信したら、ステップＥ２で実施していた監視情報の追加取得／蓄積の処理を終了する（ステップＥ４）。 When the other relay apparatuses 102 to 104 detect the occurrence / convergence of the failure sign, the other relay apparatuses 102 to 104 issue a sign notification and a convergence notification. At this time, the network relay device 101 executes a symptom notification reception process. FIG. 11 shows a flowchart of the sign notification receiving process. The flowchart of FIG. 11 is started upon reception of a sign notification (step E1). Subsequently, the received sign notification is analyzed. With reference to the monitoring enhancement information table 207, the “ID” of the monitoring information to be additionally acquired is searched from the “ID” of the indication included in the indication of the indication. When the “ID” of the monitoring information is searched, the monitoring information definition table 208 is referred to and the contents of the monitoring information to be additionally acquired are searched from the “ID” of the monitoring information. When the contents of the additionally acquired monitoring information are found, the network relay apparatus 101 additionally acquires and accumulates the monitoring information (step E2). Thereafter, it is checked whether or not a convergence notification is delivered from the relay device that has issued the sign notification (step E3). If the convergence notification has not been received, the process returns to step E2 and continues to acquire additional monitoring information. When the convergence notification is received, the monitoring information additional acquisition / accumulation process performed in step E2 is terminated (step E4).

各ネットワーク中継装置１０１〜１０４が有する兆候検出条件テーブル、監視強化情報テーブル、及び、監視情報定義テーブルに対して、一元的に、それらの内容を書き込むセットアップ端末を接続することができる。この場合には、セットアップ端末にのみ障害兆候共通定義リスト１０５を保持させることにより、ネットワーク中継装置１０１〜１０４の構成要素から障害兆候共通定義リスト１０５を省略することができる。 A setup terminal for writing the contents thereof can be centrally connected to the sign detection condition table, the monitoring enhancement information table, and the monitoring information definition table included in each of the network relay devices 101 to 104. In this case, the failure symptom common definition list 105 can be omitted from the components of the network relay devices 101 to 104 by holding the failure symptom common definition list 105 only in the setup terminal.

以上説明したように、本実施の形態によれば、ネットワーク中継装置で、共通の障害兆候を定義しているため、当該兆候パターンに関連させて、ネットワーク中継装置の特徴に合わせた監視情報の収集が可能になる。また、一部のネットワーク中継装置で検出された障害兆候を、ネットワーク全体に通報することができるので、ほぼ同時に多くのネットワーク中継装置で、同調して監視情報の収集範囲を拡大することができる。 As described above, according to the present embodiment, since a common failure sign is defined in the network relay device, collection of monitoring information according to the characteristics of the network relay device in relation to the sign pattern Is possible. In addition, since a failure sign detected by some network relay devices can be reported to the entire network, a large number of network relay devices can simultaneously synchronize and expand the collection range of monitoring information.

図１は、本発明の実施の形態におけるネットワーク構成を示す図である。FIG. 1 is a diagram showing a network configuration in the embodiment of the present invention. 図２は、障害兆候共通定義リスト１０５の説明図である。FIG. 2 is an explanatory diagram of the failure sign common definition list 105. 図３は、ネットワーク中継装置１０１のブロック説明図である。FIG. 3 is a block explanatory diagram of the network relay apparatus 101. 図４は、兆候検出条件テーブル２０６の詳細説明図である。FIG. 4 is a detailed explanatory diagram of the sign detection condition table 206. 図５は、監視強化情報テーブル２０７の詳細説明図である。FIG. 5 is a detailed explanatory diagram of the monitoring enhancement information table 207. 図６は、監視情報定義テーブル２０８の詳細説明図である。FIG. 6 is a detailed explanatory diagram of the monitoring information definition table 208. 図７は、初期設定処理のフローチャートである。FIG. 7 is a flowchart of the initial setting process. 図８は、定常監視処理のフローチャートである。FIG. 8 is a flowchart of the steady monitoring process. 図９は、通知処理のフローチャートである。FIG. 9 is a flowchart of the notification process. 図１０は、監視強化処理のフローチャートである。FIG. 10 is a flowchart of the monitoring enhancement process. 図１１は、兆候通知受信処理のフローチャートである。FIG. 11 is a flowchart of the sign notification receiving process.

Explanation of symbols

１０１〜１０４ネットワーク中継装置
１０５障害兆候共通定義リスト
２０２パケットスイッチ処理部
２０３兆候通知検出／発行処理部
２０４装置内情報収集部
２０５情報記録部
２０６兆候検出条件テーブル
２０７監視強化情報テーブル
２０８監視情報定義テーブル
２０９情報レジスタ
２１０ポート 101-104 Network relay device 105 Fault indication common definition list 202 Packet switch processing unit 203 Indication notification detection / issue processing unit 204 In-device information collection unit 205 Information recording unit 206 Indication detection condition table 207 Monitoring enhancement information table 208 Monitoring information definition table 209 Information register 210 port

Claims

A network system having n network relay devices for relaying network data,
One network relay device
In-device information collection unit that collects monitoring information in its own device;
An information recording unit for storing the monitoring information;
When the monitoring information indicates a failure sign, a sign notification processing unit for notifying that the failure sign is detected to the other [n−1] network relay devices,
Each of the other [n−1] network relay devices is
In-device information collection unit that collects monitoring information in the device itself, and collects additional monitoring information when notified from the one network relay device that a failure sign is detected;
A network system comprising: an information recording unit for storing the monitoring information and the additional monitoring information.

In-device information collection unit that collects monitoring information in its own device;
An information recording unit for storing the monitoring information;
When the monitoring information indicates a failure sign, a sign notification processing unit that notifies other network relay devices that the failure sign is detected ;
A monitoring enhancement information table in which additional monitoring information related to the failure sign is defined, and
The in-device information collection unit
A network relay device that collects additional monitoring information based on the monitoring strengthening information table when the failure sign is detected .

A list that is shared with other network relay devices and that defines the type of failure indication;
A sign detection condition table that defines detection conditions for fault signs for each type, and
The sign notification processing unit
3. The network relay according to claim 2, wherein when any type of failure sign is detected based on the sign detection condition table, the other network relay apparatus is notified that the type of failure sign is detected. apparatus.

The sign detection condition table defines a failure sign convergence condition for each type,
The sign notification processing unit
After notifying that it has been detected, based on the indication detection condition table, when the failure indication of the type has converged, it notifies other network relay devices that the failure indication of the type has converged. 3. The network relay device according to 3.

The enhanced monitoring information table is defined such that additional monitoring information related to the failure indication can be indexed by the type of the failure indication,
The in-device information collection unit
The network relay device according to claim 3, wherein when any type of failure sign is detected, additional monitoring information is collected based on the monitoring strengthening information table.

The sign detection condition table defines a failure sign convergence condition for each type,
The in-device information collection unit
After any type of failure sign is detected, based on the sign detection condition table,
The network relay device according to claim 5, wherein when the failure sign of the type converges, the collection of additional monitoring information is stopped based on the monitoring enhancement information table.

In-device information collection unit that collects monitoring information in its own device;
An information recording unit for storing the monitoring information;
The in-device information collection unit
A network relay device that collects additional monitoring information when notified by another network relay device that a failure sign has been detected.

A list that is shared with other network relay devices and that defines the type of failure indication;
A monitoring enhancement information table for indexing additional monitoring information related to a failure sign of that type when notified by another network relay device that a failure sign of any type has been detected;
The in-device information collection unit
When it is notified from another network relay device that a failure sign of any type has been detected, additional monitoring information related to the failure sign of that type is collected based on the enhanced monitoring information table. Item 8. The network relay device according to Item 7.

The in-device information collection unit
When another network relay device is notified that a failure sign of any type has been detected, and after that, the other network relay device is notified that the failure sign of the type has converged. The network relay device according to claim 8, wherein collection of additional monitoring information is stopped based on the monitoring enhancement information table.