JP2014222794A

JP2014222794A - State monitoring apparatus

Info

Publication number: JP2014222794A
Application number: JP2013100884A
Authority: JP
Inventors: 大志時水; Hiroshi Tokimizu
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2013-05-13
Filing date: 2013-05-13
Publication date: 2014-11-27
Anticipated expiration: 2033-05-13
Also published as: JP6099478B2

Abstract

PROBLEM TO BE SOLVED: To provide a state monitoring apparatus capable of detecting abnormality of a portion of a physical layer of an apparatus connected to a network.SOLUTION: An apparatus 1 comprises: a self apparatus inside state collection unit 108 which collects states of respective portions in the apparatus; a state processing unit 100 which stores the collected states in a state monitoring database unit 101; an other-apparatus inside state collection unit 109 which collects states of portions of other apparatuses through a network 4 to store the collected states by the state processing unit 100 in the state monitoring database unit 101; an abnormal position detection unit 102 which detects an abnormal portion of the self apparatus or the other apparatus from the states of portions stored in the state monitoring database unit 101 while referring to an abnormality specifying database 104 defining a reference state for abnormality detection so as to specify an abnormal portion; and a display unit/setting unit 103 which displays the detected abnormal portion.

Description

この発明は、ネットワークに接続された装置の状態監視を行い、異常部位の特定を行う状態監視装置に関するものである。 The present invention relates to a state monitoring device that monitors the state of a device connected to a network and identifies an abnormal part.

ネットワークの異常部位を特定する方法として、特許文献１では、ネットワーク上のネットワーク層やトランスポート層の通信状態を監視し、予め用意された障害の発生原因となる要素と通信の異常を示す事象を対応付けした障害箇所の判定テーブルにより、異常を検出することで、障害発生箇所を判定する。さらに、複数の装置から判定結果を示す障害情報を収集し、複数の障害情報に共通する要因から、障害発生箇所を自動で特定している。
具体的には、再生パケットの有無、重複受信パケットの有無、データの欠落の有無、Ａｃｋ（確認応答）の応答時間、リセット信号などをモニタすることで、通信の異常を検出し、どの異常事象がどのコネクションで発生しているかを障害発生箇所の判定テーブルにより、障害発生箇所を判定している。 As a method for specifying an abnormal part of a network, Patent Document 1 monitors a communication state of a network layer or a transport layer on a network, and prepares an event indicating an abnormality of communication and an element that causes a failure prepared in advance. A failure occurrence location is determined by detecting an abnormality using the associated failure location determination table. Furthermore, failure information indicating determination results is collected from a plurality of devices, and the location where the failure has occurred is automatically identified from factors common to the plurality of failure information.
Specifically, by monitoring the presence / absence of regenerated packets, presence / absence of duplicate received packets, presence / absence of data loss, response time of Ack (acknowledgment response), reset signal, etc., communication abnormalities are detected and which abnormal events The failure occurrence location is determined by the failure occurrence location determination table as to which connection is occurring.

ここで、特許文献１の方法では、各々のコネクションで検出された異常を示す事象の違いを基に障害発生箇所を判定していたために、通信相手の装置やコネクションの数が少ない場合には、十分な障害情報を収集できず、障害発生箇所を正確に判定できなくなるという問題があった。
そこで、特許文献２では、コネクションの数に関係なく、ネットワーク上の障害発生箇所を判定する方法として、内部ネットワークと外部ネットワークとの間のルータの物理アドレスを記憶するとともに、他の装置との通信状態を監視することで、その監視により検出された通信内容から通信の異常を検出した場合、記憶したアドレスと一致しない場合、そのルータ以外の内部ネットワークに接続された装置との間の通信にて障害が発生したと判定する。 Here, in the method of Patent Document 1, because the failure location is determined based on the difference in the event indicating abnormality detected in each connection, when the number of communication counterpart devices and connections is small, There was a problem that sufficient failure information could not be collected and the location where the failure occurred could not be accurately determined.
Therefore, in Patent Document 2, as a method of determining a failure occurrence location on the network regardless of the number of connections, the physical address of the router between the internal network and the external network is stored and communication with other devices is performed. By monitoring the status, if a communication error is detected from the communication content detected by the monitoring, and if it does not match the stored address, communication with a device connected to the internal network other than the router Determine that a failure has occurred.

特許文献３の方法では、複数の装置のいずれかまたはバスが故障したとき、システム内の複数の装置のいずれかより障害情報が出力されたときは、その障害情報に基づいて、複数の装置の接続状態と障害要因の関連性が予め記憶されているデータベースを検索し、故障発生箇所とその故障要因に関連する所定の動作を行うシステム内の装置接続構成を示す障害表示情報を生成、表示して、保守者に提示して障害の発生を判定する。 In the method of Patent Document 3, when any of a plurality of devices or a bus fails, and when failure information is output from any of a plurality of devices in the system, the plurality of devices are based on the failure information. Search the database in which the relationship between the connection status and the failure factor is stored in advance, and generate and display the failure display information indicating the location of the failure and the device connection configuration in the system that performs the predetermined operation related to the failure factor And presenting it to the maintenance person to determine the occurrence of the failure.

特開２００５−１６７３４７号公報（第６〜９頁、第１図）Japanese Patent Laying-Open No. 2005-167347 (pages 6-9, FIG. 1) 特開２００７−２７４２８２号公報（第５〜８頁、第１図）JP 2007-274282 A (pages 5 to 8, FIG. 1) 特開２００８−２１７１８８号公報（第６〜７頁、第１図）JP 2008-217188 A (pages 6-7, FIG. 1)

しかしながら、特許文献１と特許文献２の障害発生箇所の特定手段では、ネットワークの異常がどの装置間のコネクションで発生しているか特定できるものの、障害発生箇所の判定テーブルが不明確なため、障害発生箇所が自装置、装置間の伝送路、もしくは相手側装置かを明確に特定できない。
例えば、障害発生機器分類テーブルに、全てのコネクションで異常を検出時の障害発生機器として、隣接伝送路と定義されているが、自装置自身のコネクションができないため、自装置自身で障害が発生している可能性も否定できない。また、特定のＩＰアドレス、ポートで異常検出時の障害発生機器として、相手側装置と定義されているが、相手側の該当ポートに接続された伝送路に問題がある可能性も否定できない。
そのため、定義された障害発生機器を交換したとしても、障害の早期復旧ができないという問題がある。 However, although the means for identifying the location of failure in Patent Document 1 and Patent Document 2 can identify the connection between the devices in which the network abnormality has occurred, the failure occurrence location determination table is unclear. It is not possible to clearly identify whether the location is its own device, a transmission path between devices, or a counterpart device.
For example, in the faulty device classification table, an adjacent transmission path is defined as a faulty device when an error is detected in all connections. The possibility of being undeniable cannot be denied. Further, although a partner device is defined as a failure-occurring device when an abnormality is detected at a specific IP address and port, the possibility that there is a problem in the transmission path connected to the corresponding port on the partner side cannot be denied.
For this reason, there is a problem that even if the defined failure occurrence device is replaced, the failure cannot be recovered early.

また、上記従来技術では、障害発生した装置を特定できる可能性はあるものの、装置内のどの部位が異常なのかを検出する方法がないため、障害発生箇所の詳細な特定ができない。
具体的には、従来技術では、通信状態の監視対象が、応答（Ａｃｋ）の番号、シーケンス番号、データ長、再送回数、重複受信回数、遅延状況、およびパケットロスなどネットワーク層やトランスポート層であり、物理層が監視対象でなかった。
通常、ネットワークで障害が発生する主要因は、物理層の故障や経年劣化によるものであるため、従来技術では物理層の異常の検出ができない。物理層の例としては、光ネットワークの場合、伝送媒体である光ファイバ、光の経路を切換える光スイッチ、光を送受信し光／電気の変換をする光送受信器、電気／論理信号の変換をするＰＨＹチップ（ＰｈｙｓｉｃａｌＬａｙｅｒＣｈｉｐ）、フレームの転送処理をするＬ２／Ｌ３スイッチ、フレームを処理するための基準クロックとなる水晶発振器やクロックドライバなどである。
そのため、従来技術での障害発生箇所の特定方法の場合、障害発生時の交換対象が広範囲にわたり、さらに異常部位の特定に時間がかかるため、保守費用が高額になるという問題があった。 In addition, although there is a possibility that the faulty device can be specified in the above-described conventional technology, since there is no method for detecting which part of the device is abnormal, the fault location cannot be specified in detail.
Specifically, in the prior art, the communication status is monitored at the network layer or transport layer such as the response (Ack) number, sequence number, data length, number of retransmissions, number of duplicate receptions, delay status, and packet loss. Yes, the physical layer was not monitored.
Normally, the main cause of a failure in a network is a physical layer failure or deterioration over time, so that the conventional technology cannot detect a physical layer abnormality. As an example of the physical layer, in the case of an optical network, an optical fiber that is a transmission medium, an optical switch that switches an optical path, an optical transceiver that transmits and receives light and performs optical / electrical conversion, and electrical / logical signal conversion A PHY chip (Physical Layer Chip), an L2 / L3 switch that performs frame transfer processing, a crystal oscillator that serves as a reference clock for processing the frame, a clock driver, and the like.
Therefore, in the case of the conventional method for identifying the location where a failure has occurred, there is a problem that the replacement cost at the time of the failure is wide, and it takes time to identify the abnormal portion, which increases the maintenance cost.

さらに、障害が物理層の一過性の故障の場合、障害発生した装置を解析しても障害が簡単に再現せず、障害発生の真の原因が特定できず、障害解析に膨大な時間と費用がかかるという問題があった。また、上記従来技術では、ネットワークに異常が発生してから、障害発生箇所を切り分けているため、継続してネットワークを正常に運用することができないという問題があった。 Furthermore, if the failure is a transient failure in the physical layer, even if the failed device is analyzed, the failure cannot be easily reproduced, the true cause of the failure cannot be identified, and the failure analysis takes an enormous amount of time. There was a problem that it was expensive. In addition, the above-described prior art has a problem that the network cannot be operated normally continuously because the failure location is isolated after an abnormality has occurred in the network.

また、特許文献３では、装置内部の装置で故障が発生した場合、装置内部の障害要因と障害が発生した装置の接続関係を示す装置構成情報により、故障した装置を特定しているが、故障が発生してからの復旧作業となるため、継続してシステムを正常に運用することができないという問題があった。
さらに、故障した装置までは特定できていても、装置内部の部位まで特定できないため、特許文献１と特許文献２と同様に、障害発生した装置を解析しても障害が簡単に再現せず、障害発生の真の原因が特定できず、障害解析に膨大な時間と費用がかかるという問題があった。 Further, in Patent Document 3, when a failure occurs in a device inside the device, the failed device is identified by the device configuration information indicating the connection relationship between the failure factor inside the device and the device in which the failure has occurred. Since this was a recovery work after the occurrence of the problem, there was a problem that the system could not be operated normally continuously.
Furthermore, even if the faulty device can be identified, it is not possible to identify the part inside the device. Therefore, as in Patent Document 1 and Patent Document 2, the failure is not easily reproduced even if the failed device is analyzed. There was a problem that the true cause of the failure occurrence could not be identified, and the failure analysis took enormous time and cost.

この発明は、上述のような課題を解決するためになされたものであり、ネットワークに接続された装置の物理層の部位の異常を検出できる状態監視装置を得ることを目的とする。 The present invention has been made to solve the above-described problems, and an object of the present invention is to obtain a state monitoring device capable of detecting an abnormality in a physical layer portion of a device connected to a network.

この発明に係わる状態監視装置においては、自状態監視装置を構成する各部位の状態を収集する内部状態収集部、この内部状態収集部により収集された各部位の状態が格納された状態監視データベース、部位ごとの異常を判定するための基準を予め定義した異常特定用データベース、及び状態監視データベースに格納された各部位の状態と異常特定用データベースの該当部位の基準とを比較し、該当部位が異常かどうかを判定する異常箇所検出部を備えたものである。 In the state monitoring device according to the present invention, an internal state collection unit that collects the state of each part constituting the own state monitoring device, a state monitoring database in which the state of each part collected by the internal state collection unit is stored, Compare the condition of each part stored in the abnormality monitoring database with the criteria for judging abnormality for each part in advance and the condition monitoring database and the corresponding part in the abnormality identification database. It is provided with the abnormal part detection part which determines whether it is.

この発明によれば、自状態監視装置を構成する各部位の状態を収集する内部状態収集部、この内部状態収集部により収集された各部位の状態が格納された状態監視データベース、部位ごとの異常を判定するための基準を予め定義した異常特定用データベース、及び状態監視データベースに格納された各部位の状態と異常特定用データベースの該当部位の基準とを比較し、該当部位が異常かどうかを判定する異常箇所検出部を備えたので、装置内の各部位で障害が発生した場合に異常部位を即座に特定することができる。 According to this invention, the internal state collection unit that collects the state of each part constituting the self-state monitoring apparatus, the state monitoring database that stores the state of each part collected by the internal state collection unit, and the abnormality for each part Compare the condition of each part stored in the abnormality identification database, which is defined in advance with the criteria for determining the condition, and the corresponding part standard in the abnormality identification database, and determine whether the corresponding part is abnormal Since the abnormal location detecting unit is provided, the abnormal site can be immediately identified when a failure occurs in each site in the apparatus.

この発明の実施の形態１による状態監視装置の構成を示すブロック図である。It is a block diagram which shows the structure of the state monitoring apparatus by Embodiment 1 of this invention. この発明の実施の形態１による状態監視装置のハードウェア構成を示すブロック図である。It is a block diagram which shows the hardware constitutions of the state monitoring apparatus by Embodiment 1 of this invention. この発明の実施の形態１による状態監視装置の状態監視データベース部の一例を示す図である。It is a figure which shows an example of the state monitoring database part of the state monitoring apparatus by Embodiment 1 of this invention. この発明の実施の形態１による状態監視装置の異常特定用データベースの一例を示す図である。It is a figure which shows an example of the abnormality identification database of the state monitoring apparatus by Embodiment 1 of this invention. この発明の実施の形態１による状態監視装置の閾値格納部の一例を示す図である。It is a figure which shows an example of the threshold value storage part of the state monitoring apparatus by Embodiment 1 of this invention. この発明の実施の形態１による状態監視装置の装置内部の異常部位を特定する処理を示すフローチャートである。It is a flowchart which shows the process which specifies the abnormal site | part inside the apparatus of the state monitoring apparatus by Embodiment 1 of this invention. この発明の実施の形態２による状態監視装置のネットワークを経由して他装置内部の状態を取得する構成を示す説明図である。It is explanatory drawing which shows the structure which acquires the state inside another apparatus via the network of the state monitoring apparatus by Embodiment 2 of this invention. この発明の実施の形態２による状態監視装置の装置内部の状態を他装置に通知するためのフレームのフォーマットの一例を示す図である。It is a figure which shows an example of the format of the flame | frame for notifying the state inside the apparatus of the state monitoring apparatus by Embodiment 2 of this invention to another apparatus. この発明の実施の形態２によるネットワーク上の状態監視装置のハードウェア構成を示すブロック図である。It is a block diagram which shows the hardware constitutions of the state monitoring apparatus on the network by Embodiment 2 of this invention. この発明の実施の形態２による状態監視装置の状態監視データベース部の一例を示す図である。It is a figure which shows an example of the state monitoring database part of the state monitoring apparatus by Embodiment 2 of this invention. この発明の実施の形態２による状態監視装置の異常特定用データベースの一例を示す図である。It is a figure which shows an example of the abnormality identification database of the state monitoring apparatus by Embodiment 2 of this invention. この発明の実施の形態２による状態監視装置の閾値格納部の一例を示す図である。It is a figure which shows an example of the threshold value storage part of the state monitoring apparatus by Embodiment 2 of this invention. この発明の実施の形態２による光ネットワーク上の状態監視装置のハードウェア構成を示すブロック図である。It is a block diagram which shows the hardware constitutions of the state monitoring apparatus on the optical network by Embodiment 2 of this invention. この発明の実施の形態２による光ネットワーク上の状態監視装置の異常特定用データベースの一例を示す図である。It is a figure which shows an example of the database for abnormality specification of the state monitoring apparatus on the optical network by Embodiment 2 of this invention. この発明の実施の形態３による状態監視装置のあるパラメータの発生回数の閾値の自動算出を示す概念図である。It is a conceptual diagram which shows the automatic calculation of the threshold value of the generation frequency of a certain parameter of the state monitoring apparatus by Embodiment 3 of this invention. この発明の実施の形態３による状態監視装置の設定した閾値を超えた場合の異常検出を示す概念図である。It is a conceptual diagram which shows the abnormality detection when the threshold value set by the state monitoring apparatus by Embodiment 3 of this invention is exceeded.

実施の形態１．
以下、本発明の実施の形態１を図に基づいて説明する。なお、この実施の形態によりこの発明が限定されるものではない。
図１は、この発明の実施の形態１による状態監視装置の構成を示すブロック図である。
図１において、装置１（状態監視装置）はネットワーク４に接続され、次のように構成されている。
自装置内部状態収集部１０８（内部状態収集部）は、後述する図２の装置内部の各ブロックについてのパラメータの状態を収集する。状態処理部１００は、自装置内部状態収集部１０８で収集した装置内部の状態（各部位の各パラメータの状態）を状態監視データベース部１０１（状態監視データベース）に保存する。状態監視データベース部１０１は、装置内部の物理層を含む各部位の状態を管理する。 Embodiment 1 FIG.
Embodiment 1 of the present invention will be described below with reference to the drawings. Note that the present invention is not limited to the embodiments.
FIG. 1 is a block diagram showing a configuration of a state monitoring apparatus according to Embodiment 1 of the present invention.
In FIG. 1, a device 1 (state monitoring device) is connected to a network 4 and configured as follows.
The own apparatus internal state collection unit 108 (internal state collection unit) collects parameter states for each block in the apparatus of FIG. The state processing unit 100 stores the internal state of the device (the state of each parameter of each part) collected by the internal device internal state collection unit 108 in the state monitoring database unit 101 (state monitoring database). The state monitoring database unit 101 manages the state of each part including the physical layer inside the apparatus.

異常特定用データベース部１０４（異常特定用データベース）は、装置内部の状態から異常部位を特定するためのパラメータの基準状態を格納したデータベースである。閾値格納部１０５（異常閾値データベース）は、装置内の各部位の各パラメータが数値データであるときに、異常かどうかを判断するための判定基準となる閾値を格納する。
異常箇所検出部１０２は、状態監視データベース部１０１の装置内部の状態と、異常特定用データベース部１０４の異常部位を特定するためのパラメータの基準状態とを比較することで、異常の発生の有無を判定し、異常発生時、異常を検出する。
表示部／設定部１０３は、異常が検出された場合、表示することにより、異常が発生したことを通知する。 The abnormality identifying database unit 104 (abnormality identifying database) is a database that stores a reference state of parameters for identifying an abnormal part from the state inside the apparatus. The threshold storage unit 105 (abnormal threshold database) stores a threshold that serves as a determination criterion for determining whether there is an abnormality when each parameter of each part in the apparatus is numerical data.
The abnormal part detection unit 102 compares the internal state of the apparatus of the state monitoring database unit 101 with the reference state of a parameter for specifying the abnormal part of the abnormality specifying database unit 104 to determine whether or not an abnormality has occurred. Judges and detects an abnormality when an abnormality occurs.
When an abnormality is detected, the display / setting unit 103 notifies that an abnormality has occurred by displaying the abnormality.

受信部１０６は、他装置から内部状態を示すデータを格納した内部状態通知フレームを受信し、他装置内部状態収集部１０９に渡す。他装置内部状態収集部１０９は、受け取った他装置の内部状態を状態処理部１００に渡し、状態監視データベース部１０１に保管させる。
状態通知部１１０は、状態処理部１００から自装置の内部状態を貰い、内部状態通知フレームを作成する。送信部１０７は、他装置へ、自装置の内部状態を示すデータを格納した内部状態通知フレームを送信する。 The receiving unit 106 receives an internal state notification frame storing data indicating the internal state from another device, and passes it to the other device internal state collecting unit 109. The other device internal state collection unit 109 passes the received internal state of the other device to the state processing unit 100 and stores it in the state monitoring database unit 101.
The state notification unit 110 receives the internal state of the device from the state processing unit 100 and creates an internal state notification frame. The transmitting unit 107 transmits an internal state notification frame storing data indicating the internal state of the own device to another device.

ここで、自装置内部状態収集部１０８、状態処理部１００、状態監視データベース部１０１、異常箇所検出部１０２、異常特定用データベース部１０４、閾値格納部１０５、他装置内部状態収集部１０９、状態通知部１１０は、例えば、後述する図２でのロジック部１０で実現される。 Here, its own device internal state collection unit 108, state processing unit 100, state monitoring database unit 101, abnormal part detection unit 102, abnormality identification database unit 104, threshold storage unit 105, other device internal state collection unit 109, state notification The unit 110 is realized by, for example, the logic unit 10 in FIG.

図２は、この発明の実施の形態１による状態監視装置のハードウェア構成を示すブロック図である。
図２において、発振器部１７、１８、１９は、水晶発振器やクロックドライバなどによって構成され、それぞれフレームを処理するための基準クロックを発振する。Ｌ２スイッチ部１５は、レイヤ２のフレームの転送処理を行う。Ｌ３スイッチ部１６は、レイヤ３のフレームの転送処理を行う。ＰＨＹ（ＰｈｙｓｉｃａｌＬａｙｅｒ、物理層）部１３、１４は、電気／論理信号の変換をする。ロジック部１０は、ＰＬＤ（ＰｒｏｇｒａｍｍａｂｌｅＬｏｇｉｃＤｅｖｉｃｅ）またはＡＳＩＣ（ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）で実現され、図１に示す各部を形成するために用いられる。電気Ｉ／Ｆ部１１、１２は、ネットワーク４とのインターフェースを形成する。 FIG. 2 is a block diagram showing a hardware configuration of the state monitoring apparatus according to Embodiment 1 of the present invention.
In FIG. 2, oscillator units 17, 18, and 19 are configured by a crystal oscillator, a clock driver, and the like, and each oscillates a reference clock for processing a frame. The L2 switch unit 15 performs layer 2 frame transfer processing. The L3 switch unit 16 performs a layer 3 frame transfer process. PHY (Physical Layer, physical layer) units 13 and 14 convert electrical / logical signals. The logic unit 10 is realized by PLD (Programmable Logic Device) or ASIC (Application Specific Integrated Circuit), and is used to form each unit shown in FIG. The electrical I / F units 11 and 12 form an interface with the network 4.

図３は、この発明の実施の形態１による状態監視装置の状態監視データベース部の一例を示す図である。
図３において、状態監視データベース部１０１は、図２のＰＨＹ部１３、ＰＨＹ部１４でのリンク、クロックロス、位相ずれ、Ｌ２スイッチ部１５でのパケットロス、位相ずれ、Ｌ３スイッチ部１６での再送回数、クロックロス、位相ずれ、発振器部１７のクロック、位相ずれなど、各部位ごとに、その内部状態を示すパラメータを保存している。パラメータのうち、位相ずれなど、数値で表されるものについては、閾値格納部１０５を参照して、格納される状態を大、中、小のいずれかで保存する。 FIG. 3 is a diagram showing an example of a state monitoring database unit of the state monitoring apparatus according to Embodiment 1 of the present invention.
In FIG. 3, the state monitoring database unit 101 includes links, clock loss, phase shift, packet loss at the L2 switch unit 15, phase shift, and retransmission at the L3 switch unit 16 in the PHY unit 13 and PHY unit 14 in FIG. Parameters indicating the internal state of each part, such as the number of times, clock loss, phase shift, clock of the oscillator unit 17, phase shift, and the like are stored. Of the parameters, those represented by numerical values such as phase shifts are stored with reference to the threshold storage unit 105 and stored in a large, medium, or small state.

図４は、この発明の実施の形態１による状態監視装置の異常特定用データベースの一例を示す図である。
図４において、異常特定用データベース部１０４は、異常発生条件と異常部位の対応関係を格納している。異常発生条件として、装置内の部位に対応させて、異常と判定するための各パラメータの基準状態を保存している。ここで、異常の状態として、軽度な異常を軽故障、重度な異常を重故障と定義される。軽故障は、異常があるものの、継続して動作可能な状態を示す。一方、重故障は、システムが継続動作不可能な状態を示す。位相ずれなど、数値で表されるものについては、閾値格納部１０５を参照して、パラメータの基準状態として、大、中、小のいずれかが格納されている。
なお、異常発生条件として、複数の部位で異常が発生する場合に、異常発生部位の組合せと想定故障部位の対応関係を定義している。これにより、複数の部位で異常が発生した場合にも、本当の異常箇所の部位を特定可能になる。 FIG. 4 is a diagram showing an example of the abnormality specifying database of the state monitoring apparatus according to Embodiment 1 of the present invention.
In FIG. 4, the abnormality specifying database unit 104 stores the correspondence between the abnormality occurrence condition and the abnormal part. As an abnormality occurrence condition, a reference state of each parameter for determining an abnormality is stored in correspondence with a part in the apparatus. Here, as an abnormal state, a minor abnormality is defined as a minor failure, and a severe abnormality is defined as a major failure. A minor failure indicates a state in which operation is possible even though there is an abnormality. On the other hand, a serious failure indicates a state in which the system cannot be continuously operated. For a numerical value such as a phase shift, one of large, medium, and small is stored as a parameter reference state with reference to the threshold value storage unit 105.
Note that, as an abnormality occurrence condition, when an abnormality occurs in a plurality of parts, a correspondence relationship between the combination of the abnormality occurrence parts and the assumed failure part is defined. Thereby, even when an abnormality occurs in a plurality of parts, it is possible to specify the part of the true abnormality part.

図５は、この発明の実施の形態１による状態監視装置の閾値格納部の一例を示す図である。
図５において、閾値格納部１０５は、装置内の各部位の各パラメータが異常かどうかを判断するための判定基準となる閾値を格納する。図５ではパラメータごとに、許容できない異常、許容可能な異常、正常の各数値（閾値）が格納されている。この閾値は、表示部／設定部１０３経由で設定可能である。 FIG. 5 is a diagram showing an example of the threshold storage unit of the state monitoring apparatus according to Embodiment 1 of the present invention.
In FIG. 5, the threshold storage unit 105 stores a threshold serving as a determination criterion for determining whether or not each parameter of each part in the apparatus is abnormal. In FIG. 5, numerical values (threshold values) of unacceptable abnormality, acceptable abnormality, and normal are stored for each parameter. This threshold value can be set via the display / setting unit 103.

次に、実施の形態１の装置内部の異常部位を特定するための動作について説明する。
装置１が、例えば図２の部位を持つ構成において、図３に示す自装置内部の部位のパラメータを持つ場合、自装置内部状態収集部１０８が、図２の各部位の各パラメータの状態を収集する。次いで、状態処理部１００が、自装置内部状態収集部１０８により収集した装置内部の状態（各部位の各パラメータの状態）を状態監視データベース部１０１に保存する。
このとき、位相ずれなど、数値で表されるものについては、閾値格納部１０５を参照して、格納される状態を決める。 Next, an operation for specifying an abnormal site inside the apparatus according to the first embodiment will be described.
For example, when the apparatus 1 has the configuration shown in FIG. 2 and has the parameters of the internal part shown in FIG. 3, the internal state collection unit 108 collects the status of each parameter of each part shown in FIG. To do. Next, the state processing unit 100 stores the internal state of the apparatus (the state of each parameter of each part) collected by the internal apparatus internal state collection unit 108 in the state monitoring database unit 101.
At this time, with respect to a numerical value such as a phase shift, the threshold storage unit 105 is referred to determine a stored state.

異常箇所検出部１０２は、状態監視データベース部１０１に保存された装置内部の部位ごと及びパラメートごとの状態と、異常特定用データベース部１０４の異常発生条件の部位ごと及びパラメータごとの基準状態とを比較することで、異常の発生の有無を判定し、異常発生時、異常部位を検出する。異常部位を検出した場合、表示部／設定部１０３にて表示することにより、異常が発生したことを保守員に通知する。 The abnormal part detection unit 102 compares the state of each part and parameter inside the apparatus stored in the state monitoring database unit 101 with the reference state for each part and parameter of the abnormality occurrence condition of the abnormality specifying database unit 104. By doing so, it is determined whether or not an abnormality has occurred, and an abnormal part is detected when the abnormality occurs. When an abnormal part is detected, it is displayed on the display / setting unit 103 to notify maintenance personnel that an abnormality has occurred.

異常特定用データベース部１０４で、異常発生条件と異常部位の対応関係を記憶しておくことで、例えば、軽故障の条件１の場合、ＰＨＹ１３で位相ずれが発生（レベル：中）し、その他のパラメータが問題なければ、異常部位としてＰＨＹ部１３が特定可能である。
また、例えば、重故障の条件３の場合、ＰＨＹ１３とＰＨＹ１４で位相ずれが発生（レベル：大）し、発振器部１７でも位相ずれが発生（レベル：大）しているため、異常部位として発振器部１７が特定可能である。 By storing the correspondence relationship between the abnormality occurrence condition and the abnormal part in the abnormality specifying database unit 104, for example, in the case of a minor failure condition 1, a phase shift occurs (level: medium) in the PHY 13 and other If there is no problem with the parameters, the PHY unit 13 can be specified as an abnormal part.
Further, for example, in the case of the serious failure condition 3, a phase shift occurs in the PHY 13 and PHY 14 (level: large), and a phase shift also occurs in the oscillator unit 17 (level: large). 17 can be specified.

このように、事前に異常発生条件を設定しておき、各部位の状態を自装置内部状態収集部１０８で逐次収集し、状態監視データベース部１０１に状態処理部１００にて逐次保存しておき、異常特定用データベース部１０４と状態監視データベース部１０１とを異常箇所検出部１０２にて、逐次比較することにより、異常部位が即時検出可能となる。 In this way, abnormality occurrence conditions are set in advance, the state of each part is sequentially collected by the internal device internal state collection unit 108, and is sequentially stored in the state monitoring database unit 101 by the state processing unit 100, By comparing the abnormality specifying database unit 104 and the state monitoring database unit 101 sequentially with the abnormal part detecting unit 102, the abnormal part can be immediately detected.

次に、図６のフローチャートを用いて、異常箇所検出部１０２により異常部位を特定するための処理フローについて説明する。
ステップＳ１で、異常箇所検出部１０２が、異常箇所検出処理を開始する。
次いで、ステップＳ２で、状態監視データベース部１０１の部位ごと及びパラメータごとの状態と、異常特定用データベース部１０４の異常発生条件の部位ごと及びパラメータごとの基準状態とを比較し、一致するかどうかを判定する。
一致しなければ、ステップＳ１に戻り、一致すれば、ステップＳ３で、異常特定用データベース部１０４の異常発生条件のパラメータの基準状態が一致する異常部位を特定し、終了する。 Next, a processing flow for specifying an abnormal part by the abnormal part detection unit 102 will be described with reference to the flowchart of FIG.
In step S1, the abnormal part detection unit 102 starts an abnormal part detection process.
Next, in step S2, the state for each part and parameter in the state monitoring database unit 101 is compared with the reference state for each part and parameter in the abnormality occurrence condition in the abnormality specifying database unit 104 to determine whether they match. judge.
If they do not match, the process returns to step S1, and if they match, in step S3, an abnormal part where the reference state of the parameter of the abnormality occurrence condition in the abnormality specifying database unit 104 matches is specified, and the process ends.

実施の形態１によれば、このように、装置内部の各部位毎、各パラメータ毎の状態を一括で管理し、各パラメータ毎に、正常か、許容可能な異常か、許容できない異常かの閾値を事前に設定し、ある部位で異常が発生した場合、その閾値を超えた異常発生条件のパラメータの組合せと想定故障部位の対応関係から、異常箇所の部位を特定することが可能になる。
ここで、異常発生条件である複数の部位で異常が発生していても、事前に異常発生部位の組合せと想定故障部位の対応関係を定義することで、本当の異常箇所の部位を特定可能になる。
また、装置内の部位は、ネットワーク層やトランスポート層に限らず、物理層であっても異常検出が可能になる。 According to the first embodiment, the state of each part and each parameter in the apparatus is collectively managed as described above, and a threshold value indicating whether each parameter is normal, acceptable abnormality, or unacceptable abnormality. When an abnormality occurs in a certain part, it is possible to specify the part of the abnormal part from the correspondence between the parameter combination of the abnormality occurrence condition exceeding the threshold and the assumed failure part.
Here, even if an abnormality has occurred at multiple sites that are abnormal conditions, it is possible to identify the actual abnormal site by defining the correspondence between the combination of the abnormal site and the assumed failure site in advance. Become.
In addition, the site in the apparatus is not limited to the network layer and the transport layer, and an abnormality can be detected even in the physical layer.

なお、異常特定用データベース部１０４に重故障の発生条件だけでなく、軽故障の発生条件も定義することにより、軽故障発生時に事前に異常部位を把握でき、装置の保守・点検時、故障や異常が発生しそうな箇所を事前に代替品に交換することにより、システムの継続稼動が可能となる。 In addition, by defining not only the occurrence conditions of major faults but also the occurrence conditions of minor faults in the fault identification database unit 104, the faulty part can be grasped in advance when minor faults occur. The system can be continuously operated by exchanging a place where an abnormality is likely to occur in advance.

上述の図３の説明では、位相ずれなどの数値データを大小で表現したが、数値そのものであってもよく、その場合には、異常箇所検出部１０２は、数値データを閾値格納部１０５の閾値と比較することになる。 In the description of FIG. 3 described above, the numerical data such as the phase shift is expressed by the magnitude, but it may be a numerical value itself. In this case, the abnormal part detection unit 102 converts the numerical data to the threshold value of the threshold value storage unit 105. Will be compared.

実施の形態２．
実施の形態１では、自装置内部の異常を検出できるものの、その異常が本当に自装置に起因する異常か、もしくは、ネットワーク上の他装置に起因する異常かを判断できないという問題がある。
そこで、実施の形態２では、実施の形態１にて、自装置の内部状態だけでなく、他装置の内部状態を収集できるよう、ネットワークを経由して、各装置の内部状態を収集し、どの装置のどの部位が異常になっているかを判断可能とする。
なお、自装置の内部状態を収集する方法は、実施の形態１と同様のため、その説明を省略する。 Embodiment 2. FIG.
In the first embodiment, although an abnormality in the own device can be detected, there is a problem that it cannot be determined whether the abnormality is actually caused by the own device or an abnormality caused by another device on the network.
Therefore, in the second embodiment, the internal state of each device is collected via the network so that not only the internal state of the own device but also the internal state of other devices can be collected in the first embodiment. It is possible to determine which part of the apparatus is abnormal.
Note that the method for collecting the internal state of the own apparatus is the same as that in the first embodiment, and thus the description thereof is omitted.

図７は、この発明の実施の形態２による状態監視装置のネットワークを経由して他装置内部の状態を取得する構成を示す説明図である。
図７において、１、４は図１におけるものと同一のものである。図７では、ネットワーク４には、装置１と同様の装置２及び装置３が接続されている。内部状態通知フレーム１ａは、装置１が送信し、装置２、３が受信するフレームである。内部状態通知フレーム２ａは、装置２が送信し、装置１、３が受信するフレームである。内部状態通知フレーム３ａは、装置３が送信し、装置１、２が受信するフレームである。 FIG. 7 is an explanatory diagram showing a configuration for acquiring the internal state of another apparatus via the network of the state monitoring apparatus according to Embodiment 2 of the present invention.
In FIG. 7, 1 and 4 are the same as those in FIG. In FIG. 7, a device 2 and a device 3 similar to the device 1 are connected to the network 4. The internal state notification frame 1a is a frame transmitted by the device 1 and received by the devices 2 and 3. The internal state notification frame 2a is a frame transmitted by the device 2 and received by the devices 1 and 3. The internal state notification frame 3a is a frame transmitted by the device 3 and received by the devices 1 and 2.

図８は、この発明の実施の形態２による状態監視装置の装置内部の状態を他装置に通知するためのフレームのフォーマットの一例を示す図である。
図８において、内部状態通知フレーム１ａ、２ａ、３ａのフォーマットを示す。それぞれ、Ｌ２ヘッダ、ペイロード（データ部）を有し、ペイロードに、内部状態通知フレームであることを示す識別子５と、各装置１、２、３の内部状態を示すデータである装置内部
状態１ｂ、２ｂ、３ｂが入っている。 FIG. 8 is a diagram showing an example of a frame format for notifying other devices of the state inside the state monitoring device according to the second embodiment of the present invention.
FIG. 8 shows the format of the internal state notification frames 1a, 2a, and 3a. Each has an L2 header and a payload (data part), and an identifier 5 indicating that it is an internal state notification frame in the payload, and a device internal state 1b that is data indicating the internal state of each device 1, 2, 3, Contains 2b and 3b.

図９は、この発明の実施の形態２によるネットワーク上の状態監視装置のハードウェア構成を示すブロック図である。
図９において、１〜４は図７におけるものと、１０〜１９は図２におけるものとそれぞれ同一のものである。２０〜２９はそれぞれ１０〜１９と同じものである。装置１〜３は、同じ内部構成である。 FIG. 9 is a block diagram showing a hardware configuration of the state monitoring apparatus on the network according to the second embodiment of the present invention.
9, 1 to 4 are the same as those in FIG. 7, and 10 to 19 are the same as those in FIG. 20 to 29 are the same as 10 to 19, respectively. The devices 1 to 3 have the same internal configuration.

図１０は、この発明の実施の形態２による状態監視装置の状態監視データベース部の一例を示す図である。
図１０において、図３と同様の状態監視データベース部２０１には、自装置の内部状態を示すデータとともに、他装置の内部状態を示すデータも保存される。 FIG. 10 is a diagram showing an example of the state monitoring database unit of the state monitoring apparatus according to Embodiment 2 of the present invention.
In FIG. 10, the state monitoring database unit 201 similar to FIG. 3 stores data indicating the internal state of the own device as well as data indicating the internal state of the other device.

図１１は、この発明の実施の形態２による状態監視装置の異常特定用データベースの一例を示す図である。
図１１において、図４と同様の異常特定用データベース部２０４には、異常と判定する各パラメータの条件として、自装置及び他装置のものが格納されている。 FIG. 11 is a diagram showing an example of the abnormality specifying database of the state monitoring apparatus according to Embodiment 2 of the present invention.
In FIG. 11, the abnormality specifying database unit 204 similar to FIG. 4 stores the parameters of the own device and other devices as the conditions of each parameter determined to be abnormal.

図１２は、この発明の実施の形態２による状態監視装置の閾値格納部の一例を示す図である。
図１２において、閾値格納部２０５には、自装置及び他装置について、ネットワーク上の装置内の各部位の各パラメータが異常かどうかを判断するための判定基準となる閾値が格納されている。 FIG. 12 is a diagram showing an example of the threshold storage unit of the state monitoring apparatus according to Embodiment 2 of the present invention.
In FIG. 12, the threshold value storage unit 205 stores a threshold value that serves as a determination criterion for determining whether or not each parameter of each part in the device on the network is abnormal for the own device and other devices.

図１３は、この発明の実施の形態２による光ネットワーク上の状態監視装置のハードウェア構成を示すブロック図である。
図１３において、１〜３、１０、１３〜１９、２０、２３〜２９は図９におけるものと同一のものである。図１３では、ネットワーク６は、光のネットワークである。また、装置１〜３は、それぞれ、光信号の経路を切換える光スイッチ部３０１〜３０３を介して、ネットワーク６に接続される。各装置１〜３は、それぞれ、光を送受信し、光／電気の変換を行う光／電気変換部によって、光スイッチ部３０１〜３０３に接続されている。装置１は、光／電気変換部１１ａ、１２ａを有し、装置２は、光／電気変換部２１ａ、２２ａを有する。 FIG. 13 is a block diagram showing a hardware configuration of the state monitoring device on the optical network according to the second embodiment of the present invention.
In FIG. 13, 1 to 10, 10, 13 to 19, 20, 23 to 29 are the same as those in FIG. In FIG. 13, the network 6 is an optical network. Further, the devices 1 to 3 are connected to the network 6 via optical switch units 301 to 303 for switching optical signal paths, respectively. Each of the devices 1 to 3 is connected to the optical switch units 301 to 303 by an optical / electrical conversion unit that transmits / receives light and performs optical / electrical conversion. The device 1 includes optical / electrical converters 11a and 12a, and the device 2 includes optical / electrical converters 21a and 22a.

図１４は、この発明の実施の形態２による光ネットワーク上の状態監視装置の異常特定用データベースの一例を示す図である。
図１４において、図１１と同様の異常特定用データベース部３０４には、図１３の構成について、異常と判定する各パラメータの条件として、自装置及び他装置のものが格納されている。 FIG. 14 is a diagram showing an example of the abnormality specifying database of the state monitoring device on the optical network according to the second embodiment of the present invention.
14, the abnormality specifying database unit 304 similar to that in FIG. 11 stores, for the configuration in FIG. 13, the parameters of the own device and other devices as conditions for each parameter determined to be abnormal.

次に、実施の形態２の異常箇所検出部１０２による、装置内部の異常部位を特定するための動作について説明する。
なお、図９及び図１３の構成でも、異常箇所検出部１０２の動作は同じであるから、ここでは図９の構成について説明する。 Next, an operation for identifying an abnormal part inside the apparatus by the abnormal part detection unit 102 according to the second embodiment will be described.
9 and FIG. 13, the operation of the abnormal point detection unit 102 is the same, so the configuration of FIG. 9 will be described here.

図７の構成で、ネットワーク上に３台の装置がある場合、自装置１が、装置２、装置３の内部状態を収集する方法について説明する。なお、ネットワークの装置の台数については、台数を制限するものではない。
各装置は、状態処理部１００にて、定期的に、図１の状態監視データベース部１０１で収集した自装置の内部状態を他装置に通知するため、状態処理部１００で、各装置の内部
状態を、状態通知部１１０に渡す。状態通知部１１０は、図８に示す、装置の内部状態を通知することを示す識別子５をフレームに追加し、内部状態通知フレームを生成し、送信部１０７に渡す。送信部１０７は、ネットワーク４上に、内部状態通知フレームを送信する。ここで、装置２は内部状態通知フレーム２ａを、装置３は内部状態通知フレーム３ａを送信する。 In the configuration of FIG. 7, when there are three devices on the network, a method in which the own device 1 collects the internal states of the devices 2 and 3 will be described. Note that the number of network devices is not limited.
Each device periodically notifies the internal state of its own device collected by the state monitoring database unit 101 in FIG. 1 at the state processing unit 100, so that the state processing unit 100 uses the internal state of each device. Is passed to the state notification unit 110. The status notification unit 110 adds an identifier 5 shown in FIG. 8 indicating notification of the internal status of the apparatus to the frame, generates an internal status notification frame, and passes it to the transmission unit 107. The transmission unit 107 transmits an internal state notification frame on the network 4. Here, the device 2 transmits an internal state notification frame 2a, and the device 3 transmits an internal state notification frame 3a.

装置１は、装置２、装置３から送信された内部状態通知フレーム２ａ、内部状態通知フレーム３ａを受信部１０６で受信し、他装置内部状態収集部１０９に渡す。他装置内部状態収集部１０９は、フレーム内の識別子５から当該フレームが内部状態通知フレームであることを認識し、フレーム内にある内部状態データを抽出する。
また、フレーム内にある送信元アドレスから、得られる内部状態がどの装置かを抽出する。このようにして、装置２内部状態２ｂと装置３内部状態３ｂを抽出する。
状態処理部１００では、収集した装置２内部状態２ｂと装置３内部状態３ｂを状態監視データベース部１０１に保存する。このようにして、自装置１は、ネットワーク上にある装置２、装置３の内部状態を収集可能となる。 The device 1 receives the internal state notification frame 2 a and the internal state notification frame 3 a transmitted from the devices 2 and 3 by the receiving unit 106 and passes them to the other device internal state collection unit 109. The other apparatus internal state collection unit 109 recognizes that the frame is the internal state notification frame from the identifier 5 in the frame, and extracts internal state data in the frame.
Also, it is extracted from the source address in the frame which device is in the obtained internal state. In this way, the device 2 internal state 2b and the device 3 internal state 3b are extracted.
The state processing unit 100 stores the collected device 2 internal state 2 b and device 3 internal state 3 b in the state monitoring database unit 101. In this way, the own device 1 can collect the internal states of the devices 2 and 3 on the network.

例えば、図９のネットワーク構成において、図１０に示す各装置内部のパラメータを持つ場合、装置１（自装置）のパラメータは、自装置の自装置内部状態収集部１０８で収集した内部状態であり、装置２、装置３のパラメータは、図７、図８において、ネットワーク経由で、収集した内部状態である。 For example, in the network configuration of FIG. 9, when the parameters of each device shown in FIG. 10 are included, the parameters of the device 1 (own device) are internal states collected by the own device internal state collection unit 108 of the own device, The parameters of the devices 2 and 3 are the internal states collected via the network in FIGS.

例えば、図１１に、ネットワーク上に装置が３台ある場合の異常特定用データベース部２０４の一例を示す。異常特定用データベース部２０４は、自装置の内部の異常発生条件だけでなく、ネットワーク上の他装置の異常と判定する各パラメータの条件も保存している。 For example, FIG. 11 shows an example of the abnormality specifying database unit 204 when there are three devices on the network. The abnormality specifying database unit 204 stores not only the abnormality occurrence conditions inside the apparatus itself but also the conditions of each parameter determined to be an abnormality of another apparatus on the network.

このように、自装置だけでなく、ネットワーク上の装置の異常発生条件と異常部位の対応関係を記憶しておくことで、ネットワーク上のどの装置のどの部位で異常が発生しているかを特定可能となる。
例えば、軽故障の条件３の場合で、装置１でＰＨＹ部１３の位相ずれ（レベル：中）、装置２でＰＨＹ部２４の位相ずれ（レベル：中）、発振器部２７の位相ずれ（レベル：中）が発生した場合、装置１の内部状態だけでは、実際には装置２の発振器部２７で異常が発生しているのにもかかわらず、装置１のＰＨＹ部１３が異常となっていると診断を誤ることになる。
そこで、異常特定用データベース部２０４で、装置１だけでなく装置２の発生条件も管理し、異常部位を対応付けしておくことで、実際に異常が発生している異常発生部位、装置２の発振器部２７が特定可能となる。 In this way, it is possible to specify which part of a device on the network where the abnormality has occurred by storing the correspondence between the abnormality occurrence condition and the abnormal part of the device on the network as well as the own device. It becomes.
For example, in the case of minor failure condition 3, the phase shift of the PHY unit 13 (level: medium) in the device 1, the phase shift of the PHY unit 24 (level: medium) in the device 2, and the phase shift of the oscillator unit 27 (level: level). Medium) occurs, the PHY unit 13 of the device 1 is abnormal even though the internal state of the device 1 alone is actually abnormal in the oscillator unit 27 of the device 2. The diagnosis will be wrong.
Therefore, the abnormality identification database unit 204 manages not only the apparatus 1 but also the generation conditions of the apparatus 2, and associates the abnormal part so that the abnormal part where the abnormality actually occurs, The oscillator unit 27 can be specified.

ここで、異常特定用データベース部２０４で管理する装置の対象は、ネットワーク上の全装置だけでなく、一部の装置、もしくは隣接装置のみでも構わない。 Here, the target of the device managed by the abnormality specifying database unit 204 may be not only all devices on the network, but also some devices or only neighboring devices.

実施の形態２によれば、自装置のみならず、他装置についても異常発生部位を特定するように構成したので、他装置の異常発生部位の特定ができ、これにより自装置の異常発生部位の特定の誤りを防ぐことができる。 According to the second embodiment, since it is configured to identify the abnormality occurrence site not only in the own device but also in the other device, the abnormality occurrence site of the other device can be specified, and thereby the abnormality occurrence site of the own device can be identified. Certain errors can be prevented.

実施の形態３．
実施の形態１、実施の形態２では、各部位の各パラメータの閾値は温度、周囲温度などの外部環境などにより妥当な設定値が異なるため、設定値の決定は一定期間、各パラメータの定常状態の値をモニターし、モニターした情報を基に妥当な設定値を検討し、手動で設定する必要があり、ある程度装置を理解していないと妥当な設定値を設定できないとい
う問題がある。
そこで、実施の形態３は、閾値の設定を手動ではなく、自動で設定することにより、閾値の設定の手間を省くようにした。
なお、実施の形態３の異常部位の特定方法については、実施の形態１、２と同様のため、その説明を省略する。 Embodiment 3 FIG.
In the first embodiment and the second embodiment, since the appropriate threshold value of each parameter of each part varies depending on the external environment such as temperature and ambient temperature, the setting value is determined for a certain period of time in a steady state of each parameter. Therefore, there is a problem that it is necessary to set an appropriate setting value manually based on the monitored information, and to set the setting value manually unless the device is understood to some extent.
Therefore, in the third embodiment, the setting of the threshold value is automatically set instead of manually, so that the trouble of setting the threshold value is saved.
Note that the method for identifying an abnormal site according to the third embodiment is the same as in the first and second embodiments, and a description thereof will be omitted.

図１５は、この発明の実施の形態３による状態監視装置のあるパラメータの発生回数の閾値の自動算出を示す概念図である。
図１５において、縦軸に発生数、横軸に時間を取り、あるパラメータをグラフに表示している。図１５では、あるパラメータについて、状態確認タイミングでのグラフの確認ポイントがいずれも閾値以下になっている。 FIG. 15 is a conceptual diagram showing automatic calculation of a threshold value for the number of occurrences of a certain parameter in the state monitoring apparatus according to Embodiment 3 of the present invention.
In FIG. 15, the vertical axis represents the number of occurrences and the horizontal axis represents time, and certain parameters are displayed in a graph. In FIG. 15, the confirmation points of the graph at the state confirmation timing for a certain parameter are all equal to or less than the threshold value.

図１６は、この発明の実施の形態３による状態監視装置の設定した閾値を超えた場合の異常検出を示す概念図である。
図１６において、縦軸に発生数、横軸に時間を取り、あるパラメータをグラフに表示している。あるパラメータが、ある状態確認タイミングで過渡状態になった場合を示している。 FIG. 16 is a conceptual diagram showing abnormality detection when the threshold value set by the state monitoring apparatus according to Embodiment 3 of the present invention is exceeded.
In FIG. 16, the vertical axis represents the number of occurrences and the horizontal axis represents time, and certain parameters are displayed in a graph. A case is shown in which a certain parameter becomes a transient state at a certain state confirmation timing.

図１５は、実施の形態３において、あるパラメータの発生回数の閾値を自動で算出するための概念図である。各装置の異常箇所検出部１０２にて、内部状態のパラメータの発生頻度を定期的に監視し、監視した発生頻度の平均値と、３σ（σ：標準偏差）を自動算出し、算出した３σを閾値として、閾値格納部１０５、２０５に設定する。 FIG. 15 is a conceptual diagram for automatically calculating the threshold of the number of occurrences of a certain parameter in the third embodiment. The abnormal part detection unit 102 of each device periodically monitors the frequency of occurrence of internal state parameters, automatically calculates the average value of the monitored occurrence frequency and 3σ (σ: standard deviation), and calculates the calculated 3σ. As threshold values, threshold storage units 105 and 205 are set.

ここで、図１６において、設定した閾値を超えた場合、異常として異常箇所検出部１０２にて検出可能となる。 Here, in FIG. 16, when the set threshold value is exceeded, the abnormal point detection unit 102 can detect the abnormality.

実施の形態３によれば、自動で閾値格納部の閾値の設定が可能となり、閾値の設定の手間を省くことが可能となる。 According to the third embodiment, it is possible to automatically set the threshold value of the threshold value storage unit, and it is possible to save the labor of setting the threshold value.

なお、本発明は、その発明の範囲内において、各実施の形態を自由に組み合わせたり、各実施の形態を適宜、変形、省略することが可能である。 It should be noted that the present invention can be freely combined with each other within the scope of the invention, and each embodiment can be appropriately modified or omitted.

１、２、３装置、１ａ、２ａ、３ａ内部状態通知フレーム、１ｂ装置１内部状態、２ｂ装置２内部状態、３ｂ装置３内部状態、４、６ネットワーク、５識別子、１０、２０ロジック部、１１、１２、２１、２２電気Ｉ／Ｆ部、１１ａ、１２ａ、２１ａ、２２ａ光／電気変換部、１３、１４、２３、２４ＰＨＹ部、１５、２５Ｌ２スイッチ部、１６、２６Ｌ３スイッチ部、
１７、１８、１９、２７、２８、２９発振器部、１００状態処理部、１０１、２０１状態監視データベース部、１０２異常箇所検出部、１０３表示部／設定部、１０４、２０４、３０４異常特定用データベース部、１０５、２０５閾値格納部、１０６受信部、１０７送信部、１０８自装置内部状態収集部、１０９他装置内部状態収集部、１１０状態通知部、
３０１、３０２、３０３光スイッチ部。 1, 2, 3 devices, 1a, 2a, 3a internal status notification frame, 1b device 1 internal status, 2b device 2 internal status, 3b device 3 internal status, 4, 6 network, 5 identifier, 10, 20 logic unit, 11 12, 21, 22 Electrical I / F section, 11a, 12a, 21a, 22a Light / electrical conversion section, 13, 14, 23, 24 PHY section, 15, 25 L2 switch section, 16, 26 L3 switch section,
17, 18, 19, 27, 28, 29 Oscillator unit, 100 state processing unit, 101, 201 state monitoring database unit, 102 abnormal part detection unit, 103 display unit / setting unit, 104, 204, 304 abnormality specifying database unit , 105, 205 threshold storage unit, 106 receiving unit, 107 transmitting unit, 108 own device internal state collecting unit, 109 other device internal state collecting unit, 110 state notifying unit,
301, 302, 303 Optical switch part.

Claims

An internal state collection unit that collects the state of each part constituting the self-monitoring device,
A state monitoring database in which the state of each part collected by the internal state collection unit is stored;
A database for abnormality identification in which criteria for determining an abnormality for each part are defined in advance,
And an abnormal point detection unit that compares the state of each part stored in the state monitoring database with the reference of the corresponding part of the abnormality specifying database and determines whether the corresponding part is abnormal. A state monitoring device.

2. The state monitoring apparatus according to claim 1, wherein a minor abnormality and a severe abnormality are defined as the abnormality in the abnormality specifying database.

The state monitoring apparatus according to claim 1 or 2, wherein the abnormality specifying database is configured to specify an abnormal part that is a cause even when a plurality of parts show abnormality.

The state notification part which notifies the state of each said part collected by the said internal state collection part to another state monitoring apparatus via a network is provided, The one of Claims 1-3 characterized by the above-mentioned. The state monitoring device described in the item.

Other device internal state collection unit that collects the state of each part of other state monitoring device via a network,
The status monitoring database stores the status of each part of the other status monitoring device collected by the internal status collection unit of the other device. The state monitoring device described in the item.

The abnormality specifying database defines in advance a standard for determining an abnormality for each part of the other state monitoring device,
The abnormal part detection unit compares the state of each part of the other state monitoring device stored in the state monitoring database with the reference of the corresponding part of the abnormality specifying database, and determines whether the part is abnormal. The state monitoring apparatus according to claim 5, wherein:

7. An abnormality threshold database storing threshold values for creating a reference for the corresponding part of the abnormality specifying database when the state of each part is numerical data. The state monitoring device according to any one of the above.

The abnormal part detection unit periodically monitors the state of each part in the state monitoring database, automatically calculates the threshold value from the occurrence frequency of the normal state for each part, and stores the threshold value in the abnormal threshold database. The state monitoring apparatus according to claim 7.