JP3523979B2

JP3523979B2 - Fault tracing method

Info

Publication number: JP3523979B2
Application number: JP06879497A
Authority: JP
Inventors: 康博田中; 春人添田
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1997-03-21
Filing date: 1997-03-21
Publication date: 2004-04-26
Anticipated expiration: 2017-03-21
Also published as: JPH10271113A

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】この発明は、システム内で発
生した障害の原因解析を行うため、障害情報をトレース
する障害トレース方法に関するものであり、特に、ＡＴ
Ｍインタフェースを備えた複数の計算機、ルータ、ＣＬ
ＡＤ（セル組立分解装置）等のＡＴＭ端末が、ＵＮＩ
（ユーザネットワークインタフェース）を介してＡＴＭ
（非同期転送モード）網を構成するＡＴＭスイッチ装置
に接続されて、ＡＴＭを用いて相互に通信するＡＴＭ通
信システムに適用可能な障害トレース方法に関するもの
である。BACKGROUND OF THE INVENTION This invention performs cause analysis of failures in the system, which relates to the fault trace how tracing the fault information, in particular, AT
Multiple computers with M interface, router, CL
ATM terminals such as AD (cell assembling and disassembling equipment) are UNI
ATM via (user network interface)
Is connected to the ATM switch device constituting the (Asynchronous Transfer Mode) network, but about the fault trace how applicable to ATM communication system to communicate with each other using ATM.

【０００２】[0002]

【従来の技術】図１８は例えば特開昭６３−８３８４３
号公報に開示された従来の障害トレース方法を示す概念
図であり、図において、３１はトレース情報をトレース
バッファ３２に書き込むトレースバッファへの書き込み
部、３３はトレースバッファ３２が満杯になる度にその
トレースバッファ３２の内容をトレースファイル３４に
書き込むトレースファイルへの書き込み部、３５はトレ
ースファイルへの書き込み部３３だけに動作の開始・停
止を指令するトレース制御部である。３６は障害情報を
障害情報バッファ３７に書き込んだ後、この書き込みの
終了を障害種別判定部３８に通知する障害情報バッファ
への書き込み部、３８は障害情報バッファ３７に書き込
まれている最新の障害情報のうち近辺トレース部３９が
必要な障害であるかどうかを判定し、必要であれば近辺
トレース部３９を起動する障害種別判定部である。３９
はトレースバッファ３２から現時点の近辺トレース情報
を読み出して障害情報バッファ３７に書き込んだ後、こ
の書き込み終了を障害情報ファイルへの書き込み部４０
に通知する近辺トレース部である。４０は障害情報バッ
ファ３７に書き込まれている最新の障害情報と関連の近
辺トレース情報を障害情報ファイル４１に書き込む障害
情報ファイルへの書き込み部である。尚、障害情報バッ
ファへの書き込み部３６、障害情報バッファ３７、障害
種別判定部３８、近辺トレース部３９、障害情報ファイ
ルへの書き込み部４０、障害情報ファイル４１により障
害情報収集部を構成する。2. Description of the Related Art FIG. 18 shows, for example, Japanese Patent Laid-Open No. 63-83843.
FIG. 3 is a conceptual diagram showing a conventional fault tracing method disclosed in Japanese Patent Publication No. JP-A-2003-264, in which 31 is a trace buffer writing unit for writing trace information to a trace buffer 32, and 33 is a trace buffer writing unit for each trace buffer 32 when the trace buffer 32 is full. A trace file writing unit that writes the contents of the trace buffer 32 to the trace file 34, and a trace control unit 35 that instructs only the trace file writing unit 33 to start and stop the operation. Reference numeral 36 denotes a failure information buffer writing unit that notifies the failure type determination unit 38 of the completion of the writing after writing the failure information in the failure information buffer 37, and 38 denotes the latest failure information written in the failure information buffer 37. Of these, the near-side trace unit 39 determines whether or not it is a necessary fault, and is a fault type determination unit that activates the near-trace unit 39 if necessary. 39
Reads out the near trace information at the present time from the trace buffer 32 and writes it in the fault information buffer 37, and then writes this write end to the fault information file writing unit 40.
This is a trace part in the vicinity to notify the. Reference numeral 40 is a failure information file writing unit that writes the latest failure information written in the failure information buffer 37 and related trace information to the failure information file 41. The failure information buffer writing unit 36, the failure information buffer 37, the failure type determination unit 38, the neighborhood trace unit 39, the failure information file writing unit 40, and the failure information file 41 constitute a failure information collection unit.

【０００３】次に動作について説明する。先ず、トレー
スバッファへの書き込み部３１は、この方法を適用する
システム内でトレース情報の収集が必要な事象が発生す
る度に、該当のトレース情報をトレースバッファ３２に
書き込んでいく。そして、トレースバッファ３２が満杯
になる度に、トレースファイルへの書き込み部３３は、
その内容をトレースファイル３４に書き込んでいく。ト
レース制御部３５は、システム外部からトレース停止の
コマンドを受けると、トレースファイルへの書き込み部
３３だけに動作（トレースファイルへの書き込み動作）
の開始・停止を指令する。従って、トレースファイルへ
の書き込み部３３がその動作を停止してもトレースバッ
ファへの書き込み部３１の動作はそのまま続行される。
この結果、トレースバッファ３２が満杯になってもトレ
ースファイル３４への退避は行われず、最新のトレース
情報が最古のトレース情報に順次上書きされ、トレース
バッファ３２の容量分の最新のトレース情報だけがトレ
ースバッファ３２に残される状態となる。Next, the operation will be described. First, the trace buffer writing unit 31 writes the corresponding trace information into the trace buffer 32 each time an event requiring collection of the trace information occurs in the system to which this method is applied. Then, each time the trace buffer 32 becomes full, the trace file writing unit 33
The contents are written in the trace file 34. When the trace control unit 35 receives a trace stop command from outside the system, the trace control unit 35 operates only in the trace file write unit 33 (write operation in the trace file).
Command to start and stop. Therefore, even if the trace file writing unit 33 stops its operation, the operation of the trace buffer writing unit 31 is continued.
As a result, even when the trace buffer 32 is full, the trace file 34 is not saved, the latest trace information is sequentially overwritten with the oldest trace information, and only the latest trace information of the capacity of the trace buffer 32 is stored. The trace buffer 32 remains.

【０００４】これと並行して、障害情報収集部３６〜４
１によって、当該システム内で発生した所定の情報に対
し、近辺トレース情報を付した障害情報の収集が行われ
る。即ち、障害情報バッファへの書き込み部３６は、当
該システム内で障害が発生する度に、その障害コードや
発生時刻などを含む障害情報を障害情報バッファ３７に
書き込んだ後、この書き込みの終了を障害種別判定部３
８に通知する。この書き込み終了の通知を受けた障害種
別判定部３８は、障害情報バッファ３７に書き込まれて
いる最新の障害情報を読み出して、その中に含まれる障
害コードを所定の障害コードと照合することにより、近
辺トレースが必要な障害であるかどうかを判定し、必要
であれば近辺トレース部３９を起動する。起動された近
辺トレース部３９は、トレースバッファ３２から現時点
の近辺トレース情報を所定項目数分読み出して障害情報
バッファ３７に書き込んだ後、この書き込み終了を障害
情報ファイルへの書き込み部４０に通知する。この書き
込み終了の通知を受けた障害情報ファイルへの書き込み
部４０は、障害情報バッファ３７に書き込まれている最
新の障害情報と関連の近辺トレース情報を障害情報ファ
イル４１に書き込む。In parallel with this, the failure information collecting units 36-4
1, the fault information with the near trace information is collected with respect to the predetermined information generated in the system. That is, the failure information buffer writing unit 36 writes failure information including a failure code and time of occurrence to the failure information buffer 37 every time a failure occurs in the system, and then terminates the writing. Type determination unit 3
Notify 8. Upon receiving the notification of the completion of writing, the failure type determination unit 38 reads the latest failure information written in the failure information buffer 37 and compares the failure code included therein with a predetermined failure code, It is determined whether or not the near trace is a necessary fault, and if necessary, the near trace unit 39 is activated. The activated neighborhood trace unit 39 reads out a predetermined number of neighborhood trace information from the trace buffer 32 for writing in the fault information buffer 37, and then notifies the fault information file writing unit 40 of the completion of the writing. The failure information file writing unit 40 that has received the write end notification writes the latest failure information written in the failure information buffer 37 and the near trace information related thereto in the failure information file 41.

【０００５】尚、その他本願に関連する先行技術文献と
して特開平５−２９８１４４号公報がある。Incidentally, as another prior art document related to the present application, there is JP-A-5-298144.

【０００６】[0006]

【発明が解決しようとする課題】従来の障害トレース方
法は以上のように構成されているので、このようなトレ
ース情報の収集方法をＡＴＭ通信網に適用した場合、Ａ
ＴＭセル廃棄やセルスイッチバッファのオーバーフロ
ー、あるいはＵＰＣ（ＵｓａｇｅＰａｒａｍｅｔｅｒ
Ｃｏｎｔｒｏｌ）の監視機能にかかってユーザセルが
廃棄される等のネットワーク障害は、障害が起こる原因
が必ずしも障害を検出した装置にあるわけではないた
め、頻繁に繰り返される可能性が大きく、これらの障害
が発生し始めるとトレースファイルのほとんどはこれら
の障害情報で埋め尽くされ、障害発生の継続時間や最初
に発生した時の情報が消失し、ネットワーク障害の原因
解析に対して妨げとなるなどの課題があった。Since the conventional fault tracing method is configured as described above, when applying such a trace information collecting method to an ATM communication network,
TM cell discard, cell switch buffer overflow, or UPC (Usage Parameter)
A network failure such as a user cell being discarded due to the control function of the Control) is likely to be repeated frequently because the cause of the failure does not necessarily exist in the device that has detected the failure. When the error occurs, most of the trace files are filled with these failure information, and the duration of the failure and the information when it first occurred are lost, which hinders the analysis of the cause of the network failure. was there.

【０００７】この発明は上記のような課題を解決するた
めになされたもので、頻繁に発生する障害をトレースす
る場合に、障害情報のトレースに抑止をかける一方、障
害発生回数や最新または最旧の障害発生時刻などの重要
な障害情報は取りこぼさないようにトレースし、かつ継
続発生期間が長い障害に対して、トレース領域を抑える
ことができる障害トレース方法を得ることを目的とす
る。The present invention has been made to solve the above problems, and when tracing frequently occurring faults, while tracing the fault information is suppressed, the number of fault occurrences and the latest or oldest fault are traced. important failure information including failure time is traced to prevent dropped and the and splicing
Keep trace area down for long-lived failures
It is an object of the present invention to provide fault trace how the capable.

【０００８】[0008]

【課題を解決するための手段】請求項１記載の発明に係
る障害トレース方法は、システム内で発生した障害を検
出する検出ステップと、この検出ステップが検出した障
害内容をトレースするトレースステップと、上記検出ス
テップが検出した障害が繰り返し継続的に発生する場合
には、上記トレースステップがトレースする障害内容を
抑止するトレース抑止ステップとを備えた障害トレース
方法において、トレース抑止ステップが、トレースステ
ップに、最初に発生した障害は、障害内容と障害発生時
刻をトレースさせ、最初の障害が発生してから所定監視
期間内に発生した障害は、障害内容、障害発生回数、及
び上記所定監視期間内に発生した最新障害発生時刻をト
レースさせ、さらに継続して発生した障害は、上記所定
監視期間を順次ｎ倍した監視期間内に発生した障害の障
害内容、発生回数、及び最新障害発生時刻をトレースさ
せるようにしたものである。According to a first aspect of the present invention, there is provided a fault tracing method including a detecting step of detecting a fault occurring in a system, and a tracing step of tracing the fault content detected by the detecting step. when said detecting step occurs continuously repeated failure detected, the fault trace a trace suppression step of suppressing the fault contents above trace step traces
In the method, the trace suppression step is a trace step.
First, the failure that occurred first is the content of the failure and when the failure occurred.
Trace the time and monitor after the first failure
The failures that occurred during the period are as follows:
And the latest failure occurrence time within the specified monitoring period
If a race is continued and an obstacle that occurs continuously continues,
Failure of failures that occurred during the monitoring period, which is obtained by multiplying the monitoring period by n times.
Traces the contents of harm, the number of occurrences, and the latest failure occurrence time.
It was made to be able to .

【０００９】請求項２記載の発明に係る障害トレース方
法は、トレース抑止ステップが、トレースステップに、
最初に発生した障害は、障害内容と障害発生時刻をトレ
ースさせ、最初の障害が発生してから所定監視期間内に
発生した障害は、障害内容、障害発生回数、及び上記所
定監視期間内に発生した最旧障害発生時刻をトレースさ
せ、さらに継続して発生した障害は、上記所定監視期間
を順次ｎ倍した監視期間内に発生した障害の障害内容、
発生回数、最旧障害発生時刻、及び最新障害発生時刻を
トレースさせるようにしたものである。 In the fault tracing method according to the second aspect of the present invention, the trace inhibiting step includes the tracing step,
For the first failure, trace the failure description and failure occurrence time.
Within the specified monitoring period after the first failure.
The failure that occurred is the content of the failure, the number of times the failure occurred, and
The earliest failure occurrence time that occurred within the fixed monitoring period is traced.
In addition, the faults that have occurred continuously are for the specified monitoring period.
The failure content of the failure that occurred within the monitoring period that is sequentially multiplied by n,
The number of occurrences, the oldest failure occurrence time, and the latest failure occurrence time
It is designed to be traced.

【００１０】請求項３記載の発明に係る障害トレース方
法は、監視期間に上限値を設け、上記監視期間が上記上
限値に到達すると該上限値に固定するようにしたもので
ある。 In the fault tracing method according to the third aspect of the present invention , an upper limit value is set for the monitoring period, and the monitoring period is above the upper limit.
When the limit value is reached, the upper limit value is fixed.
is there.

【００１１】請求項４記載の発明に係る障害トレース方
法は、検出ステップが検出した障害が所定期間発生しな
くなった場合にトレースの抑止を解除するトレース抑止
解除ステップを備えるようにしたものである。 In the fault tracing method according to the present invention, the fault detected by the detecting step does not occur for a predetermined period.
Trace suppression that cancels the trace suppression
A release step is provided.

【００１２】[0012]

【発明の実施の形態】以下、この発明の実施例ではない
が、これに関連する参考例について説明する。参考例１．図１はＡＴＭ通信システムを示す構成図であり、図にお
いて、１１はＡＴＭ通信システムのＡＴＭスイッチ装
置、１１−１はＡＴＭスイッチ装置１１のマイクロプロ
セッサ、１１−２はＡＴＭスイッチ装置１のメモリ、１
１−３はＡＴＭスイッチ装置１１のＡＴＭスイッチカー
ド、１１−４はＡＴＭスイッチ装置１１のユーザインタ
フェースカードであり、このユーザインタフェースカー
ド１１−４はＡＴＭインタフェース（ＵＮＩ（ユーザネ
ットワークインタフェース）またはＮＮＩ（ネットワー
クネットワークインタフェース））を提供するポート１
１−４−１が複数実装されている。このように、ＡＴＭ
スイッチ装置１１は、１つのマイクロプロセッサ１１−
１、メモリ１１−２、ＡＴＭスイッチカード１１−３、
及びＡＴＭインタフェースカード１１−４から構成され
ている。BEST MODE FOR CARRYING OUT THE INVENTION The following is not an embodiment of the present invention.
However, a reference example related to this will be described. Reference example 1. FIG. 1 is a block diagram showing an ATM communication system. In the figure, 11 is an ATM switch device of the ATM communication system, 11-1 is a microprocessor of the ATM switch device 11, 11-2 is a memory of the ATM switch device 1,
Reference numeral 1-3 is an ATM switch card of the ATM switch device 11, 11-4 is a user interface card of the ATM switch device 11, and this user interface card 11-4 is an ATM interface (UNI (user network interface) or NNI (network network)). Interface)) providing port 1
A plurality of 1-4-1 are mounted. In this way, ATM
The switch device 11 includes one microprocessor 11-
1, memory 11-2, ATM switch card 11-3,
And an ATM interface card 11-4.

【００１３】マイクロプロセッサ１１−１は、メモリ１
１−２、ＡＴＭスイッチカード１１−３、及びユーザイ
ンタフェースカード１１−４の各ポート１１−４−１に
対してアクセス可能であり、ＡＴＭ通信に必要なプロト
コル処理は、マイクロプロセッサ１１−１、ＡＴＭスイ
ッチカード１１−３、及びユーザインタフェースカード
１１−４が分担する。また、マイクロプロセッサ１１−
１は、その周辺回路であるメモリ１１−２、ＡＴＭスイ
ッチカード１１−３、ユーザインタフェースカードの障
害を検出するとともに、ＡＴＭスイッチカード１１−３
及びユーザインタフェースカード１１−４から割り込み
またはポーリングで検出された障害情報をトレースす
る。 The microprocessor 11-1 has a memory 1
1-2, the ATM switch card 11-3, and each port 11-4-1 of the user interface card 11-4 are accessible, and the protocol processing required for ATM communication is performed by the microprocessor 11-1 and the ATM. The switch card 11-3 and the user interface card 11-4 are shared. Also, the microprocessor 11-
1 detects a failure of the memory 11-2, the ATM switch card 11-3, and the user interface card, which are the peripheral circuits, and the ATM switch card 11-3.
Also, the failure information detected by interruption or polling from the user interface card 11-4 is traced.

【００１４】１２はＡＴＭ通信システムの端末装置、１
２−１は端末装置１２のＣＰＵ、１２−２は端末装置１
２の主メモリ、１２−３はＡＴＭスイッチ装置１１とイ
ンタフェースするための端末装置１２のＡＴＭインタフ
ェースである。３はＡＴＭスイッチ装置１１と端末装置
１２間、及びＡＴＭスイッチ装置１１間をつなぐ、光フ
ァイバまたはツイストベアケーブルからなるデータ伝送
路である。このように、端末装置１２は、ＣＰＵ１２−
１、主メモリ１２−２、及びＡＴＭインタフェース１２
−３から構成され、この端末装置１２がＡＴＭスイッチ
装置１１にデータ伝送路３を介してＵＮＩ接続されてい
る。 Reference numeral 12 denotes a terminal device of an ATM communication system, 1
2-1 is the CPU of the terminal device 12, 12-2 is the terminal device 1
2 is a main memory, and 12-3 is an ATM interface of the terminal device 12 for interfacing with the ATM switch device 11. Reference numeral 3 is a data transmission line formed by an optical fiber or a twist bear cable, which connects the ATM switch device 11 and the terminal device 12 and the ATM switch device 11 together. Thus, the terminal device 12 has the CPU 12-
1, main memory 12-2, and ATM interface 12
-3, the terminal device 12 is UNI-connected to the ATM switch device 11 via the data transmission line 3.

【００１５】ＣＰＵ１２−１は、主メモリ１２−２及び
ＡＴＭインタフェース１２−３に対してアクセス可能で
あり、ＡＴＭ通信に必要なプロトコル処理は、このＣＰ
Ｕ１２−１及びＡＴＭインタフェース１２−３が分担す
る。また、ＣＰＵ１２−１は、その周辺回路である主メ
モリ１２−２、ＡＴＭインタフェース１２−３の障害を
検出するともに、ＡＴＭインタフェース１２−３から割
り込みまたはポーリングで検出された障害情報をトレー
スする。 The CPU 12-1 can access the main memory 12-2 and the ATM interface 12-3, and the protocol processing required for ATM communication is this CP.
U12-1 and ATM interface 12-3 share. Further, the CPU 12-1 detects a failure of the main memory 12-2 and the ATM interface 12-3 which are its peripheral circuits, and traces failure information detected by interrupt or polling from the ATM interface 12-3.

【００１６】図２はこの参考例１による障害トレース方
法を示す概念図であり、図において、１は障害トレース
情報を障害トレースバッファ２に書き込みを行うととも
に、障害トレース識別部５に障害情報を出力する障害ト
レースバッファへの書き込み部である。３は障害トレー
スバッファ２の内容を障害トレースファイル４に書き込
みを行う障害トレースファイルへの書き込み部である。
５は障害トレースバッファへの書き込み部１から出力さ
れた障害情報を識別（トレースすべき障害か否かを判
断）し、トレースすべき障害である場合には、トレース
抑止制御部６に障害情報を出力する障害トレース識別部
である。６は障害トレース識別部５から出力された障害
情報に基づいて、監視タイマ７（発生監視タイマ７ａ，
未発生監視タイマ７ｂ）を使用してトレース抑止処理を
実行するトレース抑止制御部である。８はトレース抑止
制御部６がトレースすべき障害であると判断すると、障
害の発生時刻や発生回数などの付加的な情報を障害トレ
ースバッファ２の内容に書き込む近辺トレース部であ
る。 FIG . 2 is a conceptual diagram showing a fault tracing method according to the first reference example . In FIG . 2, 1 writes fault trace information in a fault trace buffer 2 and outputs fault information to a fault trace identifying section 5. It is a writing unit to the failure trace buffer. 3 is a write-in portion of the fault trace file for writing the contents of the fault trace buffer 2 to the fault trace file 4.
Reference numeral 5 identifies the failure information output from the failure trace buffer writing unit 1 (determines whether or not the failure is to be traced), and if the failure is to be traced, the failure information is sent to the trace suppression control unit 6. This is a fault trace identification section for output. Reference numeral 6 is a monitoring timer 7 (occurrence monitoring timer 7a, based on the failure information output from the failure trace identifying unit 5).
This is a trace suppression control unit that executes the trace suppression process using the non-occurrence monitoring timer 7b). Reference numeral 8 is a near trace unit that, when the trace suppression control unit 6 determines that it is a fault to be traced, writes additional information such as the fault occurrence time and the number of occurrences to the contents of the fault trace buffer 2.

【００１７】尚、障害トレースバッファへの書き込み部
１、障害トレースファイルへの書き込み部３、障害トレ
ース識別部５、トレース抑止制御部６、及び監視タイマ
７は、ＡＴＭスイッチ装置１１のマイクロプロセッサ１
１−１、及び端末装置１２のＣＰＵ１２−１内に実現さ
れ、障害トレースバッファ２及び障害トレースファイル
４は、ＡＴＭスイッチ装置１１のメモリ１２−２、及び
端末装置１２の主メモリ１２−２により実現されるもの
である。 The fault trace buffer writing unit 1, fault trace file writing unit 3, fault trace identifying unit 5, trace suppression control unit 6, and monitoring timer 7 are used by the microprocessor 1 of the ATM switch device 11.
1-1 and the CPU 12-1 of the terminal device 12, the fault trace buffer 2 and the fault trace file 4 are realized by the memory 12-2 of the ATM switch device 11 and the main memory 12-2 of the terminal device 12. It is what is done.

【００１８】次に動作について説明する。図３はこの参
考例１による障害トレース方法の処理動作を説明するた
めのフローチャート、図４は発生監視タイマ処理動作を
説明するためのフローチャート、図５は未発生監視タイ
マ処理動作を説明するためのフローチャートである。以
下、これら図３から図５に基づいて処理の流れを説明す
る。 [0018] Next, the operation will be described. Figure 3 is this reference
FIG. 4 is a flowchart for explaining the processing operation of the fault tracing method according to Consideration 1 , FIG. 4 is a flowchart for explaining the occurrence monitoring timer processing operation, and FIG. 5 is a flowchart for explaining the non-occurrence monitoring timer processing operation. The process flow will be described below with reference to FIGS. 3 to 5.

【００１９】ＡＴＭ通信システムに障害が発生すると
（ステップＳＴ１、検出ステップ）、マイクロプロセッ
サ１１−１またはＣＰＵ１２−１は、発生した障害情報
を収集する。障害トレースバッファへの書き込み部１が
収集した障害情報を障害トレースバッファ２に書き込み
を行う（ステップＳＴ２、検出ステップ）と同時に、障
害トレース識別部５に障害情報を出力する。 When a failure occurs in the ATM communication system (step ST1, detection step), the microprocessor 11-1 or the CPU 12-1 collects the failure information that has occurred. The fault information collected by the fault trace buffer writing unit 1 is written in the fault trace buffer 2 (step ST2, detection step), and at the same time, the fault information is output to the fault trace identifying unit 5.

【００２０】障害トレース識別部５は、その障害情報を
受けると障害の種別（この種別はシステムに応じて種々
変更されるものである）からトレースすべき障害である
か否かの判定を行う（ステップＳＴ３、トレース抑止ス
テップ）。（Ａ）トレースすべき障害でない場合障害トレース識別部５がトレースすべき障害でないと判
断する場合には、何も処理を行わずに処理を終了する。 The fault trace identification unit 5, it is determined whether the failure to be traced from the type of disorder when receiving the fault information (in which those variously changed according to the type system) ( Step ST3, trace inhibiting step). (A) When it is not a failure to be traced When the failure trace identifying unit 5 determines that it is not a failure to be traced, the processing is terminated without performing any processing.

【００２１】（Ｂ）トレースすべき障害である場合障害トレース識別部５がトレースすべき障害であると判
断する場合には、障害トレース識別部５はトレース抑止
制御部６に障害情報を出力する。トレース抑止制御部６
は、障害トレース識別部５から出力された障害情報を受
け取ると、その障害がトレース抑止中か否かを判断する
（ステップＳＴ４、トレース抑止ステップ）。 [0021] When the (B) If a fault should be traced fault trace identification unit 5 is determined to be failure to be traced, the fault trace identifying unit 5 outputs the failure information to the trace inhibit control unit 6. Trace suppression control unit 6
Upon receiving the fault information output from the fault trace identifying section 5, the device determines whether or not the fault is under trace inhibition (step ST4, trace inhibition step).

【００２２】（ａ）トレース抑止中でない場合トレース抑止中でない場合（即ち、障害が初回発生の場
合）には、近辺トレース部８は、障害発生カウントを１
とし（ステップＳＴ５、トレース抑止ステップ）、また
障害発生時刻を現在時刻に設定して（ステップＳＴ６、
トレース抑止ステップ）、障害の発生回数及び発生時刻
を障害トレースバッファ２の内容に書き込み、障害トレ
ースファイルへの書き込み部３が、障害情報とともに障
害の発生回数及び発生時刻が書き込まれた障害トレース
バッファ２の内容を障害トレースファイル４に書き込む
（ステップＳＴ７、トレースステップ）。そして、発生
監視タイマ７ａの処理を起動（開始）する（ステップＳ
Ｔ８、トレース抑止ステップ）。 [0022] If the case is not in (a) trace deterrence is not in trace suppression (that is, if the failure of the first generation), the vicinity of the trace unit 8, the failure count 1
(Step ST5, trace suppression step), and the failure occurrence time is set to the current time (step ST6,
Trace suppression step), the occurrence frequency and the occurrence time of the failure are written in the contents of the error trace buffer 2, and the error trace file writing unit 3 writes the failure occurrence number and the occurrence time together with the failure information. Is written in the failure trace file 4 (step ST7, trace step). Then, the process of the occurrence monitoring timer 7a is activated (started) (step S
T8, trace suppression step).

【００２３】発生監視タイマ７ａの処理は、図４に示す
ように行われる。先ず、発生監視タイマ７ａの処理が開
始すると、監視タイマ７による起動か否かを判断する
（ステップＳＴ１０１、トレース抑止ステップ）。監視
タイマ７による起動でない場合（即ち、トレース抑止中
でない場合における発生監視タイマ７ａ処理の直接起動
である場合）、監視タイマ７をトレース抑止中（トレー
ス抑止状態）にセットし（ステップＳＴ１０２、トレー
ス抑止ステップ）、障害発生カウントを０にリセットし
（ステップＳＴ１０６、トレース抑止ステップ）、発生
監視タイマ７ａの初期値をＴ１として（ステップＳＴ１
０７、トレース抑止ステップ）発生監視タイマ７ａを起
動する（ステップＳＴ１０８、トレース抑止ステッ
プ）。 The process of generating the monitoring timer 7a is performed as shown in FIG. First, when the process of the occurrence monitoring timer 7a is started, it is determined whether or not the monitoring timer 7 is activated (step ST101, trace inhibiting step). If it is not activated by the monitoring timer 7 (that is, if the occurrence monitoring timer 7a processing is directly activated when the trace suppression is not being performed), the monitoring timer 7 is set to the trace suppression (trace suppression state) (step ST102, trace suppression). Step), the failure occurrence count is reset to 0 (step ST106, trace inhibiting step), and the initial value of the occurrence monitoring timer 7a is set to T1 (step ST1).
07, trace inhibiting step) The occurrence monitoring timer 7a is started (step ST108, trace inhibiting step).

【００２４】その後、障害が所定時間（Ｔ２）発生しな
い場合にトレース抑止制御部６のトレース抑止状態を解
除するため、未発生監視タイマ７ｂの初期値にＴ２を設
定し（ステップＳＴ９、トレース抑止解除ステップ）、
未発生監視タイマ７ｂを起動する（ステップＳＴ１０、
トレース抑止解除ステップ）。 [0024] Thereafter, since a failure to release the trace inhibited state of the trace inhibit control unit 6 when a predetermined time (T2) does not occur, set T2 to the initial value of the non-occurrence monitoring timer 7b (step ST9, trace inhibit release Step),
The non-occurrence monitoring timer 7b is activated (step ST10,
Trace suppression release step).

【００２５】未発生監視タイマ７ｂの処理は、図５に示
すように行われる。未発生監視タイマ７ｂの未発生監視
期間が満了すると、該当する障害の発生監視タイマ７ａ
を停止するとともに（ステップＳＴ２０１、トレース抑
止解除ステップ）、該当する障害の未発生監視タイマ７
ｂを停止し（ステップＳＴ２０２、トレース抑止解除ス
テップ）、該当する障害のトレース抑止中をリセットす
る（ステップＳＴ２０３、トレース抑止解除ステッ
プ）。 The processing of the non-occurrence monitoring timer 7b is performed as shown in FIG. When the non-occurrence monitoring period of the non-occurrence monitoring timer 7b expires, the corresponding fault occurrence monitoring timer 7a
Stop (step ST201, trace suppression release step), and the non-occurrence monitoring timer 7 for the corresponding fault.
b is stopped (step ST202, trace suppression cancellation step), and the trace suppression in progress of the corresponding fault is reset (step ST203, trace suppression cancellation step).

【００２６】（ｂ）トレース抑止中である場合トレース抑止中である場合には、既に発生監視タイマ７
ａの処理が起動している状態であり、直接トレース処理
は行わずに、障害発生カウントを更新する。即ち、新た
な障害発生回数分だけ障害発生カウントを加算し（ステ
ップＳＴ１１、トレース抑止ステップ）、障害発生時刻
に現在時刻を設定するのみとする（ステップＳＴ１２、
トレース抑止ステップ）。 [0026] (b) in the case when it is being traced deterrence is in trace deterrence, already occurrence monitoring timer 7
Since the process of a is activated, the failure occurrence count is updated without directly performing the trace process. That is, the failure occurrence count is incremented by the new failure occurrence count (step ST11, trace suppression step), and the current time is set as the failure occurrence time (step ST12,
Trace suppression step).

【００２７】このトレース抑止中である場合における発
生監視タイマ７ａの処理は、図４に示すように行われ
る。先ず、監視タイマ７による起動か否かを判断し（ス
テップ１０１）、（この場合、監視タイマ７による起動
であるので）監視タイマ７による起動であると判断され
ると、次に障害発生カウントが０か否かを判定する（ス
テップＳＴ１０３、トレース抑止ステップ）。障害発生
カウントが０である場合は、トレース処理は行わずに、
新たな発生監視期間を開始するために発生監視タイマ７
ａの初期値をＴ１として（ステップＳＴ１０７）発生監
視タイマ７ａを起動する（ステップＳＴ１０８）。障害
発生カウントが０である場合は、障害コード、障害発生
カウント、障害発生時刻からトレース内容（障害トレー
スバッファの内容）を組み立て（ステップＳＴ１０４、
トレース抑止ステップ）、障害トレースファイルへの書
き込み部３により障害トレースファイル４へ書き込みが
実行される（ステップＳＴ１０５、トレースステッ
プ）。 The process of generating the monitoring timer 7a in the case is in the trace suppression is performed as shown in FIG. First, it is determined whether or not the monitoring timer 7 is activated (step 101). If it is determined that the monitoring timer 7 is activated (in this case, the monitoring timer 7 is activated), then the failure occurrence count is determined. It is determined whether or not it is 0 (step ST103, trace suppression step). If the failure occurrence count is 0, trace processing is not performed and
Occurrence monitoring timer 7 to start a new occurrence monitoring period
The initial value of a is set to T1 (step ST107), and the generation monitoring timer 7a is started (step ST108). If the failure occurrence count is 0, the trace contents (contents of the failure trace buffer) are assembled from the failure code, the failure occurrence count, and the failure occurrence time (step ST104,
(Trace suppressing step), and writing to the failure trace file 4 is executed by the failure trace file writing unit 3 (step ST105, trace step).

【００２８】尚、この参考例１では、障害発生時刻は障
害発生毎に毎回現在時刻で更新されるので、トレースさ
れる障害発生時刻は最新の障害発生時刻である。図６は
トレース抑止中に障害発生回数及び障害発生時刻をトレ
ースする場合を説明するための図である。この場合、障
害発生回数（発生監視タイマ７ａの発生監視期間Ｔ１中
に発生した障害の回数）は５回、トレース発生時刻は最
新の障害発生時刻ｔ５が記録（トレース）される。 In the reference example 1 , since the failure occurrence time is updated with the current time every time a failure occurs, the traced failure occurrence time is the latest failure occurrence time. FIG. 6 is a diagram for explaining a case where the number of times of failure occurrence and the time of failure occurrence are traced during trace suppression. In this case, the number of times of failure occurrence (the number of failures that occurred during the occurrence monitoring period T1 of the occurrence monitoring timer 7a) is 5, and the latest failure occurrence time t5 is recorded (traced) as the trace occurrence time.

【００２９】その後、障害発生カウントを０にリセット
し（ステップＳＴ１０６、トレース抑止ステップ）、発
生監視タイマ７ａの初期値をＴ１として（ステップＳＴ
１０７）発生監視タイマ７ａを起動する（ステップＳＴ
１０８）。 [0029] Then, reset the fault count to zero (step ST 106, trace suppression step), the initial value of the occurrence monitoring timer 7a as T1 (step ST
107) Starts the occurrence monitoring timer 7a (step ST
108).

【００３０】その後、トレース抑止中でない場合と同様
に、未発生監視タイマ７ｂの初期値にＴ２を設定し（ス
テップＳＴ９）、未発生監視タイマ７ｂを起動する（ス
テップＳＴ１０）。 [0030] Thereafter, similarly to the case not being traced deterrence, set T2 to the initial value of the non-occurrence monitoring timer 7b (step ST9), activates the non-occurrence monitoring timer 7b (step ST10).

【００３１】図７はこの参考例１による障害トレース方
法によりトレースする一例を示す図である。図７におい
て、発生監視タイマ７ａの発生監視期間をＴ１とし、未
発生監視タイマ７ｂの未発生監視期間をＴ２としてい
る。最初の障害は無条件にトレースされているが、次の
発生監視期間Ｔ１に発生した障害２件はまとめてトレー
スされ、同じようなトレース抑止が３回継続している。
その後、しばらく障害が発生していないが、未発生監視
タイマの未発生監視期間Ｔ２が経過するまでに障害が発
生しているのでトレース抑止が継続する。しかし、２回
目の発生監視期間Ｔ２で障害が発生していないのでトレ
ース抑止状態は解除され、その後の最初の障害で無条件
にトレースが行われる。 FIG . 7 is a diagram showing an example of tracing by the fault tracing method according to the first reference example . In FIG. 7, the occurrence monitoring period of the occurrence monitoring timer 7a is T1, and the non-occurrence monitoring period of the non-occurrence monitoring timer 7b is T2. The first failure is unconditionally traced, but the two failures that occurred in the next occurrence monitoring period T1 are collectively traced, and similar trace suppression continues three times.
After that, although no failure has occurred for a while, the failure has occurred before the non-occurrence monitoring period T2 of the non-occurrence monitoring timer elapses, so the trace suppression continues. However, since no fault has occurred in the second occurrence monitoring period T2, the trace inhibition state is released, and the trace is unconditionally performed at the first fault thereafter.

【００３２】以上のように、この参考例１によれば、ト
レース抑止制御部６が監視タイマ７を制御して、発生し
た障害の障害情報の全てをトレースせずに、発生した障
害の障害コード、発生回数、発生時刻をトレースするよ
うにしたので、頻繁に発生する障害に対してもトレース
領域をこれで埋め尽くされることなく、発生傾向がつか
め、障害発生原因の究明（または故障解析）を容易に行
うことができる。 As described above, according to the first reference example , the trace suppression control unit 6 controls the monitoring timer 7 to trace all the fault information of the fault that has occurred, and the fault code of the fault that has occurred. Since the number of occurrences and the time of occurrence are traced, even for frequently occurring faults, the trace area is not filled with this, the tendency of occurrence can be grasped, and the cause of fault occurrence (or failure analysis) can be investigated. It can be done easily.

【００３３】参考例２．上記参考例１では、トレース抑止期間中に発生した障害
のうち、最新の障害発生時刻をトレースしていたが、こ
の参考例２では、トレース抑止期間中に発生した障害の
うち、最も古い障害の発生時刻をトレースするものであ
る。図８はそのような参考例２による障害トレース方法
の処理動作を説明するためのフローチャートである。
尚、障害トレース方法を実現するための構成は、上記参
考例１で説明した図２と同様であるため、その説明は省
略する。 Reference Example 2. In Reference Example 1 above, the latest failure occurrence time was traced among the failures that occurred during the trace suppression period, but in Reference Example 2 , among the failures that occurred during the trace suppression period, the oldest failure occurred. The time of occurrence is traced. FIG. 8 is a flow chart for explaining the processing operation of the fault tracing method according to the second reference example .
Note that the configuration for realizing the fault trace method, the ginseng
Since it is similar to FIG. 2 described in Consideration 1 , the description thereof is omitted.

【００３４】上記参考例１における障害情報発生（ステ
ップＳＴ１）、障害トレースバッファへの書き込み（ス
テップＳＴ２）、トレースすべき障害か否か判定（ステ
ップＳＴ３）（Ａ）トレースすべき障害でない場合処理を終了（Ｂ）トレースすべき障害である場合障害がトレース抑止中か否かを判断（ステップＳＴ４）（ａ）トレース抑止中でない場合障害発生カウントを１（ステップＳＴ５）、障害発生時
刻を現在時刻に設定（ステップＳＴ６）、障害トレース
ファイル４に書き込み（ステップＳＴ７）、発生監視タ
イマ７ａ処理を起動（ステップＳＴ８）、未発生監視タ
イマ７ｂの初期値にＴ２を設定（ステップＳＴ９）、未
発生監視タイマ７ｂを起動（ステップＳＴ１０）の処理
（処理ステップ）は、図８に示すこの参考例２の処理動
作でも同様である。 [0034] Reference Example fault information generated in 1 (step ST1), writing to the fault trace buffer (step ST2), the determination whether or not a failure to be traced (step ST3) (A) is not a fault to be traced when the processing End (B) If it is a fault to be traced, it is judged whether or not the fault is being trace-suppressed (step ST4). (A) If trace is not being suppressed, the fault occurrence count is 1 (step ST5), and the fault occurrence time is set to the current time. Setting (step ST6), writing to the fault trace file 4 (step ST7), starting processing of the occurrence monitoring timer 7a (step ST8), setting T2 to the initial value of the non-occurrence monitoring timer 7b (step ST9), non-occurrence monitoring timer 7b startup processing (step ST10) (process steps), of the reference example 2 shown in FIG. 8 The same applies to the physical behavior.

【００３５】（ｂ）トレース抑止中である場合トレース抑止中である場合には、直接トレース処理は行
わずに、障害発生カウントを更新する。即ち、新たな障
害発生回数分だけ障害発生カウントを加算し（ステップ
ＳＴ１１）、そして、障害発生カウントが１か否かを判
断し（ステップＳＴ１２Ａ、トレース抑止ステップ）、
障害発生時刻には障害発生カウントが１の場合のみ現在
時刻を設定する（ステップＳＴ１３Ａ、トレース抑止ス
テップ）。 [0035] (b) in the case when it is being traced deterrence is being traced suppression, without direct tracing, updates the failure count. That is, the failure occurrence count is added by the new failure occurrence count (step ST11), and it is determined whether or not the failure occurrence count is 1 (step ST12A, trace inhibiting step).
The current time is set to the failure occurrence time only when the failure occurrence count is 1 (step ST13A, trace inhibiting step).

【００３６】このように、この参考例２では、トレース
される障害発生時刻は最旧の障害発生時刻である。図９
はトレース抑止中に障害発生回数及び障害発生時刻をト
レースする場合を説明するための図である。この場合、
障害発生回数（発生監視タイマ７ａの発生監視期間Ｔ１
中に発生した障害の回数）は５回、トレース発生時刻は
最旧の障害発生時刻ｔ１が記録（トレース）される。 [0036] In this manner, in this Reference Example 2, failure time to be traced is a failure time of the oldest. Figure 9
FIG. 9 is a diagram for explaining a case where the number of times of failure occurrence and the time of failure occurrence are traced during trace suppression. in this case,
Failure occurrence frequency (occurrence monitoring period T1 of the occurrence monitoring timer 7a
The number of failures that occurred during the recording is 5 times, and the oldest failure occurrence time t1 is recorded (traced) as the trace occurrence time.

【００３７】その後の処理未発生監視タイマ７ｂの初期
値にＴ２を設定（ステップＳＴ９）、未発生監視タイマ
７ｂを起動（ステップＳＴ１０）も上記参考例１と同様
である。 The subsequent processing sets the T2 to the initial value of the non-occurrence monitoring timer 7b (step ST9), start the non-occurrence monitoring timer 7b (step ST10) is the same as in Reference Example 1.

【００３８】以上のように、この参考例２によれば、最
初の発生障害を無条件にトレースすることができるの
で、最初の障害の発生時刻から次の障害の発生時刻まで
の時間間隔が重要な障害に対して、高い障害解析性を発
揮することができる。 As described above, according to the second reference example , since the first fault occurring can be unconditionally traced, the time interval from the time when the first fault occurs to the time when the next fault occurs is important. It is possible to exhibit high failure analysis performance for various failures.

【００３９】実施の形態１．次にこの発明の実施の形態について説明する。上記参考
例１では、発生監視タイマ７ａの処理において、無条件
に発生監視タイマ７ａの初期値をＴ１として発生監視タ
イマ７ａを起動していたが、この発明の実施の形態１で
は、トレース抑止が継続する限り、発生監視タイマの初
期値を前回の発生監視タイマの初期値の順次ｎ（実数）
倍するものである。 Embodiment 1. Next, an embodiment of the present invention will be described. Above reference
In the example 1 , in the process of the occurrence monitoring timer 7a, the occurrence monitoring timer 7a is unconditionally started with the initial value of the occurrence monitoring timer 7a set to T1, but in the first embodiment of the present invention , the trace suppression continues. As long as the initial value of the occurrence monitoring timer is the same as the initial value of the previous occurrence monitoring timer, n (real number)
It doubles.

【００４０】図１０はそのようなこの発明の実施の形態
１による発生監視タイマの処理を説明するためのフロー
チャートである。この処理ステップのうち監視タイマ７
による起動か否かを判断（ステップＳＴ１０１）、監視
タイマ７による起動でない場合の監視タイマ７をトレー
ス抑止中にセット（ステップＳＴ１０２）、障害発生カ
ウントが０か否かの判定（ステップＳＴ１０３）、障害
発生カウントが０である場合のトレース内容の組み立て
（ステップＳＴ１０４）、障害トレースファイルへの書
き込み（ステップＳＴ１０５）、障害発生カウントを０
にリセット（ステップＳＴ１０６）は、図４に示した上
記参考例１に係る発生監視タイマ７ａの処理と同様であ
る。 [0040] Figure 10 is an embodiment of the such the present invention
6 is a flowchart for explaining the processing of the occurrence monitoring timer according to No. 1 ; Monitoring timer 7 in this processing step
(Step ST101), if the monitoring timer 7 is not activated, the monitoring timer 7 is set during trace suppression (step ST102), and it is determined whether the failure occurrence count is 0 (step ST103). Assembling the trace contents when the occurrence count is 0 (step ST104), writing to the failure trace file (step ST105), and setting the failure occurrence count to 0
The resetting (step ST106) is similar to the process of the occurrence monitoring timer 7a according to the first reference example shown in FIG.

【００４１】この実施の形態１では、障害発生カウント
を０にリセットした後（ステップＳＴ１０６）、発生監
視タイマ７ａの初期値を前回の発生監視タイマの初期値
のｎ倍に設定して（ステップＳＴ１０７Ｂ、トレース抑
止ステップ）、発生監視タイマ７ａを起動する（ステッ
プＳＴ１０８、トレース抑止ステップ）。 In the first embodiment , after resetting the failure occurrence count to 0 (step ST106), the initial value of the occurrence monitoring timer 7a is set to n times the initial value of the previous occurrence monitoring timer (step ST107B). , Trace inhibiting step) and the generation monitoring timer 7a is started (step ST108, trace inhibiting step).

【００４２】図１１はこの発明の実施の形態１による障
害トレース方法によりトレースする一例を示す図であ
る。図１１に示すように、障害がトレース抑止期間中に
発生している限り、発生監視タイマ７ａの発生監視期間
がｎ倍されていく。一定時間障害が発生しない場合はト
レース抑止状態を解除するようになる。 [0042] Figure 11 is a diagram showing an example of tracing the fault trace method according to the first embodiment of the present invention. As shown in FIG. 11, as long as the failure occurs during the trace suppression period, the occurrence monitoring period of the occurrence monitoring timer 7a is multiplied by n. If no failure occurs for a certain period of time, the trace suppression status will be released.

【００４３】以上のように、この実施の形態１によれ
ば、トレース抑止期間を指数関数的に増加させることが
できるので、一旦発生すると継続発生期間が長い障害に
対して、トレース領域を更に抑えることができる。 As described above, according to the first embodiment, since the trace inhibition period can be exponentially increased, the trace area is further suppressed for a fault having a long continuous generation period once it occurs. be able to.

【００４４】実施の形態２．この発明の実施の形態２は、トレース抑止期間中に発生
した障害のうち最も古い障害発生時刻と最も新しい障害
発生時刻を書き込む（トレース）点で上記実施の形態１
とは異なるものである。図１２はそのようなこの発明の
実施の形態２による障害トレース方法の処理動作を説明
するためのフローチャートである。尚、上記参考例１及
び実施の形態１で説明した処理動作と重複する処理動作
については、その説明を省略する。 Embodiment 2. Second embodiment of the invention, the above-described embodiment in writing newest failure time the oldest fault occurrence time of the failure during the trace prohibition period (trace) point 1
Is different from. FIG. 12 shows such an invention.
9 is a flowchart for explaining the processing operation of the fault tracing method according to the second embodiment . The description of the processing operations that overlap with the processing operations described in the first reference example and the first embodiment will be omitted.

【００４５】（Ｂ）トレースすべき障害である場合（ａ）トレース抑止中でない場合トレース抑止中でない場合には、近辺トレース部８は、
障害発生カウントを１とし（ステップＳＴ５）、また最
旧障害発生時刻と最新障害発生時刻に現在時刻を設定し
て（ステップＳＴ６Ｃ、トレース抑止ステップ）、障害
の発生回数及び発生時刻を障害トレースバッファ２の内
容に書き込み、障害トレースファイルへの書き込み部３
が、障害情報とともに障害の発生回数及び発生時刻が書
き込まれた障害トレースバッファ２の内容を障害トレー
スファイル４に書き込む（ステップＳＴ７）。次に、発
生監視タイマ７ａの処理を起動（開始）する（ステップ
ＳＴ８）。その後、障害が所定時間（Ｔ２）発生しない
場合にトレース抑止制御部６のトレース抑止状態を解除
するため、未発生監視タイマ７ｂの初期値にＴ２を設定
し（ステップＳＴ９）、未発生監視タイマ７ｂを起動す
る（ステップＳＴ１０）。 [0045] (B) when the case if a fault should be traced (a) not being traced deterrence not being traced suppression, Near trace section 8,
The failure occurrence count is set to 1 (step ST5), the current time is set to the oldest failure occurrence time and the latest failure occurrence time (step ST6C, trace suppression step), and the failure occurrence count and occurrence time are set to the failure trace buffer 2 Writing to the contents of and writing to fault trace file 3
However, the contents of the fault trace buffer 2 in which the number of times the fault has occurred and the time when the fault occurred are written to the fault trace file 4 together with the fault information (step ST7). Next, the process of the occurrence monitoring timer 7a is activated (started) (step ST8). After that, in order to cancel the trace suppression state of the trace suppression control unit 6 when the failure does not occur for a predetermined time (T2), T2 is set to the initial value of the non-occurrence monitoring timer 7b (step ST9), and the non-occurrence monitoring timer 7b is set. Is activated (step ST10).

【００４６】（ｂ）トレース抑止中である場合トレース抑止中である場合には、障害発生カウントを更
新した後、即ち、新たな障害発生回数分だけ障害発生カ
ウントを加算（ステップＳＴ１１）した後、最新障害発
生時刻に現在時刻を設定し（ステップＳＴ１２Ｃ、トレ
ース抑止ステップ）、最旧障害発生時刻には障害発生カ
ウントが１の場合のみ（ステップＳＴ１３Ｃ、トレース
抑止ステップ）現在時刻に設定する（ステップＳＴ１４
Ｃ、トレース抑止ステップ）。その後、トレース抑止中
でない場合と同様に、未発生監視タイマ７ｂの初期値に
Ｔ２を設定し（ステップＳＴ９）、未発生監視タイマ７
ｂを起動する（ステップＳＴ１０）。 [0046] (b) in the case when it is being traced deterrence is being traced suppression, after updating the failure count, i.e., after only a new failure number of times adding the failure count (step ST11), The current time is set to the latest failure occurrence time (step ST12C, trace suppression step), and the current time is set only to the oldest failure occurrence time when the failure occurrence count is 1 (step ST13C, trace suppression step) (step ST14).
C, trace suppression step). Thereafter, as in the case where the trace suppression is not being performed, T2 is set to the initial value of the non-occurrence monitoring timer 7b (step ST9), and the non-occurrence monitoring timer 7 is set.
b is started (step ST10).

【００４７】以上のように、この実施の形態２によれ
ば、上記実施の形態１において、トレース抑止期間中に
発生した障害のうち最旧障害発生時刻と最新障害発生時
刻をトレースするようにしたので、上記実施の形態１に
おける長いトレース抑止期間中で、障害が平均的に発生
しているのか、バースト的に発生しているかの情報を得
ることができ、継続発生期間が長い障害に対してもトレ
ース領域を抑えることができるとともに、高い障害解析
性を発揮することができる。 As described above, according to the second embodiment , the oldest failure occurrence time and the latest failure occurrence time among the failures that occurred during the trace inhibition period in the first embodiment are traced. Therefore, during the long trace suppression period in the above-described first embodiment, it is possible to obtain information as to whether the faults are occurring on average or in bursts, and for a fault with a long continuous occurrence period. The trace area can be suppressed and high failure analysis performance can be exhibited.

【００４８】実施の形態３．この発明の実施の形態３は、発生監視タイマ７ａの初期
値に上限値を設け、この上限値に達すると発生監視タイ
マ７ａの初期値を上限値に固定する点で上記実施の形態
１及び上記実施の形態２とは異なるものである。図１３
はそのようなこの発明の実施の形態３による発生監視タ
イマの処理を説明するためのフローチャートである。
尚、上記参考例１、実施の形態１、及び上記実施の形態
２で説明した処理動作と重複する処理動作については、
その説明を省略する。 Embodiment 3. Third embodiment of the invention, an upper limit is provided to the initial value of the occurrence monitoring timer 7a, the occurrence monitoring timer 7a initial value at the point of fixing to the upper limit value of the above embodiment reaches the upper limit value
The second embodiment is different from the first embodiment and the second embodiment . FIG.
Is a flow chart for explaining the process of the occurrence monitoring timer according to the third embodiment of the present invention.
In addition, the reference example 1 , the first embodiment , and the above embodiment
Regarding the processing operation that overlaps with the processing operation described in 2 ,
The description is omitted.

【００４９】この実施の形態３は、障害発生カウントを
０にリセットした後（ステップＳＴ１０６）、発生監視
タイマ７ａの発生監視期間が上限値に到達したか否かを
判断し（ステップＳＴ１０７Ｄ、トレース抑止ステッ
プ）、上限値に到達していない場合には、発生監視タイ
マ７ａの初期値に前回の発生監視タイマの初期値のｎ倍
を設定して（ステップＳＴ１０８Ｄ）、発生監視タイマ
７ａを起動し（ステップＳＴ１１０Ｄ）、一方、上限値
に到達している場合には、発生監視タイマ７ａの初期値
に上限値を再設定して（ステップＳＴ１０９Ｄ、トレー
ス抑止ステップ）、発生監視タイマ７ａを起動する（ス
テップＳＴ１１０Ｄ）。 In the third embodiment, after resetting the failure occurrence count to 0 (step ST106), it is judged whether the occurrence monitoring period of the occurrence monitoring timer 7a has reached the upper limit value (step ST107D, trace inhibition). If the upper limit has not been reached, the initial value of the occurrence monitoring timer 7a is set to n times the initial value of the previous occurrence monitoring timer (step ST108D), and the occurrence monitoring timer 7a is activated ( On the other hand, if the upper limit value is reached, the upper limit value is reset to the initial value of the occurrence monitoring timer 7a (step ST109D, trace suppression step), and the occurrence monitoring timer 7a is activated (step ST110D). ST110D).

【００５０】以上のように、この実施の形態３によれ
ば、上記実施の形態１及び上記実施の形態２において、
発生監視タイマ７ａの初期値に上限値を設け、この上限
値に達すると発生監視タイマ７ａの初期値を上限値に固
定するようにしたので、トレース抑止期間が極端に大き
くなり障害解析に役立たなくなることを未然に防止する
ことができ、また、タイマＨ／Ｗの計数値を大きくする
ことによるＨ／Ｗコストの増加を抑えることができる。 As described above, according to the third embodiment, in the first embodiment and the second embodiment ,
An upper limit value is set for the initial value of the occurrence monitoring timer 7a, and when the upper limit value is reached, the initial value of the occurrence monitoring timer 7a is fixed to the upper limit value, so that the trace suppression period becomes extremely large and is not useful for failure analysis. This can be prevented in advance, and an increase in H / W cost due to an increase in the count value of the timer H / W can be suppressed.

【００５１】参考例３．この発明に関連する参考例について再び説明するが、こ
の参考例３は、トレース抑止期間中（発生監視タイマ７
ａの発生監視期間中）に発生した障害を、各要因毎にト
レースするのではなく、複数の要因をまとめて各ビット
が１つの要因に対応するようにトレースする点で上記参
考例１とは異なるものである。図１４はそのような参考
例３による発生監視タイマの処理を説明するためのフロ
ーチャートである。尚、上記参考例１で説明した処理動
作と重複する処理動作については、その説明を省略す
る。 Reference Example 3. Reference examples related to the present invention will be described again.
Reference example 3 of the above is during the trace suppression period (occurrence monitoring timer 7
the ginseng disorders that occurred occurred during the monitoring period) of a, instead of tracing every factor, each bit together multiple factors in that traces to correspond to a factor
This is different from Case 1 . Figure 14 is such a reference
11 is a flowchart for explaining processing of an occurrence monitoring timer according to example 3 . The description of the processing operation that is the same as the processing operation described in Reference Example 1 will be omitted.

【００５２】（ｂ）トレース抑止中である場合トレース抑止中である場合には、直接トレース処理は行
わずに、障害発生カウントを更新する。即ち、新たな障
害発生回数分だけ障害発生カウントを加算する（ステッ
プＳＴ１１）。そして、障害発生時刻に現在時刻に設定
するのみとする（ステップＳＴ１２）。その後、発生し
た障害がまとめてトレースすべきものか否かを判断し
（ステップＳＴ１３Ｅ、トレースステップ）、まとめて
トレースすべきものである場合は、障害要因バッファの
内容を、蓄積された障害要因バッファの内容と障害要因
レジスタの内容とのビット単位の論理和で更新する（ス
テップＳＴ１４Ｅ、トレースステップ）。その後、トレ
ース抑止中でない場合と同様に、未発生監視タイマ７ｂ
の初期値にＴ２を設定し（ステップＳＴ９）、未発生監
視タイマ７ｂを起動する（ステップＳＴ１０）。 [0052] (b) in the case when it is being traced deterrence is being traced suppression, without direct tracing, updates the failure count. That is, the failure occurrence count is added by the new failure occurrence count (step ST11). Then, the failure occurrence time is only set to the current time (step ST12). Thereafter, it is judged whether or not the generated faults should be traced together (step ST13E, trace step), and if they are traced together, the contents of the fault factor buffer are changed to the accumulated contents of the fault factor buffer. And the contents of the failure factor register are updated by a bitwise logical sum (step ST14E, trace step). After that, as in the case where the trace is not being suppressed, the non-occurrence monitoring timer 7b
T2 is set to the initial value of (step ST9), and the non-occurrence monitoring timer 7b is started (step ST10).

【００５３】以上のように、この参考例３によれば、上
記参考例１において、発生監視タイマ７ａの監視期間中
に発生した障害を、各要因毎にトレースするのではな
く、複数の要因をまとめて各ビットが１つの要因に対応
させてトレースするようにしたので、発生すると複数の
要因がセットされる可能性がある障害（例えば、ＳＤＨ
（同期データハイアラーキ）で定義されている障害（Ｂ
ＩＰ８エラーやＬＯＳ、ＬＯＦなど）など）を、まとめ
てトレースすることになるので、相互に関連してはいる
が要因毎にトレースをとるとトレースサイズが膨大にな
るのを未然に防止することができる。 As described above, according to the third reference example, in the first reference example , the failure occurring during the monitoring period of the occurrence monitoring timer 7a is not traced for each factor but a plurality of factors are detected. Since each bit is traced in association with one factor, a plurality of factors may be set when an error occurs (eg SDH).
Failures defined in (Synchronous Data Hierarchy) (B
(IP8 error, LOS, LOF, etc.) will be traced together, so it is possible to prevent the trace size from becoming enormous if traces are taken for each factor although they are related to each other. it can.

【００５４】参考例４．この発明に関連する参考例４は、マイクロプロセッサ１
１−１またはＣＰＵ１２−１が障害を検出するタイミン
グとして、障害発生毎に検出するのではなく、障害検出
タイミングをＴ３時間毎にしか行わずに検出頻度を抑え
るとともに、トレースタイミングも発生監視期間毎とす
る点で上記参考例１とは異なるものである。 Reference Example 4. A reference example 4 related to the present invention is a microprocessor 1
The timing at which the 1-1 or the CPU 12-1 detects a fault is not detected every time a fault occurs, but the fault detection timing is only performed every T3 hours to suppress the detection frequency, and the trace timing is also generated every occurrence monitoring period. This is different from Reference Example 1 above.

【００５５】図１５はそのようなこの参考例４による障
害検出処理および検出割り込みマスクタイマ処理を説明
するためのフローチャートである。図１５（ａ）に示す
障害検出処理は、上記図３に示した障害情報発生の処理
（ステップＳＴ１）に該当する。この処理においては、
障害検出割り込みが発生すると（ステップＳＴ３０１、
検出ステップ）、検出割り込み抑止中か否かの判断を行
う（ステップＳＴ３０２、検出ステップ）。検出割り込
み抑止中でなければ何もしないが、検出割り込み抑止中
であれば、該当する障害の検出割り込みに対してマスク
をかけ（ステップＳＴ３０３、検出ステップ）、検出割
り込みマスクタイマの初期値にＴ３を設定するとともに
（ステップＳＴ３０４、検出ステップ）、検出割り込み
マスクタイマを起動する（ステップＳＴ３０５、検出ス
テップ）。図１５（ｂ）に示す検出割り込みマスクタイ
マ処理は、検出割り込みマスクタイマの満了により起動
され（ステップＳＴ４０１、トレースステップ）、該当
する障害の検出割り込みマスクを解除する（ステップＳ
Ｔ４０２、トレースステップ）。 FIG . 15 is a flow chart for explaining the fault detection processing and detection interrupt mask timer processing according to the reference example 4 . The fault detection process shown in FIG. 15A corresponds to the fault information generation process (step ST1) shown in FIG. In this process,
When a fault detection interrupt occurs (step ST301,
Detection step), it is determined whether or not the detection interrupt is being suppressed (step ST302, detection step). If the detection interrupt is not suppressed, nothing is done, but if the detection interrupt is being suppressed, the detection interrupt of the corresponding fault is masked (step ST303, detection step), and T3 is set to the initial value of the detection interrupt mask timer. While setting (step ST304, detection step), the detection interrupt mask timer is activated (step ST305, detection step). The detection interrupt mask timer process shown in FIG. 15B is started when the detection interrupt mask timer expires (step ST401, trace step), and the detection interrupt mask of the corresponding fault is released (step S).
T402, trace step).

【００５６】以上のように、この参考例４によれば、障
害を検出するタイミングとして、障害検出タイミングを
Ｔ３時間毎にしか行わずに検出頻度を抑えるとともに、
トレースタイミングも発生監視期間毎としたので、発生
頻度が高く詳細な発生時刻よりは発生傾向が障害解析に
重要な障害に対しては、検出処理の負荷を軽減すること
ができる。 As described above, according to the fourth reference example , as the timing of detecting a fault, the fault detection timing is only performed every T3 hours, and the detection frequency is suppressed.
Since the trace timing is also set for each occurrence monitoring period, it is possible to reduce the load of the detection process for a failure that is more frequently occurring and the occurrence tendency is more important for failure analysis than the detailed occurrence time.

【００５７】参考例５．この発明に関連する参考例５は、上記参考例１とは異な
り、障害トレース方法設定テーブル９を備え、この障害
トレース方法設定テーブル９では、障害要因毎に監視期
間毎の障害トレースとするか、発生検出毎に毎回トレー
スするか、一旦発生したら装置リセットまでは繰り返さ
れる障害検出に対してもトレースしないのか、の選択的
設定が可能であり、またこの障害トレース方法設定テー
ブル９は保守ツール１０を介してユーザからの任意のタ
イミングで変更可能としたものである。 Reference Example 5. Different from Reference Example 1 described above, Reference Example 5 related to the present invention includes a failure tracing method setting table 9. In this failure tracing method setting table 9, whether failure tracing is performed for each monitoring cause for each failure factor, It is possible to selectively set whether to trace every occurrence detection or not to repeat the failure detection which is repeated until the device is reset once the failure detection method setting table 9 indicates that the maintenance tool 10 is set. It is possible to change it at any timing from the user.

【００５８】図１６は参考例５による障害トレース方法
の構成を示す概念図である。尚、図１６において図２と
同一または相当する構成には同一符号を付して、その重
複する説明を省略する。図において、９は種々の障害ト
レース方法の設定項目（図１６に示すように、発生した
障害が、トレース対象か、毎回トレースするか、リセッ
トまで１回だけのトレース実行か、トレース抑止対象
か）を格納した障害トレース方法設定テーブル、１０は
障害トレース方法設定テーブル９を任意のタイミングで
更新することができる保守ツールである。 FIG . 16 is a conceptual diagram showing the structure of the fault tracing method according to the reference example 5 . 16 that are the same as or equivalent to those in FIG. 2 are assigned the same reference numerals and overlapping explanations are omitted. In the figure, 9 indicates setting items for various fault tracing methods (whether the fault that has occurred is a trace target, a trace every time, a trace execution only once until reset, or a trace suppression target, as shown in FIG. 16). Is a maintenance tool that can update the failure trace method setting table 9 at any timing.

【００５９】次に動作について説明する。図１７は参考
例５による障害トレース方法の処理動作を説明するため
のフローチャートである。 [0059] Next, the operation will be described. Figure 17 is for reference
11 is a flowchart for explaining the processing operation of the fault tracing method according to example 5 .

【００６０】ＡＴＭ通信システムに障害が発生すると
（ステップＳＴ１）、マイクロプロセッサ１１−１また
はＣＰＵ１２−１は、発生した障害情報を収集する。障
害情報バッファへの書き込み部１が収集した障害情報を
障害トレースバッファ２に書き込みを行う（ステップＳ
Ｔ２）と同時に、障害トレース識別部５に障害情報を出
力する。 When a failure occurs in the ATM communication system (step ST1), the microprocessor 11-1 or the CPU 12-1 collects the failure information that has occurred. Writing to the fault information buffer The fault information collected by the writing unit 1 is written to the fault trace buffer 2 (step S
At the same time as T2), the failure information is output to the failure trace identifying section 5.

【００６１】障害トレース識別部５は、その障害情報を
受けると障害の種別（この種別はシステムに応じて種々
変更されるものである）からトレースすべき障害である
か否かの判定を行う（ステップＳＴ３）。（Ａ）トレースすべき障害でない場合障害トレース識別部５がトレースすべき障害でないと判
断する場合には、何も処理を行わずに処理を終了する。 [0061] fault trace identification unit 5, it is determined whether the failure to be traced from the type of disorder when receiving the fault information (in which those variously changed according to the type system) ( Step ST3). (A) When it is not a failure to be traced When the failure trace identifying unit 5 determines that it is not a failure to be traced, the processing is terminated without performing any processing.

【００６２】（Ｂ）トレースすべき障害である場合障害トレース識別部５がトレースすべき障害であると判
断する場合には、障害トレース識別部５はトレース抑止
制御部６に障害情報を出力する。トレース抑止制御部６
は、障害トレース識別部５から出力された障害情報を受
け取ると、障害トレース方法設定テーブル９を参照し
て、この障害は装置リセットまで検出しないのか否か
（トレース対象か否か）を判定する（ステップＳＴ４
Ｆ）。 [0062] When the (B) If a fault should be traced fault trace identification unit 5 is determined to be failure to be traced, the fault trace identifying unit 5 outputs the failure information to the trace inhibit control unit 6. Trace suppression control unit 6
When receiving the fault information output from the fault trace identifying unit 5, the device refers to the fault tracing method setting table 9 and determines whether or not this fault is not detected until device reset (whether it is a trace target or not) ( Step ST4
F).

【００６３】（Ｉ）リセットまで検出しないに該当する
場合トレース抑止制御部６が、障害トレース識別部５から出
力された障害が装置リセットまで検出しないに該当する
と判定した場合、該当する障害の検出マスクをかけた後
（ステップＳＴ５Ｆ）、近辺トレース部８が、障害発生
カウントに１をセットするとともに（ステップＳＴ６
Ｆ）、障害発生時刻に現在時刻を設定して（ステップＳ
Ｔ７Ｆ）、障害の発生回数及び発生時刻を障害トレース
バッファ２の内容に書き込み、障害トレースファイルへ
の書き込み部３が、障害情報とともに障害発生回数、障
害発生時刻が書き込まれた障害トレースバッファ２の内
容を障害トレースファイル４へ書き込みを実行する（ス
テップＳＴ８Ｆ）。そして、発生監視タイマ７ａの処理
を起動（開始）する（ステップＳＴ９Ｆ）。 [0063] (I) is the trace inhibit control unit 6 if applicable to not detected until reset, if the fault output from fault trace identifying unit 5 determines that corresponds to no detected until system reset, the corresponding fault detection mask After applying (step ST5F), the neighboring trace unit 8 sets the failure occurrence count to 1 (step ST6).
F), the current time is set as the failure occurrence time (step S
T7F), the occurrence frequency and the occurrence time of the fault are written in the contents of the fault trace buffer 2, and the error trace file writing unit 3 writes the fault occurrence count and the fault occurrence time together with the fault information in the fault trace buffer 2. Is written to the fault trace file 4 (step ST8F). Then, the process of the occurrence monitoring timer 7a is activated (started) (step ST9F).

【００６４】（II）リセットまで検出しないに該当しな
い場合トレース抑止制御部６が、障害トレース識別部５から出
力された障害が装置リセットまで検出しないに、該当し
ないと判定した場合、次に、トレース抑止制御部６は、
障害トレース方法設定テーブル９を参照して、この障害
は毎回トレースするか否かの判定を行う（ステップＳＴ
１０Ｆ）。(II) In the case where it does not correspond to not detecting until reset : When the trace suppression control unit 6 determines that the failure output from the failure trace identifying unit 5 does not detect until device reset and does not correspond, the trace The suppression control unit 6
By referring to the fault tracing method setting table 9, it is determined whether or not this fault is traced every time (step ST
10F).

【００６５】（１）毎回トレースするに該当する場合トレース抑止制御部６が、毎回トレースするに該当する
と判定した場合、近辺トレース部８が障害発生カウント
に１をセットするとともに（ステップＳＴ６Ｆ）、障害
発生時刻に現在時刻を設定し（ステップＳＴ７Ｆ）、障
害トレースファイルへの書き込み部３が障害トレースフ
ァイルに書き込みを実行し（ステップＳＴ８Ｆ）、発生
監視タイマ７ａの処理を起動（開始）する（ステップＳ
Ｔ９Ｆ）。 [0065] (1) the trace inhibit control unit 6 if applicable to trace each time, when it is determined that corresponds to trace each, near the trace portion 8 is thereby set to 1 failure count (step ST6F), disorders The present time is set as the occurrence time (step ST7F), the error trace file writing unit 3 writes the error trace file (step ST8F), and starts (starts) the process of the occurrence monitoring timer 7a (step S).
T9F).

【００６６】（２）毎回トレースするに該当しない場合トレース抑止制御部６が、毎回トレースするに該当しな
いと判定した場合、次に、トレース抑止制御部６は、障
害トレース方法設定テーブル９を参照して、この障害が
トレース抑止中か否かの判断を行う（ステップＳＴ１１
Ｆ）。 [0066] (2) if not applicable trace inhibit control unit 6 to trace each time, if it is determined not to correspond to trace each, then the trace inhibit control section 6 refers to the fault trace method setting table 9 Then, it is determined whether or not this failure is in the trace inhibition (step ST11).
F).

【００６７】（ａ）トレース抑止中でない場合トレース抑止中でない場合（即ち、障害が初回発生の場
合）には、近辺トレース部８は、障害発生カウントに１
をセットするとともに（ステップＳＴ１２Ｆ）、障害発
生時刻を現在時刻に設定して（ステップＳＴ１３Ｆ）、
障害の発生回数及び発生時刻を障害トレースバッファ２
の内容に書き込み、障害トレースファイルへの書き込み
部３が、障害情報とともに障害の発生回数及び発生時刻
が書き込まれた障害トレースバッファ２の内容を障害ト
レースファイル４に書き込む（ステップＳＴ１４Ｆ）。
そして、発生監視タイマ７ａの処理を起動（開始）する
（ステップＳＴ１５Ｆ）。尚、発生監視タイマ７ａの処
理は、上記参考例１と同様、図４に示すように行われ
る。 [0067] If the case is not in (a) trace deterrence is not in trace suppression (that is, if the failure of the first generation), the vicinity of the trace unit 8, 1 to failure count
Is set (step ST12F), the failure occurrence time is set to the current time (step ST13F),
The number of times the error occurred and the time at which it occurred
And the failure trace file writing unit 3 writes the content of the failure trace buffer 2 in which the failure occurrence count and the occurrence time are written in the failure trace file 4 together with the failure information (step ST14F).
Then, the process of the occurrence monitoring timer 7a is activated (started) (step ST15F). The process of the occurrence monitoring timer 7a is performed as shown in FIG. 4, as in the first reference example .

【００６８】その後、障害が所定時間（Ｔ２）発生しな
い場合にトレース抑止制御部６のトレース抑止状態を解
除するため、未発生監視タイマ７ｂの初期値にＴ２を設
定し（ステップＳＴ１６Ｆ）、未発生監視タイマ７ｂを
起動する（ステップＳＴ１７Ｆ）。尚、未発生監視タイ
マ７ｂの処理は、上記参考例１と同様、図５に示すよう
に行われる。 [0068] Thereafter, since a failure to release the trace inhibited state of the trace inhibit control unit 6 when a predetermined time (T2) does not occur, set T2 to the initial value of the non-occurrence monitoring timer 7b (step ST16F), non-occurrence The monitoring timer 7b is activated (step ST17F). The process of the non-occurrence monitoring timer 7b is performed as shown in FIG. 5, as in the first reference example .

【００６９】（ｂ）トレース抑止中である場合トレース抑止中である場合には、直接トレース処理は行
わずに、障害発生カウントを更新する。即ち、新たな障
害発生回数分だけ障害発生カウントを加算する（ステッ
プＳＴ１８Ｆ）。そして、障害発生時刻に現在時刻を設
定するのみとする（ステップＳＴ１９Ｆ）。尚、トレー
ス抑止中である場合における発生監視タイマ７ａの処理
は、上記参考例１と同様、図４に示すように行われる。
その後、トレース抑止中でない場合と同様に、未発生監
視タイマ７ｂの初期値にＴ２を設定し（ステップＳＴ１
６Ｆ）、未発生監視タイマ７ｂを起動する（ステップＳ
Ｔ１７Ｆ）。 [0069] (b) in the case when it is being traced deterrence is being traced suppression, without direct tracing, updates the failure count. That is, the failure occurrence count is added by the new failure occurrence count (step ST18F). Then, only the current time is set as the failure occurrence time (step ST19F). The processing of generating the monitoring timer 7a when it is being traced suppression, similar to Reference Example 1 is performed as shown in FIG.
After that, as in the case where the trace suppression is not being performed, T2 is set to the initial value of the non-occurrence monitoring timer 7b (step ST1
6F), the non-occurrence monitoring timer 7b is started (step S
T17F).

【００７０】以上のように、この参考例５によれば、種
々の障害トレース方法の設定項目を格納した障害トレー
ス方法設定テーブル９を備えることにより、選択的設定
を可能とし、またこの障害トレース方法設定テーブル９
を保守ツール１０を介してユーザからの任意のタイミン
グで変更可能としたので、障害の発生を考慮したトレー
ス方法を柔軟に選択でき、障害解析能力を落とさずに、
全体的な障害トレース処理負荷を低減し、トレース容量
を削減することができる。 As described above, according to this reference example 5 , by providing the fault tracing method setting table 9 storing the setting items of various fault tracing methods, the selective setting is possible, and this fault tracing method is also possible. Setting table 9
Can be changed at any timing from the user via the maintenance tool 10, so that a trace method that considers the occurrence of a failure can be flexibly selected, and the failure analysis capability is not reduced.
The overall fault trace processing load can be reduced and the trace capacity can be reduced.

【００７１】尚、上記各参考例及び上記各実施の形態に
おいては、障害トレース方法をＡＴＭ通信システムに適
用した場合について説明したが、これに限る必要はな
く、様々なシステムに適用することも可能である。 In each of the above reference examples and each of the above embodiments, the case where the fault tracing method is applied to the ATM communication system has been described, but the present invention is not limited to this, and various systems can be used. It is also possible to apply to.

【００７２】[0072]

【発明の効果】以上のように、請求項１記載の発明によ
れば、システム内で発生した障害を検出する検出ステッ
プと、この検出ステップが検出した障害内容をトレース
するトレースステップと、上記検出ステップが検出した
障害が繰り返し継続的に発生する場合には、上記トレー
スステップがトレースする障害内容を抑止するトレース
抑止ステップとを備え、トレース抑止ステップが、トレ
ースステップに、最初に発生した障害は、障害内容と障
害発生時刻をトレースさせ、最初の障害が発生してから
所定監視期間内に発生した障害は、障害内容、障害発生
回数、及び上記所定監視期間内に発生した最新障害発生
時刻をトレースさせ、さらに継続して発生した障害は、
上記所定監視期間を順次ｎ倍した監視期間内に発生した
障害の障害内容、発生回数、及び最新障害発生時刻をト
レースさせるように構成している。したがって、トレー
ス領域が頻繁に発生する障害の情報で埋め尽くされ、障
害発生の継続時間や最初に発生した時の情報が消失する
のを防止し、ネットワーク障害の原因解析の妨げとなる
のを防ぐことができるばかりでなく、監視期間を指数関
数的に増加させることができるため、一旦発生すると継
続発生期間が長い障害に対して、トレース領域を更に抑
えることができる効果がある。 As described above, according to the first aspect of the present invention, a detecting step for detecting a fault occurring in the system, a tracing step for tracing the content of the fault detected by the detecting step, and the above detecting If the fault detected by the step repeatedly occurs continuously, the trace inhibiting step includes a trace inhibiting step for inhibiting the content of the error traced by the trace step.
In the first step, the failure that occurred first is
Trace the time of the harm, and after the first failure
The failure that occurred within the specified monitoring period is the failure content and failure occurrence.
Number of times and latest failure occurred within the above specified monitoring period
The time is traced, and the fault that occurred continuously is
Occurred within the monitoring period, which is obtained by multiplying the above predetermined monitoring period by n times.
The failure details, the number of occurrences, and the latest failure occurrence time of the failure are displayed.
It is configured to race. Thus, it filled with information of the failure that the trace region occurs frequently, to prevent the information when generated in the duration and the first fault disappears, prevent hinders cause analysis of network failure it not only can, the monitoring period exponential function
Since it can be increased numerically, once it occurs
The trace area is further suppressed for failures with a long continuous period.
There is an effect that can e Rukoto.

【００７３】請求項２記載の発明によれば、トレース抑
止ステップが、トレースステップに、最初に発生した障
害は、障害内容と障害発生時刻をトレースさせ、最初の
障害が発生してから所定監視期間内に発生した障害は、
障害内容、障害発生回数、及び上記所定監視期間内に発
生した最旧障害発生時刻をトレースさせ、さらに継続し
て発生した障害は、上記所定監視期間を順次ｎ倍した監
視期間内に発生した障害の障害内容、発生回数、最旧障
害発生時刻、及び最新障害発生時刻をトレースさせるよ
うに構成している。したがって、トレース領域が頻繁に
発生する障害の情報で埋め尽くされ、障害発生の継続時
間や最初に発生した時の情報が消失するのを防止し、ネ
ットワーク障害の原因解析の妨げとなるのを防ぐことが
できるばかりでなく、長いトレース抑止期間中で、障害
が平均的に発生しているのか、バースト的に発生してい
るのかの情報を得ることができ、継続発生期間が長い障
害に対してもトレース領域を抑えることができるととも
に、高い障害解析性を発揮することができる効果があ
る。 [0073] According to the second aspect of the invention, it traces depression
The stop step causes the trace step to
Harm causes the content of the failure and the time when the failure occurred to be traced.
The failure that occurred within the specified monitoring period after the failure occurred,
The nature of the failure, the number of times the failure occurred, and the
Trace the oldest failure time that occurred and continue
The failure that occurred was monitored by multiplying the above specified monitoring period by n times.
Fault details, number of occurrences, and oldest faults that occurred during the viewing period
Trace the time when the damage occurred and the time when the latest failure occurred.
It is configured as Therefore, the trace area frequently
When the failure continues, it is filled with information on the failure that occurs.
To prevent the loss of information during
To prevent it from interfering with the cause analysis of
Not only can it fail during long trace suppression periods
Is occurring on average or in a burst
Information that can be obtained for a long time
The trace area can be suppressed even for harm.
In addition, there is an effect that high failure analysis performance can be exhibited.
It

【００７４】請求項３記載の発明によれば、さらに、監
視期間に上限値を設け、上記監視期間が上記上限値に到
達すると該上限値に固定するように構成したので、トレ
ース抑止期間が極端に大きくなり障害解析に役立たなく
なることを未然に防止することができ、また、タイマＨ
／Ｗの計数値を大きくすることによるＨ／Ｗコストの増
加を抑えることができる効果がある。 [0074] According to the third aspect of the present invention, further, superintendent
The upper limit is set for the viewing period, and the monitoring period reaches the upper limit.
When it reaches the upper limit, it is fixed to the upper limit.
The control period becomes extremely large and is not useful for failure analysis.
Can be prevented in advance, and the timer H
Increase in H / W cost by increasing the count value of / W
This has the effect of suppressing the increase.

【００７５】請求項４記載の発明によれば、さらに、検
出ステップが検出した障害が所定期間発生しなくなった
場合にトレースの抑止を解除するトレース抑止解除ステ
ップをも備えたので、障害が継続的に発生しなくなると
トレース抑止状態を解除することにより、このようなト
レース領域が満杯にならない場合には詳細な障害内容
（障害情報）をトレースすることができ、障害発生原因
の究明を通常通り行うことができる効果がある。 [0075] According to the fourth aspect of the present invention, further, search
The fault detected by the outgoing step no longer occurs for the specified period.
If you want to cancel the trace suppression
Since it also has a backup, if the failure does not occur continuously
By removing the trace suppression state,
Detailed obstacle details if the race area is not full
(Fault information) can be traced and the cause of fault occurrence
The effect is that the investigation of can be performed as usual.

[Brief description of drawings]

【図１】ＡＴＭ通信システムを示す構成図である。FIG. 1 is a configuration diagram showing an ATM communication system.

【図２】この発明に関連する参考例１の障害トレース
方法の構成を示す概念図である。FIG. 2 is a conceptual diagram showing a configuration of a fault tracing method of reference example 1 related to the present invention.

【図３】同参考例１に係る障害トレース方法の処理動
作を説明するためのフローチャートである。FIG. 3 is a flowchart for explaining the processing operation of the fault tracing method according to the first reference example .

【図４】同参考例１に係る発生監視タイマ処理動作を
説明するためのフローチャートである。FIG. 4 is a flowchart for explaining an occurrence monitoring timer processing operation according to the first reference example .

【図５】同参考例１に係る未発生監視タイマ処理動作
を説明するためのフローチャートである。FIG. 5 is a flowchart for explaining a non-occurrence monitoring timer processing operation according to the first reference example .

【図６】トレース抑止中に障害発生回数及び障害発生
時刻をトレースする場合を説明するための図である。FIG. 6 is a diagram for explaining a case in which the number of times of failure occurrence and the time of failure occurrence are traced during trace suppression.

【図７】参考例１に係る障害トレース方法によりトレ
ースする一例を示す図である。FIG. 7 is a diagram showing an example of tracing by the fault tracing method according to the first reference example .

【図８】この発明に関連する参考例２の障害トレース
方法の処理動作を説明するためのフローチャートであ
る。FIG. 8 is a flowchart for explaining the processing operation of the fault tracing method of the second reference example related to the present invention.

【図９】トレース抑止中に障害発生回数及び障害発生
時刻をトレースする場合を説明するための図である。FIG. 9 is a diagram for explaining a case where the number of times of failure occurrence and the time of failure occurrence are traced during trace suppression.

【図１０】この発明の実施の形態１による発生監視タ
イマの処理を説明するためのフローチャートである。FIG. 10 is a flowchart for explaining the process of the occurrence monitoring timer according to the first embodiment of the present invention.

【図１１】この発明の実施の形態１による障害トレー
ス方法によりトレースする一例を示す図である。11 is a diagram showing an example of tracing the fault trace method according to the first embodiment of the present invention.

【図１２】この発明の実施の形態２による障害トレー
ス方法の処理動作を説明するためのフローチャートであ
る。FIG. 12 is a flowchart for explaining the processing operation of the fault tracing method according to the second embodiment of the present invention.

【図１３】この発明の実施の形態３に係る発生監視タ
イマの処理を説明するためのフローチャートである。FIG. 13 is a flowchart for explaining processing of an occurrence monitoring timer according to the third embodiment of the present invention.

【図１４】この発明に関連する参考例３に係る発生監
視タイマの処理を説明するためのフローチャートであ
る。FIG. 14 is a flowchart for explaining a process of an occurrence monitoring timer according to a reference example 3 related to the present invention.

【図１５】この発明に関連する参考例４に係る障害検
出処理および検出割り込みマスクタイマ処理を説明する
ためのフローチャートである。FIG. 15 is a flowchart for explaining a fault detection process and a detection interrupt mask timer process according to a reference example 4 related to the present invention.

【図１６】この発明に関連する参考例５による障害ト
レース方法の構成を示す概念図である。FIG. 16 is a conceptual diagram showing the configuration of a fault tracing method according to Reference Example 5 related to the present invention.

【図１７】同参考例５による障害トレース方法の処理
動作を説明するためのフローチャートである。FIG. 17 is a flowchart for explaining the processing operation of the fault tracing method according to the fifth reference example .

【図１８】従来の障害トレース方法の構成を示す概念
図である。FIG. 18 is a conceptual diagram showing a configuration of a conventional fault tracing method.

[Explanation of symbols]

１障害トレースバッファへの書き込み部、２障害ト
レースバッファ、３障害トレースファイルへの書き込
み部、４障害トレースファイル、６トレース抑止制
御部、７監視タイマ、７ａ発生監視タイマ（監視タイ
マ）、７ｂ未発生監視タイマ（監視タイマ）、８近
辺トレース部、９障害トレース方法設定テーブル、Ｓ
Ｔ１，ＳＴ２，ＳＴ３０１〜ＳＴ３０５検出ステッ
プ、ＳＴ７，ＳＴ１０５，ＳＴ１３Ｅ，ＳＴ１４Ｅ，Ｓ
Ｔ４０１，ＳＴ４０２トレースステップ、ＳＴ３〜Ｓ
Ｔ６，ＳＴ８，ＳＴ１１，ＳＴ１２，ＳＴ１０１〜ＳＴ
１０４，ＳＴ１０６〜ＳＴ１０８，ＳＴ１２Ａ，ＳＴ１
３Ａ，ＳＴ１０７Ｂ，ＳＴ１０８Ｂ，ＳＴ６Ｃ，ＳＴ１
２Ｃ〜ＳＴ１４Ｃ，ＳＴ１０７Ｄ，ＳＴ１０９Ｄトレ
ース抑止ステップ、ＳＴ９，ＳＴ１０，ＳＴ２０１〜Ｓ
Ｔ２０３トレース抑止解除ステップ。1 failure trace buffer write section, 2 failure trace buffer, 3 failure trace file write section, 4 failure trace file, 6 trace suppression control section, 7 monitoring timer, 7a occurrence monitoring timer (monitoring timer), 7b no occurrence Monitoring timer (monitoring timer), 8 near trace part, 9 fault tracing method setting table, S
T1, ST2, ST301 to ST305 Detection step, ST7, ST105, ST13E, ST14E, S
T401, ST402 Trace step, ST3-S
T6, ST8, ST11, ST12, ST101-ST
104, ST106 to ST108, ST12A, ST1
3A, ST107B, ST108B, ST6C, ST1
2C to ST14C, ST107D, ST109D Trace suppression step, ST9, ST10, ST201 to S
T203 Trace suppression cancellation step.

フロントページの続き (56)参考文献特開平８−69395（ＪＰ，Ａ) 特開平５−95357（ＪＰ，Ａ) 特開平９−65458（ＪＰ，Ａ) 特開平７−321782（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) H04L 12/26 G06F 11/34 H04L 29/14 Continuation of front page (56) Reference JP-A-8-69395 (JP, A) JP-A-5-95357 (JP, A) JP-A-9-65458 (JP, A) JP-A-7-321782 (JP , A) (58) Fields surveyed (Int.Cl. ⁷ , DB name) H04L 12/26 G06F 11/34 H04L 29/14

Claims

(57) [Claims]

1. A detection step for detecting a fault occurring in the system, a trace step for tracing the content of the fault detected by the detection step, and a case where the fault detected by the detection step repeatedly occurs continuously. , fault trace method odor and a trace suppression step of suppressing the fault contents above trace step traces
The trace inhibiting step includes the trace step.
First, the failure that occurred first is the content of the failure and the time when the failure occurred.
Traced to the specified monitoring period after the first failure.
The failures that occurred in the interval are the details of the failure, the number of failures, and
Trace the latest failure occurrence time that occurred within the specified monitoring period
The above-mentioned specified monitoring
Failure of a failure that occurred within the monitoring period that is n times as long as the visual period
Trace contents, number of occurrences, and latest failure occurrence time
A fault tracing method characterized by the above.

2. A detection step of detecting a fault occurring in the system, a trace step of tracing the content of the fault detected by the detection step, and a fault detected by the detection step in a case where the fault repeatedly occurs continuously. , fault trace method odor and a trace suppression step of suppressing the fault contents above trace step traces
The trace suppression step is
First, the failure that occurred first is the content of the failure and the time when the failure occurred.
Traced to the specified monitoring period after the first failure.
The failures that occurred in the interval are the details of the failure, the number of failures, and
Trace the oldest failure occurrence time that occurred within the above specified monitoring period
The above-mentioned specified monitoring
Failure of a failure that occurred within the monitoring period that is n times as long as the visual period
Content, number of occurrences, oldest failure occurrence time, and latest failure occurrence
Failure characterized by tracing the time
How to trace.

3. An upper limit value is set for the monitoring period, and the monitoring period is set.
Is fixed to the upper limit when
The fault trace according to claim 1 or 2, characterized in that
Method.

4. The fault detected by the detecting step is for a predetermined period.
A trace that cancels the trace suppression when it no longer occurs.
And a step of releasing the source deterrence.
The obstacle tray according to any one of claims 1 to 3.
Way.