JPH0721059A

JPH0721059A - Erroneous log information managing method

Info

Publication number: JPH0721059A
Application number: JP5164509A
Authority: JP
Inventors: Masamichi Hoshino; 正道星野
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1993-07-02
Filing date: 1993-07-02
Publication date: 1995-01-24

Abstract

PURPOSE:To effectively use erroneous log information and to apply uniform and efficient countermeasure and preventive maintenance for failure by regulating the format/content of erroneous log information based on object oriented consideration. CONSTITUTION:In a client server system consisting of plural hierarchies, the failure detail information of hardware failure sampled at every unit in a peripheral device and the erroneous log information representing error statistical information in each hierarchy are stored in the specific areas of external memory devices 25, 29. After recorded information is read out periodically and it is sent to a high-order server sequentially and the information is collected comprehensively, a failure occurrence process list and an abnormal value list when the value exceeds the threshold value of the error statistical information are generated, and they are outputted to the output devices 26, 2A of the high- order server. A unification server 27, after transferring collected erroneous log information to a terminal 2B at a maintenance base, edits a list for maintenance, and sends it to the output device 2D of the maintenance terminal 2B, then, it is used by each peripheral device as feedback data.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、コンピュ−タおよび周
辺装置のハ−ドウェア障害時に、周辺装置内のユニット
毎に採取される障害詳細情報とエラ−統計情報を示すエ
ラ−ログ情報を管理し、障害対策および予防保守を図る
ための編集リストを出力するようにしたエラ−ログ情報
管理方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention manages error log information indicating detailed error information and error statistical information collected for each unit in a peripheral device when a hardware error occurs in the computer and the peripheral device. In addition, the present invention relates to an error log information management method that outputs an edit list for troubleshooting and preventive maintenance.

【０００２】[0002]

【従来の技術】従来より、ＲＡＳ、つまりReliability
(信頼性)、Availability(有用性)およびServiceability
(保守性)について、ある製品を一定周期毎に収集デ−タ
を採取してメモリに記憶する等、単体の製品に対しては
方法が考えられていたが、製品に対する総合的なＲＡＳ
としての考え方は未だ確立されていなかった。すなわ
ち、コンピュ−タ情報処理システムの場合、製品の稼働
から予防保守までを含めたシステムＲＡＳの観点に立っ
てエラ−ログ情報の管理方法を考えたものはなく、あっ
ても内容が明確ではなかった。また、ベンダおよび保守
拠点に対して、ハ−ドウェア製品における稼働品質を向
上させるため、エラ−ログ情報をフィ−ドバックデ−タ
として戻すことにより、有効活用を図るという考え方は
なく、またシステム全体に対して、均一で効率的な障害
対策および予防保守を図る一貫したエラ−ログ情報管理
方法は確立されていなかった。なお、従来、装置内で発
生する各種デ−タを収集し、一定期間毎に外部記憶装置
に書き込む方法としては、実開平４−１１１６４９号公
報に記載された『デ−タ収集記録装置』がある。2. Description of the Related Art Conventionally, RAS, that is, reliability
(Reliability), Availability and Serviceability
Regarding (maintenance), a method was considered for a single product, such as collecting data for a certain product at fixed intervals and storing it in memory, but a comprehensive RAS for the product
The idea of as was not established yet. That is, in the case of a computer information processing system, there is no one that has considered a method of managing error log information from the viewpoint of system RAS including the operation of products to preventive maintenance, and the contents are not clear. It was In addition, there is no concept of effective use by returning error log information as feedback data to the vendors and maintenance bases in order to improve the operating quality of hardware products. On the other hand, a consistent error log information management method for uniform and efficient failure countermeasures and preventive maintenance has not been established. Incidentally, as a conventional method for collecting various data generated in the apparatus and writing the same in an external storage device at regular intervals, there is a "data collection recording device" described in Japanese Utility Model Laid-Open No. 4-111649. is there.

【０００３】[0003]

【発明が解決しようとする課題】従来では、システム全
体として一元管理／集中管理できるようなエラ−ログ情
報の採取、収集、編集、あるいは転送の各機能を備えた
エラ−ログ情報の管理システムは、未だ確立されていな
い。このように、システムＲＡＳの観点に立ったエラ−
ログ情報管理方法が確立されていなかったので、ベンダ
や保守拠点における保守部署に対して、デ−タをフィ−
ドバックさせて、ハ−ドウェア製品の稼働品質向上のた
めに有効活用を図ること、あるいは均一で効率的な障害
対策と予防保守を図ることはできなかった。本発明の目
的は、このような従来の課題を解決し、システム全体と
して一元管理／集中管理することができるようなエラ−
ログ情報の採取、収集、編集、あるいは転送の各機能を
備えたエラ−ログ情報管理方法を提供することにある。Conventionally, an error log information management system having a function of collecting, collecting, editing, or transferring error log information capable of performing centralized / centralized management of the entire system has been known. , Not yet established. In this way, the error from the viewpoint of system RAS
Since the log information management method was not established, the data was sent to the vendor and the maintenance department at the maintenance base.
However, it was not possible to make effective use to improve the operation quality of hardware products by implementing feedback, or to implement uniform and efficient failure countermeasures and preventive maintenance. An object of the present invention is to solve the above-mentioned conventional problems and to perform an error management such that the system as a whole can be managed centrally / centrally.
An object of the present invention is to provide an error log information management method having each function of collecting, collecting, editing, or transferring log information.

【０００４】[0004]

【課題を解決するための手段】上記目的を達成するた
め、本発明のエラ−ログ情報管理方法は、それぞれ外部
記憶装置（２２，２５，２９，２Ｃ）、入出力装置（２
１，２４，２６，２８，２Ａ，２Ｄ）等の周辺装置を接
続したクライアント（２０）、クライアント（２０）を
管理する複数の営業店サ−バ（２３）および営業店サ−
バ（２３）を一元管理する統合サ−バ（２７）、ならび
に統合サ−バ（２７）に接続された保守拠点（２Ｂ）の
複数階層からなるクライアントサ−バシステムにおい
て、各階層の周辺装置内ユニット毎に採取されるハ−ド
ウェア障害の障害詳細情報（５１〜５Ｎ）およびエラ−
統計情報を示すエラ−ログ情報（６１〜６Ｍ）を接続さ
れた外部記憶装置（２２，２５，２９）の特定エリアに
記録し、記録された情報を定期的に読み出して、上位の
サ−バに順次送信することにより、上位サ−バに情報を
一括して収集した後、編集を行って見易くし、障害詳細
情報の障害発生推移リスト（図１０参照）およびエラ−
統計情報の閾値を超えた時の異常値リスト（図１１参
照）を上位のサ−バの出力装置（２６，２Ａ）に出力す
るとともに、統合サ−バ（２７）に収集されたエラ−ロ
グ情報を保守拠点の端末（２Ｂ）に転送した後、編集し
た保守用リストを保守端末（２Ｂ）の出力装置（２Ｄ）
に出力することにより、保守用リストをフィ−ドバック
デ−タとして各周辺装置に活用することを特徴としてい
る。In order to achieve the above object, the error log information management method of the present invention comprises an external storage device (22, 25, 29, 2C) and an input / output device (2).
1, 24, 26, 28, 2A, 2D) and the like connected to the client (20), a plurality of sales office servers (23) for managing the clients (20), and sales office server.
In a client server system having a plurality of layers of an integrated server (27) for centrally managing the server (23) and a maintenance base (2B) connected to the integrated server (27), peripheral devices of each layer Fault detail information (51-5N) of hardware faults collected for each internal unit and error
The error log information (61 to 6M) indicating the statistical information is recorded in a specific area of the connected external storage device (22, 25, 29), and the recorded information is periodically read out to obtain a higher rank server. Information is collected in a high-order server all at once, and then edited to make it easier to see, and the failure occurrence transition list (see FIG. 10) and error information in the failure detailed information are displayed.
An abnormal value list (see FIG. 11) when the threshold value of the statistical information is exceeded is output to the output device (26, 2A) of the upper server, and the error log collected in the integrated server (27). After transferring the information to the maintenance base terminal (2B), the edited maintenance list is output to the maintenance terminal (2B) output device (2D).
It is characterized in that the maintenance list is used as feedback data for each peripheral device by outputting to the peripheral device.

【０００５】[0005]

【作用】本発明においては、オブジェクト指向の考え方
に基づいて、エラ−ログ情報の形式／内容を規定するこ
とにより、システム全体として一元集中管理できるよう
にする。すなわち、ハ−ドウェア障害時、周辺装置内ユ
ニット毎に採取される障害詳細情報およびエラ−統計情
報を示すエラ−ログ情報の構成要素である項目表現の一
貫性、保守性および拡張性を保つために、種々の特性を
持ったプリミティブ（つまり、性質が全く異なる）な属
性情報として定義し、ソフトウェアやシステムに依存し
ないエラ−ログ情報の形式／内容を規定する。そして、
ハ−ドウェア障害時には、周辺装置内のユニット毎に採
取された障害詳細情報とエラ−統計情報を示すエラ−ロ
グ情報を、外部記憶装置の特定領域に書き込むととも
に、これらを定期的に読み出して、順次、上位システム
に一括して集収する。上位システムおよび保守拠点で
は、一括収集した後、見易いように編集して、障害詳細
情報の障害発生推移リストおよびエラ−統計情報の閾値
を超えた時の異常値リストを出力装置から出力する。ま
た、最上位システム、例えば統合サ−バや営業点サ−バ
から収集ずみのエラ−ログ情報を保守拠点毎に分類し
て、保守拠点に設けられた保守端末に転送することによ
り、その保守端末の出力装置から編集した保守用リスト
を出力する。In the present invention, the format / content of the error log information is defined based on the object-oriented concept, so that the system as a whole can be centrally managed. That is, in order to maintain consistency, maintainability, and expandability of the item expression, which is a component of error log information indicating detailed error information and error statistical information collected for each unit in a peripheral device when a hardware error occurs. In addition, it is defined as primitive attribute information having various characteristics (that is, completely different properties), and defines the format / content of error log information that does not depend on software or system. And
In the event of a hardware failure, error log information indicating failure detailed information and error statistical information collected for each unit in the peripheral device is written to a specific area of the external storage device, and these are read periodically, Collected sequentially in a higher system. In the host system and the maintenance base, after collectively collecting, it is edited for easy viewing, and the failure occurrence transition list of the failure detailed information and the abnormal value list when the threshold of the error statistical information is exceeded are output from the output device. In addition, the error log information collected from the highest level system, for example, the integrated server or the sales point server, is classified for each maintenance base and transferred to the maintenance terminal provided in the maintenance base for maintenance. Output the edited maintenance list from the terminal output device.

【０００６】[0006]

【実施例】以下、本発明の実施例を、図面により詳細に
説明する。図１および図２は、本発明を適用した階層シ
ステムを示す全体図である。ここでは、クライアント
（顧客）、その上位の営業店サ−バ（供給者）、さらに
上位の統合サ−バから構成されるクライアントサ−バシ
ステムを示している。図２において、２０はクライアン
トの処理装置で、各種のデ−タを処理する装置、２１は
クライアント処理装置２０に接続されたディスプレイ、
ハ−ドディスクおよびプリンタ等の複数個のユニットか
らなる周辺装置、２２はハ−ドウェア障害時に、周辺装
置内ユニット毎に採取される障害詳細情報およびエラ−
統計情報を示すエラ−ログ情報を記録する外部記憶装置
である。また、図１において、２３は営業点サ−バの処
理装置であって、クライアントから収集したデ−タの処
理を行う装置、２４はキ−ボ−ドディスプレイ等の各種
のコマンドを入力する入力装置、２５は営業店サ−バで
ハ−ドウェア障害が発生したとき、そのサ−バに接続さ
れた周辺装置内ユニット毎に採取される障害詳細情報、
およびエラ−統計情報を示すエラ−ログ情報、およびそ
のサ−バの下に位置するクライアント２０から収集した
エラ−ログ情報を記録するための外部記憶装置、２６は
その営業店システム内の各種情報を編集し、リストにし
て出力するプリンタである。Embodiments of the present invention will now be described in detail with reference to the drawings. 1 and 2 are overall views showing a hierarchical system to which the present invention is applied. Here, a client server system including a client (customer), a sales office server (supplier) above it, and an integrated server above it is shown. In FIG. 2, reference numeral 20 denotes a client processing apparatus, which is an apparatus for processing various data, 21 denotes a display connected to the client processing apparatus 20,
A peripheral device composed of a plurality of units such as a hard disk and a printer. Reference numeral 22 denotes detailed error information and an error collected for each unit in the peripheral device when a hardware error occurs.
The external storage device records error log information indicating statistical information. Further, in FIG. 1, reference numeral 23 is a processing point server processing apparatus for processing data collected from clients, and 24 is an input for inputting various commands such as a keyboard display. When a hardware failure occurs at a sales office server, the device 25 is detailed failure information collected for each unit in the peripheral device connected to the server,
And an external storage device for recording the error log information indicating the error statistical information and the error log information collected from the client 20 located under the server, and 26 is various information in the sales office system. Is a printer that edits and outputs a list.

【０００７】また、図１の最上段の２７は統合サ−バの
処理装置であって、全営業店システムの一元管理／集中
管理する統合サ−バの処理装置であって、営業店サ−バ
から収集したデ−タの処理を行う装置、２８はキ−ボ−
ドディスプレイ等の各種のコマンドを入力する入力装
置、２５は営業店サ−バでハ−ドウェア障害が発生した
とき、そのサ−バに接続された周辺装置内ユニット毎に
採取される障害詳細情報とエラ−統計情報を示すエラ−
ログ情報、および統合サ−バの下位の全営業店サ−バ２
３より収集したエラ−ログ情報を記録するための外部記
憶装置、２Ａはシステム全体としての各種情報を編集
し、リスト出力するためのプリンタ、２Ｂは保守拠点に
設けられた保守端末の処理装置であって、統合サ−バ処
理装置２７との間は回線により接続される。また、２Ｃ
は、統合サ−バで一元管理／集中管理されたエラ−ログ
情報を定期的に受信しながら記録するための外部記録装
置、２Ｄは保守拠点における各種情報を編集し、リスト
出力するプリンタである。Further, the uppermost 27 in FIG. 1 is an integrated server processing device, which is an integrated server processing device for centralized / centralized management of all business office systems. A device for processing the data collected from the server, 28 is a keyboard
An input device for inputting various commands such as a hard disk display, and 25 is a detailed information of a failure collected for each unit in the peripheral device connected to the server when a hardware failure occurs in the sales office server. And an error indicating statistical information
Log information and all sales office servers under the integrated server 2
3 is an external storage device for recording the error log information collected from 3, 2A is a printer for editing various information of the entire system, and is a list output, and 2B is a processing device of a maintenance terminal provided at a maintenance base. Therefore, the integrated server processing device 27 is connected by a line. Also, 2C
Is an external recording device for recording while periodically receiving error log information centrally managed / centralized by the integrated server, and 2D is a printer for editing various information at the maintenance base and outputting the list. .

【０００８】図３は、本発明におけるエラ−ログ情報の
採取、収集、編集、または転送の各機能の説明図であ
る。図３において、３０はクライアント処理装置、３１
は営業店サ−バ処理装置、３２は統合サ−バ処理装置、
３３は保守端末、３４，３５，３６はいずれもエラ−ロ
グ情報ファイル、３７，３９は編集リスト出力、３８は
受信ファイルである。クライアント処理装置３０は、ハ
−ドウェア障害を検知すると、障害詳細情報およびエラ
−統計情報を示すエラ−ログ情報をクライアント処理装
置３０に接続された外部記憶装置内のエラ−ログ情報フ
ァイル３４に採取する。また、営業店サ−バ処理装置３
１は、下位のクライアント３０内に採取されたエラ−ロ
グ情報を収集して、営業店サ−バ処理装置３１に接続さ
れた外部記憶装置内のエラ−ログ情報ファイル３５に書
き込む。また、その営業店サ−バ処理装置３１におい
て、ハ−ドウェア障害を検知すると、エラ−ログ情報を
営業店サ−バ３１に接続された外部記憶装置内のエラ−
ログ情報ファイル３５に採取する。FIG. 3 is an explanatory diagram of each function of collecting, collecting, editing, or transferring error log information according to the present invention. In FIG. 3, 30 is a client processing device, 31
Is a sales office server processing device, 32 is an integrated server processing device,
33 is a maintenance terminal, 34, 35 and 36 are all error log information files, 37 and 39 are edit list outputs, and 38 is a received file. When the client processing device 30 detects a hardware failure, the client processing device 30 collects error log information indicating detailed error information and error statistical information in an error log information file 34 in an external storage device connected to the client processing device 30. To do. In addition, the server processing device 3
1 collects the error log information collected in the lower level client 30 and writes it in the error log information file 35 in the external storage device connected to the sales office server processing device 31. Further, when the sales office server processing device 31 detects a hardware failure, the error log information is stored in the external storage device connected to the sales office server 31.
Collect in the log information file 35.

【０００９】次に、統合サ−バ処理装置３２は、下位の
営業店サ−バ処理装置３１内に採取されたエラ−ログ情
報を収集し、統合サ−バ処理装置３２に接続された外部
記憶装置内の外部記憶装置内のエラ−ログ情報ファイル
３６に書き込む。また、統合サ−バ処理装置３２は、そ
の処理装置内でハ−ドウェア障害を検知すると、エラ−
ログ情報を統合サ−バ処理装置３２に接続された外部記
憶装置内のエラ−ログ情報ファイル３６に採取する。統
合サ−バ処理装置３２は、収集／採取されたエラ−ログ
情報を編集して、出力装置３７にリスト出力する。一
方、保守拠点に設置された保守端末３３は、統合サ−バ
処理装置３２から回線を介して定期的に転送されたその
保守拠点のエラ−ログ情報を編集して、出力装置３９に
リスト出力する。図４は、図１における外部記憶装置に
記録されるエラ−ログ情報ファイルのフォ−マット構成
を示す図である。図４において、４０はエラ−ログ情報
ファイル、４１はエラ−ログ情報ファイル４０内の特定
エリアに記録されている障害詳細情報管理テ−ブル、４
２はエラ−統計情報管理テ−ブルである。以下、図４に
示す障害詳細情報管理テ−ブル４１とエラ−統計情報管
理テ−ブル４２の形式を、図５および図６に示す。Next, the integrated server processing device 32 collects the error log information collected in the subordinate office server processing device 31, and the external server connected to the integrated server processing device 32. Write to the error log information file 36 in the external storage device in the storage device. When the integrated server processing device 32 detects a hardware failure in the processing device, the integrated server processing device 32 returns an error.
The log information is collected in the error log information file 36 in the external storage device connected to the integrated server processing device 32. The integrated server processing device 32 edits the collected / collected error log information and outputs it as a list to the output device 37. On the other hand, the maintenance terminal 33 installed in the maintenance base edits the error log information of the maintenance base periodically transferred from the integrated server processing unit 32 via the line and outputs the list to the output unit 39. To do. FIG. 4 is a diagram showing the format structure of the error log information file recorded in the external storage device in FIG. In FIG. 4, 40 is an error log information file, 41 is a detailed error information management table recorded in a specific area in the error log information file 40, 4
Reference numeral 2 is an error statistical information management table. The formats of the fault detailed information management table 41 and the error statistical information management table 42 shown in FIG. 4 are shown below in FIGS. 5 and 6.

【００１０】図５において、５０はハ−ドウェア障害時
の周辺装置内ユニットに対する障害詳細項目の集合体
（Ｎ個存在する）の障害詳細情報管理テ−ブル、５１は
その周辺装置内ユニットに対する障害詳細項目１であ
り、同じように５２〜５Ｎはそれぞれ障害詳細項目２〜
Ｎである。障害詳細項目１には、障害詳細項目５１に対
する属性情報テ−ブル５１０がある。５１１，５１２，
・・・５１Ｍは、障害詳細項目５１内の属性情報テ−ブ
ル５１０を構成するＭ個のプリミティブな属性情報（Ａ
ttribute）である。同じようにして、障害詳細項目５
２，・・・・・５Ｎに対しても、それぞれＭ個のプリミ
ティブな属性情報がある。図６において、６０はハ−ド
ウェア障害時の周辺装置内ユニットに対するエラ−統計
項目の集合体（Ｎ個存在する）のエラ−統計情報管理テ
−ブル、６１はその周辺装置内ユニット内に対するエラ
−統計項目１であり、同じように６２〜６Ｎはそれぞれ
エラ−統計項目２〜Ｎである。エラ−統計項目１には、
エラ−統計項目６１内の属性情報テ−ブル６１０があ
る。６１１，６１２，・・・・６１Ｍは、エラ−統計項
目６１内の属性情報テ−ブル６１０を構成するＭ個のプ
リミティブな属性情報（Attribute)である。同じように
して、エラ−統計項目６２，・・・・６Ｎに対しても、
それぞれＭ個のプリミティブな属性情報がある。このよ
うに、属性情報で統一化すれば、属性情報に従って編集
すればよいので、編集がやり易くなる。In FIG. 5, reference numeral 50 is a failure detail information management table of a collection of failure detail items (there are N pieces) of a unit in a peripheral apparatus at the time of a hardware failure, and 51 is a failure in the unit in the peripheral apparatus. It is the detailed item 1, and similarly 52 to 5N are the fault detailed items 2 to 2, respectively.
N. The fault detail item 1 includes an attribute information table 510 for the fault detail item 51. 511, 512,
... 51M is M primitive attribute information (A of the attribute information table 510 in the fault detail item 51).
ttribute). In the same way, detail item 5
There are M pieces of primitive attribute information for 2, ..., 5N, respectively. In FIG. 6, reference numeral 60 is an error statistical information management table of an aggregate of error statistical items (there are N pieces) for the unit in the peripheral device at the time of hardware failure, and 61 is an error for the unit in the peripheral device. -Statistics item 1 and similarly 62 to 6N are error statistics items 2 to N, respectively. Error statistics item 1
There is an attribute information table 610 in the error statistical item 61. 611, 612, ..., 61M are M pieces of primitive attribute information (Attribute) forming the attribute information table 610 in the error statistical item 61. Similarly, for the error statistical items 62, ... 6N,
There are M pieces of primitive attribute information. In this way, if the attribute information is unified, it is sufficient to edit according to the attribute information, which facilitates editing.

【００１１】図７は、図５および図６のプリミティブな
属性情報の形式を示す図である。図７において、７０は
個々の属性情報に対する属性識別子であり、一意の識別
番号が与えられる。７１は属性情報のデ−タ内容７２に
対するデ−タ長である。また、７２は属性識別子７０で
示されるデ−タ内容、つまりデ−タ長７１だけの属性情
報の取り得る値である。この方法の考え方としては、個
々の属性情報に対してモジュ−ラ構造的な共通仕様と
し、柔軟性、拡張性、保守性を考えた汎用化、標準化を
図るためのオブジェクト指向に基づいている。図８は、
図５の属性情報テ−ブルをマッピングした場合の障害詳
細項目の形式を示す図である。ここでは、Ｍ＝７個のプ
リミティブな属性情報により構成されている。図８にお
いて、８０は障害詳細項目を示す項目種別、８１は周辺
装置が設置されている設置場所、８２は周辺装置の装置
名称、８３は周辺装置内ユニットで障害が発生した日
付、８４は周辺装置内ユニットにおける障害の名称、８
５は障害コ−ド、８６は障害発生時の各種レジスタやセ
ンサの情報等を示す障害詳細デ−タである。なお、障害
コ−ド８５とは、どのユニットの障害かを表わす障害発
生部位コ−ドであって、どのような現象であるかを示す
障害現象コ−ドとどのような原因であるかを表わす障害
原因コ−ドの分類体系で構成されており、これにより障
害の切り分けができる分解能を示しており、またある選
択された期間で障害が発生した時のその障害コ−ドに対
する障害発生合計件数および障害発生日等の項目からな
る障害発生推移リストを出力するためのキ−項目でもあ
る。FIG. 7 is a diagram showing a format of the primitive attribute information of FIGS. 5 and 6. In FIG. 7, reference numeral 70 denotes an attribute identifier for each attribute information, which is given a unique identification number. Reference numeral 71 is a data length for the data content 72 of the attribute information. Further, 72 is a data content indicated by the attribute identifier 70, that is, a possible value of the attribute information of only the data length 71. The method is based on an object-oriented approach for generalization and standardization in which each attribute information has a modular structure common specification and flexibility, expandability, and maintainability are considered. Figure 8
It is a figure which shows the format of the failure detailed item at the time of mapping the attribute information table of FIG. Here, it is composed of M = 7 pieces of primitive attribute information. In FIG. 8, 80 is an item type indicating detailed items of failure, 81 is an installation location where the peripheral device is installed, 82 is a device name of the peripheral device, 83 is a date when a failure occurs in a unit in the peripheral device, and 84 is a peripheral device. Name of the fault in the device unit, 8
Reference numeral 5 is a fault code, and 86 is fault detail data indicating information of various registers and sensors when a fault occurs. The fault code 85 is a fault occurrence site code indicating which unit has a fault, and a fault phenomenon code indicating what kind of phenomenon and what cause. It is composed of the classification system of the fault cause code that is shown, which shows the resolution that can isolate the fault, and the total fault occurrence for the fault code when the fault occurs in a certain selected period. It is also a key item for outputting a failure occurrence transition list including items such as the number of cases and the failure occurrence date.

【００１２】図９は、図６における属性情報テ−ブルを
マッピングした時のエラ−統計項目の形式を示す図であ
る。ここでは、Ｍ＝９個のプリミティブな属性情報によ
り構成されている。図９において、９０はエラ−統計項
目を示す項目種別、９１は周辺装置が設置されている装
置設置場所、９２は周辺装置の装置名称、９３は周辺装
置内のユニット名称、９４は周辺装置内ユニットのエラ
−カウンタの名称、９５はエラ−カウンタ、９６は周辺
装置内ユニットに入出力起動命令を発行する時の起動回
数、９７はエラ−発生件数の閾値、９８はエラ−発生率
の閾値を示している。なお、エラ−発生件数閾値９７
は、上位システムからの収集コマンドにより、エラ−カ
ウンタ９５の値がそのエラ−発生件数閾値９７を超えた
エラ−統計項目だけを、上位システムに収集する時に判
断する閾値である。また、エラ−発生率閾値９８は、上
位システムからの収集コマンドにより、エラ−カウンタ
９５の値を起動回数９６で割算した値がエラ−発生率閾
値９８を超えたエラ−統計項目だけを、上位システムに
収集する時に判断する閾値である。すなわち、これらの
２つの項目９７，９８は、上位システムに収集するエラ
−統計項目の情報量を最小限にし、収集時間を短縮する
ために必要な項目となる。なお、これらのエラ−発生件
数閾値９７とエラ−発生率閾値９８の各値は、統合サ−
バ処理装置２７の入力装置２８、あるいは営業店サ−バ
処理装置２３の入力装置２４からコマンド指示が入力さ
れることにより、初期設定あるいは値の更新が行われ
る。FIG. 9 is a diagram showing the format of error statistical items when the attribute information table in FIG. 6 is mapped. Here, it is composed of M = 9 pieces of primitive attribute information. In FIG. 9, 90 is an item type indicating an error statistical item, 91 is a device installation location in which the peripheral device is installed, 92 is a device name of the peripheral device, 93 is a unit name in the peripheral device, and 94 is in the peripheral device. The name of the error counter of the unit, 95 is an error counter, 96 is the number of times of activation when an input / output activation command is issued to the unit in the peripheral device, 97 is a threshold of the number of error occurrences, 98 is a threshold of the error occurrence rate Is shown. In addition, the error occurrence threshold 97
Is a threshold value which is determined when collecting in the upper system only the error statistical item in which the value of the error counter 95 exceeds the error occurrence number threshold value 97 by the collecting command from the upper system. Further, the error occurrence rate threshold 98 is set only for the error statistical items for which the value obtained by dividing the value of the error counter 95 by the number of activations 96 exceeds the error occurrence rate threshold 98 by the collection command from the upper system. It is a threshold value that is judged when collecting data in the host system. That is, these two items 97 and 98 are necessary items for minimizing the information amount of error statistical items collected in the host system and shortening the collection time. The values of the error occurrence count threshold value 97 and the error occurrence rate threshold value 98 are the same as those of the integrated service.
Initialization or updating of values is performed by inputting a command instruction from the input device 28 of the bar processing device 27 or the input device 24 of the sales office server processing device 23.

【００１３】図１０は、本発明における障害発生推移リ
ストの出力フォ−マット例を示す図である。図５の障害
詳細情報管理テ−ブルを編集し、これを障害発生推移リ
ストとして営業店サ−バ２３あるいは統合サ−バ２７内
の出力装置２６，２Ａ、または保守端末２８内の出力装
置２Ｄに出力するときには、図１０に示すようなリスト
形式となる。リストの項目には、保守拠点名称、装置設
置場所、障害名称、障害コ−ド、発生件数、障害詳細デ
−タ、および障害発生日等が出力され、それらの各項目
に対応して具体的名称と各値が出力される。図１１は、
本発明におけるエラ−統計項目の出力フォ−マット例を
示す図である。図６のエラ−統計情報管理テ−ブルを編
集し、これをエラ−統計情報における異常値リストとし
て営業店サ−バ２３あるいは統合サ−バ２７および保守
端末２８に接続された各出力装置２６，２Ａ，２Ｄに出
力するときには、図１１に示すようなリスト形式とな
る。リストの項目としては、保守拠点名称、装置設置場
所、装置名称、ユニット名称、エラ−カウンタ名称、エ
ラ−カウンタ値、起動回数、およびエラ−発生率等が出
力され、それらの各項目に対応して具体的名称と各値が
出力される。FIG. 10 is a diagram showing an example of the output format of the failure occurrence transition list according to the present invention. The detailed fault information management table of FIG. 5 is edited and used as a fault occurrence transition list to output devices 26, 2A in the sales office server 23 or integrated server 27, or an output device 2D in the maintenance terminal 28. When output to, the list format is as shown in FIG. In the items of the list, maintenance site name, equipment installation location, failure name, failure code, number of occurrences, failure detail data, failure occurrence date, etc. are output. The name and each value are output. FIG. 11 shows
It is a figure which shows the output format example of an error statistical item in this invention. The error statistical information management table shown in FIG. 6 is edited, and this is used as an abnormal value list in the error statistical information to output server 26 connected to sales office server 23 or integrated server 27 and maintenance terminal 28. , 2A, 2D, the list format is as shown in FIG. As the list items, maintenance site name, device installation location, device name, unit name, error counter name, error counter value, number of startups, error occurrence rate, etc. are output and correspond to each of these items. The specific name and each value are output.

【００１４】図１２〜図１７は、本発明の一実施例を示
すエラ−ログ情報管理方法の動作フロ−チャ−トであ
る。図１２には、クライアント処理装置２０、営業店サ
−バ処理装置２３、統合サ−バ処理装置２７において、
障害詳細情報とエラ−統計情報を示すエラ−ログ情報を
ハ−ドディスク等の外部記憶装置内に採取する場合の各
処理装置２０，２３，２７内のプログラム実行を示すフ
ロ−が示されている。各処理装置２０，２３，２７，お
よび保守拠点２Ｂには、外部記憶装置２２，２５，２
９，２Ｃが接続され、エラ−ログ情報ファイルが記憶さ
れている。図１２では、先ずこれらハ−ドディスク等の
外部記憶装置２２，２５，２９，２Ｃ内に採取するエラ
−ログ情報ファイル内のエラ−統計情報管理テ−ブル４
２に対して初期設定処理を行う（ステップ１００）。初
期設定方法としては、図９のエラ−統計項目内の項目種
別９０、装置設置場所９１、装置名称９２、ユニット名
称９３、エラ−カウンタ名称９４、エラ−カウンタ値９
５、起動回数９６、エラ−発生件数閾値９７、およびエ
ラ−発生率閾値９８を、該当する周辺装置およびユニッ
トが構成する情報に従って初期設定する。なお、上記各
項目のうち、エラ−発生件数閾値９７とエラ−発生率閾
値９８は、上位システムからのコマンド指示により初期
設定または変更を行うことができる。FIGS. 12 to 17 are operation flowcharts of the error log information management method showing an embodiment of the present invention. FIG. 12 shows a client processing device 20, a branch server processing device 23, and an integrated server processing device 27.
A flow chart showing program execution in each processing unit 20, 23, 27 when error log information showing detailed error information and error statistical information is collected in an external storage device such as a hard disk is shown. There is. External storage devices 22, 25, 2 are provided in each of the processing devices 20, 23, 27 and the maintenance base 2B.
9 and 2C are connected and an error log information file is stored. In FIG. 12, first, the error statistical information management table 4 in the error log information file to be collected in the external storage device 22, 25, 29, 2C such as the hard disk.
An initial setting process is performed for 2 (step 100). As the initial setting method, the item type 90, the device installation place 91, the device name 92, the unit name 93, the error counter name 94, and the error counter value 9 in the error statistics item of FIG.
5, the number of activations 96, the error occurrence number threshold 97, and the error occurrence rate threshold 98 are initialized according to the information configured by the corresponding peripheral device and unit. Among the above items, the error occurrence count threshold value 97 and the error occurrence rate threshold value 98 can be initialized or changed by a command from the host system.

【００１５】次に、クライアント処理装置２０、営業店
サ−バ処理装置２３および統合サ−バ処理装置２７にお
ける実行が、通常業務処理中であるか否かを判断する
（ステップ１０１）。通常業務処理中であれば、周辺装
置内ユニットに対して入出力起動命令を発行する毎に、
エラ−統計情報管理テ−ブル６０の周辺装置内ユニット
に対応したエラ−統計項目内の起動回数９６を更新する
（ステップ１０２）。次に、周辺装置内ユニットにおい
て障害を検知した時には（ステップ１０３）、回復可能
な障害であるか否かを判断する（ステップ１０４）。回
復可能な障害であれば、障害回復処理を実行する（ステ
ップ１０５）。次に、外部記憶装置内に採取した図４の
エラ−統計情報管理テ−ブル４２の該当するエラ−統計
項目内のエラ−カウンタ９５の値を１だけ更新する（ス
テップ１０６）。次に、障害回復処理時には、障害回復
が成功したか否かを判断する（ステップ１０７）。障害
回復が成功したならば、業務続行処理を行う（ステップ
１０９）。一方、障害回復が不成功のときには、障害回
復処理の再試行回数が規定回数をオ−バ−したか否かを
判断する（ステップ１０８）。障害回復処理の再試行回
数がオ−バ−していなければ、ステップ１０５に戻っ
て、再度、障害回復処理を実行する。Next, it is judged whether or not the executions in the client processing device 20, the branch server processing device 23, and the integrated server processing device 27 are in the normal business process (step 101). If normal business processing is in progress, each time an I / O start-up command is issued to the peripheral unit,
The number of times of activation 96 in the error statistical item corresponding to the unit in the peripheral device of the error statistical information management table 60 is updated (step 102). Next, when a failure is detected in the peripheral device unit (step 103), it is determined whether or not the failure is a recoverable failure (step 104). If it is a recoverable failure, failure recovery processing is executed (step 105). Next, the value of the error counter 95 in the corresponding error statistical item of the error statistical information management table 42 of FIG. 4 collected in the external storage device is updated by 1 (step 106). Next, during failure recovery processing, it is determined whether failure recovery has succeeded (step 107). If the failure recovery is successful, business continuation processing is performed (step 109). On the other hand, when the failure recovery is unsuccessful, it is determined whether or not the number of retries of the failure recovery processing has exceeded the specified number (step 108). If the number of retries of the failure recovery processing is not over, the process returns to step 105 and the failure recovery processing is executed again.

【００１６】障害回復処理の再試行回数が規定回数より
オ−バ−していたときには（ステップ１０８）、該当す
る周辺装置内ユニットは回復不能障害であるとみなし
て、図８に示す障害詳細項目内に項目種別８０、装置設
置場所８１、装置名称８２、障害発生日付８３、障害名
称８４、障害コ−ド８５、および障害詳細デ−タ８６を
設定した後、外部記憶装置内の図５に示す障害詳細情報
管理テ−ブル５０の障害詳細項目５Ｎの後に追加モ−ド
で次の各事項を書き込む。すなわち、障害詳細項目を設
定して外部記憶装置に採取し（ステップ１１０）、次に
縮退運用が可能であるか否かを判断し（ステップ１１
１）、縮退運用が可能であれば、縮退運用に切り替える
処理を行う（ステップ１１２）。そして、最初の処理ス
テップ１０１に戻る（Ａ）。また、縮退運用が不可能な
場合には、システムダウンの処理を行う（ステップ１１
３）。なお、ステップ１０４において、当該周辺装置内
ユニットが回復不能障害であれば、直ちに障害詳細項目
を採取する（ステップ１１０）。When the number of retries of the failure recovery processing is over the specified number of times (step 108), the relevant peripheral unit is regarded as an unrecoverable failure and the failure detail items shown in FIG. After setting the item type 80, the device installation location 81, the device name 82, the fault occurrence date 83, the fault name 84, the fault code 85, and the fault detailed data 86 in the external storage device, refer to FIG. The following items are written in the additional mode after the fault detail item 5N of the fault detailed information management table 50 shown. That is, the failure detail items are set and collected in the external storage device (step 110), and then it is determined whether or not degenerate operation is possible (step 11).
1) If degenerate operation is possible, a process for switching to degenerate operation is performed (step 112). Then, the process returns to the first processing step 101 (A). If degenerate operation is not possible, system down processing is performed (step 11).
3). In step 104, if the peripheral device unit is an unrecoverable failure, the failure detail item is immediately collected (step 110).

【００１７】図１３には、該当するクライアント処理装
置２０が営業店サ−バ処理装置２３から収集コマンドを
受信した後、そのクライアント処理装置２０で採取され
た障害詳細情報およびエラ−統計情報を示すエラ−ログ
情報を営業店サ−バ２３に送信する場合における処理装
置２０内のプログラムの実行処理フロ−が示されてい
る。先ず、該当するクライアント処理装置２０が営業店
サ−バ処理装置２３から収集コマンドを受信すると（ス
テップ１２０）、処理装置２０は採取された図５に示す
障害詳細情報管理テ−ブル５０の中の全ての障害詳細項
目を営業店サ−バ処理装置２３に送信する（ステップ１
２１）。障害詳細項目の全てが終了した後、障害詳細情
報管理テ−ブル５０をクリアする（ステップ１２２）。
次に、外部記憶装置２２内に採取された図６に示すエラ
−統計情報管理テ−ブル６０の中で、エラ−カウンタ９
５の値がエラ−発生件数閾値９７を超えたエラ−統計項
目、または起動回数９６をエラ−カウンタ９５の値で割
算した値がエラ−発生率閾値９８を超えたエラ−統計項
目だけを営業店サ−バ処理装置２３に送信する（ステッ
プ１２３）。全ての送信を終了した後、エラ−統計情報
管理テ−ブル６０をクリアする（ステップ１２４）。FIG. 13 shows fault detail information and error statistical information collected by the client processing device 20 after the client processing device 20 receives the collection command from the sales office server processing device 23. The execution processing flow of the program in the processing device 20 when the error log information is transmitted to the sales office server 23 is shown. First, when the corresponding client processing device 20 receives a collection command from the sales office server processing device 23 (step 120), the processing device 20 extracts the failure detailed information management table 50 shown in FIG. All detail items of failure are transmitted to the sales office server processing device 23 (step 1).
21). After all the failure detail items have been completed, the failure detail information management table 50 is cleared (step 122).
Next, in the error statistical information management table 60 shown in FIG. 6 collected in the external storage device 22, the error counter 9
Only the error statistical item whose value of 5 exceeds the error occurrence threshold 97 or the error statistical item whose number of activations 96 divided by the value of the error counter 95 exceeds the error occurrence threshold 98 It is transmitted to the sales office server processing device 23 (step 123). After completing all the transmissions, the error statistical information management table 60 is cleared (step 124).

【００１８】図１４には、該当する営業店サ−バ処理装
置２３が統合サ−バ処理装置２７から収集コマンドを受
信した後、その営業店サ−バ処理装置２３で採取した障
害詳細情報およびエラ−統計情報を示すエラ−ログ情報
またはその営業店サ−バ処理装置２３の下位にあるクラ
イアント処理装置２０より収集した障害詳細情報および
エラ−統計情報を示すエラ−ログ情報を統合サ−バ処理
装置２７に送信する場合の処理装置２３内のプログラム
実行動作フロ−が示されている。先ず、該当する営業店
サ−バ処理装置２３は、統合サ−バ処理装置２７から収
集コマンドを受信する（ステップ１３０）。次に、その
営業店サ−バ処理装置２３で採取または収集された障害
詳細情報管理テ−ブル５０を、統合サ−バ処理装置２７
に送信する（ステップ１３１）。全ての送信を終了した
後、障害詳細情報管理テ−ブル５０をクリアする（ステ
ップ１３２）。さらに、外部記憶装置内に採取または収
集されたエラ−統計情報管理テ−ブル６０の中で、エラ
−カウンタ９５の値がエラ−発生件数閾値９７を超えた
エラ−統計項目、または起動回数９６をエラ−カウンタ
９５の値で割算した値がエラ−発生率閾値９８を超えた
エラ−統計項目だけを、統合サ−バ処理装置２７に送信
する（ステップ１３３）。全ての送信が終了した後、エ
ラ−統計情報管理テ−ブル９０をクリアする（ステップ
１３４）。FIG. 14 shows detailed fault information collected by the sales office server processing device 23 after the sales office server processing device 23 receives the collection command from the integrated server processing device 27. The error log information indicating the error statistical information or the failure detailed information collected from the client processing device 20 below the branch server processing device 23 and the error log information indicating the error statistical information are integrated into the server. A program execution operation flow in the processing device 23 when transmitting to the processing device 27 is shown. First, the corresponding sales office server processing device 23 receives a collection command from the integrated server processing device 27 (step 130). Next, the failure detailed information management table 50 collected or collected by the sales office server processing device 23 is transferred to the integrated server processing device 27.
(Step 131). After all the transmissions have been completed, the fault detailed information management table 50 is cleared (step 132). Further, in the error statistical information management table 60 collected or collected in the external storage device, the error statistical item in which the value of the error counter 95 exceeds the error occurrence number threshold value 97, or the number of activations 96 Is divided by the value of the error counter 95 and exceeds the error occurrence rate threshold 98, only the error statistical items are transmitted to the integrated server processing device 27 (step 133). After all transmission is completed, the error statistical information management table 90 is cleared (step 134).

【００１９】図１５には、営業店サ−バ処理装置２３か
らの収集コマンドに従い、下位にある全てのクライアン
ト処理装置２０に採取されたエラ−ログ情報を受信する
場合における処理装置２３内のプログラム実行処理フロ
−が示されている。先ず、営業店サ−バ処理装置２３の
入力装置２４から収集コマンドが入力されると、処理装
置２３は、そのコマンドに従って下位にある全てのクラ
イアント処理装置２０に採取された障害詳細情報管理テ
−ブル５０およびエラ−統計情報管理テ−ブル６０の収
集指示を送信する（ステップ１４０）。下位にある全て
のクライアント処理装置２０から障害詳細情報管理テ−
ブル５０およびエラ−統計情報管理テ−ブル６０を受信
して、これらを外部記憶装置２５に書き込む（ステップ
１４１）。上記テ−ブル５０，６０の全ての受信が終了
すると、営業店サ−バ処理装置２３は、次に障害詳細情
報管理テ−ブル５０およびエラ−統計情報管理テ−ブル
６０に設定された属性情報に従って編集を行い、図１０
に示す障害発生推移リストおよび図１１に示すエラ−統
計情報における異常値リストを作成する（ステップ１４
２）。次に、これらのリストを出力装置２６に出力する
（ステップ１４３）。FIG. 15 shows a program in the processing device 23 when receiving the error log information collected by all the client processing devices 20 in the lower order according to the collection command from the sales office server processing device 23. The execution process flow is shown. First, when a collection command is input from the input device 24 of the branch server processing device 23, the processing device 23 causes the failure detailed information management table collected in all the client processing devices 20 below to follow the command. An instruction to collect the bull 50 and the error statistical information management table 60 is transmitted (step 140). Fault detail information management tables are sent from all the client processing devices 20 in the lower order.
The cable 50 and the error statistical information management table 60 are received and written in the external storage device 25 (step 141). When all of the above-mentioned tables 50 and 60 have been received, the branch server processing device 23 next sets the attributes set in the fault detailed information management table 50 and the error statistical information management table 60. Editing according to the information,
The failure occurrence transition list shown in FIG. 11 and the abnormal value list in the error statistical information shown in FIG. 11 are created (step 14
2). Next, these lists are output to the output device 26 (step 143).

【００２０】図１６には、統合サ−バ処理装置２７から
の収集コマンドに従って、下位にある全ての営業店サ−
バ処理装置２３に採取または収集されたエラ−ログ情報
を受信する場合における処理装置２７のプログラム実行
処理フロ−が示されている。先ず、統合サ−バ処理装置
２７の入力装置２８から収集コマンドが入力されると、
処理装置２７は、下位にある全ての営業店サ−バ処理装
置２３に採取または収集された障害詳細情報管理テ−ブ
ル５０およびエラ−統計情報管理テ−ブル６０の収集指
示を送信する（ステップ１５０）。次に、処理装置２７
は、下位の全ての営業店サ−バ処理装置２３から障害詳
細情報管理テ−ブル５０およびエラ−統計情報管理テ−
ブル６０を受信して、これらを外部記憶装置２９に書き
込む（ステップ１５１）。全ての受信が終了すると、障
害詳細情報管理テ−ブル５０およびエラ−統計情報管理
テ−ブル６０に設定された属性情報に従って編集し、図
１０に示す障害発生推移リストおよび図１１に示すエラ
−統計情報における異常値リストを作成する（ステップ
１５２）。これらのリストを出力装置２Ａに出力する
（ステップ１５３）。次に、処理装置２７は、外部記憶
装置２９に書き込まれた障害詳細情報管理テ−ブル５０
およびエラ−統計情報管理テ−ブル６０を定期的、例え
ば毎日、周毎、月単位に読み出して、保守拠点毎に分類
し、これらを保守端末に転送する（ステップ１５４）。In FIG. 16, according to the collection command from the integrated server processing unit 27, all the subordinate sales office servers are displayed.
The program execution processing flow of the processing device 27 when the error log information collected or collected by the processing device 23 is received is shown. First, when a collecting command is input from the input device 28 of the integrated server processing device 27,
The processing device 27 sends a collection instruction of the fault detailed information management table 50 and the error statistical information management table 60 collected or collected to all the branch office server processing devices 23 (step). 150). Next, the processing device 27
Is the fault detailed information management table 50 and the error statistical information management table from all the subordinate server server processing devices 23.
Bull 60 is received and these are written in the external storage device 29 (step 151). When all the reception is completed, it is edited according to the attribute information set in the fault detailed information management table 50 and the error statistical information management table 60, and the fault occurrence transition list shown in FIG. 10 and the error shown in FIG. An abnormal value list in the statistical information is created (step 152). These lists are output to the output device 2A (step 153). Next, the processing device 27 causes the fault detailed information management table 50 written in the external storage device 29.
Also, the error statistical information management table 60 is read out periodically, for example, on a daily, weekly or monthly basis, classified by maintenance base, and transferred to the maintenance terminal (step 154).

【００２１】図１７には、統合サ−バ処理装置２７から
転送された障害詳細情報管理テ−ブル５０エラ−統計情
報管理テ−ブル６０を編集して、リスト出力する場合の
保守拠点における処理装置２Ｂ内のプログラム実行処理
フロ−が示される。処理装置２Ｂは、統合サ−バ処理装
置２７から定期的（毎日、周毎、月単位）に転送された
障害詳細情報管理テ−ブル５０およびエラ−統計情報管
理テ−ブル６０を受信して、これらを外部記憶装置２Ｃ
に書き込む（ステップ１６０）。全ての受信が終了する
と、障害詳細情報管理テ−ブル５０およびエラ−統計情
報管理テ−ブル６０に設定された属性情報に従って編集
し、図１０に示す障害発生推移リストおよび図１１に示
すエラ−統計情報における異常値リストを作成する（ス
テップ１６１）。それらのリストを出力装置２Ｄに出力
する（ステップ１６２）。一定の期間経過の後、外部記
憶装置２Ｃに書き込まれた障害詳細情報管理テ−ブル５
０およびエラ−統計情報管理テ−ブル６０をクリアする
（ステップ１６３）。FIG. 17 shows the processing at the maintenance base when the detailed error information management table 50 and the statistical information management table 60 transferred from the integrated server processing device 27 are edited and output as a list. A program execution process flow in the device 2B is shown. The processing unit 2B receives the fault detailed information management table 50 and the error statistical information management table 60 which are periodically (daily, weekly, monthly) transferred from the integrated server processing unit 27. , These are external storage devices 2C
(Step 160). When all the reception is completed, it is edited according to the attribute information set in the fault detailed information management table 50 and the error statistical information management table 60, and the fault occurrence transition list shown in FIG. 10 and the error shown in FIG. An abnormal value list in the statistical information is created (step 161). The list is output to the output device 2D (step 162). After a certain period of time has passed, the detailed fault information management table 5 written in the external storage device 2C is displayed.
0 and the error statistical information management table 60 are cleared (step 163).

【００２２】[0022]

【発明の効果】以上説明したように、本発明によれば、
クライアントサ−バシステム等の階層関係にある分散シ
ステムの各種周辺装置に発生したエラ−ログ情報を定量
的に管理することができるので、この情報をフィ−ドバ
ックデ−タとして有効に活用すると同時に、均一で効率
的な障害対策および日常点検、定期点検、異常の事前検
知等の予防保守を行うことが可能となる。As described above, according to the present invention,
Since it is possible to quantitatively manage error log information generated in various peripheral devices of a distributed system having a hierarchical relationship such as a client server system, at the same time effectively utilizing this information as feedback data, It is possible to carry out uniform and efficient fault countermeasures and preventive maintenance such as daily inspections, periodic inspections, and prior detection of abnormalities.

[Brief description of drawings]

【図１】本発明の一実施例を示すクライアントサ−バシ
ステムの構成の一部を示す図である。FIG. 1 is a diagram showing a part of a configuration of a client server system showing an embodiment of the present invention.

【図２】同じく、クライアントサ−バシステムの構成の
他の一部を示す図である。FIG. 2 is a diagram showing another part of the configuration of the client server system.

【図３】図１，図２におけるエラ−ログ情報の採取／収
集／編集／転送の各動作を示す説明図である。FIG. 3 is an explanatory diagram showing each operation of collecting / collecting / editing / transferring error log information in FIGS. 1 and 2;

【図４】図１，図２，図３における外部記憶装置に記録
されるエラ−ログ情報ファイルの要部フォ−マット図で
ある。FIG. 4 is a main part format diagram of an error log information file recorded in the external storage device in FIGS. 1, 2 and 3.

【図５】図４における障害詳細情報管理テ−ブルおよび
属性情報テ−ブルのフォ−マット図である。5 is a format diagram of a fault detailed information management table and an attribute information table in FIG.

【図６】図４におけるエラ−統計情報管理テ−ブルおよ
び属性情報テ−ブルのフォ−マット図である。6 is a format diagram of an error statistical information management table and an attribute information table in FIG.

【図７】図５，図６におけるプリミティブな属性情報の
形式を示す図である。FIG. 7 is a diagram showing a format of primitive attribute information in FIGS. 5 and 6;

【図８】図５における属性情報テ−ブルを具体的にマッ
ピングした場合の障害詳細項目のフォ−マット図であ
る。8 is a format diagram of fault detail items when the attribute information table in FIG. 5 is specifically mapped.

【図９】図６における属性情報テ−ブルを具体的にマッ
ピングした場合のエラ−統計項目のフォ−マット図であ
る。9 is a format diagram of error statistical items when the attribute information table in FIG. 6 is specifically mapped.

【図１０】図３における障害発生推移リストの出力フォ
−マット図である。10 is an output format diagram of the failure occurrence transition list in FIG.

【図１１】図３におけるエラ−統計情報の異常値リスト
の出力フォ−マット図である。11 is an output format diagram of an abnormal value list of error statistical information in FIG.

【図１２】図１，図２における各処理装置のプログラム
実行処理フロ−チャ−トである。FIG. 12 is a program execution processing flowchart of each processing apparatus in FIGS. 1 and 2;

【図１３】図１，図２において、クライアントで採取さ
れた障害詳細情報およびエラ−ログ情報を営業店サ−バ
に送信する場合のクライアント処理装置のプログラム実
行処理フロ−チャ−トである。FIG. 13 is a program execution processing flowchart of the client processing device in the case of transmitting the fault detail information and the error log information collected by the client in FIGS. 1 and 2 to the sales office server.

【図１４】図１，図２において、営業店サ−バで採取ま
たは収集された障害詳細情報およびエラ−ログ情報を統
合サ−バに送信する場合の営業店サ−バ処理装置のプロ
グラム実行処理フロ−チャ−トである。FIG. 14 is a program execution of the sales office server processing device in the case of transmitting the fault detail information and error log information collected or collected by the sales office server to the integrated server in FIGS. It is a processing flow chart.

【図１５】図１，図２において、クライアントに採取さ
れたエラ−ログ情報を受信する場合の営業店サ−バ処理
装置のプログラム実行処理フロ−チャ−トである。FIG. 15 is a program execution processing flowchart of the sales office server processing apparatus when receiving the error log information collected by the client in FIGS. 1 and 2;

【図１６】図１，図２において、営業店サ−バに採取ま
たは収集されたエラ−ログ情報を受信する場合の統合サ
−バ処理装置のプログラム実行処理フロ−チャ−トであ
る。16 is a program execution processing flowchart of the integrated server processing device when receiving the error log information collected or collected by the sales office server in FIG. 1 and FIG.

【図１７】図１，図２において、統合サ−バから転送さ
れた障害詳細情報管理テ−ブルおよびエラ−統計情報管
理テ−ブルを編集し、リスト出力する場合の保守拠点の
処理装置のプログラム実行処理フロ−チャ−トである。FIG. 17 is a diagram showing a processing unit of a maintenance base in the case of editing the failure detailed information management table and error statistical information management table transferred from the integrated server and outputting the list in FIGS. It is a program execution processing flowchart.

[Explanation of symbols]

２０，３０クライアント処理装置２３，３１営業店サ−バ処理装置２７，３２統合サ−バ処理装置２Ｂ，３３保守端末処理装置２１周辺装置２２，２５，２９，２Ｃ外部記憶装置２６，２Ａ，２Ｄ出力装置２４，２８入力装置４０エラ−ログ情報ファイル４１，５０障害詳細情報管理テ−ブル４２，６０エラ−統計情報管理テ−ブル５１〜５Ｎ障害詳細項目９１〜９Ｎエラ−統計項目５１０属性情報テ−ブル５１１〜５１Ｍ属性情報６１０〜６１Ｍ属性情報７０属性識別子７１デ−タ長７２属性情報取り得る値８０障害詳細項目を表す項目種別８１，９１装置設置場所８２装置名称８３障害発生日付８４，９２障害名称８５障害コ−ド８６障害詳細デ−タ９０エラ−統計項目を表す項目種別９３ユニット名称９４エラ−カウンタ名称９５エラ−カウンタ値９６起動回数９７エラ−発生件数閾値９８エラ−発生率閾値 20, 30 Client processing device 23, 31 Sales office server processing device 27, 32 Integrated server processing device 2B, 33 Maintenance terminal processing device 21 Peripheral device 22, 25, 29, 2C External storage device 26, 2A, 2D Output device 24,28 Input device 40 Error log information file 41,50 Fault detailed information management table 42,60 Error statistical information management table 51-5N Fault detailed item 91-9N Error statistical item 510 Attribute information Table 511-51M Attribute information 610-61M Attribute information 70 Attribute identifier 71 Data length 72 Attribute information possible values 80 Item type indicating detailed fault items 81, 91 Device installation location 82 Device name 83 Fault occurrence date 84, 92 Fault name 85 Fault code 86 Fault detailed data 90 Error item type indicating statistical items 93 Unit Name 94 Error counter name 95 Error counter value 96 Number of startups 97 Threshold number of error occurrences 98 Threshold value of error occurrence rate

Claims

[Claims]

1. A client to which peripheral devices such as an external storage device and an input / output device are respectively connected, a plurality of business office servers that manage the clients, and an integrated server that centrally manages the business office servers, In addition, in the client server system including a plurality of layers of maintenance bases connected to the integrated server, detailed error information and error statistical information of the hardware error collected for each unit in the peripheral device of each layer are displayed. The error log information shown is recorded in a specific area of the connected external storage device, and the recorded information is read out periodically and sequentially transmitted to the upper server, so that the information can be collectively sent to the upper server. After collecting the data, edit it to make it easier to see, and output the failure occurrence transition list of the detailed error information and the abnormal value list when the threshold of the error statistical information is exceeded to the output device of the upper server. At the same time, the error log information collected by the integrated server is transferred to the terminal at the maintenance base, and the edited maintenance list is output to the output device of the maintenance terminal to save the maintenance list. An error log information management method characterized in that it is used as feedback data for each peripheral device.