JP2001216166A

JP2001216166A - Maintenance control method for information processor, information processor, creating method for software and software

Info

Publication number: JP2001216166A
Application number: JP2000025359A
Authority: JP
Inventors: Yoshinori Igarashi; 良典五十嵐; Suketaka Nabekura; 祐高鍋倉
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2000-02-02
Filing date: 2000-02-02
Publication date: 2001-08-10

Abstract

PROBLEM TO BE SOLVED: To realize prompt recovery from a fault, preventive maintenance, etc., by efficiently performing remote maintenance of software of external storage devices such as distributively arranged disk array devices. SOLUTION: A disk array device installation site 604 is connected with a fault recovery assistance center 602 with a fault countermeasure data base with information such as a fault countermeasure ID to discriminate faults and the contents of countermeasure and a micro database in which a binary micro program in which the executed fault countermeasure ID and the contents of countermeasure are embedded is stored through a network 603, fault information is reported to the fault recovery assistance center 602 through the network 603 taking the opportunity of generation of a fault of a disk array device 601, the corresponding fault countermeasure ID is detected from the fault countermeasure database, binary retrieval is performed to the micro program itself in the micro database by the obtained fault countermeasure ID, the micro program with the applicable fault countermeasure ID is found, transmitted to the disk array device and updated there.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、情報処理装置の保
守管理技術および情報処理装置ならびにソフトウェアの
作成技術およびソフトウェアに関し、特に、遠隔地等に
分散して配置された複数の外部記憶装置等の情報処理装
置の保守管理等に適用して有効な技術に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a maintenance management technique for an information processing apparatus, an information processing apparatus, and a software creation technique and software, and in particular, to a plurality of external storage apparatuses and the like distributed in remote locations or the like. The present invention relates to a technique that is effective when applied to maintenance management of an information processing device.

【０００２】[0002]

【従来の技術】近年、オンラインシステム等による計算
機システムの急速な利用拡大に伴い、無停止運転・高信
頼性・高可用性のシステムが要求されている。この様な
システムで障害が発生した場合、システムの回復作業は
短時間に、しかも使用状態下で実施しなければならず、
回復手順での操作ミスや作業指示の誤りは絶対に許され
ない。もし、誤った手順で回復させると、２次障害も起
こし兼ねない。そこで、自動的に障害を回避できる部分
に対しては、保守員の介入なしに復旧作業が行なえれ
ば、保守ミスを防ぐことにも繋がる。2. Description of the Related Art In recent years, with the rapid expansion of the use of a computer system by an online system or the like, a system of non-stop operation, high reliability, and high availability has been required. If a failure occurs in such a system, the recovery of the system must be performed in a short period of time and in use.
Operation errors and work instruction errors in the recovery procedure are absolutely unacceptable. If recovery is performed by an incorrect procedure, a secondary failure may occur. Therefore, if a recovery operation can be performed on a portion that can automatically avoid a failure without the intervention of maintenance personnel, maintenance errors can be prevented.

【０００３】たとえば、計算機システムに接続される外
部記憶装置として広範に普及しているディスクアレイ装
置に着目すると、全世界には膨大数のマイクロプログラ
ムがディスクアレイ装置に組込まれて稼働中であり、あ
るマイクロプログラムバージョンで不良が発覚しても、
必ずしも対策したマイクロプログラムバージョンへの更
新を徹底することは困難である。そのため、不良対策は
個別に行なう必要があり、対策内容とマイクロプログラ
ムバージョンの管理が複雑になっている。For example, if attention is paid to a disk array device which is widely used as an external storage device connected to a computer system, a huge number of microprograms are incorporated in the disk array device in the world and are in operation. Even if a defect is discovered in one microprogram version,
It is difficult to always update the microprogram version to take measures. For this reason, it is necessary to individually take a countermeasure against a defect, and management of the content of the countermeasure and the microprogram version is complicated.

【０００４】[0004]

【発明が解決しようとする課題】上述のようにソフトウ
ェアの更新等の保守管理を人手で行った場合には、膨大
な工数を要するとともに、作業ミスを皆無とすることは
困難であり、冗長構成による格納データの高信頼性が売
り物のディスクアレイ装置ではデータ喪失等の重大な障
害が懸念される。When the maintenance management such as the software update is performed manually as described above, it requires a great number of man-hours and it is difficult to eliminate any work mistakes. However, there is a concern that a disk array device that has a high reliability of stored data due to data loss may cause a serious obstacle such as data loss.

【０００５】また、ソフトウェアのバージョン等の管理
については、ソースコードレベルでは注釈の付加によっ
て更新履歴を後から知ることが可能であるが、注釈は、
コンパイルやアセンブル等によるバイナリのロードモジ
ュールへの変換の課程で削除されるため、実際に装置に
実装されて稼働するバイナリのロードモジュールから、
当該ロードモジュールに関する変更履歴を知ることがで
きない。[0005] Regarding management of software version and the like, it is possible to know the update history later by adding an annotation at the source code level.
Since it is deleted in the process of converting to a binary load module by compiling and assembling, etc., from the binary load module that is actually mounted and operated on the device,
The change history of the load module cannot be known.

【０００６】従って、従来では、変更履歴の異なる多数
のロードモジュールを管理する場合、個々のロードモジ
ュールの名前と、当該ロードモジュールに対する障害対
策等の変更履歴とを対応付けて記録／管理するデータベ
ースが必須であり、ロードモジュールの管理運用に多大
なコストや労力が必要となっていた。Therefore, conventionally, when managing a large number of load modules having different change histories, a database for recording / managing the names of the individual load modules and the change histories such as countermeasures against failures for the load modules is provided. It is indispensable, and the management and operation of the load module requires a great deal of cost and labor.

【０００７】また、このような管理の煩雑さから、ロー
ドモジュールが複数のモジュール構成の場合、ソフトウ
ェアの管理や更新は複数のモジュールについて纏めて実
行されることとなり、一部のモジュールでの不具合が発
覚しても、ロードモジュール全体が入れ替え可能になる
まで更新等の対策が遅延したり、取り扱いの単位（たと
えば、ファイルサイズ）が大きくなり、可搬媒体や通信
媒体での搬送コストが嵩む、等の技術的課題もある。[0007] Also, due to such complicated management, when the load module has a plurality of module configurations, software management and update are executed collectively for a plurality of modules. Even if discovered, measures such as updating will be delayed until the entire load module can be replaced, the handling unit (eg, file size) will increase, and transport costs on portable media and communication media will increase. There are also technical issues.

【０００８】なお、プログラム制御の電子装置の遠隔保
守技術としては、従来、たとえば特開平６−１２５３８
９号公報に開示された技術が知られている。すなわち、
リモート保守装置をネットワークを介して複数のディジ
タル構内交換機（ＰＢＸ）に接続し、ＰＢＸ側から到来
する障害情報に基づいて作成される保守履歴のデータベ
ースと、障害対策情報と、当該障害対策情報に対応して
ＰＢＸ側に送出すべきデータを格納した記憶手段を設
け、保守履歴のデータベースに基づいて次の障害を予測
するとともに、個々の障害には対応する障害対策情報の
データをＰＢＸ側に送信する動作を行うものである。し
かしながら、この従来技術では、多種多様な多数のマイ
クロプログラム等のソフトウェアの障害対策履歴の効率
的な蓄積、検索、更新等の保守管理技術は開示されてい
ない。As a remote maintenance technique for a program-controlled electronic device, a conventional technique is disclosed, for example, in Japanese Patent Application Laid-Open No. 6-12538.
The technique disclosed in Japanese Patent Application Laid-Open No. 9-99 is known. That is,
A remote maintenance device is connected to a plurality of digital private branch exchanges (PBXs) via a network, and a database of maintenance history created based on failure information received from the PBX side, troubleshooting information, and corresponding troubleshooting information. A storage means for storing data to be transmitted to the PBX side is provided, the next failure is predicted based on the maintenance history database, and data of failure countermeasure information corresponding to each failure is transmitted to the PBX side. Perform the operation. However, this conventional technique does not disclose a maintenance management technique for efficiently storing, searching, updating, and the like of a failure countermeasure history of software such as a large number of various microprograms.

【０００９】本発明の目的は、分散して配置された複数
の情報処理装置に実装されたソフトウェアの保守管理
を、高い信頼性にて効率よく遂行することが可能な技術
を提供することにある。An object of the present invention is to provide a technique capable of efficiently and efficiently maintaining and managing software mounted on a plurality of information processing apparatuses distributed and arranged. .

【００１０】本発明の他の目的は、任意の情報処理装置
に実装されて稼働するバイナリのソフトウェアの保守管
理や運用を高い信頼性にて効率よく遂行することが可能
な技術を提供することにある。Another object of the present invention is to provide a technique capable of efficiently performing maintenance management and operation of binary software mounted and operated on an arbitrary information processing apparatus with high reliability. is there.

【００１１】本発明の他の目的は、任意の情報処理装置
に実装されて稼働するバイナリのソフトウェアの保守管
理や運用を、より小さなサイズ単位にきめ細かく迅速に
遂行することが可能な技術を提供することにある。Another object of the present invention is to provide a technology capable of performing maintenance management and operation of binary software mounted and operated on an arbitrary information processing device in a small size unit in a fine and quick manner. It is in.

【００１２】本発明の他の目的は、障害発生を契機に、
ネットワークを介して障害が対策されたプログラムをダ
ウンロードし、人員の介入無しにプログラムの入れ替え
作業を行う技術を提供することにある。[0012] Another object of the present invention is to provide a communication system in which
It is an object of the present invention to provide a technology for downloading a program in which a measure against a failure is downloaded via a network, and replacing the program without human intervention.

【００１３】本発明の他の目的は、不良部位の推定を実
行形式プログラムを構成する複数のロードモジュール単
位まで行い、更新もロードモジュール単位で行なえる様
にすることで、早期対策及びネットワーク負荷の軽減を
図ることが可能な技術を提供することにある。Another object of the present invention is to estimate a defective portion up to a plurality of load modules constituting an executable program and to perform updating in units of load modules so that early countermeasures and network load reduction can be achieved. It is an object of the present invention to provide a technology capable of reducing the amount.

【００１４】本発明の他の目的は、ソフトウェアが実装
された情報処理装置が自発的にネットワークを介して保
守支援センタにアクセスすることで、稼働中のプログラ
ムに不良があるかどうかを判断し、障害発生を未然に防
ぐ予防保守が可能な技術を提供することにある。Another object of the present invention is to determine whether or not a running program has a defect by allowing an information processing apparatus in which software is installed to voluntarily access a maintenance support center via a network. It is an object of the present invention to provide a technology capable of preventive maintenance for preventing a failure from occurring.

【００１５】本発明の他の目的は、実行形式のマイクロ
プログラム中にバイナリコードとして過去の障害対策内
容を埋め込むことで、マイクロプログラムバージョンと
障害対策内容を一元的に管理することが可能な技術を提
供することにある。Another object of the present invention is to provide a technique capable of centrally managing the microprogram version and the content of the troubleshooting by embedding the past troubleshooting content as a binary code in an executable microprogram. To provide.

【００１６】[0016]

【課題を解決するための手段】本発明の情報処理装置の
保守管理方法は、ソフトウェアで動作する少なくとも一
つの情報処理装置とソフトウェアを管理する保守支援セ
ンタとを情報ネットワークにて接続し、情報処理装置に
て障害が発生したとき、障害に対する対策が施されたソ
フトウェアを保守支援センタから情報処理装置に送信し
て当該情報処理装置におけるソフトウェアの更新を行う
ものである。According to a maintenance management method for an information processing apparatus of the present invention, at least one information processing apparatus operated by software is connected to a maintenance support center for managing the software via an information network. When a failure occurs in a device, software in which measures against the failure have been taken is transmitted from the maintenance support center to the information processing device to update the software in the information processing device.

【００１７】また、本発明の情報処理装置の保守管理方
法は、ソフトウェアで動作する情報処理装置とソフトウ
ェアを管理する保守支援センタとを情報ネットワークに
て接続し、任意の契機で、情報処理装置にて稼働中のソ
フトウェアおよび保守支援センタに保管されている複数
のソフトウェアの各々に実施済の障害対策を情報ネット
ワークを経由して比較判定し、稼働中のソフトウェアに
未実施の障害対策が、保守支援センタに保管されている
ソフトウェアに施されている場合には、情報処理装置の
保守管理者に潜在的な障害発生の警報を発する第１の操
作、および情報処理装置における稼働中のソフトウェア
を、情報ネットワークを経由して、保守支援センタに保
管されているソフトウェアにて更新する第２の操作、の
少なくとも一方の操作を実行するものである。Further, the maintenance management method for an information processing apparatus according to the present invention connects an information processing apparatus operated by software to a maintenance support center for managing the software via an information network. Of the software running and the multiple software stored at the maintenance support center are compared and determined via the information network. When applied to software stored in the center, the first operation for issuing a warning of a potential failure occurrence to the maintenance manager of the information processing device, and the software running on the information processing device, At least one of a second operation of updating with software stored in the maintenance support center via the network. It is intended to perform the work.

【００１８】また、本発明は、ソフトウェアで動作する
制御部と、情報ネットワークを介して保守支援センタと
接続されるネットワークインタフェースとを含む情報処
理装置において、情報処理装置における障害の発生を監
視する障害監視機能と、障害の発生を保守支援センタに
通知する障害通報機能と、障害に対する障害対策が施さ
れたソフトウェアを保守支援センタから受信して、稼働
中のソフトウェアと交換するソフトウェア交換機能と、
を含む制御論理を備えたものである。According to the present invention, there is provided an information processing apparatus including a control unit operated by software and a network interface connected to a maintenance support center via an information network. A monitoring function, a failure notification function for notifying the occurrence of a failure to the maintenance support center, a software replacement function for receiving software from the maintenance support center for which a measure against the failure has been taken, and replacing the software with an operating software,
Is provided with control logic.

【００１９】また、本発明は、ソースコードを作成また
は変更する第１の工程と、ソースコードを任意の情報処
理装置に実装されるバイナリの実行コードに変換する第
２の工程と、を含むソフトウェアの作成方法において、
第１の工程では、情報処理装置の稼働中に検出された障
害の対策としてソースコードに変更を加えるとき、障害
対策の種別を示す障害対策識別情報を、当該障害対策識
別情報が実行コードまで継承される形式でソースコード
に付け加え、第２の工程で得られる実行コードに、障害
対策識別情報を埋め込むものである。Further, the present invention provides software including a first step of creating or changing a source code, and a second step of converting the source code into a binary executable code mounted on an arbitrary information processing apparatus. In the method of creating
In the first step, when a change is made to the source code as a countermeasure for a fault detected during the operation of the information processing apparatus, the fault countermeasure identification information indicating the type of the fault countermeasure is inherited by the fault countermeasure identification information to the execution code. In addition to the source code in the format described, the fault countermeasure identification information is embedded in the execution code obtained in the second step.

【００２０】また、本発明は、情報処理装置に実装され
るバイナリのソフトウェアであって、ソフトウェアの一
部に、当該ソフトウェアに対してこれまでに施された障
害対策の種別を示す障害対策識別情報が埋め込まれてい
るものである。The present invention also relates to binary software implemented in an information processing apparatus, wherein a part of the software includes fault countermeasure identification information indicating a type of a fault countermeasure performed so far on the software. Is embedded.

【００２１】より具体的には、一例として、障害発生を
契機に、その障害情報と障害発生バージョンをネットワ
ークを介して障害回復支援センタに報告し、障害回復支
援センタでは、障害の種別毎に付与された障害対策ＩＤ
及び障害の原因、現象、対策内容等の情報を備えた障害
対策データベース（障害対策ＤＢ）を具備し、報告され
た障害情報とセンタ側に具備した障害対策ＤＢマッチン
グから障害対策ＩＤを検出し、予め障害対策がなされた
マイクロプログラムに、この障害対策ＩＤを埋め込み、
各マイクロプログラムをバイナリでサーチして、先に検
出した障害対策ＩＤと一致するマイクロプログラムを見
つけて、そのマイクロプログラムをネットワークを介し
てディスクアレイ装置に送付し更新を行う。More specifically, as an example, when a failure occurs, the failure information and the failure occurrence version are reported to a failure recovery support center via a network. Failure countermeasure ID
And a failure countermeasure database (failure countermeasure DB) including information such as cause, phenomenon, and countermeasure contents of the fault, and detects a fault countermeasure ID from the reported fault information and the fault countermeasure DB matching provided on the center side. This failure countermeasure ID is embedded in a microprogram for which a countermeasure has been taken in advance,
Each microprogram is searched for in binary, a microprogram matching the previously detected failure countermeasure ID is found, and the microprogram is sent to the disk array device via the network to be updated.

【００２２】ここで、マイクロプログラム内にバイナリ
コードとして障害対策ＩＤと対策内容を埋め込む為に
は、ソースプログラム中で外部変数として障害対策ＩＤ
と対策内容を記入することで、コンパイル後に出来上が
ったロードモジュールにバイナリコードとして障害対策
ＩＤと対策内容を持たせることができ、これによって障
害対策内容とマイクロプログラムバージョンを一元的に
管理することが可能となる。Here, in order to embed the fault countermeasure ID and the countermeasure contents as a binary code in the microprogram, the fault countermeasure ID is used as an external variable in the source program.
By entering the countermeasure and the countermeasure, the load module created after compiling can have the fault countermeasure ID and countermeasure content as a binary code, which enables centralized management of the fault countermeasure content and the microprogram version Becomes

【００２３】更に、動作中のシステムにおいても、自律
的に障害回復支援センタへ障害対策ＤＢを照会し、当該
マイクロプログラム内に埋め込まれた障害対策ＩＤをサ
ーチすることで、不良が対策されているかを調べること
ができる。仮に障害発生のポテンシャルを秘めたプログ
ラムであると判明した場合、そのことを保守員に報告
し、保守員よりプログラム交換の指示があれば、同じ様
に対策済みのプログラムをダウンロードし、自動的に更
新を図る。Further, even in the operating system, whether the failure is dealt with by autonomously querying the failure recovery DB to the failure recovery support center and searching for the failure countermeasure ID embedded in the microprogram. You can find out. If it turns out that the program has the potential for failure occurrence, report it to maintenance personnel and if maintenance personnel instructs you to replace the program, download the program that has been similarly treated and automatically Try to update.

【００２４】また、全世界には膨大数のマイクロプログ
ラムがディスク装置に組込まれ稼働中である。そこで、
マイクロプログラム内に障害対策ＩＤと対策内容を持た
せて管理することで、ディスク装置で稼動中のマイクロ
プログラムが他のディスク装置で発生した障害が対策済
みかどうか容易に調べることが可能となる。In addition, an enormous number of microprograms are incorporated in disk devices and are in operation throughout the world. Therefore,
By managing the microprogram with the fault countermeasure ID and the content of the countermeasure, it is possible for the microprogram operating in the disk device to easily check whether or not the fault occurring in another disk device has been corrected.

【００２５】[0025]

【発明の実施の形態】以下、本発明の実施の形態を図面
を参照しながら詳細に説明する。Embodiments of the present invention will be described below in detail with reference to the drawings.

【００２６】図１は、本発明の一実施の形態である情報
処理装置の保守管理方法の作用の一例を示すフローチャ
ートであり、図２は、本実施の形態の情報処理装置の保
守管理方法にて用いられる各種データベースの構成の一
例を示す概念図であり、図３は、本実施の形態の情報処
理装置の保守管理方法にてソフトウェアに埋め込まれる
情報の一例を示すテーブルである。また、図４は、本実
施の形態の情報処理装置の保守管理方法において、保守
管理対象となるソフトウェアのソースコードの一例を示
す説明図、図５は、本実施の形態の情報処理装置の保守
管理方法において、保守管理対象となるソフトウェアの
コンパイル後のバイナリの実行形式のモジュールの一例
を、１６進ダンプで示す説明図である。FIG. 1 is a flowchart showing an example of the operation of the information processing apparatus maintenance management method according to one embodiment of the present invention. FIG. 2 is a flowchart showing the operation of the information processing apparatus maintenance management method according to this embodiment. FIG. 3 is a conceptual diagram showing an example of the configuration of various databases used in the present embodiment. FIG. 3 is a table showing an example of information embedded in software in the maintenance management method for the information processing apparatus according to the present embodiment. FIG. 4 is an explanatory diagram showing an example of source code of software to be maintained and managed in the information processing apparatus maintenance management method according to the present embodiment, and FIG. In a management method, it is an explanatory view showing an example of a module of a binary execution form after compilation of software for maintenance management by a hexadecimal dump.

【００２７】また、図６は、本実施の形態の情報処理装
置の保守管理方法が実施される情報処理システムの構成
の一例を示す概念図、図７は、本実施の形態の情報処理
装置の保守管理方法を実施する情報処理装置の一例であ
るディスクアレイ装置の構成の一例を示す概念図、図８
は、図７に例示したディスクアレイ装置に備えられたサ
ービスプロセッサの構成の一例を示す概念図、図９は、
本実施の形態のディスクアレイ装置の遠隔保守を行う障
害回復支援センタの構成の一例を示す概念図である。FIG. 6 is a conceptual diagram showing an example of the configuration of an information processing system in which the information processing apparatus maintenance management method according to the present embodiment is implemented. FIG. FIG. 8 is a conceptual diagram showing an example of a configuration of a disk array device which is an example of an information processing device that implements a maintenance management method.
FIG. 9 is a conceptual diagram illustrating an example of a configuration of a service processor provided in the disk array device illustrated in FIG. 7, and FIG.
FIG. 2 is a conceptual diagram illustrating an example of a configuration of a failure recovery support center that performs remote maintenance of the disk array device according to the present embodiment.

【００２８】まず、本実施の形態のディスク装置の障害
発生からプログラムの自動更新まで実施されるディスク
装置のハードウェア構成について、図６、図７、図８及
び図９を参照して説明する。First, a description will be given of a hardware configuration of a disk device according to the present embodiment, which is performed from the occurrence of a failure in the disk device to the automatic update of a program, with reference to FIGS. 6, 7, 8, and 9.

【００２９】図６にて、本実施の形態の情報処理装置の
保守管理方法が実施される情報処理システムの構成につ
いて説明する。複数のディスクアレイ装置６０１がロー
カルエリアネットワーク６０５（ＬＡＮ）で接続されて
構成されるディスクアレイ装置設置サイト６０４と、デ
ィスクアレイ装置設置サイト６０４で発生した障害に対
する障害回復を支援する為の障害回復支援センタ６０２
が、ネットワーク６０３を介して、通信可能な状態に構
成される。なお、図６では、簡単のため障害回復支援セ
ンタ６０２が一つ例示されているが、一つまたは複数の
障害回復支援センタ６０２をローカルエリアネットワー
ク６０６（ＬＡＮ）を介してネットワーク６０３に接続
する構成とすることができる。Referring to FIG. 6, a description will be given of a configuration of an information processing system in which the maintenance management method for the information processing apparatus according to the present embodiment is performed. A disk array device installation site 604 configured by connecting a plurality of disk array devices 601 via a local area network 605 (LAN), and a failure recovery support for supporting a failure recovery from a failure occurring at the disk array device installation site 604 Center 602
Are configured to be communicable via the network 603. In FIG. 6, one failure recovery support center 602 is illustrated for simplicity, but one or a plurality of failure recovery support centers 602 are connected to a network 603 via a local area network 606 (LAN). It can be.

【００３０】図７にて、ディスクアレイ装置６０１のハ
ードウェアの基本構成の一例を説明する。ディスクアレ
イ装置６０１は、汎用コンピュータ、ワークステーショ
ン、パーソナルコンピュータ等のホストコンピュータ７
０１とチャネル等のインタフェースを介して直接接続し
てデータの授受等を制御するための複数のホストＩ／Ｆ
制御部７０２と、複数のＤｉｓｋ装置７０４を制御する
為のＤｉｓｋ制御部７０３、これらのＩ／Ｆ制御部を接
続する為の共通バス７０５と、各Ｉ／Ｆ制御部が制御情
報を共有する為の共有メモリ７０６から構成される。Referring to FIG. 7, an example of the basic configuration of the hardware of the disk array device 601 will be described. The disk array device 601 is a host computer 7 such as a general-purpose computer, a workstation, or a personal computer.
01 and a plurality of host I / Fs for controlling data transmission / reception by directly connecting them via interfaces such as channels.
A control unit 702, a disk control unit 703 for controlling a plurality of disk devices 704, a common bus 705 for connecting these I / F control units, and a control unit for sharing control information between the I / F control units. Of the shared memory 706.

【００３１】複数のＤｉｓｋ装置７０４には、ホストコ
ンピュータ７０１との間で授受される情報および当該情
報から生成された冗長情報が分散して格納され、ディス
クアレイを構成する。Information exchanged with the host computer 701 and redundant information generated from the information are dispersedly stored in the plurality of disk devices 704 to form a disk array.

【００３２】ホストＩ／Ｆ制御部７０２の各々は、たと
えば制御記憶７０２ａを備え、この制御記憶７０２ａに
格納された複数のモジュールからなるマイクロプログラ
ムにて動作する。同様にＤｉｓｋ制御部７０３の各々
は、たとえば制御記憶７０３ａを備え、この制御記憶７
０３ａに格納された複数のモジュールからなるマイクロ
プログラムにて動作する。図１０に、制御記憶７０２ａ
および７０３ａにおけるマイクロプログラムの格納状態
の一例を示す。制御記憶７０２ａおよび７０３ａに実装
されるバイナリ形式のマイクロプログラム１０００は、
複数のロードモジュール１００１で構成され、個々のロ
ードモジュール１００１の中には、後述のような方法で
埋め込まれた障害対策ＩＤ１００２および注釈としての
対策内容１００３が含まれている。Each of the host I / F control units 702 includes, for example, a control storage 702a, and operates according to a microprogram including a plurality of modules stored in the control storage 702a. Similarly, each of the disk control units 703 includes, for example, a control storage 703a.
It operates with a microprogram consisting of a plurality of modules stored in the module 03a. FIG. 10 shows a control storage 702a.
And 703a show an example of the storage state of the microprogram. The microprogram 1000 in binary format implemented in the control storages 702a and 703a is
The load module 1001 is composed of a plurality of load modules 1001. Each load module 1001 includes a failure countermeasure ID 1002 embedded by a method described later and a countermeasure content 1003 as an annotation.

【００３３】個々のディスクアレイ装置６０１では、当
該ディスクアレイ装置６０１の障害対策等の保守管理を
行う目的で、共通バス７０５と接続されたサービスプロ
セッサ（Service Processor ）（以下、ＳＶＰと呼ぶ）
７０７を具備する。このＳＶＰ７０７は障害対策用のハ
ードウェアとして、図示しないマイクロプロセッサ（Ｍ
ＰＵ）、メモリ、共通バス等を具備した中央処理装置８
００とＣＲＴ等のディスプレイ８０１、キーボード８０
２、マウス８０３等の外部入出力装置、及びＳＶＰ７０
７自体の制御プログラム、障害回復支援センタ６０２等
から受信したり、ＦＤ経由で入力され、ディスクアレイ
装置６０１にロードされるマイクロプログラム１００
０、さらにはデータを記憶するためのハードディスクド
ライブ８０５（ＨＤＤ）、フロッピイディスクドライブ
８０４（ＦＤ）等の外部記憶装置と、ＬＡＮ６０５、ネ
ットワーク６０３を介して外部の障害回復支援センタ６
０２等との間で情報通信を行うネットワークドライバ８
０６等からなるパーソナルコンピュータを使用する。中
央処理装置８００は、後述の障害監視部８０８、障害通
報部８０９、マイクロ交換部８１０等の機能を実現す
る。詳細は図８に示す。In each disk array device 601, a service processor (Service Processor) (hereinafter, referred to as SVP) connected to the common bus 705 for the purpose of performing maintenance management such as troubleshooting for the disk array device 601.
707. The SVP 707 is a microprocessor (M) (not shown) as hardware for troubleshooting.
PU), a memory, a central processing unit 8 having a common bus, etc.
00, display 801 such as CRT, keyboard 80
2. External input / output device such as mouse 803, and SVP70
7, a microprogram 100 received from the failure recovery support center 602 or the like, or input via the FD and loaded into the disk array device 601.
0, an external storage device such as a hard disk drive 805 (HDD) and a floppy disk drive 804 (FD) for storing data, and an external failure recovery support center 6 via a LAN 605 and a network 603.
Network driver 8 for performing information communication with the device 02 or the like
06 or the like is used. The central processing unit 800 implements functions of a failure monitoring unit 808, a failure notification unit 809, a micro exchange unit 810, and the like, which will be described later. Details are shown in FIG.

【００３４】図８にて、図７で示したディスクアレイ装
置６０１に搭載しているＳＶＰ７０７内のソフトウェア
・ハードウェア構成をより詳細に説明する。図中の共通
バス７０５は、図７で示したディスクアレイ装置６０１
内の各Ｉ／Ｆ制御部とＩ／Ｆを持つものである。従っ
て、各Ｉ／Ｆ制御部で検知した障害は、共通バス７０５
を経由してＳＶＰ７０７に伝達される。Referring to FIG. 8, the software and hardware configuration in the SVP 707 mounted on the disk array device 601 shown in FIG. 7 will be described in more detail. The common bus 705 in the figure corresponds to the disk array device 601 shown in FIG.
Each has an I / F control unit and an I / F. Therefore, the fault detected by each I / F control unit is determined by the common bus 705
Is transmitted to the SVP 707 via the

【００３５】ＳＶＰ７０７内では、具体的に障害監視部
８０８が各Ｉ／Ｆ制御部で検知したエラーコードやエラ
ーメッセージ等の障害情報を受け取る。受け取った障害
情報を障害通報部８０９に渡し、障害通報部８０９がＳ
ＶＰ内共通バス８０７、ネットワークドライバ８０６、
後述の図９で示すネットワークドライバ９０６、共通バ
ス９０７を経由して、障害回復支援センタ６０２に通報
する。障害回復支援センタ６０２は、同一部位を経由し
てマイクロ交換部８１０に障害が対策されたプログラム
を送り込む。マイクロ交換部８１０は、送付されてきた
プログラムの更新を行う。In the SVP 707, the fault monitoring unit 808 receives fault information such as an error code and an error message detected by each I / F control unit. The received fault information is passed to the fault notification unit 809, and the fault notification unit 809
VP common bus 807, network driver 806,
A notification is sent to the failure recovery support center 602 via a network driver 906 and a common bus 907 shown in FIG. The failure recovery support center 602 sends a program in which a failure has been dealt with to the micro exchange unit 810 via the same part. The micro exchange unit 810 updates the sent program.

【００３６】図９にて、本実施の形態の障害回復支援セ
ンタ６０２の詳細な構成例を示す。障害回復支援センタ
６０２は、たとえば、図示しないマイクロプロセッサ
（ＭＰＵ）、メモリ、共通バス等を具備した中央処理装
置９００、ＣＲＴ等のディスプレイ９０１、キーボード
９０２、マウス９０３等の外部入出力装置、及びプログ
ラムやデータを記憶するためのハードディスクドライブ
９０５（ＨＤＤ）、フロッピイディスクドライブ９０４
（ＦＤ）等の外部記憶装置と、ＬＡＮ６０６、ネットワ
ーク６０３を介して、外部のディスクアレイ装置６０１
のＳＶＰ７０７等との間で情報通信を行うネットワーク
ドライバ９０６、これらを相互に接続する共通バス９０
７等からなるワークステーション（ＷＳ）を使用する。
このワークステーションには、後述の図２に例示される
形式の障害対策データベース９０８（障害対策ＤＢ）、
および機能や障害対策の状態が異なる複数のマイクロプ
ログラムが蓄積されたマイクロプログラムデータベース
９０９（以下、マイクロＤＢ９０９）が接続されてい
る。FIG. 9 shows a detailed configuration example of the failure recovery support center 602 of this embodiment. The failure recovery support center 602 includes, for example, a central processing unit 900 having a microprocessor (MPU), a memory, a common bus, and the like, a display 901 such as a CRT, an external input / output device such as a keyboard 902, a mouse 903, and a program. Disk drive 905 (HDD) and floppy disk drive 904 for storing data and data
(FD) and an external disk array device 601 via a LAN 606 and a network 603.
Network driver 906 for performing information communication with the SVP 707, etc., and a common bus 90 for interconnecting them.
7 is used.
The workstation includes a failure countermeasure database 908 (failure countermeasure DB) in a format illustrated in FIG.
Also, a microprogram database 909 (hereinafter, microDB909) storing a plurality of microprograms having different functions and fault countermeasure states is connected.

【００３７】中央処理装置９００は、後述の障害対策Ｉ
Ｄ検出プログラム９１０、バイナリサーチプログラム９
１１、マイクロ送付プログラム９１２を実行して、後述
のようなディスクアレイ装置６０１の保守管理等の制御
を行う。The central processing unit 900 has a failure countermeasure I described later.
D detection program 910, binary search program 9
11. The micro-sending program 912 is executed to perform control such as maintenance management of the disk array device 601 as described later.

【００３８】ディスクアレイ装置設置サイト６０４のデ
ィスクアレイ装置６０１で発生した障害がネットワーク
６０３を介して障害回復支援センタ６０２に障害情報と
して通報されると、障害回復支援センタ６０２内の障害
対策ＩＤ検出プログラム９１０が障害対策ＤＢ９０８か
ら障害対策ＩＤ９０８ａを検出する。検出された障害対
策ＩＤ９０８ａを共通バス９０７を介してバイナリサー
チプログラム９１１に渡して、バイナリサーチプログラ
ム９１１が障害が対策されたマイクロＤＢ９０９からマ
イクロプログラムをバイナリでサーチし、障害対策ＩＤ
９０８ａと一致した障害対策ＩＤ１００２を持つマイク
ロプログラムを見つけて、マイクロ送付プログラム９１
２が、障害の対策された当該マイクロプログラムをディ
スクアレイ装置６０１に送付する。When a failure occurred in the disk array device 601 at the disk array device installation site 604 is reported as failure information to the failure recovery support center 602 via the network 603, a failure countermeasure ID detection program in the failure recovery support center 602 910 detects the fault countermeasure ID 908a from the fault countermeasure DB 908. The detected fault countermeasure ID 908a is passed to the binary search program 911 via the common bus 907, and the binary search program 911 searches the micro DB 909 in which the fault has been countermeasured for a microprogram in binary.
908a, find a micro program having the fault countermeasure ID 1002, and send the micro program 91
2 sends to the disk array device 601 the microprogram in which the measure has been taken.

【００３９】図２に、本実施の形態にて用いられる障害
対策データベース９０８の詳細なテーブル構造の一例を
示す。項目としては、障害対策ＩＤ９０８ａ、件名９０
８ｂ、現象９０８ｃ、原因９０８ｄ、対策内容９０８
ｅ、発生プログラムバージョン９０８ｆ、対策プログラ
ムバージョン９０８ｇ、複数の事象９０８ｈがある。こ
こで、事象９０８ｈとは、たとえばエラーメッセージや
エラーコード等の障害情報を表しており、幾つかの障害
に対し、同一障害に分類されると判断した場合は事象９
０８ｈを増やすことも可能である。事象９０８ｈには、
何々機能のリトライ回数５回以上や何々機能の性能パフ
ォーマンス低下等の当該障害の症状等を表す情報が格納
される。FIG. 2 shows an example of a detailed table structure of the failure countermeasure database 908 used in this embodiment. The items are: failure countermeasure ID 908a, subject 90
8b, phenomenon 908c, cause 908d, and countermeasures 908
e, occurrence program version 908f, countermeasure program version 908g, and multiple events 908h. Here, the event 908h indicates fault information such as an error message or an error code. If it is determined that some faults are classified as the same fault, the event 908h is detected.
08h can be increased. Event 908h includes:
Information indicating the symptom of the failure, such as the number of times the function is retried 5 times or more, or the performance degradation of the function, is stored.

【００４０】図３は、本実施の形態にて保守管理される
マイクロプログラム１０００のロードモジュール内に埋
め込む情報の詳細なテーブル構造の一例を示している。
すなわち、マイクロプログラム１０００のロードモジュ
ール１００１には、障害対策データベース９０８の障害
対策ＩＤ９０８ａに対応する障害対策ＩＤ１００２と、
対策内容（注釈：コメント）１００３が埋め込まれる。FIG. 3 shows an example of a detailed table structure of information to be embedded in the load module of the microprogram 1000 maintained and managed in this embodiment.
That is, the load module 1001 of the microprogram 1000 includes a failure countermeasure ID 1002 corresponding to the failure countermeasure ID 908a of the failure countermeasure database 908,
A countermeasure content (comment: comment) 1003 is embedded.

【００４１】図４は、図３のテーブルに例示され情報を
ロードモジュール１００１内に埋め込む為に、マイクロ
プログラム１０００のソースコード１０００ａ中に記入
する外部変数の一例を示している。FIG. 4 shows an example of external variables to be written in the source code 1000a of the microprogram 1000 in order to embed information in the load module 1001 as exemplified in the table of FIG.

【００４２】従来では、任意のマイクロプログラムに施
された障害対策の内容等を当該マイクロプログラムのソ
ースコード中に注釈（コメント）として記入している
が、これを外部変数として定義することで、図５のロー
ドモジュール１００１の１６進ダンプのデータに例示さ
れる様にロードモジュール１００１のバイナリコード中
に障害対策ＩＤ１００２（９０８ａ）と対策内容１００
３（９０８ｅ）を持たせることができる。これによっ
て、当該マイクロプログラム１０００に施された障害対
策の内容とロードモジュール１００１の関係を、特別な
データベース等を必要とすることなく、バイナリの当該
ロードモジュール１００１だけで一元的に管理すること
ができ、障害対策項目とそのロードモジュールを持つプ
ログラムバージョンを一元的に管理することが可能とな
る。Conventionally, the contents of the measures taken against a failure performed on an arbitrary microprogram are written as an annotation (comment) in the source code of the microprogram. By defining this as an external variable, 5, the failure countermeasure ID 1002 (908a) and the countermeasure content 100 in the binary code of the load module 1001 as exemplified in the hexadecimal dump data of the load module 1001.
3 (908e). This makes it possible to centrally manage the relationship between the contents of the failure countermeasures applied to the microprogram 1000 and the load module 1001 using only the binary load module 1001 without requiring a special database or the like. In addition, it is possible to centrally manage the program version having the failure countermeasure item and its load module.

【００４３】以下、本実施の形態の情報処理装置の保守
管理方法を、上述の構成のディスクアレイ装置６０１に
適用した場合において、障害発生からプログラムの自動
更新まで作用の一例を図１のフローチャートで説明す
る。FIG. 1 is a flowchart showing an example of the operation from the occurrence of a failure to the automatic update of a program when the maintenance management method for an information processing apparatus according to the present embodiment is applied to the disk array apparatus 601 having the above configuration. explain.

【００４４】ステップ１０１では、障害発生の監視をす
る。ステップ１０２では、発生した障害情報を障害回復
支援センタ６０２に通報する。ステップ１０３およびス
テップ１０４では、障害情報の通報が入ると、障害回復
支援センタ６０２では、障害対策ＤＢ９０８から、報告
された障害情報と一致する事象９０８ｈを検索し、対応
する障害対策ＩＤ９０８ａを検出する。In step 101, the occurrence of a failure is monitored. In step 102, the generated fault information is reported to the fault recovery support center 602. In step 103 and step 104, when the notification of the fault information is received, the fault recovery support center 602 searches the fault countermeasure DB 908 for an event 908h that matches the reported fault information, and detects the corresponding fault countermeasure ID 908a.

【００４５】この時、障害対策ＤＢ９０８で障害対策Ｉ
Ｄ９０８ａを検出できない場合は、ステップ１１３で障
害情報の解析を行い、ステップ１１４では障害解析の結
果、既に障害対策ＤＢ９０８に登録されている障害対策
ＩＤ９０８ａと、同一障害であると判断した場合は、ス
テップ１１５で同一と判断した障害対策ＩＤ９０８ａに
対して新たに事象９０８ｈを設けて障害情報を登録し、
この時の障害対策ＩＤ９０８ａをステップ１１６で取得
し、ステップ１０５に進む。At this time, the failure countermeasure I is stored in the failure countermeasure DB 908.
If D908a cannot be detected, the failure information is analyzed in step 113. In step 114, as a result of the failure analysis, if it is determined that the failure is the same as the failure countermeasure ID 908a already registered in the failure countermeasure DB 908, A new event 908h is provided for the fault countermeasure ID 908a determined to be the same at 115, and fault information is registered.
The failure countermeasure ID 908a at this time is obtained in step 116, and the process proceeds to step 105.

【００４６】しかし、ステップ１１４での障害解析の結
果、障害対策ＤＢ９０８に登録されていないと判断した
場合は、ステップ１１７に進み、ここで、解析結果がソ
フトウェア不良による障害と判断した場合は、ステップ
１１８ａにて新たに障害対策ＤＢ９０８に新規の障害対
策ＩＤ９０８ａを設定し対応する件名９０８ｂ〜事象９
０８ｈ等の情報の登録するとともに、ステップ１１８ｂ
にて障害対策のためのソースコード１０００ａの手直し
および障害対策ＩＤ１００２、対策内容１００３の埋め
込みを行い、さらにステップ１１８ｃにてソースコード
１０００ａをコンパイルまたはアセンブル等により、障
害対策ＩＤ１００２および対策内容１００３を含むバイ
ナリのロードモジュール１００１を生成し、ステップ１
１８ｄにて、生成されたマイクロプログラム１０００
（ロードモジュール１００１）をマイクロＤＢ９０９に
登録する。However, as a result of the failure analysis in step 114, if it is determined that the failure is not registered in the failure countermeasure DB 908, the process proceeds to step 117. If the analysis result is determined to be a failure due to a software defect, the process proceeds to step 117. At 118a, a new fault countermeasure ID 908a is newly set in the fault countermeasure DB 908, and the corresponding subject 908b to event 9
08h and the like, and step 118b
In step 118c, the source code 1000a is corrected and the fault countermeasure ID 1002 and the countermeasure content 1003 are embedded. Further, in step 118c, the source code 1000a is compiled or assembled to include the fault countermeasure ID 1002 and the binary including the countermeasure content 1003. Of the load module 1001 of step 1
18d, the generated microprogram 1000
(Load module 1001) is registered in the micro DB 909.

【００４７】また、ステップ１１７でハードウェア不良
による障害と判明した場合は、ステップ１１９でハード
ウェア交換等の別途対応を行う。If it is determined in step 117 that the failure is caused by a hardware defect, in step 119, a separate response such as hardware replacement is performed.

【００４８】ステップ１０５では、マイクロＤＢ９０９
から、障害対策の施されたバイナリのマイクロプログラ
ム１０００（ロードモジュール１００１）を取得する。
ステップ１０６では、先に取得したマイクロプログラム
１０００をバイナリでサーチし先に検出した障害対策Ｉ
Ｄ９０８ａと一致するコード（障害対策ＩＤ１００２）
を探す。ここで、一致するコードがない場合は、ステッ
プ１０５に戻り同じ操作を繰り返し行う。In step 105, the micro DB 909
From this, a binary microprogram 1000 (load module 1001) on which a fault measure has been taken is obtained.
In step 106, the previously acquired microprogram 1000 is searched in binary, and the previously detected failure countermeasure I
Code that matches D908a (failure countermeasure ID 1002)
Search for If there is no matching code, the process returns to step 105 and the same operation is repeated.

【００４９】ステップ１０７で障害対策ＩＤ９０８ａと
一致するコード（障害対策ＩＤ１００２）を検出した場
合、ステップ１０８でそのロードモジュール名とマイク
ロプログラムバージョンを記憶する。ステップ１０９
で、一致するコード（障害対策ＩＤ１００２）を含む別
のマイクロプログラム１０００が存在する場合は、ステ
ップ１０５に移行して同じ操作を繰り返し行い、別のマ
イクロプログラム１０００が存在しない場合は、ステッ
プ１０８で記憶したマイクロプログラム１０００をステ
ップ１１０で障害の起きたディスクアレイ装置６０１に
送信する。If a code (fault countermeasure ID 1002) that matches the fault countermeasure ID 908a is detected in step 107, the load module name and microprogram version are stored in step 108. Step 109
If another microprogram 1000 including the matching code (failure countermeasure ID 1002) exists, the process proceeds to step 105 and the same operation is repeated, and if another microprogram 1000 does not exist, it is stored in step 108. The microprogram 1000 is transmitted to the failed disk array device 601 in step 110.

【００５０】なお、ステップ１０５〜ステップ１０９の
処理の反復にて目的の障害対策ＩＤ９０８ａに該当する
障害対策ＩＤ１００２が埋め込まれた複数のロードモジ
ュールが見いだされた場合には、たとえば、ステップ１
１０で、稼働中のロードモジュールのソフトウェアバー
ジョンに最も近いバージョンのロードモジュールを選択
して送付する処理を行う。その理由は、稼働中のマイク
ロプログラム１０００を半数の懸け離れた最新のものに
更新すると予期せぬ不具合が発生する場合があるので、
必要な障害対策が実施済の範囲で、現在稼働中の半数に
近いマイクロプログラム１０００にて更新するためであ
る。なお、このような懸念がない場合には、無条件に最
新の版の更新してもよいことはいうまでもない。If a plurality of load modules in which the failure countermeasure ID 1002 corresponding to the target failure countermeasure ID 908a is found by repeating the processing of steps 105 to 109, for example, step 1 is executed.
At 10, a process of selecting and sending the load module of the version closest to the software version of the operating load module is performed. The reason is that if the running microprogram 1000 is updated to the latest one half apart, unexpected trouble may occur,
This is because the update is performed by the microprogram 1000 that is close to half of the currently operating ones within a range in which necessary fault measures have been implemented. If there is no such concern, it is needless to say that the latest version may be unconditionally updated.

【００５１】ステップ１１１では、ステップ１１０で送
信したマイクロプログラム１０００を障害が発生したデ
ィスクアレイ装置６０１のＳＶＰ７０７で受け取り、ス
テップ１１２では、受け取ったマイクロプログラム１０
００を用いて、マイクロ交換部８１０で稼働中のマイク
ロプログラムの更新を行う。この様に一連の流れで障害
発生を契機にマイクロプログラムの自動更新を行う。In step 111, the microprogram 1000 transmitted in step 110 is received by the SVP 707 of the failed disk array device 601. In step 112, the received microprogram 10
Using 00, the microprogram that is operating in the microexchange unit 810 is updated. In this way, the automatic update of the microprogram is performed in response to the occurrence of a failure in a series of flows.

【００５２】また、障害対策ＩＤ１００２（９０８ａ）
をロードモジュール１００１に埋め込むことで、障害対
策箇所をロードモジュール単位で管理することができ
る。そしてその障害が対策されたロードモジュール１０
０１をネットワーク６０３を介してダウンロードし、更
新を行なうことも可能である。Also, a fault countermeasure ID 1002 (908a)
Is embedded in the load module 1001, it is possible to manage the failure countermeasures in units of the load module. And the load module 10 in which the failure has been taken
01 can be downloaded via the network 603 and updated.

【００５３】また、動作中のディスクアレイ装置６０１
においても、自律的に障害回復支援センタ６０２の障害
対策ＤＢ９０８およびマイクロＤＢ９０９等を参照する
アクセスを行い、マイクロＤＢ９０９内のマイクロプロ
グラム内に埋め込まれた障害対策ＩＤ１００２をサーチ
する（たとえば、自システムで稼働中のマイクロプログ
ラム１０００に埋め込まれている障害対策ＩＤ１００２
と照合する）ことで、稼働中のマイクロプログラム１０
００の障害対策以降に、新たな障害対策が施されたマイ
クロプログラム１０００がマイクロＤＢ９０９内に存在
するか否かを調べることができる。そして、マイクロＤ
Ｂ９０９内のマイクロプログラム１０００に比較して、
稼働中のマイクロプログラム１０００で未実施の障害対
策があると判明した場合、すなわち、稼働中のマイクロ
プログラム１０００が、障害発生のポテンシャルを秘め
たマイクロプログラム１０００であると判明した場合、
保守員に、稼働中のマイクロプログラム１０００には潜
在的な障害発生のおそれがあるとの警報を報告し、保守
員よりプログラム交換の指示があれば、上述の更新操作
と同様に対策済みのマイクロプログラム１０００を、障
害回復支援センタ６０２のマイクロＤＢ９０９からダウ
ンロードし、ディスクアレイ装置６０１の稼働中のマイ
クロプログラム１０００の更新を行う。これにより、障
害の発生を未然に防ぐ予防保守が可能になる。Also, the operating disk array device 601
In this case, an autonomous access is performed with reference to the failure countermeasure DB 908 and the micro DB 909 of the failure recovery support center 602, and the failure countermeasure ID 1002 embedded in the micro program in the micro DB 909 is searched (for example, the system operates on its own system). Troubleshooting ID 1002 embedded in the microprogram 1000 inside
By comparing with the running microprogram 10
After the trouble countermeasure of 00, it is possible to check whether or not the microprogram 1000 provided with the new trouble countermeasure exists in the micro DB 909. And micro D
Compared to the microprogram 1000 in B909,
When it is determined that there is an unexecuted failure countermeasure in the active microprogram 1000, that is, when it is determined that the active microprogram 1000 is a microprogram 1000 with a potential for failure occurrence,
An alert is sent to maintenance personnel indicating that a potential failure may occur in the operating microprogram 1000. If maintenance personnel instructs the replacement of the program, the microcontroller 1000 that has taken measures as in the update operation described above is notified. The program 1000 is downloaded from the micro DB 909 of the failure recovery support center 602, and the operating micro program 1000 of the disk array device 601 is updated. As a result, preventive maintenance for preventing the occurrence of a failure can be performed.

【００５４】上述のような、障害発生契機以外の任意の
契機での、ディスクアレイ装置６０１の自発的なソフト
ウェア保守管理動作の一例を、図１１のフローチャート
に例示する。An example of the voluntary software maintenance management operation of the disk array device 601 at any occasion other than the above-described failure occurrence occasion is illustrated in the flowchart of FIG.

【００５５】すなわち、ディスクアレイ装置６０１に備
えられたＳＶＰ７０７は、障害回復支援センタ６０２に
自発的にアクセスする契機を監視する（ステップ２０
１）。この契機は、あらかじめタイマ等のアクセス時刻
を決めて自動的に決定してもよいし、保守管理者の指示
で随時行う等、任意の契機でよい。That is, the SVP 707 provided in the disk array device 601 monitors the opportunity to voluntarily access the failure recovery support center 602 (step 20).
1). This opportunity may be determined automatically in advance by determining the access time of a timer or the like, or may be any occasion, such as being performed at any time by a maintenance manager's instruction.

【００５６】障害回復支援センタ６０２へのアクセス契
機と判定された場合、ＳＶＰ７０７は、たとえば、自装
置で稼働中のマイクロプログラム１０００に埋め込まれ
ている障害対策ＩＤ１００２を読み出して、キー情報と
して障害回復支援センタ６０２へ照合依頼を送出する。
障害回復支援センタ６０２では、ＳＶＰ７０７から送出
されたキー情報の障害対策ＩＤ１００２と、障害対策Ｄ
Ｂ９０８内の同種のマイクロプログラム１０００に関す
る障害対策ＩＤ９０８ａとを照合し（ステップ２０
２）、照合結果をキー情報の送出元のＳＶＰ７０７に送
出する。この時、必要に応じて、照合結果とともに、未
実施と判明した障害対策の対策内容９０８ｅを送出して
もよい。When it is determined that the access to the failure recovery support center 602 has been triggered, the SVP 707 reads, for example, the failure countermeasure ID 1002 embedded in the microprogram 1000 running on its own device, and uses the readout as the key information. A check request is sent to the center 602.
In the failure recovery support center 602, the failure countermeasure ID 1002 of the key information sent from the SVP 707 and the failure countermeasure D
B908 is compared with the failure countermeasure ID 908a for the same type of microprogram 1000 (step 20).
2) Send the collation result to the SVP 707 that sent the key information. At this time, if necessary, the countermeasure contents 908e of the fault countermeasure determined to have not been performed may be transmitted together with the verification result.

【００５７】そして、ＳＶＰ７０７は、障害対策ＤＢ９
０８内の同種のマイクロプログラム１０００に関する障
害対策ＩＤ９０８ａに存在し、かつ稼働中のマイクロプ
ログラム１０００のキー情報の障害対策ＩＤ１００２に
存在しない障害対策ＩＤ９０８ａがある場合、すなわ
ち、稼働中のマイクロプログラム１０００に未実施の障
害対策が施された同種のマイクロプログラム１０００が
障害回復支援センタ６０２の側に存在し、未実施の障害
対策の障害が発生する可能性（障害ポテンシャル）があ
る、と判明した場合には（ステップ２０３）、まず自装
置の保守管理者に稼働中のマイクロプログラム１０００
には、未実施の障害対策があり、当該障害の発生の可能
性がある旨の警告を出す（ステップ２０４）。このと
き、未実施と判明した障害対策の対策内容９０８ｅを表
示してもよい。The SVP 707 stores the failure countermeasure DB 9
08, there is a fault countermeasure ID 908a that is present in the fault countermeasure ID 908a for the same type of microprogram 1000 and does not exist in the fault countermeasure ID 1002 of the key information of the active microprogram 1000, that is, the active microprogram 1000 has no fault countermeasure ID 908a. If it is determined that the same type of microprogram 1000 that has been subjected to the implemented failure countermeasure exists on the side of the failure recovery support center 602 and that there is a possibility that an unexecuted failure countermeasure may occur (failure potential). (Step 203) First, the microprogram 1000 in operation by the maintenance manager of the own device.
In step 204, a warning is issued to the effect that there is an unexecuted failure countermeasure and that the failure may occur. At this time, the countermeasure content 908e of the fault countermeasure that has been determined not to be performed may be displayed.

【００５８】また、この警告を受けた保守管理者の判断
により、必要に応じて、適時に、障害回復支援センタ６
０２から障害対策済の最新のマイクロプログラム１００
０をダウンロードし、稼働中のマイクロプログラム１０
００と交換する予防保守を行うこともできる（ステップ
２０５）。The maintenance manager who receives the warning may timely, if necessary, determine whether the failure has been recovered.
02 The latest microprogram 100 that has already been fixed
0 and download the running microprogram 10
Preventive maintenance can also be performed to replace with 00 (step 205).

【００５９】また、前記ステップ２０３で、障害ポテン
シャルがないと判明した場合には、必要に応じて、保守
管理者に、その旨を報告する（ステップ２０６）。If it is determined in step 203 that there is no fault potential, the fact is reported to the maintenance manager as necessary (step 206).

【００６０】このように、ディスクアレイ装置６０１に
おける障害発生の有無に関係なく、当該ディスクアレイ
装置６０１のＳＶＰ７０７の側の自発的に動作により、
適時に予防保守が可能となる。As described above, irrespective of whether or not a failure has occurred in the disk array device 601, the SVP 707 side of the disk array device 601 spontaneously operates.
Preventive maintenance can be performed in a timely manner.

【００６１】なお、上述の実施の形態の説明では、保守
管理対象のシステムとして、特定のホストコンピュータ
７０１の配下で稼働するディスクアレイ装置６０１の保
守管理に適用した場合を例に採って説明したが、これに
限らない。In the above description of the embodiment, a case where the present invention is applied to the maintenance management of a disk array device 601 operating under a specific host computer 701 as a system to be managed is described. However, it is not limited to this.

【００６２】たとえば、図１２に例示されるストレージ
エリアネットワーク４００（Storage Area Network :Ｓ
ＡＮ）の保守管理にも適用することができる。For example, a storage area network 400 (Storage Area Network: S) exemplified in FIG.
AN) can also be applied to maintenance management.

【００６３】すなわち、ＳＡＮ４００では、複数のディ
スクアレイ装置４０１を高速な情報ネットワーク４０２
にて接続して仮想的な大記憶容量の外部記憶装置を構成
し、外部のメインフレームコンピュータ、オープン系サ
ーバ、パーソナルコンピュータ等の任意の複数の上位装
置４０３を、それぞれ任意の接続インタフェース４０４
を介して接続することで、データストレージサービスを
提供するシステムである。That is, in the SAN 400, a plurality of disk array devices 401 are connected to a high-speed information network 402.
To configure a virtual large-capacity external storage device, and connect a plurality of upper devices 403 such as an external mainframe computer, an open system server, and a personal computer to an arbitrary connection interface 404.
This is a system that provides a data storage service by connecting via the Internet.

【００６４】ＳＡＮ４００の複数のディスクアレイ装置
４０１は、情報ネットワーク４０２を介してＳＡＮ管理
センタ４０５（上述のＳＶＰ７０７に相当）の配下で一
括して管理運用され、さらに、ＳＡＮ管理センタ４０５
は、公衆回線４０７を介して保守センタ４０６（上述の
障害回復支援センタ６０２に相当）に接続される。A plurality of disk array devices 401 of the SAN 400 are collectively managed and operated under the control of the SAN management center 405 (corresponding to the above-mentioned SVP 707) via the information network 402, and furthermore, the SAN management center 405
Is connected to the maintenance center 406 (corresponding to the above-described failure recovery support center 602) via the public line 407.

【００６５】個々の上位装置４０３は他の上位装置４０
３を意識することなく、当該上位装置４０３に割り当て
られたストレージ領域を、自上位装置４０３に対応した
仮想的なファイルシステムや記憶フォーマットとして利
用できるように、ＳＡＮ４００の側にて、ＳＡＮ４００
内部の所望の記憶形式やファイルシステムから、複数の
上位装置４０３の各々の要求仕様に合わせた記憶形式の
相互変換サービスを行う。Each upper device 403 is connected to another upper device 40.
3 so that the storage area allocated to the host device 403 can be used as a virtual file system or a storage format corresponding to the host device 403 without being aware of the SAN 400.
Based on the desired internal storage format and file system, a mutual conversion service of the storage format is performed according to the required specifications of each of the plurality of higher-level devices 403.

【００６６】この場合、個々の上位装置４０３の所有者
は、自前の外部記憶装置を持つ必要はなく、ＳＡＮ４０
０の提供者から、データストレージサービスを受けるこ
とで、外部記憶装置に関連する購入コストや保守管理コ
スト削減等の恩恵を受けることができる等の利点があ
る。In this case, the owner of each higher-level device 403 does not need to have its own external storage device.
By receiving the data storage service from the 0 provider, there is an advantage that benefits such as reduction in purchase costs and maintenance management costs related to the external storage device can be obtained.

【００６７】このような、ＳＡＮ４００においても、本
実施の形態の情報処理装置の保守管理方法を適用し、Ｓ
ＡＮ管理センタ４０５に上述のＳＶＰ７０７の機能を持
たせ、保守センタ４０６に上述の障害回復支援センタ６
０２の機能を持たせることで、個々のディスクアレイ装
置４０１にて仕様されるマイクロプログラム等のソフト
ウェアの保守管理を効率よく遂行することが可能になる
とともに、予防保守も可能になる。In such a SAN 400 as well, by applying the maintenance management method for the information processing apparatus of this embodiment,
The AN management center 405 has the function of the SVP 707 described above, and the maintenance center 406 has the failure recovery support center 6 described above.
By providing the function 02, maintenance management of software such as a microprogram specified in each disk array device 401 can be efficiently performed, and preventive maintenance can be performed.

【００６８】以上説明したように、本発明の実施の形態
によれば、ディスクアレイ装置設置サイト６０４のディ
スクアレイ装置６０１等における障害発生を契機に、ネ
ットワーク６０３を介して障害回復支援センタ６０２か
ら障害が対策されたマイクロプログラムをダウンロード
し、保守員の介入無しに入れ替え作業を行うことで、保
守員に負担をかけることなく、迅速かつ確実にディスク
アレイ装置６０１等の外部記憶サブシステム等の早期障
害回復を図ることが可能になる。As described above, according to the embodiment of the present invention, when a failure occurs in the disk array device 601 or the like at the disk array device installation site 604, the failure recovery support center 602 via the network 603 triggers the failure. Downloading the microprogram with the countermeasures, and performing the replacement work without the intervention of maintenance personnel, quickly and surely prevent early failure of the external storage subsystem such as the disk array device 601 without burdening maintenance personnel. Recovery can be achieved.

【００６９】また、ディスクアレイ装置６０１の制御に
用いられるマイクロプログラム等のロードモジュール１
００１のバイナリコード中に障害対策ＩＤ１００２と対
策内容１００３を埋め込むことで、外部に特別なデータ
ベース等を必要とすることなく、ロードモジュール１０
０１のみにて、煩雑なデータベース管理等を必要とする
ことなく、当該ロードモジュール１００１に施された障
害対策内容とマイクロプログラムの一元的な管理が可能
となる。The load module 1 such as a microprogram used for controlling the disk array device 601
By embedding the fault countermeasure ID 1002 and the countermeasure content 1003 in the binary code 001, the load module 10 can be loaded without requiring a special external database or the like.
01 alone, it is possible to centrally manage the contents of the fault measures applied to the load module 1001 and the microprogram without the need for complicated database management or the like.

【００７０】また、従来では、個々のロードモジュール
レベル毎に障害対策を管理しようとすると管理が煩雑に
なるため、障害時には障害のロードモジュール１００１
を含むマイクロプログラム１０００の全体を交換してい
たが、本実施の形態の場合には、上述のように個々のロ
ードモジュール１００１とその障害対策の来歴の関係を
簡単に把握できるため、よりデータ量の小さいロードモ
ジュール１００１単位にきめ細かく修正や更新等の保守
管理を実施でき、ネットワーク６０３を経由した更新ソ
フトウェアの授受における通信経路の負荷軽減、通信時
間の短縮、等により、入れ替え作業に要する時間を短縮
することができる。Conventionally, if it is attempted to manage a fault measure for each load module level, the management becomes complicated.
In the present embodiment, the relationship between the individual load modules 1001 and the history of the failure countermeasures can be easily grasped as described above. Maintenance work such as correction and update can be performed in detail for each load module 1001 with a small load module, and the time required for replacement work can be shortened by reducing the load on the communication path when transferring updated software via the network 603 and shortening the communication time. can do.

【００７１】また、ディスクアレイ装置６０１のＳＶＰ
７０７の側から任意の契機で自発的にネットワーク６０
３を介して障害回復支援センタ６０２の障害対策ＤＢ９
０８等を参照することで、稼働中のマイクロプログラム
が障害発生のポテンシャルを秘めたソフトウェアか否か
の把握ができ、必要に応じて、障害対策がなされたマイ
クロプログラムに更新することで、障害の発生を未然に
防ぐ予防保守が可能となる。The SVP of the disk array device 601
707 side voluntarily triggers network 60
3 of the failure recovery support center 602 via the
08, etc., it is possible to determine whether the operating microprogram is software having the potential for failure occurrence, and if necessary, update to a microprogram that has been subjected to failure countermeasures. Preventive maintenance to prevent occurrence is possible.

【００７２】以上、本発明を実施の形態に基づき具体的
に説明したが、本発明は、前記実施の形態に限定される
ものではなく、例えば、ディスク装置としては、上述の
実施の形態に例示したが、ディスクアレイ装置に限ら
ず、一般の情報処理システム等に広く適用することがで
きる。Although the present invention has been specifically described based on the embodiments, the present invention is not limited to the above-described embodiments. For example, a disk device is exemplified in the above-described embodiments. However, the present invention can be widely applied not only to the disk array device but also to general information processing systems and the like.

【００７３】たとえば、本発明は、ソフトウェアのアッ
プグレード（障害を伴わないもの）にも、ソフトウェア
の個々のバージョンに識別子を付与する（埋め込む）こ
とで対応可能である。For example, the present invention can cope with software upgrades (those without failure) by adding (embedding) identifiers to individual versions of software.

【００７４】[0074]

【発明の効果】本発明によれば、分散して配置された複
数の情報処理装置に実装されたソフトウェアの保守管理
を、高い信頼性にて効率よく遂行することができる、と
いう効果が得られる。According to the present invention, the maintenance and management of software mounted on a plurality of information processing apparatuses distributed and arranged can be efficiently performed with high reliability. .

【００７５】本発明によれば、任意の情報処理装置に実
装されて稼働するバイナリのソフトウェアの保守管理や
運用を高い信頼性にて効率よく遂行することができる、
という効果が得られる。According to the present invention, maintenance and operation of binary software mounted and operated on an arbitrary information processing apparatus can be efficiently performed with high reliability.
The effect is obtained.

【００７６】本発明によれば、任意の情報処理装置に実
装されて稼働するバイナリのソフトウェアの保守管理や
運用を、より小さなサイズ単位にきめ細かく迅速に遂行
することができる、という効果が得られる。According to the present invention, there is an effect that maintenance and operation of binary software mounted and operated on an arbitrary information processing apparatus can be performed finely and quickly in smaller size units.

【００７７】本発明によれば、障害発生を契機に、ネッ
トワークを介して障害が対策されたプログラムをダウン
ロードし、人員の介入無しにプログラムの入れ替え作業
を行うことができる、という効果が得られる。According to the present invention, it is possible to obtain an effect that, upon the occurrence of a failure, a program in which the failure has been prevented is downloaded via a network, and the program can be replaced without human intervention.

【００７８】本発明によれば、不良部位の推定を実行形
式プログラムを構成する複数のロードモジュール単位ま
で行い、更新もロードモジュール単位で行なえる様にす
ることで、早期対策及びネットワーク負荷の軽減を図る
ことがができる、という効果が得られる。According to the present invention, the failure portion is estimated for a plurality of load modules constituting the executable program, and the update can be performed for each load module. The effect is that it can be achieved.

【００７９】本発明によれば、ソフトウェアが実装され
た情報処理装置が自発的にネットワークを介して保守支
援センタにアクセスすることで、稼働中のプログラムに
不良があるかどうかを判断し、障害発生を未然に防ぐ予
防保守ができる、という効果が得られる。According to the present invention, the information processing apparatus in which the software is installed spontaneously accesses the maintenance support center via the network to judge whether or not the running program has a defect, and And preventive maintenance can be performed.

【００８０】本発明によれば、実行形式のマイクロプロ
グラム中にバイナリコードとして過去の障害対策内容を
埋め込むことで、マイクロプログラムバージョンと障害
対策内容を一元的に管理することができる、という効果
が得られる。According to the present invention, by embedding the past troubleshooting contents as a binary code in an executable microprogram, it is possible to centrally manage the microprogram version and the troubleshooting contents. Can be

[Brief description of the drawings]

【図１】本発明の一実施の形態である情報処理装置の保
守管理方法の作用の一例を示すフローチャートである。FIG. 1 is a flowchart illustrating an example of an operation of a maintenance management method for an information processing apparatus according to an embodiment of the present invention.

【図２】本発明の一実施の形態である情報処理装置の保
守管理方法にて用いられるデータベースの構成の一例を
示す概念図である。FIG. 2 is a conceptual diagram illustrating an example of a configuration of a database used in a maintenance management method for an information processing apparatus according to an embodiment of the present invention.

【図３】本発明の一実施の形態である情報処理装置の保
守管理方法にてソフトウェアに埋め込まれる情報の一例
を示す概念図である。FIG. 3 is a conceptual diagram showing an example of information embedded in software in the information processing apparatus maintenance management method according to an embodiment of the present invention.

【図４】本発明の一実施の形態である情報処理装置の保
守管理方法において、保守管理対象となるソフトウェア
のソースコードの一例を示す説明図である。FIG. 4 is an explanatory diagram showing an example of a source code of software to be maintained and managed in the information processing apparatus maintenance management method according to an embodiment of the present invention;

【図５】本発明の一実施の形態である情報処理装置の保
守管理方法において、保守管理対象となるソフトウェア
のコンパイル後のバイナリの実行形式のモジュールの一
例を、１６進ダンプで示す説明図である。FIG. 5 is an explanatory diagram showing, by a hexadecimal dump, an example of a binary executable module after compiling software to be maintained and managed in the information processing apparatus maintenance management method according to an embodiment of the present invention; is there.

【図６】本発明の一実施の形態である情報処理装置の保
守管理方法が実施される情報処理システムの構成の一例
を示す概念図である。FIG. 6 is a conceptual diagram illustrating an example of a configuration of an information processing system in which a method for maintaining and managing an information processing apparatus according to an embodiment of the present invention is performed.

【図７】本発明の一実施の形態である情報処理装置の保
守管理方法を実施する情報処理装置の一例であるディス
クアレイ装置の構成の一例を示す概念図である。FIG. 7 is a conceptual diagram illustrating an example of a configuration of a disk array device that is an example of an information processing device that performs a maintenance management method for the information processing device according to an embodiment of the present invention.

【図８】図７に例示したディスクアレイ装置に備えられ
たサービスプロセッサの構成の一例を示す概念図であ
る。FIG. 8 is a conceptual diagram showing an example of a configuration of a service processor provided in the disk array device illustrated in FIG.

【図９】本発明の一実施の形態であるディスクアレイ装
置の遠隔保守を行う障害回復支援センタの構成の一例を
示す概念図である。FIG. 9 is a conceptual diagram showing an example of a configuration of a failure recovery support center for performing remote maintenance of a disk array device according to an embodiment of the present invention.

【図１０】本発明の情報処理の一実施の形態であるディ
スクアレイ装置の制御記憶におけるマイクロプログラム
の格納状態の一例を示す概念図である。FIG. 10 is a conceptual diagram showing an example of a storage state of a microprogram in control storage of a disk array device which is one embodiment of the information processing of the present invention.

【図１１】本発明の一実施の形態である情報処理装置の
保守管理方法の作用の変形例を示すフローチャートであ
る。FIG. 11 is a flowchart showing a modification of the operation of the information processing apparatus maintenance management method according to the embodiment of the present invention;

【図１２】本発明の一実施の形態である情報処理装置の
変形例であるストレージエリアネットワークの構成の一
例を示す概念図である。FIG. 12 is a conceptual diagram illustrating an example of a configuration of a storage area network as a modification of the information processing apparatus according to an embodiment of the present invention.

[Explanation of symbols]

４００…ストレージエリアネットワーク、４０１…ディ
スクアレイ装置、４０２…情報ネットワーク（第１の情
報ネットワーク）、４０３…上位装置、４０４…接続イ
ンタフェース、４０５…ＳＡＮ管理センタ、４０６…保
守センタ、４０７…公衆回線（第２の情報ネットワー
ク）、６０１…ディスクアレイ装置、６０２…障害回復
支援センタ、６０３…ネットワーク、６０４…ディスク
アレイ装置設置サイト、６０５…ローカルエリアネット
ワーク、６０６…ローカルエリアネットワーク、７０１
…ホストコンピュータ、７０２…ホストＩ／Ｆ制御部、
７０２ａ…制御記憶、７０３…Ｄｉｓｋ制御部、７０３
ａ…制御記憶、７０４…Ｄｉｓｋ装置、７０５…共通バ
ス、７０６…共有メモリ、７０７…サービスプロセッ
サ、８００…中央処理装置、８０１…ディスプレイ、８
０２…キーボード、８０３…マウス、８０４…フロッピ
イディスクドライブ、８０５…ハードディスクドライ
ブ、８０６…ネットワークドライバ、８０７…ＳＶＰ内
共通バス、８０８…障害監視部、８０９…障害通報部、
８１０…マイクロ交換部、９００…中央処理装置、９０
１…ディスプレイ、９０２…キーボード、９０３…マウ
ス、９０４…フロッピイディスクドライブ、９０５…ハ
ードディスクドライブ、９０６…ネットワークドライ
バ、９０７…共通バス、９０８…障害対策データベー
ス、９０８ａ…障害対策ＩＤ、９０８ｂ…件名、９０８
ｃ…現象、９０８ｄ…原因、９０８ｅ…対策内容、９０
８ｆ…発生プログラムバージョン、９０８ｇ…対策プロ
グラムバージョン、９０８ｈ…事象、９０９…マイクロ
プログラムデータベース、９１０…障害対策ＩＤ検出プ
ログラム、９１１…バイナリサーチプログラム、９１２
…マイクロ送付プログラム、１０００…マイクロプログ
ラム、１０００ａ…ソースコード、１００１…モジュー
ル、１００１…ロードモジュール、１００２…障害対策
ＩＤ、１００３…対策内容。400: storage area network, 401: disk array device, 402: information network (first information network), 403: host device, 404: connection interface, 405: SAN management center, 406: maintenance center, 407: public line ( Second information network), 601: Disk array device, 602: Failure recovery support center, 603: Network, 604: Disk array device installation site, 605: Local area network, 606: Local area network, 701
ホスト Host computer 702… Host I / F control unit
702a: control storage, 703: Disk control unit, 703
a: control storage, 704: disk device, 705: common bus, 706: shared memory, 707: service processor, 800: central processing unit, 801: display, 8
02, keyboard, 803, mouse, 804, floppy disk drive, 805, hard disk drive, 806, network driver, 807, common bus in SVP, 808, fault monitoring unit, 809, fault notification unit,
810: Micro exchange unit, 900: Central processing unit, 90
1 ... Display, 902 ... Keyboard, 903 ... Mouse, 904 ... Floppy disk drive, 905 ... Hard disk drive, 906 ... Network driver, 907 ... Common bus, 908 ... Troubleshooting database, 908a ... Troubleshooting ID, 908b ... Subject, 908
c: phenomenon, 908d: cause, 908e: countermeasures, 90
8f: occurrence program version, 908g: countermeasure program version, 908h: event, 909: microprogram database, 910: failure countermeasure ID detection program, 911: binary search program, 912
... micro sending program, 1000 ... micro program, 1000a ... source code, 1001 ... module, 1001 ... load module, 1002 ... failure measure ID, 1003 ... measure contents.

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ０６Ｆ 13/00 ３５１Ｇ０６Ｆ 15/16 ６２０Ｂ 15/16 ６２０ 15/177 ６７８Ａ 15/177 ６７８ 9/06 ４２０ＭＦターム(参考） 5B042 GA04 JJ02 5B045 AA00 BB12 BB28 BB47 GG06 JJ02 JJ09 JJ49 5B065 BA01 ZA07 5B076 AC07 EA06 5B089 GB03 JA34 JA35 JB07 JB17 KA13 KB09 KC28 KC44 ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) G06F 13/00 351 G06F 15/16 620B 15/16 620 15/177 678A 15/177 678 9/06 420M F Terms (reference) 5B042 GA04 JJ02 5B045 AA00 BB12 BB28 BB47 GG06 JJ02 JJ09 JJ49 5B065 BA01 ZA07 5B076 AC07 EA06 5B089 GB03 JA34 JA35 JB07 JB17 KA13 KB09 KC28 KC44

Claims

[Claims]

At least one information processing device operated by software and a maintenance support center for managing the software are connected via an information network, and when a failure occurs in the information processing device, the information is embedded in the software. By referring to the identifier of the software
A maintenance management method for an information processing apparatus, comprising: transmitting the software with the measure against the failure from the maintenance support center to the information processing apparatus to update the software in the information processing apparatus.

2. The method according to claim 1, wherein at the time of creating the software, a fault countermeasure identification indicating a type of a fault countermeasure applied to the software so far in the software. The maintenance support center reads the failure countermeasure identification information embedded in each of the plurality of pieces of software, identifies the software that has dealt with the failure of the information processing apparatus, and specifies the information processing. A maintenance management method for an information processing device, wherein the method is transmitted to the device.

3. The maintenance management method for an information processing apparatus according to claim 1, wherein the software includes a plurality of modules, and the software is applied to each of the modules so far. And transmitting the software from the maintenance support center to the information processing device and updating the software of the information processing device in units of the module. A maintenance management method for an information processing apparatus characterized by the following.

4. A method according to claim 1, wherein said information processing device is a disk array device, and said software is a microprogram. Maintenance management method.

5. An information processing apparatus operated by software and a maintenance support center for managing the software are connected via an information network, and the software running on the information processing apparatus and the maintenance support are arbitrarily triggered. A failure countermeasure that has been implemented for each of the plurality of software stored in the center is compared and determined via the information network, and the failure countermeasure that has not been implemented in the running software is stored in the maintenance support center. If the software has been applied, the first operation of issuing a warning of a potential failure occurrence to the maintenance manager of the information processing device, and the software running on the information processing device, A second operation of updating with the software stored in the maintenance support center via an information network; A maintenance management method for an information processing apparatus, wherein at least one operation is performed.

6. The maintenance management method for an information processing apparatus according to claim 5, wherein at the time of creating the software, a failure countermeasure identification indicating a type of a failure countermeasure applied to the software so far in the software. Information is embedded, and a comparison is made between the implemented troubleshooting measures based on the troubleshooting countermeasure identification information embedded in each of the software running the information processing apparatus and the software stored in the maintenance support center. A maintenance management method for an information processing apparatus, comprising:

7. The maintenance management method for an information processing apparatus according to claim 5, wherein the software includes a plurality of modules, and the software is applied to each of the modules so far. A failure management identification information indicating the type of the failure management implemented, and performing the comparison determination in units of the module.

8. A method according to claim 5, wherein said information processing device is a disk array device, and said software is a microprogram. Maintenance management method.

9. An information processing apparatus including a control unit operated by software and a network interface connected to a maintenance support center via an information network, wherein a fault monitor for monitoring the occurrence of a fault in the information processing apparatus is provided. Function, a failure notification function for notifying the occurrence of the failure to the maintenance support center, and software for receiving, from the maintenance support center, the software that has been subjected to the failure countermeasure against the failure, and replacing the software with an operating software. An information processing apparatus comprising control logic including an exchange function.

10. A plurality of storage devices, each operated by software, accessed from a plurality of higher-level devices via a first information network, and performing maintenance management of the storage devices via the first information network. An information processing apparatus including a management device for performing the management, and a maintenance support center connected to the management device via a second information network, wherein the management device monitors the occurrence of a failure in the storage device. Means, failure reporting means for notifying the maintenance support center of the occurrence of the failure, and software for receiving from the maintenance support center the software provided with the failure countermeasure for the failure and replacing the software with an operating software. An information processing apparatus comprising: an exchange unit.

11. The information processing apparatus according to claim 9, wherein the software has embedded therein fault countermeasure identification information indicating a type of a fault countermeasure performed so far on the software. Information processing device.

12. The information processing apparatus according to claim 9, wherein the control logic is configured to store the failure countermeasure identification information embedded in the running software and the maintenance support center at an arbitrary opportunity. And comparing with the trouble countermeasure identification information embedded in the software, and when a trouble countermeasure that has not been implemented in the running software is performed in the software on the maintenance support center side, the information processing is performed. A function of alerting a maintenance manager of a potential failure occurrence in the device and, if necessary, replacing the operating software with the software stored in the maintenance support center. Information processing device.

13. The information processing apparatus according to claim 9, 11 or 12, wherein the software is composed of a plurality of modules, and for each of the modules, a countermeasure against a fault that has been applied to the module so far is provided. An information processing apparatus, wherein failure countermeasure identification information indicating a type is embedded, and the software is exchanged for each module.

14. The information processing apparatus according to claim 9, wherein the control unit is a host interface control unit that controls transmission and reception of information to and from a higher-level device; A disk interface control unit that controls transmission and reception of the information to and from a plurality of stored disk storage devices, wherein the information processing device includes a plurality of the disk storage devices,
A disk array device that constitutes a disk array in which the information exchanged with the higher-level device and redundant information generated from the information are stored in a distributed manner; An information processing device, which is a service processor that performs management.

15. A method for creating software, comprising: a first step of creating or changing a source code; and a second step of converting the source code into a binary executable code mounted on an arbitrary information processing apparatus. In the first step, when a change is made to the source code as a countermeasure for a fault detected during the operation of the information processing apparatus, the fault countermeasure identification information indicating the type of the fault countermeasure is stored in the first step. A method for creating software, wherein identification information is added to the source code in a format inherited up to the execution code, and the failure countermeasure identification information is embedded in the execution code obtained in the second step.

16. The software creation method according to claim 15, wherein, when the execution code is composed of a plurality of load modules, the failure countermeasure identification information relating to each of the load modules is embedded. How to create software.

17. Binary software implemented in an information processing apparatus, wherein failure countermeasure identification information indicating a type of a failure countermeasure performed so far for the software is embedded in a part of the software. Software characterized by that.

18. The software according to claim 17, wherein the software includes a plurality of load modules, and the failure countermeasure identification information is embedded in each of the load modules.

19. The software according to claim 17, wherein the information processing device is a disk array device, and the software is a microprogram.