JPH07210331A

JPH07210331A - Disk array device

Info

Publication number: JPH07210331A
Application number: JP6001981A
Authority: JP
Inventors: Kenji Tsutsumi; 健次堤
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1994-01-13
Filing date: 1994-01-13
Publication date: 1995-08-11
Anticipated expiration: 2016-04-23
Also published as: JP3158832B2

Abstract

PURPOSE:To provide a disk array device capable of accurately detecting the states of respective built-in disk devices at the time of start. CONSTITUTION:When an initialization instruction from a host 21 is received, a disk array controller 13 sets parameters, then performs a test on whether or not a first disk device 14i is capable of normally performing read and write and writes management information in the management information storage area 15i of the disk device 14i when it is capable of normally performing the read and write. Also, When an (i)-th disk device 14i is not capable of normally performing the read and write, '1' is added to a counter, the contents of identification information is rewritten to (i), an error message indicating that it can not be operated even as the disk array device whose redundancy degree is zero is outputted when the counter is more than '2' and an initialization processing is ended. '1' is added to (i) when the counter is equal to or less than '1' and the test and the write of the management information are performed to the next disk device when a variable (i) is equal to or less than the number of the entire disk devices.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、ディスクアレイ装置に
係わり、特に、データを冗長構成にして格納するディス
クアレイ装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a disk array device, and more particularly to a disk array device for storing data in a redundant configuration.

【０００２】[0002]

【従来の技術】ディスクアレイ装置は、単体でも動作可
能な複数台のディスク装置を組み合わせた記憶装置であ
り、コンピュータなどのホスト装置からは、１台のディ
スク装置として扱えるように構成される。2. Description of the Related Art A disk array device is a storage device that is a combination of a plurality of disk devices that can operate independently, and is configured so that a host device such as a computer can handle it as one disk device.

【０００３】図１４に、従来のディスクアレイ装置の構
成の一例を示す。ディスクアレイ装置１１は、インタフ
ェース１２とディスクアレイコントローラ１３と複数台
のディスク装置１４₁ 〜１４₅ で構成される。内蔵され
るディスク装置１４の台数は、ディスクアレイ装置によ
り異なる。このようなディスクアレイ装置には、データ
のアクセス速度の向上を目的としたものと、記憶装置と
しての信頼性の向上を目的としたものがあり、後者にお
いては、書き込み要求のなされたデータは、冗長度が付
加されて、ディスクアレイ装置内に格納される。FIG. 14 shows an example of the configuration of a conventional disk array device. The disk array device 11 comprises an interface 12, a disk array controller 13, and a plurality of disk devices 14 _{1 to} 14 ₅ . The number of built-in disk devices 14 differs depending on the disk array device. Such disk array devices include those for the purpose of improving the data access speed and those for the purpose of improving the reliability as a storage device. In the latter, the data for which a write request is made is Redundancy is added and stored in the disk array device.

【０００４】図１５を用いて、このデータの書き込み動
作の説明を簡単に行う。なお、ここで説明に用いる冗長
構成は、ＲＡＩＤ(Redundant Arrays of Inexpensive D
isks) のレベル３と呼ばれるものである。The data write operation will be briefly described with reference to FIG. The redundant configuration used here is a RAID (Redundant Arrays of Inexpensive D).
It is called level 3 of isks).

【０００５】ディスクアレイコントローラ１３は、ホス
トから書き込み要求がなされたデータ２０を、所定のサ
イズ、たとえば、セクタ単位に分割し、ディスク装置１
４₁〜１４₄の記憶領域１６₁ 〜１６₄に分散させて書
き込む。このとき、ディスクアレイコントローラ１３
は、データＤ₁、Ｄ₂、Ｄ₃、Ｄ₄の排他的論理和演算
を行い、その演算結果であるパリティＰを、ディスク装
置１４₅の記録領域１６ ₅に書き込むことも、併せて実
行する。The disk array controller 13 is a host
Data 20 for which a write request has been
Size, for example, divided into sector units, and the disk device 1
Four₁~ 14_FourStorage area 16₁ ~ 16_FourDispersed and written
Imprint At this time, the disk array controller 13
Is the data D₁, D₂, D₃, D_FourExclusive OR operation of
And the parity P that is the calculation result is
Storage 14_FiveRecording area 16 _FiveYou can also write to
To go.

【０００６】このような書き込み動作の結果、ディスク
アレイ装置内には、あるディスク装置に格納されたデー
タが、他のディスク装置に格納されたデータおよびパリ
ティから算出することができる状態が形成される。たと
えば、データＤ₄は、データＤ₁ 、Ｄ₂ 、Ｄ₃、パリテ
ィＰの排他的論理和演算結果と一致する。このため、読
み出し要求があったデータが実際に格納されたディスク
装置を使用することなく、その要求に応答することが可
能である。ディスクアレイ装置では、内蔵するディスク
装置の１台にアクセス障害が発生した際にそのディスク
に対する読み出し要求があった場合には、このような形
で他のディスク装置に格納された情報から、必要とされ
るデータを再構築することにより、その読み出し要求に
応答を行なう。As a result of such a write operation, a state in which the data stored in a certain disk device can be calculated from the data stored in another disk device and the parity is formed in the disk array device. . For example, the data D ₄ matches the exclusive OR operation result of the data D ₁ , D ₂ , D ₃ and the parity P. Therefore, it is possible to respond to the request without using the disk device in which the data requested to be read is actually stored. In a disk array device, if a read request is made to the disk when an access failure occurs in one of the built-in disk devices, it is necessary to use the information stored in other disk devices in this way. Responding to the read request by reconstructing the stored data.

【０００７】この再構築による応答か否かを判断するに
は、ディスクアレイコントローラに、アクセス障害を有
するディスク装置を認識させることが必要である。この
ための手段としては、さまざまなものがあり、たとえ
ば、ディスクアレイコントローラ内のメモリに、障害を
有するディスク装置（障害ディスク装置）を特定する情
報を記憶させておくことも行なわれている。In order to determine whether the response is due to this rebuilding, it is necessary for the disk array controller to recognize the disk device having the access failure. There are various means for this purpose, and, for example, it has been practiced to store information for specifying a disk device having a failure (failed disk device) in a memory in the disk array controller.

【０００８】しかし、障害ディスク装置を特定する情報
が揮発性メモリに記憶されている場合には、その情報
が、システム停止時に失われてしまうので、ディスクア
レイ装置の使用者は、再立ち上げ時に、障害ディスク装
置を特定する情報を再設定しなければならないという問
題があった。このため、障害ディスク装置を特定する情
報を揮発性メモリ以外に格納するさまざまな装置が提案
されている。However, when the information for identifying the failed disk device is stored in the volatile memory, the information is lost when the system is stopped. Therefore, the user of the disk array device is required to restart the system. However, there is a problem that the information specifying the failed disk device must be reset. For this reason, various devices have been proposed that store information specifying the failed disk device in addition to the volatile memory.

【０００９】たとえば、特開平３−２２９３３１号公報
には、２重化ディスク装置の状態記憶に不揮発性メモリ
を用いる装置が開示されている。For example, Japanese Unexamined Patent Publication No. 3-229331 discloses a device using a non-volatile memory for state storage of a dual disk device.

【００１０】図１６に、その２重化ディスク装置の概要
を示す。このシステムは、２台のディスク装置２２とそ
れらを統合的に制御する制御部２３と不揮発性記憶部２
４とオペレーティングシステム（ＯＳ）２５で構成され
る。この２重化ディスク装置では、１台のディスク装置
にアクセス障害が発生した際には、他方のディスク装置
に対応するディスク装置識別情報が、不揮発性記憶部２
４に書き込まれる。そして、装置の再立ち上げ時には、
この不揮発性記憶部２４に書き込まれたディスク装置識
別情報から、正常に機能するディスク装置が特定され
る。FIG. 16 shows an outline of the duplicated disk device. This system includes two disk devices 22, a control unit 23 that integrally controls them, and a nonvolatile storage unit 2.
4 and an operating system (OS) 25. In this duplicated disk device, when an access failure occurs in one disk device, the disk device identification information corresponding to the other disk device is stored in the nonvolatile storage unit 2.
Written to 4. And when restarting the device,
From the disk device identification information written in the non-volatile storage unit 24, a normally functioning disk device is specified.

【００１１】また、特公平２−３２６５２号公報には、
ディスク媒体上に記録された情報から、ディスク装置の
状態を判定する２重化ディスク装置が開示されている。Further, Japanese Patent Publication No. 2-32652 discloses that
A dual disk device is disclosed which determines the status of the disk device from the information recorded on the disk medium.

【００１２】図１７を用いて、この２重化ディスク装置
の構成と動作の説明を行なう。特公平２−３２６５２号
公報記載の２重化ディスク装置は、２台のディスク装置
２２ ₁、２２₂とチャネル装置２６₁、２６₂と中央処
理装置２７で構成されており、各ディスク装置２２のデ
ィスク媒体上には、ヒストリカウンタ部２９とヒストリ
カウンタ検証部３０が設けられ、中央処理装置２７内の
主メモリ２８には、各ディスク装置が正常に機能するか
否かを示す情報を記憶するための制御テーブル３１が設
けられている。Referring to FIG. 17, this dual disk device is used.
The configuration and operation of will be described. Japanese Patent Publication No. 2-32652
The duplicated disk device disclosed in the publication is two disk devices.
22 ₁, 22₂And channel device 26₁, 26₂And central office
It is composed of the processing device 27, and the data of each disk device 22 is
The history counter 29 and the history are recorded on the disk medium.
A counter verification unit 30 is provided, and in the central processing unit 27
Whether each disk device functions normally in the main memory 28
A control table 31 for storing information indicating whether or not
It has been burned.

【００１３】中央処理装置２７は、両方のディスク装置
２２が、正常に機能しているときに、それぞれのヒスト
リカウンタ部２９に同じ値を書き込んでおく。そして、
一方のディスク装置、たとえば、ディスク装置２２₁に
アクセス障害が発生した際には、他方のディスク装置２
２₂のヒストリカウンタ部２９₂の値を増加させる。そ
して、装置の再立ち上げ時には、ヒストリカウンタ部２
９₁と２９₂の値を読み出して比較し、それらの値が異
なるときには、大きな値が書き込まれている方のディス
ク装置が正常に機能する装置であると判断する。The central processing unit 27 writes the same value in each history counter unit 29 when both the disk devices 22 are functioning normally. And
When an access failure occurs in _one disk device, for example, the disk device 22 ₁ , the other disk device 2 1
Increase 2 ₂ values of the history counter 29 _2. When the device is restarted, the history counter unit 2
The values of 9 ₁ and 29 ₂ are read and compared, and when the values are different, it is determined that the disk device in which the larger value is written is a device that functions normally.

【００１４】なお、ヒストリカウンタ検証部３０は、ヒ
ストリカウンタ部２９に隣接する記憶領域であり、この
２重化ディスク装置では、この部分に書き込まれたデー
タの変更の有無を確認することにより、ヒストリカウン
タ部の内容の正否の判定を行っている。The history counter verification unit 30 is a storage area adjacent to the history counter unit 29, and in this duplicated disk device, the history is verified by checking whether or not the data written in this portion has been changed. Whether the contents of the counter section are correct or not is determined.

【００１５】[0015]

【発明が解決しようとする課題】以上説明した２重化デ
ィスク装置に対する２つの技術では、確かに、システム
が停止しても、障害ディスク装置を特定する情報は失わ
れることはない。しかし、これらの技術は、２重化ディ
スク装置に対するものであり、３台以上のディスク装置
を備えるディスクアレイ装置に直接適用できるものでは
ない。According to the two techniques for the duplicated disk device described above, the information for specifying the failed disk device is not lost even if the system is stopped. However, these techniques are for a duplicated disk device, and cannot be directly applied to a disk array device having three or more disk devices.

【００１６】また、これらの技術では、システム停止時
に、ディスク装置の交換がなされた場合に対する対処が
行なわれていないため、以下のような問題が生ずること
が考えられる。たとえば、不揮発性メモリにディスク装
置の状態を記憶する２重化ディスク装置では、システム
停止時にディスク装置が交換された場合には、そのこと
が、不揮発性メモリ内の情報に反映されない。このた
め、正常に動作する方のディスク装置が誤って交換され
た場合、制御部は、情報が格納されていないディスク装
置が正常な装置であると判断して、動作を開始してしま
う。Further, these techniques do not deal with the case where the disk device is replaced when the system is stopped, so that the following problems may occur. For example, in a duplicated disk device that stores the state of the disk device in a non-volatile memory, if the disk device is replaced when the system is stopped, this is not reflected in the information in the non-volatile memory. For this reason, when the normally operating disk device is replaced by mistake, the control unit determines that the disk device in which information is not stored is a normal device and starts operating.

【００１７】また、ディスク媒体上にヒストリカウンタ
を設ける２重化ディスク装置では、正常な方のディスク
装置が、新しいディスク装置に交換されて場合、そのヒ
ストリカウンタ検証部に格納された情報が正しいもので
ないため、その装置は正常に機能するものではないと判
断され、障害ディスク装置の方が正常に機能するものと
して動作が開始される。In the duplicated disk device having the history counter on the disk medium, when the normal disk device is replaced with a new disk device, the information stored in the history counter verification unit is correct. Therefore, it is determined that the device does not function normally, and the operation of the failed disk device is started assuming that it functions normally.

【００１８】そこで本発明の目的は、立ち上げ時に、ア
クセス障害を有するディスク装置を検出することができ
るディスクアレイ装置を提供することにある。Therefore, an object of the present invention is to provide a disk array device capable of detecting a disk device having an access failure at startup.

【００１９】また、本発明の他の目的は、システム停止
時にディスク装置の交換が行なわれても、それぞれのデ
ィスク装置の状態を、正確に判定することができるディ
スクアレイ装置を提供することにある。Another object of the present invention is to provide a disk array device which can accurately determine the state of each disk device even if the disk device is replaced when the system is stopped. .

【００２０】[0020]

【課題を解決するための手段】請求項１記載の発明は、
データを格納するための複数台のディスク装置と、これ
ら全てのディスク装置を巡回する順序を設定する設定手
段と、複数台のディスク装置のそれぞれの所定の記憶領
域に、各ディスク装置の状態の判定に用いる履歴情報と
して同一内容の数値情報を書き込む初期化手段と、ディ
スク装置にアクセス障害が発生したことを検出する検出
手段と、この検出手段によりアクセス障害が検出された
ディスク装置を除いたディスク装置を設定手段で設定さ
れた順序で１台ずつ指定する指定手段と、この指定手段
で指定されたディスク装置の所定の記憶領域に格納され
た履歴情報をその値が所定量増加するように書き換える
書換手段と、立ち上げ時に、複数台のディスク装置の所
定の記憶領域に格納されている履歴情報を読み出す読出
手段と、この読出手段が読みだした履歴情報を基に、設
定手段により後に順序付けられたディスク装置の履歴情
報の方が大きいディスク装置を探索する探索手段と、こ
の探索手段により探索されたディスク装置をアクセス障
害を有するディスク装置であると判断する判断手段とを
具備する。The invention according to claim 1 is
A plurality of disk devices for storing data, setting means for setting the order in which all of these disk devices are circulated, and determination of the status of each disk device in a predetermined storage area of each of the plurality of disk devices Initializing means for writing numerical information having the same contents as history information used for recording, detecting means for detecting occurrence of access failure in the disk device, and disk device except for the disk device in which the access failure is detected by the detecting means. And a rewriting means for rewriting the history information stored in a predetermined storage area of the disk device designated by the designating means in the order set by the setting means so that the value increases by a predetermined amount. A reading means for reading history information stored in a predetermined storage area of a plurality of disk devices at the time of startup, and a reading means for reading the history information. Based on the history information read by the stage, the setting means has a searching means for searching a disk device having a larger history information of the disk devices later ordered, and the disk device searched by this searching means has an access failure. And a determination unit that determines that the device is a disk device.

【００２１】すなわち請求項１記載の発明では、ディス
クアレイ装置の初期化時に、各ディスク装置の所定の記
憶領域に、同一の履歴情報を書き込んでおき、通常動作
中に、ディスク装置にアクセス障害が発生した際には、
その障害ディスク装置を除くディスク装置の履歴情報
を、その内容が所定量、たとえば、“１”だけ増加する
ように書き換えていく。この書換は、設定手段により設
定された順序で行なわれる。That is, according to the first aspect of the invention, when the disk array device is initialized, the same history information is written in a predetermined storage area of each disk device, and during normal operation, an access failure occurs in the disk device. When it happens,
The history information of the disk devices excluding the faulty disk device is rewritten so that the contents increase by a predetermined amount, for example, "1". This rewriting is performed in the order set by the setting means.

【００２２】読出手段は、立ち上げ時に、それぞれのデ
ィスク装置内の履歴情報の読み出しを行い、探索手段
は、設定手段により設定されている順序において、連続
する２台のディスク装置に格納されていた履歴情報の比
較を行い、後に順序付けられたディスク装置の履歴情報
の方が大きくなるディスク装置の探索を行なう。判断手
段は、この探索手段で探索されたディスク装置がアクセ
ス障害を有するディスク装置であると判断する。このよ
うに、所定の順序で履歴情報の書き換えを行なっている
ので、書換手段による履歴情報の書き換えが中断された
場合にも、立ち上げ時に、正確にアクセス障害を有する
ディスク装置が存在するか否かが判定できることにな
る。The reading means reads the history information in each disk device at the time of startup, and the searching means is stored in two consecutive disk devices in the order set by the setting means. The history information is compared, and a disk device in which the ordered history information of the disk device is larger is searched later. The judging means judges that the disk device searched by the searching means is a disk device having an access failure. As described above, since the history information is rewritten in a predetermined order, even if the history information rewriting by the rewriting means is interrupted, whether or not there is a disk device having an accurate access failure at the time of start-up. It will be possible to judge whether or not.

【００２３】請求項２記載の発明は、データを格納する
ための複数台のディスク装置と、これら全てのディスク
装置を巡回する順序を設定する設定手段と、複数台のデ
ィスク装置のそれぞれの所定の記憶領域に、各ディスク
装置の状態の判定に用いる管理情報としてそのディスク
装置が接続されるスロット番号と数値情報である履歴情
報とを書き込む初期化手段と、ディスク装置にアクセス
障害が発生したことを検出する検出手段と、この検出手
段によりアクセス障害が検出されたディスク装置を除い
たディスク装置を設定手段により設定された順序で１台
ずつ指定する指定手段と、この指定手段で指定されたデ
ィスク装置の所定の記憶領域に格納された履歴情報をそ
の値が所定量増加するように書き換える書換手段と、立
ち上げ時に、複数台のディスク装置のそれぞれの所定の
記憶領域に格納されている管理情報を読み出す読出手段
と、この読出手段が読みだした管理情報中のスロット番
号が初期化手段により書き込まれたものと一致している
か否かを判定する判定手段と、この判定手段が一致して
いないと判定したディスク装置から読みだした履歴情報
を初期化手段が書き込みに用いる履歴情報より小さな値
の履歴情報に変更する変更手段と、それぞれのディスク
装置に応じた履歴情報を基に、設定手段により後に順序
付けられたディスク装置の履歴情報の方が大きいディス
ク装置を探索する探索手段と、この探索手段により探索
されたディスク装置に対応する履歴情報が変更手段によ
り変更されたものであった場合に、そのディスク装置が
交換されたディスク装置であると判断し、それ以外の場
合には、そのディスク装置がアクセス障害を有するディ
スク装置であると判断する判断手段とを具備するAccording to a second aspect of the present invention, a plurality of disk devices for storing data, setting means for setting the order in which all of these disk devices are circulated, and a predetermined number for each of the plurality of disk devices. Initializing means for writing the slot number to which the disk device is connected and the history information, which is numerical information, to the storage area as management information used for determining the state of each disk device, and an access failure occurred in the disk device. Detecting means for detecting, specifying means for specifying one by one the disk devices excluding the disk device in which the access failure is detected by the detecting means, in the order set by the setting means, and the disk device specified by this specifying means Rewriting means for rewriting the history information stored in the predetermined storage area so that the value increases by a predetermined amount, and a plurality of Read-out means for reading out the management information stored in each predetermined storage area of the disk device, and whether the slot number in the management information read out by the read-out means matches the one written by the initialization means. Determination means for determining whether or not there is a change means for changing the history information read from the disk device, which is determined by the determination means to be inconsistent, to history information having a smaller value than the history information used by the initialization means for writing Corresponding to a searching means for searching a disk device having a larger history information of the disk device later ordered by the setting means, based on the history information corresponding to each disk device, and the disk device searched by this searching means If the history information to be changed has been changed by the changing means, it is determined that the disk device has been replaced. And, in other cases, the disk apparatus comprises a determining means for determining that the disk device having an access failure

【００２４】すなわち請求項２記載の発明では、ディス
クアレイ装置の初期化時に、各ディスク装置の所定の記
憶領域に、スロット情報と履歴情報とからなる管理情報
を書き込んでおき、通常動作中に、ディスク装置にアク
セス障害が発生した際には、その障害ディスク装置を除
くディスク装置の履歴情報を、その内容が所定量、たと
えば、“１”だけ増加するように書き換えていく。この
書換は、設定手段により設定された順序に従い行なわれ
る。That is, according to the second aspect of the invention, at the time of initialization of the disk array device, management information including slot information and history information is written in a predetermined storage area of each disk device, and during normal operation, When an access failure occurs in the disk device, the history information of the disk devices other than the failed disk device is rewritten so that the contents increase by a predetermined amount, for example, "1". This rewriting is performed according to the order set by the setting means.

【００２５】読出手段は、立ち上げ時に、それぞれのデ
ィスク装置内の管理情報の読み出しを行い、判定手段
は、管理情報中のスロット情報が初期化手段により書き
込まれたものであるか否かを判定する。変更手段は、こ
の判定手段によりスロット情報が正常でないと判定され
たディスク装置に対応する履歴情報を、初期化手段が書
き込みに用いる履歴情報より小さな値に変更する。な
お、この手段では、ディスク装置内の履歴情報が書き換
えられるのではなく、読み出された履歴情報に変更が加
えられるだけである。そして、探索手段は、設定手段に
より連続するよう順序付けられた２台のディスク装置に
対応する履歴情報の比較を行い、後に順序付けられたデ
ィスク装置の履歴情報の方が大きくなるディスク装置の
探索を行なう。判断手段は、この探索手段により探索さ
れたディスク装置に対応する履歴情報が変更手段により
変更されたものであった場合に、そのディスク装置が交
換されたディスク装置であると判断し、その履歴情報が
変更手段により変更されたものでなかった場合には、そ
のディスク装置がアクセス障害を有するディスク装置で
あると判断する。このように、所定の順序による履歴情
報の書き換えと、スロット情報を用いた判定を行なって
いるので、書換手段による履歴情報の書き換えが中断さ
れた場合や、システム停止時にディスク装置が交換され
た場合にも、正確に、アクセス障害を有するディスク装
置および正しく交換されたディスク装置が存在するか否
かが判定できることになる。The reading means reads the management information in each disk device at the time of start-up, and the judging means judges whether or not the slot information in the management information is written by the initializing means. To do. The changing means changes the history information corresponding to the disk device for which the slot information is judged to be not normal by the judging means to a value smaller than the history information used for writing by the initializing means. Note that this means does not rewrite the history information in the disk device, but only changes the read history information. Then, the searching means compares the history information corresponding to the two disk devices sequentially ordered by the setting means, and searches for a disk device in which the history information of the disk devices ordered later becomes larger. . If the history information corresponding to the disk device searched by the searching means is changed by the changing means, the judging means judges that the disk device is a replaced disk device, and the history information Is not changed by the changing means, it is determined that the disk device is a disk device having an access failure. In this way, since the history information is rewritten in the predetermined order and the determination is performed using the slot information, when the rewriting of the history information by the rewriting means is interrupted or the disk device is replaced when the system is stopped. Also, it is possible to accurately determine whether or not there is a disk device having an access failure and a correctly replaced disk device.

【００２６】請求項３記載の発明は、データを格納する
ための複数台のディスク装置と、これら全てのディスク
装置を巡回する順序を設定する設定手段と、複数台のデ
ィスク装置のそれぞれの所定の記憶領域に、各ディスク
装置の状態の判定に用いる履歴情報として第１の履歴情
報を書き込む初期化手段と、ディスク装置にアクセス障
害が発生したことを検出する検出手段と、この検出手段
によりアクセス障害が検出されたディスク装置の設定手
段により後に順序付けられるディスク装置を指定する指
定手段と、この指定手段で指定されたディスク装置の所
定の記憶領域の内容を第１の履歴情報とは異なる第２の
履歴情報に書き換える書換手段と、立ち上げ時に、それ
ぞれのディスク装置の所定の記憶領域に格納された情報
を読み出す読出手段と、この読出手段により読み出され
た情報を基に、第２の履歴情報と一致する情報が記憶さ
れたディスク装置を探索する探索手段と、この探索手段
により探索されたディスク装置の設定手段により前に順
序付けられたディスク装置がアクセス障害を有するディ
スク装置であると判断する判断手段とを具備する。According to a third aspect of the present invention, a plurality of disk devices for storing data, setting means for setting the order in which all of these disk devices are circulated, and a predetermined number for each of the plurality of disk devices. An initialization unit that writes first history information as history information used to determine the state of each disk device in the storage area, a detection unit that detects that an access failure has occurred in the disk device, and an access failure by this detection unit. Of the disk device specified by the specifying means and the content of the predetermined storage area of the disk device specified by this specifying means are different from the first history information. Rewriting means for rewriting to history information and a reading means for reading out information stored in a predetermined storage area of each disk device at startup. And a searching means for searching for a disk device in which information matching the second history information is stored based on the information read by the reading means, and a setting means for the disk device searched by the searching means. And a discriminating means for discriminating that the previously ordered disk device is a disk device having an access failure.

【００２７】すなわち請求項３記載の発明では、ディス
クアレイ装置の初期化時に、各ディスク装置の所定の記
憶領域に、同一内容の第１の履歴情報を書き込んでお
き、通常動作中に、ディスク装置にアクセス障害が発生
した際には、その障害ディスク装置の後に順序付けられ
たディスク装置の履歴情報を、第１の履歴情報とは異な
る第２の履歴情報に書き換える。That is, according to the third aspect of the invention, when the disk array device is initialized, the first history information having the same contents is written in a predetermined storage area of each disk device, and the disk device is operated during normal operation. When an access failure occurs in, the history information of the disk devices ordered after the failed disk device is rewritten to the second history information different from the first history information.

【００２８】読出手段は、立ち上げ時に、それぞれのデ
ィスク装置内の履歴情報の読み出しを行い、探索手段
は、第２の履歴情報が記憶されているディスク装置の探
索を行なう。そして、判断手段は、この探索手段で探索
されたディスク装置の、設定手段により前に順序付けら
れたディスク装置がアクセス障害を有するディスク装置
であると判断する。このように、所定の順序で履歴情報
の書き換えを行なっているので、書換手段による履歴情
報の書き換えが中断された場合にも、立ち上げ時に、正
確にアクセス障害を有するディスク装置が存在するか否
かが判定できることになる。The reading means reads the history information in the respective disk devices at the time of startup, and the searching means searches the disk device in which the second history information is stored. Then, the judging means judges that the disk device searched by the searching means and previously ordered by the setting means is a disk device having an access failure. As described above, since the history information is rewritten in a predetermined order, even if the history information rewriting by the rewriting means is interrupted, whether or not there is a disk device having an accurate access failure at the time of start-up. It will be possible to judge whether or not.

【００２９】請求項４記載の発明は、データを格納する
ための複数台のディスク装置と、これら全てのディスク
装置を巡回する順序を設定する設定手段と、複数台のデ
ィスク装置のそれぞれの所定の記憶領域に、各ディスク
装置の状態の判定に用いる履歴情報として第１の履歴情
報を書き込む初期化手段と、ディスク装置にアクセス障
害が発生したことを検出する検出手段と、この検出手段
によりアクセス障害が検出されたディスク装置の設定手
段により後に順序付けられるディスク装置を指定する指
定手段と、この指定手段で指定されたディスク装置の所
定の記憶領域の内容を第１の履歴情報とは異なる第２の
履歴情報に書き換える書換手段と、立ち上げ時に、それ
ぞれのディスク装置の所定の記憶領域に格納された情報
を読み出す読出手段と、この読出手段により読み出され
た情報を基に、第２の履歴情報と一致する情報が記憶さ
れたディスク装置の設定手段により前に順序付けられた
ディスク装置を特定する第１の特定手段と、読出手段に
より読み出された情報を基に、第１および第２の履歴情
報のいずれとも一致しない情報が記憶されているディス
ク装置を特定する第２の特定手段と、これら第１および
第２の特定手段で特定されたディスク装置が同じ装置で
あったときに、そのディスク装置を正しく交換されたデ
ィスク装置であると判断する第１の判断手段と、第２の
特定手段による特定が行なわれておらず、第１の特定手
段によるディスク装置の特定が行なわれていたときに、
第１の特定手段で特定されたディスク装置がアクセス障
害を有するディスク装置であると判断する第２の判断手
段とを具備する。According to a fourth aspect of the present invention, a plurality of disk devices for storing data, setting means for setting the order in which all of these disk devices are circulated, and a predetermined number for each of the plurality of disk devices. An initialization unit that writes first history information as history information used to determine the state of each disk device in the storage area, a detection unit that detects that an access failure has occurred in the disk device, and an access failure by this detection unit. Of the disk device specified by the specifying means and the content of the predetermined storage area of the disk device specified by this specifying means are different from the first history information. Rewriting means for rewriting to history information and a reading means for reading out information stored in a predetermined storage area of each disk device at startup. And first specifying means for specifying the disk device ordered in advance by the setting means of the disk device in which the information matching the second history information is stored, based on the information read by the reading means. Second identifying means for identifying a disk device storing information that does not match any of the first and second history information based on the information read by the reading means, and the first and second When the disk device specified by the specifying device is the same device, the first judging device for judging that the disk device is a properly replaced disk device and the specifying device by the second specifying device. When the disk device is specified by the first specifying means,
And a second judging means for judging that the disk device specified by the first specifying device is a disk device having an access failure.

【００３０】すなわち請求項４記載の発明では、ディス
クアレイ装置の初期化時に、各ディスク装置の所定の記
憶領域に、同一内容の第１の履歴情報を書き込んでお
き、通常動作中に、ディスク装置にアクセス障害が発生
した際には、その障害ディスク装置の後に順序付けられ
たディスク装置の履歴情報を、第１の履歴情報とは異な
る第２の履歴情報に書き換える。That is, in the invention described in claim 4, the first history information having the same content is written in a predetermined storage area of each disk device at the time of initialization of the disk array device, and the disk device is operated during normal operation. When an access failure occurs in, the history information of the disk devices ordered after the failed disk device is rewritten to the second history information different from the first history information.

【００３１】読出手段は、立ち上げ時に、それぞれのデ
ィスク装置内の履歴情報の読み出しを行い、第１の特定
手段は、第２の履歴情報が記憶されているディスク装置
の探索して、そのようなディスク装置が探索できた場合
には、探索されたディスク装置の前に順序付けられたデ
ィスク装置の特定を行なう。また、第２の特定手段は、
第１および第２の履歴情報とは異なる履歴情報を有する
ディスク装置を探索して、そのようなディスク装置が探
索できた場合には、探索されたディスク装置の特定（識
別情報の記憶）を行なう。The reading means reads the history information in each disk device at the time of start-up, and the first specifying means searches the disk device in which the second history information is stored and If such a disk device can be searched for, an ordered disk device is specified before the searched disk device. The second specifying means is
A disk device having history information different from the first and second history information is searched, and when such a disk device can be searched, the searched disk device is specified (identification information is stored). .

【００３２】そして、第１の判断手段は、これら第１お
よび第２の特定手段で特定されたディスク装置が同じ装
置であったときに、そのディスク装置を正しく交換され
たディスク装置であると判断し、第２の判断手段は、第
２の特定手段による特定が行なわれておらず、第１の特
定手段によるディスク装置の特定が行なわれていたとき
に、第１の特定手段で特定されたディスク装置がアクセ
ス障害を有するディスク装置であると判断する。このよ
うに、２種の履歴情報しか用いず、所定の順序で履歴情
報の書き換えを行なっているので、システム停止時にデ
ィスク装置が交換された場合にも、アクセス障害を有す
るディスク装置および正しく交換されたディスク装置が
存在するか否かが判定できることになる。Then, the first judging means judges that the disk device is a disk device which has been properly exchanged when the disk devices specified by the first and second specifying means are the same device. However, the second determining means is specified by the first specifying means when the second specifying means has not been specified and the first specifying means is specifying the disk device. It is determined that the disk device has the access failure. In this way, since only two types of history information are used and history information is rewritten in a predetermined order, even if a disk device is replaced when the system is stopped, a disk device having an access failure and a correct replacement will be performed. It is possible to determine whether or not there is another disk device.

【００３３】[0033]

【実施例】以下、実施例につき本発明を詳細に説明す
る。EXAMPLES The present invention will be described in detail below with reference to examples.

【００３４】第１の実施例 First embodiment

【００３５】図１に、本発明の第１の実施例によるディ
スクアレイ装置の概要を示す。第１の実施例のディスク
アレイ装置１１は、インターフェース１２とディスクア
レイコントローラ１３と５台のディスク装置１４₁ ない
し１４₅ で構成される。ディスク装置１４は、単独でも
動作可能なディスク装置であり、それぞれのディスク装
置の所定の記憶領域には、ディスクアレイ装置の状態判
定に用いる管理情報を格納するための管理情報記憶領域
１５が設けられている。ディスクアレイコントローラ１
３は、コンピュータなどのホスト２１からのデータのア
クセス要求に応答するために、これら５台のディスク装
置１４₁ ないし１４₅ を統合的に制御する制御部であ
る。なお、ディスクアレイコントローラ１３は、制御用
プロセッサと、制御用プロセッサの動作を規定するプロ
グラムが格納されたプログラムメモリと、その動作時に
使用するパラメータを格納するメモリと、各ディスク装
置とのインタフェースで構成されている。FIG. 1 shows an outline of a disk array device according to the first embodiment of the present invention. The disk array device 11 of the first embodiment comprises an interface 12, a disk array controller 13 and five disk devices 14 ₁ to 14 ₅ . The disk device 14 is a disk device that can operate independently, and a predetermined storage area of each disk device is provided with a management information storage area 15 for storing management information used for determining the state of the disk array device. ing. Disk array controller 1
A control unit ₃ integrally controls these five disk devices 14 ₁ to 14 ₅ in order to respond to a data access request from a host 21 such as a computer. The disk array controller 13 includes a control processor, a program memory that stores a program that defines the operation of the control processor, a memory that stores parameters used during the operation, and an interface with each disk device. Has been done.

【００３６】このディスクアレイ装置の動作は、ディス
クアレイ装置の使用に先駆けて行われる初期化処理と、
通常動作中に、ディスク装置に障害が発生した際に行わ
れる管理情報更新処理と、各ディスク装置の状態を判
定、認識するために、装置の立ち上げ時に行われる状態
判定処理に大別される。まず、初期化処理について説明
を行うことにする。The operation of this disk array device includes an initialization process which is performed prior to the use of the disk array device.
It is roughly divided into management information update processing that is performed when a failure occurs in a disk device during normal operation, and status determination processing that is performed when the device is started up in order to determine and recognize the status of each disk device. . First, the initialization process will be described.

【００３７】図２に、初期化処理時におけるディスクア
レイコントローラの動作の流れを示す。ここで、処理対
象とされる各ディスク装置は、いわゆるフォーマットが
完了した装置であり、ディスクアレイコントローラ１３
は、インターフェース１２を介してホスト２１からの初
期化指示を受信すると、カウンタＣ_FAILと識別情報Ｎ
_FAILに、それぞれ“０”をセットし、また、変数ｉに
“１”をセット（ステップＳ１０１）する。カウンタＣ
_FAILは、正常でないディスク装置の台数の記憶に用いら
れ、識別情報Ｎ_FAILは、カウンタＣ_FAILでカウントされ
たディスク装置のうち、１台のディスク装置の識別情報
の記憶に用いられるパラメータである。FIG. 2 shows a disk array at the time of initialization processing.
The operation flow of the ray controller is shown. Where the processing pair
Each disk device, which is supposed to be an elephant, has a so-called format.
It is a completed device, and the disk array controller 13
From the host 21 via interface 12
When receiving the periodization instruction, the counter C_FAILAnd identification information N
_FAILTo "0" respectively, and to the variable i
"1" is set (step S101). Counter C
_FAILIs used to store the number of abnormal disk units.
Identification information N_FAILIs the counter C_FAILIs counted in
Identification information of one disk device
Is a parameter used to store the.

【００３８】パラメータの設定を終えたディスクアレイ
コントローラ１３は、ｉ番のディスク装置１４_iが正常
に読み書きできるかどうかのテストを行い（ステップＳ
１０２）、正常に読み書き可能である場合には（ステッ
プＳ１０３；Ｙ）、そのディスク装置１４_iの管理情報
記憶領域１５_iに管理情報を書き込み（ステップＳ１０
４）、ステップＳ１０７に進む。なお、ステップＳ１０
４では、ディスクアレイ装置を識別するための装置識別
情報と、接続されたスロット番号に応じた情報であるス
ロット情報と、履歴情報とからなる管理情報が書き込ま
れるが、管理情報中、変数ｉに応じてその内容が変化す
る情報はスロット情報だけである。After setting the parameters, the disk array controller 13 tests whether or not the i-th disk device 14 _i can read and write normally (step S
102), if the data can be read and written normally (step S103; Y), the management information is written in the management information storage area 15 _i of the disk device 14 _i (step S10).
4) and proceeds to step S107. Note that step S10
In 4, the management information including the device identification information for identifying the disk array device, the slot information that is information according to the connected slot number, and the history information is written. The only information whose content changes accordingly is the slot information.

【００３９】また、ｉ番のディスク装置が正常に読み書
きが行なえない装置である場合（ステップＳ１０３；
Ｎ）には、カウンタＣ_FAILに“１”が加算され、識別情
報Ｎ_FA _ILの内容がｉに書き換えられる（ステップＳ１０
５）。そして、カウンタＣ_FAILが“１”以下である場合
（ステップＳ１０６；Ｙ）に限り、ステップＳ１０７に
進む。カウンタＣ_FAILが“２”以上である場合（ステッ
プＳ１０６；Ｎ）には、冗長度ゼロのディスクアレイ装
置としても動作させることができない旨を示すエラーメ
ッセージを出力（ステップＳ１０９）して、初期化処理
を終了する。If the i-th disk device is a device that cannot read and write normally (step S103;
In N), “1” is added to the counter C _FAIL, and the content of the identification information N _FA _IL is rewritten to i (step S10).
5). Then, only when the counter C _FAIL is equal to or less than “1” (step S106; Y), the process proceeds to step S107. When the counter C _FAIL is "2" or more (step S106; N), an error message indicating that the disk array device with zero redundancy cannot be operated is output (step S109), and initialization is performed. The process ends.

【００４０】ステップＳ１０７では、ｉに“１”が加算
され、次に、変数ｉが全ディスク装置数Ｎ_ALL以下であ
るか否かの判断（ステップＳ１０８）が行なわれる。変
数ｉが全ディスク装置数Ｎ_ALL以下である場合（Ｙ）
は、次のディスク装置に対して、テストおよび管理情報
の書き込みを行うために、ステップＳ１０２に戻る。In step S107, "1" is added to i, and then it is judged whether or not the variable i is less than or equal to the total number N _{ALL of} disk devices (step S108). When the variable i is less than or equal to the total number N _{ALL of} disk devices (Y)
Returns to step S102 in order to write the test and management information to the next disk device.

【００４１】変数ｉが全ディスク台数Ｎ_ALLを越えたと
き（ステップＳ１０８；Ｎ）、すなわち、全てのディス
ク装置に対するテストおよび管理情報の書き込みが完了
したときには、カウンタＣ_FAILが“０”であるか否かの
判定を行い（ステップＳ１１０）、“０”でない場合
（Ｎ）には、Ｎ_FAIL番のディスク装置をアクセス対象か
ら除外して（ステップＳ１１１）、初期化処理を終了す
る。なお、アクセス対象から除外するディスク装置の設
定は、ディスクアレイコントローラ１３内のメモリの所
定の記憶領域に、Ｎ_FAIL番のディスク装置が使用不可で
あることを示す情報を書き込むことにより行なわれる。Whether the counter C _FAIL is "0" when the variable i exceeds the total number N _ALL of disks (step S108; N), that is, when the test and the writing of the management information to all the disk devices are completed. Whether or not it is determined (step S110), and if it is not "0" (N), the disk device with N _FAIL number is excluded from the access target (step S111), and the initialization process is ended. The setting of the disk device excluded from the access target is performed by writing information indicating that the N _FAIL disk device cannot be used in a predetermined storage area of the memory in the disk array controller 13.

【００４２】このように第１の実施例のディスクアレイ
装置は、１台のディスク装置に障害があっても使用を開
始することができるように構成されているが、通常、全
てのディスク装置が正常に読み書きできる状態で、その
使用は開始される。As described above, the disk array device of the first embodiment is constructed so that the use can be started even if there is a failure in one disk device, but normally all disk devices are Its use is started in a state where it can be read and written normally.

【００４３】図３に、初期化処理において、全てのディ
スク装置が正常であった場合（Ｃ_FA _IL＝０）における各
ディスク装置の管理情報記憶領域の内容を模式的に示
す。図中、管理情報記憶領域１５の記憶領域１７内に表
記してある“ＵＩＤ”は、装置識別情報であり、記憶領
域１８と記憶領域１９に示してある数値は、それぞれ、
スロット情報と履歴情報であるとする。このように、そ
れぞれのディスク装置の管理情報記憶領域１５には、図
２に示した初期化処理により、スロット情報だけが異な
る管理情報が書き込まれることになる。FIG. 3 schematically shows the contents of the management information storage area of each disk device when all the disk devices are normal (C _FA _IL = 0) in the initialization processing. In the figure, "UID" written in the storage area 17 of the management information storage area 15 is device identification information, and the numerical values shown in the storage areas 18 and 19 are respectively
It is assumed that these are slot information and history information. As described above, the management information different in only the slot information is written in the management information storage area 15 of each disk device by the initialization processing shown in FIG.

【００４４】この管理情報は、各ディスク装置の状態判
定のために用いられる情報であり、第１の実施例のディ
スクアレイ装置では、通常動作中に、ディスク装置に障
害が発生すると、その障害ディスク装置を除くディスク
装置の管理情報記憶領域の内容が書き換えられる。この
管理情報更新処理は、以下に記す手順により実行され
る。This management information is information used to determine the status of each disk device. In the disk array device of the first embodiment, if a disk device fails during normal operation, the failed disk The contents of the management information storage area of the disk device other than the device are rewritten. This management information update processing is executed by the procedure described below.

【００４５】図４に、管理情報更新処理時のディスクア
レイコントローラの動作の流れを示す。ディスクアレイ
コントローラは、ディスク装置から障害が発生したこと
を示す情報を受けると、そのディスク装置の識別情報を
識別情報Ｎ_FAILにセット（ステップＳ２０１）する。そ
して、識別情報Ｎ_FAILが全ディスク台数Ｎ_ALLと等しい
場合（ステップＳ２０２；Ｙ）は、変数ｉに“１”をセ
ット（ステップＳ２０３）し、Ｎ_FAILが全ディスク装置
数Ｎ_ALLと等しくない場合（ステップＳ２０２；Ｎ）に
は、ｉに“Ｎ_FAIL＋１”をセット（ステップＳ２０４）
する。FIG. 4 shows the operation flow of the disk array controller during the management information updating process. When the disk array controller receives information indicating that a failure has occurred from the disk device, the disk array controller sets the identification information of the disk device in the identification information N _FAIL (step S201). When the identification information N _FAIL is equal to the total number of disks N _ALL (step S202; Y), "1" is set to the variable i (step S203), and N _FAIL is not equal to the total number of disk devices N _ALL. In step S202; N, i is set to "N _FAIL +1" (step S204)
To do.

【００４６】その後、ｉ番のディスク装置の管理情報中
の履歴情報が“１”だけ増加するように管理情報の書き
換えを行う（ステップＳ２０５）。このステップでは、
管理情報の読み出しと、読み出した管理情報中の履歴情
報の更新と、更新した管理情報の管理情報記憶領域への
書き込みが行われる。After that, the management information is rewritten so that the history information in the management information of the i-th disk device increases by "1" (step S205). In this step,
The management information is read, the history information in the read management information is updated, and the updated management information is written in the management information storage area.

【００４７】次に、ディスクアレイコントローラは、変
数ｉに“１”を加算（ステップＳ２０６）し、ｉがＮ
_ALLを越える場合（ステップＳ２０７；Ｙ）に限り、ｉ
を“１”に変更（ステップＳ２０８）する。そして、変
数ｉと識別情報Ｎ_FAILの比較を行い、ｉがＮ_FAILと一致
していない場合（ステップＳ２０９；Ｎ）には、次のデ
ィスク装置の管理情報を更新するために、ステップＳ２
０５に戻る。この一連の処理は、変数ｉと識別情報Ｎ
_FAILが一致したとき（ステップＳ２０９；Ｙ）、すなわ
ち、識別情報Ｎ_FAILで指定されるディスク装置を除くデ
ィスク装置の管理情報の更新が行われたときに、終了す
る。Next, the disk array controller adds "1" to the variable i (step S206), and i is N.
Only when _ALL is exceeded (step S207; Y), i
Is changed to "1" (step S208). Then, the variable i and the identification information N _FAIL are compared, and if i does not match N _FAIL (step S209; N), step S2 is performed to update the management information of the next disk device.
Return to 05. This series of processing is performed by the variable i and the identification information N.
_{When the FAILs} match (step S209; Y), that is, when the management information of the disk device other than the disk device designated by the identification information N _FAIL is updated, the process ends.

【００４８】なお、ステップＳ２０２ないしＳ２０４の
処理は、最初に管理情報の更新を行うディスク装置を決
定するための処理であり、これらの処理により、Ｎ_ALL
番（実施例のディスクアレイ装置では“５”）のディス
ク装置が障害ディスク装置であった場合には、１番のデ
ィスク装置が管理情報の更新対象とされ、それ以外の場
合には、障害ディスク装置が接続されたスロット番号よ
り、“１”だけ多いスロット番号を有するスロットに接
続されたディスク装置が最初の更新対象とされる。ステ
ップＳ２０６ないしＳ２０８の処理も、次に管理情報の
更新を行うディスク装置を同様の順序規則に従って、決
定するための処理である。[0048] The processing of steps S202 through S204 is a process for determining a disk apparatus of updating the first management information, by these processes, N _ALL
If the No. 1 disk device (“5” in the disk array device of the embodiment) is the failed disk device, the No. 1 disk device is the update target of the management information; otherwise, the failed disk device. The disk device connected to the slot having a slot number that is larger by "1" than the slot number to which the device is connected is the first update target. The processing of steps S206 to S208 is also processing for determining the disk device for which the management information is updated next, according to the same order rule.

【００４９】図５に、この管理情報更新処理により、各
ディスク装置の管理情報記憶領域の内容が更新されてい
くようすを模式的に示す。各ディスク装置の管理情報の
内容が（ａ）に示したような状態である場合に、たとえ
ば、☆印を付けたディスク装置（２番のディスク装置）
に障害が発生したときには、（ｂ）に示すように、最初
に、３番のディスク装置の管理情報中の履歴情報が書き
換えられ、その後、４番、５番、１番の順序で管理情報
が更新されていき、最終的に、（ｃ）に示したような状
態が形成される。FIG. 5 schematically shows that the contents of the management information storage area of each disk device are updated by this management information updating process. When the content of the management information of each disk device is in the state as shown in (a), for example, a disk device with a star mark (disk device No. 2)
When a failure occurs, the history information in the management information of the disk device No. 3 is first rewritten as shown in (b), and then the management information is updated in the order of No. 4, No. 5, No. 1. It is updated, and finally, the state shown in (c) is formed.

【００５０】この管理情報は、装置の立ち上げ時に読み
だされ、アクセス対象としてはならないディスク装置
や、復旧処理を開始してもよいディスク装置が存在する
かなどの判定に用いられる。この状態判定処理の動作手
順は複雑であるので、その詳細な説明を行う前に、各デ
ィスク装置から読み出される管理情報の内容と、その内
容により、ディスクアレイコントローラが選択すべき動
作内容との関係を説明することにする。This management information is read out when the apparatus is started up and is used for determining whether there is a disk apparatus that should not be accessed or a disk apparatus that may start the recovery process. Since the operation procedure of this state determination process is complicated, before giving a detailed description, the relationship between the content of the management information read from each disk device and the operation content that should be selected by the disk array controller depending on the content. Will be explained.

【００５１】図６および図７に、各ディスク装置から読
み出される場合が考えられる管理情報の内容を模式的に
示す。図６（ｃ）は、２番のディスク装置に障害が発生
し、管理情報の更新処理が完了している場合の各管理情
報の内容を示したものである。この場合、ディスクアレ
イコントローラは、２番のディスク装置をアクセス対象
から除外して、ディスクアレイ装置を冗長度ゼロの記憶
装置として動作させるべきである。なお、２番のディス
ク装置の管理情報記憶領域は、読み出しが可能な場合も
あり、また、不可能な場合もある。6 and 7 schematically show the contents of the management information which may be read from each disk device. FIG. 6C shows the contents of each piece of management information when a failure has occurred in the second disk device and the management information update processing has been completed. In this case, the disk array controller should exclude the second disk device from the access target and operate the disk array device as a storage device with zero redundancy. The management information storage area of the second disk device may or may not be readable.

【００５２】図６（ｄ）は、実線の矢印で示したよう
に、障害ディスク装置が正しく交換された場合の管理情
報の内容を示したものである。交換されたディスク装置
の管理情報記憶領域は、管理情報が書き込まれていない
ことを示すために空白としてある。この場合のディスク
アレイ装置の状態は、２番のディスク装置に対して復旧
処理を行い、全てのディスク装置を利用可能とした後
に、動作させるべき状態である。なお、交換されたディ
スク装置がフォーマット直後のものであれば、その管理
情報記憶領域に対応する記憶領域には、所定のビットパ
ターン（たとえば、１６進表記で“Ｅ５”のビットパタ
ーン）が書き込まれている。FIG. 6D shows the contents of the management information when the faulty disk device is correctly replaced as indicated by the solid arrow. The management information storage area of the replaced disk device is blank to indicate that the management information is not written. The state of the disk array device in this case is a state in which the recovery process is performed on the second disk device to make all the disk devices available and then to operate. If the exchanged disk device has just been formatted, a predetermined bit pattern (for example, a bit pattern of "E5" in hexadecimal notation) is written in the storage area corresponding to the management information storage area. ing.

【００５３】図６（ｅ）は、（ｃ）の状態に対して、破
線の矢印で示したように、誤ったディスク装置交換が行
われた場合の一例を示したものである。この場合、ディ
スクアレイコントローラは、ディスクアレイ装置の状態
を、動作不可能な状態であると判定すべきである。FIG. 6 (e) shows an example of the case of erroneous disk device replacement as indicated by the broken line arrow in the state of (c). In this case, the disk array controller should determine the status of the disk array device as inoperable.

【００５４】また、管理情報更新処理が中断されていた
場合には、ディスクアレイ装置の状態は同じでも、管理
情報の内容が、図６とは異なるものとなる。Further, when the management information updating process is interrupted, the contents of the management information are different from those in FIG. 6 even though the state of the disk array device is the same.

【００５５】図７に、管理情報の更新が中断された場合
の各ディスク装置の管理情報の内容を示す。ここでは、
１台のディスク装置の管理情報の更新が行われた段階
で、その処理が中断されたものとする。図７（ｂ）、
（ｆ）、（ｇ）における各ディスク装置の状態は、それ
ぞれ、図６（ｃ）、（ｄ）、（ｅ）における各ディスク
装置の状態と同じである。すなわち、（ｂ）は、２番の
ディスク装置をアクセス対象から除外して冗長度ゼロの
ディスクアレイ装置として動作させるべき状態であり、
（ｄ）は、２番のディスク装置に対して復旧処理を行っ
た後に、本来の冗長構成を用いたディスクアレイ装置と
して動作させるべき状態であり、また、（ｅ）は、ディ
スクアレイ装置を動作させてはならない状態であり、誤
った交換がなされたことをユーザーに通知すべき状態で
ある。FIG. 7 shows the contents of the management information of each disk device when the update of the management information is interrupted. here,
It is assumed that the processing is interrupted when the management information of one disk device is updated. 7 (b),
The states of the disk devices in (f) and (g) are the same as the states of the disk devices in FIGS. 6 (c), (d), and (e), respectively. That is, (b) is a state in which the second disk device should be excluded from access targets and operated as a disk array device with zero redundancy,
(D) shows a state in which the disk array device should be operated as the disk array device using the original redundant configuration after the recovery process is performed for the second disk device, and (e) shows the operation of the disk array device. This is a state that should not be allowed, and a state in which the user should be notified that an incorrect exchange has been made.

【００５６】第１の実施例のディスクアレイ装置では、
このような状態判定を以下のような手順により実現す
る。In the disk array device of the first embodiment,
Such state determination is realized by the following procedure.

【００５７】図８に、立ち上げ時にディスクアレイコン
トローラが最初に行なう動作の流れを示す。電源が投入
されると、ディスクアレイコントローラは、まず、各デ
ィスク装置の状態を判別するために用いるパラメータを
初期化（ステップＳ３０１）する。このステップで初期
化されるパラメータとしては、Ｃ_FAILとＨ_j（ｊ＝１〜
Ｎ_ALL）とＮ_INITとｉとＮ_UPとＮ_DOWNがあり、Ｃ_FAILと
Ｈ_jとＮ_UPとＮ_DOWNは、それぞれ“０”に、Ｎ_INITとｉ
は“１”に初期化される。なお、Ｈ_jは、主に、管理情
報中の履歴情報を記憶するために用いられるパラメータ
であり、Ｎ_INITは、異常であることが明らかなディスク
装置のスロット番号を記憶するためのパラメータであ
る。また、Ｎ_UPとＮ_DOWNは、図９を用いて後ほど説明す
る動作で用いられるパラメータである。FIG. 8 shows the flow of the first operation performed by the disk array controller at startup. When the power is turned on, the disk array controller first initializes the parameters used to determine the status of each disk device (step S301). The parameters initialized in this step are C _FAIL and H _j (j = 1 to 1).
N _ALL ), N _INIT , i, N _UP, and N _DOWN , and C _FAIL , H _j , N _UP, and N _DOWN are set to “0” and N _INIT and i, respectively.
Is initialized to "1". It should be noted that H _j is a parameter mainly used for storing the history information in the management information, and N _INIT is a parameter for storing the slot number of the disk device which is apparently abnormal. . Further, N _UP and N _DOWN are parameters used in the operation described later with reference to FIG.

【００５８】これらのパラメータの初期化後、ディスク
アレイコントローラは、ｉ番のディスク装置の管理情報
の読み出しを試み（ステップＳ３０２）、正常に読み出
しが行なえなかった場合（ステップＳ３０３；Ｎ）に
は、Ｈ_iに“−１”をセット（ステップＳ３０９）し、
カウンタＣ_FAILに“１”を加算するとともに、Ｎ_INITに
ｉをセット（ステップＳ３１０）してから、ステップＳ
３０７に進む。After the initialization of these parameters, the disk array controller attempts to read the management information of the i-th disk device (step S302), and if it cannot be read normally (step S303; N), Set "-1" to H _i (step S309),
The counter C _FAIL is incremented by “1” and N _INIT is set to i (step S310), and then step S310.
Proceed to 307.

【００５９】また、ｉ番のディスク装置の管理情報の読
み出しが正常に行なえた場合（ステップＳ３０３；Ｙ）
には、読み出した管理情報中の装置識別情報およびスロ
ット情報を、初期化時に書き込んだ装置識別情報および
スロット情報とそれぞれ比較（ステップＳ３０４）する
ことにより、そのディスク装置が正常に接続されたもの
かどうかの判断を行なう。そして、装置識別情報とスロ
ット情報が初期化時に書き込んだ装置識別情報とスロッ
ト情報とそれぞれ一致していた場合には、正常に接続さ
れたディスク装置であると判断（ステップＳ３０５；
Ｙ）して、その管理情報中の履歴情報を、Ｈ_iとして記
憶（ステップＳ３０６）し、ステップＳ３０７に進む。When the management information of the i-th disk device can be read normally (step S303; Y).
Is determined by comparing the device identification information and the slot information in the read management information with the device identification information and the slot information written at the initialization (step S304). Make a decision. Then, if the device identification information and the slot information match the device identification information and the slot information written at the time of initialization, it is determined that the disk device is normally connected (step S305;
Y), the history information in the management information is stored as H _i (step S306), and the process proceeds to step S307.

【００６０】なお、ステップＳ３０５の判断において、
“Ｙ”側に分岐されるディスク装置は、正常に機能する
ディスク装置とは限らない。すなわち、そのディスク装
置が障害ディスク装置であっても、その接続が正しく、
管理情報記憶領域の内容が読み出せる場合には、ステッ
プＳ３０５における判断は、“Ｙ”となる。In the judgment of step S305,
The disk device branched to the “Y” side is not always a normally functioning disk device. That is, even if the disk device is a failed disk device, the connection is correct,
If the contents of the management information storage area can be read, the determination in step S305 is "Y".

【００６１】そして、装置識別情報とスロット番号のい
ずれかが初期化処理時に書き込んだ内容と異なっていた
場合には、そのディスク装置は、正常に接続されたもの
ではないと判断（ステップＳ３０５；Ｎ）して、カウン
タＣ_FAILに“１”を加算するとともに、Ｎ_INITにｉをセ
ット（ステップＳ３１０）して、ステップＳ３０７に進
む。If either the device identification information or the slot number is different from the contents written during the initialization processing, it is determined that the disk device is not normally connected (step S305; N). ), "1" is added to the counter C _FAIL , i is set to N _INIT (step S310), and the process proceeds to step S307.

【００６２】ステップＳ３０７では、ｉに“１”が加算
され、ｉが全ディスク装置数Ｎ_ALL以下である場合（ス
テップＳ３０８；Ｎ）には、ステップＳ３０２に戻り、
次のディスク装置に対して処理が行なわれる。ｉが全デ
ィスク装置数Ｎ_ALLを越えたとき（ステップＳ３０８；
Ｙ）、すなわち、全てのディスク装置の管理情報記憶領
域の読み出しが試みられた後に、この処理は終了する。In step S307, "1" is added to i, and if i is equal to or less than the total number N _{ALL of} disk devices (step S308; N), the process returns to step S302.
Processing is performed for the next disk device. When i exceeds the number N _{ALL of} all disk devices (step S308;
Y), that is, after the reading of the management information storage areas of all the disk devices is attempted, this process ends.

【００６３】この処理が終了すると、管理情報の読み出
しが行なえないディスク装置があった場合には、その装
置に対応する変数Ｈが“−１”となり、装置識別情報ま
たはスロット情報のいずれかが初期化時に書き込んだ情
報と異なっているディスク装置があった場合には、その
装置に対応する変数Ｈの値は、“０”のままとなる。な
お、交換されたディスク装置に対応する変数Ｈの値も、
管理情報記憶領域に対応する記憶領域に、所定のパター
ンの管理情報が書き込まれていないため、ステップＳ３
０５で正常に接続されたものではないと判断され、
“０”のままとなる。When this processing ends, if there is a disk device from which the management information cannot be read, the variable H corresponding to the device becomes "-1", and either the device identification information or the slot information is initialized. If there is a disk device different from the information written at the time of writing, the value of the variable H corresponding to the device remains "0". The value of the variable H corresponding to the exchanged disk device is also
Since the management information of the predetermined pattern is not written in the storage area corresponding to the management information storage area, step S3
In 05, it was judged that it was not connected properly,
It remains “0”.

【００６４】そして、カウンタＣ_FAILには、Ｈの値が
“−１”または“０”であるディスク装置、すなわち、
正常な状態ではないことが確実なディスク装置の台数が
記憶され、変数Ｎ_INITには、そのうち１台のスロット番
号が記憶される。なお、カウンタＣ_FAILが“０”であっ
た場合には、変数Ｎ_INITの書き換え（ステップＳ３１
０）が実行されないため、Ｎ_INITの値は、ステップＳ３
０１で設定された値である“１”のままとなる。The counter C _FAIL has a disk device whose H value is "-1" or "0", that is,
The number of disk devices that are certainly not in a normal state is stored, and the variable N _INIT stores the slot number of one of them. If the counter C _FAIL is “0”, the variable N _INIT is rewritten (step S31).
0) is not executed, the value of N _INIT is
The value set in 01 remains "1".

【００６５】図９に、図８の流れに引き続いてディスク
アレイコントローラが実行する動作の流れを示す。ディ
スクアレイコントローラは、まず、図８の処理において
得られたカウンタＣ_FAILの値が“１”より大きいか否か
の判断を行なう（ステップＳ４０１）。このカウンタＣ
_FAILには、既に説明したように、正常な状態ではないこ
とが確実なディスク装置の台数がセットされているの
で、この値が“２”以上であるときには、ディスクアレ
イ装置を冗長度ゼロの記憶装置としても動作させること
ができないことが確定する。このため、カウンタＣ_FAIL
が、“１”より大きい場合（ステップＳ４０１；Ｙ）に
は、装置を立ち上げることができない旨を表わすエラー
メッセージを出力（ステップＳ４０２）して、処理を終
了する。FIG. 9 shows a flow of operations executed by the disk array controller subsequent to the flow of FIG. The disk array controller first determines whether or not the value of the counter C _FAIL obtained in the processing of FIG. 8 is larger than "1" (step S401). This counter C
_As described above, the number of disk devices that are surely not in a normal state is set in _FAIL , so when this value is "2" or more, the disk array device is stored with zero redundancy. It is decided that the device cannot be operated as well. Therefore, the counter C _FAIL
However, if it is larger than "1" (step S401; Y), an error message indicating that the apparatus cannot be started up is output (step S402) and the process ends.

【００６６】なお、図６および図７に例示した状態のう
ち、ステップＳ４０１の判断が行なわれた段階で、装置
を立ち上げることができないことが確定するのは、
（ｅ）と（ｇ）の状態において、☆印を付した障害ディ
スク装置の管理情報が読み出せなかった場合である。Of the states illustrated in FIGS. 6 and 7, it is determined that the apparatus cannot be started up at the stage when the determination in step S401 is made.
In the states (e) and (g), the management information of the faulty disk device marked with * cannot be read.

【００６７】カウンタＣ_FAILの値が“１”以下である場
合（ステップＳ４０１；Ｎ）には、変数ｉの内容をＮ
_INITに書き換え（ステップＳ４０３）、変数ｊに、その
値が“１”以上、Ｎ_ALL以下になるように、“ｉ＋１”
または“ｉ＋１−Ｎ_ALL”をセット（ステップＳ４０
４）する。そして、図８の処理により内容が設定された
Ｈ _jとＨ_iの大小関係の比較（ステップＳ４０５）を行
ない、Ｈ_jがＨ_iより大きく、Ｎ_UPが“０”である場合
（ステップＳ４０６；Ｙ）には、Ｎ_UPにｉをセット（ス
テップＳ４０７）し、Ｈ_jがＨ_iより大きく、Ｎ_UPが
“０”でない場合（ステップＳ４０６；Ｎ）には、Ｎ_UP
を変更することなく、ステップＳ４１０に進む。Counter C_FAILWhen the value of is less than "1"
(Step S401; N), the content of the variable i is set to N
_INITTo the variable j (step S403)
Value is "1" or more, N_ALL"I + 1" as follows
Or "i + 1-N_ALLSet "(step S40
4) Do. Then, the contents are set by the processing of FIG.
H _jAnd H_iComparison of the size relationship (step S405)
No, h_jIs H_iGreater than N_UPIs "0"
(Step S406; Y), N_UPSet i to
Step S407) and then H_jIs H_iGreater than N_UPBut
If it is not "0" (step S406; N), N_UP
Without changing, the process proceeds to step S410.

【００６８】また、Ｈ_jがＨ_iより小さく、Ｎ_DOWNが
“０”である場合（ステップＳ４０８；Ｙ）には、Ｎ
_DOWNにｉをセット（ステップＳ４０９）し、Ｈ_jがＨ_i
より大きく、Ｎ_DOWNが“０”でない場合（ステップＳ４
０８；Ｎ）には、Ｎ_DOWNを変更することなく、ステップ
Ｓ４１０に進む。また、ステップＳ４０５の比較におい
て、Ｈ_jとＨ_iが一致していた場合には、直接、ステッ
プＳ４１０に進む。If H _j is smaller than H _i and N _DOWN is "0" (step S408; Y), N
I is set to _DOWN (step S409), and H _j is H _i.
If it is larger and N _DOWN is not “0” (step S4)
08; N), the process proceeds to step S410 without changing N _DOWN . Further, in the comparison in step S405, if H _j and H _i match, the process directly proceeds to step S410.

【００６９】ステップＳ４０６またはＳ４０８において
行なっている分岐は、Ｈ_jとＨ_iが最初に異なったとき
の変数ｉの値を、Ｎ_UPまたはＮ_DOWNに記憶させるための
ものである。たとえば、図６（ｅ）の状態において障害
ディスク装置の管理情報が読み出せる場合には、図８の
処理によりＮ_INITに“３”が設定されるので、図９のス
テップＳ４０５では、最初に、Ｈ₄とＨ₃の比較が行な
われ、その後、Ｈ₅とＨ₄、Ｈ₁とＨ₅、Ｈ₂とＨ₁、
Ｈ₃とＨ₂の比較がこの順で行なわれることになる。The branching performed in step S406 or S408 is to store the value of the variable i when H _j and H _i first differ from each other in N _UP or N _DOWN . For example, when the management information of the failed disk device can be read in the state of FIG. _6E , N _INIT is set to “3” by the process of FIG. 8, so that in step S405 of FIG. A comparison of H ₄ and H ₃ is made, after which H ₅ and H ₄ , H ₁ and H ₅ , H ₂ and H ₁ ,
The comparison of H ₃ and H ₂ will be performed in this order.

【００７０】これらの５回の比較で、ステップＳ４０６
側に分岐されるのは、Ｈ₄とＨ₃の比較が行われたとき
だけであるが、ステップＳ４０８側に分岐される場合
は、Ｈ ₂とＨ₁の比較が行なわれたときと、Ｈ₃とＨ₂
の比較が行なわれたときの２回存在する。このような場
合に、ステップＳ４０８においてＮ_DOWNが“０”である
か否かの判断がなされているため、２回目の分岐では、
Ｎ_DOWNの書き換えが行なわれないことになり、結局、最
初に分岐された時の情報が、Ｎ_DOWNに記憶されることに
なる。After these five comparisons, step S406
H is branched to_FourAnd H₃When the comparison of
However, when branching to the step S408 side
Is H ₂And H₁When the comparison of₃And H₂
Exists twice when the comparison is made. Such a place
In step S408, N_DOWNIs “0”
Since it has been determined whether or not it is the second branch,
N_DOWNWill not be rewritten, and in the end,
The information at the time of the first branch is N_DOWNTo be remembered in
Become.

【００７１】ステップＳ４１０においては、全ての比較
が終了したか否かの判定が行われ、ｊがＮ_INITと一致し
ていない場合（ステップＳ４１０；Ｎ）には、比較すべ
き組み合わせが残っているので、ｉの内容をｊに書き換
えて（ステップＳ４１１）、ステップＳ４０４に戻る。
そして、変数ｊとＮ_INITが一致したとき（ステップＳ４
１０；Ｙ）に、この処理は終了し、以下に記す、最終的
な判定処理が実行される。In step S410, it is determined whether or not all comparisons have been completed, and if j does not match N _INIT (step S410; N), there are combinations to be compared. Therefore, the content of i is rewritten to j (step S411), and the process returns to step S404.
When the variable j and N _INIT match (step S4)
10; Y), this process ends, and the final determination process described below is executed.

【００７２】図１０に、図９の流れに引き続いて、ディ
スクアレイコントローラが行なう動作の流れを示す。デ
ィスクアレイコントローラは、まず、Ｎ_UPが“０”であ
るか否かの判定（ステップＳ５０１）を行ない、Ｎ_UPが
“０”である場合（Ｙ）は、全てのディスク装置が正常
に動作可能であると判断して、立ち上げ動作を終了す
る。この判定により、全ディスク装置が正常であるとす
ることができることは、Ｎ_UPが“０”となる場合は、全
てのディスク装置の履歴情報が一致している場合である
ことから容易に理解出来よう。FIG. 10 shows a flow of operations performed by the disk array controller subsequent to the flow of FIG. The disk array controller first determines whether N _UP is "0" (step S501). If N _UP is "0" (Y), all disk devices can operate normally. Then, the startup operation is ended. From this determination, it can be easily understood that all the disk devices can be regarded as normal, because when N _UP is “0”, the history information of all the disk devices is the same. See.

【００７３】また、Ｎ_UPが“０”でない場合（ステップ
Ｓ５０１；Ｎ）には、カウンタＣ_FA _ILが“０”であるか
否かの判定（ステップＳ５０２）を行い、“０”であっ
た場合（Ｙ）には、Ｎ_UP番のディスク装置をアクセス対
象から除外して（ステップＳ５０９）、後処理（ステッ
プＳ５１０）を行なった後に、立ち上げ処理を終了す
る。このステップＳ５０２の判断により状態が決定され
るのは、図７（ｂ）と図６（ｃ）において、障害ディス
ク装置の管理情報記憶領域の読み出しが行なえた場合で
ある。When N _UP is not "0" (step S501; N), it is determined whether the counter C _FA _IL is "0" (step S502) and it is "0". In the case (Y), the N _UP disk device is excluded from the access targets (step S509), post-processing (step S510) is performed, and then the startup processing ends. The state is determined by the determination in step S502 when the management information storage area of the failed disk device can be read in FIGS. 7B and 6C.

【００７４】なお、ステップＳ５１０で行なわれる後処
理は、管理情報更新処理が中断されていた場合に、その
管理情報更新処理を完了させるための処理であり、この
処理により、たとえば、図７（ｂ）の状態である各ディ
スク装置の管理情報は、図６（ｃ）の状態に修正され
る。その実際の動作は、図４に示した管理情報更新処理
を中断されたステップ（Ｎ_DOWNが中断直前に更新された
ディスク装置を示す情報となっている。）から再開する
といったものである。The post-processing performed in step S510 is a processing for completing the management information update processing when the management information update processing is interrupted, and by this processing, for example, FIG. The management information of each disk device in the state of () is corrected to the state of FIG. The actual operation is to restart the management information update process shown in FIG. 4 from the interrupted step (N _DOWN is the information indicating the disk device updated immediately before the interrupt).

【００７５】そして、カウンタＣ_FAILが“０”でなかっ
た場合（ステップＳ５０２；Ｎ）、Ｎ_UP番のＨであるＨ
_Nが“−１”であるか否かの判定（ステップＳ５０３）
を行い、“−１”であった場合（Ｙ）、すなわち、その
ディスク装置が管理情報の読み出しが行なえない装置で
あった場合は、Ｎ_UP番のディスク装置を障害ディスク装
置に特定（ステップＳ５０９）して、後処理（ステップ
Ｓ５１０）を行なった後に立ち上げ処理を終了する。こ
のステップＳ５０３の判断により状態が決定されるの
は、（ｂ）と（ｃ）の状態において、障害ディスク装置
の管理情報記憶領域の読み出しが不可能な場合である。When the counter C _FAIL is not "0" (step S502; N), H which is N _UP number H
Determination whether _N is "-1" (step S503)
If it is “−1” (Y), that is, if the disk device cannot read the management information, the N _UP number disk device is specified as the failed disk device (step S509). ), The post-processing (step S510) is performed, and then the startup processing is ended. The state is determined by the determination in step S503 when the reading of the management information storage area of the failed disk device is impossible in the states of (b) and (c).

【００７６】また、Ｈ_Nが“−１”でなかった場合（ス
テップＳ５０３；Ｎ）、変数ｊにその値が“１”以上、
Ｎ_ALL以下になるように、“Ｎ_DOWN＋１”または“Ｎ
_DOWN＋１−Ｎ_ALL”をセット（ステップＳ５０４）す
る。そして、Ｎ_UPとｊの比較（ステップＳ５０５）を行
い、Ｎ_UPとｊが一致していない場合（ステップＳ５０
５；Ｎ）には、やはり、Ｎ_UP番のディスク装置が障害デ
ィスク装置に特定される。このステップＳ５０５の判断
により状態が決定されるのは、（ｆ）の状態と（ｅ）の
状態において、障害ディスク装置の管理情報記憶領域の
読み出しが可能な場合である。When H _N is not "-1" (step S503; N), the value of variable j is "1" or more,
"N _DOWN +1" or "N" so that N _ALL or less
_DOWN + 1-N _ALL "is set (step S504). Then, N _UP and j are compared (step S505), and when N _UP and j do not match (step S50).
5; N), the disk device numbered N _UP is specified as the failed disk device. The state is determined by the determination in step S505 when the management information storage area of the failed disk device can be read in the states (f) and (e).

【００７７】また、Ｎ_UPとｊが一致していた場合（Ｙ）
には、Ｎ_DOWN番のディスク装置が正常に読み書き可能で
あるかのテスト（ステップＳ５０６）を行なう。そし
て、そのディスク装置が正常であった場合（ステップＳ
５０７；Ｙ）には、Ｎ_UP番のディスク装置を正しく交換
されたディスク装置であるとして、復旧処理を行い（ス
テップＳ５０８）、立ち上げ処理を終了する。このステ
ップＳ５０８では、当然、管理情報の内容の設定も行な
われる。また、Ｎ_DOWN番のディスク装置が正常に読み書
き可能でなかった場合（ステップＳ５０７；Ｎ）には、
Ｎ_UP番のディスク装置を障害ディスク装置に特定（ステ
ップＳ５０９）して、後処理（ステップＳ５１０）を行
なった後に立ち上げ処理を終了する。When N _UP and j match (Y)
For this, a test is performed as to whether or not the N _DOWN disk device can normally read and write (step S506). If the disk device is normal (step S
In 507; Y), it is determined that the N _UP disk device is a disk device that has been correctly replaced, and recovery processing is performed (step S508), and the startup processing ends. Of course, in this step S508, the contents of the management information are also set. If the N _DOWN disk device cannot normally read / write (step S507; N),
The N _UP disk device is specified as the failed disk device (step S509), post-processing (step S510) is performed, and then the start-up process is terminated.

【００７８】このステップＳ５０７の判断まで行なわれ
るのは、（ｄ）と（ｇ）の状態である。これらの状態
は、履歴情報の内容が異なるだけで、その相対的な位置
関係は等しいものとなっており、ステップＳ５０５まで
の処理により得られる情報では、判別不可能である。こ
の判別のために行っている処理がステップＳ５０６およ
びＳ５０７の処理であり、誤った交換がなされた（ｇ）
の状態では、Ｎ_DOWN番のディスク装置が障害ディスク装
置に該当するため、正常に読み書きができないものであ
るのに対し、正しい交換が行なわれた（ｄ）の状態で
は、Ｎ_DOWN番のディスク装置は、正常に読み書きできる
ことを利用して両者の判別を行っている。It is in the states of (d) and (g) that the determination up to step S507 is performed. These states differ only in the contents of the history information, but have the same relative positional relationship, and cannot be discriminated from the information obtained by the processing up to step S505. The processing performed for this determination is the processing of steps S506 and S507, and an incorrect exchange was made (g).
In the state of N, the N _DOWN disk device corresponds to the faulty disk device, so normal reading and writing cannot be performed, whereas in the state of (d), the N _DOWN disk device is properly replaced. Uses the fact that it can read and write normally to distinguish between them.

【００７９】なお、このディスクアレイ装置では、装置
識別情報とスロット情報と履歴情報とからなる管理情報
を使用しているが、スロット情報と履歴情報からなる管
理情報、または、履歴情報だけからなる管理情報を用い
るようにしてもよい。スロット情報と履歴情報からなる
管理情報を用いる場合には、図８のステップＳ３０４に
おいて管理情報中のスロット情報だけが比較されるよう
に装置を構成すればよい。Although this disk array device uses management information consisting of device identification information, slot information, and history information, management information consisting of slot information and history information, or management consisting of history information only. Information may be used. When using the management information including the slot information and the history information, the device may be configured so that only the slot information in the management information is compared in step S304 of FIG.

【００８０】また、履歴情報だけを用いる場合には、履
歴情報として用いる数値の範囲を、たとえば、１から１
００までに限定しておき、図８のステップＳ３０４にお
いて、読み出した履歴情報が、その範囲内であるか否か
を評価し、範囲外であった場合には、ステップＳ３０５
において、“Ｎ”側に分岐するように構成すればよい。
障害ディスク装置の交換には、フォーマットされた状
態、すなわち、データの書き込みが行われていない状態
のディスク装置が用いられるので、管理情報記憶領域に
対応する記憶領域の内容を数値として読みだした場合、
上記のような範囲内の数値になることはない。このた
め、このような判定によっても、交換されたディスク装
置を特定することはでき、他のディスク装置に記憶され
た管理情報（履歴情報）と比較することにより、やは
り、その交換が正しく行われたものであるか否かを判定
することができる。また、交換に用いる前に、そのディ
スク装置の管理情報記憶領域に特定の情報を書き込むこ
とにしておき、立ち上げ時に、その特定の情報の有無を
検出して、そのディスク装置が交換されたものか否かを
検出するようにしてもよい。When only history information is used, the range of numerical values used as history information is, for example, 1 to 1.
It is limited to 00, and in step S304 of FIG. 8, it is evaluated whether or not the read history information is within the range, and if it is out of the range, step S305.
In the above, it may be configured to branch to the “N” side.
When replacing a failed disk unit, a disk unit in a formatted state, that is, in a state in which no data is written, is used, so if the contents of the storage area corresponding to the management information storage area are read out as numerical values ,
It does not fall within the above range. Therefore, even by such a judgment, the replaced disk device can be identified, and by comparing with the management information (history information) stored in another disk device, the replacement is correctly performed. It is possible to determine whether or not it is an item. Also, before use for replacement, specific information is written in the management information storage area of the disk device, and the presence or absence of the specific information is detected at startup, and the disk device is replaced. You may make it detect whether or not.

【００８１】第２の実施例 Second embodiment

【００８２】第１の実施例のディスクアレイ装置は、管
理情報中の履歴情報の大小関係だけを用いて各ディスク
装置の状態判定を行なうが、第２の実施例のディスクア
レイ装置では、その内容に意味を持たせた２種の履歴情
報を用いて、ディスクアレイ装置の状態判定を行う。The disk array device of the first embodiment determines the status of each disk device using only the magnitude relation of the history information in the management information, but the disk array device of the second embodiment does The state of the disk array device is determined by using the two types of history information that have meaning.

【００８３】第２の実施例のディスクアレイ装置の構成
は、図１に示したものと同様のものであるので、その構
成の説明は省略する。また、その初期化時の動作も、図
２を用いて説明した第１の実施例のディスク装置の動作
とほぼ同じものである。ただし、第２の実施例のディス
クアレイ装置で管理情報中の履歴情報として用いる情報
は、２種類しかなく、一方は、初期化時に書き込まれる
情報であり、他方は、ディスク装置に障害が発生した際
に、書き込まれる情報である。これらの履歴情報は、数
値としてではなく、ビット列として、判定に使用され
る。The configuration of the disk array device of the second embodiment is the same as that shown in FIG. 1, so the description of the configuration will be omitted. Also, the operation at the time of initialization is almost the same as the operation of the disk device of the first embodiment described with reference to FIG. However, there are only two types of information used as history information in the management information in the disk array device of the second embodiment, one is information written at initialization, and the other is a failure in the disk device. This is the information written at that time. These pieces of history information are used for determination as bit strings, not as numerical values.

【００８４】図１１に、第２の実施例のディスクアレイ
装置における管理情報更新処理の流れを示す。ディスク
アレイコントローラは、ディスク装置から障害が発生し
たことを示す情報を受けると、そのディスク装置の識別
情報を識別情報Ｎ_FAILにセット（ステップＳ６０１）す
る。そして、Ｎ_FAILが全ディスク台数Ｎ_ALLと等しい場
合（ステップＳ６０２；Ｙ）は、変数ｉに“１”をセッ
ト（ステップＳ６０３）し、Ｎ_FAILが全ディスク装置数
Ｎ_ALLと等しくない場合（ステップＳ６０２；Ｎ）に
は、ｉに“Ｎ_FAIL＋１”をセット（ステップＳ６０４）
する。FIG. 11 shows the flow of management information update processing in the disk array system of the second embodiment. Upon receiving the information indicating that a failure has occurred from the disk device, the disk array controller sets the identification information of the disk device in the identification information N _FAIL (step S601). When N _FAIL is equal to the total number of disks N _ALL (step S602; Y), "1" is set to the variable i (step S603), and when N _FAIL is not equal to the total number of disk devices N _ALL (step S603). In S602; N), i is set to "N _FAIL +1" (step S604).
To do.

【００８５】そして、ｉ番のディスク装置の管理情報中
の履歴情報を第２の履歴情報に書き換えて（ステップＳ
６０５）、処理を終了する。このように、第２の実施例
のディスクアレイ装置では、障害ディスク装置の次に装
置の管理情報の内容だけが書き換えられる。Then, the history information in the management information of the i-th disk device is rewritten to the second history information (step S
605), the process ends. As described above, in the disk array device of the second embodiment, only the contents of the management information of the device next to the failed disk device are rewritten.

【００８６】第２の実施例のディスクアレイ装置では、
立ち上げ時に、まず、図８に示した処理と同じ処理が行
なわれ、ｉ番目のディスク装置の管理情報が読み出せ、
かつ、管理情報中の装置識別情報とスロット情報が初期
化時に書き込んだ情報と一致していた場合には、Ｈ_iに
その履歴情報の内容が記憶される。また、ｉ番のディス
ク装置の管理情報の内容が読み出せない場合には、Ｈ_i
に“−１”が設定され、管理情報中の装置識別情報とス
ロット情報が初期化時に書き込んだ情報と一致していな
い場合には、Ｈ_iに“０”が記憶される。カウンタＣ
_FAILには、Ｈが“−１”または“０”であるディスク装
置の台数が記憶される。そして、第２の実施例のディス
クアレイ装置は、このようにして得られた情報を用い
て、各ディスク装置の状態の判定を行なう。In the disk array device of the second embodiment,
At start-up, first, the same processing as that shown in FIG. 8 is performed to read the management information of the i-th disk device,
Further, when the device identification information and the slot information in the management information match the information written at the time of initialization, the content of the history information is stored in H _i . If the management information of the i-th disk device cannot be read, H _i
Is set to "-1" and the device identification information and slot information in the management information do not match the information written at the initialization, "0" is stored in H _i . Counter C
The number of disk devices whose H is "-1" or "0" is stored in _FAIL . Then, the disk array device of the second embodiment uses the information thus obtained to judge the state of each disk device.

【００８７】図１２に、第２の実施例のディスクアレイ
コントローラが行う判定動作の流れを示す。図８の処理
を終えたディスクアレイコントローラは、まず、カウン
タＣ _FAILが“１”より大きいか否かの判定を行い（ステ
ップＳ７０１）、“１”より大きい場合（Ｙ）には、エ
ラーメッセージを出力（ステップＳ７１１）して、立ち
上げ動作を終了する。FIG. 12 shows the disk array of the second embodiment.
The flow of the determination operation performed by the controller is shown. Processing of FIG.
The disk array controller that has finished
Ta C _FAILIs determined to be greater than “1” (step
S701), if it is larger than “1” (Y),
Error message (step S711) and stand
The lifting operation ends.

【００８８】カウンタＣ_FAILが“１”より小さい場合
（ステップＳ７０１；Ｎ）には、Ｈ_X＝Ｈ_FAILとなるＸ
を探索（ステップＳ７０２）する。ここで、Ｈ_FAILは、
管理情報更新処理時に書き込まれる第２の履歴情報を示
すこととする。そして、そのようなＨ_Xが存在していた
場合（ステップＳ７０３；Ｙ）には、そのディスク装置
の前に順序付けられたディスク装置を識別するための情
報ｉを算出する（ステップＳ７０４）。また、Ｈ_X＝Ｈ
_FAILとなるＨ_Xが存在していなかった場合（ステップＳ
７０３；Ｎ）には、ｉに、“０”をセット（ステップＳ
７０５）する。次に、ディスクアレイコントローラは、
Ｈ_j＝０となるｊの探索を行う（ステップＳ７０６）。
このステップでは、Ｈ_j＝０となるｊが存在していない
場合には、ｊに“０”がセットされる。When the counter C _FAIL is smaller than "1" (step S701; N), _X where H _X = H _FAIL.
Is searched (step S702). Where H _FAIL is
The second history information written during the management information update processing is shown. If such H _X exists (step S703; Y), the information i for identifying the disk device ordered before the disk device is calculated (step S704). Also, H _X = H
If there is no H _X that becomes _FAIL (step S
703; N), i is set to "0" (step S
705). Next, the disk array controller
A search for j for H _j = 0 is performed (step S706).
In this step, if _j for which H _j = 0 does not exist, “0” is set to j.

【００８９】変数ｊが“０”である場合（ステップＳ７
０７；Ｙ）、すなわち、管理情報中の装置識別情報ある
いはスロット情報が初期化時に書き込んだ情報と異なっ
ていたディスク装置が存在していなかった場合には、ｉ
が“０”であるか否かの判断（ステップＳ７０８）を行
う。変数ｉが“０”であること（Ｙ）は、第２の管理情
報が書き込まれたディスク装置が存在していないことを
示すので、ディスクアレイコントローラは、全てのディ
スク装置が正常であると判断して、立ち上げ処理を終了
する。When the variable j is "0" (step S7)
07; Y), that is, if there is no disk device whose device identification information or slot information in the management information is different from the information written at initialization, i
It is determined whether or not is "0" (step S708). The fact that the variable i is “0” (Y) indicates that there is no disk device in which the second management information has been written, so the disk array controller determines that all disk devices are normal. Then, the start-up process ends.

【００９０】また、変数ｉが“０”でない場合（ステッ
プＳ７０８；Ｎ）には、ｉ番のディスク装置をアクセス
対象から除外（ステップＳ７０９）することにより、デ
ィスクアレイ装置を冗長度ゼロの記憶装置として動作さ
せる。If the variable i is not "0" (step S708; N), the i-th disk device is excluded from the access targets (step S709), so that the disk array device is a storage device with zero redundancy. To operate as.

【００９１】変数ｊが“０”でない場合（ステップＳ７
０７；Ｎ）には、ｊ番のディスク装置が管理情報の読み
出しが行えない装置であるか、または、交換されたディ
スク装置であるため、ｊとｉを比較（ステップＳ７１
０）することにより、ｊ番のディスク装置が交換された
ディスク装置であるか否かの判断を行う。ｊとｉの内容
が一致していない場合（Ｎ）には、第２の管理情報の存
在により障害ディスク装置であるとされているｊ番の装
置の他に、使用することができないディスク装置が存在
することになるので、ディスクアレイ装置を動作させる
ことができない旨を出力（ステップＳ７１１）して、立
ち上げ処理を終了する。When the variable j is not "0" (step S7)
07; N), the jth disk device is a device that cannot read the management information or the disk device has been replaced, so j and i are compared (step S71).
By performing 0), it is determined whether or not the j-th disk device is the replaced disk device. If the contents of j and i do not match (N), there is a disk device that cannot be used in addition to the jth device which is considered to be the faulty disk device due to the presence of the second management information. Since it exists, the fact that the disk array device cannot be operated is output (step S711), and the startup process is terminated.

【００９２】ｊとｉの内容が一致している場合（ステッ
プＳ７１０；Ｙ）は、正しい交換が行われた場合である
ので、ｉ（＝ｊ）番のディスク装置に対して復旧処理を
行い（ステップＳ７１２）、ディスクアレイ装置を全て
のディスク装置が使用可能な状態に戻して、立ち上げ処
理を終了する。なお、ステップＳ７１２では、復旧処理
の完了後、全てのディスク装置が正常であることを、デ
ィスク媒体上に記憶させるために、Ｘ番のディスク装置
（すなわち、第２の履歴情報が書き込まれていたディス
ク装置）の管理情報中の履歴情報を、第１の履歴情報に
戻す作業も行われる。If the contents of j and i match (step S710; Y), it means that the correct exchange has been performed, and therefore the recovery process is performed for the i (= j) th disk device ( In step S712), the disk array device is returned to a state in which all the disk devices can be used, and the startup process ends. In step S712, the Xth disk device (that is, the second history information is written in order to store on the disk medium that all the disk devices are normal after the completion of the recovery process. The operation of returning the history information in the management information of the disk device) to the first history information is also performed.

【００９３】ここで、この判定動作により正確にディス
クアレイ装置の状態が判定できることを、図を参照して
説明しておく。Here, the fact that the state of the disk array device can be accurately determined by this determination operation will be described with reference to the drawings.

【００９４】図１３に、第２の実施例のディスクアレイ
コントローラが判断すべき、ディスクアレイ装置の状態
と履歴情報の内容の関係、および、上記の処理を各ディ
スクアレイ装置に加えた場合に得られるｉとｊの値を示
す。この図では、管理情報中の履歴情報だけを示してあ
り、“good”が第１の履歴情報を、“fail”が第２の履
歴情報を示すものとする。また、☆印を付けた履歴情報
が、障害の発生したディスク装置の履歴情報であり、そ
の管理情報記憶領域の読み出しは可能であるとする。FIG. 13 shows the relationship between the state of the disk array device and the contents of the history information, which should be judged by the disk array controller of the second embodiment, and is obtained when the above processing is applied to each disk array device. The values of i and j are shown. In this figure, only the history information in the management information is shown, where “good” indicates the first history information and “fail” indicates the second history information. Further, it is assumed that the history information marked with a star is the history information of the disk device in which the failure has occurred, and the management information storage area can be read.

【００９５】図１３（Ａ）に示したように、すべてのデ
ィスク装置の履歴情報が第１の履歴情報“good”である
場合には、カウンタＣ_FAILは“０”になり、第２の履歴
情報が書き込まれたディスク装置を探索できないため、
ｉは“０”になり、装置識別情報またはスロット情報が
初期化時に書き込んだ値と異なっているディスク装置も
存在しないので、ｊも“０”となる。このため、ステッ
プＳ７０７およびＳ７０８ではいずれも“Ｙ”側に分岐
され、全てのディスク装置が正常であると判断される。As shown in FIG. 13A, when the history information of all the disk devices is the first history information "good", the counter C _FAIL becomes "0" and the second history information is obtained. Since the disk device where the information was written cannot be searched,
i becomes "0", and since there is no disk device whose device identification information or slot information is different from the value written at the time of initialization, j also becomes "0". Therefore, in steps S707 and S708, both branches to the "Y" side, and it is determined that all the disk devices are normal.

【００９６】また、（Ｂ）に示したように、図１１に示
した流れに従って、３番のディスク装置の履歴情報が第
２の履歴情報“fail”に書き換えられている場合、すな
わち、２番のディスク装置に障害が発生していた場合に
は、Ｈ_X＝Ｈ_FAILとなるＨ_Xが存在しているので、ステ
ップＳ７０４において、ｉに“２”が設定される。ま
た、装置識別情報またはスロット情報が初期化時に書き
込んだ値と異なっているディスク装置は存在しないの
で、ｊは“０”となる。結局、この場合は、ステップＳ
７０８で“Ｎ”側に分岐されることになり、２番のディ
スク装置がアクセス対象から除外される。Further, as shown in (B), when the history information of the disk device No. 3 is rewritten to the second history information "fail" according to the flow shown in FIG. If a disk device failure has occurred in, since the H _{_X} = H _FAIL H _X is present, in step S704, "2" is set to i. Further, since there is no disk device in which the device identification information or the slot information is different from the value written at the time of initialization, j becomes "0". After all, in this case, step S
At 708, it is branched to the “N” side, and the second disk device is excluded from the access target.

【００９７】さらに、（Ｂ）の状態からディスク装置の
交換が、１台のディスク装置を取り外して、新しいディ
スク装置を取り付けるという形で行われた場合には、
（Ｃ）のように、正しい交換がなされる場合と、
（Ｄ）、（Ｅ）のように誤った交換がなされる場合が考
えられる。（Ｃ）、（Ｅ）の場合には、当然、（Ｂ）と
同様に、ｉに“２”が設定されるが、交換されたディス
ク装置が存在しているため、ｊは“０”とはならず、
（Ｃ）では、ｊが“２”に、（Ｅ）では、“４”に設定
される。このため、両者は、ステップＳ７１０で判別さ
れ、（Ｃ）は、正しくディスク装置の交換が行われてい
ると判断され、復旧処理が開始され、（Ｅ）は、誤った
交換がなされていると判断されて、エラーメッセージが
出力される。Further, when the disk device is replaced from the state of (B) by removing one disk device and installing a new disk device,
As in (C), when the correct exchange is made,
It is conceivable that the wrong replacement is performed as in (D) and (E). In the cases of (C) and (E), i is set to "2" as in the case of (B), but since the replaced disk device exists, j is set to "0". Not
In (C), j is set to "2", and in (E), it is set to "4". Therefore, both are determined in step S710, (C) is determined that the disk device is correctly exchanged, recovery processing is started, and (E) is erroneous exchange. It is judged and an error message is output.

【００９８】（Ｄ）の状態は、第２の履歴情報が書き込
まれたディスク装置が誤って交換されてしまったという
点で特徴がある状態なのだが、この場合は、ｉが“０”
となり、ｊが“３”となるため、ステップＳ７１０で
“Ｎ”側に分岐されることになり、誤った交換がなされ
ていると判断されて、エラーメッセージが出力される。
なお、☆印を付けた管理情報記憶領域の内容が読みだせ
ない場合には、（Ｄ）、（Ｅ）は、ステップＳ７０１に
おいて、動作不可能と判定される。The state (D) is characteristic in that the disk device in which the second history information has been written is mistakenly replaced, but in this case, i is "0".
Then, since j becomes "3", it is branched to the "N" side in step S710, it is judged that an incorrect exchange is made, and an error message is output.
If the contents of the management information storage area marked with a star cannot be read, (D) and (E) are determined to be inoperable in step S701.

【００９９】第２の実施例のディスクアレイ装置におい
ても、管理情報をスロット情報と履歴情報、または、履
歴情報だけで構成することは可能であり、いずれの管理
情報を用いても、１台のディスク装置を取り外して新し
いディスク装置を取り付けるといった形の交換に対して
は、常に、正確にその状態を判定することができる。し
かし、履歴情報だけでは、たとえば、４番のスロット
に、５番のスロットに接続されるべきディスク装置が接
続された場合などを、識別することができないので、デ
ィスク装置の交換に全部のディスク装置を外すことが必
要なディスクアレイ装置では、管理情報中に少なくとも
スロット情報を含めておくことが望ましい。Also in the disk array device of the second embodiment, the management information can be composed of slot information and history information, or only history information. For replacement such as removing the disk device and installing a new disk device, the state can always be accurately determined. However, the history information alone cannot identify the case where the disk device that should be connected to the slot No. 5 is connected to the slot No. 4, for example. In a disk array device that needs to be removed, it is desirable to include at least slot information in the management information.

【０１００】[0100]

【発明の効果】以上詳細に説明したように、請求項１記
載の発明によれば、立ち上げ時に、ディスク装置内に書
き込んだ情報を基に、障害ディスク装置の特定を行なう
ので、障害ディスク装置に関する情報が、システムが停
止されても失われることがない。As described above in detail, according to the first aspect of the present invention, the failed disk device is specified based on the information written in the disk device at the time of start-up. Information is not lost when the system is stopped.

【０１０１】請求項２記載の発明によれば、ディスク装
置内に書き込んだ情報を基に、障害ディスク装置の特定
と交換されたディスク装置が存在するか否か、また、存
在する場合には、その交換が正しいものであるか否かの
判定を行なえるので、人為的ミスによるデータ喪失を防
止することができる。According to the second aspect of the invention, based on the information written in the disk device, it is determined whether or not there is a disk device that has been replaced with the identification of the failed disk device, and if there is, Since it can be judged whether the exchange is correct or not, it is possible to prevent data loss due to human error.

【０１０２】請求項３記載の発明によれば、立ち上げ時
に、ディスク装置内に書き込んだ情報を基に、障害ディ
スク装置の特定を行なうので、障害ディスク装置に関す
る情報が、システムが停止されても失われることがな
い。According to the third aspect of the present invention, since the failed disk device is specified based on the information written in the disk device at the time of start-up, the information regarding the failed disk device can be obtained even if the system is stopped. Never lost.

【０１０３】請求項４記載の発明によれば、ディスク装
置に２種の管理情報を書き込むことにより、障害ディス
ク装置の特定と交換されたディスク装置が存在するか否
か、また、存在する場合には、その交換が正しいもので
あるか否かの判定を行なえるので、人為的ミスによるデ
ータ喪失を防止することができる。According to the fourth aspect of the present invention, by writing the two types of management information in the disk device, it is determined whether or not there is a disk device that has been exchanged with the identification of the faulty disk device. Can determine whether the exchange is correct or not, so that data loss due to human error can be prevented.

[Brief description of drawings]

【図１】本発明の第１の実施例のディスクアレイ装置
の概要を示す構成図である。FIG. 1 is a configuration diagram showing an outline of a disk array device according to a first embodiment of the present invention.

【図２】第１の実施例のディスクアレイ装置の初期化
動作の流れを示す流れ図である。FIG. 2 is a flowchart showing a flow of an initialization operation of the disk array system of the first embodiment.

【図３】第１の実施例のディスクアレイ装置におい
て、初期化時に各ディスク装置に記憶される管理情報の
概要を示す説明図である。FIG. 3 is an explanatory diagram showing an outline of management information stored in each disk device at initialization in the disk array device of the first embodiment.

【図４】第１の実施例のディスクアレイ装置におい
て、ディスク装置に障害が発生した際に行なわれる管理
情報更新処理の流れを示す流れ図である。FIG. 4 is a flowchart showing a flow of management information update processing performed when a failure occurs in a disk device in the disk array device of the first embodiment.

【図５】第１の実施例のディスクアレイ装置における
管理情報更新処理時に、各ディスク装置に記憶された管
理情報が更新されていく様子を示す説明図である。FIG. 5 is an explanatory diagram showing how the management information stored in each disk device is updated during the management information updating process in the disk array device of the first embodiment.

【図６】第１の実施例のディスクアレイ装置におい
て、システム停止時にディスク装置が交換されることに
より生ずる管理情報の変化の様子を示すための説明図で
ある。FIG. 6 is an explanatory diagram showing how the management information changes in the disk array device of the first embodiment when the disk device is replaced when the system is stopped.

【図７】第１の実施例のディスクアレイ装置におい
て、管理情報更新処理が中断された後に、システム停止
時にディスク装置が交換されることにより生ずる管理情
報の変化の様子を示すための説明図である。FIG. 7 is an explanatory diagram showing how the management information changes in the disk array device of the first embodiment after the management information update process is interrupted and the disk device is replaced when the system is stopped. is there.

【図８】第１の実施例のディスクアレイ装置の立ち上
げ時の動作の流れを示す流れ図である。FIG. 8 is a flowchart showing an operation flow at the time of startup of the disk array system of the first embodiment.

【図９】第１の実施例のディスクアレイ装置が、図８
の流れに引き続いて実行する動作の流れを示す流れ図で
ある。9 is a diagram showing the disk array device of the first embodiment as shown in FIG.
6 is a flowchart showing a flow of operations performed subsequently to the flow of FIG.

【図１０】第１の実施例のディスクアレイ装置が、図
９の流れに引き続いて実行する動作の流れを示す流れ図
である。FIG. 10 is a flow chart showing a flow of operations performed subsequently to the flow of FIG. 9 by the disk array device of the first embodiment.

【図１１】第２の実施例のディスクアレイ装置が、デ
ィスク装置に障害が発生した際に行なう管理情報更新処
理の流れを示す流れ図である。FIG. 11 is a flowchart showing a flow of management information update processing performed by the disk array device of the second embodiment when a failure occurs in the disk device.

【図１２】第２の実施例のディスクアレイ装置が、立
ち上げ時に行なう動作の流れを示す流れ図である。FIG. 12 is a flowchart showing the flow of operations performed at startup by the disk array device of the second embodiment.

【図１３】第２の実施例のディスクアレイ装置が立ち
上げ時の行う判定動作を説明するための説明図である。FIG. 13 is an explanatory diagram illustrating a determination operation performed by the disk array device according to the second embodiment at startup.

【図１４】従来のディスクアレイ装置の概要を示す構
成図である。FIG. 14 is a configuration diagram showing an outline of a conventional disk array device.

【図１５】ディスクアレイ装置におけるデータの書き
込み動作の概要を示すための説明図である。FIG. 15 is an explanatory diagram showing an outline of a data write operation in the disk array device.

【図１６】従来の不揮発性メモリを用いた２重化ディ
スク装置の概要を示す構成図である。FIG. 16 is a configuration diagram showing an outline of a conventional dual disk device using a nonvolatile memory.

【図１７】従来のヒストリカウンタを用いた２重化デ
ィスク装置の概要を示す構成図である。FIG. 17 is a configuration diagram showing an outline of a conventional dual disk device using a history counter.

[Explanation of symbols]

１１…ディスクアレイ装置、１２…インタフェース、１
３…ディスクアレイコントローラ、１４、２２…ディス
ク装置、１５…管理情報記録領域、１６…記録領域、１
７…装置識別情報記憶領域、１８…スロット情報記憶領
域、１９…履歴情報記憶領域、２０…書き込み要求デー
タ、２１…ホストコンピュータ、２３…制御部、２４…
不揮発性記憶部、２５…オペレーティングシステム、２
６…チャネル装置、２７…中央処理装置、２８…主メモ
リ、２９…ヒストリカウンタ部、３０…ヒストリカウン
タ検証部、３１…制御テーブル11 ... Disk array device, 12 ... Interface, 1
3 ... Disk array controller, 14, 22 ... Disk device, 15 ... Management information recording area, 16 ... Recording area, 1
7 ... Device identification information storage area, 18 ... Slot information storage area, 19 ... History information storage area, 20 ... Write request data, 21 ... Host computer, 23 ... Control section, 24 ...
Non-volatile storage unit, 25 ... Operating system, 2
6 ... Channel device, 27 ... Central processing unit, 28 ... Main memory, 29 ... History counter unit, 30 ... History counter verification unit, 31 ... Control table

Claims

[Claims]

1. A plurality of disk devices for storing data, setting means for setting the order in which all of these disk devices are circulated, and a predetermined storage area for each of the plurality of disk devices. An initialization unit that writes numerical information of the same content as history information used to determine the state of the disk device, a detection unit that detects that an access failure has occurred in the disk device, and an access failure is detected by this detection unit. Designating means for designating the disk devices other than the disk device one by one in the order set by the setting means, and the history information stored in the predetermined storage area of the disk device designated by the designating means as its value. And a rewriting unit that rewrites the data so that it increases by a predetermined amount, and the data is stored in the predetermined storage area of the plurality of disk devices at startup. Read means for reading the history information, and search means for searching for a disk device having a larger history information of the disk device later ordered by the setting means, based on the history information read by the reading means, A disc array device comprising: a discriminating unit that discriminates the disc unit searched by the unit as a disc unit having an access failure.

2. A plurality of disk devices for storing data, setting means for setting the order in which all of these disk devices are circulated, and a predetermined storage area for each of the plurality of disk devices. Initializing means for writing the slot number to which the disk device is connected and history information which is numerical information as management information used for judging the state of the disk device, and detecting means for detecting that an access failure has occurred in the disk device. Specifying means for specifying the disk devices one by one in the order set by the setting means, excluding the disk device in which the access failure is detected by the detecting means, and the predetermined operation for the disk device specified by the specifying means. Rewriting means for rewriting the history information stored in the storage area of the plurality of units so that the value increases by a predetermined amount; Read-out means for reading out the management information stored in the predetermined storage area of each of the disk devices, and the slot number in the management information read out by the read-out means coincides with the one written by the initialization means. And the history information read from the disk device determined by the determination means to be different from the history information used for writing by the initialization means. Changing means, searching means for searching for a disk device having a larger history information of the disk device ordered by the setting means after the history information corresponding to each disk device, If the history information corresponding to the disk device searched by is changed by the changing means, the disk is changed. A disk array device comprising: a judgment unit that judges that the device is a replaced disk device, and otherwise judges that the disk device is a disk device having an access failure.

3. A plurality of disk devices for storing data, setting means for setting the order in which all these disk devices are circulated, and a predetermined storage area for each of the plurality of disk devices. An initialization unit that writes first history information as history information used to determine the state of the disk device, a detection unit that detects that an access failure has occurred in the disk device, and an access failure is detected by this detection unit. Specifying means for specifying a disk device to be ordered later by the setting means of the disk device; and a second history different from the first history information in the contents of the predetermined storage area of the disk device specified by the specifying means. Rewriting means for rewriting to information and reading information stored in the predetermined storage area of each disk device at startup The reading means and the second reading means based on the information read by the reading means.
Search means for searching the disk device in which the information matching the history information of the disk device is stored, and the disk device previously ordered by the setting means of the disk device searched by the searching means is a disk device having an access failure. A disk array device comprising: a determination unit that determines that

4. A plurality of disk devices for storing data, setting means for setting the order in which all these disk devices are circulated, and a predetermined storage area for each of the plurality of disk devices. An initialization unit that writes first history information as history information used to determine the state of the disk device, a detection unit that detects that an access failure has occurred in the disk device, and an access failure is detected by this detection unit. Specifying means for specifying a disk device to be ordered later by the setting means of the disk device; and a second history different from the first history information in the contents of the predetermined storage area of the disk device specified by the specifying means. Rewriting means for rewriting to information and reading information stored in the predetermined storage area of each disk device at startup The reading means and the second reading means based on the information read by the reading means.
Based on the information read by the reading means, the first specifying means for specifying the disk device previously ordered by the setting means of the disk device storing the information matching the history information of 1
The second specifying means for specifying the disk device that stores the information that does not match the first and second history information and the disk device specified by the first and second specifying means are the same device. Sometimes, the first discriminating unit discriminates that the disc device is a properly exchanged disc device, and the discriminating unit is not discriminated by the second discriminating unit. A disk array device, comprising: a second judging means for judging that the disk device specified by the first specifying means is a disk device having an access failure when the specifying is performed.