JP2003303057A

JP2003303057A - Method for data recovery and disk array controller in disk array apparatus

Info

Publication number: JP2003303057A
Application number: JP2003085671A
Authority: JP
Inventors: Kyoichi Sasamoto; 享一笹本; Masayuki Takakuwa; 正幸高桑
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2003-03-26
Filing date: 2003-03-26
Publication date: 2003-10-24

Abstract

<P>PROBLEM TO BE SOLVED: To achieve quick detection and recovery of media failures by executing media check processing with a distinction between areas actually in use and not in use at a file system among disk areas of a disk array. <P>SOLUTION: In a media check processing for checking partial failures of a plurality of HDDs (hard disk drives) composed of the disk array, it is determined whether each stripe in the disk areas of the disk array is used by the file system or not based on a disk source management table (S41, S42), and a media check including data reading from the HDDs is executed only for a stripe (a first stripe) used by the file system (S43, S44). <P>COPYRIGHT: (C)2004,JPO

Description

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、ＲＡＩＤ（Redund
ant Arrays of Inexpensive Disks）構成のディスクア
レイ装置におけるデータ修復方法に係り、特にディスク
アレイを構成するメンバーのディスクドライブの部分的
な障害（メディア障害）を検出して修復する場合に好適
なデータ修復方法及びディスクアレイコントローラに関
する。TECHNICAL FIELD The present invention relates to a RAID (Redundant
The present invention relates to a data recovery method in a disk array device having an ant array of Inexpensive Disks), and particularly a data recovery method suitable for detecting and repairing a partial failure (media failure) of a disk drive of a member forming the disk array. And a disk array controller.

【０００２】[0002]

【従来の技術】一般にディスクアレイ装置は、複数のデ
ィスクドライブ、問えば磁気ディスク装置（以下、ＨＤ
Ｄと称する）から構成されるディスクアレイと、このデ
ィスクアレイ内の各ＨＤＤ（メンバーＨＤＤ）に対する
アクセスを制御するディスクアレイコントローラとを備
え、当該各ＨＤＤを並列に動かして読み出し／書き込み
を分散して実行することでアクセスの高速化を図ると共
に、冗長構成によって信頼性の向上を図るようにした外
部記憶装置として知られている。2. Description of the Related Art Generally, a disk array device is composed of a plurality of disk drives, that is, a magnetic disk device (hereinafter referred to as HD
D)) and a disk array controller for controlling access to each HDD (member HDD) in this disk array, and each HDD is moved in parallel to distribute read / write. It is known as an external storage device in which the speed of access is increased by executing it and the reliability is improved by a redundant configuration.

【０００３】上記ディスクアレイコントローラは、ホス
ト計算機から転送される書き込みデータに対して、デー
タ訂正情報としての冗長データを生成し、上記複数のＨ
ＤＤのうちのいずれかに書き込むようになっている。こ
れにより、複数のＨＤＤのうちの１台の故障に対し、こ
の冗長データと残りのＨＤＤのデータを用いて故障した
ＨＤＤのデータを修復することを可能としている。The disk array controller generates redundant data as data correction information for the write data transferred from the host computer, and the plurality of H
It is adapted to write to any one of the DDs. As a result, when one of the plurality of HDDs fails, the redundant HDD data and the remaining HDD data can be used to restore the failed HDD data.

【０００４】データ冗長化の手法の１つとして、ＲＡＩ
Ｄの手法が知られている。ＲＡＩＤ手法では、ＲＡＩＤ
のデータと冗長データとの関連において、種々のＲＡＩ
Ｄレベルに分類されている。ＲＡＩＤレベルの代表的な
ものにレベル３とレベル５がある。As one of data redundancy methods, RAI
The D method is known. In the RAID method, RAID
Various RAIs in relation to redundant data and redundant data
Classified as D level. Levels 3 and 5 are typical RAID levels.

【０００５】レベル３（ＲＡＩＤレベル３）では、ホス
ト計算機から転送される更新データ（書き込みデータ）
を分割して、その分割された更新データ間の排他的論理
和演算を行うことで冗長データとしてのパリティデータ
を生成し、当該パリティデータで複数のＨＤＤのいずれ
かに書き込まれている元のパリティデータを更新する。
一方、レベル５（ＲＡＩＤレベル５）では、ホスト計算
機から転送される更新データ（新データ）と、当該更新
データの格納先となるＨＤＤ内領域に格納されている更
新前のデータ（旧データ）と、当該更新データの格納先
に対応する別のＨＤＤの領域に格納されている更新前の
パリティデータ（旧パリティデータ）との間の排他的論
理和演算を行うことで、更新されたパリティデータ（新
パリティデータ）を生成し、当該更新パリティデータで
元のパリティデータを更新する。At level 3 (RAID level 3), update data (write data) transferred from the host computer
To generate parity data as redundant data by performing an exclusive OR operation between the divided update data, and the original parity written in any of the plurality of HDDs with the parity data. Update the data.
On the other hand, at level 5 (RAID level 5), the update data (new data) transferred from the host computer and the pre-update data (old data) stored in the HDD internal area where the update data is stored. , The parity data updated by performing an exclusive OR operation with the parity data before update (old parity data) stored in the area of another HDD corresponding to the storage destination of the update data. New parity data) is generated, and the original parity data is updated with the updated parity data.

【０００６】このようなＲＡＩＤ構成のディスクアレイ
装置では、ディスクアレイ内のメンバーＨＤＤが故障し
た場合に、故障したＨＤＤ以外のＨＤＤから、ディスク
アレイのディスク領域を管理する単位であるストライプ
毎にデータを読み出して、それらのデータの排他的論理
和演算を行うＲＡＩＤの機能により、故障したＨＤＤの
すべての領域のデータを、故障したＨＤＤに代えて用い
られるＨＤＤ内に修復することができる。この故障した
ＨＤＤに代えて用いられるＨＤＤは、故障したＨＤＤと
交換して用いられるＨＤＤ、またはディスクコントロー
ラに予め接続されていて、故障したＨＤＤの代替として
割り付けられるスペアＨＤＤである。In a disk array device having such a RAID structure, when a member HDD in the disk array fails, data is read from the HDDs other than the failed HDD for each stripe, which is a unit for managing the disk area of the disk array. By the RAID function of reading and performing an exclusive OR operation of those data, data in all areas of the failed HDD can be restored in the HDD used in place of the failed HDD. The HDD used in place of the failed HDD is an HDD used by replacing the failed HDD, or a spare HDD that is previously connected to the disk controller and is assigned as a substitute for the failed HDD.

【０００７】[0007]

【発明が解決しようとする課題】このように、ＲＡＩＤ
構成のディスクアレイ装置では、ＨＤＤが故障しても、
故障したＨＤＤのデータを元通りに修復することができ
る。As described above, the RAID
In the disk array device with the configuration, even if the HDD fails
The data of the failed HDD can be restored to the original state.

【０００８】ところが従来のディスクアレイ装置では、
故障したＨＤＤ内のデータを元通りに修復するのに、す
べてのＨＤＤ領域のデータをＲＡＩＤの機能により修復
していた。このため、近年のようにＨＤＤ容量が増加す
るのに伴い、データの修復に非常に時間がかかるという
問題があった。However, in the conventional disk array device,
In order to restore the data in the failed HDD to its original state, the data in all HDD areas are restored by the RAID function. Therefore, as the HDD capacity has increased in recent years, there has been a problem that it takes a very long time to restore data.

【０００９】また、データ修復をしている最中は、一般
にＲＡＩＤによるデータの冗長性が損なわれる。このた
め、修復に時間がかかるほど更に他のＨＤＤ故障も発生
しやすくなって、データ修復不能となり、データが消失
する危険性が高まる。また、故障したＨＤＤのデータを
修復するためには、その他のＨＤＤの全領域を読み出す
必要がある。もし、この読み出しの対象となるＨＤＤに
てメディア障害（ＨＤＤの部分的な障害）が発生する
と、ＨＤＤの多重障害となってデータ修復不能となるた
め、ディスクアレイのディスク領域（ディスクボリュー
ム）の閉塞またはデータ修復処理の継続不能となる。こ
れによりディスクアレイ装置の信頼性が低下する。During data restoration, data redundancy due to RAID is generally lost. For this reason, the longer the restoration time, the more likely it is that another HDD failure will occur, making the data unrecoverable and increasing the risk of data loss. Further, in order to recover the data of the failed HDD, it is necessary to read all the areas of the other HDDs. If a media failure (partial failure of the HDD) occurs in the HDD that is the target of reading, multiple failures of the HDD occur and data cannot be recovered. Therefore, the disk area (disk volume) of the disk array is blocked. Or, the data restoration process cannot be continued. This reduces the reliability of the disk array device.

【００１０】ところが本発明者は、ディスクアレイ装置
を利用するホスト計算機のファイルシステムが実際に使
用しているＨＤＤ領域以外は、データ修復処理は必ずし
も必要でないことを想到するに至った。そこで、ファイ
ルシステムが実際に使用しているＨＤＤ領域（ディスク
領域）のみを対象にＨＤＤ（ディスクドライブ）のデー
タ修復を行うならば、データ修復処理に要する時間を短
縮して、ＨＤＤ故障などの危険性を減らすことが可能と
なる。しかし、従来のディスクアレイ装置は、ホスト計
算機のファイルシステムが実際に使用しているＨＤＤ領
域を知ることができず、したがってファイルシステムが
使用しているＨＤＤ領域のみを対象にＨＤＤのデータ修
復を行うことはできない。However, the inventor of the present invention has come to the idea that the data recovery process is not necessarily required except for the HDD area actually used by the file system of the host computer that uses the disk array device. Therefore, if the data of the HDD (disk drive) is restored only for the HDD area (disk area) actually used by the file system, the time required for the data restoration process is shortened, and the risk of HDD failure or the like is reduced. It is possible to reduce the sex. However, the conventional disk array device cannot know the HDD area actually used by the file system of the host computer, and therefore the HDD data is restored only for the HDD area used by the file system. It is not possible.

【００１１】また、ＨＤＤのメディア障害、つまりＨＤ
Ｄの部分的な障害であるセクタブロックの障害を早期に
検出しこれを修復する目的で、周期的にＨＤＤの内容を
読み出して検査するＨＤＤメディア検査処理が一般に知
られている。しかし従来のディスクアレイ装置では、フ
ァイルシステムが使用しているＨＤＤ領域が不明のた
め、ＨＤＤの全領域について読み出し検査をする必要が
あった。このため、故障したＨＤＤのデータの修復処理
の場合と同様に、検査に非常に時間を要するという問題
があった。In addition, a HDD media failure, that is, HD
For the purpose of early detection and repair of a sector block failure, which is a partial failure of D, an HDD media inspection process for periodically reading and inspecting the contents of the HDD is generally known. However, in the conventional disk array device, since the HDD area used by the file system is unknown, it is necessary to perform a read test on the entire area of the HDD. For this reason, there is a problem that the inspection takes a very long time, as in the case of the restoration processing of the data of the failed HDD.

【００１２】また、メディア障害を発見した際には、そ
の障害箇所のセクタブロックを他のセクタブロックに代
替する処理を行い、その代替先のブロック内に、代替元
のデータを修復する必要がある。しかし、このデータ修
復が行えるのは、ＲＡＩＤの機能によりデータの冗長性
が確保されている場合に限られる。したがって、ＨＤＤ
故障などに起因して行われるデータの修復中は、ＨＤＤ
メディア検査処理にてメディア障害を見つけても修復す
ることができず、ディスクアレイ装置の信頼性が低下す
るという問題もあった。Further, when a media failure is found, it is necessary to perform processing for replacing the sector block at the failure location with another sector block and restore the replacement source data in the replacement destination block. . However, this data restoration can be performed only when the redundancy of data is secured by the RAID function. Therefore, the HDD
While data is being restored due to a failure, the HDD is
Even if a media failure is found in the media inspection process, it cannot be repaired, and the reliability of the disk array device is reduced.

【００１３】本発明は上記事情を考慮してなされたもの
でその目的は、ディスクアレイのディスク領域のうちフ
ァイルシステムにて実際に使用されている領域と使用さ
れていない領域とを区別してメディア検査処理を実行す
ることで、メディア障害の早期検出・早期修復を実現
し、これによりディスクアレイ装置の信頼性を向上でき
るようにすることにある。The present invention has been made in consideration of the above circumstances, and an object thereof is to perform a media inspection by discriminating an area actually used in a file system and an area not used in a disk area of a disk array. By executing the processing, it is possible to realize the early detection and early repair of the media failure, thereby improving the reliability of the disk array device.

【００１４】[0014]

【課題を解決するための手段】本発明は、ディスクアレ
イのディスク領域を論理ブロック単位に管理するファイ
ルシステムを備えたホスト計算機から、当該ファイルシ
ステムにより使用されている論理ブロックまたは当該論
理ブロックを含むストライプを示す第１のディスク資源
管理情報を取得するステップと、上記第１のディスク資
源管理情報から上記ディスク領域内の各ストライプ毎に
ファイルシステムにより使用されている論理ブロックを
含むか否かを示す第２のディスク資源管理情報を生成し
てディスクアレイ装置内に保持するステップと、上記ホ
スト計算機からデータ書き込み要求を受け取った場合、
当該要求で指定されたデータ書き込み先が属するストラ
イプを特定するステップと、この特定されたストライプ
がファイルシステムにより使用されている論理ブロック
を含む第１のストライプであるか或いはファイルシステ
ムにより使用されている論理ブロックを含まない第２の
ストライプであるかを、上記ディスクアレイ装置内に保
持されている第２のディスク資源管理情報に基づいて判
定するステップと、上記特定されたストライプが上記第
２のストライプであると判定された場合、当該特定され
たストライプが上記第１のストライプであることを示す
ように上記第２のディスク資源管理情報を更新するステ
ップと、ディスクアレイ内の各ディスクドライブの記憶
内容を読み出すことにより当該ディスクドライブの部分
的な障害を検出するメディア検査処理を実行する場合
に、ディスクアレイのディスク領域のすべてのストライ
プについて上記第１のストライプであるか、或いは上記
第２のストライプであるかを上記第２のディスク資源管
理情報に基づいて判定するステップと、第１のストライ
プ、即ちファイルシステムが使用しているストライプに
ついてのみ、ディスクドライブからのデータ読み出しを
含むメディア検査を実行するステップと、このメディア
検査で障害が検出された箇所のデータをＲＡＩＤ機能に
より修復するステップとを備えたことを特徴とする。The present invention includes a logical block used by the file system or a logical block used by the file system from a host computer having a file system for managing the disk area of the disk array in logical block units. A step of acquiring first disk resource management information indicating a stripe and indicating whether each stripe in the disk area includes a logical block used by a file system from the first disk resource management information Generating a second disk resource management information and holding it in the disk array device; and when receiving a data write request from the host computer,
The step of identifying the stripe to which the data write destination specified by the request belongs, and the identified stripe is the first stripe including the logical block used by the file system or is used by the file system. A step of determining whether the stripe is a second stripe that does not include a logical block based on the second disk resource management information held in the disk array device; and the specified stripe is the second stripe. And the storage content of each disk drive in the disk array, the step of updating the second disk resource management information so as to indicate that the specified stripe is the first stripe. To detect a partial failure of the disk drive. When performing the media inspection process, it is determined whether all the stripes in the disk area of the disk array are the first stripes or the second stripes based on the second disk resource management information. To perform the media check including reading data from the disk drive only for the first stripe, that is, the stripe used by the file system, and the data at the location where the failure is detected by this media check. And a step of repairing by a RAID function.

【００１５】このような構成においては、ディスクアレ
イ装置内の例えばディスクアレイコントローラにて、ホ
スト計算機から第１のディスク資源管理情報が取得さ
れ、当該情報から第２のディスク資源管理情報が生成さ
れてディスクアレイ装置内の例えばディスクアレイコン
トローラに保持される。ここでのメディア検査処理は、
ファイルシステムが実際に使用している論理ブロックを
含んでいることが第２のディスク資源管理情報によって
示されているストライプ（第２のストライプ）だけを対
象に実行される。In such a configuration, the first disk resource management information is acquired from the host computer, for example, by the disk array controller in the disk array device, and the second disk resource management information is generated from the information. It is held in, for example, a disk array controller in the disk array device. The media inspection process here is
Only the stripe (second stripe) indicated by the second disk resource management information that the file system contains the logical block actually used is executed.

【００１６】このように、ファイルシステムにて使用さ
れているストライプのみ抽出してメディア検査をするこ
とにより、その検査に要する処理時間を短縮することが
できる。また、処理時間を短縮したことで、その結果と
してメディア障害をより早期に見つけることが可能とな
り、ディスクドライブの信頼性向上が図れる。これによ
りディスクアレイ装置の信頼性も向上する。In this way, by extracting only the stripes used in the file system and inspecting the medium, the processing time required for the inspection can be shortened. Further, by shortening the processing time, as a result, it becomes possible to detect the media failure earlier, and the reliability of the disk drive can be improved. This also improves the reliability of the disk array device.

【００１７】ホスト計算機から取得した第１のディスク
資源管理情報は、当該情報を取得した直前までの論理ブ
ロックの使用状況を表す。したがって、それ以降に発生
したファイルの更新（ディスクアレイ装置へのデータ書
き込み）については反映されていない。しかし、ファイ
ルの更新の度に最新の第１のディスク資源管理情報をホ
スト計算機から取得するのは大幅な性能低下となる。The first disk resource management information acquired from the host computer represents the usage status of the logical block until immediately before the acquisition of the information. Therefore, file updates (data writing to the disk array device) that occurred after that are not reflected. However, acquiring the latest first disk resource management information from the host computer every time the file is updated results in a significant decrease in performance.

【００１８】そこで、ホスト計算機からのデータ書き込
み要求をディスクアレイ装置が受け取った場合、つまり
ディスクアレイ装置でのファイル更新が発生する場合、
当該データ書き込み要求に基づいて、ディスクアレイ装
置にて保持している第２のディスク資源管理情報を、ス
トライプの最新の使用状況を表すようにディスクアレイ
装置自身が更新するとよい。Therefore, when the disk array device receives a data write request from the host computer, that is, when a file update occurs in the disk array device,
Based on the data write request, the disk array device itself may update the second disk resource management information held in the disk array device so as to represent the latest usage status of the stripe.

【００１９】また本発明は、上記メディア検査処理を全
てのストライプについてストライプ毎に順次実行し、デ
ィスクドライブの部分的な障害が検出された場合には、
障害が検出された箇所を含むストライプが第１または第
２のストライプのいずれであるかを判定し、第１のスト
ライプの場合で、即ちファイルシステムが使用している
ストライプである場合で、障害が検出されたディスクド
ライブ以外のディスクドライブが正常な場合には、障害
が検出された箇所のデータをＲＡＩＤ機能により修復
し、第２のストライプの場合で、即ちファイルシステム
が使用していないストライプである場合には、障害が検
出された箇所のデータを固定データにより修復すること
をも特徴とする。Further, according to the present invention, the above-mentioned media inspection process is sequentially executed for all stripes, and when a partial failure of the disk drive is detected,
It is determined whether the stripe including the location where the failure is detected is the first stripe or the second stripe, and in the case of the first stripe, that is, the stripe used by the file system, the failure is detected. When a disk drive other than the detected disk drive is normal, the data at the location where the failure is detected is restored by the RAID function, and in the case of the second stripe, that is, the stripe not used by the file system. In this case, the feature is that the data at the location where the failure is detected is restored by the fixed data.

【００２０】このように、メディア検査で障害が検出さ
れた箇所を含むストライプがファイルシステムにより使
用されている場合で、且つ障害が検出されたディスクド
ライブ以外のディスクドライブが正常な場合だけ、障害
が検出された箇所のデータをＲＡＩＤ機能により修復す
る。これに対し、障害が検出された箇所を含むストライ
プがファイルシステムにより使用されていない場合に
は、そのストライプ内のデータを保持している必要はな
いため、障害が検出されたディスクドライブ以外のディ
スクドライブが正常であるか否かに無関係に、つまりＲ
ＡＩＤ機能によるデータの冗長性が確保されているか否
かに無関係に、障害が検出された箇所のデータを強制的
に固定データにより修復することで、メディア障害の修
復できる可能性が大幅に向上してディスクドライブの信
頼性を向上し、これによりディスクアレイ装置の信頼性
も向上する。As described above, the failure occurs only when the stripe including the location where the failure is detected by the media inspection is used by the file system and when the disk drive other than the disk drive where the failure is detected is normal. The data at the detected location is restored by the RAID function. On the other hand, if the stripe containing the location where the failure was detected is not used by the file system, it is not necessary to hold the data in that stripe, so a disk other than the disk drive where the failure was detected is not required. Irrespective of whether the drive is normal, ie R
Regardless of whether or not data redundancy by the AID function is secured, by forcibly repairing the data at the location where the failure is detected with fixed data, the possibility of being able to repair the media failure is greatly improved. The reliability of the disk drive is improved, and thus the reliability of the disk array device is also improved.

【００２１】なお、以上の方法に係る発明は、装置（デ
ィスクアレイコントローラ、または同ディスクアレイコ
ントローラを備えたディスクアレイ装置、または同ディ
スクアレイ装置を備えた計算機システム）に係る発明と
しても成立する。The invention related to the above method is also realized as an invention related to a device (disk array controller, or a disk array device including the same disk array controller, or a computer system including the same disk array device).

【００２２】[0022]

【発明の実施の形態】以下、本発明の実施の形態につき
図面を参照して説明する。図１は本発明の一実施形態に
係るディスクアレイ装置を備えた計算機システムの構成
を示すブロック図である。図１の計算機システムは、ホ
スト計算機１０と、このホスト計算機１０によって利用
されるディスクアレイ装置２０とから構成される。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing the configuration of a computer system including a disk array device according to an embodiment of the present invention. The computer system shown in FIG. 1 includes a host computer 10 and a disk array device 20 used by the host computer 10.

【００２３】ホスト計算機１０は、当該ホスト計算機１
０と接続されているディスクアレイ装置２０（のディス
ク領域）に格納されるファイルを管理するファイルシス
テム１１を備えている。このファイルシステム１１はＯ
Ｓ（オペレーティングシステム）により提供される機能
の一部である。The host computer 10 is the host computer 1
A file system 11 for managing files stored in (a disk area of) the disk array device 20 connected to the disk array device 0 is provided. This file system 11 is O
It is a part of the function provided by S (operating system).

【００２４】ホスト計算機１０は、ディスクアレイ装置
２０のディスク領域内のすべての論理ブロック（連続す
る複数の物理セクタブロックで構成される固定長のブロ
ック）について、そのブロックがファイルシステム１１
により使用されている（つまり有効なデータが格納され
ている）か、或いは使用されていない（つまりデータは
格納されておらず新しいデータを格納できる）かを示す
管理テーブル（以下、ディスク資源管理テーブルと称す
る）１２を、当該計算機１０が持つ記憶装置、例えばＨ
ＤＤ（図示せず）に保持している。この記憶装置がディ
スクアレイ装置２０であっても構わない。また、ディス
ク資源管理テーブル１２が、ディスク領域内のすべての
物理セクタブロックについて、そのブロックがファイル
システム１１により使用されている否かを示すものであ
っても構わない。The host computer 10 has a file system 11 for all logical blocks (fixed length blocks composed of a plurality of consecutive physical sector blocks) in the disk area of the disk array device 20.
Management table (hereinafter, disk resource management table) that indicates whether it is used (that is, valid data is stored) or is not used (that is, no data is stored and new data can be stored) 12) is a storage device of the computer 10, for example, H
It is held in DD (not shown). The storage device may be the disk array device 20. Also, the disk resource management table 12 may indicate whether or not all physical sector blocks in the disk area are used by the file system 11.

【００２５】ホスト計算機１０（が持つ記憶装置）に
は、ファイルシステム１１から予め定められたタイミン
グでディスク資源管理テーブル１２を取得して、当該テ
ーブル１２から生成されるディスク資源管理情報リスト
９０（図９参照）をディスクアレイ装置２０に送信する
専用ソフトウエア１３がインストールされている。この
ディスク資源管理情報リスト９０は、後述するようにフ
ァイルシステム１１により使用されているすべての論理
ブロックについて、そのブロックの識別情報としての論
理ブロック番号の集合からなる。A disk resource management table 12 is acquired from the file system 11 in the host computer 10 (a storage device of the host computer) at a predetermined timing, and a disk resource management information list 90 (FIG. 9)) is installed in the disk array device 20. The disk resource management information list 90 is composed of a set of logical block numbers as identification information of all logical blocks used by the file system 11 as described later.

【００２６】ディスクアレイ装置２０は、ディスクアレ
イ２１とディスクアレイコントローラ２２とから構成さ
れている。ディスクアレイ２１は、ディスクアレイコン
トローラ２２と接続される複数のディスクドライブ、例
えば４台のＨＤＤ（磁気ディスク装置）２１０-0〜２１
０-3から構成される。ディスクアレイコントローラ２２
には、ＨＤＤ２１０-0〜２１０-3のいずれかに障害が発
生した場合のバックアップディスク用に割り当てられる
スペアＨＤＤ（図示せず）も接続されている。The disk array device 20 comprises a disk array 21 and a disk array controller 22. The disk array 21 includes a plurality of disk drives connected to the disk array controller 22, for example, four HDDs (magnetic disk devices) 210-0 to 210.
It consists of 0-3. Disk array controller 22
Also connected to this is a spare HDD (not shown) that is assigned as a backup disk when any of the HDDs 210-0 to 210-3 fails.

【００２７】ディスクアレイコントローラ２２は、ディ
スクアレイ２１内の各ＨＤＤ２１０-0〜２１０-3に対す
るアクセスを制御する。ディスクアレイコントローラ２
２は、当該コントローラ２２の主制御部をなすマイクロ
プロセッサ２２１と、メモリ２２２とを備えている。メ
モリ２２２には、マイクロプロセッサ２２１が実行する
制御プログラム２２２ａが格納されている。またメモリ
２２２には、ディスク資源管理テーブル領域２２２ｂが
確保されている。このディスク資源管理テーブル領域２
２２ｂは、ホスト計算機１０から送信されるディスク資
源管理情報リスト９０をもとに生成されるディスク資源
管理テーブル１２０を格納するのに用いられる。The disk array controller 22 controls access to each HDD 210-0 to 210-3 in the disk array 21. Disk array controller 2
2 includes a microprocessor 221 which is a main control unit of the controller 22 and a memory 222. The memory 222 stores a control program 222a executed by the microprocessor 221. A disk resource management table area 222b is secured in the memory 222. This disk resource management table area 2
22b is used to store the disk resource management table 120 generated based on the disk resource management information list 90 transmitted from the host computer 10.

【００２８】本実施形態では、ディスクアレイ装置２０
がＲＡＩＤ５レベルで用いられるものとする。この場
合、ＨＤＤ２１０-0〜２１０-3がいずれもデータ並びに
パリティデータ（冗長データ）の格納用（データ・パリ
ティディスク用）に用いられる。なお、ディスクアレイ
装置２０がＲＡＩＤ３レベルで用いられる場合には、Ｈ
ＤＤ２１０-0〜２１０-3のうちの３台がデータ格納用
（データディスク用）に、残りの１台がパリティデータ
格納用（パリティディスク用）に割り当てられる。In this embodiment, the disk array device 20
Shall be used at the RAID 5 level. In this case, the HDDs 210-0 to 210-3 are used for storing data and parity data (redundant data) (for data / parity disk). When the disk array device 20 is used at the RAID3 level, H
Three of the DDs 210-0 to 210-3 are allocated for data storage (for data disk), and the other one is allocated for parity data storage (for parity disk).

【００２９】ディスクアレイ装置２０（内のディスクア
レイコントローラ２２）では、ＨＤＤ２１０-0〜２１０
-3によって実現されるディスクアレイ２１のディスク領
域を、図２に示すように複数のストライプ２３に分割し
て管理する。このストライプ２３のサイズは、１ＨＤＤ
当たり６４Ｋ（キロ）バイト〜２５６Ｋバイト程度に設
定されるのが一般的である。ストライプ２３は、少なく
とも１つの論理ブロック２４から構成される。この論理
ブロック２４は、ホスト計算機１０のファイルシステム
１１がディスクアレイ装置２０（内のディスクアレイ装
置２０）のディスク領域を管理する際の最小単位であ
る。つまり、ディスクアレイ装置２０のディスク領域
は、当該ディスクアレイ装置２０ではストライプ２３を
単位に管理され、ホスト計算機１０では論理ブロック２
４を単位に管理される。通常、論理ブロックのサイズは
１Ｋバイト〜８Ｋバイト程度である。論理ブロック２４
は、連続する複数の物理セクタブロック２５から構成さ
れる。このセクタブロック２５のサイズは５１２バイト
であるのが一般的である。In the disk array device 20 (inside the disk array controller 22), the HDDs 210-0 to 210
The disk area of the disk array 21 realized by -3 is managed by being divided into a plurality of stripes 23 as shown in FIG. The size of this stripe 23 is 1 HDD
It is generally set to about 64 K (kilo) bytes to 256 K bytes. The stripe 23 is composed of at least one logical block 24. The logical block 24 is the minimum unit when the file system 11 of the host computer 10 manages the disk area of the disk array device 20 (the disk array device 20 therein). That is, the disk area of the disk array device 20 is managed in units of stripes 23 in the disk array device 20, and in the host computer 10, the logical block 2 is managed.
It is managed in units of 4. Usually, the size of the logical block is about 1 Kbyte to 8 Kbytes. Logic block 24
Is composed of a plurality of consecutive physical sector blocks 25. The size of this sector block 25 is generally 512 bytes.

【００３０】図３は、ホスト計算機１０内に保持される
ディスク資源管理テーブル１２とディスクアレイコント
ローラ２２のメモリ２２２内のディスク資源管理情報領
域２２２ｂに格納されるディスク資源管理テーブル１２
０のデータ構造例を示す。FIG. 3 shows the disk resource management table 12 held in the host computer 10 and the disk resource management table 12 stored in the disk resource management information area 222b in the memory 222 of the disk array controller 22.
An example of the data structure of 0 is shown.

【００３１】ディスク資源管理テーブル１２の各エント
リの並び順で決まるエントリ番号は、そのまま論理ブロ
ック番号を表すようになっている。このテーブル１２の
各エントリには、そのエントリに固有の論理ブロック番
号で表される論理ブロックがファイルシステム１１によ
り使用されているか否かを示すフラグが設定されてい
る。なお、ディスク資源管理テーブル１２の各エントリ
に、論理ブロック番号と上記フラグとの対が設定される
ものであっても構わない。The entry number determined by the arrangement order of the entries in the disk resource management table 12 directly represents the logical block number. A flag indicating whether or not a logical block represented by a logical block number unique to the entry is used by the file system 11 is set in each entry of the table 12. A pair of the logical block number and the flag may be set in each entry of the disk resource management table 12.

【００３２】一方、ディスク資源管理テーブル１２０の
各エントリの並び順で決まるエントリ番号は、そのまま
ストライプ番号を表すようになっている。このテーブル
１２０の各エントリには、そのエントリに固有のストラ
イプ番号で表されるストライプ（に含まれる論理ブロッ
クの少なくとも１つ）がファイルシステム１１により使
用されているか否かを示すフラグが設定されている。な
お、ディスク資源管理テーブル１２０の各エントリに、
ストライプ番号と上記フラグとの対が設定されるもので
あっても構わない。On the other hand, the entry number determined by the order of arrangement of the entries in the disk resource management table 120 directly represents the stripe number. A flag indicating whether or not a stripe (at least one of logical blocks included in the stripe) represented by a stripe number unique to the entry is used by the file system 11 is set in each entry of the table 120. There is. In addition, in each entry of the disk resource management table 120,
A pair of a stripe number and the above flag may be set.

【００３３】次に、図１の構成の計算機システムにおけ
る動作を、（１）ホスト計算機１０からのディスク資源
管理情報リスト送信時の処理、（２）ホスト計算機１０
からのデータ書き込み要求発行時の処理、（３）ディス
クアレイ装置２０におけるデータ修復処理、（４）ディ
スクアレイ装置２０におけるＨＤＤメディア障害検査処
理を例に順に説明する。Next, the operation of the computer system configured as shown in FIG. 1 will be described as follows: (1) processing when the disk resource management information list is transmitted from the host computer 10, (2) host computer 10
The process for issuing a data write request from (1), (3) data recovery process in the disk array device 20, and (4) HDD media failure inspection process in the disk array device 20 will be sequentially described as an example.

【００３４】（１）ホスト計算機１０からのディスク資
源管理情報リスト送信時の処理まず、ホスト計算機１０からのディスク資源管理情報リ
スト送信時の処理について、図４のフローチャートを参
照して説明する。(1) Processing when transmitting the disk resource management information list from the host computer 10 First, the processing when transmitting the disk resource management information list from the host computer 10 will be described with reference to the flowchart of FIG.

【００３５】ホスト計算機１０は、当該計算機１０（の
記憶装置）にインストールされている専用ソフトウエア
１３に従い、予め定められたタイミングで、その時点に
当該計算機１０（の記憶装置）に保持されているディス
ク資源管理テーブル１２をファイルシステム１１から取
得する。そしてホスト計算機１０は、このディスク資源
管理テーブル１２から図９に示すディスク資源管理情報
リスト９０を作成し、当該リスト９０をディスクアレイ
装置２０に送信する。このときホスト計算機１０は、デ
ィスク資源管理テーブル１２の内容がディスク資源管理
情報リスト９０の送信中に変化するのを防止するため
に、ファイルの更新が発生しないように配慮することが
好ましい。また、ディスク資源管理情報リスト９０は極
めて大きなサイズとなる可能性があり、その場合には当
該リスト９０の送信に長時間を要する。そこで、ディス
ク資源管理情報リスト９０の送信がホスト計算機１０の
効率に影響を及ぼさないように、送信タイミングとし
て、ホスト計算機１０の立ち上げ時、或いはホスト計算
機１０の負荷が少ない夜間等の一定周期を設定するとよ
い。The host computer 10 is held in (the storage device of) the computer 10 at a predetermined timing according to a dedicated software 13 installed in (the storage device of) the computer 10 at a predetermined timing. The disk resource management table 12 is acquired from the file system 11. Then, the host computer 10 creates a disk resource management information list 90 shown in FIG. 9 from the disk resource management table 12 and sends the list 90 to the disk array device 20. At this time, in order to prevent the contents of the disk resource management table 12 from changing during transmission of the disk resource management information list 90, it is preferable that the host computer 10 considers that no file update occurs. Further, the disk resource management information list 90 may have an extremely large size, in which case it takes a long time to transmit the list 90. Therefore, in order to prevent the transmission of the disk resource management information list 90 from affecting the efficiency of the host computer 10, the transmission timing is set to a fixed period such as when the host computer 10 is started up or at night when the load of the host computer 10 is low. Good to set.

【００３６】さて、ホスト計算機１０からディスクアレ
イ装置２０に送信されるディスク資源管理情報リスト９
０は、図９に示すように、ホスト計算機１０内のファイ
ルシステム１１がディスクアレイ装置２０のディスク領
域（ディスクボリューム）を扱う際の論理ブロック２４
のサイズ９１と、このディスク領域内のすべての論理ブ
ロック２４のうち、ファイルシステム１１により使用さ
れている論理ブロック２４のブロック番号（論理ブロッ
ク番号）９２，９２…の集合とから構成される。このよ
うに、ディスク資源管理情報リスト９０内に、ファイル
システム１１により使用されていない論理ブロック２４
の情報（論理ブロック番号）が含まれていないのは、当
該リスト９０のサイズを小さくすることで、当該リスト
９０をホスト計算機１０からディスクアレイ装置２０に
送信するのに要する時間を短縮するためである。通常、
ディスクアレイ装置２０のディスク領域のうち、ファイ
ルシステム１１によって使用されている領域の占める割
合は少ない。このような場合、ファイルシステム１１に
より使用されていない論理ブロック２４の情報をディス
ク資源管理情報リスト９０に含まないことは、当該リス
ト９０の送信時間を短縮するのに特に効果がある。な
お、ディスク資源管理テーブル１２自体をホスト計算機
１０からディスクアレイ装置２０に送信するようにして
も構わない。また、ディスク資源管理情報リスト９０ま
たはディスク資源管理テーブル１２０のうちデータ量の
少ない方を送信するようにしても構わない。この場合、
リスト９０またはテーブル１２０のいずれの送信である
かを示す情報を付加して送信するとよい。Now, the disk resource management information list 9 transmitted from the host computer 10 to the disk array device 20
As shown in FIG. 9, 0 is a logical block 24 when the file system 11 in the host computer 10 handles the disk area (disk volume) of the disk array device 20.
, And a set of block numbers (logical block numbers) 92, 92 ... Of the logical blocks 24 used by the file system 11 among all the logical blocks 24 in this disk area. Thus, in the disk resource management information list 90, the logical blocks 24 that are not used by the file system 11 are
Information (logical block number) is not included because the size of the list 90 is reduced to shorten the time required to transmit the list 90 from the host computer 10 to the disk array device 20. is there. Normal,
The ratio of the area used by the file system 11 to the disk area of the disk array device 20 is small. In such a case, not including the information of the logical block 24 not used by the file system 11 in the disk resource management information list 90 is particularly effective in reducing the transmission time of the list 90. The disk resource management table 12 itself may be transmitted from the host computer 10 to the disk array device 20. Further, one of the disk resource management information list 90 or the disk resource management table 120 having the smaller data amount may be transmitted. in this case,
Information indicating which of the list 90 and the table 120 is to be transmitted may be added and transmitted.

【００３７】ディスクアレイ装置２０内のディスクアレ
イコントローラ２２（に設けられたマイクロプロセッサ
２２１）は、ホスト計算機１０からディスク資源管理情
報リスト９０が送信されると、当該リスト９０を受信す
る（ステップＳ１）。するとディスクアレイコントロー
ラ２２は、ディスク資源管理情報リスト９０に含まれて
いるすべての論理ブロック番号をもとに、ファイルシス
テム１１により使用されていない論理ブロックの論理ブ
ロック番号を判別し、ディスクアレイ装置２０のディス
ク領域内のすべての論理ブロックについて、そのブロッ
クを示す論理ブロック番号の例えば昇順に、そのブロッ
クが使用されているか否かを示すエントリが配置され
た、図３に示すディスク資源管理テーブル１２を復元す
る（ステップＳ２）。When the disk resource management information list 90 is transmitted from the host computer 10, the disk array controller 22 (microprocessor 221 provided therein) in the disk array device 20 receives the list 90 (step S1). . Then, the disk array controller 22 determines the logical block number of the logical block not used by the file system 11 based on all the logical block numbers included in the disk resource management information list 90, and the disk array device 20. For all the logical blocks in the disk area, the disk resource management table 12 shown in FIG. 3 in which entries indicating whether or not the block is used are arranged in ascending order of the logical block numbers indicating the block, for example. It is restored (step S2).

【００３８】次にディスクアレイコントローラ２２は、
ホスト計算機１０により管理される論理ブロック番号と
ディスクアレイ装置２０により管理されるストライプ番
号との対応付けを行う（ステップＳ３）。この対応付け
は次のように行われる。Next, the disk array controller 22
The logical block number managed by the host computer 10 and the stripe number managed by the disk array device 20 are associated (step S3). This association is performed as follows.

【００３９】まずディスクアレイコントローラ２２は、
「ストライプ当たりの論理ブロック数」を、自身が管理
している「ストライプのサイズ」と、ホスト計算機１０
から送信されたディスク資源管理情報リスト９０に含ま
れている「論理ブロックのサイズ」９１とから、「ストライプ当たりの論理ブロック数」＝「ストライプ
のサイズ」／「論理ブロックのサイズ」により算出する。First, the disk array controller 22
The "number of logical blocks per stripe" is managed by the "stripe size" managed by itself and the host computer 10
From the “logical block size” 91 included in the disk resource management information list 90 transmitted from the device, the number of logical blocks per stripe = “stripe size” / “logical block size” is calculated.

【００４０】次にディスクアレイコントローラ２２は、
すべての「論理ブロック番号」について、その「論理ブ
ロック番号」と「ストライプ当たりの論理ブロック数」
とから、「論理ブロック番号」の示す論理ブロック２４
が含まれているストライプ２３を示す「ストライプ番
号」を、「ストライプ番号」＝｛「論理ブロック番号」／「スト
ライプ当たりの論理ブロック数」｝の整数部により算出する。例えば、「ストライプ当たりの論理ブ
ロック数」を「４」とすると、「論理ブロック番号」が
「０」〜「３」の論理ブロックを含むストライプの「ス
トライプ番号」はいずれも「０」である。この結果、ホ
スト計算機１０により管理される論理ブロック番号とデ
ィスクアレイ装置２０により管理されるストライプ番号
との対応付けが行われたことになる。Next, the disk array controller 22
For every "logical block number", its "logical block number" and "number of logical blocks per stripe"
From the logical block 24 indicated by the "logical block number"
The "stripe number" indicating the stripe 23 including "is included" is calculated by the integer part of "stripe number" = {"logical block number" / "number of logical blocks per stripe"}. For example, when the “number of logical blocks per stripe” is “4”, the “stripe number” of the stripes including the logical blocks whose “logical block number” is “0” to “3” is “0”. As a result, the logical block number managed by the host computer 10 and the stripe number managed by the disk array device 20 are associated with each other.

【００４１】ディスクアレイコントローラ２２は、論理
ブロック番号とストライプ番号との対応付けを行うと、
その対応付けの結果と復元されたディスク資源管理テー
ブル１２とから、ディスクアレイ装置２０内のすべての
ストライプについて、そのストライプを示すストライプ
番号の例えば昇順に、そのストライプがファイルシステ
ム１１により使用されているか否かを示すエントリが配
置された、図３に示すディスク資源管理テーブル１２０
を作成する（ステップＳ４）。ここでは、ファイルシス
テム１１によって使用されている論理ブロックを１つで
も含むストライプは、ファイルシステム１１により使用
されていると判定されて、対応するエントリに使用中を
示すフラグが設定される。これに対し、ファイルシステ
ム１１によって使用されている論理ブロックを含まない
ストライプは、ファイルシステム１１によって使用され
ていないと判定されて、対応するエントリに未使用（不
使用）を示すフラグが設定される。When the disk array controller 22 associates the logical block number with the stripe number,
From the association result and the restored disk resource management table 12, for all the stripes in the disk array device 20, is the stripe used by the file system 11 in the ascending order of the stripe numbers indicating the stripes? A disk resource management table 120 shown in FIG. 3 in which an entry indicating whether or not is arranged is arranged.
Is created (step S4). Here, a stripe including at least one logical block used by the file system 11 is determined to be used by the file system 11, and a flag indicating busy is set in the corresponding entry. On the other hand, a stripe that does not include a logical block used by the file system 11 is determined not to be used by the file system 11, and a flag indicating unused (not used) is set in the corresponding entry. .

【００４２】ディスクアレイコントローラ２２はディス
ク資源管理テーブル１２０を作成すると、当該テーブル
１２０をメモリ２２２内のディスク資源管理テーブル領
域２２２ｂに上書きコピーする（ステップＳ５）。After creating the disk resource management table 120, the disk array controller 22 overwrites the table 120 on the disk resource management table area 222b in the memory 222 (step S5).

【００４３】上記のように、論理ブロック番号とストラ
イプ番号との対応付けをディスクアレイ装置２０で行う
場合、ホスト計算機１０ではディスクアレイ装置２０に
固有のストライプのサイズを考慮する必要がない。但
し、ディスクアレイ装置２０では、ホスト計算機１０に
固有の論理ブロックのサイズを考慮する必要がある。As described above, when the logical block number and the stripe number are associated with each other in the disk array device 20, it is not necessary for the host computer 10 to consider the stripe size unique to the disk array device 20. However, in the disk array device 20, it is necessary to consider the size of the logical block unique to the host computer 10.

【００４４】これに対し、図９のディスク資源管理情報
リスト９０に代えて、図１０のデータ構造のディスク資
源管理情報リスト１００を用いるならば、ディスクアレ
イ装置２０では、ホスト計算機１０に固有の論理ブロッ
クのサイズを考慮する必要がない。この図１０の構造の
ディスク資源管理情報リスト１００は、ファイルシステ
ム１１により使用されている論理ブロック２４が含まれ
るストライプを示すストライプ番号１０１，１０１…の
集合から構成される。但し、図１０の構造のディスク資
源管理情報リスト１００をホスト計算機１０で用意する
には、当該ホスト計算機１０がディスクアレイ装置２０
内のディスクアレイコントローラ２２から予めストライ
プのサイズを取得し、論理ブロック番号とストライプ番
号との対応付けを、当該ホスト計算機１０にインストー
ルされている専用ソフトウエア１３に従って実行する必
要がある。On the other hand, if the disk resource management information list 90 having the data structure shown in FIG. 10 is used in place of the disk resource management information list 90 shown in FIG. 9, the disk array device 20 has a logic unique to the host computer 10. There is no need to consider the block size. The disk resource management information list 100 having the structure shown in FIG. 10 is composed of a set of stripe numbers 101, 101 ... Which indicate stripes including the logical block 24 used by the file system 11. However, in order to prepare the disk resource management information list 100 having the structure shown in FIG. 10 in the host computer 10, the host computer 10 needs to prepare the disk array device 20.
It is necessary to acquire the stripe size in advance from the disk array controller 22 in the internal disk array controller 22 and to associate the logical block number with the stripe number in accordance with the dedicated software 13 installed in the host computer 10.

【００４５】（２）ホスト計算機１０からのデータ書き
込み要求発行時の処理次に、メモリ２２２内のディスク資源管理テーブル領域
２２２ｂに、図３に示すディスク資源管理テーブル１２
０が格納されている状態で、ホスト計算機１０からディ
スクアレイ装置２０内のディスクアレイコントローラ２
２に対してデータの書き込み要求が発行された場合の処
理について、図５のフローチャートを参照して説明す
る。(2) Processing when issuing a data write request from the host computer 10 Next, in the disk resource management table area 222b in the memory 222, the disk resource management table 12 shown in FIG.
With 0 stored, the disk array controller 2 in the disk array device 20 from the host computer 10
A process when a data write request is issued to the data 2 will be described with reference to the flowchart of FIG.

【００４６】まずディスクアレイコントローラ２２（内
のマイクロプロセッサ２２１）は、ホスト計算機１０
（内のファイルシステム１１）からディスクアレイ装置
２０に対してデータ書き込み要求が発行されると、当該
要求を受信する（ステップＳ１１）。この要求には、ア
ドレス（開始アドレス）とサイズとが含まれている。First, the disk array controller 22 (the microprocessor 221 therein) is connected to the host computer 10.
When a data write request is issued from (the file system 11 in) to the disk array device 20, the request is received (step S11). This request includes the address (starting address) and the size.

【００４７】次にディスクアレイコントローラ２２は、
受信したデータ書き込み要求に含まれている開始アドレ
スとサイズとから、書き込み対象となるストライプを示
すストライプ番号を算出する（ステップＳ１２）。Next, the disk array controller 22
From the start address and size included in the received data write request, the stripe number indicating the stripe to be written is calculated (step S12).

【００４８】次にディスクアレイコントローラ２２は、
ディスク資源管理テーブル領域２２２ｂに格納されてい
るディスク資源管理テーブル１２０内のエントリのう
ち、ステップＳ１２で算出したストライプ番号で指定さ
れるエントリを参照することにより、当該ストライプ番
号の示す書き込み対象ストライプがファイルシステム１
１にて既に使用されているか否かを判定する（ステップ
Ｓ１３）。もし、書き込み対象ストライプがそれまで使
用されていなかった場合、ディスクアレイコントローラ
２２はステップＳ１３で参照したディスク資源管理テー
ブル１２０内のエントリの内容（フラグの状態）を、未
使用から使用中を示すように更新する（ステップＳ１
４）。このように、ホスト計算機１０からのデータ書き
込み要求で指定されるデータ書き込みにより、当該要求
で指定される論理ブロックを含むストライプの状態が未
使用から使用中に変化すると判定された場合には、当該
ストライプに対応するディスク資源管理テーブル１２０
内のエントリの内容が使用中を示すように更新される。Next, the disk array controller 22
Among the entries in the disk resource management table 120 stored in the disk resource management table area 222b, by referring to the entry specified by the stripe number calculated in step S12, the write target stripe indicated by the stripe number is a file. System 1
It is determined whether or not it is already used in 1 (step S13). If the stripe to be written has not been used until then, the disk array controller 22 indicates the content (flag state) of the entry in the disk resource management table 120 referred to in step S13 from unused to used. To (step S1
4). As described above, when it is determined by the data writing specified by the data writing request from the host computer 10 that the state of the stripe including the logical block specified by the request changes from unused to busy, Disk resource management table 120 corresponding to stripes
The contents of the entry in are updated to indicate busy.

【００４９】ディスクアレイコントローラ２２はステッ
プＳ１４を実行すると、ディスクアレイ２１に対してホ
スト計算機１０からのデータ書き込み要求で指定された
データ書き込みを行う（ステップＳ１５）。After executing step S14, the disk array controller 22 writes the data specified by the data write request from the host computer 10 to the disk array 21 (step S15).

【００５０】またディスクアレイコントローラ２２は、
ステップＳ１２で算出したストライプ番号の示す書き込
み対象ストライプがファイルシステム１１にて既に使用
されている場合には（ステップＳ１３）、ステップＳ１
４をスキップしてステップＳ１５に進み、ホスト計算機
１０からのデータ書き込み要求で指定されたデータ書き
込みを行う。Further, the disk array controller 22 is
When the write target stripe indicated by the stripe number calculated in step S12 is already used in the file system 11 (step S13), step S1
4 is skipped and the process proceeds to step S15 to write the data specified by the data write request from the host computer 10.

【００５１】（３）ディスクアレイ装置２０におけるデ
ータ修復処理次に、ディスクアレイ装置２０におけるデータ修復処理
について、図６のフローチャートを参照して説明する。(3) Data Restoration Process in Disk Array Device 20 Next, the data restoration process in the disk array device 20 will be described with reference to the flowchart of FIG.

【００５２】今、ディスクアレイ２１内のＨＤＤ２１０
-0〜２１０-3のうちＨＤＤ２１０-3が故障したために、
その故障したＨＤＤ（旧ＨＤＤ）２１０-3を新たなＨＤ
Ｄ（新ＨＤＤ）２１０-3に交換して、旧ＨＤＤ２１０-3
内のデータを新ＨＤＤ２１０-3に修復するものとする。
ここでは便宜的に、新ＨＤＤ、つまり修復先となるＨＤ
Ｄにも、故障した旧ＨＤＤ２１０-3と同一符号“２１０
-3”を付してある。なお、修復先となるＨＤＤがディス
クアレイコントローラ２２に予め接続されているスペア
ＨＤＤであっても構わない。Now, the HDD 210 in the disk array 21
-Since the HDD 210-3 out of 0 to 210-3 has failed,
Replace the failed HDD (old HDD) 210-3 with a new HD
Replace the D (new HDD) 210-3 with the old HDD 210-3
It is assumed that the data in the HDD is restored to the new HDD 210-3.
Here, for convenience, the new HDD, that is, the HD that is the restoration destination
D also has the same code "210" as the old HDD 210-3 that failed.
-3 "is added. The HDD that is the restoration destination may be a spare HDD that is connected in advance to the disk array controller 22.

【００５３】ディスクアレイコントローラ２２（内のマ
イクロプロセッサ２２１）はＨＤＤ２１０-3が故障した
場合、その故障したＨＤＤ（旧ＨＤＤ）２１０-3内のデ
ータを、新ＨＤＤ２１０-3に修復する処理を、ストライ
プ番号ｎが０の先頭のストライプから順番に次のように
実行する（ステップＳ２１）。When the HDD 210-3 fails, the disk array controller 22 (inside the microprocessor 221) stripes the processing for restoring the data in the failed HDD (old HDD) 210-3 to the new HDD 210-3. The following processing is executed in order from the first stripe with the number n of 0 (step S21).

【００５４】まずディスクアレイコントローラ２２は、
ストライプ番号ｎ（ｎの初期値は０）のストライプ２３
のデータ修復のために、そのストライプ番号ｎで指定さ
れるディスク資源管理テーブル１２０内のエントリを参
照して、そのストライプ番号ｎの示すストライプ２３
（に含まれている論理ブロックの少なくとも１つ）がフ
ァイルシステム１１により使用されているか否かを判定
する（ステップＳ２２）。First, the disk array controller 22
Stripe 23 with stripe number n (the initial value of n is 0)
To recover the data of the stripe number n, the entry in the disk resource management table 120 designated by the stripe number n is referred to, and the stripe 23 indicated by the stripe number n is referred to.
It is determined whether (at least one of the logical blocks included in) is used by the file system 11 (step S22).

【００５５】もし、ストライプ番号ｎの示すストライプ
２３が使用されているならば、ディスクアレイコントロ
ーラ２２は、従来から知られているＲＡＩＤの機能に従
う通常のデータ修復処理を以下に述べる手順で図１１に
示すように行う。If the stripe 23 indicated by the stripe number n is used, the disk array controller 22 performs the normal data restoration process according to the conventionally known RAID function as shown in FIG. Do as shown.

【００５６】まず、ディスクアレイコントローラ２２
は、修復するストライプ２３について、正常なすべての
ＨＤＤ２１０-0〜２１０-2からのデータ読み出し１１１
を行う（ステップＳ２３）。次にディスクアレイコント
ローラ２２は、読み出したデータを使用してＲＡＩＤの
機能により、つまり排他的論理和演算１１２により、そ
の演算結果として修復されたデータを取得する（ステッ
プＳ２４）。そしてディスクアレイコントローラ２２
は、取得したデータをストライプ２３に含まれる新ＨＤ
Ｄ２１０-3内の領域に書き込む動作１１３を実行する
（ステップＳ２５）。これにより旧ＨＤＤ２１０-3内の
データが新ＨＤＤ２１０-3に修復される。First, the disk array controller 22
Reads data from all normal HDDs 210-0 to 210-2 for the stripe 23 to be repaired 111
Is performed (step S23). Next, the disk array controller 22 uses the read data to acquire the data restored as the operation result by the RAID function, that is, the exclusive OR operation 112 (step S24). And the disk array controller 22
Is the new HD included in the stripe 23
The operation 113 of writing to the area in D210-3 is executed (step S25). As a result, the data in the old HDD 210-3 is restored in the new HDD 210-3.

【００５７】これに対し、ストライプ番号ｎの示すスト
ライプ２３がファイルシステム１１により使用されてい
ないならば、ディスクアレイコントローラ２２は当該ス
トライプ２３内には修復すべき有効なデータが格納され
ていないものと判断する。この場合、ディスクアレイコ
ントローラ２２は図１２に示す動作を行う。即ちディス
クアレイコントローラ２２は、ストライプ２３に含まれ
る正常なすべてのＨＤＤ２１０-0〜２１０-2内の各領域
に対して予め定められた固定データを書き込む動作２１
１を実行すると共に、その固定データの排他的論理和値
をストライプ２３に含まれる新ＨＤＤ２１０-3内の領域
に書き込む動作２１２を実行する（ステップＳ２６）。
ここで、固定データの排他的論理和値は、実際に排他的
論理和演算を行うことにより取得されるものでも、予め
定められた固定値であっても構わない。つまり、ステッ
プＳ２６では、ファイルシステム１１により使用されて
いないストライプ２３への固定データの書き込みが行わ
れる。このステップＳ２６の動作は、正常なＨＤＤ２１
０-0〜２１０-2からの読み出しを必要としないため、フ
ァイルシステム１１により使用されているストライプ２
３のデータを修復する場合（ステップＳ２３〜Ｓ２５）
に比べて、短時間で実行できる。また、ＨＤＤ２１０-0
〜２１０-2からの読み出しが行われないことにより、Ｈ
ＤＤの多重障害となる危険性が大幅に低下する。On the other hand, if the stripe 23 indicated by the stripe number n is not used by the file system 11, the disk array controller 22 determines that valid data to be restored is not stored in the stripe 23. to decide. In this case, the disk array controller 22 performs the operation shown in FIG. That is, the disk array controller 22 writes the predetermined fixed data into each area in all the normal HDDs 210-0 to 210-2 included in the stripe 23 (operation 21)
1 is executed, and the operation 212 of writing the exclusive OR value of the fixed data to the area in the new HDD 210-3 included in the stripe 23 is executed (step S26).
Here, the exclusive OR value of the fixed data may be obtained by actually performing an exclusive OR operation or may be a predetermined fixed value. That is, in step S26, fixed data is written to the stripe 23 that is not used by the file system 11. The operation of this step S26 is performed by the normal HDD 21.
Stripe 2 used by file system 11 because it does not need to read from 0-0 to 210-2
When recovering data of No. 3 (steps S23 to S25)
It can be executed in a shorter time than In addition, HDD210-0
Since the reading from ~ 210-2 is not performed, H
The risk of multiple failures of DD is greatly reduced.

【００５８】ディスクアレイコントローラ２２はステッ
プＳ２５またはＳ２６を終了すると、ストライプ番号ｎ
を１だけインクリメントし（ステップＳ２７）、そのイ
ンクリメント後のストライプ番号ｎが最終ストライプ番
号を越えたか否かにより、最終ストライプまで修復処理
を終了したか否かを判定する（ステップＳ２８）。ディ
スクアレイコントローラ２２は、ステップＳ２８で未終
了を判定したならば、ステップＳ２１以降の動作を再度
実行し、終了を判定したならば一連のデータ修復処理を
終了する。When the disk array controller 22 finishes step S25 or S26, the stripe number n
Is incremented by 1 (step S27), and it is determined whether or not the restoration processing has been completed up to the final stripe depending on whether or not the stripe number n after the increment exceeds the final stripe number (step S28). If it is determined in step S28 that the disk array controller 22 has not ended, the disk array controller 22 re-executes the operations in step S21 and subsequent steps.

【００５９】なお、以上に述べた故障したＨＤＤ内のデ
ータを修復する処理では、ファイルシステム１１により
使用されていないストライプについては、当該ストライ
プへの固定データの書き込み（ステップＳ２６）が行わ
れるものとして説明した。しかし、ファイルシステム１
１により使用されていないストライプ内には修復すべき
有効なデータが格納されていないことから、図６のフロ
ーチャートにおいて破線６０で示すように、このステッ
プＳ２６の動作（固定データによるストライプの修復動
作）をスキップするようにしても構わない。In the above-described processing for recovering data in the failed HDD, it is assumed that fixed data is written to the stripe (step S26) that is not used by the file system 11. explained. However, file system 1
Since valid data to be restored is not stored in the stripe which is not used by No. 1, the operation of this step S26 (stripe restoration operation with fixed data) as indicated by the broken line 60 in the flowchart of FIG. You may skip.

【００６０】しかし、ステップＳ２６をスキップする
と、ファイルシステム１１により使用されていないスト
ライプについては、ＲＡＩＤレベル５におけるデータと
冗長データ（パリティデータ）との整合性が得られなく
なる。つまり、データに対して正しいパリティデータが
生成されていない状態となる。したがって、ステップＳ
２６の動作をスキップする方法を適用する場合、ホスト
計算機１０からのデータ書き込み要求を、図７のフロー
チャートに示す手順に従って図１３に示すように処理す
る必要がある。However, if step S26 is skipped, the consistency between the data in RAID level 5 and the redundant data (parity data) cannot be obtained for the stripes not used by the file system 11. That is, the correct parity data is not generated for the data. Therefore, step S
When applying the method of skipping the operation of No. 26, it is necessary to process the data write request from the host computer 10 as shown in FIG. 13 according to the procedure shown in the flowchart of FIG.

【００６１】まずディスクアレイコントローラ２２は、
ホスト計算機１０からディスクアレイ装置２０に対して
データ書き込み要求が発行された場合、当該要求で指定
されたデータの書き込み対象となるストライプ２３のス
トライプ番号を算出し、そのストライプ２３がファイル
システム１１により使用されているか否かをディスク資
源管理テーブル１２０に基づいて判定する（ステップＳ
３１〜Ｓ３３）。ここまでは図５のフローチャートのス
テップＳ１１〜Ｓ１３と同様である。First, the disk array controller 22
When the host computer 10 issues a data write request to the disk array device 20, the stripe number of the stripe 23 to which the data specified by the request is written is calculated, and the stripe 23 is used by the file system 11. It is determined based on the disk resource management table 120 (step S
31-S33). The process up to this point is the same as steps S11 to S13 in the flowchart of FIG.

【００６２】もし、データの書き込み対象となるストラ
イプ２３がファイルシステム１１により使用されていな
い場合、ディスクアレイコントローラ２２は当該ストラ
イプ２３はＲＡＩＤレベル５によるパリティデータの整
合性が得られていないものと判断する。この場合、まず
ディスクアレイコントローラ２２は、ホスト計算機１０
からのデータ書き込み要求で指定された書き込みデータ
（新規書き込みデータ）１３１と予め定められた、“デ
ィスクアレイ２１を構成するＨＤＤの数−２”個のＨＤ
Ｄ用の固定データ１３３との排他的論理和値１３５を正
しいパリティデータ（冗長データ）として取得する（ス
テップＳ３４）。ここで、固定データ１３３を全ビット
が“０”のデータとするならば、排他的論理和値１３５
は書き込みデータ１３１に一致する。この場合、書き込
みデータ１３１を排他的論理和値（冗長データ）１３５
とすることができるため、排他的論理和演算を必要とし
ない。If the stripe 23 to which data is to be written is not used by the file system 11, the disk array controller 22 determines that the stripe 23 does not have parity data consistency according to RAID level 5. To do. In this case, first, the disk array controller 22 sets the host computer 10
From the write data (new write data) 131 specified by the data write request from the HDD, the number of the HDDs constituting the disk array 21-2 HDs.
The exclusive OR value 135 with the fixed data 133 for D is acquired as correct parity data (redundant data) (step S34). Here, if the fixed data 133 is data in which all bits are “0”, the exclusive OR value 135
Matches the write data 131. In this case, the write data 131 is converted into the exclusive OR value (redundant data) 135.
Therefore, the exclusive OR operation is not required.

【００６３】次にディスクアレイコントローラ２２は、
データの書き込み対象となるストライプ２３に含まれる
すべてのＨＤＤ２１０-0〜２１０-3内の領域に、各ＨＤ
Ｄ毎に、書き込みデータ１３１、固定データ１３３、ま
たは排他的論理和値（冗長データ）１３５を書き込む
（ステップＳ３５）。ここでは、ホスト計算機１０から
のデータ書き込み要求で指定されたデータの書き込み先
ＨＤＤがＨＤＤ２１０-0であり、ストライプ２３におい
て冗長データが格納されているＨＤＤがＨＤＤ２１０-3
であるものとすると、ＨＤＤ２１０-0に対する書き込み
データ１３１の書き込み１３２と、ＨＤＤ２１０-1及び
２１０-2に対する固定データ１３３の書き込み１３４
と、ＨＤＤ２１０-3に対する排他的論理和値（冗長デー
タ）１３５の書き込み１３６とが、それぞれ行われる。
これにより、故障したＨＤＤのデータ修復の際に修復を
スキップしたストライプについてもデータの冗長性を保
証できる。Next, the disk array controller 22
In each HDD 210-0 to 210-3 area included in the stripe 23 to which data is to be written, each HD
The write data 131, the fixed data 133, or the exclusive OR value (redundant data) 135 is written for each D (step S35). Here, the data write destination HDD designated by the data write request from the host computer 10 is the HDD 210-0, and the HDD storing redundant data in the stripe 23 is the HDD 210-3.
, The write data 131 is written 132 to the HDD 210-0, and the fixed data 133 is written 134 to the HDD 210-1 and 210-2.
And writing 136 of the exclusive OR value (redundant data) 135 to the HDD 210-3 are performed.
As a result, data redundancy can be assured even for stripes for which repair has been skipped when repairing data on a failed HDD.

【００６４】ディスクアレイコントローラ２２はステッ
プＳ３５を終了すると、ステップＳ３３で参照したディ
スク資源管理テーブル１２０内のエントリの内容、つま
りデータの書き込み対象となったストライプ２３の使用
の有無を示すディスク資源管理テーブル１２０内のエン
トリの内容を、未使用から使用中を示すように更新する
（ステップＳ３６）。When the disk array controller 22 finishes the step S35, the contents of the entry in the disk resource management table 120 referred to in the step S33, that is, the disk resource management table indicating whether or not the stripe 23 to which the data is written is used. The contents of the entry in 120 are updated to indicate unused to in use (step S36).

【００６５】なお、データの書き込み対象となるストラ
イプ２３がファイルシステム１１により使用されている
場合には、ディスクアレイコントローラ２２は通常のＲ
ＡＩＤ手法によるデータ書き込みを行う（ステップＳ３
７）。ここでは、データ書き込み要求で指定されたデー
タ（新データ）と、当該新データの格納先となるＨＤＤ
内領域に格納されているデータ（旧データ）と、同じス
トライプ２３に含まれている別のＨＤＤ内領域に格納さ
れているパリティデータ（旧パリティデータ）との間の
排他的論理和演算を行うことで、新パリティデータ（新
冗長データ）を生成し、当該新パリティデータで旧パリ
ティデータを更新する。When the stripe 23 to which data is to be written is used by the file system 11, the disk array controller 22 operates as a normal R disk.
Data writing is performed by the AID method (step S3)
7). Here, the data (new data) specified by the data write request and the HDD that is the storage destination of the new data
An exclusive OR operation is performed between the data (old data) stored in the inner area and the parity data (old parity data) stored in another HDD inner area included in the same stripe 23. Thus, new parity data (new redundant data) is generated, and the old parity data is updated with the new parity data.

【００６６】（４）ディスクアレイ装置２０におけるＨ
ＤＤメディア障害検査処理次に、ディスクアレイ装置２０におけるＨＤＤメディア
障害検査処理について、図８のフローチャートを参照し
て説明する。(4) H in the disk array device 20
DD Media Failure Inspection Processing Next, HDD media failure inspection processing in the disk array device 20 will be described with reference to the flowchart of FIG.

【００６７】ディスクアレイコントローラ２２は、ＨＤ
Ｄメディア検査を例えば当該コントローラ２２の有する
パトロール機能により周期的に実行する。ここではディ
スクアレイコントローラ２２は、ＨＤＤメディア検査
を、ストライプ番号ｎが０の先頭のストライプから順番
に次のように実行する（ステップＳ４１）。The disk array controller 22 is an HD
The D media inspection is periodically executed by the patrol function of the controller 22, for example. Here, the disk array controller 22 executes the HDD media inspection in the following order from the first stripe with the stripe number n of 0 (step S41).

【００６８】まずディスクアレイコントローラ２２は、
ストライプ番号ｎ（ｎの初期値は０）のストライプ２３
のＨＤＤメディア検査のために、そのストライプ番号ｎ
で指定されるディスク資源管理テーブル１２０内のエン
トリを参照して、そのストライプ番号ｎの示すストライ
プ２３がファイルシステム１１により使用されているか
否かを判定する（ステップＳ４２）。First, the disk array controller 22
Stripe 23 with stripe number n (the initial value of n is 0)
Stripe number n for HDD media inspection of
By referring to the entry in the disk resource management table 120 designated by, it is determined whether or not the stripe 23 indicated by the stripe number n is used by the file system 11 (step S42).

【００６９】もし、ストライプ番号ｎの示すストライプ
２３が使用されているならば、ディスクアレイコントロ
ーラ２２は、当該ストライプについて、すべてのＨＤＤ
２１０-0〜２１０-3からのデータ読み出しを行う（ステ
ップＳ４３）。If the stripe 23 indicated by the stripe number n is used, the disk array controller 22 makes all HDD
Data is read from 210-0 to 210-3 (step S43).

【００７０】次にディスクアレイコントローラ２２は、
ＨＤＤ２１０-0〜２１０-3からのデータ読み出し結果を
チェックして、読み出しに成功したか否かを判定する
（ステップＳ４４）。Next, the disk array controller 22
The result of reading the data from the HDDs 210-0 to 210-3 is checked to determine whether the reading has been successful (step S44).

【００７１】もし、ＨＤＤ２１０-0〜２１０-3のいずれ
かからのデータ読み出しに失敗したならば、ディスクア
レイコントローラ２２は、その失敗したセクタブロッ
ク、つまりメディア障害が検出された不良セクタブロッ
クを、同じＨＤＤ内の別のセクタ（交代セクタ）に代替
する代替処理を行う（ステップＳ４５）。例えば、図１
４に示すように、ＨＤＤ２１０-1内のセクタブロック１
４１が不良セクタブロックとして検出された場合であれ
ば、当該セクタブロック１４１を同じＨＤＤ２１０-1内
の任意の交代セクタ１４２に代替する代替処理１４３が
行われる。If the data read from any of the HDDs 210-0 to 210-3 fails, the disk array controller 22 sets the same sector block as the failed sector block, that is, the defective sector block in which the media failure is detected. Substitution processing is performed to substitute another sector (replacement sector) in the HDD (step S45). For example, in FIG.
As shown in FIG. 4, sector block 1 in HDD 210-1
If 41 is detected as a bad sector block, a substitution process 143 is performed for substituting the sector block 141 with an arbitrary replacement sector 142 in the same HDD 210-1.

【００７２】次にディスクアレイコントローラ２２は、
ＲＡＩＤの機能を使用して、不良セクタブロック１４１
の修復されたデータを算出し、そのデータを交代セクタ
１４２に書き込む動作１４４を行う（ステップＳ４
６）。そしてディスクアレイコントローラ２２はステッ
プＳ４７に進む。Next, the disk array controller 22
Using the RAID function, the bad sector block 141
Of the restored data in step S4 and write the data in the alternate sector 142 (step S4).
6). Then, the disk array controller 22 proceeds to step S47.

【００７３】これに対し、ストライプ番号ｎの示すスト
ライプ２３についてＨＤＤ２１０-0〜２１０-3からのデ
ータ読み出しに成功したならば、ディスクアレイコント
ローラ２２はステップＳ４５，Ｓ４６ををスキップして
ステップＳ４７に進む。On the other hand, if the data read from the HDDs 210-0 to 210-3 for the stripe 23 indicated by the stripe number n is successful, the disk array controller 22 skips steps S45 and S46 and proceeds to step S47. .

【００７４】また、ストライプ番号ｎの示すストライプ
２３がファイルシステム１１によって使用されていない
ならば、ディスクアレイコントローラ２２は当該ストラ
イプ２３の検査をせずに、ステップＳ４３〜Ｓ４６をス
キップしてステップＳ４７に進む。If the stripe 23 indicated by the stripe number n is not used by the file system 11, the disk array controller 22 does not inspect the stripe 23 and skips steps S43 to S46 and goes to step S47. move on.

【００７５】ディスクアレイコントローラ２２は、ステ
ップＳ４７においてストライプ番号ｎを１だけインクリ
メントし、そのインクリメント後のストライプ番号ｎが
最終ストライプ番号を越えるまで（ステップＳ４８）、
ステップＳ４１以降の動作を繰り返す。The disk array controller 22 increments the stripe number n by 1 in step S47, and until the incremented stripe number n exceeds the final stripe number (step S48).
The operation after step S41 is repeated.

【００７６】以上に述べたＨＤＤメディア障害検査処理
では、ファイルシステム１１により使用されているスト
ライプのみ、ＨＤＤからのデータ読み出しによる検査
（メディア検査）を実行する場合について説明したが、
これに限るものではない。例えば図１５のフローチャー
トに示すように、ディスクアレイ装置２０のディスク領
域のすべてのストライプについて、その使用の有無に無
関係にＨＤＤからのデータ読み出しによる検査を実行す
るようにしてもよい（ステップＳ５２，Ｓ５３）。ここ
では、データ読み出しによる検査で、メディア障害とな
ったセクタブロック（不良セクタブロック）が検出され
た場合に、ディスク資源管理テーブル１２０を参照し
て、そのセクタブロックを含むストライプがファイルシ
ステム１１により使用されているか否かを判定する（ス
テップＳ５４）。In the HDD media failure inspection process described above, the case where the inspection (media inspection) by reading the data from the HDD is executed only for the stripes used by the file system 11 has been described.
It is not limited to this. For example, as shown in the flowchart of FIG. 15, all stripes in the disk area of the disk array device 20 may be inspected by reading data from the HDD regardless of whether they are used (steps S52 and S53). ). Here, when the sector block (defective sector block) in which the media failure has occurred is detected by the inspection by reading the data, the disk resource management table 120 is referred to and the stripe including the sector block is used by the file system 11. It is determined whether it has been done (step S54).

【００７７】もし、上記ストライプがファイルシステム
１１により使用されていない場合、そのストライプ内の
データを保持する必要性はない。そこで、この場合に
は、ＲＡＩＤの冗長性の有無に拘わらずに、図１２に示
すように不良セクタブロック（１４１）の代替処理（１
４３）を行った後に（ステップＳ５５）、交代セクタ
（１４２）に（修復データではなくて）予め定められた
固定データの書き込み（１４４）を行って修復する（ス
テップＳ５６）。このステップＳ５６では、上記ステッ
プＳ２６におけるストライプに対するのと同様に、交代
セクタに対応する他の正常なＨＤＤ内のセクタに固定デ
ータが書き込まれる。If the stripe is not used by the file system 11, it is not necessary to hold the data in the stripe. Therefore, in this case, the replacement process (1) of the defective sector block (141) as shown in FIG. 12 is performed regardless of the presence or absence of RAID redundancy.
After performing step 43) (step S55), a predetermined fixed data (not restoration data) is written (144) to the alternate sector (142) to restore (step S56). In this step S56, fixed data is written in the sector in another normal HDD corresponding to the alternate sector, as in the case of the stripe in step S26.

【００７８】一方、上記ストライプがファイルシステム
１１により使用されている場合、他のＨＤＤにて故障な
どが発生しておらず（ステップＳ５７）、したがってデ
ータの冗長性が確保されている場合は、不良セクタブロ
ック（１４１）を交代セクタ（１４２）に代替する処理
（１４３）を行った後（ステップＳ５８）、ＲＡＩＤの
機能を使用して、不良セクタブロック（１４１）のデー
タを修復し、その修復データの交代セクタ（１４２）へ
の書き込み（１４４）を行う（ステップＳ５９）。On the other hand, if the stripe is used by the file system 11, no failure occurs in another HDD (step S57), and therefore, if data redundancy is ensured, a defect occurs. After performing the process (143) of substituting the sector block (141) with the alternate sector (142) (step S58), the data of the defective sector block (141) is restored using the RAID function, and the restored data is restored. Writing (144) to the alternate sector (142) is performed (step S59).

【００７９】これに対し、他のＨＤＤが故障しているた
めにデータの冗長性がなくなっている場合は（ステップ
Ｓ５７）、不良セクタブロック（１４１）のデータをＲ
ＡＩＤの機能により修復することができない。この場
合、上記ステップＳ５８，Ｓ５９をスキップし、不良セ
クタブロック（１４１）をそのまま放置する。On the other hand, when the data redundancy is lost because another HDD has failed (step S57), the data of the defective sector block (141) is set to R.
It cannot be repaired due to the function of AID. In this case, steps S58 and S59 are skipped and the defective sector block (141) is left as it is.

【００８０】なお、以上の実施形態では、ディスクアレ
イ装置２０がＲＡＩＤ５レベルで用いられるものとして
説明したが、本発明は、ＲＡＩＤ３レベルなど、他のＲ
ＡＩＤレベルで用いられるディスクアレイ装置にも、デ
ータ修復の方法は異なるものの、ＲＡＩＤ５レベルの場
合と同様に適用できる。In the above embodiments, the disk array device 20 is described as being used at the RAID 5 level, but the present invention is not limited to RAID 3 level and other R disks.
Although the data recovery method is different, it can be applied to the disk array device used at the AID level as in the case of the RAID5 level.

【００８１】なお、本発明は、上記実施形態に限定され
るものではなく、実施段階ではその要旨を逸脱しない範
囲で種々に変形することが可能である。更に、上記実施
形態には種々の段階の発明が含まれており、開示される
複数の構成要件における適宜な組み合わせにより種々の
発明が抽出され得る。例えば、実施形態に示される全構
成要件から幾つかの構成要件が削除されても、発明が解
決しようとする課題の欄で述べた課題が解決でき、発明
の効果の欄で述べられている効果の少なくとも１つが得
られる場合には、この構成要件が削除された構成が発明
として抽出され得る。The present invention is not limited to the above-described embodiment, but can be variously modified in an implementation stage without departing from the scope of the invention. Furthermore, the embodiments include inventions at various stages, and various inventions can be extracted by appropriately combining a plurality of disclosed constituent elements. For example, even if some constituent elements are deleted from all the constituent elements shown in the embodiment, the problem described in the section of the problem to be solved by the invention can be solved, and the effect described in the section of the effect of the invention can be solved. When at least one of the above is obtained, the configuration in which this constituent element is deleted can be extracted as the invention.

【００８２】[0082]

【発明の効果】以上詳述したように本発明によれば、デ
ィスクアレイのディスク領域のうちファイルシステムに
て実際に使用されている領域を判定し、このファイルシ
ステムにて実際に使用されている領域に絞ってディスク
ドライブのメディア障害検査を行うようにしたので、処
理時間を短縮することができ、その結果、メディア障害
の早期検出が可能となり、ディスクアレイ装置の信頼性
を向上することができる。As described above in detail, according to the present invention, of the disk areas of the disk array, the area actually used by the file system is determined and is actually used by this file system. Since the disk drive media failure inspection is performed focusing on the area, the processing time can be shortened, and as a result, the media failure can be detected early and the reliability of the disk array device can be improved. .

【００８３】また本発明によれば、ホスト計算機からデ
ータ書き込み要求を受け取った場合、当該要求で指定さ
れたデータ書き込み先が属するストライプが、ファイル
システムにより使用されている論理ブロックを含む第１
のストライプであるか或いはファイルシステムにより使
用されている論理ブロックを含まない第２のストライプ
であるかを、ディスクアレイ装置内に保持されているデ
ィスク資源管理情報（第２のディスク資源管理情報）に
基づいて判定し、第２のストライプであるならば、当該
データ書き込み先が属するストライプが第１のストライ
プであることを示すように第２のディスク資源管理情報
を更新するようにしたので、当該第２のディスク資源管
理情報によりストライプの最新の使用状況を表すことが
でき、ファイルの更新の度に最新の第１のディスク資源
管理情報をホスト計算機から取得しなくても済む。Further, according to the present invention, when a data write request is received from the host computer, the stripe to which the data write destination specified by the request belongs includes the first block including the logical block used by the file system.
Disk stripe management information (second disk resource management information) held in the disk array device, indicating whether the stripe is the second stripe or the second stripe that does not include the logical block used by the file system. If it is the second stripe, the second disk resource management information is updated so as to indicate that the stripe to which the data write destination belongs is the first stripe. The second disk resource management information can represent the latest usage status of the stripe, and it is not necessary to obtain the latest first disk resource management information from the host computer every time the file is updated.

[Brief description of drawings]

【図１】本発明の一実施形態に係るディスクアレイ装置
２０を備えた計算機システムの構成を示すブロック図。FIG. 1 is a block diagram showing a configuration of a computer system including a disk array device 20 according to an embodiment of the present invention.

【図２】ディスクアレイ２１のディスク領域を管理する
のに用いられるストライプ、論理ブロック及びセクタブ
ロックの関係を説明するための図。FIG. 2 is a diagram for explaining the relationship between stripes, logical blocks, and sector blocks used to manage the disk area of the disk array 21.

【図３】ホスト計算機１０内に保持されるディスク資源
管理テーブル１２とディスクアレイコントローラ２２の
メモリ２２２内のディスク資源管理情報領域２２２ｂに
格納されるディスク資源管理テーブル１２０のデータ構
造例を示す図。FIG. 3 is a diagram showing an example of a data structure of a disk resource management table 12 stored in a host computer 10 and a disk resource management table 120 stored in a disk resource management information area 222b in a memory 222 of a disk array controller 22.

【図４】ディスク資源管理情報リスト送信時の処理手順
を示すフローチャート。FIG. 4 is a flowchart showing a processing procedure when transmitting a disk resource management information list.

【図５】ホスト計算機１０からのデータ書き込み要求発
行時の処理手順を示すフローチャート。FIG. 5 is a flowchart showing a processing procedure when a data write request is issued from the host computer 10.

【図６】ディスクアレイ装置２０におけるデータ修復処
理手順を示すフローチャート。FIG. 6 is a flowchart showing a data recovery processing procedure in the disk array device 20.

【図７】ホスト計算機１０からのデータ書き込み要求発
行時の処理手順の変形例を示すフローチャート。FIG. 7 is a flowchart showing a modification of the processing procedure when a data write request is issued from the host computer 10.

【図８】ディスクアレイ装置２０におけるＨＤＤメディ
ア障害検査の処理手順を示すフローチャート。FIG. 8 is a flowchart showing a processing procedure of an HDD media failure inspection in the disk array device 20.

【図９】ホスト計算機１０からディスクアレイ装置２０
に送信されるディスク資源管理情報リストのデータ構造
例を示す図。FIG. 9 shows a host computer 10 to a disk array device 20.
The figure which shows the example of a data structure of the disk resource management information list transmitted to FIG.

【図１０】上記ディスク資源管理情報リストの変形例を
示す図。FIG. 10 is a diagram showing a modified example of the disk resource management information list.

【図１１】ＲＡＩＤレベル５における通常のデータ修復
処理を説明するための図。FIG. 11 is a diagram for explaining a normal data restoration process in RAID level 5.

【図１２】使用されていないストライプへの固定データ
書き込み時の動作を説明するための図。FIG. 12 is a diagram for explaining an operation at the time of writing fixed data to an unused stripe.

【図１３】使用されていないストライプへの新規データ
書き込み時の動作を説明するための図。FIG. 13 is a diagram for explaining an operation at the time of writing new data to an unused stripe.

【図１４】ＨＤＤメディア障害検査で不良セクタブロッ
クが検出された場合のデータ修復動作を説明するための
図。FIG. 14 is a diagram for explaining a data recovery operation when a defective sector block is detected in the HDD media failure inspection.

【図１５】図８に示したＨＤＤメディア障害検査の処理
手順の変形例を示すフローチャートFIG. 15 is a flowchart showing a modification of the processing procedure of the HDD media failure inspection shown in FIG.

[Explanation of symbols]

１０…ホスト計算機１１…ファイルシステム１２…ディスク資源管理テーブル１３…専用ソフトウエア２０…ディスクアレイ装置２１…ディスクアレイ２２…ディスクアレイコントローラ２３…ストライプ２４…論理ブロック２５…セクタブロック９０，１００…ディスク資源管理情報リスト（第１のデ
ィスク資源管理情報）１２０…ディスク資源管理テーブル（第２のディスク資
源管理情報）２１０-0〜２１０-3…ＨＤＤ（ディスクドライブ）２２２ｂ…ディスク資源管理テーブル領域10 ... Host computer 11 ... File system 12 ... Disk resource management table 13 ... Dedicated software 20 ... Disk array device 21 ... Disk array 22 ... Disk array controller 23 ... Stripe 24 ... Logical block 25 ... Sector block 90, 100 ... Disk resource Management information list (first disk resource management information) 120 ... Disk resource management table (second disk resource management information) 210-0 to 210-3 ... HDD (disk drive) 222b ... Disk resource management table area

Claims

[Claims]

1. A RAID (Redundant Arrays of Inexpensive Disks) composed of a plurality of disk drives.
A method for recovering data in a disk array device having a disk array having a configuration, wherein a logical block used by the file system from a host computer having a file system for managing the disk area of the disk array in logical block units Or obtaining a first disk resource management information indicating a stripe including the logical block, and using the file system for each stripe in the disk area of the disk array from the first disk resource management information. Generating a second disk resource management information indicating whether or not it contains a logical block and holding it in the disk array device; and when a data write request is received from the host computer, it is specified by the request. The data write destination belongs to Specifying a slice, and the specified stripe is a first stripe including a logical block used by the file system or a second stripe not including a logical block used by the file system Determining based on the second disk resource management information held in the disk array device, and if the identified stripe is the second stripe, Updating the second disk resource management information held in the disk array device to indicate that the specified stripe is the first stripe; and each disk in the disk array. By reading the contents stored in the drive, the part of the disk drive A case to perform media inspection processing of detecting a failure, for all the stripes of the disk areas of the disk array, or a first stripe comprising a logic block used by the file system,
Alternatively, a step of determining whether the second stripe does not include a logical block used by the file system based on the second disk resource management information, and based on the determination result of the determination step, , Performing a media inspection including data reading from the disk drive only for the first stripe, and R for data of a location where a failure is detected in the media inspection.
And a step of recovering by the AID function.

2. A RAID (Redundant Arrays of Inexpensive Disks) composed of a plurality of disk drives.
A method for recovering data in a disk array device having a disk array having a configuration, wherein a logical block used by the file system from a host computer having a file system for managing the disk area of the disk array in logical block units Or obtaining a first disk resource management information indicating a stripe including the logical block, and using the file system for each stripe in the disk area of the disk array from the first disk resource management information. Generating a second disk resource management information indicating whether or not it contains a logical block and holding it in the disk array device; and when a data write request is received from the host computer, it is specified by the request. The data write destination belongs to Specifying a slice, and the specified stripe is a first stripe including a logical block used by the file system or a second stripe not including a logical block used by the file system Determining based on the second disk resource management information held in the disk array device, and if the identified stripe is the second stripe, Updating the second disk resource management information held in the disk array device to indicate that the specified stripe is the first stripe; and each disk in the disk array. By reading the contents stored in the drive, the part of the disk drive A step of sequentially performing a media inspection process for detecting various faults for all stripes in the disk area of the disk array, and when a partial fault of the disk drive is detected in the media inspection process, The stripe containing the location where the failure was detected is the first stripe containing the logical block used by the file system,
Alternatively, a step of determining whether the second stripe does not include a logical block used by the file system, based on the second disk resource management information held in the disk array device, If the disk drive other than the disk drive in which the failure is detected is normal among the plurality of disk drives in the disk array determined to be the first stripe in the determination step, the failure is detected. And a step of recovering the data of the location where the failure is detected by the fixed data when the determination step determines that the stripe is the second stripe. A method for recovering data in a disk array device, comprising:

3. A RAID (Redundant Arrays of Inexpensive Disks) composed of a plurality of disk drives.
In a disk array controller for controlling a disk array having a configuration, a logical block used by the file system or the logical block transmitted from a host computer having a file system that manages the disk area of the disk array in logical block units. From the first disk resource management information indicating the stripe including the block, the second disk resource management indicating whether or not each stripe in the disk area of the disk array includes the logical block used by the file system. A means for generating information, a memory for storing the second disk resource management information, and a means for specifying a stripe to which a data write destination designated by the request belongs when a data write request is received from the host computer And the identifying means When the stripe to which the data write destination belongs is specified by, the stripe is the first stripe including the logical block used by the file system, or the logical block used by the file system. Second not including
If the first determination means determines that the stripe identified by the identification means is the second stripe, the stripe is the first stripe. The second stored in the memory to indicate that it is a stripe of
And a first stripe including a logical block used by the file system for all stripes of the disk area of the disk array, or used by the file system. The second stripe that does not include the logical block
Second judging means for judging based on the disk resource management information, and media checking means for detecting a partial failure of the disk drive by reading the storage contents of each of the disk drives in the disk array. A media inspection unit that executes a media inspection including data reading from the disk drive only for the stripe determined to be the first stripe by the second determination unit; and a failure by the media inspection unit. A disk array controller, comprising: a data recovery unit that recovers data at a location where is detected by a RAID function.

4. A RAID (Redundant Arrays of Inexpensive Disks) composed of a plurality of disk drives.
In a disk array controller for controlling a disk array having a configuration, a logical block used by the file system or the logical block transmitted from a host computer having a file system that manages the disk area of the disk array in logical block units. From the first disk resource management information indicating the stripe including the block, the second disk resource management indicating whether or not each stripe in the disk area of the disk array includes the logical block used by the file system. A means for generating information, a memory for storing the second disk resource management information, and a means for specifying a stripe to which a data write destination designated by the request belongs when a data write request is received from the host computer And the identifying means When the stripe to which the data write destination belongs is specified by, the stripe is the first stripe including the logical block used by the file system, or the logical block used by the file system. Second not including
If the first determination means determines that the stripe identified by the identification means is the second stripe, the stripe is the first stripe. The second stored in the memory to indicate that it is a stripe of
Means for updating the disk resource management information, and a media inspection for executing, for each stripe, a media inspection process for detecting a partial failure of the disk drive by reading the storage contents of each disk drive in the disk array. Means, and when the media inspection means detects a partial failure of the disk drive, the stripe including the location where the failure is detected is the first stripe including the logical block used by the file system. Yes, or a second stripe that does not contain a logical block used by the file system,
A second determining unit that determines based on the second disk resource management information; and a plurality of disk drives that are determined to be the first stripe by the second determining unit and that are in the disk array. When a disk drive other than the disk drive in which the failure is detected is normal, first data recovery means for recovering data at the location where the failure is detected by a RAID function, and the second determination means A disk array controller comprising: a second data repairing unit that repairs the data at the location where the failure is detected with fixed data when it is determined to be the second stripe.