JP2003036146A

JP2003036146A - Disk array control system

Info

Publication number: JP2003036146A
Application number: JP2001221566A
Authority: JP
Inventors: Seiji Kaneko; 誠司金子
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2001-07-23
Filing date: 2001-07-23
Publication date: 2003-02-07

Abstract

PROBLEM TO BE SOLVED: To enables a disk control part with low error detecting capability to detect such trouble that old data are read out by mistake and also detect even an error of data by using an added check code. SOLUTION: A disk array control system is characterized by that a disk array device constituted by using a plurality of disk units writes added check codes 220, 221, and 222 generated according to data written to data disks constituting a parity group to the data disks, writes all the added check codes 220, 221, and 222 to parity disks constituting the parity group, and reads added check codes of one of the plurality of data disks and of parity disks to check an error.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、ディスクアレイ装
置の障害検出方式に関し、特に、低価格で障害検出能力
の低いディスク装置を組み合わせて、高信頼なディスク
アレイ装置を構成する技術に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a failure detecting method for a disk array device, and more particularly to a technique for constructing a highly reliable disk array device by combining disk devices which are inexpensive and have a low failure detecting ability.

【０００２】[0002]

【従来の技術】一般的に、ディスクは磁気記録媒体や光
記録媒体を使用しているために、円板上の一部分が読み
書きできなくなる障害が発生する。これをセクタ障害と
いう。セクタ障害の原因としては、円板上の傷や磁性体
の劣化等が考えられる。ディスク装置では、ＥＣＣ（Ｅ
ｒｒｏｒＣｏｒｒｅｃｔｉｎｇＣｏｄｅ）を付加す
ることで、ある程度のセクタ障害に対しては、復旧する
ことが可能である。また、付加するＥＣＣ符号の種類に
より、セクタ中の何ビットまでのエラーを訂正すること
が可能であるかが変化する。2. Description of the Related Art Generally, a disk uses a magnetic recording medium or an optical recording medium, so that a part of the disk cannot be read or written. This is called a sector failure. The cause of the sector failure may be scratches on the disk or deterioration of the magnetic material. In the disk device, ECC (E
By adding a error correcting code, it is possible to recover from a certain degree of sector failure. Also, depending on the type of ECC code to be added, the number of bits in a sector in which an error can be corrected changes.

【０００３】低価格のドライブでは、上記のセクタ障害
が起きる確率が比較的高いため、複数のディスクを並列
に動作させることで高速制御を実現し、また、パリティ
と呼ぶ冗長データをパリティディスクと呼ぶ特定のディ
スクに格納することにより、万一、データを格納する１
台のディスクが故障しても、他のディスクとパリティデ
ィスクのパリティとから故障したディスクのデータを再
現することができ、耐ディスク障害信頼性を高めること
ができるＲＡＩＤと呼ばれるディスク制御の方法が提案
された。In a low-priced drive, since the above-mentioned sector failure is relatively likely to occur, a high speed control is realized by operating a plurality of disks in parallel, and redundant data called parity is called a parity disk. By storing on a specific disk, data is stored by any chance 1
Even if one disk fails, a disk control method called RAID that can reproduce the data of the failed disk from other disks and the parity of the parity disk and can improve the reliability of disk failure is proposed. Was done.

【０００４】このＲＡＩＤに関しては、「ＡＣａｓｅ
ｆｏｒＲｅｄｕｎｄａｎｔＡｒｒａｙｓｏｆ
ＩｎｅｘｐｅｎｓｉｖｅＤｉｓｋｓ（ＲＡＩ
Ｄ）」；ＩｎＰｒｏｃ．ＡＣＭＳＩＧＭＯＤ，Ｊｕ
ｎｅ１９８８（カリフォルニア大学バークレー校発
行）に詳しい。ＲＡＩＤは、そのパリティの格納の方法
によりレベル１からレベル５があるが、現在広く耐障害
性を持つディスクアレイで用いられているのはレベル１
とレベル５である。Regarding this RAID, "A Case
for Redundant Arrays of
Inexpensive Disks (RAI
D) "; In Proc. ACM SIGMOD, Ju
Learn more about ne 1988 (published by the University of California, Berkeley). RAID has levels 1 to 5 depending on the method of storing the parity, but the level 1 is currently widely used in disk arrays having fault tolerance.
And level 5.

【０００５】冗長構成を有する上記ＲＡＩＤ５構成を取
るディスクアレイ装置では、セクタ障害が発生した場
合、同一パリティグループに属する残りのディスク装置
によってデータを復元し、交代領域への書き込みを行う
ことにより障害データを回復させていた。In a disk array device having the above-mentioned RAID 5 structure having a redundant structure, when a sector failure occurs, the data is restored by the remaining disk devices belonging to the same parity group, and the failure data is written to the alternate area. Had been recovered.

【０００６】ここで、図３に示す４つのディスクへのデ
ータ配置を例に取って、レイド５構成について述べる
と、ディスクアレイ制御装置に転送されてきたデータを
０，１，２のように例えば５１２バイトの固定長に分け
てディスク１，２，３に転送するとともに、パリティ部
Ｐとしてデータ０，１，２のエクシクルーシブオアを取
って５１２バイトの固定長を形成してディスク４に転送
する。そして、このデータ０，１，２とパリティＰが同
一パリティグループを形成しているのである。図３の矢
印に示すように、データ０，１，２，パリティＰ，デー
タ０，１，２，パリティＰ，…と続くディスク１〜４へ
のデータ配置はレイド５の規格である。Here, taking the data arrangement on the four disks shown in FIG. 3 as an example, the RAID 5 configuration will be described. The data transferred to the disk array control device is, for example, 0, 1, 2. The data is divided into a fixed length of 512 bytes and transferred to the disks 1, 2 and 3, and the exclusive OR of the data 0, 1 and 2 is taken as the parity part P to form a fixed length of 512 bytes and transferred to the disk 4. To do. The data 0, 1, and 2 and the parity P form the same parity group. As shown by the arrow in FIG. 3, the data allocation to the disks 1 to 4 following the data 0, 1, 2, parity P, the data 0, 1, 2, parity P, ... Is the standard of RAID 5.

【０００７】そして、ＲＡＩＤ５構成を取るひとつのデ
ィスクからデータを読み出そうとする場合、パリティグ
ループを構成する全データディスクとパリティディスク
からデータを読みとってエラーの検出をすることは可能
であるが、複数のディスクからデータを読み出すため大
きな性能低下を招く。When it is attempted to read data from one disk having the RAID5 configuration, it is possible to read the data from all the data disks and the parity disks which form the parity group to detect the error. Since data is read from a plurality of disks, a large performance drop will occur.

【０００８】このため、この様なＲＡＩＤ装置の場合、
所定のデータにエラー検出用のコードを付け、その両方
をデータとして書き込み、読み出しをおこなうことによ
り単体ディスクの読み出しでエラー検出が行える様に構
成するのが通常である。即ち、データ０，１，２には、
前述した５１２バイトの生データに加えて、ディスクの
ロジカルアドレスＬＡとチェックビットＣＨを付加して
いる。そして、このＬＡやＣＨによって単体ディスクで
のエラー検出を行っている（図２を参照すると、図２の
データ２０が生データに対応し、図２のＬＡ２１がロジ
カルアドレスに対応し、ＲＣＣ２３がチェックビットに
対応するものである）。Therefore, in the case of such a RAID device,
It is usual to add a code for error detection to predetermined data, write both of them as data, and perform reading so that the error can be detected by reading a single disk. That is, the data 0, 1, 2
In addition to the 512-byte raw data described above, a logical address LA of the disc and a check bit CH are added. Then, error detection is performed on a single disk by this LA or CH (refer to FIG. 2, the data 20 of FIG. 2 corresponds to raw data, the LA 21 of FIG. 2 corresponds to a logical address, and the RCC 23 checks. It corresponds to a bit).

【０００９】また、特開平１０ー１７１６０８号公報に
は、データ部に対してアドレス情報と時間的要素を表す
情報とを付加することが開示されている。このようなデ
ータフォーマットを採用することで、磁気ディスク装置
のキャッシュレジスタの古いデータを誤って読みだした
場合でも磁気ディスク装置用データに変換する際にデー
タにタイムスタンプを追加記憶しておくので、データの
読み出し又は書き込みの際にデータの異常を検出できる
ようになっている。Further, Japanese Laid-Open Patent Publication No. 10-171608 discloses that address information and information indicating a temporal element are added to the data part. By adopting such a data format, even if the old data in the cache register of the magnetic disk device is erroneously read, the time stamp is additionally stored in the data when converting to the data for the magnetic disk device. A data abnormality can be detected when reading or writing data.

【００１０】[0010]

【発明が解決しようとする課題】従来技術として説明し
た技術は、主にディスクの媒体に対する書き込み及び読
み出し時のセクタ障害に代表されるエラーの検出、及び
インターフェース関連のエラー検出とその回復を目的と
する手段である。The technique described as the prior art is mainly aimed at detecting an error represented by a sector failure at the time of writing and reading to and from a disk medium, and detecting and recovering an error related to an interface. Is a means to do.

【００１１】一方、近年ディスクドライブはパーソナル
コンピュータ用を中心に著しい価格の低下が進んだが、
それにともない低価格化を重視してメディア（記録媒
体）の障害に比して発生頻度の低い内部制御系のチェッ
クの省略が行われたため、ディスクの制御部でのエラー
検出能力は相対的に低くなっている。この場合でも、デ
ィスク内部で発生したエラーが、例えばデータ系のビッ
ト化けなどの単純なデータ誤りとして書きこまれるなら
ば、従来技術で説明したＲＡＩＤデータのエラー検出符
号によって検出可能であるが、制御部用のデータを保持
するディスク内のバッファに障害が発生した場合、書き
込もうとしたデータを誤った位置に書き込む誤動作が発
生し得る。On the other hand, in recent years, the price of disk drives has been remarkably reduced mainly for personal computers, but
Along with that, since the cost of price reduction was emphasized, the check of the internal control system, which occurs less frequently than the failure of the medium (recording medium), was omitted, so the error detection capability of the disk controller was relatively low. Has become. Even in this case, if the error generated inside the disk is written as a simple data error such as a garbled bit of the data system, it can be detected by the error detection code of the RAID data described in the prior art. When a failure occurs in the buffer in the disk that holds the copy data, a malfunction may occur in which the data to be written is written in the wrong position.

【００１２】この種のエラーでは、書き込まれた先のデ
ータ（誤った位置に書き込まれたデータ）も、上書きさ
れないで残ったデータ（本来、上書きされて消去される
べき旧データ）も正しいチェックコードを持つ場合に、
特に、書き込まれるべきデータが誤った場所に書き込ま
れたことによって書き込まれないで残った旧データは、
これ自体単独では矛盾していないものなので、これが誤
っていることを検出する手段が従来の技術では存在しな
い。In this kind of error, correct data is written for both the written data (data written in the wrong position) and the data left unwritten (the original data that should be overwritten and erased). If you have
In particular, the old data that was left unwritten because the data that should be written was written in the wrong location,
There is no means in the prior art to detect that this is wrong, as it is not inconsistent by itself.

【００１３】敷衍すると、５１２Ｂの生データ、位置情
報のＬＡ、チェックビットのＣＨ、からなるデータ部
と、生データから生成されたパリティデータ、ＬＡ及び
ＣＨからなるパリティ部と、で構成する従来のＲＡＩＤ
の場合、パリティ部（Ｐ）と全てのデータ部（０，１，
２）との読み出しでエラーチェックすれば間違いなくエ
ラー検出できるが、前述した読み出しの性能低下を考慮
すると、データ部の読み出しでデータとチェックコード
ＣＨのチェックで問題なければ正のデータとしてホスト
に上げるという取り扱いをする場合がある。このような
場合において、データが誤った位置に書き込まれた際
に、書き込まれるべき位置に存するデータの読み出し指
令に対して（ディスクアレイ制御装置は当該位置に新し
いデータが書き込まれていると認識しているから）、上
書きで消去されずに残っていた旧データが読み出される
こととなる。旧データはそれ自体ではデータとＣＨとが
一致するものであるから、この旧データの読み出しにエ
ラー検出が掛からない（ＬＡも問題がないのであるか
ら）こととなる。[0013] According to the conventional method, a data portion including 512B of raw data, LA of position information, CH of check bits, and parity data generated from the raw data and a parity portion of LA and CH are used. RAID
, The parity part (P) and all data parts (0, 1,
Although an error can be definitely detected by performing an error check by reading with 2), considering the above-mentioned read performance degradation, if there is no problem in checking the data and check code CH in the reading of the data section, it is sent to the host as positive data. May be handled. In such a case, when the data is written in the wrong position, the disk array controller recognizes that new data has been written in the position where the data should be written. Therefore, the old data that has not been erased by overwriting and remains, will be read. Since the old data itself is the same as the CH, the error detection does not occur in the reading of the old data (since LA has no problem).

【００１４】本発明の目的は、互いに共通するデータに
基づいた付加チェックコードをデータ部とパリティ部の
両方に保持して、データ部とパリティ部の２つのディス
クでその付加チェックコードの検出を行うことによっ
て、誤って旧データが読まれている障害を検出するとと
もに、付加チェックコードによってデータのエラーをも
検出でき得ることにある。An object of the present invention is to hold an additional check code based on common data in both the data section and the parity section, and detect the additional check code in two disks, the data section and the parity section. As a result, it is possible to detect a fault in which old data is erroneously read and also to detect a data error by the additional check code.

【００１５】[0015]

【課題を解決するための手段】本発明は、主として次の
ようなディスクアレイ制御方式を採用することによっ
て、従来技術では為し得なかったエラー検出を実現する
機能乃至作用を有するものである。（１）パリティ部のパリティデータに加えて、パリティ
付随情報として、ＲＡＩＤを構成するデータ部に書き込
んだ識別情報（図２に示すＣＣ０、ＣＣ１又はＣＣ２で
あり、データ２０に基づいて生成されるもの）を付加チ
ェックデータとして加えて書き込みを行う。（２）常にデータ部とパリティ部の二つのディスク単位
でデータの読み出しを行い、前記データ部の識別情報と
パリティ部のパリティ付随情報とを比較し、誤って古い
データが読まれている障害を検出する。SUMMARY OF THE INVENTION The present invention mainly has a function or action for realizing error detection, which cannot be achieved by the prior art, by adopting the following disk array control system. (1) Identification information (CC0, CC1 or CC2 shown in FIG. 2 which is generated based on the data 20 and which is written in the data portion forming the RAID as the additional information of the parity in addition to the parity data of the parity portion. ) Is added as additional check data and writing is performed. (2) Data is always read out in units of two disks, the data section and the parity section, and the identification information of the data section and the parity accompanying information of the parity section are compared with each other to detect a fault in which old data is erroneously read. To detect.

【００１６】本発明においては、読み出しには必ず二個
のディスクにアクセスするため、読みだし性能は多少低
下するが、古いデータを誤って読み出す障害を検出する
ことができる。According to the present invention, since two disks are always accessed for reading, the reading performance is somewhat lowered, but a failure to read old data by mistake can be detected.

【００１７】以上のような機能乃至作用を果たすため
に、本発明は主として次のような構成を採用する。複数
のディスク装置を使って構成したディスクアレイ装置に
おいて、パリティグループを構成する複数のデータディ
スクに、各データディスクに書き込まれた各データに基
づいて生成された各付加チェックコードをそれぞれ書き
込むとともに、前記パリティグループを構成するパリテ
ィディスクに、全ての各付加チェックコードを書き込
み、前記複数データディスクの１つと前記パリティディ
スクの前記付加チェックコードを読み出してエラーチェ
ックを行うディスクアレイ制御方式。In order to achieve the above functions and actions, the present invention mainly employs the following configurations. In a disk array device configured using a plurality of disk devices, each additional check code generated based on each data written in each data disk is written in each of the plurality of data disks forming the parity group, and A disk array control method in which all of the additional check codes are written to a parity disk that constitutes a parity group, and one of the plurality of data disks and the additional check code of the parity disk are read to perform an error check.

【００１８】[0018]

【発明の実施の形態】本発明の実施形態に係るディスク
アレイ制御方式について、図１、図２及び図３を用いて
以下詳細に説明する。図１は本発明の実施形態に係るデ
ィスクアレイ装置の概略的構成を示す図であり、図２は
本実施形態におけるディスクに書き込むデータフォーマ
ットを示す図であり、図３は本実施形態におけるディス
ク内のデータ配置を示す図である。BEST MODE FOR CARRYING OUT THE INVENTION A disk array control system according to an embodiment of the present invention will be described in detail below with reference to FIGS. 1, 2 and 3. FIG. 1 is a diagram showing a schematic configuration of a disk array device according to an embodiment of the present invention, FIG. 2 is a diagram showing a data format to be written to a disk according to this embodiment, and FIG. 3 is an inside disk of this embodiment. It is a figure which shows the data arrangement of.

【００１９】図１において、１１０はホストとなるサー
バ１０への通信路を示す。この通信路１１０には、一般
にＳＣＳＩやファイバーチャネル等が使われる。図の１
２は、本実施形態における制御を実施するディスクアレ
イ制御装置であり、ホスト通信制御部１２１、装置制御
を行うプロセッサ部１２２、データのキャッシングを行
うキャッシュ部１２３、ディスクの制御を行うディスク
制御部１２４、から構成される。また、ディスクアレイ
制御装置１２にはディスク１３が接続される。In FIG. 1, reference numeral 110 indicates a communication path to the server 10 which serves as a host. For this communication path 110, generally SCSI or fiber channel is used. Figure 1
Reference numeral 2 denotes a disk array control device that performs control in the present embodiment, and includes a host communication control unit 121, a processor unit 122 that performs device control, a cache unit 123 that caches data, and a disk control unit 124 that controls disks. ,,. A disk 13 is connected to the disk array controller 12.

【００２０】本実施形態では、図３を参照して、３個の
データに付きパリティデータを１つ用意し、４つのディ
スクでパリティグループを構成するＲＡＩＤ５構成を取
るため、４の整数倍のディスク台数が必要であり、図３
では簡単のため４台記載している。データとパリティ自
体は、図３に示す様に分散されて配置されるため、デー
タ用のディスクとパリティ用の専用ディスクに分かれて
いるわけではない。In the present embodiment, referring to FIG. 3, one piece of parity data for three pieces of data is prepared, and a RAID5 configuration in which a parity group is formed by four disks is adopted. The number is required, and Fig. 3
For simplicity, 4 units are shown. Since the data and the parity itself are distributed and arranged as shown in FIG. 3, they are not divided into a data disk and a dedicated parity disk.

【００２１】図２は、ＲＡＩＤ装置に書き込むデータの
フォーマットを示す。データ部とパリティ部は同じフォ
ーマットを使用する。図２の２０はディスクの保持する
データ、図２の２１はそのデータのディスク中の位置情
報、図２の２２は本発明の特徴である付加チェックコー
ドデータ、図２の２３はデータ、位置情報、付加チェッ
クコードから生成されるチェックコードである。FIG. 2 shows a format of data to be written in the RAID device. The data part and the parity part use the same format. Reference numeral 20 in FIG. 2 is data held by the disc, 21 in FIG. 2 is position information of the data in the disc, 22 in FIG. 2 is additional check code data which is a feature of the present invention, and 23 in FIG. 2 is data and position information. , A check code generated from the additional check code.

【００２２】本実施形態では、パリティディスクに書き
込む付加チェックコードは、データ部の書き込みに用い
る付加チェックコードと同一のもの（但し、データと位
置情報から生成されたもの）を用い、またデータ部で用
いる付加チェックコードは、パリティ部に用いる付加チ
ェックコードを用いる。実際に書き込むデータに付随す
るチェックコードＲＣＣは、データ、位置情報、付加チ
ェックコードから生成された全体のデータに対するチェ
ックコードである。ここで、付加チェックコードは前述
したようにデータ２０に基づいて生成されたチェックコ
ードであることが特徴であり、ＣＲＣ（Ｃｙｃｌｉｃ
ＲｅｄｕｎｄａｎｃｙＣｈｅｃｋ）コードを用いても
良い。いずれもデータ２０に基づいたチェックコードで
あるから、この付加チェックコードを用いて、読み出し
データのエラーチェックも可能であり、更に、付加チェ
ックコードの生成には既存のデータ及び／又は位置情報
ＬＡを利用していて、特別な他の情報源を必要とした
り、利用するものではな。また、付加チェックビットと
してデータを書き込んだ際に書き込みを一意に識別でき
る識別子を用いても良い。In this embodiment, the additional check code to be written to the parity disk is the same as the additional check code used for writing the data section (however, it is generated from the data and the position information), and the additional check code is used in the data section. The additional check code used is the additional check code used in the parity part. The check code RCC accompanying the data to be actually written is a check code for the entire data generated from the data, the position information, and the additional check code. Here, the additional check code is characterized by being a check code generated based on the data 20 as described above, and CRC (Cyclic).
A Redundancy Check) code may be used. Since each is a check code based on the data 20, it is possible to check the read data for an error by using this additional check code. Furthermore, existing data and / or position information LA is used to generate the additional check code. You are using it and do not need or use any other special sources of information. Further, an identifier that can uniquely identify writing when writing data may be used as the additional check bit.

【００２３】図３は、本実施形態のおけるディスク内の
データ配置を示す。パリティグループを構成する各ディ
スク３０，３１，３２，３３に、３つのデータ３４，３
５，３６及びパリティ３７が図示の様に格納されるが、
これは図示するように６４ＫＢｙｔｅｓ程度の細かい単
位で別のディスクを使うようになっており、それによっ
て負荷が高くなるパリティ部を全ディスクに分散させて
負荷の均一化と性能向上を図っている。FIG. 3 shows the data arrangement in the disc in this embodiment. Each disk 30, 31, 32, 33 forming the parity group has three data 34, 3
5, 36 and parity 37 are stored as shown,
As shown in the figure, another disk is used in a fine unit of about 64 KBytes, whereby the parity part, which increases the load, is distributed to all the disks to make the load uniform and improve the performance.

【００２４】本実施形態の実際の動作を以下詳細に記載
する。本実施形態では、ホスト１０側から通信路１１０
経由でデータの書き込みが指定された場合、ホストから
送られてきた書き込むべきデータを５１２Ｂｙｔｅｓ単
位に分割して、まずキャッシュメモリ部１２３にバッフ
ァする。次に、制御プロセッサ１２２により、ディスク
１３に書くべきデータを図２のフォーマットに従って生
成する。ここで、データ２０は、ホストから送られてき
たデータそのものであり、位置情報２１は、内部で管理
している書き込むべきデータのディスクアレイ内の位置
である。これは８バイトからなり、上位２バイトは、デ
ィスクアレイ制御装置１２につながったディスクの通番
（接続位置情報）であり、下位６Ｂｙｔｅｓは、ディス
ク内のデータの書き込み開始セクタ番号を示す。The actual operation of this embodiment will be described in detail below. In this embodiment, the communication path 110 is transmitted from the host 10 side.
When data writing is designated via the data, the data to be written sent from the host is divided into 512 bytes and buffered in the cache memory unit 123 first. Next, the control processor 122 generates data to be written on the disk 13 according to the format shown in FIG. Here, the data 20 is the data itself sent from the host, and the position information 21 is the position in the disk array of the data to be written managed internally. This consists of 8 bytes, the upper 2 bytes are the serial number (connection position information) of the disk connected to the disk array control device 12, and the lower 6 bytes indicate the write start sector number of the data in the disk.

【００２５】本実施形態では、書き込みの際には、対象
とするデータが配置されるディスクと、パリティディス
クの両方に対して同じ形式の図２に示す付加チェックコ
ード２２を持つデータを書き込む。通常のＲＡＩＤの場
合にもチェックコードとしてパリティデータを書き込む
ので、本実施形態が従来フォーマットと異なる点は、前
記付加チェックコード２２を付随データとして書き込む
点である。In this embodiment, at the time of writing, the data having the additional check code 22 shown in FIG. 2 of the same format is written to both the disk where the target data is arranged and the parity disk. Since parity data is written as a check code even in the case of normal RAID, the present embodiment differs from the conventional format in that the additional check code 22 is written as ancillary data.

【００２６】この時、書き込むデータの付加チェックコ
ードデータ２２には、本実施形態のＲＡＩＤ構成に対応
した、３つの付加チェックコードのフィールド２２０，
２２１，２２２があるが、データ部の付加チェックコー
ドは、自データの位置の付加チェックコードのみ更新、
パリティデータの付加チェックコードは、書き込んだデ
ータに対応する付加チェックコードのみ更新し、他の部
分は元のままとする。At this time, in the additional check code data 22 of the data to be written, there are three additional check code fields 220, which correspond to the RAID configuration of this embodiment.
221 and 222 exist, but the additional check code of the data part is only the additional check code of the position of the own data updated.
As for the additional check code of the parity data, only the additional check code corresponding to the written data is updated, and the other parts remain unchanged.

【００２７】例えば、パリティグループの第一のデータ
領域３４にデータを書き込んだ場合（図３参照）、その
データからパリティデータを再計算し、パリティデータ
の付加チェックコードをデータ領域３４のフィールド２
２０の位置に書き、データから生成したチェックコード
をパリティデータ３７のフィールド２２０の位置に書
く。最後に、このデータ全体（データ２０、ＬＡ２１、
付加チェックコード２２）に対して所定の検出能力を持
ったチェックコード２３を生成し、データとして追加す
る。本実施形態では、８Ｂｙｔｅｓのチェックコードを
付けており、５１２Ｂｙｔｅｓのデータにつき実際にデ
ィスクに書き込まれるのは５５２Ｂｙｔｅｓである。こ
の５５２Ｂｙｔｅｓのデータをディスク制御装置１２か
ら実際のディスク１３に書き込む。For example, when data is written in the first data area 34 of the parity group (see FIG. 3), the parity data is recalculated from the data and the additional check code of the parity data is stored in the field 2 of the data area 34.
The check code generated from the data is written in the position 220 in the field 220 of the parity data 37. Finally, this whole data (data 20, LA21,
A check code 23 having a predetermined detection capability is generated for the additional check code 22) and added as data. In the present embodiment, a check code of 8 Bytes is attached, and 552 Bytes are actually written to the disk for 512 Bytes of data. This 552 Bytes data is written from the disk controller 12 to the actual disk 13.

【００２８】以上説明したディスクデータフォーマット
について、図２の（２）を用いて再度説明する。ホスト
サーバ１０から送られてきたデータは、ディスクアレイ
制御装置１２で、データ０，１，２のように固定長デー
タに分割される。更に、パリティＰはデータ０，１，２
からエクシクルーシブオアを取って作成された５１２Ｂ
のデータである。これらデータ０、１，２，パリティＰ
は、図３に示すようにディスク１〜４に配置される。こ
こで、データ０のデータフォーマットについて、本実施
形態の特徴である付加チェックコードＣＣ０が５１２Ｂ
のデータ、ＬＡ２１に基づいて作成されてＬＡ２１に続
いて書き込まれ、更に、チェックコードＲＣＣ２３が５
１２Ｂデータ、ＬＡ２１及びＣＣ０（２２０）に基づい
て作成され書き込まれる。データ１及びデータ２につい
ても、同様に付加チェックコードＣＣ１（２２１）及び
ＣＣ２（２２２）が図示のように書き込まれる。また、
パリティＰについては、データ２０、ＬＡ２１に続い
て、ＣＣ０（２２０）、ＣＣ１（２２１）、ＣＣ２（２
２２）が全て書き込まれる。The disk data format described above will be described again with reference to FIG. The data sent from the host server 10 is divided by the disk array controller 12 into fixed length data such as data 0, 1, 2. Furthermore, the parity P is data 0, 1, 2
512B created by taking exclusive OR from
Data. These data 0, 1, 2, parity P
Are arranged on the disks 1 to 4 as shown in FIG. Here, regarding the data format of the data 0, the additional check code CC0 which is the feature of this embodiment is 512B.
Data, which is created based on LA21 and is written following LA21, and the check code RCC23 is 5
It is created and written based on 12B data, LA21 and CC0 (220). Similarly, for data 1 and data 2, additional check codes CC1 (221) and CC2 (222) are written as shown. Also,
As for the parity P, CC0 (220), CC1 (221), CC2 (2
22) are all written.

【００２９】一方、本実施形態では読み出しの際の処理
は通常と異なり、本実施形態に即して必ず対象とするデ
ータが配置されるディスク３０〜３３の一つとパリティ
データ３７の両方を読み込む。読み出した付加チェック
コード２２が読み出したデータの対応する位置の付加チ
ェックコードと一致しているかを確認してエラーの検出
を行う。本実施形態の場合、エラー検出は以下の判定で
行う。On the other hand, in the present embodiment, the processing at the time of reading is different from usual, and in accordance with the present embodiment, one of the disks 30 to 33 in which the target data is arranged and the parity data 37 are always read. An error is detected by checking whether the read additional check code 22 matches the additional check code at the corresponding position of the read data. In the case of the present embodiment, error detection is performed by the following judgment.

【００３０】（１）チェックコード２３とデータ２０が
不一致の場合には、そのデータはエラーである。これは
ディスク内部の書き込み時のデータ系の障害によって正
しいデータが書き込まれなかったものであると考えられ
る。(1) If the check code 23 and the data 20 do not match, the data is an error. It is considered that this is because correct data was not written due to a failure of the data system during writing in the disc.

【００３１】（２）チェックコード２３とデータ２０は
一致しているが、読み出されたデータに埋め込まれた位
置情報２１と、そのデータ２０が本来あるべき位置（デ
ィスクアレイ制御装置が書き込みを指令した位置）が一
致していなかった場合は、その不一致となっているデー
タはエラーである。これはディスク１３の内蔵制御部障
害によって誤った位置に書き込まれた先のものが読み出
されたケースである。(2) The check code 23 and the data 20 match, but the position information 21 embedded in the read data and the position where the data 20 should be (the disk array control device issues a write command). If the specified position) does not match, the mismatched data is an error. This is a case where the destination of the data written in the wrong position is read due to the failure of the built-in control unit of the disk 13.

【００３２】（３）付加チェックコード２２が一致して
いなかった場合には、まずパリティグループのデータを
読み、そのデータから不一致の各データが誤っていたこ
とを仮定した二通りのデータ再構築を行い、その結果と
付加チェックコード２２を比較して矛盾している側が誤
ったデータである。これは、制御部障害によって書き込
まれるべきデータが誤った位置に書き込まれたため、旧
データが読み出されているケースである。即ち、図２の
（２）の場合を例にすると、データ０とパリティＰを読
み出して付加チェックコードが不一致の場合、データ０
とパリティＰのいずれかがエラーである。そこで、デー
タ１とパリティＰ、データ２とパリティＰ、の組み合わ
せで付加チェックコードの一致をみて、データ０とパリ
ティＰのいずれかのエラーを検出する。データ０がエラ
ーであると解ると、データ１、データ２及びパリティＰ
からデータ０の回復を行う。(3) If the additional check codes 22 do not match, the data of the parity group is first read, and two types of data reconstruction are performed on the assumption that each mismatched data is incorrect. The result is compared with the additional check code 22, and the inconsistent side is erroneous data. This is a case where the old data is being read because the data to be written was written in the wrong position due to the control unit failure. That is, taking the case of (2) of FIG. 2 as an example, when the data 0 and the parity P are read and the additional check code does not match, the data 0
Or parity P is in error. Therefore, the combination of the data 1 and the parity P and the combination of the data 2 and the parity P are checked for the coincidence of the additional check code to detect any error of the data 0 and the parity P. If data 0 is found to be an error, data 1, data 2 and parity P
Data 0 is recovered from.

【００３３】（４）不一致やチェックコードとの不整合
がなかった時にはデータは正しく、そのまま用いて良
い。(4) If there is no mismatch or mismatch with the check code, the data is correct and can be used as it is.

【００３４】上記判定によりデータのエラーが検出され
た場合には、改めてアレイを構成する全データを読み込
み、そのデータから障害によって失われたデータを回復
する。リカバリ時は、付加チェックコードデータを読み
出したデータから再構築することを除けば通常のＲＡＩ
Ｄデータ回復手順と全く同じ処理である。If a data error is detected by the above judgment, all the data that make up the array are read again, and the data lost due to the failure is recovered from that data. At the time of recovery, except for rebuilding the additional check code data from the read data, the normal RAI
The process is exactly the same as the D data recovery procedure.

【００３５】本実施形態では、データディスク３つに対
してパリティディスク１つを置く構成を取ったが、チェ
ックコード２３のフィールド数を増減させることにより
他のＲＡＩＤ構成に対応させることも容易に可能であ
る。In this embodiment, one parity disk is placed for three data disks, but it is possible to easily adapt to other RAID configurations by increasing or decreasing the number of fields of the check code 23. Is.

【００３６】本実施形態では、識別子としてＣＲＣ等の
情報を用いたが、これはＲＡＩＤ装置としてのエラー回
復処理が容易となるためである。例えば、データ部を書
き込んだ書き込み通算番号をパリティ部に保持する様に
してもＲＡＩＤ５では、同様のエラー検出能力を得るこ
とができる。この場合３台以下のパリティグループ構成
では、どちらのデータが正しいかを知ることができない
という難点があり、チェックコードを更に追加する必要
がある。In the present embodiment, information such as CRC is used as the identifier because this facilitates the error recovery process as a RAID device. For example, even if the write total number in which the data part is written is held in the parity part, the same error detection capability can be obtained in RAID5. In this case, with a parity group configuration of three or less, it is difficult to know which data is correct, and it is necessary to add a check code.

【００３７】[0037]

【発明の効果】本発明によれば、ディスク制御部でのエ
ラー検出能力の低いディスクを使いながら、エラー検出
能力を備え、かつ性能低下を抑えたＲＡＩＤ装置を構成
することができる。As described above, according to the present invention, it is possible to construct a RAID device having error detection capability and suppressing performance degradation while using a disk with low error detection capability in the disk control unit.

[Brief description of drawings]

【図１】本発明の実施形態に係るディスクアレイ装置の
概略的構成を示す図である。FIG. 1 is a diagram showing a schematic configuration of a disk array device according to an embodiment of the present invention.

【図２】本実施形態におけるディスクに書き込むデータ
フォーマットを示す図である。FIG. 2 is a diagram showing a data format to be written on the disc in the present embodiment.

【図３】本実施形態におけるディスク内のデータ配置を
示す図である。FIG. 3 is a diagram showing a data arrangement in a disc in the present embodiment.

[Explanation of symbols]

１２ディスクアレイ制御装置１３ディスク２０データ２１位置情報３４，３５，３６データ領域３７パリティデータ領域１１０通信路１２１ホスト通信制御部１２２制御プロセッサ１２３キャッシュメモリ部１２４ディスク制御部 12 Disk array controller 13 discs 20 data 21 Location information 34,35,36 data area 37 Parity data area 110 communication path 121 Host communication controller 122 Control Processor 123 cash memory section 124 Disk controller

Claims

[Claims]

1. In a disk array device configured by using a plurality of disk devices, a plurality of data disks forming a parity group,
While writing each additional check code generated based on each data written to each data disk, to the parity disk that constitutes the parity group,
A disk array control method characterized by writing all additional check codes.

2. In a disk array device configured by using a plurality of disk devices, a plurality of data disks forming a parity group are
Each additional check code generated based on each data written to each data disk is written, and all the additional check codes are written to the parity disks forming the parity group. A disk array control method, wherein an error check is performed by reading the additional check code of the parity disk.

3. The disk array control method according to claim 2, wherein the plurality of data disks and the parity disk are
A disk array control method characterized in that position information of data to be written in the disk array is written respectively, and the position information is used for error checking of data written at an incorrect position.

4. The disk array control system according to claim 3, wherein a check code generated from each piece of data, each piece of position information, and each piece of additional check code is written on a data disk and a parity disk that form a parity group. A disk array control method characterized by writing at the end of the format.