JP2005258866A

JP2005258866A - Disk array device and program

Info

Publication number: JP2005258866A
Application number: JP2004070384A
Authority: JP
Inventors: Shigeru Suganuma; 滋菅沼
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2004-03-12
Filing date: 2004-03-12
Publication date: 2005-09-22

Abstract

<P>PROBLEM TO BE SOLVED: To provide a disk array device for assuring the redundancy of data even if a spare disk is exhausted. <P>SOLUTION: In the disk array device, having a spare disk separate from a disk for storing data and redundant data, a processing cut/divide section 33 starts a restoration section 34 if the spare disk is present and starts an integral section 35 if no spare disks are present, when a disk failure occurs in a striping group having redundancy. The restoration section 34 restores the content of the faulty disk onto the spare disk for restoring redundancy to the striping group that has lost redundancy by the disk failure. The integral section 35 integrates the striping group that has lost redundancy by disk failure with another striping group having redundancy, thus generating one striping group having redundancy. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明はディスクアレイ装置に関し、特にデータ及び冗長データを記憶するディスクとは別に予備のディスクを有するディスクアレイ装置に関する。 The present invention relates to a disk array device, and more particularly to a disk array device having a spare disk separately from a disk for storing data and redundant data.

コンピュータシステムにおける長期記憶システムとして、磁気ディスクを用いたストレージシステムが一般的に使用されている。特に信頼性を要求される分野では、複数の磁気ディスクにデータを分散し、冗長データを付加することで信頼性を高めたディスクアレイ装置が使用されている。 As a long-term storage system in a computer system, a storage system using a magnetic disk is generally used. In particular, in a field where reliability is required, a disk array device is used in which reliability is increased by distributing data to a plurality of magnetic disks and adding redundant data.

ディスクアレイ装置の構成は、ＲＡＩＤ（ＲｅｄｕｎｄａｎｔＡｒｒａｙｏｆＩｎｅｘｐｅｎｓｉｖｅＤｉｓｋｓ）と呼ばれる形式で分類されており、データの分散方法および冗長データの記録方式により、ＲＡＩＤ０からＲＡＩＤ５までの６種類に分類されている。 The configuration of the disk array device is classified in a format called RAID (Redundant Array of Inexpensive Disks), and is classified into six types from RAID 0 to RAID 5 according to a data distribution method and a redundant data recording method.

ディスクアレイ装置は、冗長データを含まないＲＡＩＤ０構成を除き、データに冗長性を持たせてあるため、一部の磁気ディスクに異常が生じても、他の磁気ディスクからデータを復元することが可能である。そのため、ディスクアレイ装置では、磁気ディスクに異常が発生した場合、その磁気ディスクを使用不可（縮退とよぶ）にすることにより、異常な磁気ディスクを切り離して、運用を継続する。 Except for RAID 0 configuration that does not include redundant data, the disk array device has data redundancy so that even if some magnetic disks fail, data can be restored from other magnetic disks. It is. Therefore, in the disk array apparatus, when an abnormality occurs in the magnetic disk, the abnormal magnetic disk is disconnected and the operation is continued by disabling the magnetic disk (referred to as degeneration).

また、多くのディスクアレイ装置は、磁気ディスクに異常が生じた場合の、代替用の磁気ディスク（予備ディスクまたはホットスペアと呼ばれる）が搭載されており、磁気ディスク障害発生時に、速やかに障害ディスクの内容を、予備ディスク上に再構築（復旧と呼ばれる）することが可能となっている（例えば特許文献１、２参照）。このため、予備ディスクを装備したディスクアレイ装置では、人手による保守操作を行うことなく、データの復旧および冗長性の回復が可能であり、十分な台数の予備ディスクを装備したディスクアレイ装置では、長期間、保守操作を行うことなく、運用を継続することが可能である。このことは、保守作業を削減し、保守費用を低減する効果がある。 In addition, many disk array devices are equipped with a replacement magnetic disk (called a spare disk or hot spare) in the event of an abnormality in the magnetic disk. Can be reconstructed (called recovery) on a spare disk (see, for example, Patent Documents 1 and 2). For this reason, a disk array device equipped with spare disks can recover data and restore redundancy without manual maintenance. A disk array device equipped with a sufficient number of spare disks is Operation can be continued without performing maintenance operations for a period of time. This has the effect of reducing maintenance work and maintenance costs.

ディスクアレイ装置の設計者、コンピューター・システムベンダあるいはユーザは、各自の使用目的や要求に適合したＲＡＩＤ構成のディスクアレイ装置を選択する。そのため、多くのディスクアレイ装置では、制御プログラムの変更や装置のスイッチ設定等により、装置のＲＡＩＤ構成を変更することが可能である。
特開２００３−８５０１９号公報特開平１１−３３８６５０号公報 A disk array device designer, computer system vendor or user selects a disk array device having a RAID configuration suitable for his / her purpose and requirements. Therefore, in many disk array devices, the RAID configuration of the device can be changed by changing the control program, setting the device switch, or the like.
JP 2003-85019 A JP 11-338650 A

ディスクアレイ装置では、データ又は冗長データを記憶する磁気ディスクに障害が発生した場合、予備ディスクにデータまたは冗長データを復旧することにより、データの冗長性および上位装置への応答性を回復するが、度重なるディスク障害によって予備ディスクが枯渇した場合には、データが復旧できないため、データは冗長保証されない。データの冗長性が保証されない状態で磁気ディスクに障害が発生すると、データ喪失の可能性がある。 In a disk array device, when a failure occurs in a magnetic disk storing data or redundant data, the data redundancy and responsiveness to the host device are recovered by restoring the data or redundant data to the spare disk. If the spare disk is exhausted due to repeated disk failures, the data cannot be recovered, so the data is not guaranteed to be redundant. If a failure occurs in a magnetic disk in a state where data redundancy is not guaranteed, there is a possibility of data loss.

本発明の目的は、予備ディスクが枯渇した場合であっても、データの冗長性を保証することのできるディスクアレイ装置を提供することにある。 An object of the present invention is to provide a disk array device capable of guaranteeing data redundancy even when a spare disk is exhausted.

本発明の第１のディスクアレイ装置は、データ及び冗長データを記憶するディスクとは別に予備のディスクを有するディスクアレイ装置において、
ディスク障害により冗長性を失ったストライピング・グループに対し、障害ディスクの内容を予備ディスク上に復旧して冗長性を回復させる復旧手段と、
冗長性を持たないストライピング・グループと冗長性を持つストライピング・グループとを統合し、冗長性を持つ１つのストライピング・グループを生成する統合手段と、
冗長性を持つストライピング・グループにディスク障害が発生した際、予備ディスクが存在する場合は、前記復旧手段を起動し、予備ディスクが存在しない場合は、前記統合手段を起動する処理切り分け手段とを備えている。 A first disk array device of the present invention is a disk array device having a spare disk separately from a disk for storing data and redundant data.
For the striping group that has lost redundancy due to disk failure, recovery means to recover the redundancy by restoring the contents of the failed disk on the spare disk,
An integration means for integrating a non-redundant striping group with a redundant striping group and generating one redundant striping group;
When a disk failure occurs in a striping group having redundancy, the recovery unit is activated if a spare disk exists, and the separation unit is activated to activate the integration unit if there is no spare disk. ing.

本発明の第２のディスクアレイ装置は、第１のディスクアレイ装置において、前記統合手段は、冗長性を持つＲＡＩＤ４（Ｄｉ＋Ｐ）構成のストライピング・グループと冗長性を持たないＲＡＩＤ４（Ｄｊ＋Ｐ−１）構成のストライピング・グループとを統合し、冗長性を持つＲＡＩＤ４（Ｄ（ｉ＋ｊ）＋Ｐ）構成のストライピング・グループを生成する。 According to a second disk array device of the present invention, in the first disk array device, the integration means includes a RAID4 (Di + P) configuration striping group having redundancy and a RAID4 (Dj + P-1) configuration having no redundancy. And a striping group having a redundancy RAID 4 (D (i + j) + P) configuration is generated.

本発明の第３のディスクアレイ装置は、第２のディスクアレイ装置において、前記統合手段は、冗長性を持つＲＡＩＤ４（Ｄｉ＋Ｐ）構成のストライピング・グループを第１のストライピング・グループ、冗長性を持たないＲＡＩＤ４（Ｄｊ＋Ｐ−１）構成のストライピング・グループを第２のストライピング・グループとするとき、第１のストライピング・グループのパリティディスクに記憶されたパリティと、第２のストライピング・グループの全ての正常ディスクのデータをリードしてバッファに格納し、第２のストライピング・グループの正常ディスクからリードしたデータどうしの排他的論理和をとることで障害ディスクのデータを生成すると共に、第１のストライピング・グループのパリティディスクからリードしたパリティと第２のストライピング・グループのパリティとの排他的論理和をとることで、統合後のストライピング・グループ用の新パリティを生成してバッファに格納し、前記生成した障害ディスクのデータを第１のストライピング・グループのパリティディスクにおける前記リードした箇所にライトすると共に、前記生成した新パリティを第２のストライピング・グループの１つの正常ディスクの前記リードした箇所にライトする動作を繰り返すことで、第１および第２のストライピング・グループを統合したストライピング・グループを生成する。 According to a third disk array device of the present invention, in the second disk array device, the integration unit does not have a redundancy in a RAID4 (Di + P) configuration striping group having redundancy as a first striping group. When the striping group having the RAID 4 (Dj + P-1) configuration is set as the second striping group, the parity stored in the parity disk of the first striping group and all the normal disks of the second striping group The data is read and stored in the buffer, the data of the failed disk is generated by taking the exclusive OR of the data read from the normal disks of the second striping group, and the parity of the first striping group Parity read from disk By taking an exclusive OR with the parity of the striping group of 2, a new parity for the integrated striping group is generated and stored in the buffer, and the data of the generated failed disk is stored in the first striping group. By repeating the operation of writing to the read location on the parity disk of the group and writing the generated new parity to the read location of one normal disk of the second striping group, the first and second A striping group in which the striping groups are integrated is generated.

本発明の第４のディスクアレイ装置は、第１のディスクアレイ装置において、前記統合手段は、冗長性を持つＲＡＩＤ１構成のストライピング・グループと冗長性を持たないＲＡＩＤ１構成のストライピング・グループとを統合し、冗長性を持つＲＡＩＤ４（Ｄ２＋Ｐ）構成のストライピング・グループを生成する。 According to a fourth disk array device of the present invention, in the first disk array device, the integration unit integrates a RAID1 configuration striping group having redundancy and a RAID1 configuration striping group having no redundancy. Then, a striping group having a RAID4 (D2 + P) configuration with redundancy is generated.

本発明の第５のディスクアレイ装置は、第４のディスクアレイ装置において、前記統合手段は、冗長性を持つＲＡＩＤ１構成のストライピング・グループを第１のストライピング・グループ、冗長性を持たないＲＡＩＤ１構成のストライピング・グループを第２のストライピング・グループとするとき、第１のストライピング・グループの何れか１つの磁気ディスクに記憶されたデータと、第２のストライピング・グループの正常ディスクのデータとをリードしてバッファに格納し、前記リードしたデータどうしの排他的論理和をとることで、統合後のＲＡＩＤ４（Ｄ２＋Ｐ）用のパリティを生成し、このパリティを第１のストライピング・グループの何れか１つの磁気ディスクにライトする動作を繰り返すことで、第１および第２のストライピング・グループを統合したストライピング・グループを生成する。 According to a fifth disk array device of the present invention, in the fourth disk array device, the integration means uses a RAID1 configuration with redundancy as a first striping group, and a RAID1 configuration without redundancy. When the striping group is the second striping group, the data stored in one of the magnetic disks in the first striping group and the data on the normal disks in the second striping group are read. A parity for RAID 4 (D2 + P) after integration is generated by storing in the buffer and taking the exclusive OR of the read data, and this parity is stored in one of the magnetic disks in the first striping group By repeating the operation of writing to the first and second strata To produce a striping group that integrates ping group.

本発明の第６のディスクアレイ装置は、第１のディスクアレイ装置において、前記統合手段は、冗長性を持つＲＡＩＤ５（Ｄｉ＋Ｐ）構成のストライピング・グループと冗長性を持たないＲＡＩＤ５（Ｄｊ＋Ｐ−１）構成のストライピング・グループとを統合し、冗長性を持つＲＡＩＤ５（Ｄ（ｉ＋ｊ）＋Ｐ）構成のストライピング・グループを生成する。 According to a sixth disk array device of the present invention, in the first disk array device, the integration means includes a RAID5 (Di + P) configuration striping group having redundancy and a RAID5 (Dj + P-1) configuration having no redundancy. And a striping group having a redundant RAID 5 (D (i + j) + P) configuration is generated.

本発明の第７のディスクアレイ装置は、第６のディスクアレイ装置において、前記統合手段は、冗長性を持つＲＡＩＤ５（Ｄｉ＋Ｐ）構成のストライピング・グループを第１のストライピング・グループ、冗長性を持たないＲＡＩＤ５（Ｄｊ＋Ｐ−１）構成のストライピング・グループを第２のストライピング・グループとするとき、第１のストライピング・グループのパリティと第２のストライピング・グループの全ての正常ディスクのデータとをリードしてバッファに格納し、第２のストライピング・グループの正常ディスクからリードしたデータどうしの排他的論理和をとることで障害ディスクのデータを生成すると共に、第１のストライピング・グループからリードしたパリティと第２のストライピング・グループのパリティとの排他的論理和をとることで、統合後のストライピング・グループ用の新パリティを生成してバッファに格納し、前記生成した障害ディスクのデータがパリティかどうかを判別し、パリティであれば、前記生成した新パリティを第１のストライピング・グループの前記読み出したパリティに上書きし、前記生成した障害ディスクのデータがパリティでなければ、前記生成した障害ディスクのデータを第２のストライピング・グループから読み出したパリティに上書きすると共に、前記生成した新パリティを第１のストライピング・グループの前記読み出したパリティに上書きする動作を繰り返すことで、第１および第２のストライピング・グループを統合したストライピング・グループを生成する。 According to a seventh disk array device of the present invention, in the sixth disk array device, the integration unit does not have a redundancy striping group having a RAID 5 (Di + P) configuration as a first striping group. When the striping group having the RAID5 (Dj + P-1) configuration is set as the second striping group, the parity of the first striping group and the data of all the normal disks in the second striping group are read and buffered. And data of the failed disk is generated by taking the exclusive OR of the data read from the normal disks of the second striping group and the parity read from the first striping group and the second Striping group elimination with parity New parity for the integrated striping group is generated and stored in the buffer, and it is determined whether or not the data of the generated failed disk is parity. If the read parity of the first striping group is overwritten with the new parity and the data of the generated failed disk is not parity, the generated data of the failed disk is replaced with the parity read from the second striping group. Overwriting and repeating the operation of overwriting the generated new parity with the read parity of the first striping group, thereby generating a striping group in which the first and second striping groups are integrated.

本発明のプログラムは、データ及び冗長データを記憶するディスクとは別に予備のディスクを有するディスクアレイ装置のディスクアレイ制御部を構成するコンピュータを、
ディスク障害により冗長性を失ったストライピング・グループに対し、障害ディスクの内容を予備ディスク上に復旧して冗長性を回復させる復旧手段、
冗長性を持たないストライピング・グループと冗長性を持つストライピング・グループとを統合し、冗長性を持つ１つのストライピング・グループを生成する統合手段、
冗長性を持つストライピング・グループにディスク障害が発生した際、予備ディスクが存在する場合は、前記復旧手段を起動し、予備ディスクが存在しない場合は、前記統合手段を起動する処理切り分け手段、
として機能させる。 The program of the present invention is a computer that constitutes a disk array control unit of a disk array device having a spare disk separately from a disk for storing data and redundant data.
Recovery means for recovering redundancy by restoring the contents of a failed disk on a spare disk for a striping group that has lost redundancy due to a disk failure,
A means for integrating a non-redundant striping group with a redundant striping group to generate a single redundant striping group;
When a disk failure occurs in a striping group having redundancy, if a spare disk exists, the recovery unit is activated, and if there is no spare disk, a process separation unit that activates the integration unit,
To function as.

本発明によれば、予備ディスクが枯渇した場合であっても、データの冗長性を保証することができ、データ喪失確率を低減させることができる。 According to the present invention, even when the spare disk is exhausted, data redundancy can be guaranteed and the data loss probability can be reduced.

その理由は、冗長性を持たないストライピング・グループと冗長性を持つストライピング・グループとを統合し、冗長性を持つ１つのストライピング・グループを生成するからであり、冗長性を持たないストライピング・グループと冗長性を持つストライピング・グループとを統合し、冗長性を持つ一つのストライピング・グループにする方が、データ喪失確率が小さくなるためである。 The reason is that a non-redundant striping group and a redundant striping group are integrated to generate a single redundant striping group. This is because the probability of data loss becomes smaller when a striping group having redundancy is integrated into one striping group having redundancy.

図１を参照すると、本発明の第１実施例にかかるディスクアレイ装置１は、複数の磁気ディスク１００〜１２７と、ホストバス５を通じて図示しないホストコンピュータから与えられるコマンドを解釈し、ディスク制御バス４０〜４５を介して磁気ディスク１００〜１２７を制御するディスクアレイ制御部３とで構成されている。なお、磁気ディスク１００〜１２７は全て同じ記憶容量を有している。 Referring to FIG. 1, the disk array device 1 according to the first embodiment of the present invention interprets commands given from a plurality of magnetic disks 100 to 127 and a host computer (not shown) through the host bus 5, and a disk control bus 40. The disk array control unit 3 controls the magnetic disks 100 to 127 via .about.45. The magnetic disks 100 to 127 all have the same storage capacity.

複数の磁気ディスク１００〜１２７のうち、磁気ディスク１００、１１０および１２０、磁気ディスク１０１、１１１および１２１、磁気ディスク１０２、１１２および１２２、磁気ディスク１０３、１１３および１２３、磁気ディスク１０４、１１４および１２４、磁気ディスク１０５、１１５および１２５は、それぞれＲＡＩＤ４（Ｄ２＋Ｐ）のストライピング・グループ２０、２１、２２、２３、２４、２５を構成しており、残りの磁気ディスク１０６、１１６および１２６、磁気ディスク１０７、１１７および１２７は予備ディスクとなっている。ここで、ストライピング・グループとは冗長グループのことである。なお、各磁気ディスク１００〜１２７中、ディスク内にＤｘｘ（ｘは任意の数値）を付記したものはデータディスクを示し、Ｐｘと付記したものはパリティディスクを示し、Ｓｘと付記したたものは予備ディスクを示している。 Among the plurality of magnetic disks 100 to 127, magnetic disks 100, 110 and 120, magnetic disks 101, 111 and 121, magnetic disks 102, 112 and 122, magnetic disks 103, 113 and 123, magnetic disks 104, 114 and 124, The magnetic disks 105, 115, and 125 constitute RAID 4 (D2 + P) striping groups 20, 21, 22, 23, 24, and 25, respectively, and the remaining magnetic disks 106, 116, and 126, and magnetic disks 107, 117 are configured. And 127 are spare disks. Here, the striping group is a redundant group. Of the magnetic disks 100 to 127, those with Dxx (x is an arbitrary numerical value) in the disk indicate data disks, those with Px indicate parity disks, and those with Sx are spares. Shows the disc.

図２を参照すると、ディスクアレイ制御部３は、ＩＯ処理部３１、ディスク制御部３２、処理切り分け部３３、復旧部３４、統合部３５および記憶部３６を含んで構成される。記憶部３６はＲＡＭなどのメモリで構成され、各機能手段３１〜３５は、マイクロプロセッサ等のプロセッサとそれに内蔵あるいは外付けされたＲＯＭ等のメモリに記憶されたプログラムとで実現することができる。つまり、ＲＯＭ等に記憶されたプログラムは、プロセッサに読み取られ、そのプロセッサの動作を制御することにより、そのプロセッサ上に各機能手段３１〜３５を実現する。 Referring to FIG. 2, the disk array control unit 3 includes an IO processing unit 31, a disk control unit 32, a process isolation unit 33, a recovery unit 34, an integration unit 35, and a storage unit 36. The storage unit 36 is configured by a memory such as a RAM, and each of the functional units 31 to 35 can be realized by a processor such as a microprocessor and a program stored in a memory such as a ROM built in or externally attached thereto. That is, the program stored in the ROM or the like is read by the processor, and the functional units 31 to 35 are realized on the processor by controlling the operation of the processor.

記憶部３６には、予備ディスクの台数を計数する予備ディスクカウンタ３６１と、各ストライピング・グループの状態を示すストライピング・テーブル３６２と、その他必要な制御情報が記憶される。 The storage unit 36 stores a spare disk counter 361 that counts the number of spare disks, a striping table 362 that indicates the status of each striping group, and other necessary control information.

ストライピング・テーブル３６２には、磁気ディスク１００〜１２７で構成される各ストライピング・グループ２０〜２５毎に、そのＲＡＩＤタイプ、構成ディスク番号の情報が記録されている。 In the striping table 362, information on the RAID type and the constituent disk number is recorded for each of the striping groups 20 to 25 composed of the magnetic disks 100 to 127.

図３にストライピング・テーブル３６２の構成例を示す。ストライピング・テーブル３６２は、複数のエントリＥ０〜Ｅ７を有し、各エントリＥ０〜Ｅ７は、ストライピング・ステータスを設定するフィールド３６２１、ストライピング・グループを構成する磁気ディスクを一意に識別する物理ディスク番号を設定するｎ個のフィールド３６２２−１〜３６２２−ｎとで構成される。フィールド３６２１に設定するストライピング・ステータスは、「グループ名／ＲＡＩＤ／構成ディスク数」の形式で記述される。例えば、エントリＥ０は、ストライピング・グループ２０の状態を記録しており、そのＲＡＩＤタイプはＲＡＩＤ４で構成ディスク数が３であり、使用する磁気ディスクは磁気ディスク１００、１１０、１２０であることを示す。以下、エントリＥ１〜Ｅ５に、ストライピング・グループ２１〜２５の状態が記録されている。 FIG. 3 shows a configuration example of the striping table 362. The striping table 362 has a plurality of entries E0 to E7, and each entry E0 to E7 sets a field 3621 for setting a striping status, and a physical disk number for uniquely identifying the magnetic disks constituting the striping group. N fields 3622-1 to 3622 -n. The striping status set in the field 3621 is described in the format of “group name / RAID / number of constituent disks”. For example, the entry E0 records the state of the striping group 20, the RAID type is RAID 4, the number of constituent disks is 3, and the magnetic disks to be used are the magnetic disks 100, 110, and 120. Hereinafter, the states of the striping groups 21 to 25 are recorded in the entries E1 to E5.

また、ストライピング・テーブル３６２には、エントリＥ６、Ｅ７に示されるように、予備ディスクも便宜的にストライピング・グループと同様にグループ化して記録されている。例えば、エントリＥ６は、フィールド３６２１に「グループ２６／スペア用／３」と記述されており、グループ２６に３つの予備ディスクが存在することを示し、フィールド３６２２−１〜３６２２−３にそれら３つの予備ディスクが磁気ディスク１０６、１１６、１２６であることが記述されている。 In the striping table 362, as indicated by the entries E6 and E7, the spare disks are also grouped and recorded in the same manner as the striping group for convenience. For example, the entry E6 is described as “Group 26 / Spare / 3” in the field 3621, indicating that there are three spare disks in the group 26, and these three in the fields 3622-1 to 3622-3. It is described that the spare disk is the magnetic disk 106, 116, 126.

なお、各ストライピング・グループを構成する磁気ディスクのうち、どの磁気ディスクがデータディスクで、どの磁気ディスクがパリティディスクであるかの情報や、図示しないホストコンピュータで稼働するソフトウェアが認識する論理ディスクと物理ディスクとの対応表など、ディスクアレイを制御するためのその他の情報も記憶部３６で管理されるが、図示は省略してある。 Of the magnetic disks that make up each striping group, information on which magnetic disk is a data disk and which magnetic disk is a parity disk, and logical disks and physical disks recognized by software running on a host computer (not shown) Other information for controlling the disk array, such as a correspondence table with disks, is also managed by the storage unit 36, but is not shown.

ディスクアレイ制御部３のディスク制御部３２は、ＩＯ処理部３１、復旧部３４および統合部３５からリードおよびライト要求された磁気ディスクに対して、ディスク制御バス４０〜４５を通じてリードおよびライトする制御を実行する部分である。このディスク制御部３２は、磁気ディスクの故障を検出する故障検出部３２１を有しており、何れかの磁気ディスクの故障を故障検出部３２１で検出すると、処理切り分け部３３に故障した磁気ディスクの番号を通知する。 The disk control unit 32 of the disk array control unit 3 performs control to read and write the magnetic disk requested to be read and written by the IO processing unit 31, the recovery unit 34, and the integration unit 35 through the disk control buses 40 to 45. The part to execute. The disk control unit 32 includes a failure detection unit 321 that detects a failure of a magnetic disk. When the failure detection unit 321 detects a failure of any one of the magnetic disks, the process separation unit 33 detects the failure of the magnetic disk. Give the number.

ＩＯ処理部３１は、ホストバス５を通じて図示しないホストコンピュータから受信したリードコマンド、ライトコマンドを処理する部分であり、記憶部３６に記憶されているストライピング・グループの状態などを参照して、ディスク制御部３２を介して必要な磁気ディスク１００〜１２７をアクセスし、結果をホストバス５を通じて要求元のホストコンピュータに返却する。 The IO processing unit 31 is a part that processes a read command and a write command received from a host computer (not shown) through the host bus 5, and refers to the status of the striping group stored in the storage unit 36, etc. The necessary magnetic disks 100 to 127 are accessed via the unit 32, and the result is returned to the requesting host computer via the host bus 5.

復旧部３４は、ディスク障害により冗長性を失ったストライピング・グループに対し、障害ディスクの内容を予備ディスク上に復旧して冗長性を回復させる処理を行う部分である。復旧部３４は、処理切り分け部３３から、復旧対象となるストライピング・グループと、障害ディスクに代えて使用する１つの予備ディスクとが指示されて起動されると、図４に示す処理を実行する。まず、ディスク制御部３２を介してストライピング・グループの全ての正常ディスクから所定単位でデータをリードして図示しないバッファに格納し（Ｓ１１）、次に、このリードしたデータの排他的論理和をとることで障害ディスクのデータを生成してバッファに格納し（Ｓ１２）、この生成したデータをディスク制御部３２を介して予備ディスクにライトする（Ｓ１３）。この動作を繰り返し、障害ディスクの全データを予備ディスク上に復元し終えると（Ｓ１４でＹＥＳ）、記憶部３６上の予備ディスクカウンタ３６１の値を−１し、ストライピング・テーブル３６２中の当該ストライピング・グループの情報を更新し、障害ディスクの内容を復旧した予備ディスクを、新たな現用ディスクとして組み入れる（Ｓ１５）。また、論理ディスクと物理ディスクとの対応を更新する。 The recovery unit 34 is a part that performs processing for recovering the redundancy by restoring the contents of the failed disk on the spare disk for the striping group that has lost redundancy due to the disk failure. The recovery unit 34 executes the processing shown in FIG. 4 when the processing partitioning unit 33 instructs and starts the striping group to be recovered and one spare disk to be used instead of the failed disk. First, data is read in predetermined units from all normal disks in the striping group via the disk control unit 32 and stored in a buffer (not shown) (S11), and then the exclusive OR of the read data is obtained. Thus, the data of the failed disk is generated and stored in the buffer (S12), and the generated data is written to the spare disk via the disk controller 32 (S13). When this operation is repeated and all the data of the failed disk is restored on the spare disk (YES in S14), the value of the spare disk counter 361 on the storage unit 36 is decremented by 1, and the striping table 362 in the striping table 362 is concerned. The spare disk whose group information is updated and the contents of the failed disk are restored is incorporated as a new working disk (S15). Also, the correspondence between the logical disk and the physical disk is updated.

統合部３５は、冗長性を持たないストライピング・グループと冗長性を持つストライピング・グループとを統合して、冗長性を持つ１つのストライピング・グループを生成する処理を行う部分である。統合部３５は、処理切り分け部３３から、冗長性を持つ第１のストライピング・グループと冗長性を持たない第２のストライピング・グループとが指示されて起動されると、図５に示す処理を実行する。まず、磁気ディスクの先頭領域から順に、所定単位毎に、ディスク制御部３２を介して第１のストライピング・グループのパリティディスクに記憶されたパリティと、第２のストライピング・グループの全ての正常ディスクのデータをリードして図示しないバッファに格納し（Ｓ２１）、次に、第２のストライピング・グループの正常ディスクからリードしたデータどうしの排他的論理和をとることで障害ディスクのデータを生成し（Ｓ２２）、また、第１のストライピング・グループのパリティディスクからリードしたパリティと第２のストライピング・グループのパリティ（これは、パリティディスクが障害ディスクでなければステップＳ２１でリードされており、障害ディスクであればステップＳ２２で復元されている）との排他的論理和をとることで、統合後のストライピング・グループ用の新パリティを生成してバッファに格納し（Ｓ２３）、ステップＳ２２で生成した障害ディスクのデータを第１のストライピング・グループのパリティディスクにおける前記リードした箇所にライトすると共に、ステップＳ２３で生成した新パリティを第２のストライピング・グループの何れか１つの正常ディスク（パリティディスクが正常であればパリティディスク、パリティディスクが障害であればデータディスク）の前記リードした箇所にライトする（Ｓ２４）。この動作を繰り返し、全データの復元を終えると（Ｓ２５でＹＥＳ）、ストライピング・テーブル３６２中の第１のストライピング・グループおよび第２のストライピング・グループの情報を更新し、第１のストライピング・グループが、前記新パリティを格納した磁気ディスクをパリティディスクとして持ち、データディスクとして、障害ディスクのデータを格納した磁気ディスクと元のデータディスクと第２のストライピング・グループの残りの正常なデータディスクとを持つように変更すると共に、第２のストライピング・グループを未使用状態とする（Ｓ２６）。また、論理ディスクと物理ディスクとの対応などを更新する。 The integration unit 35 is a part that performs a process of generating a single striping group having redundancy by integrating a striping group having no redundancy and a striping group having redundancy. The integration unit 35 executes the processing shown in FIG. 5 when the processing separation unit 33 is instructed and activated by the first striping group having redundancy and the second striping group having no redundancy. To do. First, in order from the top area of the magnetic disk, the parity stored in the parity disk of the first striping group via the disk control unit 32 and every normal disk of the second striping group in predetermined units. The data is read and stored in a buffer (not shown) (S21), and the data of the failed disk is generated by taking the exclusive OR of the data read from the normal disks in the second striping group (S22). In addition, the parity read from the parity disk of the first striping group and the parity of the second striping group (this is read in step S21 if the parity disk is not a failed disk, Exclusive theory) that has been restored in step S22) By taking the sum, a new parity for the integrated striping group is generated and stored in the buffer (S23), and the failure disk data generated in step S22 is read from the parity disk of the first striping group. And write the new parity generated in step S23 to one of the normal disks in the second striping group (a parity disk if the parity disk is normal, a data disk if the parity disk is faulty). Write to the read location (S24). When this operation is repeated and the restoration of all data is completed (YES in S25), the information on the first striping group and the second striping group in the striping table 362 is updated, and the first striping group is updated. The magnetic disk storing the new parity is used as the parity disk, and the data disk includes the magnetic disk storing the data of the failed disk, the original data disk, and the remaining normal data disks in the second striping group. And change the second striping group to the unused state (S26). Also, the correspondence between the logical disk and the physical disk is updated.

処理切り分け部３３は、冗長性を持つストライピング・グループにディスク障害が発生した際、予備ディスクが存在すれば復旧部３４を起動し、予備ディスクが存在しない場合は統合部３５を起動する処理を行う部分である。その処理例を図６に示す。故障検出部３２１から磁気ディスクに故障が発生した旨の通知が出されると、処理切り分け部３３は、その通知を受信し（Ｓ３１）、その障害ディスクを持つストライピング・グループの状態を記憶部３６から参照して、冗長性を有していたかどうかを判別する（Ｓ３２）。もし、冗長性を有していなかった場合（Ｓ３２でＮＯ）、データ喪失となる。この場合は他の処理へ進むが、本発明とは直接関係しないので、その場合の処理の説明は省略する。 When a disk failure occurs in a striping group having redundancy, the process isolation unit 33 activates the recovery unit 34 if there is a spare disk, and activates the integration unit 35 if there is no spare disk. Part. An example of the processing is shown in FIG. When a notification that a failure has occurred in the magnetic disk is issued from the failure detection unit 321, the process isolation unit 33 receives the notification (S 31), and stores the status of the striping group having the failed disk from the storage unit 36. Referring to it, it is determined whether or not there is redundancy (S32). If there is no redundancy (NO in S32), data is lost. In this case, the process proceeds to another process, but since it is not directly related to the present invention, description of the process in that case is omitted.

磁気ディスクが故障したストライピング・グループが冗長性を有していた場合（Ｓ３２でＹＥＳ）、処理切り分け部３３は、記憶部３６のストライピング・テーブル３６２における当該ストライピング・グループのステータス情報を更新することにより、障害ディスクを縮退し、運用から論理的に切り離す（Ｓ３３）。次に、記憶部３６の予備ディスクカウンタ３６１の値を読み出し、値０と比較する（Ｓ３４）。この比較の結果、予備ディスクカウンタ３６１の値が０より大きければ（Ｓ３４でＹＥＳ）、つまり予備ディスクが少なくとも１台存在すれば、復旧対象となるストライピング・グループと、障害ディスクに代えて使用する１つの予備ディスクとを指示して復旧部３４を起動する（Ｓ３５）。このとき、図示しない排他制御機構により、復旧部３４の起動時に、復旧対象となるストライピング・グループへのＩＯ処理部３１による通常アクセスを禁止し、復旧部３４による処理完了時に禁止を解除すれば、復旧処理中のストライピング・グループへの通常アクセスを抑止することができる。 If the striping group in which the magnetic disk has failed has redundancy (YES in S32), the process separation unit 33 updates the status information of the striping group in the striping table 362 of the storage unit 36. Then, the failed disk is degenerated and logically separated from the operation (S33). Next, the value of the spare disk counter 361 in the storage unit 36 is read and compared with the value 0 (S34). As a result of this comparison, if the value of the spare disk counter 361 is greater than 0 (YES in S34), that is, if at least one spare disk exists, the striping group to be restored and 1 used instead of the failed disk. The recovery unit 34 is activated by instructing one spare disk (S35). At this time, if the exclusive control mechanism (not shown) prohibits normal access by the IO processing unit 31 to the recovery target striping group when the recovery unit 34 is started, and cancels the prohibition when the processing by the recovery unit 34 is completed, Normal access to the striping group during recovery processing can be suppressed.

他方、予備ディスクカウンタ３６１の値が０であれば（Ｓ３４でＮＯ）、つまり予備ディスクが１台も残っていない場合、ステップＳ３３で障害ディスクを縮退したストライピング・グループを第２のストライピング・グループとして指示し、ストライピング・テーブル３６２に設定されている冗長性を有する適当なストライピング・グループを第１のストライピング・グループとして指示して、統合部３５を起動する（Ｓ３６）。冗長性を有する他のストライピング・グループが存在しない場合、ステップＳ３６はスキップされ、統合部３５は起動されない。また、統合部３５の起動時に、統合対象となる２つのストライピング・グループへのＩＯ処理部３１による通常アクセスを禁止し、統合部３５による処理完了時に禁止を解除すれば、統合処理中のストライピング・グループへの通常アクセスを抑止することができる。 On the other hand, if the value of the spare disk counter 361 is 0 (NO in S34), that is, if no spare disk remains, the striping group in which the failed disk is degenerated in step S33 is set as the second striping group. Then, an appropriate striping group having redundancy set in the striping table 362 is indicated as the first striping group, and the integration unit 35 is activated (S36). If there is no other striping group having redundancy, step S36 is skipped and the integration unit 35 is not activated. Further, when the integration unit 35 is started, normal access by the IO processing unit 31 to the two striping groups to be integrated is prohibited, and when the processing is completed by the integration unit 35, the prohibition is canceled. Normal access to the group can be suppressed.

次に本実施の形態にかかるディスクアレイ装置の動作を説明する。 Next, the operation of the disk array apparatus according to this embodiment will be described.

まず、図１に示すように合計６個のＲＡＩＤ４（Ｄ２＋Ｐ）のストライピング・グループ２０〜２５が運用されており、予備ディスクが合計６個存在している状態で、何れかのストライピング・グループの構成ディスクに障害が発生した場合の動作を説明する。 First, as shown in FIG. 1, a total of six RAID4 (D2 + P) striping groups 20 to 25 are operated, and there are a total of six spare disks, and any of the striping groups is configured. The operation when a disk failure occurs will be described.

何れかのストライピング・グループの或る磁気ディスク、例えばストライピング・グループ２１の磁気ディスク１０１が故障すると、そのことがディスクアレイ制御部３の故障検出部３２１で検出され、処理切り分け部３３に通知される。処理切り分け部３３は、その通知を受信すると（図６のＳ３１）、故障したディスク装置１０１が属するストライピング・グループ２１の状態をストライピング・テーブル３６２から参照して冗長性を有しているかどうかを判定し（Ｓ３２）、冗長性を有するので、故障した磁気ディスク１０１を縮退し（Ｓ３３）、予備ディスクカウンタ３６１の値を０と比較する（Ｓ３４）。図１に示す状態の場合、磁気ディスク１０６、１０７、１１６、１１７、１２６、１２７の合計６台が予備ディスクであり、予備ディスクカウンタ３６１の値は６になっているため、処理切り分け部３３は、ストライピング・グループ２１と予備ディスク（例えば磁気ディスク１０７とする）を指定して復旧部３４を起動する。 When a certain magnetic disk of any striping group, for example, the magnetic disk 101 of the striping group 21, fails, this is detected by the failure detection unit 321 of the disk array control unit 3 and notified to the processing separation unit 33. . When receiving the notification (S31 in FIG. 6), the process separating unit 33 refers to the state of the striping group 21 to which the failed disk device 101 belongs from the striping table 362 and determines whether or not there is redundancy. However, since it has redundancy, the failed magnetic disk 101 is degenerated (S33), and the value of the spare disk counter 361 is compared with 0 (S34). In the state shown in FIG. 1, since a total of 6 magnetic disks 106, 107, 116, 117, 126, 127 are spare disks and the value of the spare disk counter 361 is 6, The recovery unit 34 is activated by designating the striping group 21 and the spare disk (for example, the magnetic disk 107).

復旧部３４は、指定されたストライピング・グループ２１の情報をストライピング・テーブル３６２から参照し、障害ディスク１０１以外の正常な磁気ディスク１１１、１２１のデータに基づいて、指定された予備ディスク１０７上に障害ディスク１０１のデータを復元する（図４のＳ１１〜Ｓ１５）。これにより、ストライピング・グループ２１は再び冗長性を回復し、運用が継続される。 The recovery unit 34 refers to the information on the specified striping group 21 from the striping table 362 and, based on the data of the normal magnetic disks 111 and 121 other than the failed disk 101, causes a failure on the specified spare disk 107. The data on the disk 101 is restored (S11 to S15 in FIG. 4). As a result, the striping group 21 recovers the redundancy again, and the operation is continued.

ストライピング・グループ２１以外の他のストライピング・グループ２０、２２〜２５に磁気ディスクの故障が発生した場合や、一度復旧処理されたストライピング・グループに再び磁気ディスクの故障が発生した場合も、その時点で予備ディスクが存在する限り、ストライピング・グループ２１と同様に予備ディスクを使用して冗長性が回復される。 When a magnetic disk failure occurs in another striping group 20, 22 to 25 other than the striping group 21, or when a magnetic disk failure occurs again in a striping group that has been recovered once, at that time As long as there is a spare disk, redundancy is restored using the spare disk as in the striping group 21.

図７は図１の状態においてストライピング・グループ２１、２２、２３、２４および２５の磁気ディスク１０１、１１２、１０３、１２４および１１５が縮退し、予備ディスク１０７、１１６、１２７、１２６および１１７を用いた復旧処理が行われた後の、予備ディスクカウンタ３６１、ストライピング・テーブル３６２、各磁気ディスクの状態を示しており、この時点で予備ディスクカウンタ３６１の値は６から１へと変化している。 7 shows that the magnetic disks 101, 112, 103, 124 and 115 of the striping groups 21, 22, 23, 24 and 25 are degenerated in the state of FIG. 1, and the spare disks 107, 116, 127, 126 and 117 are used. The state of the spare disk counter 361, the striping table 362, and each magnetic disk after the restoration processing is shown is shown. At this time, the value of the spare disk counter 361 changes from 6 to 1.

図８は図７の状態においてストライピング・グループ２４の磁気ディスク１０４が縮退し、予備ディスク１０６を用いた復旧処理が行われた後の、予備ディスクカウンタ３６１、ストライピング・テーブル３６２、各磁気ディスクの状態を示しており、この時点で予備ディスクカウンタ３６１の値は０になる。 FIG. 8 shows the state of the spare disk counter 361, striping table 362, and each magnetic disk after the magnetic disk 104 of the striping group 24 is degenerated in the state of FIG. 7 and the recovery process using the spare disk 106 is performed. At this time, the value of the spare disk counter 361 becomes zero.

次に、図８に示すように合計６個のＲＡＩＤ４（Ｄ２＋Ｐ）のストライピング・グループ２０〜２５が運用されており、予備ディスクが１台も残っていない状態で、何れかのストライピング・グループの構成ディスクに障害が発生した場合の動作を説明する。 Next, as shown in FIG. 8, a total of six RAID4 (D2 + P) striping groups 20 to 25 are operated, and no spare disk is left, and any of the striping groups is configured. The operation when a disk failure occurs will be described.

何れかのストライピング・グループの或るディスク装置、例えばストライピング・グループ２１の磁気ディスク１０７が故障すると、そのことがディスクアレイ制御部３の故障検出部３２１で検出され、処理切り分け部３３に通知される。処理切り分け部３３は、その通知を受信すると（図６のＳ３１）、故障した磁気ディスク１０７が属するストライピング・グループ２１の状態をストライピング・テーブル３６２から参照して冗長性を有しているかどうかを判定し（Ｓ３２）、冗長性を有するので、故障した磁気ディスク１０７を縮退する（Ｓ３３）。そして、予備ディスクカウンタ３６１の値を０と比較する（Ｓ３４）。今の場合、予備ディスクカウンタ３６１の値が０になっているため、処理切り分け部３３は、統合部３５を起動する（Ｓ３６）。このとき、処理切り分け部３３は、統合部３５に対して、ストライピング・グループ２１を第２のストライピング・グループとして指定し、第１のストライピング・グループとしては冗長性を有する他のストライピング・グループ、例えばストライピング・グループ２０を指定する。 When a certain disk device in any striping group, for example, the magnetic disk 107 in the striping group 21, fails, this is detected by the failure detection unit 321 of the disk array control unit 3 and notified to the process isolation unit 33. . When receiving the notification (S31 in FIG. 6), the process separation unit 33 refers to the state of the striping group 21 to which the failed magnetic disk 107 belongs from the striping table 362 and determines whether or not there is redundancy. (S32) Since it has redundancy, the failed magnetic disk 107 is degenerated (S33). Then, the value of the spare disk counter 361 is compared with 0 (S34). In this case, since the value of the spare disk counter 361 is 0, the process separation unit 33 activates the integration unit 35 (S36). At this time, the process separation unit 33 designates the striping group 21 as the second striping group to the integration unit 35, and the first striping group has another striping group having redundancy, for example, Specifies the striping group 20.

起動された統合部３５の動作を図５のフローチャートと図９の動作説明図を参照して説明する。統合部３５は、まず、ディスク制御部３２を介してストライピング・グループ２０のパリティディスク１２０に記憶されたパリティＰ０と、ストライピング・グループ２１の全ての正常ディスク１１１、１２１のデータＤ１１、パリティＰ１とをリードし、図示しないバッファに格納する（Ｓ２１）。次に、ストライピング・グループ２１の正常ディスク１１１、１２１からリードしたデータＤ１１とパリティＰ１の排他的論理和をとることで、障害ディスク１０７のデータＤ０１を生成する（Ｓ２２）。また、ストライピング・グループ２０のパリティディスク１２０からリードしたパリティＰ０とストライピング・グループ２１のパリティディスク１２１のパリティＰ１との排他的論理和をとることで、統合後のストライピング・グループ用の新パリティＰｘを生成し、バッファに格納する（Ｓ２３）。そして、生成したデータＤ０１をストライピング・グループ２０のパリティディスク１２０における前記パリティＰ０をリードした箇所にライトすると共に、生成した新パリティＰｘをストライピング・グループ２１のパリティディスク１２１の前記パリティＰ１をリードした箇所にライトする（Ｓ２４）。 The operation of the activated integration unit 35 will be described with reference to the flowchart of FIG. 5 and the operation explanatory diagram of FIG. First, the integration unit 35 stores the parity P0 stored in the parity disk 120 of the striping group 20 via the disk control unit 32 and the data D11 and parity P1 of all the normal disks 111 and 121 of the striping group 21. Read and store in a buffer (not shown) (S21). Next, the data D01 of the failed disk 107 is generated by taking the exclusive OR of the data D11 read from the normal disks 111 and 121 of the striping group 21 and the parity P1 (S22). Also, by taking the exclusive OR of the parity P0 read from the parity disk 120 of the striping group 20 and the parity P1 of the parity disk 121 of the striping group 21, a new parity Px for the striping group after integration is obtained. It is generated and stored in the buffer (S23). Then, the generated data D01 is written to the location where the parity P0 is read from the parity disk 120 of the striping group 20, and the generated new parity Px is read from the parity P1 of the parity disk 121 of the striping group 21 (S24).

以上のような動作を所定単位で繰り返し、全データの復元を終えると（Ｓ２５でＹＥＳ）、ストライピング・テーブル３６２中のストライピング・グループ２０およびストライピング・グループ２１の情報を更新し、ストライピング・グループ２０が、新パリティＰｘを格納した磁気ディスク１２１をパリティディスクとして持ち、データディスクとして、障害ディスク１０７のデータＤ０１を格納した磁気ディスク１２０と元のデータディスク１００、１１０とストライピング・グループ２１の残りの正常なデータディスク１１１とを持つように変更すると共に、ストライピング・グループ２１を未使用状態とする（Ｓ２６）。 When the above operations are repeated in predetermined units and the restoration of all data is completed (YES in S25), the information on the striping group 20 and the striping group 21 in the striping table 362 is updated, and the striping group 20 The magnetic disk 121 storing the new parity Px is used as a parity disk, and as the data disk, the magnetic disk 120 storing the data D01 of the failed disk 107, the original data disks 100 and 110, and the remaining normal data in the striping group 21 are stored. The striping group 21 is set to an unused state (S26).

このときの予備ディスクカウンタ３６１、ストライピング・テーブル３６２、各磁気ディスクの状態を図１０に示す。ストライピング・グループ２０のステータスは、統合により、ＲＡＩＤ４（Ｄ２＋Ｐ）からＲＡＩＤ４（Ｄ４＋Ｐ）に変更され、ストライピング・グループ２１のステータスは未使用に変更されている。 FIG. 10 shows the status of the spare disk counter 361, the striping table 362, and each magnetic disk at this time. The status of the striping group 20 is changed from RAID 4 (D2 + P) to RAID 4 (D4 + P) by integration, and the status of the striping group 21 is changed to unused.

ストライピング・グループ２１以外の他のストライピング・グループ２０、２２〜２５に磁気ディスクの故障が発生した場合も、ストライピング・グループ２１と同様に冗長性を有する他のストライピング・グループと統合され、冗長性が回復される。なお、同じストライピング・グループに対して何度まで統合を許可するかは任意である。例えば、２度までの統合を許可する場合、一度統合された図１０のストライピング・グループ２１は必要に応じて別のストライピング・グループと再度統合されることになる。 When a magnetic disk failure occurs in another striping group 20, 22 to 25 other than the striping group 21, the redundancy is integrated with other striping groups having redundancy in the same manner as the striping group 21. Recovered. It is arbitrary how many times integration is permitted for the same striping group. For example, when the integration up to two times is permitted, the striping group 21 of FIG. 10 once integrated is re-integrated with another striping group as necessary.

このように本実施の形態によれば、ストライピング・グループがディスク障害により冗長性を失った場合、予備ディスクがあればその予備ディスクを使って冗長性を回復させ、予備ディスクがない場合には冗長性を持つ他のストライピング・グループと統合することにより、冗長性を回復させることにより、データ喪失確率を低減させる。以下に、データ喪失確率が低減する理由を説明する。 Thus, according to this embodiment, when a striping group loses redundancy due to a disk failure, if there is a spare disk, the spare disk is used to restore redundancy, and if there is no spare disk, redundancy is achieved. Reduce the probability of data loss by restoring redundancy by integrating with other striping groups that have sex. Hereinafter, the reason why the data loss probability is reduced will be described.

磁気ディスクｎ台によって形成される、冗長性を有するストラピング・グループのデータ喪失確率ＰＬ１は、２台以上の磁気ディスクが故障する確率で表される。２台以上の磁気ディスクに故障が発生する確率は、故障が未発生または１台のみ故障が発生する場合の補集合として求められる。個々の磁気ディスク故障が独立事象であり、その故障確率をｐとすると、データ喪失確率ＰＬ１は、下記の式（１）で表される。
ＰＬ１＝１―｛（１−ｐ）ⁿ ＋ｎ×ｐ×（１−ｐ）^(n-1)｝
＝１− （１−ｐ）ⁿ −ｎ×ｐ×（１−ｐ）^(n-1)
…（１） The data loss probability PL1 of a strapping group having redundancy formed by n magnetic disks is expressed by the probability that two or more magnetic disks will fail. The probability that a failure occurs in two or more magnetic disks is obtained as a complement when no failure occurs or only one failure occurs. If each magnetic disk failure is an independent event and its failure probability is p, the data loss probability PL1 is expressed by the following equation (1).
PL1 = 1 − {(1−p) ⁿ + n × p × (1−p) ⁽ⁿ⁻¹⁾ }
= 1− (1−p) ⁿ −n × p × (1−p) ⁽ⁿ⁻¹⁾
... (1)

上式（１）で、右辺第２項はｎ台の磁気ディスクで故障が発生しない確率、右辺第３項は、ｎ台の磁気ディスクの内、１台のみ故障が発生する確率である。 In the above equation (1), the second term on the right side is the probability that no failure will occur in the n magnetic disks, and the third term on the right side is the probability that only one of the n magnetic disks will fail.

次に、磁気ディスクｎ−１台によって形成される冗長性のないストライピング・グループで、データ喪失の発生する確率ＰＬ２は、１台以上の磁気ディスクで障害が発生する場合で、障害が発生しない場合の補集合として次式（２）で求められる。
ＰＬ２＝１−（１−ｐ）^(n-1)
…（２） Next, in the non-redundant striping group formed by n-1 magnetic disks, the probability of data loss PL2 is when a failure occurs in one or more magnetic disks and no failure occurs. Is obtained by the following equation (2).
PL2 = 1- (1-p) ^(n-1)
... (2)

よって、冗長性を持つｎ台の磁気ディスクを有するストライピング・グループと、冗長性を持たないｎ−１台の磁気ディスクを有するストライピング・グループの何れか一方、または、両方でデータ喪失が発生する確率ＰＬ３は、下記のように求められる。
ＰＬ３＝ＰＬ１×（１−ＰＬ２）＋（１−ＰＬ１）×ＰＬ２＋ＰＬ１×ＰＬ２
＝１−（１−ｐ）^(2n-1) −ｎ×ｐ×（１−ｐ）^(2n-2)
…（３） Therefore, the probability of data loss occurring in one or both of a striping group having n magnetic disks with redundancy and a striping group having n-1 magnetic disks without redundancy PL3 is obtained as follows.
PL3 = PL1 * (1-PL2) + (1-PL1) * PL2 + PL1 * PL2
= 1- (1-p) ^(2n-1) -n * p * (1-p) ^(2n-2)
... (3)

次に、２組のストライピング・グループを結合し、（２ｎ−１）台の磁気ディスクで冗長性を持つストライピング・グループを組む場合の、データ喪失確率ＰＬ４を求める。ＰＬ４は、式１におけるｎを２ｎ−１に置き換えることで、ＰＬ１より直ちに求められる。
ＰＬ４＝１−（１−ｐ）^(2n-1)−（２ｎ−１）×ｐ×（１−ｐ）^(2n-2)
…（４） Next, two sets of striping groups are combined to determine a data loss probability PL4 when a striping group having redundancy is formed by (2n-1) magnetic disks. PL4 is obtained immediately from PL1 by replacing n in Equation 1 with 2n-1.
PL4 = 1- (1-p) ^(2n-1) -(2n-1) * p * (1-p) ^(2n-2)
(4)

ＰＬ３とＰＬ４を比較することで、何れの構成が信頼性が高いかを判断できる。
ＰＬ３−ＰＬ４＝（ｎ−１）×ｐ×（１−ｐ）^(2n-2)＞０
…（５）
上式（５）が０より大きい、即ちＰＬ４がＰＬ３よりも小さいため、冗長性を持たないストライピング・グループと冗長性を持つストライピング・グループとを統合し、冗長性を持つ一つのストライピング・グループにする方が、データ喪失確率が小さくなる。 By comparing PL3 and PL4, it can be determined which configuration has high reliability.
PL3-PL4 = (n-1) * p * (1-p) ^(2n-2) > 0
... (5)
Since the above equation (5) is greater than 0, that is, PL4 is smaller than PL3, the non-redundant striping group and the redundant striping group are integrated into a single redundant striping group. This will reduce the probability of data loss.

以上の実施例は、ＲＡＩＤ４構成のストライピング・グループを持つディスクアレイ装置に対して本発明を適用したものであるが、本発明はＲＡＩＤ１、ＲＡＩＤ２、ＲＡＩＤ３、ＲＡＩＤ５のストライピング・グループを持つディスクアレイ装置に対しても適用可能である。以下、ＲＡＩＤ１とＲＡＩＤ５へ適用した実施例について説明する。 In the above embodiment, the present invention is applied to a disk array device having a striping group of RAID4 configuration. However, the present invention is applied to a disk array device having a striping group of RAID1, RAID2, RAID3, and RAID5. It can also be applied to. Hereinafter, embodiments applied to RAID 1 and RAID 5 will be described.

図１１は、ＲＡＩＤ１構成での実施例であり、複数の磁気ディスク１００〜１１７のうち、磁気ディスク１００および１１０、磁気ディスク１０１および１１１、磁気ディスク１０２および１１２、磁気ディスク１０３および１１３、磁気ディスク１０４および１１４、磁気ディスク１０５および１１５は、それぞれＲＡＩＤ１のストライピング・グループ２０、２１、２２、２３、２４、２５を構成しており、残りの磁気ディスク１０６、１０７、１１６および１１７は予備ディスクとなっている。 FIG. 11 shows an example of a RAID 1 configuration. Among the plurality of magnetic disks 100 to 117, the magnetic disks 100 and 110, the magnetic disks 101 and 111, the magnetic disks 102 and 112, the magnetic disks 103 and 113, and the magnetic disk 104 are shown. And 114 and magnetic disks 105 and 115 constitute RAID1 striping groups 20, 21, 22, 23, 24, and 25, respectively, and the remaining magnetic disks 106, 107, 116, and 117 are spare disks. Yes.

図１２は図１１の状態においてストライピング・グループ２１、２２、２４および２５の磁気ディスク１０１、１１２、１０４および１１５が縮退し、予備ディスク１０７、１１６、１０６および１１７を用いた復旧処理が行われた後の、予備ディスクカウンタ３６１、ストライピング・テーブル３６２、各磁気ディスクの状態を示しており、この時点で予備ディスクカウンタ３６１の値は４から０へと変化している。 In FIG. 12, the magnetic disks 101, 112, 104, and 115 of the striping groups 21, 22, 24, and 25 are degenerated in the state of FIG. 11, and the recovery process using the spare disks 107, 116, 106, and 117 is performed. The state of the spare disk counter 361, the striping table 362, and each magnetic disk later is shown. At this time, the value of the spare disk counter 361 changes from 4 to 0.

次に、図１２に示すように合計６個のＲＡＩＤ１のストライピング・グループ２０〜２５が運用されており、予備ディスクが１台も残っていない状態で、何れかのストライピング・グループの構成ディスクに障害が発生した場合の動作を説明する。 Next, as shown in FIG. 12, a total of six RAID1 striping groups 20 to 25 are in operation, and there is no spare disk remaining, and there is a failure in any of the striping group's constituent disks. The operation when this occurs will be described.

起動された統合部３５の動作を図１３の動作説明図を参照して説明する。統合部３５は、まず、磁気ディスクの先頭領域から順に、所定単位毎に、ディスク制御部３２を介してストライピング・グループ２０の何れか一方の磁気ディスク（例えば１１０）に記憶されたデータＤ００と、ストライピング・グループ２１の正常ディスク１１１のデータＤ０１とをリードして図示しないバッファに格納し、このデータＤ００とＤ０１の排他的論理和をとることで、統合後のＲＡＩＤ４（Ｄ２＋Ｐ）用のパリティＰ０を生成し、このパリティＰ０をストライピング・グループ２０の磁気ディスク１００或いは１１０（図示の例は前者を示す）にライトする。以上のような動作を繰り返し、全データの統合を終えると、ストライピング・テーブル３６２中のストライピング・グループ２０およびストライピング・グループ２１の情報を更新し、ストライピング・グループ２０が、パリティＰ０を格納した磁気ディスク１００をパリティディスクとして持ち、データディスクとして、磁気ディスク１１０と１１１とを持つＲＡＩＤ４（Ｄ２＋Ｐ）に変更すると共に、ストライピング・グループ２１を未使用状態とする。 The operation of the activated integration unit 35 will be described with reference to the operation explanatory diagram of FIG. First, the integration unit 35 sequentially stores data D00 stored in one of the magnetic disks (for example, 110) of the striping group 20 via the disk control unit 32 for each predetermined unit in order from the head area of the magnetic disk. The data D01 of the normal disk 111 of the striping group 21 is read and stored in a buffer (not shown), and an exclusive OR of the data D00 and D01 is taken to obtain a parity P0 for RAID 4 (D2 + P) after integration. The parity P0 is generated and written to the magnetic disk 100 or 110 of the striping group 20 (the illustrated example indicates the former). When the above operation is repeated and integration of all data is completed, information on the striping group 20 and the striping group 21 in the striping table 362 is updated, and the striping group 20 stores the parity P0. 100 is used as a parity disk, and the data disk is changed to RAID 4 (D2 + P) having magnetic disks 110 and 111, and the striping group 21 is set to an unused state.

このときの予備ディスクカウンタ３６１、ストライピング・テーブル３６２、各磁気ディスクの状態を図１４に示す。ストライピング・グループ２０のステータスは、統合により、ＲＡＩＤ１からＲＡＩＤ４（Ｄ２＋Ｐ）に変更され、ストライピング・グループ２１のステータスは未使用に変更されている。 FIG. 14 shows the status of the spare disk counter 361, the striping table 362, and each magnetic disk at this time. The status of the striping group 20 is changed from RAID 1 to RAID 4 (D2 + P) by integration, and the status of the striping group 21 is changed to unused.

ストライピング・グループ２１以外の他のストライピング・グループ２０、２２〜２５に磁気ディスクの故障が発生した場合も、ストライピング・グループ２１と同様に冗長性を有する他のストライピング・グループと統合され、冗長性が回復される。 When a magnetic disk failure occurs in another striping group 20, 22 to 25 other than the striping group 21, the redundancy is integrated with other striping groups having redundancy in the same manner as the striping group 21. Recovered.

図１５は、ＲＡＩＤ５構成での実施例であり、複数の磁気ディスク１００〜１２５のうち、磁気ディスク１００、１１０および１２０、磁気ディスク１０１、１１１および１２１、磁気ディスク１０２、１１２および１２２、磁気ディスク１０３、１１３および１２３は、それぞれＲＡＩＤ５のストライピング・グループ２０、２１、２２、２３を構成しており、残りの磁気ディスク１０４、１０５、１１４、１１５、１２４および１２５は予備ディスクとなっている。ここで、磁気ディスク１００〜１２５の内容は、４つの記憶領域に分割されている。例えば、磁気ディスク１００の内容は（Ｄ００，Ｐ０２、Ｄ１４，Ｄ０６）である。他の実施例との違いは、ストライピング・テーブル３６２の物理ディスク番号が記載されているフィールドに、各磁気ディスクに記録されているデータの情報が格納されている点である。 FIG. 15 shows an embodiment with a RAID 5 configuration, and among the plurality of magnetic disks 100 to 125, the magnetic disks 100, 110 and 120, the magnetic disks 101, 111 and 121, the magnetic disks 102, 112 and 122, and the magnetic disk 103. , 113 and 123 constitute RAID 5 striping groups 20, 21, 22, and 23, respectively, and the remaining magnetic disks 104, 105, 114, 115, 124, and 125 are spare disks. Here, the contents of the magnetic disks 100 to 125 are divided into four storage areas. For example, the contents of the magnetic disk 100 are (D00, P02, D14, D06). A difference from the other embodiments is that information of data recorded on each magnetic disk is stored in a field in which a physical disk number of the striping table 362 is described.

図１６は図１５の状態においてストライピング・グループ２０、２１、２２および２３の磁気ディスク１２０、１０１、１２２および１１３が縮退し、予備ディスク１２４、１０５、１１４および１１５を用いた復旧処理が行われた後の、予備ディスクカウンタ３６１、ストライピング・テーブル３６２、各磁気ディスクの状態を示しており、この時点で予備ディスクカウンタ３６１の値は６から２へと変化している。なお、障害が発生した磁気ディスクを縮退し、予備ディスク上に復旧を行うとき、磁気ディスクの内容は、データであるか冗長データであるかを問わず、対応する予備ディスク上に復旧される。そのため、物理ディスク番号の記載フィールドに格納されているデータの情報は、書き換える必要はない。 In FIG. 16, the magnetic disks 120, 101, 122, and 113 of the striping groups 20, 21, 22, and 23 are degenerated in the state of FIG. 15, and the recovery process using the spare disks 124, 105, 114, and 115 is performed. The state of the spare disk counter 361, the striping table 362, and each magnetic disk later is shown. At this time, the value of the spare disk counter 361 changes from 6 to 2. When the failed magnetic disk is degenerated and restored on the spare disk, the contents of the magnetic disk are restored on the corresponding spare disk regardless of whether it is data or redundant data. Therefore, it is not necessary to rewrite the data information stored in the physical disk number description field.

図１７は図１６の状態においてストライピング・グループ２２および２３の磁気ディスク１１４および１１５が縮退し、予備ディスク１０４および１２５を用いた復旧処理が行われた後の、予備ディスクカウンタ３６１、ストライピング・テーブル３６２、各磁気ディスクの状態を示しており、この時点で予備ディスクカウンタ３６１の値は０になる。 FIG. 17 shows the spare disk counter 361 and the striping table 362 after the magnetic disks 114 and 115 of the striping groups 22 and 23 are degenerated in the state of FIG. 16 and the recovery process using the spare disks 104 and 125 is performed. The state of each magnetic disk is shown, and the value of the spare disk counter 361 becomes 0 at this time.

次に、図１７に示すように合計４個のＲＡＩＤ５（Ｄ２＋Ｐ）のストライピング・グループ２０〜２３が運用されており、予備ディスクが１台も残っていない状態で、何れかのストライピング・グループの構成ディスクに障害が発生した場合の動作を説明する。 Next, as shown in FIG. 17, a total of four RAID5 (D2 + P) striping groups 20 to 23 are operated, and no spare disk is left, and any of the striping groups is configured. The operation when a disk failure occurs will be described.

何れかのストライピング・グループの或るディスク装置、例えばストライピング・グループ２１の磁気ディスク１０５が故障すると、そのことがディスクアレイ制御部３の故障検出部３２１で検出され、処理切り分け部３３に通知される。処理切り分け部３３は、その通知を受信すると（図６のＳ３１）、故障したディスク装置１０５が属するストライピング・グループ２１の状態をストライピング・テーブル３６２から参照して冗長性を有しているかどうかを判定し（Ｓ３２）、冗長性を有するので、故障した磁気ディスク１０５を縮退する（Ｓ３３）。そして、予備ディスクカウンタ３６１の値を０と比較する（Ｓ３４）。今の場合、予備ディスクカウンタ３６１の値が０になっているため、処理切り分け部３３は、統合部３５を起動する（Ｓ３６）。このとき、処理切り分け部３３は、統合部３５に対して、ストライピング・グループ２１を第２のストライピング・グループとして指定し、第１のストライピング・グループとしては冗長性を有する他のグループ、例えばストライピング・グループ２０を指定する。 When a certain disk device of any striping group, for example, the magnetic disk 105 of the striping group 21, fails, this is detected by the failure detection unit 321 of the disk array control unit 3 and notified to the processing separation unit 33. . When receiving the notification (S31 in FIG. 6), the process isolation unit 33 refers to the state of the striping group 21 to which the failed disk device 105 belongs from the striping table 362 and determines whether or not there is redundancy. (S32) Since it has redundancy, the failed magnetic disk 105 is degenerated (S33). Then, the value of the spare disk counter 361 is compared with 0 (S34). In this case, since the value of the spare disk counter 361 is 0, the process separation unit 33 activates the integration unit 35 (S36). At this time, the process separating unit 33 designates the striping group 21 as the second striping group to the integrating unit 35, and the first striping group is another group having redundancy, such as a striping group. Group 20 is designated.

図１８はＲＡＩＤ５構成の場合の統合部３５の処理例を示すフローチャートである。統合部３５は、処理切り分け部３３から、冗長性を持つ第１のストライピング・グループと冗長性を持たない第２のストライピング・グループとが指示されて起動されると、まず、磁気ディスクの先頭領域から順に、所定単位毎に、ディスク制御部３２を介して第１のストライピング・グループのパリティと、第２のストライピング・グループの全ての正常ディスクのデータをリードして図示しないバッファに格納し（Ｓ４１）、次に、第２のストライピング・グループの正常ディスクからリードしたデータどうしの排他的論理和をとることで障害ディスクのデータを生成し（Ｓ４２）、また、第１のストライピング・グループからリードしたパリティと第２のストライピング・グループのパリティ（これは、パリティが障害ディスク以外に格納されていればステップＳ４１でリードされており、障害ディスクに格納されていればステップＳ４２で復元されている）との排他的論理和をとることで、統合後のストライピング・グループ用の新パリティを生成してバッファに格納する（Ｓ４３）。そして、ステップＳ４２で生成したデータがパリティかどうかを判別し（Ｓ４４）、パリティであれば、ステップＳ４３で生成した新パリティを第１のストライピング・グループの前記読み出したパリティに上書きする（Ｓ４５）。他方、ステップＳ４２で生成したデータがパリティでなくデータであれば、生成したデータを第２のストライピング・グループからステップＳ４１で読み出したパリティに上書きし、ステップＳ４３で生成した新パリティを第１のストライピング・グループの前記読み出したパリティに上書きする（Ｓ４６）。 FIG. 18 is a flowchart illustrating a processing example of the integration unit 35 in the case of a RAID 5 configuration. When the integration unit 35 is instructed by the process separation unit 33 to designate the first striping group having redundancy and the second striping group having no redundancy, the integration unit 35 first starts the head area of the magnetic disk. From the beginning, the parity of the first striping group and the data of all normal disks of the second striping group are read via the disk control unit 32 and stored in a buffer (not shown) for each predetermined unit (S41). Next, the data of the failed disk is generated by taking the exclusive OR of the data read from the normal disks of the second striping group (S42), and read from the first striping group. Parity and parity of the second striping group (this means that the parity is not a failed disk) If it is stored, it is read in step S41, and if it is stored in the failed disk, it is restored in step S42). Is generated and stored in the buffer (S43). Then, it is determined whether or not the data generated in step S42 is parity (S44), and if it is parity, the new parity generated in step S43 is overwritten on the read parity of the first striping group (S45). On the other hand, if the data generated in step S42 is not parity, the generated data is overwritten on the parity read in step S41 from the second striping group, and the new parity generated in step S43 is changed to the first striping. Overwrite the read parity of the group (S46).

以上の動作を繰り返し、全データの復元を終えると（Ｓ４７でＹＥＳ）、ストライピング・テーブル３６２中の第１のストライピング・グループおよび第２のストライピング・グループの情報を更新し、第１のストライピング・グループが、元の磁気ディスクに加えて、第２のストライピング・グループの正常ディスクを持つように変更すると共に、第２のストライピング・グループを未使用状態とする（Ｓ４８）。また、論理ディスクと物理ディスクとの対応などを更新する。 When the above operation is repeated and the restoration of all data is completed (YES in S47), the information on the first striping group and the second striping group in the striping table 362 is updated, and the first striping group is updated. However, in addition to the original magnetic disk, it is changed to have a normal disk in the second striping group, and the second striping group is set to an unused state (S48). Also, the correspondence between the logical disk and the physical disk is updated.

図１７の状態において、ストライピング・グループ２１の構成ディスク１０５に障害が発生し、処理切り分け部３３から、ストライピング・グループ２０を第１のストライピング・グループ、ストライピング・グループ２１を第２のストライピング・グループとして指定されて起動された統合部３５による処理が完了した状態を図１９に示す。このように、ＲＡＩＤ５構成時も、ストライピング・グループの動的変更が可能であり、これにより、データ喪失確率を低減することができる。 In the state shown in FIG. 17, a failure occurs in the configuration disk 105 of the striping group 21, and the processing grouping unit 33 sets the striping group 20 as the first striping group and the striping group 21 as the second striping group. FIG. 19 shows a state in which the processing by the integrated unit 35 specified and started is completed. As described above, even when RAID 5 is configured, the striping group can be dynamically changed, thereby reducing the data loss probability.

以上本発明の実施例について説明したが、本発明は以上の例に限定されず、その他各種の付加変更が可能である。 Although the embodiments of the present invention have been described above, the present invention is not limited to the above examples, and various other additions and modifications can be made.

本発明の第１の実施例にかかるディスクアレイ装置のブロック図である。1 is a block diagram of a disk array device according to a first exemplary embodiment of the present invention. 本発明の第１の実施例にかかるディスクアレイ装置におけるディスクアレイ制御部の構成例を示すブロック図である。It is a block diagram which shows the structural example of the disk array control part in the disk array apparatus concerning the 1st Example of this invention. 本発明の第１の実施例にかかるディスクアレイ装置におけるストライピング・テーブルの構成例を示す図である。It is a figure which shows the structural example of the striping table in the disk array apparatus concerning the 1st Example of this invention. 本発明の第１の実施例にかかるディスクアレイ装置における復旧部の処理例を示すフローチャートである。It is a flowchart which shows the example of a process of the recovery part in the disk array apparatus concerning 1st Example of this invention. 本発明の第１の実施例にかかるディスクアレイ装置における統合部の処理例を示すフローチャートである。It is a flowchart which shows the process example of the integration part in the disk array apparatus concerning 1st Example of this invention. 本発明の第１の実施例にかかるディスクアレイ装置における処理切り分け部の処理例を示すフローチャートである。It is a flowchart which shows the process example of the process isolation | separation part in the disk array apparatus concerning 1st Example of this invention. 本発明の第１の実施例にかかるディスクアレイ装置の復旧動作の説明図である。It is explanatory drawing of the recovery operation | movement of the disk array apparatus concerning 1st Example of this invention. 本発明の第１の実施例にかかるディスクアレイ装置の復旧動作の説明図である。It is explanatory drawing of the recovery operation | movement of the disk array apparatus concerning 1st Example of this invention. 本発明の第１の実施例にかかるディスクアレイ装置の統合動作の説明図である。It is explanatory drawing of the integrated operation | movement of the disk array apparatus concerning 1st Example of this invention. 本発明の第１の実施例にかかるディスクアレイ装置の統合動作の説明図である。It is explanatory drawing of the integrated operation | movement of the disk array apparatus concerning 1st Example of this invention. 本発明の第２の実施例にかかるディスクアレイ装置の構成説明図である。FIG. 6 is a configuration explanatory diagram of a disk array device according to a second example of the present invention. 本発明の第２の実施例にかかるディスクアレイ装置の復旧動作の説明図である。It is explanatory drawing of recovery operation | movement of the disk array apparatus concerning the 2nd Example of this invention. 本発明の第２の実施例にかかるディスクアレイ装置の統合動作の説明図である。It is explanatory drawing of the integration operation | movement of the disk array apparatus concerning the 2nd Example of this invention. 本発明の第２の実施例にかかるディスクアレイ装置の統合動作の説明図である。It is explanatory drawing of the integration operation | movement of the disk array apparatus concerning the 2nd Example of this invention. 本発明の第３の実施例にかかるディスクアレイ装置の構成説明図である。It is a block diagram explaining the configuration of a disk array device according to a third example of the present invention. 本発明の第３の実施例にかかるディスクアレイ装置の復旧動作の説明図である。It is explanatory drawing of recovery operation | movement of the disk array apparatus concerning the 3rd Example of this invention. 本発明の第３の実施例にかかるディスクアレイ装置の復旧動作の説明図である。It is explanatory drawing of recovery operation | movement of the disk array apparatus concerning the 3rd Example of this invention. 本発明の第３の実施例にかかるディスクアレイ装置における統合部の処理例を示すフローチャートである。It is a flowchart which shows the process example of the integration part in the disk array apparatus concerning the 3rd Example of this invention. 本発明の第３の実施例にかかるディスクアレイ装置の統合動作の説明図である。It is explanatory drawing of the integration operation | movement of the disk array apparatus concerning the 3rd Example of this invention.

Explanation of symbols

１…ディスクアレイ装置
１００〜１２７…磁気ディスク
２０〜２７…ストライピング・グループ
３…ディスクアレイ制御部
３１…ＩＯ処理部
３２…ディスク制御部
３２１…故障検出部
３３…処理切り分け部
３４…復旧部
３５…統合部
３６…記憶部
３６１…予備ディスクカウンタ
３６２…ストライピング・テーブル
４０〜４５…ディスク制御バス
５…ホストバス DESCRIPTION OF SYMBOLS 1 ... Disk array apparatus 100-127 ... Magnetic disks 20-27 ... Striping group 3 ... Disk array control part 31 ... IO process part 32 ... Disk control part 321 ... Failure detection part 33 ... Process isolation | separation part 34 ... Restoration part 35 ... Integration unit 36 ... storage unit 361 ... spare disk counter 362 ... striping tables 40 to 45 ... disk control bus 5 ... host bus

Claims

In a disk array device having a spare disk separately from a disk for storing data and redundant data,
For the striping group that has lost redundancy due to disk failure, recovery means to recover the redundancy by restoring the contents of the failed disk on the spare disk,
An integration means for integrating a non-redundant striping group with a redundant striping group and generating one redundant striping group;
When a disk failure occurs in a striping group having redundancy, the recovery unit is activated if a spare disk exists, and the separation unit is activated to activate the integration unit if no spare disk exists. A disk array device characterized by the above.

The integration means integrates a striping group having a RAID 4 (Di + P) configuration with redundancy and a striping group having a RAID 4 (Dj + P-1) configuration without redundancy to provide a RAID 4 (D (i + j) having redundancy. 2. The disk array device according to claim 1, wherein a striping group having a configuration of + P) is generated.

The integration means includes a striping group having a RAID 4 (Di + P) configuration having redundancy as a first striping group, and a striping group having a RAID 4 (Dj + P-1) configuration having no redundancy as a second striping group. When reading, the parity stored in the parity disk of the first striping group and the data of all the normal disks of the second striping group are read and stored in the buffer, and the normal disks of the second striping group are read. The data of the failed disk is generated by taking the exclusive OR of the data read from 1 and the exclusive logic of the parity read from the parity disk of the first striping group and the parity of the second striping group By taking the sum, A new parity for the combined striping group is generated and stored in the buffer, and the data of the generated failed disk is written to the read location in the parity disk of the first striping group, and the generated new parity is also generated. The striping group in which the first and second striping groups are integrated is generated by repeating the operation of writing the parity to the read location of one normal disk of the second striping group. 2. The disk array device according to 2.

The integration means integrates a RAID1 configuration striping group with redundancy and a RAID1 configuration striping group without redundancy to generate a RAID4 (D2 + P) configuration striping group with redundancy. The disk array device according to claim 1.

The integration means uses a first striping group when a RAID1 configuration striping group having redundancy is a first striping group and a RAID1 configuration striping group without redundancy is a second striping group. By reading the data stored in any one magnetic disk of the group and the data of the normal disk of the second striping group and storing them in the buffer, and taking the exclusive OR of the read data The parity for RAID 4 (D2 + P) after integration is generated, and the operation of writing this parity to any one of the magnetic disks in the first striping group is repeated, so that the first and second striping groups are changed. To create an integrated striping group. The disk array system according to claim 4, wherein.

The integration unit integrates a striping group having a RAID 5 (Di + P) configuration with redundancy and a striping group having a RAID 5 (Dj + P-1) configuration without redundancy into a RAID 5 (D (i + j) having redundancy. 2. The disk array device according to claim 1, wherein a striping group having a configuration of + P) is generated.

The integrating means includes a striping group having a RAID 5 (Di + P) configuration having redundancy as a first striping group, and a striping group having a RAID 5 (Dj + P-1) configuration having no redundancy as a second striping group. When reading, the parity of the first striping group and the data of all the normal disks of the second striping group are read and stored in the buffer, and the data read from the normal disks of the second striping group are The data of the failed disk is generated by taking the exclusive OR, and the exclusive OR of the parity read from the first striping group and the parity of the second striping group is taken. New parity for striping groups Generate and store in a buffer, determine whether the data of the generated failed disk is parity, and if so, overwrite the generated new parity with the read parity of the first striping group, If the generated failed disk data is not parity, the generated failed disk data is overwritten on the parity read from the second striping group, and the generated new parity is read from the first striping group. 7. The disk array device according to claim 6, wherein a striping group in which the first and second striping groups are integrated is generated by repeating the operation of overwriting the parity.

A computer constituting a disk array control unit of a disk array device having a spare disk separately from a disk for storing data and redundant data;
Recovery means for recovering redundancy by restoring the contents of a failed disk on a spare disk for a striping group that has lost redundancy due to a disk failure,
A means for integrating a non-redundant striping group with a redundant striping group to generate a single redundant striping group;
When a disk failure occurs in a striping group having redundancy, if a spare disk exists, the recovery unit is activated, and if there is no spare disk, a process separation unit that activates the integration unit,
Program to function as.