JP2010244130A

JP2010244130A - Disk array device and disk array control method

Info

Publication number: JP2010244130A
Application number: JP2009089267A
Authority: JP
Inventors: Masaki Kobayashi; 正樹小林
Original assignee: Toshiba Corp; Toshiba Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2009-04-01
Filing date: 2009-04-01
Publication date: 2010-10-28

Abstract

<P>PROBLEM TO BE SOLVED: To maintain responsiveness to host access even when any obstacle occurs in a portion of disk devices. <P>SOLUTION: A disk array device 10 includes: a replication management unit 250 for generating a backup logical disk from an RAID array different from an RAID array configuring a master logical disk in response to a replication instruction; and an RAID control unit 260 for, when the master logical disk and the backup logical disk are put in a synchronized state to host access, and when it is detected that any obstacle occurs in the disk device configuring the master logical disk, substituting the disk device in which the obstacle has occurred in the master logical disk with the disk device in which any obstacle has not occurred in the backup logical disk, and for changing the synchronized state of the master logical disk and the backup logical disk to a separated state to the host access. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、一部のディスク装置に障害が発生してもホストアクセスに対する応答性能を維持し得るディスクアレイ装置及びディスクアレイ制御方法に関する。 The present invention relates to a disk array device and a disk array control method that can maintain response performance to host access even if a failure occurs in some of the disk devices.

ディスクアレイ装置は、ハードディスクドライブ等のディスク装置を複数台並列に接続し、それら全体を１つの記憶装置のように制御することで、データの読み書きを高速化したり、障害に対する耐久性を上げたりする。例えば、ＲＡＩＤ（Redundant Array of Inexpensive Disks）等の技術を用いることにより、一部のディスク装置に障害が生じた場合でも、その復元処理を実行することができる。 A disk array device connects multiple disk devices such as hard disk drives in parallel, and controls them as a single storage device, thereby speeding up data reading and writing and increasing durability against failures. . For example, by using a technique such as RAID (Redundant Array of Inexpensive Disks), even if a failure occurs in some of the disk devices, the restoration process can be executed.

ところで、ディスクアレイ装置は、ホスト装置からの命令に従ってデータを読み書きする外部記憶装置である。そのため、一部のディスク装置の復元処理中にホストアクセスがあると、ヘッドがランダムに動作してしまい、復元処理が遅くなる。また、ホストアクセスの応答も遅くなる。 Incidentally, the disk array device is an external storage device that reads and writes data according to instructions from the host device. Therefore, if there is a host access during the restoration process of some disk devices, the head operates randomly, and the restoration process is delayed. Also, the host access response is delayed.

特開２００２−２５９０６２号公報JP 2002-259062 A

しかしながら、ディスクアレイ装置は、常に性能低下が生じないように制御する必要がある。 However, it is necessary to control the disk array device so that performance degradation does not always occur.

本発明は上記問題に鑑みてなされたものであり、一部のディスク装置に障害が発生してもホストアクセスに対する応答性能を維持し得るディスクアレイ装置及びディスクアレイ制御方法を提供することを目的とする。 The present invention has been made in view of the above problems, and an object of the present invention is to provide a disk array device and a disk array control method capable of maintaining response performance to host access even if a failure occurs in some disk devices. To do.

本発明は上記課題を解決するために以下の手段を講じる。 The present invention takes the following means in order to solve the above problems.

第１の発明は、スライス単位で管理されるマスタ論理ディスクとバックアップ論理ディスクとが異なるディスク装置の集合から構成されるディスクアレイ装置であって、前記マスタ論理ディスク及び前記バックアップ論理ディスクのスライスを示す論理アドレスとディスク装置の物理アドレスとを対応付けたアドレス管理情報を記憶するアドレス記憶手段と、前記マスタ論理ディスクを構成するディスク装置に障害が生じたか否かを検出する障害検出手段と、前記マスタ論理ディスク及び前記バックアップ論理ディスクがホストアクセスに対して同期状態であるときに、前記障害検出手段により前記マスタ論理ディスクを構成するディスク装置に障害が生じたことが検出された場合、該マスタ論理ディスクと前記バックアップ論理ディスクとのスライスを入れ替えることにより、前記マスタ論理ディスクの障害が生じたディスク装置と前記バックアップ論理ディスクの障害が生じていないディスク装置とを入れ替える入替手段と、前記入替処理が実行された場合、前記マスタ論理ディスク及び前記バックアップ論理ディスクをホストアクセスに対して切り離し状態に変更する状態変更手段と、を備えたディスクアレイ装置である。 A first invention is a disk array device comprising a set of disk devices in which a master logical disk and a backup logical disk managed in units of slices are different, and shows slices of the master logical disk and the backup logical disk Address storage means for storing address management information in which a logical address and a physical address of a disk device are associated with each other, failure detection means for detecting whether or not a failure has occurred in a disk device constituting the master logical disk, and the master When the logical disk and the backup logical disk are in synchronization with the host access and the failure detection means detects that a failure has occurred in the disk device constituting the master logical disk, the master logical disk And the backup logical disk By replacing the slices, a replacement unit that replaces the disk device in which the failure of the master logical disk has failed and a disk device in which the failure of the backup logical disk has not occurred; and when the replacement process is executed, the master logical disk And a state changing means for changing the backup logical disk to a disconnected state with respect to host access.

第２の発明は、マスタ論理ディスク及びバックアップ論理ディスクのスライスに対応する論理アドレスとディスク装置の物理アドレスとを対応付けたアドレス管理情報を記憶するアドレス記憶手段と、レプリケーション命令に応じて、マスタ論理ディスクと、該マスタ論理ディスクを構成するディスク装置の集合とは異なるディスク装置の集合からバックアップ論理ディスクとを生成し、生成した各論理ディスクのアドレス管理情報を前記アドレス記憶手段に書き込む手段と、コピー命令に応じて、前記マスタ論理ディスクのデータを前記バックアップ論理ディスクにコピーする手段と、前記マスタ論理ディスクを構成するディスク装置に障害が生じたか否かを検出する障害検出手段と、前記マスタ論理ディスク及び前記バックアップ論理ディスクがホストアクセスに対して同期状態であるときに、前記障害検出手段により前記マスタ論理ディスクを構成するディスク装置に障害が生じたことが検出された場合、該マスタ論理ディスクと前記バックアップ論理ディスクとのスライスを入れ替えることにより、前記マスタ論理ディスクの障害が生じたディスク装置と前記バックアップ論理ディスクの障害が生じていないディスク装置とを入れ替える入替手段と、前記入替処理が実行された場合、前記マスタ論理ディスク及び前記バックアップ論理ディスクをホストアクセスに対して切り離し状態に変更する状態変更手段と、を備えたディスクアレイ装置である。 According to a second aspect of the invention, address storage means for storing address management information in which a logical address corresponding to a slice of a master logical disk and a backup logical disk and a physical address of a disk device are associated, and a master logical disk Means for generating a backup logical disk from a disk and a set of disk devices different from the set of disk devices constituting the master logical disk, and writing address management information of each generated logical disk to the address storage means; In response to an instruction, means for copying data of the master logical disk to the backup logical disk, failure detection means for detecting whether a failure has occurred in a disk device constituting the master logical disk, and the master logical disk And the backup logical data If the failure detection means detects that a failure has occurred in the disk device constituting the master logical disk when the disk is in a synchronized state with respect to host access, the master logical disk and the backup logical disk And replacing the disk device in which the failure of the master logical disk has failed and the disk device in which the failure of the backup logical disk has not occurred, and when the replacement process is executed, A disk array device comprising: a disk and a state changing unit that changes the backup logical disk to a disconnected state with respect to host access.

第３の発明は、マスタ論理ディスク及びバックアップ論理ディスクのスライスに対応する論理アドレスとディスク装置の物理アドレスとを対応付けたアドレス管理情報を記憶するアドレス記憶手段と、レプリケーション命令に応じて、マスタ論理ディスク及びバックアップ論理ディスクをディスク装置の集合から生成し、生成した各論理ディスクのアドレス管理情報を前記アドレス記憶手段に書き込む手段と、コピー命令に応じて、前記マスタ論理ディスクのデータを前記バックアップ論理ディスクにコピーする際、該マスタ論理ディスクのスライスのデータと同一のデータが前記バックアップ論理ディスクのスライスに記憶されている場合、該バックアップ論理ディスクのスライスを破棄し、前記マスタ論理ディスクを構成するディスク装置とは異なるディスク装置に前記バックアップ論理ディスクの新たなスライスを設定し、該新たなスライスを設定したバックアップ論理ディスクに前記マスタ論理ディスクのデータをコピーする手段と、前記マスタ論理ディスクを構成するディスク装置に障害が生じたか否かを検出する障害検出手段と、前記マスタ論理ディスク及び前記バックアップ論理ディスクがホストアクセスに対して同期状態であるときに、前記障害検出手段により前記マスタ論理ディスクを構成するディスク装置に障害が生じたことが検出された場合、該マスタ論理ディスクと前記バックアップ論理ディスクとのスライスを入れ替えることにより、前記マスタ論理ディスクの障害が生じたディスク装置と前記バックアップ論理ディスクの障害が生じていないディスク装置とを入れ替える入替手段と、前記入替処理が実行された場合、前記マスタ論理ディスク及び前記バックアップ論理ディスクをホストアクセスに対して切り離し状態に変更する状態変更手段と、を備えたディスクアレイ装置である。 According to a third aspect of the invention, there is provided address storage means for storing address management information in which a logical address corresponding to a slice of a master logical disk and a backup logical disk and a physical address of a disk device are associated, and a master logical disk A disk and a backup logical disk are generated from a set of disk devices, and address management information of each generated logical disk is written to the address storage means, and data of the master logical disk is transferred to the backup logical disk in response to a copy command When the same data as the slice data of the master logical disk is stored in the slice of the backup logical disk, the slice of the backup logical disk is discarded and the disk device constituting the master logical disk is copied. Means for setting a new slice of the backup logical disk in a different disk device, copying the data of the master logical disk to the backup logical disk in which the new slice is set, and a disk device constituting the master logical disk A failure detection means for detecting whether or not a failure has occurred, and a disk constituting the master logical disk by the failure detection means when the master logical disk and the backup logical disk are in a synchronized state with respect to host access. When it is detected that a failure has occurred in the device, the failure of the master logical disk and the backup logical disk have occurred by replacing the slices of the master logical disk and the backup logical disk. Not disk A disk array device comprising: replacement means for replacing a device; and state changing means for changing the master logical disk and the backup logical disk into a disconnected state with respect to host access when the replacement process is executed. .

本発明によれば、一部のディスク装置に障害が発生してもホストアクセスに対する応答性能を維持することができる。また、ディスク装置障害による予備ディスク装置へのデータ復元動作及び、障害の発生したディスク装置の交換によるデータ復元動作がホストアクセスの影響を受けずに実行できる。 According to the present invention, response performance to host access can be maintained even if a failure occurs in some disk devices. Further, the data restoration operation to the spare disk device due to the disk device failure and the data restoration operation by replacing the failed disk device can be executed without being affected by the host access.

本発明の第１の実施形態に係るディスクアレイ装置１０の構成を示す模式図である。1 is a schematic diagram showing a configuration of a disk array device 10 according to a first embodiment of the present invention. 同実施形態に係るＣＰＵ２１により実現される機能を説明するための模式図である。It is a schematic diagram for demonstrating the function implement | achieved by CPU21 which concerns on the embodiment. 同実施形態に係るアドレス管理部２４０が保持するデータ構造の一例を示す模式図である。It is a schematic diagram which shows an example of the data structure which the address management part 240 which concerns on the embodiment hold | maintains. スライスの概念を説明するための模式図である。It is a schematic diagram for demonstrating the concept of a slice. 同実施形態に係るディスクアレイ装置１０のレプリケーション方法を説明するためのフローチャートである。4 is a flowchart for explaining a replication method of the disk array device 10 according to the embodiment. 同実施形態に係るディスクアレイ装置１０に障害が生じた場合の動作を説明するためのフローチャートである。4 is a flowchart for explaining an operation when a failure occurs in the disk array device 10 according to the embodiment. 同実施形態に係るディスクアレイ装置１０に障害が生じた場合の動作を説明するためのフローチャートである。4 is a flowchart for explaining an operation when a failure occurs in the disk array device 10 according to the embodiment. 本発明の第２の本実施形態に係るレプリケーション方法を説明するためのフローチャートである。It is a flowchart for demonstrating the replication method which concerns on the 2nd this embodiment of this invention.

以下、図面を参照して本発明の実施形態を説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

＜第１の実施形態＞
図１は本発明の第１の実施形態に係るディスクアレイ装置１０の構成を示す模式図である。ディスクアレイ装置１０は、アレイコントローラ２０と複数のＲＡＩＤアレイから構成されるプール３０とを備えている。ディスクアレイ装置１０は、接続路１を介してホスト装置５と接続し、ホスト装置５からの要求に応じてデータを記憶する。 <First Embodiment>
FIG. 1 is a schematic diagram showing the configuration of a disk array device 10 according to the first embodiment of the present invention. The disk array device 10 includes an array controller 20 and a pool 30 composed of a plurality of RAID arrays. The disk array device 10 is connected to the host device 5 via the connection path 1 and stores data in response to a request from the host device 5.

アレイコントローラ２０は、ＣＰＵ２１・プログラムメモリ２２・キャッシュメモリ２３・ホスト接続部２４・ＨＤＤ接続部２５を備えている。ＣＰＵ２１は、各種処理を実行する演算装置である。 The array controller 20 includes a CPU 21, a program memory 22, a cache memory 23, a host connection unit 24, and an HDD connection unit 25. The CPU 21 is an arithmetic device that executes various processes.

プログラムメモリ２２は、プログラム及びデータの保存領域を記憶する記憶媒体である。このプログラムメモリ２２に記憶されたプログラムがＣＰＵ２１に読み込まれることにより、後述するアレイコントローラ２０の各機能が実現される。 The program memory 22 is a storage medium that stores a storage area for programs and data. When the program stored in the program memory 22 is read by the CPU 21, each function of the array controller 20 described later is realized.

キャッシュメモリ２３は、接続路１を介してコンピュータもしくは他のデータ記憶装置からのＲＡＩＤアレイに対する入出力要求に対するバッファとして機能するものである。たとえば、ホスト装置５からのＲＡＩＤアレイへの書き込み処理は、キャッシュメモリ２３にデータがエントリされてからＲＡＩＤアレイに書き込みデータが渡される。また、読み込み処理についても、キャッシュメモリ２３にデータがエントリされていれば、そのデータをホスト装置５に送信する。また、キャッシュメモリ２３にデータがエントリされていない場合には、ＨＤＤ３０からキャッシュメモリ２３にデータが読み出される。 The cache memory 23 functions as a buffer for an input / output request to the RAID array from the computer or other data storage device via the connection path 1. For example, in the write processing from the host device 5 to the RAID array, the write data is transferred to the RAID array after the data is entered in the cache memory 23. Also for the read process, if data is entered in the cache memory 23, the data is transmitted to the host device 5. If no data is entered in the cache memory 23, the data is read from the HDD 30 to the cache memory 23.

ホスト接続部２４は、ホスト装置５とアレイコントローラ２０とを接続するインタフェースである。 The host connection unit 24 is an interface that connects the host device 5 and the array controller 20.

ＨＤＤ接続部２５は、各ＲＡＩＤアレイを構成するＨＤＤとアレイコントローラ２０とを接続するインタフェースである。 The HDD connection unit 25 is an interface that connects the HDDs constituting each RAID array and the array controller 20.

プール３０は、いわゆるハードディスクドライブ（ＨＤＤ：Hard Disk Drive）である複数のディスク装置により構成されたＲＡＩＤアレイ３１０〜３６０の集合である。ここで、論理ディスクは、定義されたスライス単位に分割される。なお、スライスは論理ディスク作成時に全容量分確保される。 The pool 30 is a set of RAID arrays 310 to 360 configured by a plurality of disk devices which are so-called hard disk drives (HDDs). Here, the logical disk is divided into defined slice units. Note that slices are reserved for the entire capacity when a logical disk is created.

本実施形態に係るディスクアレイ装置１０では、スライス単位で管理されるマスタ論理ディスクとバックアップ論理ディスクとが異なるＲＡＩＤアレイから構成される。例えば外部記憶装置としてマスタ論理ディスクが利用され、マスタ論理ディスクのデータ回復用やテープ装置へのバックアップのコピー元にバックアップ論理ディスクが利用される。 In the disk array device 10 according to the present embodiment, the master logical disk and backup logical disk managed in units of slices are configured from different RAID arrays. For example, a master logical disk is used as an external storage device, and a backup logical disk is used for data recovery of the master logical disk or a backup copy source to a tape device.

図２はＣＰＵ２１により実現される機能を説明するための模式図である。ＣＰＵ２１は、プログラムメモリ２２に記憶されたプログラムが読み込まれることにより、ホストＩ／Ｆ制御部２１０、ＩＯ管理部２２０、キャッシュメモリ管理部２３０、アドレス管理部２４０、レプリケーション管理部２５０、ＲＡＩＤ制御部２６０、およびディスクＩ／Ｆ制御部２７０として機能する。 FIG. 2 is a schematic diagram for explaining functions realized by the CPU 21. When the program stored in the program memory 22 is read, the CPU 21 reads the host I / F control unit 210, the IO management unit 220, the cache memory management unit 230, the address management unit 240, the replication management unit 250, and the RAID control unit 260. And the disk I / F control unit 270.

ホストＩ／Ｆ制御部２１０は、ホスト接続部２４を制御するものであり、ホスト装置５からのＩＯ要求およびホスト装置５へのＩＯ要求を受け付ける。ＩＯ管理部２２０は、ホスト装置５からのＩＯアクセスの管理、ＩＯアクセス領域制御を行なう。キャッシュメモリ管理部２３０は、キャッシュメモリ２３におけるＩＯアクセス管理を行なうものである。 The host I / F control unit 210 controls the host connection unit 24 and accepts an IO request from the host device 5 and an IO request to the host device 5. The IO management unit 220 performs management of IO access from the host device 5 and IO access area control. The cache memory management unit 230 performs IO access management in the cache memory 23.

アドレス管理部２４０は、マスタ論理ディスク及びバックアップ論理ディスクのスライスに対応する論理アドレスを、ＨＤＤの物理アドレスに対応付けた「アドレス管理情報」を記憶している。そして、アドレス管理部２４０は、このアドレス管理情報を用いて、論理アドレスと物理アドレスとの変換処理を行なう。例えば、アドレス管理部２４０は、ホスト装置５のＩＯ要求を受信した場合、各論理ディスクの論理アドレス（ＬＢＡ）に対応するスライスを読み出し、そのスライス管理データ構造内のＲＡＩＤアレイ番号とシーケンス番号とに変換し、ＲＡＩＤ制御部２６０に対してＩＯ処理を要求する。なお、アドレス管理部２４０が保持するデータ構造の一例は図３に示すとおりである。すなわち、アドレス管理部２４０は、論理ディスク管理データ構造として、論理ディスク番号に関連付けて、物理リストポインタを記憶している。また、スライス管理データ構造として、ＲＡＩＤアレイ番号に対応付けて、スライスシーケンス番号とネクストシーケンスポインタを記憶している。 The address management unit 240 stores “address management information” in which logical addresses corresponding to slices of the master logical disk and the backup logical disk are associated with physical addresses of the HDD. Then, the address management unit 240 uses the address management information to perform a conversion process between a logical address and a physical address. For example, when receiving an IO request from the host device 5, the address management unit 240 reads a slice corresponding to the logical address (LBA) of each logical disk, and uses the RAID array number and sequence number in the slice management data structure. Convert the data and request the RAID control unit 260 for IO processing. An example of the data structure held by the address management unit 240 is as shown in FIG. That is, the address management unit 240 stores a physical list pointer as a logical disk management data structure in association with a logical disk number. In addition, as a slice management data structure, a slice sequence number and a next sequence pointer are stored in association with the RAID array number.

さらに、アドレス管理部２４０は、ＲＡＩＤアレイの作成時に、予め定められたサイズＤで論理ディスクを分割して、スライスＳを設定する機能を有する（図４参照）。そして、ＲＡＩＤアレイ上の論理アドレス順にスライスシーケンス番号を割り振りスライス管理データ構造で管理する。具体的には、アドレス管理部２４０では、論理ディスク作成時に指定された容量分のスライスを確保し、論理ディスク管理データ構造にシーケンシャルに繋げる。ここで、確保するスライスは、任意でも良いし、ＲＡＩＤレベル等の信頼性や、構成するＨＤＤの種別・ＨＤＤの内外周を考慮する等の性能といった要素を用いて選別しても良い。 Furthermore, the address management unit 240 has a function of setting a slice S by dividing a logical disk by a predetermined size D when creating a RAID array (see FIG. 4). Then, slice sequence numbers are assigned in the order of logical addresses on the RAID array and managed by the slice management data structure. Specifically, the address management unit 240 secures slices for the capacity specified at the time of logical disk creation, and sequentially connects them to the logical disk management data structure. Here, the slice to be secured may be arbitrary, or may be selected using factors such as reliability such as a RAID level, performance such as considering the type of HDD to be configured and the inner and outer circumferences of the HDD.

なお、アドレス管理部２４０は、論理アドレスから該当アドレスが割り当てられているＲＡＩＤアレイのアドレスへの変換を可能にするが、hashやtree形式にして高速に変換可能な手段を採用してもよい。 Note that the address management unit 240 enables conversion from a logical address to an address of a RAID array to which the corresponding address is assigned, but may adopt a means capable of high-speed conversion in a hash or tree format.

レプリケーション管理部２５０は、レプリケーションの制御を行なうものである。具体的には、レプリケーション管理部２５０は、ホスト装置５からのレプリケーション命令等に応じて、マスタ論理ディスクと、そのマスタ論理ディスクを構成するＲＡＩＤアレイとは異なるＲＡＩＤアレイからバックアップ論理ディスクとを生成する。また、レプリケーション管理部２５０は、生成した各論理ディスクのアドレス管理情報をアドレス管理部２４０に書き込む。 The replication management unit 250 controls replication. Specifically, the replication management unit 250 generates a master logical disk and a backup logical disk from a RAID array different from the RAID array that constitutes the master logical disk in response to a replication command from the host device 5 or the like. . In addition, the replication management unit 250 writes the generated address management information of each logical disk in the address management unit 240.

なお、「レプリケーション」とは、マスタ論理ディスクの複製であるバックアップ論理ディスクを作成することをいう。レプリケーションの状態には、「同期状態（sync）」と「切り離し状態(split)」とがある。同期状態は、ホスト装置５からのマスタ論理ディスクに対する書き込みデータがバックアップ論理ディスクにも同時に書き込まれる状態であり（ミラーリング)、両者の内容が常に一致している。切り離し状態は、ホスト装置５からのアクセスに対して、マスタ論理ディスクだけに書き込みし、バックアップ論理ディスクには書き込まない状態のことである。 Note that “replication” means creating a backup logical disk that is a duplicate of the master logical disk. The replication state includes a “synchronization state (sync)” and a “detachment state (split)”. The synchronization state is a state in which write data from the host device 5 to the master logical disk is simultaneously written to the backup logical disk (mirroring), and the contents of both are always the same. The disconnected state is a state in which, with respect to access from the host device 5, data is written only to the master logical disk and not written to the backup logical disk.

また、レプリケーション管理部２５０は、ホスト装置５からコピー命令等を受けると、マスタ論理ディスクのデータをバックアップ論理ディスクにコピーする。ここで、レプリケーション管理部２５０は、マスタ論理ディスクのデータとバックアップ論理ディスクのデータとの差分を示す「差分管理用ビットマップ」を記憶しており、マスタ論理ディスクとバックアップ論理ディスクとをレプリケーションペアとして管理する。なお、レプリケーション管理部２５０が、レプリケーションを作成した初期状態では、マスタ論理ディスクとバックアップ論理ディスクとの差分は１００％である。また、レプリケーションを作成した初期状態では、マスタ論理ディスクとバックアップ論理ディスクとは“切り離し状態”となっている。 In addition, when receiving a copy command or the like from the host device 5, the replication management unit 250 copies the data on the master logical disk to the backup logical disk. Here, the replication management unit 250 stores a “difference management bitmap” indicating the difference between the data on the master logical disk and the data on the backup logical disk, and the master logical disk and the backup logical disk as a replication pair. to manage. In the initial state where the replication management unit 250 creates a replication, the difference between the master logical disk and the backup logical disk is 100%. Further, in the initial state where the replication is created, the master logical disk and the backup logical disk are in a “detached state”.

レプリケーション管理部２５０は、ＲＡＩＤ制御部２６０により入替処理が実行されたときに、マスタ論理ディスクとバックアップ論理ディスクとが同期状態であった場合、ホストアクセスに対して両者を切り離し状態に変更する。レプリケーション管理部２５０では、同期状態から切り離し状態に移行するときには、その時点でのキャッシュ上のダーティデータをマスタ論理ディスク及びバックアップ論理ディスクに書き込み、バックアップ側のデータを確定させる。 If the master logical disk and the backup logical disk are in a synchronized state when the replacement processing is executed by the RAID control unit 260, the replication management unit 250 changes both of them to a disconnected state for host access. When the replication management unit 250 shifts from the synchronized state to the disconnected state, the dirty data on the cache at that time is written in the master logical disk and the backup logical disk, and the data on the backup side is determined.

また、レプリケーション管理部２５０は、ＲＡＩＤ制御部２６０によりバックアップ論理ディスクの障害回復処理が実行された場合は、バックアップ論理ディスクとマスタ論理ディスクとを同期状態にする。レプリケーション管理部２５０では、切り離し状態から同期状態へ移行するときには、差分管理用ビットマップを参照して、差分と判定される領域のデータをマスタ論理ディスクからバックアップ論理ディスクにコピーする。 In addition, when the RAID control unit 260 performs failure recovery processing for the backup logical disk, the replication management unit 250 brings the backup logical disk and the master logical disk into a synchronized state. When the replication management unit 250 shifts from the disconnected state to the synchronous state, the replication management unit 250 refers to the difference management bitmap and copies the data of the area determined to be the difference from the master logical disk to the backup logical disk.

ＲＡＩＤ制御部２６０は、ＲＡＩＤに関する制御を行なうものであり、障害検出機能とスライス入替機能と障害回復機能とを有している。ＲＡＩＤ制御部２６０は、障害検出機能により、マスタ論理ディスクを構成するディスク装置に障害が生じたか否かを検出する。 The RAID control unit 260 performs control related to RAID and has a failure detection function, a slice replacement function, and a failure recovery function. The RAID control unit 260 detects whether or not a failure has occurred in the disk device constituting the master logical disk by the failure detection function.

また、ＲＡＩＤ制御部２６０は、マスタ論理ディスク及びバックアップ論理ディスクがホストアクセスに対して同期状態であるときに、マスタ論理ディスクを構成するディスク装置に障害が生じたことが検出された場合、スライス入替機能によりそのマスタ論理ディスクとバックアップ論理ディスクとのスライスを入れ替えることにより、ＨＤＤの障害により縮退したマスタ論理ディスクのＲＡＩＤアレイをバックアップ論理ディスクのＲＡＩＤアレイと入れ替える。 Further, when the RAID controller 260 detects that a failure has occurred in the disk device constituting the master logical disk when the master logical disk and the backup logical disk are in synchronization with the host access, the slice replacement is performed. By replacing the slices of the master logical disk and the backup logical disk by the function, the RAID array of the master logical disk that has been degraded due to the failure of the HDD is replaced with the RAID array of the backup logical disk.

一方、ＲＡＩＤ制御部２６０は、マスタ論理ディスク及びバックアップ論理ディスクがホストアクセスに対して切り離し状態であるときに、マスタ論理ディスクを構成するディスク装置に障害が生じたことが検出された場合、差分管理用ビットマップに基づいて、マスタ論理ディスクとバックアップ論理ディスクとのスライスのうち、差分が生じていないスライスに対応するＲＡＩＤアレイを入れ替える。 On the other hand, when the RAID controller 260 detects that a failure has occurred in the disk device constituting the master logical disk when the master logical disk and the backup logical disk are disconnected from the host access, the RAID controller 260 performs differential management. Based on the bitmap for use, the RAID array corresponding to the slice in which no difference is generated among the slices of the master logical disk and the backup logical disk is replaced.

また、ＲＡＩＤ制御部２６０は、障害回復機能により、レプリケーション管理部２５０により切り離し状態になったバックアップ論理ディスクに対し障害回復処理を実行する。
ディスクＩ／Ｆ制御部２７０は、ＨＤＤ接続部２５を制御するものであり、各ＲＡＩＤアレイ３１０〜３６０に対するアクセスを制御する。 Further, the RAID control unit 260 executes failure recovery processing for the backup logical disk that has been disconnected by the replication management unit 250 by the failure recovery function.
The disk I / F control unit 270 controls the HDD connection unit 25 and controls access to each of the RAID arrays 310 to 360.

次に本実施形態に係るディスクアレイ装置１０の動作を説明する。
ディスクアレイ装置１０では、レプリケーション管理部２５０が、マスタ論理ディスクの複製であるバックアップ論理ディスクを生成する。レプリケーションの初期状態ではデータの差分が１００％の状態であるので、マスタ論理ディスクからバックアップ論理ディスクにデータを全面コピーする。 Next, the operation of the disk array device 10 according to the present embodiment will be described.
In the disk array device 10, the replication management unit 250 generates a backup logical disk that is a duplicate of the master logical disk. Since the data difference is 100% in the initial replication state, the entire data is copied from the master logical disk to the backup logical disk.

図５は本実施形態に係るディスクアレイ装置１０のレプリケーション方法を説明するためのフローチャートである。
まず、ホスト装置５からのレプリケーション命令を受けると、レプリケーション管理部２５０が、コピーアドレスが論理ディスクをスライスサイズＤでアラインした先頭アドレスまたは、コピーアドレス＋コピーサイズがスライスサイズDでアラインした境界を跨ぐか否かを判定する（Ｓ１）。 FIG. 5 is a flowchart for explaining the replication method of the disk array device 10 according to the present embodiment.
First, upon receiving a replication command from the host device 5, the replication management unit 250 straddles the start address where the copy address is aligned with the logical disk with the slice size D or the boundary where the copy address + copy size is aligned with the slice size D. It is determined whether or not (S1).

判定の結果、境界を跨いでいる場合には、コピー領域が新規スライスに到達したことになる。この場合、レプリケーション管理部２５０は、マスタ論理ディスク及びバックアップ論理ディスクのＲＡＩＤアレイ番号をアドレス管理部２４０から読み出す。この際、マスタ論理ディスクとバックアップ論理ディスクとのＲＡＩＤアレイ番号が同一である場合（Ｓ２−Ｙｅｓ）、レプリケーション管理部２５０は、バックアップスライスを破棄し、マスタ論理ディスクとは異なるＲＡＩＤアレイから新規スライスを確保する（Ｓ３）。 As a result of the determination, if the boundary is crossed, the copy area has reached a new slice. In this case, the replication management unit 250 reads the RAID array numbers of the master logical disk and the backup logical disk from the address management unit 240. At this time, if the RAID array numbers of the master logical disk and the backup logical disk are the same (S2-Yes), the replication management unit 250 discards the backup slice and creates a new slice from a RAID array different from the master logical disk. Secure (S3).

そして、レプリケーション管理部２５０は、マスタ論理ディスクに属するＲＡＩＤアレイ以外から確保したスライスをバックアップ論理ディスクに新たに割り当て（Ｓ４）、マスタ論理ディスクのデータをコピーする（Ｓ５）。この際、バックアップ論理ディスクのＲＡＩＤアドレスを更新する（Ｓ６）。そして、この制御をマスタ論理ディスクの全スライスのコピーが完了するまで実行する（Ｓ７）。 Then, the replication management unit 250 newly allocates a slice secured from other than the RAID array belonging to the master logical disk to the backup logical disk (S4), and copies the data of the master logical disk (S5). At this time, the RAID address of the backup logical disk is updated (S6). This control is executed until copying of all slices of the master logical disk is completed (S7).

これにより、マスタ論理ディスクのデータをバックアップ論理ディスクにコピーしたときには、マスタ論理ディスクとバックアップ論理ディスクとを構成するＲＡＩＤアレイが必ず異なることとなる。 As a result, when the data of the master logical disk is copied to the backup logical disk, the RAID arrays constituting the master logical disk and the backup logical disk are always different.

次に、ディスクアレイ装置１０に障害が生じた場合の動作を説明する。
ＲＡＩＤアレイがそれを構成するＨＤＤの障害により縮退した場合、スペアディスクへの復元を行なう。この復元は、障害が生じたＨＤＤのデータを残ったシャドウＨＤＤから単純にコピーすることや（ＲＡＩＤ１）、残ったＨＤＤのデータ及びパリティを読み出して再生し（ＲＡＩＤ５）、スペアＨＤＤに書き込む制御により行なわれる。 Next, the operation when a failure occurs in the disk array device 10 will be described.
When the RAID array is degenerated due to a failure of the HDD constituting the RAID array, restoration to the spare disk is performed. This restoration is performed by simply copying the data of the failed HDD from the remaining shadow HDD (RAID 1), reading and reproducing the remaining HDD data and parity (RAID 5), and writing to the spare HDD. It is.

図６は本実施形態に係るディスクアレイ装置１０に障害が生じた場合の動作を説明するためのフローチャートである。ここでは、障害時のレプリケーション状態が“同期状態”であった場合を説明する。
まず、ＲＡＩＤ制御部２６０によりＨＤＤに障害が生じたことが検出されると、マスタ論理ディスク及びバックアップ論理ディスクを構成しているスライスのＲＡＩＤアレイ番号を論理アドレスの昇順にアドレス管理部２４０から読み込み、障害が生じたスライスのＲＡＩＤアレイ番号を確認する（Ｔ１）。この際、マスタ論理ディスクのスライスが縮退したＲＡＩＤアレイに属していた場合、バックアップ論理ディスクは縮退したＲＡＩＤアレイに属していない。マスタ論理ディスクとバックアップ論理ディスクとは異なるＲＡＩＤアレイにより構成されているからである。 FIG. 6 is a flowchart for explaining the operation when a failure occurs in the disk array device 10 according to the present embodiment. Here, a case where the replication state at the time of failure is “synchronized state” will be described.
First, when it is detected by the RAID control unit 260 that a failure has occurred in the HDD, the RAID array numbers of the slices constituting the master logical disk and the backup logical disk are read from the address management unit 240 in ascending order of logical addresses, The RAID array number of the slice in which the failure has occurred is confirmed (T1). At this time, if the master logical disk slice belongs to the degraded RAID array, the backup logical disk does not belong to the degraded RAID array. This is because the master logical disk and the backup logical disk are configured by different RAID arrays.

このような場合（Ｔ２−Ｙｅｓ）、ＲＡＩＤ制御部２６０が、マスタ論理ディスクにおける縮退したＲＡＩＤアレイに対応するスライスを、バックアップ論理ディスクのスライスと入れ替える（Ｔ３）。 In such a case (T2-Yes), the RAID control unit 260 replaces the slice corresponding to the degenerated RAID array in the master logical disk with the slice of the backup logical disk (T3).

そして、この操作をマスタ論理ディスクの全領域に対して行なう（Ｔ４，Ｔ５）。スライスの入れ替えが1組でも生じている場合（Ｔ６−Ｙｅｓ）は、レプリケーション管理部２５０が、マスタ論理ディスクとバックアップ論理ディスクとを切り離し状態に変更する（Ｔ７）。これにより、マスタ論理ディスクを構成するスライス群に復元中のＲＡＩＤアレイに属するスライスが存在しなくなる。従って、前述のマスタ論理ディスクに対するホストＩ／Ｏと復元処理との相互の影響を排除することが出来る。 Then, this operation is performed on the entire area of the master logical disk (T4, T5). If even one set of slice replacement occurs (T6-Yes), the replication management unit 250 changes the master logical disk and the backup logical disk to a disconnected state (T7). As a result, no slice belonging to the RAID array being restored exists in the slice group constituting the master logical disk. Accordingly, it is possible to eliminate the mutual influence between the host I / O and the restoration process on the master logical disk.

図７は本実施形態に係るディスクアレイ装置１０に障害が生じた場合の動作を説明するためのフローチャートである。ここでは、障害時のレプリケーション状態が“切り離し状態”であった場合を説明する。
この場合も、ＲＡＩＤ制御部２６０が、マスタ論理ディスク及びバックアップ論理ディスクを構成しているスライスのＲＡＩＤアレイ番号を論理アドレスの昇順にアドレス管理部２４０から読み込み、障害が生じたスライスのＲＡＩＤアレイ番号を確認する（Ｕ１）。 FIG. 7 is a flowchart for explaining the operation when a failure occurs in the disk array device 10 according to the present embodiment. Here, a case will be described in which the replication state at the time of the failure is “detached state”.
Also in this case, the RAID control unit 260 reads the RAID array numbers of the slices constituting the master logical disk and the backup logical disk from the address management unit 240 in the ascending order of the logical addresses, and the RAID array numbers of the failed slices are read. Confirm (U1).

この際、マスタ論理ディスクのスライスが縮退したＲＡＩＤアレイに属していた場合（Ｕ２−Ｙｅｓ）、差分管理用ビットマップにより、該当スライスが割り当てられた論理ディスク上の領域に差分が生じているか否かを判定する（Ｕ３）。 At this time, if the slice of the master logical disk belongs to the degenerated RAID array (U2-Yes), whether or not a difference has occurred in the area on the logical disk to which the corresponding slice is assigned by the difference management bitmap. Is determined (U3).

判定の結果、差分が生じていない場合（Ｕ３−Ｙｅｓ）は、ＲＡＩＤ制御部２６０が、マスタ論理ディスクにおける縮退したＲＡＩＤアレイに対応するスライスを、バックアップ論理ディスクのスライスと入れ替える（Ｕ４）。そして、この操作をマスタ論理ディスクの全領域に対して行なう（Ｕ５，Ｕ６）。 If no difference is found as a result of the determination (U3-Yes), the RAID control unit 260 replaces the slice corresponding to the degenerated RAID array in the master logical disk with the slice of the backup logical disk (U4). This operation is performed on the entire area of the master logical disk (U5, U6).

これにより、マスタ論理ディスクを構成するスライス群の中で復元中のＲＡＩＤアレイに属するスライスを減らすことが可能となる。従って、前述のマスタ論理ディスクに対するホストＩ／Ｏと復元処理との相互の影響を減少させることが出来る。 As a result, it is possible to reduce the number of slices belonging to the RAID array being restored in the slice group constituting the master logical disk. Accordingly, it is possible to reduce the mutual influence between the host I / O and the restoration processing on the master logical disk.

以上説明したように、本実施形態に係るディスクアレイ装置１０は、レプリケーション命令に応じて、マスタ論理ディスクを構成するＲＡＩＤアレイとは異なるＲＡＩＤアレイからバックアップ論理ディスクを生成するレプリケーション管理部２５０と、マスタ論理ディスク及びバックアップ論理ディスクがホストアクセスに対して同期状態であるときに、マスタ論理ディスクを構成するディスク装置に障害が生じたことが検出された場合、マスタ論理ディスクの障害が生じたディスク装置とバックアップ論理ディスクの障害が生じていないディスク装置とを入れ替え、マスタ論理ディスク及びバックアップ論理ディスクをホストアクセスに対して切り離し状態に変更するＲＡＩＤ制御部２６０とを備えているので、障害が発生したディスク装置をマスタ論理ディスクから除くとともに、障害が発生したディスク装置へのホストアクセスを遮断することができ、一部のディスク装置に障害が発生してもホストアクセスに対する応答性能を維持することができる。 As described above, the disk array device 10 according to the present embodiment includes the replication management unit 250 that generates a backup logical disk from a RAID array that is different from the RAID array that configures the master logical disk, and the master in response to the replication command. When it is detected that a failure has occurred in the disk device constituting the master logical disk when the logical disk and the backup logical disk are in synchronization with the host access, the disk device in which the failure of the master logical disk has occurred A disk in which a failure has occurred because it includes a RAID controller 260 that replaces the disk device in which no failure has occurred in the backup logical disk and changes the master logical disk and the backup logical disk to a disconnected state for host access. With the exception of location from a master logical disk, it is possible to block the host access to the disk device failed, even if a part of the disk drive fails can maintain the response performance to the host access.

要するに、本実施形態に係るディスクアレイ装置１０では、マスタ論理ディスクをバックアップする際、異なるＲＡＩＤアレイに属するスライスからミラーリングされるようにバックアップ論理ディスクを構成する。ホスト装置からスナップ取得要求受信時にキャッシュ上のダーティデータが両ディスクに反映されるようにする。そして、あるＲＡＩＤアレイの障害時に、マスタ論理ディスクとバックアップ論理ディスクとが同期状態である場合は、障害が発生したＲＡＩＤアレイに属するスライスのうち、マスタ論理ディスクに属するスライスを、ミラーリングされたバックアップ論理ディスクのスライスと入れ替え、レプリケーションを切り離し状態にする。それゆえ、ＲＡＩＤアレイに障害が発生しても、マスタ論理ディスクには縮退したＲＡＩＤアレイのスライスが存在しなくなるので、ディスク装置の故障による復元性能の低下を回避できる。また、レプリケーションを切り離し状態にするため、マスタ論理ディスクに対してのみホストアクセスの処理をすることになる。そのため、同期状態に比して、ホストアクセスに対する応答性能の低下を防ぐことができる。 In short, in the disk array device 10 according to the present embodiment, when backing up the master logical disk, the backup logical disk is configured to be mirrored from slices belonging to different RAID arrays. When the snap acquisition request is received from the host device, the dirty data in the cache is reflected on both disks. If the master logical disk and the backup logical disk are in synchronization when a certain RAID array fails, the slices belonging to the master logical disk among the slices belonging to the failed RAID array are mirrored to the backup logical disk. Replace the disk with a slice, and put replication in a disconnected state. Therefore, even if a failure occurs in the RAID array, the master logical disk does not have a degenerated RAID array slice, so that a reduction in restoration performance due to a disk device failure can be avoided. In addition, in order to put replication in a disconnected state, host access processing is performed only for the master logical disk. Therefore, it is possible to prevent a decrease in response performance to host access as compared with the synchronization state.

なお、ディスクアレイ装置１０は、ＲＡＩＤ制御部２６０により、切り離し状態になったバックアップ論理ディスクに対し障害回復処理を実行する。そして、バックアップ論理ディスクを障害回復した場合、バックアップ論理ディスクとマスタ論理ディスクとを同期状態にする。これにより、マスタ論理ディスクを稼動している間に、縮退が生じたＲＡＩＤアレイの障害回復処理を実行できる。それゆえ、このようなディスクアレイ装置によれば、ＲＡＩＤアレイの縮退に伴うスペアディスクの復元処理によるホストアクセス性能への影響及び、スペアディスクへの復元性能に対するホストアクセスからの影響の相互干渉を最小限に留めることが出来る。 In the disk array device 10, the RAID control unit 260 executes failure recovery processing for the backup logical disk that has been disconnected. When the backup logical disk is recovered from a failure, the backup logical disk and the master logical disk are synchronized. Thereby, it is possible to execute failure recovery processing for the RAID array in which degeneration occurs while the master logical disk is operating. Therefore, according to such a disk array device, the influence on the host access performance by the restoration process of the spare disk accompanying the degeneration of the RAID array and the mutual interference of the influence from the host access on the restoration performance to the spare disk are minimized. Can be limited.

＜第２の実施形態＞
図１を参照して、第２の実施形態に係るレプリケーション方法を説明する。本実施形態では、論理ディスク構成方法が仮想的であることを想定している。仮想的であるとは、論理ディスクの構成時に全てのスライスを割り当てるのでは無く、ホスト装置から書き込みのあった部分のみスライスを順次割り当てていくことにより容量効率を上げる方式である。 <Second Embodiment>
A replication method according to the second embodiment will be described with reference to FIG. In this embodiment, it is assumed that the logical disk configuration method is virtual. The term “virtual” refers to a method of increasing capacity efficiency by not allocating all slices at the time of logical disk configuration but by allocating slices sequentially only to the portion written from the host device.

レプリケーション管理部２５０は、レプリケーション命令に応じて、マスタ論理ディスク及びバックアップ論理ディスクをＲＡＩＤアレイから生成し、生成した各論理ディスクのアドレス管理情報をアドレス管理部２４０に書き込む。 In response to the replication command, the replication management unit 250 generates a master logical disk and a backup logical disk from the RAID array, and writes the generated address management information of each logical disk to the address management unit 240.

また、レプリケーション管理部２５０は、コピー命令に応じて、マスタ論理ディスクのデータをバックアップ論理ディスクにコピーする際、マスタ論理ディスクのスライスのデータと同一のデータがバックアップ論理ディスクのスライスに記憶されている場合、バックアップ論理ディスクのスライスを破棄する。そして、マスタ論理ディスクを構成するディスク装置とは異なるディスク装置にバックアップ論理ディスクの新たなスライスを設定し、新たなスライスを設定したバックアップ論理ディスクにマスタ論理ディスクのデータをコピーする。 Further, when the replication management unit 250 copies the master logical disk data to the backup logical disk according to the copy command, the same data as the slice data of the master logical disk is stored in the backup logical disk slice. If so, destroy the slice of the backup logical disk. Then, a new slice of the backup logical disk is set in a disk device different from the disk device constituting the master logical disk, and the master logical disk data is copied to the backup logical disk in which the new slice is set.

図８は本実施形態に係るレプリケーション方法を説明するためのフローチャートである。
まず、ホスト装置５からコピー命令を受けると、レプリケーション管理部２５０が、コピーアドレスが論理ディスクをスライスサイズＤでアラインした先頭アドレスまたは、コピーアドレス＋コピーサイズがスライスサイズＤでアラインした境界を跨ぐか否かを判定する（Ｖ１）。 FIG. 8 is a flowchart for explaining the replication method according to this embodiment.
First, when a copy command is received from the host device 5, the replication management unit 250 determines whether the copy address crosses the start address where the logical disk is aligned with the slice size D or the boundary where the copy address + copy size is aligned with the slice size D. It is determined whether or not (V1).

判定の結果、境界を跨いでいる場合には、コピー領域が新規スライスに到達したことになる。この場合、レプリケーション管理部２５０は、マスタ論理ディスク及びバックアップ論理ディスクのＲＡＩＤアレイ番号が既に割り当てられているか否かをアドレス管理部２４０のアドレス管理情報を参照して確認する（Ｖ２）。 As a result of the determination, if the boundary is crossed, the copy area has reached a new slice. In this case, the replication management unit 250 checks whether the RAID array numbers of the master logical disk and the backup logical disk have already been assigned by referring to the address management information of the address management unit 240 (V2).

ここで、レプリケーション管理部２５０は、マスタ論理ディスクのＲＡＩＤアレイ番号が割り当てられていない場合であって（Ｖ２−Ｎｏ）、バックアップ論理ディスクのＲＡＩＤアレイ番号が既に割り当てられているとき（Ｖ３−Ｙｅｓ）は、バックアップ論理ディスクのスライスを破棄し、当該スライスサイズ分のコピーをスキップする（Ｖ４）。 Here, the replication management unit 250 is a case where the RAID array number of the master logical disk is not assigned (V2-No), and the RAID array number of the backup logical disk is already assigned (V3-Yes). Discards the slice of the backup logical disk and skips copying for the slice size (V4).

また、レプリケーション管理部２５０は、バックアップ論理ディスクのＲＡＩＤアレイ番号のみ割り当てられていない場合（Ｖ２−Ｙｅｓ，Ｖ５−Ｎｏ）、マスタ論理ディスクのＲＡＩＤアレイとは異なるＲＡＩＤアレイから新たなスライスを確保して（Ｖ８）、コピーを実行する（Ｖ９）。そして、バックアップ論理ディスクのＲＡＩＤアレイ番号と論理アドレスとの対応関係を更新し（Ｖ１０）、全領域のコピーが終了するまで上記処理を繰り返す（Ｖ１１−Ｎｏ）。 If only the RAID array number of the backup logical disk is not assigned (V2-Yes, V5-No), the replication management unit 250 secures a new slice from a RAID array different from the RAID array of the master logical disk. (V8) Copy is executed (V9). Then, the correspondence relationship between the RAID array number of the backup logical disk and the logical address is updated (V10), and the above processing is repeated until the copying of all areas is completed (V11-No).

一方、レプリケーション管理部２５０は、マスタ論理ディスクとバックアップ論理ディスクとの双方にＲＡＩＤアレイ番号が既に割り当て済みであった場合（Ｖ２−Ｙｅｓ，Ｖ５−Ｙｅｓ）、両ＲＡＩＤアレイ番号が同一であるか否かを判定する（Ｖ６）。そして、判定の結果、同一のＲＡＩＤアレイ番号である場合（Ｖ６−Ｙｅｓ）、アドレス管理部２４０にバックアップ論理ディスクのスライスを破棄するとともに、バックアップ論理ディスクに対し、異なるＲＡＩＤアレイから新規のスライスを確保するようにする（Ｖ７）。 On the other hand, if the RAID array number has already been assigned to both the master logical disk and the backup logical disk (V2-Yes, V5-Yes), the replication management unit 250 determines whether or not both RAID array numbers are the same. Is determined (V6). If it is determined that the RAID array numbers are the same (V6-Yes), the address management unit 240 discards the slice of the backup logical disk and secures a new slice from a different RAID array for the backup logical disk. (V7).

以上説明したように、本実施形態に係るディスクアレイ装置１０は、コピー命令に応じて、マスタ論理ディスクのデータをバックアップ論理ディスクにコピーする際、マスタ論理ディスクのスライスのデータと同一のデータがバックアップ論理ディスクのスライスに記憶されている場合、バックアップ論理ディスクのスライスを破棄し、マスタ論理ディスクを構成するディスク装置とは異なるディスク装置にバックアップ論理ディスクの新たなスライスを設定し、新たなスライスを設定したバックアップ論理ディスクにマスタ論理ディスクのデータをコピーするレプリケーション管理部２５０を備えているので、マスタ論理ディスクからバックアップ論理ディスクへデータのコピーを完了したときに、レプリケーションペアの同一の論理アドレスのＲＡＩＤアレイが必ず異なることを保障できる。 As described above, the disk array device 10 according to the present embodiment backs up the same data as the slice data of the master logical disk when copying the data of the master logical disk to the backup logical disk according to the copy command. If it is stored in a logical disk slice, discard the backup logical disk slice, set a new slice for the backup logical disk on a disk unit different from the disk unit that configures the master logical disk, and set a new slice. Since the replication manager 250 is provided to copy the master logical disk data to the backup logical disk, the same logical address of the replication pair when the data copy from the master logical disk to the backup logical disk is completed. Vinegar RAID array can guarantee always different.

また、ディスクアレイ装置１０に障害が生じた場合の動作は第１の実施形態と同様である。すなわち、ＲＡＩＤ制御部２６０が、マスタ論理ディスク及びバックアップ論理ディスクがホストアクセスに対して同期状態であるときに、マスタ論理ディスクを構成するディスク装置に障害が生じたことが検出された場合、マスタ論理ディスクの障害が生じたディスク装置をバックアップ論理ディスクの障害が生じていないディスク装置と入れ替え、マスタ論理ディスク及びバックアップ論理ディスクをホストアクセスに対して切り離し状態に変更する。 The operation when a failure occurs in the disk array device 10 is the same as in the first embodiment. That is, when the RAID controller 260 detects that a failure has occurred in the disk device constituting the master logical disk when the master logical disk and the backup logical disk are in synchronization with the host access, the master logical disk The disk device in which the disk failure has occurred is replaced with a disk device in which the failure of the backup logical disk has not occurred, and the master logical disk and the backup logical disk are changed to the disconnected state for host access.

よって、本実施形態に係るディスクアレイ装置１０では、データをコピーするときに、マスタ論理ディスクとバックアップ論理ディスクとが異なるディスク装置の集合から構成されていなくても、障害が発生したディスク装置をマスタ論理ディスクから除くとともに、障害が発生したディスク装置へのホストアクセスを遮断することができる。 Therefore, in the disk array device 10 according to the present embodiment, when copying data, even if the master logical disk and the backup logical disk are not composed of different sets of disk devices, the failed disk device is designated as the master. In addition to being removed from the logical disk, host access to the disk device in which the failure has occurred can be blocked.

＜その他＞
なお、本発明は、上記実施形態に限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記実施形態に開示されている複数の構成要素の適宜な組み合わせにより種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。更に、異なる実施形態に構成要素を適宜組み合わせてもよい。 <Others>
In addition, this invention is not limited to the said embodiment, In the implementation stage, it can embody by modifying a component in the range which does not deviate from the summary. In addition, various inventions can be formed by appropriately combining a plurality of components disclosed in the embodiment. For example, some components may be deleted from all the components shown in the embodiment. Furthermore, you may combine a component suitably in different embodiment.

なお、上記実施形態に記載した手法は、コンピュータに実行させることのできるプログラムとして、磁気ディスク、光ディスク、光磁気ディスク、半導体メモリなどの記憶媒体に格納して頒布することもできる。また、この記憶媒体としては、プログラムを記憶でき、かつコンピュータが読み取り可能な記憶媒体であれば、その記憶形式は何れの形態であっても良い。 The method described in the above embodiment can be stored and distributed in a storage medium such as a magnetic disk, an optical disk, a magneto-optical disk, or a semiconductor memory as a program that can be executed by a computer. In addition, as long as the storage medium can store a program and can be read by a computer, the storage format may be any form.

また、記憶媒体からコンピュータにインストールされたプログラムの指示に基づきコンピュータ上で稼働しているオペレーティングシステムや、データベース管理ソフト、ネットワークソフト等のミドルウェア等が上記実施形態を実現するための各処理の一部を実行しても良い。 Further, an operating system running on the computer based on instructions of a program installed in the computer from the storage medium, middleware such as database management software, network software, etc., part of each process for realizing the above embodiment May be executed.

さらに、本発明における記憶媒体は、コンピュータと独立した媒体に限らず、ＬＡＮやインターネット等により伝送されたプログラムをダウンロードして記憶または一時記憶した記憶媒体も含まれる。 Furthermore, the storage medium in the present invention is not limited to a medium independent of a computer, but also includes a storage medium in which a program transmitted via a LAN, the Internet, or the like is downloaded and stored or temporarily stored.

また、記憶媒体は１つに限らず、複数の媒体から上記実施形態における処理が実行される場合も本発明における記憶媒体に含まれ、媒体構成は何れの構成であっても良い。 Further, the number of storage media is not limited to one, and the case where the processing in the above embodiment is executed from a plurality of media is also included in the storage media in the present invention, and the media configuration may be any configuration.

尚、本発明におけるコンピュータは、記憶媒体に記憶されたプログラムに基づき、上記実施形態における各処理を実行するものであって、パソコン等の１つからなる装置、複数の装置がネットワーク接続されたシステム等の何れの構成であっても良い。 The computer according to the present invention executes each process in the above-described embodiment based on a program stored in a storage medium, and is a single device such as a personal computer or a system in which a plurality of devices are connected to a network. Any configuration may be used.

また、本発明におけるコンピュータとは、パソコンに限らず、情報処理機器に含まれる演算処理装置、マイコン等も含み、プログラムによって本発明の機能を実現することが可能な機器、装置を総称している。 In addition, the computer in the present invention is not limited to a personal computer, but includes an arithmetic processing device, a microcomputer, and the like included in an information processing device, and is a generic term for devices and devices that can realize the functions of the present invention by a program. .

５・・・ホスト装置、１０・・・ディスクアレイ装置、２０・・・アレイコントローラ、２１・・・ＣＰＵ、２２・・・プログラムメモリ、２３・・・キャッシュメモリ、２４・・・ホスト接続部、２５・・・ＨＤＤ接続部、３０・・・プール、２１０・・・ホストＩ／Ｆ制御部、２２０・・・ＩＯ管理部、２３０・・・キャッシュメモリ管理部、２４０・・・アドレス管理部、２５０・・・レプリケーション管理部、２６０・・・ＲＡＩＤ制御部、２７０・・・ディスクＩ／Ｆ制御部、３１０〜３６０・・・ＲＡＩＤアレイ。 5 ... Host device, 10 ... Disk array device, 20 ... Array controller, 21 ... CPU, 22 ... Program memory, 23 ... Cache memory, 24 ... Host connection unit, 25 ... HDD connection unit, 30 ... pool, 210 ... host I / F control unit, 220 ... IO management unit, 230 ... cache memory management unit, 240 ... address management unit, 250... Replication management unit, 260... RAID control unit, 270... Disk I / F control unit, 310 to 360.

Claims

A disk array device composed of a set of disk devices in which the master logical disk and backup logical disk managed in units of slices are different,
Address storage means for storing address management information in which logical addresses indicating slices of the master logical disk and the backup logical disk are associated with physical addresses of disk devices;
Failure detection means for detecting whether or not a failure has occurred in the disk device constituting the master logical disk;
When the master logical disk and the backup logical disk are in a synchronized state with respect to host access, when the failure detection unit detects that a failure has occurred in the disk device constituting the master logical disk, the master logical disk A replacement means for replacing the disk device in which the failure of the master logical disk has failed and the disk device in which the failure of the backup logical disk has not occurred by replacing the slices of the logical disk and the backup logical disk;
When the replacement process is executed, state changing means for changing the master logical disk and the backup logical disk to a disconnected state for host access;
A disk array device comprising:

Address storage means for storing address management information in which logical addresses corresponding to slices of a master logical disk and a backup logical disk are associated with physical addresses of disk devices;
In response to the replication command, a backup logical disk is generated from a master logical disk and a set of disk units different from the set of disk units constituting the master logical disk, and the address management information of each generated logical disk is stored in the address Means for writing to the storage means;
Means for copying data of the master logical disk to the backup logical disk in response to a copy instruction;
Failure detection means for detecting whether or not a failure has occurred in the disk device constituting the master logical disk;
When the master logical disk and the backup logical disk are in a synchronized state with respect to host access, when the failure detection unit detects that a failure has occurred in the disk device constituting the master logical disk, the master logical disk A replacement means for replacing the disk device in which the failure of the master logical disk has failed and the disk device in which the failure of the backup logical disk has not occurred by replacing the slices of the logical disk and the backup logical disk;
When the replacement process is executed, state changing means for changing the master logical disk and the backup logical disk to a disconnected state for host access;
A disk array device comprising:

The disk array device according to claim 1 or 2,
Means for executing failure recovery processing on the backup logical disk that has been put into a disconnected state by the state changing means;
Means for synchronizing the backup logical disk with the master logical disk when the backup logical disk is recovered from a failure;
A disk array device comprising:

Address storage means for storing address management information in which logical addresses corresponding to slices of a master logical disk and a backup logical disk are associated with physical addresses of disk devices;
In response to the replication command, a master logical disk and a backup logical disk are generated from a set of disk devices, and address management information of each generated logical disk is written to the address storage means;
When copying the master logical disk data to the backup logical disk in response to a copy command, if the same data as the slice data of the master logical disk is stored in the backup logical disk slice, the backup logical disk The slice of the logical disk is discarded, a new slice of the backup logical disk is set in a disk device different from the disk device constituting the master logical disk, and the master logical disk is set in the backup logical disk in which the new slice is set A means of copying the data of
Failure detection means for detecting whether or not a failure has occurred in the disk device constituting the master logical disk;
When the master logical disk and the backup logical disk are in a synchronized state with respect to host access, when the failure detection unit detects that a failure has occurred in the disk device constituting the master logical disk, the master logical disk A replacement means for replacing the disk device in which the failure of the master logical disk has failed and the disk device in which the failure of the backup logical disk has not occurred by replacing the slices of the logical disk and the backup logical disk;
When the replacement process is executed, state changing means for changing the master logical disk and the backup logical disk to a disconnected state for host access;
A disk array device comprising:

A disk having address storage means for storing address management information in which a logical address indicating a slice of the master logical disk and the backup logical disk and a physical address of the disk device are associated, and the master logical disk and the backup logical disk are different A disk array control method used for a disk array device configured by a set of devices,
A failure detection step of detecting whether or not a failure has occurred in a disk device constituting the master logical disk;
When the master logical disk and the backup logical disk are in a synchronized state with respect to host access, if the failure detection step detects that a failure has occurred in a disk device constituting the master logical disk, the master logical disk Replacing a disk device in which a failure of the master logical disk and a disk device in which a failure of the backup logical disk has not occurred by replacing slices of a logical disk and the backup logical disk;
A state change step for changing the master logical disk and the backup logical disk to a disconnected state for host access when the replacement process is executed;
A disk array control method comprising: