JP2000076021A

JP2000076021A - Disk array control method and disk array device

Info

Publication number: JP2000076021A
Application number: JP10242000A
Authority: JP
Inventors: Atsushi Miyauchi; 敦宮内; Seishi Tomonaga; 誠史友永
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1998-08-27
Filing date: 1998-08-27
Publication date: 2000-03-14

Abstract

PROBLEM TO BE SOLVED: To provide a method and a device for controlling disk array with which data access is accelerated when the data is read by providing a data arranging method considering the characteristics of disk storage devices compos ing a disk array. SOLUTION: Devices having the disks of zone bit recording(ZBR) system are used for disk storage devices 12-1 to 12-5, a storage area for data to be originally stored is arranged on the outer peripheral side of disk and a storage area on a disk for redundant data is arranged on the inner peripheral side of disk than the storage area for such data. Concerning the ZBR system, data transfer speed is accelerated toward the outer periphery and by arranging the data to be originally stored on the outer peripheral side of disk storage devices 12-1 to 12-5, data transfer speed is accelerated.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明はディスクアレイ制御
方法及びディスクアレイ装置に関し、さらに詳しくは、
情報配信型サーバまたはデータベース用サーバに用いて
好適なディスクアレイ制御方法及びディスクアレイ装置
に関する。[0001] 1. Field of the Invention [0002] The present invention relates to a disk array control method and a disk array device,
The present invention relates to a disk array control method and a disk array device suitable for use in an information distribution type server or a database server.

【０００２】[0002]

【従来の技術】近年のコンピュータシステムの発展に伴
い、外部記憶装置に対する要求も高まっている。その要
求に応えるべく開発されたのがディスクアレイ装置であ
る。ディスクアレイ装置は、複数のディスク記憶装置を
内蔵することにより大容量化を図り、各ディスク記憶装
置を並列に動かすことによって、記録あるいは再生を高
速化すると共に、冗長データを付加することにより、信
頼性を上げている。2. Description of the Related Art With the recent development of computer systems, demands for external storage devices have been increasing. A disk array device has been developed to meet the demand. A disk array device achieves a large capacity by incorporating a plurality of disk storage devices, and speeds up recording or reproduction by moving each disk storage device in parallel, and adds redundant data to achieve reliability. I'm raising sex.

【０００３】このようなディスクアレイ装置はＲＡＩＤ
（Redundant Arrays of Inexpensive Disks ）と呼ば
れ、ＵＣバークレイ校の David A. Patterson らによる
「A Case for Redundant Arrays of Inexpensive Disks
(RAID)」UCB/CSD 87/391(December 1987) において、そ
の基本概念が紹介されている。[0003] Such a disk array device has a RAID
(Redundant Arrays of Inexpensive Disks) by David A. Patterson and others at UC Berkeley's A Case for Redundant Arrays of Inexpensive Disks.
(RAID) ”UCB / CSD 87/391 (December 1987) introduces the basic concept.

【０００４】その論文には、本来記憶すべきデータに冗
長データ（Parity）を付加して記憶することにより、デ
ィスク記憶装置がたとえ１台故障したときでも、その冗
長データを使い、読み出せないデータを演算により回復
することができることや、ディスク記憶装置に並列にア
クセスすることによりデータアクセスの高速化を実現す
る方法が記述されている。[0004] In that paper, by adding redundant data (Parity) to data to be originally stored and storing the data, even if one of the disk storage devices fails, the redundant data can be used and read out. And a method for realizing high-speed data access by accessing disk storage devices in parallel.

【０００５】しかし、冗長データの配置方法について
は、図８で示すＲＡＩＤ−３方式のように、ディスクア
レイ装置を構成するディスク記憶装置１２−１〜１２−
（Ｎ＋１）のＮ＋１台の内、１台、例えばディスク記憶
装置１２−（Ｎ＋１）を全て冗長データ用のディスク記
憶装置として使用しているために、データ読み出し時の
並列性がＮ＋１倍ではなくＮ倍に制約されたり、図９で
示すＲＡＩＤ−５方式のように、冗長データを、ディス
ク記憶装置１２−１〜１２−（Ｎ＋１）のそれぞれのデ
ィスク全面にバラバラに配置する方式を使うことによ
り、データ読み出し時に最悪の場合は、ディスクの内周
から外周までのフルシークが起こり得るため、アクセス
速度が遅くなることがある。However, regarding the method of arranging redundant data, as in the RAID-3 system shown in FIG.
Since one of the (N + 1) N + 1 units, for example, the disk storage device 12- (N + 1) is all used as the disk storage device for redundant data, the parallelism at the time of data reading is not N + 1 times but N times. By using a method of arranging redundant data randomly on the entire surface of each of the disk storage devices 12-1 to 12- (N + 1) as in the RAID-5 method shown in FIG. In the worst case when reading data, a full seek from the inner circumference to the outer circumference of the disk may occur, and the access speed may be reduced.

【０００６】なお、ＲＡＩＤ−３方式はビット単位のス
トライピングであるのに対し、ＲＡＩＤ−５方式はブロ
ック単位のストライピングであり、図９に示す各々のブ
ロックはデータブロックを示すものとする。The RAID-3 system is striping in units of bits, while the RAID-5 system is striping in units of blocks, and each block shown in FIG. 9 indicates a data block.

【０００７】また、ディスクアレイ装置に内蔵される個
々のディスク記憶装置に記憶されるデータを、本来記憶
すべきデータと冗長データに分けて考えると、本来記憶
すべきデータや冗長データを個々のディスク記憶装置の
アクセス特性を考慮した配置にする方法に関しては、従
来の文献では言及されていない。When data stored in individual disk storage devices incorporated in a disk array device is divided into data to be stored and redundant data, data to be stored and redundant data are stored in individual disk storage devices. A conventional document does not mention a method of arranging storage devices in consideration of access characteristics.

【０００８】[0008]

【発明が解決しようとする課題】本発明の課題は、ディ
スクアレイを構成するディスク記憶装置の特性を考慮し
たデータ配置方法を実現することにより、データ読みだ
し時のデータアクセスを高速にするディスクアレイ制御
方法及びディスクアレイ装置を提供することである。SUMMARY OF THE INVENTION It is an object of the present invention to provide a disk array which realizes a high-speed data access at the time of data reading by realizing a data arranging method in consideration of characteristics of a disk storage device constituting a disk array. An object of the present invention is to provide a control method and a disk array device.

【０００９】[0009]

【課題を解決するための手段】上記課題を解決するため
に、ディスク記憶装置にゾーン・ビット・レコーディン
グ方式のディスクを有するものを使用し、その特徴を生
かしたデータ配置にする。すなわち、請求項１のディス
クアレイ制御方法では、入力データに冗長データを加
え、ゾーン・ビット・レコーディング方式のディスクを
有するディスク記憶装置の複数台に、入力データ及び冗
長データを分けて記憶するディスクアレイ制御方法であ
って、冗長データのディスクにおける記憶領域を、入力
データの記憶領域よりディスクの内周側に配置すること
を特徴とする。In order to solve the above-mentioned problem, a disk storage device having a zone bit recording type disk is used, and a data arrangement utilizing the characteristics is used. In other words, in the disk array control method according to the first aspect, the redundant data is added to the input data, and the input data and the redundant data are separately stored in a plurality of disk storage devices having a zone bit recording type disk. The control method is characterized in that a storage area of a disk for redundant data is arranged on an inner peripheral side of the disk with respect to a storage area of input data.

【００１０】また、請求項２のディスクアレイ制御方法
では、請求項１のディスクアレイ制御方法に加え、入力
データの記憶領域を、ディスクの使用可能領域の中で、
外周側から配置することを特徴とする。In the disk array control method according to the second aspect, in addition to the disk array control method according to the first aspect, the storage area of the input data is set in the usable area of the disk.
It is characterized by being arranged from the outer peripheral side.

【００１１】また、請求項３のディスクアレイ制御方法
では、請求項１のディスクアレイ制御方法に加え、入力
データの、入力データの記憶領域内での配置方法は、入
力データの分割数をＮ、ディスク記憶装置の総数をＮ＋
１、ディスク記憶装置の番号を１からＮ＋１としたと
き、番号１のディスク記憶装置から番号Ｎ＋１のディス
ク記憶装置の方向へ順に配置し、番号Ｎ＋１のディスク
記憶装置の次は番号１のディスク記憶装置であって次の
記憶位置に配置するものであり、冗長データの、冗長デ
ータの記憶領域内での配置方法は、冗長データに対応す
る分割された入力データが配置されていないディスク記
憶装置に配置することを特徴とする。In the disk array control method according to a third aspect, in addition to the disk array control method according to the first aspect, the method of arranging input data in the storage area of the input data is as follows. N + the total number of disk storage devices
1, when the numbers of the disk storage devices are 1 to N + 1, the disk storage devices of the number 1 are arranged in order from the disk storage device of the number 1 to the disk storage device of the number N + 1. The method of arranging redundant data in the storage area of the redundant data is performed by arranging the redundant data in a disk storage device in which the divided input data corresponding to the redundant data is not arranged. It is characterized by doing.

【００１２】また、請求項４のディスクアレイ装置で
は、入力データに冗長データを加え、ゾーン・ビット・
レコーディング方式のディスクを有するディスク記憶装
置の複数台に、入力データ及び冗長データを分けて記憶
させるディスクアレイ装置であって、冗長データのディ
スクにおける記憶領域を、入力データの記憶領域よりデ
ィスクの内周側に配置する構成であることを特徴とす
る。Further, in the disk array device according to the present invention, redundant data is added to input data, and zone data is added.
A disk array device for separately storing input data and redundant data in a plurality of disk storage devices having a recording type disk, wherein a storage area of the disk for redundant data is set at an inner circumference of the disk from a storage area of the input data. It is characterized in that it is arranged on the side.

【００１３】また、請求項５のディスクアレイ装置で
は、請求項４のディスクアレイ装置に加え、入力データ
の記憶領域を、ディスクの使用可能領域の中で、外周側
から配置する構成であることを特徴とする。According to a fifth aspect of the present invention, in addition to the disk array device of the fourth aspect, a storage area for input data is arranged from an outer peripheral side in a usable area of the disk. Features.

【００１４】また、請求項６のディスクアレイ装置で
は、請求項４のディスクアレイ装置に加え、入力データ
の、入力データの記憶領域内での配置構成は、入力デー
タの分割数をＮ、ディスク記憶装置の総数をＮ＋１、デ
ィスク記憶装置の番号を１からＮ＋１としたとき、番号
１のディスク記憶装置から番号Ｎ＋１のディスク記憶装
置の方向へ順に配置し、番号Ｎ＋１のディスク記憶装置
の次は番号１のディスク記憶装置であって次の記憶位置
に配置する構成であり、冗長データの、冗長データの記
憶領域内での配置構成は、冗長データに対応する分割さ
れた入力データが配置されていないディスク記憶装置に
配置する構成であることを特徴とする。According to a sixth aspect of the present invention, in addition to the disk array device of the fourth aspect, the arrangement of the input data in the storage area of the input data is as follows: Assuming that the total number of devices is N + 1 and the numbers of the disk storage devices are 1 to N + 1, the disk storage devices are arranged in order from the disk storage device of number 1 to the disk storage device of number N + 1. And the redundant data is arranged in the storage area of the redundant data in the disk storage device in which the divided input data corresponding to the redundant data is not arranged. It is characterized by being arranged in a storage device.

【００１５】上述した手段による作用を説明する。本来
記憶すべき入力データはデータ領域に、冗長データはデ
ータ領域とは別の冗長データ領域に記憶される。しか
も、冗長データ領域は、ゾーン・ビット・レコーディン
グ（以下、「ＺＢＲ」という。）方式のディスクでデー
タ領域より内周側に配置されている。冗長データは本来
必要のないデータであり、ディスク記憶装置が故障時に
使われるリカバリ用のデータである。すなわち、通常読
み出し時には必要のないデータである。The operation of the above means will be described. The input data to be originally stored is stored in the data area, and the redundant data is stored in a redundant data area different from the data area. In addition, the redundant data area is disposed on the inner peripheral side of the data area in a zone bit recording (hereinafter, referred to as "ZBR") type disk. The redundant data is essentially unnecessary data and is recovery data used when a disk storage device fails. That is, the data is not necessary at the time of normal reading.

【００１６】本発明のディスクアレイ制御方法で配置さ
れたデータを読み出す時には、通常はデータ領域の読み
出しのみ行えばよいため、冗長データのある内周側への
シーク動作は行われない。ＺＢＲ方式において外周側ほ
どデータ転送速度が速く、データ領域はディスク記憶装
置の外周側に配置されていることにより、通常の読み出
し時のデータ転送速度が速くなる。また、本来記憶すべ
きデータがまとめて配置されているので、冗長データを
含まないだけデータ転送速度が速くなり、シーク範囲も
狭くなることからアクセス時間の短縮も可能となる。デ
ータ転送速度が速くなるということは、逆にいえば、デ
ータ転送時間が短縮されることである。When data arranged by the disk array control method of the present invention is read, usually only the reading of the data area needs to be performed, and therefore, the seek operation to the inner peripheral side with the redundant data is not performed. In the ZBR system, the data transfer speed is higher on the outer peripheral side, and the data area is arranged on the outer peripheral side of the disk storage device, so that the data transfer speed at the time of normal reading is higher. In addition, since the data to be stored is arranged in a lump, the data transfer speed is increased without redundant data, and the seek range is narrowed, so that the access time can be reduced. Increasing the data transfer speed means, conversely, reducing the data transfer time.

【００１７】なお、冗長データは、冗長データに対応す
るデータが配置されていないディスク記憶装置に配置す
るので、ディスク記憶装置の内の１台が故障してもデー
タを回復できるというＲＡＩＤ本来の機能は失わない。Since redundant data is allocated to a disk storage device in which data corresponding to the redundant data is not allocated, even if one of the disk storage devices fails, the original function of RAID is that data can be recovered. Do not lose.

【００１８】[0018]

【発明の実施の形態】本発明の実施の形態例について図
１ないし図７を参照して説明する。ここで、図１は情報
配信用サーバの構成図であり、図２は５台のディスク記
憶装置で構成した場合のデータ配置例であり、図３はＮ
＋１台のディスク記憶装置で構成した場合のデータ配置
例であり、図４はＲＡＩＤコントローラの必要な領域確
保の処理のフローチャートであり、図５はＲＡＩＤコン
トローラの書き込み時の処理のフローチャートであり、
図６はＲＡＩＤコントローラの読み出し時の処理のフロ
ーチャートであり、図７は読み出し時のシーク範囲を示
す図である。なお、図中の構成要素で従来の技術と同様
の構造を成しているものについては、同一の参照符号を
付すものとする。DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described with reference to FIGS. Here, FIG. 1 is a configuration diagram of an information distribution server, FIG. 2 is an example of data arrangement in a case where the information distribution server is configured by five disk storage devices, and FIG.
FIG. 4 is a flowchart of a process for securing a necessary area of the RAID controller, and FIG. 5 is a flowchart of a process at the time of writing by the RAID controller;
FIG. 6 is a flowchart of a process at the time of reading by the RAID controller, and FIG. 7 is a diagram showing a seek range at the time of reading. Note that components in the figure that have the same structure as the conventional technology are denoted by the same reference numerals.

【００１９】まず、図１に沿って、情報配信用サーバの
構成例を説明する。サーバ内部にはホストバス１により
ホストＣＰＵ２、メモリ３、本サーバの外部との通信が
行われるネットワークアダプタ４が接続されている。デ
ィスクアレイを装備したサーバの場合、ＲＡＩＤコント
ローラ５がＢＩＵ（Bus Interface Unit）６を介してホ
ストバス１に接続されている。First, a configuration example of the information distribution server will be described with reference to FIG. Inside the server, a host CPU 2, a memory 3, and a network adapter 4 for communicating with the outside of the server are connected by a host bus 1. In the case of a server equipped with a disk array, the RAID controller 5 is connected to the host bus 1 via a BIU (Bus Interface Unit) 6.

【００２０】ＢＩＵ６は、ホストバス１とコントロール
ＣＰＵバス７間のインターフェイス機能およびバスブリ
ッジ機能を有するものである。このＢＩＵ６の中にＰＣ
Ｉバスインターフェイス機能が含まれる構成も考えられ
る。また、直接ＢＩＵ６を介して接続する代わりに、ホ
ストバス１に情報配信用として付けられているネットワ
ークアダプタ４以外の、さらに別のネットワークアダプ
タを付加して、このネットワークを経由してＲＡＩＤコ
ントローラ５を接続するという形態も可能である。The BIU 6 has an interface function between the host bus 1 and the control CPU bus 7 and a bus bridge function. PC in this BIU6
A configuration including an I bus interface function is also conceivable. Instead of directly connecting via the BIU 6, another network adapter other than the network adapter 4 attached to the host bus 1 for information distribution is added, and the RAID controller 5 is connected via this network. A form of connection is also possible.

【００２１】ＲＡＩＤコントローラ５内のコントロール
ＣＰＵバス７にはＢＩＵ６、コントロールＣＰＵ８、メ
モリ９、冗長データ計算回路１０、ディスクコントロー
ラ部１１が接続されている。メモリ９は、コントロール
ＣＰＵ８のワークエリア、各種テーブル、リスト用に使
用されると共に、ディスクキャッシュとして使われる。
冗長データ計算回路１０は、ディスク記憶装置１２−１
〜１２−（Ｎ＋１）に使われる冗長データを計算するた
めに使用される。冗長データは、たとえばパリティであ
る。A BIU 6, a control CPU 8, a memory 9, a redundant data calculation circuit 10, and a disk controller 11 are connected to a control CPU bus 7 in the RAID controller 5. The memory 9 is used for a work area of the control CPU 8, various tables and lists, and is also used as a disk cache.
The redundant data calculation circuit 10 includes a disk storage device 12-1.
Used to calculate redundant data used for ~ 12- (N + 1). The redundant data is, for example, parity.

【００２２】ＲＡＩＤコントローラ５は、ホストＣＰＵ
２から書き込み指示されたデータを一定の大きさに分割
（以下、「ストライプ化」という。）し、このストライ
プ化されたデータを一つの単位（ストライプ）として冗
長データを計算する。この冗長データを使うことによ
り、ディスク記憶装置１２−１〜１２−（Ｎ＋１）のう
ち、いずれか１台が故障してデータが失われた場合で
も、元の失われたデータを回復することができる。The RAID controller 5 has a host CPU
2, the data instructed to be written is divided into a certain size (hereinafter, referred to as "striping"), and redundant data is calculated using the striped data as one unit (stripe). By using the redundant data, even if one of the disk storage devices 12-1 to 12- (N + 1) fails and data is lost, the original lost data can be recovered. it can.

【００２３】ＲＡＩＤコントローラ５とディスク記憶装
置１２−１〜１２−（Ｎ＋１）とはディスクコントロー
ラ部１１を介してディスクバス１３により接続されてお
り、ＲＡＩＤコントローラ５内で計算された冗長データ
とストライプ化されたデータが各々のディスク記憶装置
１２−１〜１２−（Ｎ＋１）に並列に書き込まれる。The RAID controller 5 and the disk storage devices 12-1 to 12- (N + 1) are connected by a disk bus 13 via a disk controller unit 11, and the redundant data calculated in the RAID controller 5 and the stripe data are stripped. The written data is written in parallel to each of the disk storage devices 12-1 to 12- (N + 1).

【００２４】本実施の形態例のディスク記憶装置１２−
１〜１２−（Ｎ＋１）は、ＺＢＲ方式の、複数のデータ
転送レートが異なるゾーンを備えているディスクを使用
するものである。記録密度が一定かつディスクの回転速
度が一定という条件の下では、内周より外周の方でトラ
ックあたりのセクタ数を数多く取ることができる。ＺＢ
Ｒ方式はこの性質を利用して、記録面をディスクの中心
からの距離に応じて複数のゾーンに分け、外側のゾーン
では内側のゾーンよりトラックあたりのセクタ数を多く
するようにしてディスクの記録容量を増やすようにした
フォーマット方式である。The disk storage device 12- of the present embodiment
Nos. 1 to 12- (N + 1) use a disk having a plurality of zones with different data transfer rates in the ZBR system. Under the condition that the recording density is constant and the rotation speed of the disk is constant, a larger number of sectors per track can be obtained on the outer periphery than on the inner periphery. ZB
The R system utilizes this property to divide the recording surface into a plurality of zones according to the distance from the center of the disk, and to make the outer zone use more sectors per track in the inner zone than the inner zone. This is a format method to increase the capacity.

【００２５】ＺＢＲ方式のディスクを使用するディスク
記憶装置では、ディスク回転数が一定で、外周の方が内
周よりセクタ数が多いので、同一のデータサイズであっ
ても外周側に記録されているデータのほうが内周側に記
録されているデータより読み出しが速い。かかるアクセ
ス特性を生かして、ストライプ化されたデータをディス
クの外周側に記録し、冗長データをそのデータよりディ
スクの内周側に記録する。In a disk storage device using a ZBR type disk, since the disk rotation speed is constant and the number of sectors is larger on the outer circumference than on the inner circumference, data is recorded on the outer circumference even if the data size is the same. Data is read faster than data recorded on the inner circumference. Taking advantage of such access characteristics, striped data is recorded on the outer peripheral side of the disk, and redundant data is recorded on the inner peripheral side of the disk from the data.

【００２６】５台のディスク記憶装置で使用した場合の
データ配置例を図２で説明する。ＳＳ１からＳＳ５は１
ストライプを構成するストライプ化されたデータのセッ
トで、それぞれ４つのデータから構成されている。たと
えばＳＳ１はＤ１からＤ４のデータで構成される。スト
ライプ化されたデータは、ディスク記憶装置１２−１か
らディスク記憶装置１２−５へ順に配置され、それぞれ
のディスク記憶装置内ではディスクの外周側から順に配
置する。An example of data arrangement when used in five disk storage devices will be described with reference to FIG. SS1 to SS5 are 1
A set of striped data constituting a stripe, each of which is composed of four data. For example, SS1 is composed of data D1 to D4. The striped data is arranged in order from the disk storage device 12-1 to the disk storage device 12-5, and in each disk storage device, arranged in order from the outer periphery of the disk.

【００２７】Ｐ（Ｄ１〜Ｄ４）からＰ（Ｄ３７〜Ｄ４
０）はストライプ化されたデータに対応する冗長データ
で、たとえばＰ（Ｄ１〜Ｄ４）はＳＳ１の冗長データで
ある。冗長データはデータの配置より内周側からまとめ
て順に配置する。Ｐ（Ｄ１〜Ｄ４）からＰ（Ｄ１７〜Ｄ
２０）は冗長データ記憶領域の先頭アドレスに配置さ
れ、Ｐ（Ｄ２１〜Ｄ２４）からＰ（Ｄ３７〜Ｄ４０）は
冗長データ記憶領域の先頭アドレス＋１（ストライプ内
の各データブロックの大きさ分）のアドレスに配置され
る。From P (D1 to D4) to P (D37 to D4)
0) is redundant data corresponding to the striped data. For example, P (D1 to D4) is SS1 redundant data. The redundant data is arranged collectively from the inner circumference side in order from the data arrangement. From P (D1 to D4) to P (D17 to D
20) is located at the start address of the redundant data storage area, and P (D21 to D24) to P (D37 to D40) are the address of the start address of the redundant data storage area + 1 (the size of each data block in the stripe). Placed in

【００２８】しかし、ディスク記憶装置１２−１〜１２
−５の１台が故障してもデータの回復ができるために
は、データが配置されているディスク記憶装置と異なる
ディスク記憶装置に配置する必要がある。このため、Ｓ
Ｓ１の構成要素であるＤ１〜Ｄ４はディスク記憶装置１
２−１〜１２−４のそれぞれに配置されているので、Ｓ
Ｓ１の冗長データであるＰ（Ｄ１〜Ｄ４）は、ディスク
記憶装置１２−５に配置する。同様にＳＳ２の冗長デー
タであるＰ（Ｄ５〜Ｄ８）はディスク記憶装置１２−４
に配置する。However, the disk storage devices 12-1 to 12-12
In order to be able to recover data even if one of the -5s fails, the data must be located in a disk storage device different from the disk storage device in which the data is located. Therefore, S
The components D1 to D4 of S1 are disk storage devices 1
2-1 to 12-4, so that S
P (D1 to D4), which is redundant data of S1, is arranged in the disk storage device 12-5. Similarly, P (D5 to D8) which is redundant data of SS2 is stored in the disk storage device 12-4.
To place.

【００２９】ディスク記憶装置をＮ＋１台にして一般化
したときの構成例を図３で説明する。ここでＮはストラ
イプ化に使用されるデータ用のディスク記憶装置の台数
である。入力データの、入力データの記憶領域内での配
置は、ディスク記憶装置１２−１からディスク記憶装置
１２−（Ｎ＋１）の方向へ順に配置し、ディスク記憶装
置１２−（Ｎ＋１）の次は、ディスク記憶装置１２−１
であって次の記憶位置に配置する。An example of the configuration when the number of disk storage devices is generalized to N + 1 will be described with reference to FIG. Here, N is the number of disk storage devices for data used for striping. The input data is arranged in the storage area of the input data in the order from the disk storage device 12-1 to the disk storage device 12- (N + 1), and the disk storage device 12- (N + 1) is followed by the disk storage device 12- (N + 1). Storage device 12-1
And is located at the next storage location.

【００３０】冗長データの、冗長データの記憶領域内で
の配置は、冗長データに対応するストライプ化されたデ
ータが配置されていないディスク記憶装置に配置する。
たとえば、冗長データＰ（ＤＮ＋１〜Ｄ２Ｎ）は、対応
するデータＤＮ＋１〜Ｄ２Ｎが配置されていないディス
ク記憶装置１２−Ｎに配置する。The arrangement of the redundant data in the storage area of the redundant data is arranged in a disk storage device in which the striped data corresponding to the redundant data is not arranged.
For example, the redundant data P (DN + 1 to D2N) is arranged in the disk storage device 12-N in which the corresponding data DN + 1 to D2N are not arranged.

【００３１】なお、図２および図３で、データ領域に連
続して冗長データ領域があるが、冗長データ領域はデー
タ領域よりディスクの内周側であればどこでもよく、デ
ータ領域に連続している必要はない。In FIGS. 2 and 3, there is a redundant data area continuous with the data area. The redundant data area may be anywhere on the inner peripheral side of the disk with respect to the data area, and is continuous with the data area. No need.

【００３２】これらの領域の設定方法を含め、本実施の
形態例のＲＡＩＤコントローラ５の処理を、初期設定、
書き込み前の必要な領域確保の処理、書き込み時の処
理、消去時の処理、読み出し時の処理に分け、順に説明
する。The processing of the RAID controller 5 of the present embodiment, including the method of setting these areas, is initialized,
The process for securing a necessary area before writing, the process for writing, the process for erasing, and the process for reading will be described separately.

【００３３】まず初期設定から説明する。初期設定で
は、ＺＢＲ方式のゾーン別のアドレス範囲を示す、ゾー
ン情報テーブルを作成する。この作成方法は、ディスク
記憶装置内のディスクのゾーンの境界を割り出し、対応
する物理アドレス範囲をあらかじめ調査し、その結果を
メモリ内部に保存する。ゾーンの境界を割り出す方法
は、たとえばゾーン境界値に対するバイナリーサーチを
行う方法があるが、それに限定するものではない。初期
設定は、本サーバシステムを稼動する前に行われるので
本サーバシステムの書き込み時、あるいは読み出し時の
性能には影響しない。このゾーン情報テーブルはメモリ
９に記憶される。First, the initial setting will be described. In the initial setting, a zone information table indicating an address range for each zone of the ZBR system is created. In this creation method, the boundaries of the zones of the disk in the disk storage device are determined, the corresponding physical address range is checked in advance, and the result is stored in the memory. As a method of determining a zone boundary, for example, there is a method of performing a binary search for a zone boundary value, but the method is not limited to this. Since the initial setting is performed before the operation of the server system, it does not affect the writing or reading performance of the server system. This zone information table is stored in the memory 9.

【００３４】次に、書き込み前の必要な領域確保の処理
を説明する。図４のフローチャートに沿って説明する。
書き込み指令の前に、サーバのホストＣＰＵ２からＲＡ
ＩＤコントローラ５へ、書き込むデータの名前とサイズ
が指定されて送られてくる。ステップＳ１０１では、指
定されたサイズからディスク記憶装置でデータを格納す
るために必要な領域のサイズを決め、その領域のサイズ
をメモリ９に書く。Next, processing for securing a necessary area before writing will be described. This will be described with reference to the flowchart of FIG.
Before the write command, the host CPU 2 of the server
The name and size of the data to be written are specified and sent to the ID controller 5. In step S101, the size of an area necessary for storing data in the disk storage device is determined from the designated size, and the size of the area is written in the memory 9.

【００３５】ステップＳ１０２からステップＳ１０４
は、ストライプ化されたデータを配置するデータ領域リ
ストを作成し、メモリ９に記憶する。ステップＳ１０２
では、ステップＳ１０１で決められた領域のサイズを、
初期設定で作成したゾーン情報テーブルとメモリ９にあ
る未使用の領域リストを使って、ディスク記憶装置内の
ディスクの外周側から未使用の領域を割り当てることに
より確保する。未使用の領域リストは未使用の領域の先
頭アドレスと領域のサイズが入っているリストである。
未使用の領域は、ディスク記憶装置に連続的に存在して
いる場合と断片的に存在している場合がある。Steps S102 to S104
Creates a data area list for arranging striped data and stores it in the memory 9. Step S102
Then, the size of the area determined in step S101 is
Using the zone information table created in the initial setting and the unused area list in the memory 9, an unused area is allocated from the outer peripheral side of the disk in the disk storage device and secured. The unused area list is a list including the start address of the unused area and the size of the area.
The unused area may exist continuously in the disk storage device or may exist in a fragmentary manner.

【００３６】ステップＳ１０３では、作成するデータ領
域リストにステップＳ１０２で確保したサイズ分の領域
を書き込む。ステップＳ１０４では、必要なサイズ分を
確保できたか判断し、確保できなかった場合はステップ
Ｓ１０２に戻り、確保できた場合はステップＳ１０５へ
進む。データ領域リストはメモリ９に記憶する。In step S103, an area of the size secured in step S102 is written in the data area list to be created. In step S104, it is determined whether the required size has been secured. If the required size has not been secured, the process returns to step S102. If the size has been secured, the process proceeds to step S105. The data area list is stored in the memory 9.

【００３７】ステップＳ１０５からステップＳ１０８
は、冗長データを配置する冗長データ領域リストを作成
し、メモリ９に記憶する。ステップＳ１０５では冗長デ
ータの格納に必要な領域のサイズを計算する。冗長デー
タは対応するデータの１ストライプあたり１データとす
ると、冗長データに必要な領域のサイズは、データ領域
のサイズを１ストライプを構成するデータの分割数で割
算することにより計算される。Steps S105 to S108
Creates a redundant data area list for arranging redundant data and stores it in the memory 9. In step S105, the size of the area required for storing the redundant data is calculated. Assuming that the redundant data is one data per stripe of the corresponding data, the size of the area required for the redundant data is calculated by dividing the size of the data area by the number of divisions of the data constituting one stripe.

【００３８】ステップＳ１０６では、ステップＳ１０５
で計算されたサイズ分の領域を、初期設定で作成したゾ
ーン情報テーブルとメモリ９にある未使用の領域リスト
を使って、取得したデータ領域より内周側で、対応する
データが記憶されないディスク記憶装置にある領域を割
り当てることにより、冗長データ領域を確保する。In step S106, step S105
Using the zone information table created by the initial setting and the unused area list in the memory 9, the area corresponding to the size calculated in the above is stored in the disk storage where the corresponding data is not stored on the inner side of the acquired data area. By allocating an area to the device, a redundant data area is secured.

【００３９】ステップＳ１０７では、作成する冗長デー
タ領域リストにステップＳ１０６で確保したサイズ分の
領域を書き込む。ステップＳ１０８では、必要なサイズ
分を確保できたか判断し、確保できなかった場合はステ
ップＳ１０６に戻り、確保できた場合はステップＳ１０
９へ進む。In step S107, the area of the size secured in step S106 is written in the redundant data area list to be created. In step S108, it is determined whether the required size has been secured. If the required size has not been secured, the process returns to step S106.
Go to 9.

【００４０】ステップＳ１０９ではファイルシステムを
初期化する。その内容は、入力するデータと冗長データ
の論理アドレスと、ステップＳ１０１からステップＳ１
０８で確保した領域の物理アドレスとの、情報間のマッ
ピングをして、その結果をアドレスマッピング・テーブ
ルとしてメモリ９に保存することである。アドレスマッ
ピング・テーブルは、論理アドレス、物理ストライプ番
号、物理アドレスから構成される。In step S109, the file system is initialized. The contents include the logical addresses of the input data and the redundant data, and the contents of steps S101 to S1.
The mapping between information and the physical address of the area secured in step 08 is performed, and the result is stored in the memory 9 as an address mapping table. The address mapping table includes a logical address, a physical stripe number, and a physical address.

【００４１】次に、データの書き込み時の処理を図５の
フローチャートに沿って説明する。なお、説明の簡便化
のため、確保したデータ領域の先頭アドレスを物理アド
レス０とし、内周側へいくにしたがってアドレス値が大
きくなるものとする。また、確保したデータ領域は連続
しているものとする。Next, the data writing process will be described with reference to the flowchart of FIG. For the sake of simplicity, it is assumed that the head address of the secured data area is physical address 0, and the address value increases toward the inner circumference. The secured data area is assumed to be continuous.

【００４２】ステップＳ２０１では、ＲＡＩＤコントロ
ーラ５のＯＳ（Operating System）から渡されたデータ
の論理アドレスを、前述のステップＳ１０９で作成され
たアドレスマッピング・テーブルを使って物理ストライ
プ番号に変換する。すなわち、論理アドレスから物理ス
トライプ番号が判明し変換される。In step S201, the logical address of the data passed from the OS (Operating System) of the RAID controller 5 is converted into a physical stripe number using the address mapping table created in step S109. That is, the physical stripe number is determined from the logical address and converted.

【００４３】ステップＳ２０２以降では、個々のデータ
を書き込む物理アドレスを計算する。ステップＳ２０１
で求めた物理ストライプ番号中の、どのディスク記憶装
置の、どのアドレスにデータを書くかについては、次の
ように計算される。論理アドレスをＲＡ（block 位
置）、ストライプ内のデータの分割サイズをＳＺ（bloc
k ）、ＲＡＩＤを構成するディスク記憶装置数をＤＫと
する。冗長データによりディスク１台分のサイズだけ実
際にはデータを書き込めるサイズが減るため、データを
書くためのディスク記憶装置数は、ＤＫ−１台となり、
これによりデータ用のストライプが構成されるものとす
る。In and after step S202, the physical address at which the individual data is to be written is calculated. Step S201
The data to be written to which address of which disk storage device in the physical stripe number obtained in the above is calculated as follows. The logical address is RA (block position) and the data division size in the stripe is SZ (bloc
k), the number of disk storage devices constituting the RAID is DK. Since the size in which data can be actually written is reduced by the size of one disk due to the redundant data, the number of disk storage devices for writing data is DK-1.
Thus, a data stripe is configured.

【００４４】１ストライプ内に書けるデータの大きさ
は、ＳＺ×（ＤＫ−１）となるため、式（１）の商によ
り、そのストライプ番号内の何番目のディスク記憶装置
の位置に書くかが計算される。式（１）で、％は割算の
余りである。（ＲＡ％（ＳＺ×（ＤＫ−１）））／ＳＺ（１）さらに、式（１）の余りの値により、ディスク記憶装置
内の物理ストライプ番号に対応する先頭アドレスからの
オフセット位置を計算することができる。Since the size of the data that can be written in one stripe is SZ × (DK−1), the quotient of equation (1) determines the number of the disk storage device in the stripe number to be written. Is calculated. In equation (1),% is the remainder of the division. (RA% (SZ × (DK-1))) / SZ (1) Further, the offset position from the head address corresponding to the physical stripe number in the disk storage device is calculated based on the remainder of the equation (1). be able to.

【００４５】例として、図２のようにディスク記憶装置
を５台でＲＡＩＤを構成し、ストライプサイズＳＺを１
６block と仮定したケースを考える。この場合、データ
記憶用のディスク記憶装置数は、冗長データ用のサイズ
分を除いた４台分となるため、ＤＫ−１＝４となる。As an example, as shown in FIG. 2, a RAID is composed of five disk storage devices, and the stripe size SZ is 1
Consider the case assuming 6 blocks. In this case, the number of disk storage devices for data storage is four, excluding the size for redundant data, so that DK-1 = 4.

【００４６】論理アドレスとして８１がＯＳから渡され
た場合、式（１）から（８１％（１６×４））／１６＝１７／１６＝１（商）となるため、ステップＳ２０１で渡された物理ストライ
プ番号内の複数のディスク記憶装置のうち、ストライプ
の先頭にあるドライブ＋１台目、すなわち先頭の次の位
置のディスク記憶装置に書けばよい事がわかる。また、
この式の余り値は１であるため、そのディスク記憶装置
内にデータを書き込むためのアドレスのオフセットは、
１block 分であることがわかる。When 81 is passed as the logical address from the OS, (81% (16 × 4)) / 16 = 17/16 = 1 (quotient) is obtained from the equation (1). It can be seen that, of the plurality of disk storage devices in the physical stripe number, writing to the drive at the head of the stripe plus the first drive, that is, the disk storage device at the position next to the head, is sufficient. Also,
Since the remainder of the equation is 1, the offset of the address for writing data into the disk storage device is:
It can be seen that it is one block worth.

【００４７】なお、物理ストライプ番号により、ストラ
イプの開始するディスク記憶装置番号が異なるが、それ
は式（２）で求まる。式（２）で、ＳＮはデータの物理
ストライプ番号、ＤＫはＲＡＩＤを構成するディスク記
憶装置数、％は割算の余りである。ただし、物理ストラ
イプ番号は１から始まるものとする。また、ディスク記
憶装置の番号付けも１からの順とする。ＤＫ−（（ＳＮ＋ＤＫ−２）％ＤＫ）（２）Note that the disk storage device number at which the stripe starts depends on the physical stripe number, which is obtained by equation (2). In equation (2), SN is the physical stripe number of the data, DK is the number of disk storage devices constituting the RAID, and% is the remainder of the division. However, the physical stripe number starts from 1. The numbering of the disk storage devices is also performed in the order from 1. DK-((SN + DK-2)% DK) (2)

【００４８】たとえば、５台のディスク記憶装置で、デ
ータの物理ストライプ番号が３であるストライプが開始
するディスク記憶装置番号は、ＤＫ＝５、ＳＮ＝３を式
（２）に代入すると４が得られるので４番になる。この
結果は図２で物理ストライプ番号３、すなわちＳＳ３の
先頭データＤ９がディスク記憶装置１２−４にあること
により確認される。For example, in five disk storage devices, the disk storage device number at which the stripe whose data physical stripe number is 3 starts is 4 by substituting DK = 5 and SN = 3 into equation (2). No. 4 This result is confirmed by the fact that the physical stripe number 3, ie, the leading data D9 of SS3 is in the disk storage device 12-4 in FIG.

【００４９】ステップＳ２０３では、ステップＳ２０１
で変換された物理ストライプ番号に対応する冗長データ
を書き込む物理アドレスを計算する。冗長データを書き
込むディスク記憶装置は、物理ストライプ番号に対応す
るデータが配置されていないディスク記憶装置を割り当
てる必要がある。具体的には、ストライプ内の先頭のデ
ータが書かれるディスク記憶装置より一つ前の位置にあ
るディスク記憶装置に冗長データを書く。In step S203, step S201
Calculate the physical address to write the redundant data corresponding to the physical stripe number converted by the above. A disk storage device to which redundant data is written needs to allocate a disk storage device in which data corresponding to a physical stripe number is not arranged. More specifically, redundant data is written to a disk storage device at a position immediately before a disk storage device in which the first data in the stripe is written.

【００５０】そのディスク記憶装置のどこのアドレス
に、書き込むデータに対応する冗長データを書くかは、
冗長データ領域の先頭物理アドレスを便宜上０とする
と、式（３）で計算される。式（３）でＩＮＴは整数値
という意味で、括弧内の割算の余りを除いた商の分だけ
オフセットした位置となる。ＩＮＴ（（ＳＮ−１）／ＤＫ）（３）At which address of the disk storage device the redundant data corresponding to the data to be written is to be written
Assuming that the leading physical address of the redundant data area is 0 for convenience, it is calculated by equation (3). In the expression (3), INT means an integer value, which is a position offset by the quotient excluding the remainder of the division in parentheses. INT ((SN-1) / DK) (3)

【００５１】実際には、図２及び図３のＤ１、Ｄ２等
は、ストライプ化されたあるサイズのデータブロックと
考えると、式（３）の結果とストライプ化されたデータ
ブロック数の乗算分だけオフセットされたアドレスが冗
長データを書く位置となる。Actually, assuming that D1, D2, etc. in FIGS. 2 and 3 are striped data blocks of a certain size, only the product of the result of equation (3) and the number of striped data blocks is obtained. The offset address becomes a position where redundant data is written.

【００５２】たとえば、５台のディスク記憶装置で、デ
ータの物理ストライプ番号が６の冗長データは、ＤＫ＝
５、ＳＮ＝６を式（３）に代入すると、１が得られるの
で冗長データ領域の先頭物理アドレス＋１（ストライプ
内の各データブロックの大きさ分）のアドレスになるこ
とがわかる。この結果は図２で物理ストライプ番号６の
冗長データＰ（Ｄ２１〜Ｄ２４）が、冗長データ領域の
先頭物理アドレス＋１（ストライプ内の各データブロッ
クの大きさ分）であることにより確認される。以上のス
テップＳ２０１からステップＳ２０３により入力される
データに対応する全ての物理アドレスが具体的に決定さ
れる。For example, in five disk storage devices, redundant data having a physical stripe number of 6 is DK =
By substituting 5, SN = 6 into equation (3), 1 is obtained, and it can be seen that this is the address of the top physical address of the redundant data area + 1 (the size of each data block in the stripe). This result is confirmed by the fact that the redundant data P (D21 to D24) of the physical stripe number 6 in FIG. 2 is the first physical address of the redundant data area + 1 (the size of each data block in the stripe). All physical addresses corresponding to the data input in steps S201 to S203 are specifically determined.

【００５３】ステップＳ２０４では冗長データを計算す
る。これは入力されたデータをハードウエアの冗長デー
タ計算回路１０で行ってもよいし、ＲＡＩＤのアルゴリ
ズムに従ってソフトウエア処理で冗長データを計算して
もよい。ステップＳ２０５では、データ用物理アドレス
にストライプ化されたデータを書きこむと共に、冗長デ
ータ用物理アドレスにストライプ化されたデータに対応
する冗長データを書きこむ。以上によって全ての書き込
み処理が行われる。In step S204, redundant data is calculated. This may be performed by using the redundant data calculation circuit 10 of hardware for the input data, or calculating the redundant data by software processing according to a RAID algorithm. In step S205, the striped data is written to the data physical address, and the redundant data corresponding to the striped data is written to the redundant data physical address. As described above, all the writing processes are performed.

【００５４】書き込み処理では、データを書く位置と冗
長データを書く位置が離れているために、１データごと
に逐次に書き込みを行うと、シークが多発しアクセス速
度の面で不利であるので、ＲＡＩＤコントローラ５内の
メモリ９に冗長データをまとめて保存しておき、ある程
度冗長データがまとまったらディスク記憶装置に書くよ
うにする。In the write processing, since the position where data is written and the position where redundant data are written are distant from each other, if writing is performed sequentially for each data, many seeks occur, which is disadvantageous in terms of access speed. The redundant data is collectively stored in the memory 9 in the controller 5, and is written to the disk storage device when the redundant data is collected to some extent.

【００５５】次に、消去時の処理を説明する。サーバか
ら所定データの消去指示が来ると、ＲＡＩＤコントロー
ラ５は、所定データのデータ領域リストの内容を未使用
な領域リストに移動させ、所定データの冗長データ領域
リストの内容を未使用な領域リストに移動させる。この
ようにして、不要になったデータ領域と冗長データ領域
を解放し、次のデータ書き込み時のデータ割り当てに使
用可能にする。Next, processing at the time of erasing will be described. When a predetermined data erasure instruction is received from the server, the RAID controller 5 moves the contents of the data area list of the predetermined data to the unused area list, and changes the contents of the redundant data area list of the predetermined data to the unused area list. Move. In this way, the unnecessary data area and redundant data area are released, and can be used for data allocation at the time of next data writing.

【００５６】次に、読み出す時の処理を図６のフローチ
ャートに沿って説明する。通常は冗長データをアクセス
せずに、データのみのアクセスになる。冗長データは通
常は不要であるからである。読み出すデータの名前とサ
イズは、サーバのホストＣＰＵ２からＲＡＩＤコントロ
ーラ５へ指令される。Next, the reading process will be described with reference to the flowchart of FIG. Normally, only data is accessed without accessing redundant data. This is because redundant data is usually unnecessary. The name and size of the data to be read are instructed from the host CPU 2 of the server to the RAID controller 5.

【００５７】ステップＳ３０１はステップＳ２０１と同
じであり、ステップＳ３０２はステップＳ２０２と同様
の内容である。すなわち、ＲＡＩＤコントローラ５のＯ
Ｓから渡された指定データの論理アドレスを物理ストラ
イプ番号に変換し、さらに、データを読み出す物理アド
レスを計算する。ステップＳ３０３では、ステップＳ３
０１、Ｓ３０２で計算された物理先頭アドレスから、必
要サイズのデータをディスク記憶装置から読み込む。Step S301 is the same as step S201, and step S302 is the same as step S202. That is, the O of the RAID controller 5
The logical address of the designated data passed from S is converted into a physical stripe number, and the physical address from which data is read is calculated. In step S303, step S3
01, data of a required size is read from the disk storage device from the physical head address calculated in S302.

【００５８】通常の読み出し時にはデータ領域の読み出
しのみ行うため、冗長データのある内周側へのシーク動
作は行われない。ＺＢＲ方式において外周ほどデータ転
送速度が速く、本来記憶すべきデータはディスク記憶装
置の外周側に配置されていることにより、データ転送速
度が速くなる。また、データはまとめて配置されている
ので、冗長データを含まないだけデータ転送速度が速く
なる。データ転送速度が速いということは、逆にいえ
ば、データ転送時間が短縮されることである。During normal reading, only the reading of the data area is performed, so that the seek operation to the inner peripheral side with the redundant data is not performed. In the ZBR method, the data transfer speed is higher at the outer periphery, and the data transfer speed is higher because the data to be stored is arranged on the outer periphery of the disk storage device. Also, since the data is arranged collectively, the data transfer speed is increased as much as the redundant data is not included. Faster data transfer speed means, conversely, shorter data transfer time.

【００５９】さらに、図７（ａ）に示すように、従来の
シーク範囲はデータと冗長データの領域２０であった
が、図７（ｂ）に示すようにデータ領域２１と冗長デー
タ領域２２が分離されていれば、シーク範囲はデータ領
域２１だけでよく、シーク範囲が狭くなることからアク
セス時間の短縮も可能となる。Further, as shown in FIG. 7A, the conventional seek range is the data and redundant data area 20, but as shown in FIG. 7B, the data area 21 and the redundant data area 22 are divided. If they are separated, the seek range only needs to be the data area 21, and the seek range is narrowed, so that the access time can be reduced.

【００６０】冗長データを読み出す必要があるのはＲＡ
ＩＤを構成するディスク記憶装置の内、１台が故障した
場合である。この場合でも、正常なディスク記憶装置に
書かれているデータおよび冗長データを、ステップＳ２
０２及びステップＳ２０３と同じ方法で計算することに
より、記憶したデータ及び冗長データのアドレスを特定
して読み出しされるので、故障したディスク記憶装置に
書かれている内容を回復することができる。すなわち、
１台が故障してもデータを回復できるというＲＡＩＤ本
来の機能は失わない。It is necessary to read the redundant data from RA
This is a case where one of the disk storage devices constituting the ID has failed. Even in this case, the data written in the normal disk storage device and the redundant data are transferred to step S2.
02 and the same method as in step S203, the addresses of the stored data and the redundant data are specified and read out, so that the contents written in the failed disk storage device can be recovered. That is,
Even if one unit fails, the original function of RAID that data can be recovered is not lost.

【００６１】このアルゴリズムによると、１台のディス
クが故障した場合は、フルシークが発生する可能性があ
るが、ＲＡＩＤを構成するディスク記憶装置の内、１台
が故障する確率は、全てのディスク記憶装置が正常な場
合に比べ低いので、実用上問題は少ない。たとえば、現
在のハードディスク記憶装置では１台当たりのＭＴＢＦ
（Mean Time Between Failure ）は１００万時間である
ため、これを１０台使用したＲＡＩＤにおいては、この
うちの１台が故障するまでのＭＴＢＦは１０万時間にな
り、冗長データへのアクセスは滅多に起こらず、通常は
データのみのアクセスとなる。According to this algorithm, if one disk fails, a full seek may occur, but the probability of failure of one of the disk storage devices constituting the RAID is determined by the probability that all disk storage devices will fail. Since the device is lower than the case where the device is normal, there is little practical problem. For example, with current hard disk storage devices, one MTBF
(Mean Time Between Failure) is one million hours, so in a RAID using ten of them, the MTBF until one of them fails is 100,000 hours, and access to redundant data is rare. This does not occur, and is usually only data access.

【００６２】また、全く同じデータを持つ完全２重化運
転されたサーバのようなときは、片側のサーバの１台の
ディスク記憶装置が故障したときに、別のサーバで運転
を引き継ぐような構成であるため、故障時のサービス品
質の低下は考えなくて済む。Further, in the case of a completely duplicated server having completely the same data, when one disk storage device of one server fails, another server takes over the operation. Therefore, there is no need to consider a decrease in service quality at the time of failure.

【００６３】実施形態として、ＲＡＩＤコントローラ５
無しで、ホストＣＰＵ２のソフトウエア処理のみで冗長
データを計算し、本来記憶すべき入力データと冗長デー
タのディスク記憶装置における配置位置を決定する機能
を有し、ホストＣＰＵ２のデータ配置指示に従ってディ
スク記憶装置１２−１〜１２−（Ｎ＋１）へデータを振
り分けるだけの機能を有する形態も可能である。また、
ディスク記憶装置１２−１〜１２−（Ｎ＋１）のファー
ムウエアでこれらの機能の一部を実現する形態も可能で
ある。As an embodiment, the RAID controller 5
A function of calculating redundant data only by software processing of the host CPU 2 and determining an arrangement position of input data and redundant data to be originally stored in the disk storage device. A form having a function of only distributing data to the devices 12-1 to 12- (N + 1) is also possible. Also,
A form in which some of these functions are realized by the firmware of the disk storage devices 12-1 to 12- (N + 1) is also possible.

【００６４】ディスク記憶装置のディスクは、ＺＢＲ方
式でフォーマット化されているもの、またはＺＢＲ方式
のフォーマット化が可能なものであれば、ハードディス
クであっても光ディスクであってもよいし、交換可能な
形態であってもよい。The disk of the disk storage device may be a hard disk or an optical disk as long as the disk is formatted by the ZBR system, or a disk which can be formatted by the ZBR system. It may be in a form.

【００６５】[0065]

【発明の効果】本発明のディスクアレイ制御方法及びデ
ィスクアレイ装置によれば、本来記憶すべき入力データ
と冗長データに分け、入力データをＺＢＲ方式のディス
ク記憶装置のディスクの外周側から記憶し、ディスク記
憶装置が故障したときにしか使われない冗長データを入
力データよりディスクの内周側に記憶することにより、
通常の読み出し時のデータ転送時間の短縮が可能とな
る。そのため、本発明を利用した、ＶＯＤ（Video On D
emand ）のような情報配信型サーバで、読み出し時のデ
ータ転送時間の短縮、アクセス時間の短縮を実現でき
る。また、一般的に読み出し頻度が高いデータベース用
サーバにも同様な効果が得られる。According to the disk array control method and the disk array device of the present invention, input data to be originally stored and redundant data are divided, and the input data is stored from the outer peripheral side of the disk of the ZBR type disk storage device. By storing redundant data that is used only when the disk storage device has failed on the inner side of the disk from the input data,
Data transfer time during normal reading can be reduced. Therefore, VOD (Video On D
With an information distribution server such as emand), it is possible to reduce the data transfer time at the time of reading and the access time. In addition, the same effect can be obtained for a database server that is generally read frequently.

[Brief description of the drawings]

【図１】情報配信用サーバの構成例を示す図である。FIG. 1 is a diagram illustrating a configuration example of an information distribution server.

【図２】５台のディスク記憶装置で構成した場合のデ
ータ配置例を示す図である。FIG. 2 is a diagram illustrating an example of a data arrangement in a case where the disk is configured by five disk storage devices.

【図３】Ｎ＋１台のディスク記憶装置で構成した場合
のデータ配置例を示す図である。FIG. 3 is a diagram showing an example of a data arrangement in the case of being configured with N + 1 disk storage devices.

【図４】ＲＡＩＤコントローラの必要な領域確保の処
理のフローチャートである。FIG. 4 is a flowchart of a process for securing a necessary area of the RAID controller.

【図５】ＲＡＩＤコントローラの書き込み時の処理の
フローチャートである。FIG. 5 is a flowchart of a write process performed by a RAID controller.

【図６】ＲＡＩＤコントローラの読み出し時の処理の
フローチャートである。FIG. 6 is a flowchart of a process at the time of reading by the RAID controller.

【図７】読みだし時のシーク範囲を示す図であり、図
７（ａ）は従来のシーク範囲、図７（ｂ）は本実施の形
態例のシーク範囲を示す図である。7A and 7B are diagrams showing a seek range at the time of reading, FIG. 7A is a diagram showing a conventional seek range, and FIG. 7B is a diagram showing a seek range of the present embodiment.

【図８】ＲＡＩＤ−３方式のデータの配置方法を示す
図である。FIG. 8 is a diagram showing a method of allocating data in the RAID-3 system.

【図９】ＲＡＩＤ−５方式のデータの配置方法を示す
図である。FIG. 9 is a diagram showing a method of allocating data in the RAID-5 system.

[Explanation of symbols]

１…ホストバス、２…ホストＣＰＵ、３…メモリ、４…
ネットワークアダプタ、５…ＲＡＩＤコントローラ、６
…ＢＩＵ、７…コントロールＣＰＵバス、８…コントロ
ールＣＰＵ、９…メモリ、１０…冗長データ計算回路、
１１…ディスクコントローラ部、１２−１〜１２−（Ｎ
＋１）…ディスク記憶装置、１３…ディスクバス、２０
…データと冗長データの領域、２１…データ領域、２２
…冗長データ領域、ＳＳ１、ＳＳ２、ＳＳ３、ＳＳ４、
ＳＳ５、ＳＳ６、ＳＳ７…ストライプ化されたデータの
セットDESCRIPTION OF SYMBOLS 1 ... Host bus, 2 ... Host CPU, 3 ... Memory, 4 ...
Network adapter, 5 ... RAID controller, 6
... BIU, 7 ... control CPU bus, 8 ... control CPU, 9 ... memory, 10 ... redundant data calculation circuit,
11: Disk controller unit, 12-1 to 12- (N
+1) Disk storage device, 13 Disk bus, 20
... data and redundant data area, 21 ... data area, 22
... redundant data area, SS1, SS2, SS3, SS4,
SS5, SS6, SS7 ... Striped data set

Claims

[Claims]

1. A disk array control method in which redundant data is added to input data and the input data and the redundant data are separately stored in a plurality of disk storage devices having a zone bit recording type disk. A disk array control method, wherein a storage area of data on the disk is arranged on an inner peripheral side of the disk with respect to a storage area of the input data.

2. The disk array control method according to claim 1, wherein a storage area of the input data is arranged from an outer peripheral side in a usable area of the disk.

3. A method of arranging the input data in a storage area of the input data, wherein the number of divisions of the input data is N, the total number of the disk storage devices is N + 1, and the number of the disk storage device is 1 to 1 N
When +1 is set, the disk storage device of the number 1 is arranged in order from the disk storage device of the number 1 to the disk storage device of the number N + 1. The method of arranging the redundant data in the storage area of the redundant data is characterized in that the redundant data is arranged in a disk storage device in which the divided input data corresponding to the redundant data is not arranged. 2. The disk array control method according to claim 1, wherein:

4. A disk array device in which redundant data is added to input data and the input data and redundant data are separately stored in a plurality of disk storage devices having a zone bit recording type disk. The disk array device according to claim 1, wherein a storage area of said disk is arranged on an inner peripheral side of said disk with respect to a storage area of said input data.

5. The disk array device according to claim 4, wherein a storage area for the input data is arranged from an outer peripheral side in a usable area of the disk.

6. An arrangement configuration of the input data in a storage area of the input data, wherein the number of divisions of the input data is N, the total number of the disk storage devices is N + 1, and the number of the disk storage device is 1 to 1. N
When +1 is set, the disk storage device of the number 1 is arranged in order from the disk storage device of the number 1 to the disk storage device of the number N + 1. The arrangement of the redundant data in the storage area of the redundant data is such that the divided input data corresponding to the redundant data is arranged in a disk storage device in which the divided input data is not arranged. The disk array device according to claim 4, wherein: